Dataset statistics
Number of variables | 8 |
---|---|
Number of observations | 100 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 7.1 KiB |
Average record size in memory | 72.3 B |
Variable types
Numeric | 4 |
---|---|
Categorical | 4 |
Dataset
Description | 고지혈증 환자의 검사 기록을 OMOP CDM 형식으로 생산한 데이터 |
---|---|
Author | 가톨릭대학교 서울성모병원 |
URL | http://cmcdata.net/data/dataset/_measurement_2020-omop-cdm |
measurement_type_concept_id has constant value "" | Constant |
unit_concept_id is highly overall correlated with unit_source_value | High correlation |
unit_source_value is highly overall correlated with measurement_concept_id and 1 other fields | High correlation |
measurement_id is highly overall correlated with measurement_date | High correlation |
measurement_concept_id is highly overall correlated with unit_source_value | High correlation |
measurement_date is highly overall correlated with measurement_id | High correlation |
measurement_id has unique values | Unique |
measurement_concept_id has 28 (28.0%) zeros | Zeros |
value_as_number has 2 (2.0%) zeros | Zeros |
Reproduction
Analysis started | 2023-10-08 18:57:46.870516 |
---|---|
Analysis finished | 2023-10-08 18:57:52.249045 |
Duration | 5.38 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
measurement_id
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 100 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 89571.75 |
Minimum | 44866 |
---|---|
Maximum | 119049 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 44866 |
---|---|
5-th percentile | 53165.9 |
Q1 | 59459.5 |
median | 99846 |
Q3 | 103106.5 |
95-th percentile | 119040.05 |
Maximum | 119049 |
Range | 74183 |
Interquartile range (IQR) | 43647 |
Descriptive statistics
Standard deviation | 24635.097 |
---|---|
Coefficient of variation (CV) | 0.275032 |
Kurtosis | -1.1150824 |
Mean | 89571.75 |
Median Absolute Deviation (MAD) | 7388.5 |
Skewness | -0.66391852 |
Sum | 8957175 |
Variance | 6.0688801 × 108 |
Monotonicity | Strictly increasing |
Value | Count | Frequency (%) |
44866 | 1 | 1.0% |
102500 | 1 | 1.0% |
103106 | 1 | 1.0% |
103105 | 1 | 1.0% |
103087 | 1 | 1.0% |
102526 | 1 | 1.0% |
102525 | 1 | 1.0% |
102523 | 1 | 1.0% |
102522 | 1 | 1.0% |
102519 | 1 | 1.0% |
Other values (90) | 90 |
Value | Count | Frequency (%) |
44866 | 1 | |
44867 | 1 | |
44869 | 1 | |
44872 | 1 | |
53164 | 1 | |
53166 | 1 | |
53167 | 1 | |
53168 | 1 | |
53169 | 1 | |
53170 | 1 |
Value | Count | Frequency (%) |
119049 | 1 | |
119044 | 1 | |
119043 | 1 | |
119042 | 1 | |
119041 | 1 | |
119040 | 1 | |
116868 | 1 | |
116865 | 1 | |
116853 | 1 | |
116850 | 1 |
measurement_concept_id
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 26 |
---|---|
Distinct (%) | 26.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2170940.1 |
Minimum | 0 |
---|---|
Maximum | 3036887 |
Zeros | 28 |
Zeros (%) | 28.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 3011903 |
Q3 | 3020416 |
95-th percentile | 3024929 |
Maximum | 3036887 |
Range | 3036887 |
Interquartile range (IQR) | 3020416 |
Descriptive statistics
Standard deviation | 1360660.6 |
---|---|
Coefficient of variation (CV) | 0.62676101 |
Kurtosis | -1.031168 |
Mean | 2170940.1 |
Median Absolute Deviation (MAD) | 10998 |
Skewness | -0.99483847 |
Sum | 2.1709401 × 108 |
Variance | 1.8513973 × 1012 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 28 | |
3000905 | 6 | 6.0% |
3013682 | 6 | 6.0% |
3020416 | 5 | 5.0% |
3000963 | 5 | 5.0% |
3023314 | 5 | 5.0% |
3024929 | 5 | 5.0% |
3016723 | 5 | 5.0% |
3019550 | 5 | 5.0% |
3023103 | 5 | 5.0% |
Other values (16) | 25 |
Value | Count | Frequency (%) |
0 | 28 | |
3000905 | 6 | 6.0% |
3000963 | 5 | 5.0% |
3004410 | 2 | 2.0% |
3004501 | 2 | 2.0% |
3006923 | 2 | 2.0% |
3007070 | 1 | 1.0% |
3007220 | 1 | 1.0% |
3009966 | 1 | 1.0% |
3010156 | 2 | 2.0% |
Value | Count | Frequency (%) |
3036887 | 1 | 1.0% |
3035995 | 1 | 1.0% |
3026910 | 2 | 2.0% |
3024929 | 5 | |
3024128 | 1 | 1.0% |
3023314 | 5 | |
3023103 | 5 | |
3022192 | 1 | 1.0% |
3020416 | 5 | |
3019550 | 5 |
measurement_date
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 11 |
---|---|
Distinct (%) | 11.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 201343.75 |
Minimum | 201106 |
---|---|
Maximum | 201505 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 201106 |
---|---|
5-th percentile | 201112 |
Q1 | 201204 |
median | 201406 |
Q3 | 201408 |
95-th percentile | 201505 |
Maximum | 201505 |
Range | 399 |
Interquartile range (IQR) | 204 |
Descriptive statistics
Standard deviation | 145.27218 |
---|---|
Coefficient of variation (CV) | 0.00072151326 |
Kurtosis | -1.0257128 |
Mean | 201343.75 |
Median Absolute Deviation (MAD) | 5 |
Skewness | -0.75677841 |
Sum | 20134375 |
Variance | 21104.008 |
Monotonicity | Increasing |
Value | Count | Frequency (%) |
201408 | 23 | |
201112 | 20 | |
201504 | 13 | |
201401 | 10 | |
201405 | 7 | 7.0% |
201505 | 6 | 6.0% |
201204 | 5 | 5.0% |
201106 | 4 | 4.0% |
201404 | 4 | 4.0% |
201407 | 4 | 4.0% |
Value | Count | Frequency (%) |
201106 | 4 | 4.0% |
201112 | 20 | |
201204 | 5 | 5.0% |
201401 | 10 | |
201404 | 4 | 4.0% |
201405 | 7 | 7.0% |
201407 | 4 | 4.0% |
201408 | 23 | |
201409 | 4 | 4.0% |
201504 | 13 |
Value | Count | Frequency (%) |
201505 | 6 | 6.0% |
201504 | 13 | |
201409 | 4 | 4.0% |
201408 | 23 | |
201407 | 4 | 4.0% |
201405 | 7 | 7.0% |
201404 | 4 | 4.0% |
201401 | 10 | |
201204 | 5 | 5.0% |
201112 | 20 |
measurement_type_concept_id
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 1.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
44818702 |
---|
Length
Max length | 8 |
---|---|
Median length | 8 |
Mean length | 8 |
Min length | 8 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 44818702 |
---|---|
2nd row | 44818702 |
3rd row | 44818702 |
4th row | 44818702 |
5th row | 44818702 |
Common Values
Value | Count | Frequency (%) |
44818702 | 100 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
44818702 | 100 |
operator_concept_id
Categorical
Distinct | 3 |
---|---|
Distinct (%) | 3.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
<NA> | |
---|---|
4171756 | |
4172704 | 5 |
Length
Max length | 7 |
---|---|
Median length | 4 |
Mean length | 4.51 |
Min length | 4 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | <NA> |
---|---|
2nd row | <NA> |
3rd row | <NA> |
4th row | <NA> |
5th row | <NA> |
Common Values
Value | Count | Frequency (%) |
<NA> | 83 | |
4171756 | 12 | 12.0% |
4172704 | 5 | 5.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
na | 83 | |
4171756 | 12 | 12.0% |
4172704 | 5 | 5.0% |
value_as_number
Real number (ℝ)
ZEROS
 
Distinct | 53 |
---|---|
Distinct (%) | 53.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 44.98 |
Minimum | 0 |
---|---|
Maximum | 392 |
Zeros | 2 |
Zeros (%) | 2.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 1 |
Q1 | 4 |
median | 13 |
Q3 | 41.75 |
95-th percentile | 168.2 |
Maximum | 392 |
Range | 392 |
Interquartile range (IQR) | 37.75 |
Descriptive statistics
Standard deviation | 71.378629 |
---|---|
Coefficient of variation (CV) | 1.586897 |
Kurtosis | 8.8991991 |
Mean | 44.98 |
Median Absolute Deviation (MAD) | 11 |
Skewness | 2.6996555 |
Sum | 4498 |
Variance | 5094.9087 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1 | 13 | 13.0% |
4 | 10 | 10.0% |
11 | 5 | 5.0% |
24 | 4 | 4.0% |
12 | 4 | 4.0% |
10 | 3 | 3.0% |
5 | 3 | 3.0% |
13 | 3 | 3.0% |
19 | 3 | 3.0% |
140 | 2 | 2.0% |
Other values (43) | 50 |
Value | Count | Frequency (%) |
0 | 2 | 2.0% |
1 | 13 | |
2 | 2 | 2.0% |
3 | 1 | 1.0% |
4 | 10 | |
5 | 3 | 3.0% |
6 | 2 | 2.0% |
7 | 2 | 2.0% |
9 | 1 | 1.0% |
10 | 3 | 3.0% |
Value | Count | Frequency (%) |
392 | 1 | |
371 | 1 | |
239 | 1 | |
192 | 1 | |
172 | 1 | |
168 | 1 | |
152 | 1 | |
144 | 1 | |
141 | 1 | |
140 | 2 |
unit_concept_id
Categorical
HIGH CORRELATION
 
Distinct | 2 |
---|---|
Distinct (%) | 2.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
0 | |
---|---|
8554 |
Length
Max length | 4 |
---|---|
Median length | 1 |
Mean length | 1.42 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 8554 |
---|---|
2nd row | 0 |
3rd row | 0 |
4th row | 0 |
5th row | 8554 |
Common Values
Value | Count | Frequency (%) |
0 | 86 | |
8554 | 14 | 14.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 86 | |
8554 | 14 | 14.0% |
unit_source_value
Categorical
HIGH CORRELATION
 
Distinct | 11 |
---|---|
Distinct (%) | 11.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
㎎/㎗ | |
---|---|
% | |
초 | |
10^9/L | |
mmol/L | |
Other values (6) |
Length
Max length | 7 |
---|---|
Median length | 6 |
Mean length | 3.43 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | % |
---|---|
2nd row | <NA> |
3rd row | 초 |
4th row | 초 |
5th row | % |
Common Values
Value | Count | Frequency (%) |
㎎/㎗ | 18 | |
% | 14 | |
초 | 14 | |
10^9/L | 14 | |
mmol/L | 10 | |
U/ℓ | 9 | |
<NA> | 7 | 7.0% |
10^12/L | 5 | 5.0% |
g/㎗ | 5 | 5.0% |
㎎/ℓ | 2 | 2.0% |
Length
Value | Count | Frequency (%) |
㎎/㎗ | 18 | |
14 | ||
초 | 14 | |
10^9/l | 14 | |
mmol/l | 10 | |
u/ℓ | 9 | |
na | 7 | 7.0% |
10^12/l | 5 | 5.0% |
g/㎗ | 5 | 5.0% |
㎎/ℓ | 2 | 2.0% |
measurement_id | measurement_concept_id | measurement_date | operator_concept_id | value_as_number | unit_concept_id | unit_source_value | |
---|---|---|---|---|---|---|---|
measurement_id | 1.000 | 0.394 | 0.835 | 0.000 | 0.000 | 0.000 | 0.000 |
measurement_concept_id | 0.394 | 1.000 | 0.544 | 0.000 | 0.243 | 0.207 | 0.979 |
measurement_date | 0.835 | 0.544 | 1.000 | 0.000 | 0.000 | 0.000 | 0.367 |
operator_concept_id | 0.000 | 0.000 | 0.000 | 1.000 | 0.306 | 0.000 | 0.682 |
value_as_number | 0.000 | 0.243 | 0.000 | 0.306 | 1.000 | 0.432 | 0.468 |
unit_concept_id | 0.000 | 0.207 | 0.000 | 0.000 | 0.432 | 1.000 | 1.000 |
unit_source_value | 0.000 | 0.979 | 0.367 | 0.682 | 0.468 | 1.000 | 1.000 |
operator_concept_id | unit_concept_id | unit_source_value | |
---|---|---|---|
operator_concept_id | 1.000 | 0.000 | 0.411 |
unit_concept_id | 0.000 | 1.000 | 0.955 |
unit_source_value | 0.411 | 0.955 | 1.000 |
measurement_id | measurement_concept_id | measurement_date | value_as_number | operator_concept_id | unit_concept_id | unit_source_value | |
---|---|---|---|---|---|---|---|
measurement_id | 1.000 | 0.013 | 0.988 | -0.060 | 0.000 | 0.000 | 0.000 |
measurement_concept_id | 0.013 | 1.000 | 0.024 | 0.161 | 0.000 | 0.132 | 0.833 |
measurement_date | 0.988 | 0.024 | 1.000 | -0.078 | 0.000 | 0.000 | 0.177 |
value_as_number | -0.060 | 0.161 | -0.078 | 1.000 | 0.470 | 0.450 | 0.252 |
operator_concept_id | 0.000 | 0.000 | 0.000 | 0.470 | 1.000 | 0.000 | 0.411 |
unit_concept_id | 0.000 | 0.132 | 0.000 | 0.450 | 0.000 | 1.000 | 0.955 |
unit_source_value | 0.000 | 0.833 | 0.177 | 0.252 | 0.411 | 0.955 | 1.000 |
measurement_id | measurement_concept_id | measurement_date | measurement_type_concept_id | operator_concept_id | value_as_number | unit_concept_id | unit_source_value | |
---|---|---|---|---|---|---|---|---|
0 | 44866 | 0 | 201106 | 44818702 | <NA> | 113 | 8554 | % |
1 | 44867 | 0 | 201106 | 44818702 | <NA> | 1 | 0 | <NA> |
2 | 44869 | 0 | 201106 | 44818702 | <NA> | 10 | 0 | 초 |
3 | 44872 | 0 | 201106 | 44818702 | <NA> | 24 | 0 | 초 |
4 | 53164 | 3004410 | 201112 | 44818702 | <NA> | 5 | 8554 | % |
5 | 53166 | 3000905 | 201112 | 44818702 | <NA> | 6 | 0 | 10^9/L |
6 | 53167 | 3020416 | 201112 | 44818702 | <NA> | 4 | 0 | 10^12/L |
7 | 53168 | 3000963 | 201112 | 44818702 | <NA> | 13 | 0 | g/㎗ |
8 | 53169 | 3023314 | 201112 | 44818702 | 4171756 | 39 | 8554 | % |
9 | 53170 | 3024929 | 201112 | 44818702 | <NA> | 192 | 0 | 10^9/L |
measurement_id | measurement_concept_id | measurement_date | measurement_type_concept_id | operator_concept_id | value_as_number | unit_concept_id | unit_source_value | |
---|---|---|---|---|---|---|---|---|
90 | 116850 | 0 | 201504 | 44818702 | <NA> | 11 | 0 | 초 |
91 | 116853 | 0 | 201504 | 44818702 | <NA> | 29 | 0 | 초 |
92 | 116865 | 3035995 | 201504 | 44818702 | <NA> | 50 | 0 | U/ℓ |
93 | 116868 | 3010156 | 201504 | 44818702 | <NA> | 0 | 0 | ㎎/ℓ |
94 | 119040 | 3000905 | 201505 | 44818702 | <NA> | 5 | 0 | 10^9/L |
95 | 119041 | 3020416 | 201505 | 44818702 | 4171756 | 3 | 0 | 10^12/L |
96 | 119042 | 3000963 | 201505 | 44818702 | 4171756 | 9 | 0 | g/㎗ |
97 | 119043 | 3023314 | 201505 | 44818702 | 4171756 | 28 | 8554 | % |
98 | 119044 | 3024929 | 201505 | 44818702 | <NA> | 152 | 0 | 10^9/L |
99 | 119049 | 3013682 | 201505 | 44818702 | 4172704 | 24 | 0 | ㎎/㎗ |