Dataset statistics
Number of variables | 7 |
---|---|
Number of observations | 100 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 6.2 KiB |
Average record size in memory | 63.3 B |
Variable types
Numeric | 3 |
---|---|
Categorical | 4 |
Dataset
Description | 당뇨 환자의 관찰 기록을 OMOP CDM 형식으로 생산한 데이터 |
---|---|
Author | 가톨릭대학교 서울성모병원 |
URL | http://cmcdata.net/data/dataset/diabetes_observation_2020-omop-cdm |
observation_type_concept_id has constant value "" | Constant |
unit_source_value is highly overall correlated with value_as_number and 2 other fields | High correlation |
unit_concept_id is highly overall correlated with value_as_number and 2 other fields | High correlation |
observation_concept_id is highly overall correlated with value_as_number and 2 other fields | High correlation |
value_as_number is highly overall correlated with observation_concept_id and 2 other fields | High correlation |
Reproduction
Analysis started | 2023-10-08 18:55:37.006834 |
---|---|
Analysis finished | 2023-10-08 18:55:46.963316 |
Duration | 9.96 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
observation_id
Real number (ℝ)
Distinct | 88 |
---|---|
Distinct (%) | 88.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 6.8805297 × 1015 |
Minimum | 1.6388867 × 1011 |
---|---|
Maximum | 2.458449 × 1016 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 1.6388867 × 1011 |
---|---|
5-th percentile | 1.9236646 × 1011 |
Q1 | 2.9695147 × 1011 |
median | 4.1953879 × 1011 |
Q3 | 2.4566575 × 1016 |
95-th percentile | 2.4577635 × 1016 |
Maximum | 2.458449 × 1016 |
Range | 2.4584326 × 1016 |
Interquartile range (IQR) | 2.4566278 × 1016 |
Descriptive statistics
Standard deviation | 1.1088393 × 1016 |
---|---|
Coefficient of variation (CV) | 1.611561 |
Kurtosis | -1.0311147 |
Mean | 6.8805297 × 1015 |
Median Absolute Deviation (MAD) | 1.6163166 × 1011 |
Skewness | 0.99494495 |
Sum | 6.8805297 × 1017 |
Variance | 1.2295247 × 1032 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
24568190000200000 | 3 | 3.0% |
24567340001120000 | 3 | 3.0% |
24570190000100000 | 2 | 2.0% |
24567400001060000 | 2 | 2.0% |
24584490000020000 | 2 | 2.0% |
24575940001180000 | 2 | 2.0% |
24570840001370000 | 2 | 2.0% |
24579060001110000 | 2 | 2.0% |
24572750000540000 | 2 | 2.0% |
24577560000980000 | 2 | 2.0% |
Other values (78) | 78 |
Value | Count | Frequency (%) |
163888670001 | 1 | |
163888680004 | 1 | |
163888730001 | 1 | |
192366410002 | 1 | |
192366420001 | 1 | |
192366460001 | 1 | |
192393520001 | 1 | |
199363230001 | 1 | |
199363270001 | 1 | |
199363280001 | 1 |
Value | Count | Frequency (%) |
24584490000020000 | 2 | |
24580530001380000 | 1 | |
24579060001110000 | 2 | |
24577560000980000 | 2 | |
24575950000040000 | 1 | |
24575940001180000 | 2 | |
24575410000840000 | 1 | |
24572750000540000 | 2 | |
24570840001370000 | 2 | |
24570190000100000 | 2 |
observation_concept_id
Categorical
HIGH CORRELATION
 
Distinct | 4 |
---|---|
Distinct (%) | 4.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
4060831 | |
---|---|
4062019 | |
4099154 | |
4177340 |
Length
Max length | 7 |
---|---|
Median length | 7 |
Mean length | 7 |
Min length | 7 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 4060831 |
---|---|
2nd row | 4062019 |
3rd row | 4099154 |
4th row | 4177340 |
5th row | 4060831 |
Common Values
Value | Count | Frequency (%) |
4060831 | 27 | |
4062019 | 27 | |
4099154 | 23 | |
4177340 | 23 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
4060831 | 27 | |
4062019 | 27 | |
4099154 | 23 | |
4177340 | 23 |
observation_date
Real number (ℝ)
Distinct | 26 |
---|---|
Distinct (%) | 26.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 201568.27 |
Minimum | 201208 |
---|---|
Maximum | 201912 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 201208 |
---|---|
5-th percentile | 201303 |
Q1 | 201405 |
median | 201606 |
Q3 | 201706 |
95-th percentile | 201911 |
Maximum | 201912 |
Range | 704 |
Interquartile range (IQR) | 301 |
Descriptive statistics
Standard deviation | 202.54202 |
---|---|
Coefficient of variation (CV) | 0.0010048309 |
Kurtosis | -1.0953559 |
Mean | 201568.27 |
Median Absolute Deviation (MAD) | 196.5 |
Skewness | 0.10014988 |
Sum | 20156827 |
Variance | 41023.27 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
201403 | 8 | 8.0% |
201606 | 7 | 7.0% |
201607 | 5 | 5.0% |
201412 | 4 | 4.0% |
201912 | 4 | 4.0% |
201208 | 4 | 4.0% |
201801 | 4 | 4.0% |
201911 | 4 | 4.0% |
201509 | 4 | 4.0% |
201702 | 4 | 4.0% |
Other values (16) | 52 |
Value | Count | Frequency (%) |
201208 | 4 | |
201303 | 4 | |
201304 | 4 | |
201312 | 4 | |
201403 | 8 | |
201405 | 2 | 2.0% |
201406 | 4 | |
201408 | 4 | |
201412 | 4 | |
201503 | 4 |
Value | Count | Frequency (%) |
201912 | 4 | |
201911 | 4 | |
201901 | 2 | |
201811 | 4 | |
201804 | 2 | |
201801 | 4 | |
201711 | 2 | |
201710 | 1 | 1.0% |
201706 | 4 | |
201705 | 4 |
observation_type_concept_id
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 1.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
44814644 |
---|
Length
Max length | 8 |
---|---|
Median length | 8 |
Mean length | 8 |
Min length | 8 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 44814644 |
---|---|
2nd row | 44814644 |
3rd row | 44814644 |
4th row | 44814644 |
5th row | 44814644 |
Common Values
Value | Count | Frequency (%) |
44814644 | 100 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
44814644 | 100 |
value_as_number
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 67 |
---|---|
Distinct (%) | 67.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 104.51 |
Minimum | 43 |
---|---|
Maximum | 180 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 43 |
---|---|
5-th percentile | 51 |
Q1 | 68 |
median | 90.5 |
Q3 | 144.25 |
95-th percentile | 169 |
Maximum | 180 |
Range | 137 |
Interquartile range (IQR) | 76.25 |
Descriptive statistics
Standard deviation | 41.072858 |
---|---|
Coefficient of variation (CV) | 0.3930041 |
Kurtosis | -1.4160967 |
Mean | 104.51 |
Median Absolute Deviation (MAD) | 32.5 |
Skewness | 0.28830259 |
Sum | 10451 |
Variance | 1686.9797 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
80 | 7 | 7.0% |
110 | 4 | 4.0% |
156 | 3 | 3.0% |
61 | 3 | 3.0% |
158 | 3 | 3.0% |
130 | 3 | 3.0% |
51 | 3 | 3.0% |
133 | 2 | 2.0% |
74 | 2 | 2.0% |
169 | 2 | 2.0% |
Other values (57) | 68 |
Value | Count | Frequency (%) |
43 | 1 | 1.0% |
46 | 1 | 1.0% |
50 | 1 | 1.0% |
51 | 3 | |
53 | 1 | 1.0% |
54 | 1 | 1.0% |
56 | 2 | |
58 | 2 | |
60 | 2 | |
61 | 3 |
Value | Count | Frequency (%) |
180 | 1 | |
178 | 1 | |
172 | 1 | |
170 | 1 | |
169 | 2 | |
164 | 2 | |
163 | 1 | |
162 | 1 | |
160 | 2 | |
159 | 1 |
unit_concept_id
Categorical
HIGH CORRELATION
 
Distinct | 3 |
---|---|
Distinct (%) | 3.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
8876 | |
---|---|
9529 | |
8582 |
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 4 |
Min length | 4 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 8876 |
---|---|
2nd row | 8876 |
3rd row | 9529 |
4th row | 8582 |
5th row | 8876 |
Common Values
Value | Count | Frequency (%) |
8876 | 54 | |
9529 | 23 | |
8582 | 23 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
8876 | 54 | |
9529 | 23 | |
8582 | 23 |
unit_source_value
Categorical
HIGH CORRELATION
 
Distinct | 3 |
---|---|
Distinct (%) | 3.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
mmhg | |
---|---|
KG | |
CM |
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 3.08 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | mmhg |
---|---|
2nd row | mmhg |
3rd row | KG |
4th row | CM |
5th row | mmhg |
Common Values
Value | Count | Frequency (%) |
mmhg | 54 | |
KG | 23 | |
CM | 23 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
mmhg | 54 | |
kg | 23 | |
cm | 23 |
observation_id | observation_concept_id | observation_date | value_as_number | unit_concept_id | unit_source_value | |
---|---|---|---|---|---|---|
observation_id | 1.000 | 0.322 | 0.166 | 0.000 | 0.141 | 0.141 |
observation_concept_id | 0.322 | 1.000 | 0.000 | 0.886 | 1.000 | 1.000 |
observation_date | 0.166 | 0.000 | 1.000 | 0.000 | 0.000 | 0.000 |
value_as_number | 0.000 | 0.886 | 0.000 | 1.000 | 0.844 | 0.844 |
unit_concept_id | 0.141 | 1.000 | 0.000 | 0.844 | 1.000 | 1.000 |
unit_source_value | 0.141 | 1.000 | 0.000 | 0.844 | 1.000 | 1.000 |
unit_source_value | unit_concept_id | observation_concept_id | |
---|---|---|---|
unit_source_value | 1.000 | 1.000 | 0.995 |
unit_concept_id | 1.000 | 1.000 | 0.995 |
observation_concept_id | 0.995 | 0.995 | 1.000 |
observation_id | observation_date | value_as_number | observation_concept_id | unit_concept_id | unit_source_value | |
---|---|---|---|---|---|---|
observation_id | 1.000 | 0.492 | 0.132 | 0.217 | 0.237 | 0.237 |
observation_date | 0.492 | 1.000 | 0.085 | 0.000 | 0.000 | 0.000 |
value_as_number | 0.132 | 0.085 | 1.000 | 0.732 | 0.731 | 0.731 |
observation_concept_id | 0.217 | 0.000 | 0.732 | 1.000 | 0.995 | 0.995 |
unit_concept_id | 0.237 | 0.000 | 0.731 | 0.995 | 1.000 | 1.000 |
unit_source_value | 0.237 | 0.000 | 0.731 | 0.995 | 1.000 | 1.000 |
observation_id | observation_concept_id | observation_date | observation_type_concept_id | value_as_number | unit_concept_id | unit_source_value | |
---|---|---|---|---|---|---|---|
0 | 289546300003 | 4060831 | 201412 | 44814644 | 75 | 8876 | mmhg |
1 | 289546290003 | 4062019 | 201412 | 44814644 | 141 | 8876 | mmhg |
2 | 24570190000100000 | 4099154 | 201412 | 44814644 | 51 | 9529 | KG |
3 | 24570190000100000 | 4177340 | 201412 | 44814644 | 150 | 8582 | CM |
4 | 505140510002 | 4060831 | 201811 | 44814644 | 74 | 8876 | mmhg |
5 | 505140500001 | 4062019 | 201811 | 44814644 | 156 | 8876 | mmhg |
6 | 24584490000020000 | 4099154 | 201811 | 44814644 | 69 | 9529 | KG |
7 | 24584490000020000 | 4177340 | 201811 | 44814644 | 163 | 8582 | CM |
8 | 231778410001 | 4060831 | 201312 | 44814644 | 80 | 8876 | mmhg |
9 | 24566320001200000 | 4062019 | 201312 | 44814644 | 145 | 8876 | mmhg |
observation_id | observation_concept_id | observation_date | observation_type_concept_id | value_as_number | unit_concept_id | unit_source_value | |
---|---|---|---|---|---|---|---|
90 | 365384910001 | 4099154 | 201606 | 44814644 | 46 | 9529 | KG |
91 | 24575410000840000 | 4177340 | 201606 | 44814644 | 148 | 8582 | CM |
92 | 163888680004 | 4060831 | 201208 | 44814644 | 80 | 8876 | mmhg |
93 | 163888670001 | 4062019 | 201208 | 44814644 | 130 | 8876 | mmhg |
94 | 163888730001 | 4099154 | 201208 | 44814644 | 61 | 9529 | KG |
95 | 24561540000210000 | 4177340 | 201208 | 44814644 | 164 | 8582 | CM |
96 | 248142950001 | 4060831 | 201403 | 44814644 | 60 | 8876 | mmhg |
97 | 248142940001 | 4062019 | 201403 | 44814644 | 110 | 8876 | mmhg |
98 | 24567400001060000 | 4099154 | 201403 | 44814644 | 54 | 9529 | KG |
99 | 24567400001060000 | 4177340 | 201403 | 44814644 | 158 | 8582 | CM |