Dataset statistics
Number of variables | 7 |
---|---|
Number of observations | 100 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 6.2 KiB |
Average record size in memory | 63.3 B |
Variable types
Numeric | 3 |
---|---|
Categorical | 4 |
Dataset
Description | 알코올 사용장애 환자의 관찰 기록을 OMOP CDM 형식으로 생산한 데이터 |
---|---|
Author | 가톨릭대학교 서울성모병원 |
URL | http://cmcdata.net/data/dataset/alcohol_observation_2020-omop-cdm |
observation_type_concept_id has constant value "" | Constant |
unit_concept_id is highly overall correlated with value_as_number and 2 other fields | High correlation |
unit_source_value is highly overall correlated with value_as_number and 2 other fields | High correlation |
observation_concept_id is highly overall correlated with value_as_number and 2 other fields | High correlation |
value_as_number is highly overall correlated with observation_concept_id and 2 other fields | High correlation |
Reproduction
Analysis started | 2023-10-08 18:56:30.035754 |
---|---|
Analysis finished | 2023-10-08 18:56:32.826761 |
Duration | 2.79 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
observation_id
Real number (ℝ)
Distinct | 85 |
---|---|
Distinct (%) | 85.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1.1057774 × 1016 |
Minimum | 9.643633 × 1010 |
---|---|
Maximum | 2.458426 × 1016 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 9.643633 × 1010 |
---|---|
5-th percentile | 1.0894539 × 1011 |
Q1 | 2.5217985 × 1011 |
median | 5.7121808 × 1011 |
Q3 | 2.457347 × 1016 |
95-th percentile | 2.4580041 × 1016 |
Maximum | 2.458426 × 1016 |
Range | 2.4584164 × 1016 |
Interquartile range (IQR) | 2.4573218 × 1016 |
Descriptive statistics
Standard deviation | 1.2286038 × 1016 |
---|---|
Coefficient of variation (CV) | 1.1110769 |
Kurtosis | -1.9987369 |
Mean | 1.1057774 × 1016 |
Median Absolute Deviation (MAD) | 4.6227269 × 1011 |
Skewness | 0.20408207 |
Sum | 1.1057774 × 1018 |
Variance | 1.5094672 × 1032 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
24573470000580000 | 2 | 2.0% |
24576200001000000 | 2 | 2.0% |
24580240000560000 | 2 | 2.0% |
24576600000350000 | 2 | 2.0% |
24574630000330000 | 2 | 2.0% |
24571000000170000 | 2 | 2.0% |
24567060000950000 | 2 | 2.0% |
24575830000230000 | 2 | 2.0% |
24578820001000000 | 2 | 2.0% |
24579130000460000 | 2 | 2.0% |
Other values (75) | 80 |
Value | Count | Frequency (%) |
96436330006 | 1 | |
98268710001 | 1 | |
99237860001 | 1 | |
99411780001 | 1 | |
108945380001 | 1 | |
108945390001 | 1 | |
130448900001 | 1 | |
130448910001 | 1 | |
141816890001 | 1 | |
141816900001 | 1 |
Value | Count | Frequency (%) |
24584260000040000 | 1 | |
24582470000780000 | 1 | |
24580610000130000 | 1 | |
24580240000560000 | 2 | |
24580030000390000 | 1 | |
24579130000460000 | 2 | |
24578820001000000 | 2 | |
24577650000420000 | 2 | |
24576600000350000 | 2 | |
24576200001000000 | 2 |
observation_concept_id
Categorical
HIGH CORRELATION
 
Distinct | 2 |
---|---|
Distinct (%) | 2.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
4099154 | |
---|---|
4177340 |
Length
Max length | 7 |
---|---|
Median length | 7 |
Mean length | 7 |
Min length | 7 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 4099154 |
---|---|
2nd row | 4177340 |
3rd row | 4099154 |
4th row | 4177340 |
5th row | 4099154 |
Common Values
Value | Count | Frequency (%) |
4099154 | 52 | |
4177340 | 48 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
4099154 | 52 | |
4177340 | 48 |
observation_date
Real number (ℝ)
Distinct | 39 |
---|---|
Distinct (%) | 39.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 201533.11 |
Minimum | 201104 |
---|---|
Maximum | 201911 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 201104 |
---|---|
5-th percentile | 201104 |
Q1 | 201312 |
median | 201556 |
Q3 | 201709.5 |
95-th percentile | 201910 |
Maximum | 201911 |
Range | 807 |
Interquartile range (IQR) | 397.5 |
Descriptive statistics
Standard deviation | 254.39327 |
---|---|
Coefficient of variation (CV) | 0.0012622902 |
Kurtosis | -1.0771982 |
Mean | 201533.11 |
Median Absolute Deviation (MAD) | 155 |
Skewness | -0.1511825 |
Sum | 20153311 |
Variance | 64715.937 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
201104 | 7 | 7.0% |
201402 | 5 | 5.0% |
201605 | 4 | 4.0% |
201404 | 4 | 4.0% |
201403 | 4 | 4.0% |
201709 | 4 | 4.0% |
201910 | 4 | 4.0% |
201904 | 4 | 4.0% |
201706 | 3 | 3.0% |
201905 | 3 | 3.0% |
Other values (29) | 58 |
Value | Count | Frequency (%) |
201104 | 7 | |
201107 | 2 | 2.0% |
201112 | 2 | 2.0% |
201203 | 2 | 2.0% |
201204 | 2 | 2.0% |
201211 | 2 | 2.0% |
201301 | 2 | 2.0% |
201305 | 2 | 2.0% |
201309 | 1 | 1.0% |
201310 | 2 | 2.0% |
Value | Count | Frequency (%) |
201911 | 2 | |
201910 | 4 | |
201905 | 3 | |
201904 | 4 | |
201812 | 2 | |
201811 | 2 | |
201809 | 2 | |
201805 | 2 | |
201804 | 2 | |
201711 | 2 |
observation_type_concept_id
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 1.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
44814644 |
---|
Length
Max length | 8 |
---|---|
Median length | 8 |
Mean length | 8 |
Min length | 8 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 44814644 |
---|---|
2nd row | 44814644 |
3rd row | 44814644 |
4th row | 44814644 |
5th row | 44814644 |
Common Values
Value | Count | Frequency (%) |
44814644 | 100 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
44814644 | 100 |
value_as_number
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 76 |
---|---|
Distinct (%) | 76.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 115.0885 |
Minimum | 29 |
---|---|
Maximum | 189 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 29 |
---|---|
5-th percentile | 51.765 |
Q1 | 65.75 |
median | 92 |
Q3 | 168 |
95-th percentile | 178.1 |
Maximum | 189 |
Range | 160 |
Interquartile range (IQR) | 102.25 |
Descriptive statistics
Standard deviation | 52.25436 |
---|---|
Coefficient of variation (CV) | 0.45403633 |
Kurtosis | -1.8201058 |
Mean | 115.0885 |
Median Absolute Deviation (MAD) | 43.5 |
Skewness | 0.035975827 |
Sum | 11508.85 |
Variance | 2730.5182 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
170.0 | 4 | 4.0% |
175.0 | 4 | 4.0% |
168.0 | 4 | 4.0% |
167.0 | 3 | 3.0% |
65.0 | 3 | 3.0% |
75.0 | 3 | 3.0% |
70.0 | 2 | 2.0% |
176.0 | 2 | 2.0% |
172.0 | 2 | 2.0% |
173.0 | 2 | 2.0% |
Other values (66) | 71 |
Value | Count | Frequency (%) |
29.0 | 1 | |
38.0 | 1 | |
46.0 | 1 | |
51.0 | 1 | |
51.1 | 1 | |
51.8 | 1 | |
52.0 | 1 | |
52.8 | 1 | |
53.8 | 1 | |
54.0 | 2 |
Value | Count | Frequency (%) |
189.0 | 1 | 1.0% |
186.0 | 1 | 1.0% |
184.3 | 1 | 1.0% |
181.0 | 1 | 1.0% |
180.0 | 1 | 1.0% |
178.0 | 1 | 1.0% |
177.6 | 1 | 1.0% |
177.0 | 1 | 1.0% |
176.0 | 2 | |
175.0 | 4 |
unit_concept_id
Categorical
HIGH CORRELATION
 
Distinct | 2 |
---|---|
Distinct (%) | 2.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
9529 | |
---|---|
8582 |
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 4 |
Min length | 4 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 9529 |
---|---|
2nd row | 8582 |
3rd row | 9529 |
4th row | 8582 |
5th row | 9529 |
Common Values
Value | Count | Frequency (%) |
9529 | 52 | |
8582 | 48 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
9529 | 52 | |
8582 | 48 |
unit_source_value
Categorical
HIGH CORRELATION
 
Distinct | 2 |
---|---|
Distinct (%) | 2.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
KG | |
---|---|
CM |
Length
Max length | 2 |
---|---|
Median length | 2 |
Mean length | 2 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | KG |
---|---|
2nd row | CM |
3rd row | KG |
4th row | CM |
5th row | KG |
Common Values
Value | Count | Frequency (%) |
KG | 52 | |
CM | 48 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
kg | 52 | |
cm | 48 |
observation_id | observation_concept_id | observation_date | value_as_number | unit_concept_id | unit_source_value | |
---|---|---|---|---|---|---|
observation_id | 1.000 | 0.000 | 0.705 | 0.192 | 0.000 | 0.000 |
observation_concept_id | 0.000 | 1.000 | 0.000 | 1.000 | 0.999 | 0.999 |
observation_date | 0.705 | 0.000 | 1.000 | 0.245 | 0.000 | 0.000 |
value_as_number | 0.192 | 1.000 | 0.245 | 1.000 | 1.000 | 1.000 |
unit_concept_id | 0.000 | 0.999 | 0.000 | 1.000 | 1.000 | 0.999 |
unit_source_value | 0.000 | 0.999 | 0.000 | 1.000 | 0.999 | 1.000 |
unit_concept_id | unit_source_value | observation_concept_id | |
---|---|---|---|
unit_concept_id | 1.000 | 0.980 | 0.980 |
unit_source_value | 0.980 | 1.000 | 0.980 |
observation_concept_id | 0.980 | 0.980 | 1.000 |
observation_id | observation_date | value_as_number | observation_concept_id | unit_concept_id | unit_source_value | |
---|---|---|---|---|---|---|
observation_id | 1.000 | 0.442 | 0.068 | 0.000 | 0.000 | 0.000 |
observation_date | 0.442 | 1.000 | 0.043 | 0.000 | 0.000 | 0.000 |
value_as_number | 0.068 | 0.043 | 1.000 | 0.964 | 0.964 | 0.964 |
observation_concept_id | 0.000 | 0.000 | 0.964 | 1.000 | 0.980 | 0.980 |
unit_concept_id | 0.000 | 0.000 | 0.964 | 0.980 | 1.000 | 0.980 |
unit_source_value | 0.000 | 0.000 | 0.964 | 0.980 | 0.980 | 1.000 |
observation_id | observation_concept_id | observation_date | observation_type_concept_id | value_as_number | unit_concept_id | unit_source_value | |
---|---|---|---|---|---|---|---|
0 | 24573470000580000 | 4099154 | 201511 | 44814644 | 92.0 | 9529 | KG |
1 | 24573470000580000 | 4177340 | 201511 | 44814644 | 189.0 | 8582 | CM |
2 | 144652320001 | 4099154 | 201204 | 44814644 | 65.0 | 9529 | KG |
3 | 144652310001 | 4177340 | 201204 | 44814644 | 166.0 | 8582 | CM |
4 | 24575110000630000 | 4099154 | 201605 | 44814644 | 75.0 | 9529 | KG |
5 | 24575110000630000 | 4177340 | 201605 | 44814644 | 178.0 | 8582 | CM |
6 | 509727380001 | 4099154 | 201812 | 44814644 | 74.0 | 9529 | KG |
7 | 509727370001 | 4177340 | 201812 | 44814644 | 180.0 | 8582 | CM |
8 | 24565700000280000 | 4099154 | 201310 | 44814644 | 75.0 | 9529 | KG |
9 | 222981690001 | 4177340 | 201310 | 44814644 | 175.0 | 8582 | CM |
observation_id | observation_concept_id | observation_date | observation_type_concept_id | value_as_number | unit_concept_id | unit_source_value | |
---|---|---|---|---|---|---|---|
90 | 495829870001 | 4099154 | 201809 | 44814644 | 75.3 | 9529 | KG |
91 | 495829860001 | 4177340 | 201809 | 44814644 | 172.9 | 8582 | CM |
92 | 233429680003 | 4099154 | 201312 | 44814644 | 52.8 | 9529 | KG |
93 | 24566430001620000 | 4177340 | 201312 | 44814644 | 161.0 | 8582 | CM |
94 | 24569710001510000 | 4099154 | 201411 | 44814644 | 74.1 | 9529 | KG |
95 | 24569710001510000 | 4177340 | 201411 | 44814644 | 184.3 | 8582 | CM |
96 | 24564400000380000 | 4099154 | 201305 | 44814644 | 61.45 | 9529 | KG |
97 | 204336800001 | 4177340 | 201305 | 44814644 | 160.2 | 8582 | CM |
98 | 24562440000190000 | 4099154 | 201211 | 44814644 | 51.8 | 9529 | KG |
99 | 24562440000190000 | 4177340 | 201211 | 44814644 | 161.6 | 8582 | CM |