Dataset statistics
Number of variables | 7 |
---|---|
Number of observations | 100 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 6.1 KiB |
Average record size in memory | 62.3 B |
Variable types
Numeric | 3 |
---|---|
Categorical | 4 |
Dataset
Description | 고지혈증 환자의 진단 정보를 OMOP CDM 형식으로 생산한 데이터 |
---|---|
Author | 가톨릭대학교 서울성모병원 |
URL | http://cmcdata.net/data/dataset/_condition_occurrence_2020-omop-cdm |
condition_status_source_value has constant value "" | Constant |
condition_status_concept_id has constant value "" | Constant |
condition_source_value is highly overall correlated with condition_concept_id and 1 other fields | High correlation |
condition_type_concept_id is highly overall correlated with condition_source_value | High correlation |
condition_occurrence_id is highly overall correlated with condition_start_date | High correlation |
condition_concept_id is highly overall correlated with condition_source_value | High correlation |
condition_start_date is highly overall correlated with condition_occurrence_id | High correlation |
condition_occurrence_id has unique values | Unique |
Reproduction
Analysis started | 2023-10-08 18:57:36.465424 |
---|---|
Analysis finished | 2023-10-08 18:57:39.555659 |
Duration | 3.09 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
condition_occurrence_id
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 100 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 52078455 |
Minimum | 30177580 |
---|---|
Maximum | 75781836 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 30177580 |
---|---|
5-th percentile | 32141348 |
Q1 | 37121454 |
median | 53523540 |
Q3 | 64609179 |
95-th percentile | 75547625 |
Maximum | 75781836 |
Range | 45604256 |
Interquartile range (IQR) | 27487725 |
Descriptive statistics
Standard deviation | 14434387 |
---|---|
Coefficient of variation (CV) | 0.2771662 |
Kurtosis | -1.3597282 |
Mean | 52078455 |
Median Absolute Deviation (MAD) | 14217894 |
Skewness | 0.026315487 |
Sum | 5.2078455 × 109 |
Variance | 2.0835154 × 1014 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
70252909 | 1 | 1.0% |
72035568 | 1 | 1.0% |
70252907 | 1 | 1.0% |
75781833 | 1 | 1.0% |
47823829 | 1 | 1.0% |
41149888 | 1 | 1.0% |
55895296 | 1 | 1.0% |
67741743 | 1 | 1.0% |
72035569 | 1 | 1.0% |
40995285 | 1 | 1.0% |
Other values (90) | 90 |
Value | Count | Frequency (%) |
30177580 | 1 | |
30177581 | 1 | |
30177582 | 1 | |
31348976 | 1 | |
31348977 | 1 | |
32183052 | 1 | |
32183053 | 1 | |
32183055 | 1 | |
33110890 | 1 | |
33177936 | 1 |
Value | Count | Frequency (%) |
75781836 | 1 | |
75781835 | 1 | |
75781834 | 1 | |
75781833 | 1 | |
75547627 | 1 | |
75547625 | 1 | |
72035570 | 1 | |
72035569 | 1 | |
72035568 | 1 | |
72035567 | 1 |
condition_concept_id
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 17 |
---|---|
Distinct (%) | 17.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1658690.3 |
Minimum | 75860 |
---|---|
Maximum | 4294549 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 75860 |
---|---|
5-th percentile | 80502 |
Q1 | 193518 |
median | 443731 |
Q3 | 4098483 |
95-th percentile | 4212516 |
Maximum | 4294549 |
Range | 4218689 |
Interquartile range (IQR) | 3904965 |
Descriptive statistics
Standard deviation | 1882554.6 |
---|---|
Coefficient of variation (CV) | 1.1349645 |
Kurtosis | -1.676965 |
Mean | 1658690.3 |
Median Absolute Deviation (MAD) | 363229 |
Skewness | 0.57936763 |
Sum | 1.6586903 × 108 |
Variance | 3.5440117 × 1012 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
80502 | 22 | |
443731 | 19 | |
4098483 | 12 | |
321318 | 7 | 7.0% |
378416 | 6 | 6.0% |
4159131 | 6 | 6.0% |
4193704 | 5 | 5.0% |
201820 | 5 | 5.0% |
193518 | 3 | 3.0% |
4127568 | 3 | 3.0% |
Other values (7) | 12 |
Value | Count | Frequency (%) |
75860 | 2 | 2.0% |
80502 | 22 | |
193518 | 3 | 3.0% |
201820 | 5 | 5.0% |
321318 | 7 | 7.0% |
378416 | 6 | 6.0% |
443731 | 19 | |
4001645 | 1 | 1.0% |
4098483 | 12 | |
4102985 | 1 | 1.0% |
Value | Count | Frequency (%) |
4294549 | 2 | 2.0% |
4224741 | 2 | 2.0% |
4212516 | 2 | 2.0% |
4193704 | 5 | |
4174977 | 2 | 2.0% |
4159131 | 6 | |
4127568 | 3 | 3.0% |
4102985 | 1 | 1.0% |
4098483 | 12 | |
4001645 | 1 | 1.0% |
condition_start_date
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 31 |
---|---|
Distinct (%) | 31.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 201569.14 |
Minimum | 201111 |
---|---|
Maximum | 201908 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 201111 |
---|---|
5-th percentile | 201205.85 |
Q1 | 201311 |
median | 201609 |
Q3 | 201803 |
95-th percentile | 201908 |
Maximum | 201908 |
Range | 797 |
Interquartile range (IQR) | 492 |
Descriptive statistics
Standard deviation | 243.9246 |
---|---|
Coefficient of variation (CV) | 0.0012101287 |
Kurtosis | -1.2398727 |
Mean | 201569.14 |
Median Absolute Deviation (MAD) | 199 |
Skewness | -0.29656554 |
Sum | 20156914 |
Variance | 59499.213 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
201808 | 10 | 10.0% |
201609 | 7 | 7.0% |
201908 | 6 | 6.0% |
201209 | 5 | 5.0% |
201903 | 4 | 4.0% |
201311 | 4 | 4.0% |
201307 | 4 | 4.0% |
201706 | 4 | 4.0% |
201801 | 4 | 4.0% |
201803 | 4 | 4.0% |
Other values (21) | 48 |
Value | Count | Frequency (%) |
201111 | 3 | |
201203 | 2 | 2.0% |
201206 | 3 | |
201209 | 5 | |
201212 | 3 | |
201304 | 2 | 2.0% |
201307 | 4 | |
201309 | 1 | 1.0% |
201311 | 4 | |
201403 | 2 | 2.0% |
Value | Count | Frequency (%) |
201908 | 6 | |
201903 | 4 | 4.0% |
201812 | 3 | 3.0% |
201808 | 10 | |
201807 | 1 | 1.0% |
201803 | 4 | 4.0% |
201801 | 4 | 4.0% |
201709 | 3 | 3.0% |
201706 | 4 | 4.0% |
201705 | 2 | 2.0% |
condition_type_concept_id
Categorical
HIGH CORRELATION
 
Distinct | 2 |
---|---|
Distinct (%) | 2.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
44786629 | |
---|---|
44786627 |
Length
Max length | 8 |
---|---|
Median length | 8 |
Mean length | 8 |
Min length | 8 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 44786629 |
---|---|
2nd row | 44786627 |
3rd row | 44786629 |
4th row | 44786629 |
5th row | 44786627 |
Common Values
Value | Count | Frequency (%) |
44786629 | 63 | |
44786627 | 37 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
44786629 | 63 | |
44786627 | 37 |
condition_source_value
Categorical
HIGH CORRELATION
 
Distinct | 13 |
---|---|
Distinct (%) | 13.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
E11 | |
---|---|
M81 | |
E78 | |
H35 | |
I20 | |
Other values (8) |
Length
Max length | 3 |
---|---|
Median length | 3 |
Mean length | 3 |
Min length | 3 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | 1.0% |
Sample
1st row | I20 |
---|---|
2nd row | E11 |
3rd row | I20 |
4th row | M81 |
5th row | H35 |
Common Values
Value | Count | Frequency (%) |
E11 | 24 | |
M81 | 23 | |
E78 | 18 | |
H35 | 8 | 8.0% |
I20 | 7 | 7.0% |
E14 | 5 | 5.0% |
N32 | 3 | 3.0% |
K56 | 3 | 3.0% |
F98 | 2 | 2.0% |
K59 | 2 | 2.0% |
Other values (3) | 5 | 5.0% |
Length
Value | Count | Frequency (%) |
e11 | 24 | |
m81 | 23 | |
e78 | 18 | |
h35 | 8 | 8.0% |
i20 | 7 | 7.0% |
e14 | 5 | 5.0% |
n32 | 3 | 3.0% |
k56 | 3 | 3.0% |
f98 | 2 | 2.0% |
k59 | 2 | 2.0% |
Other values (3) | 5 | 5.0% |
condition_status_source_value
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 1.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
C |
---|
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | C |
---|---|
2nd row | C |
3rd row | C |
4th row | C |
5th row | C |
Common Values
Value | Count | Frequency (%) |
C | 100 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
c | 100 |
condition_status_concept_id
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 1.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
4230359 |
---|
Length
Max length | 7 |
---|---|
Median length | 7 |
Mean length | 7 |
Min length | 7 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 4230359 |
---|---|
2nd row | 4230359 |
3rd row | 4230359 |
4th row | 4230359 |
5th row | 4230359 |
Common Values
Value | Count | Frequency (%) |
4230359 | 100 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
4230359 | 100 |
condition_occurrence_id | condition_concept_id | condition_start_date | condition_type_concept_id | condition_source_value | |
---|---|---|---|---|---|
condition_occurrence_id | 1.000 | 0.198 | 0.962 | 0.000 | 0.318 |
condition_concept_id | 0.198 | 1.000 | 0.153 | 0.231 | 0.957 |
condition_start_date | 0.962 | 0.153 | 1.000 | 0.000 | 0.230 |
condition_type_concept_id | 0.000 | 0.231 | 0.000 | 1.000 | 0.951 |
condition_source_value | 0.318 | 0.957 | 0.230 | 0.951 | 1.000 |
condition_source_value | condition_type_concept_id | |
---|---|---|
condition_source_value | 1.000 | 0.907 |
condition_type_concept_id | 0.907 | 1.000 |
condition_occurrence_id | condition_concept_id | condition_start_date | condition_type_concept_id | condition_source_value | |
---|---|---|---|---|---|
condition_occurrence_id | 1.000 | -0.100 | 0.999 | 0.000 | 0.149 |
condition_concept_id | -0.100 | 1.000 | -0.097 | 0.131 | 0.779 |
condition_start_date | 0.999 | -0.097 | 1.000 | 0.000 | 0.030 |
condition_type_concept_id | 0.000 | 0.131 | 0.000 | 1.000 | 0.907 |
condition_source_value | 0.149 | 0.779 | 0.030 | 0.907 | 1.000 |
condition_occurrence_id | condition_concept_id | condition_start_date | condition_type_concept_id | condition_source_value | condition_status_source_value | condition_status_concept_id | |
---|---|---|---|---|---|---|---|
0 | 70252909 | 321318 | 201812 | 44786629 | I20 | C | 4230359 |
1 | 72035567 | 443731 | 201903 | 44786627 | E11 | C | 4230359 |
2 | 58418326 | 321318 | 201705 | 44786629 | I20 | C | 4230359 |
3 | 37121513 | 80502 | 201311 | 44786629 | M81 | C | 4230359 |
4 | 53511004 | 378416 | 201609 | 44786627 | H35 | C | 4230359 |
5 | 36109157 | 4159131 | 201307 | 44786629 | E78 | C | 4230359 |
6 | 58580276 | 443731 | 201706 | 44786627 | E11 | C | 4230359 |
7 | 38485973 | 80502 | 201403 | 44786629 | M81 | C | 4230359 |
8 | 46695791 | 378416 | 201509 | 44786627 | H35 | C | 4230359 |
9 | 54236221 | 4294549 | 201610 | 44786629 | F98 | C | 4230359 |
condition_occurrence_id | condition_concept_id | condition_start_date | condition_type_concept_id | condition_source_value | condition_status_source_value | condition_status_concept_id | |
---|---|---|---|---|---|---|---|
90 | 32183053 | 201820 | 201206 | 44786629 | E14 | C | 4230359 |
91 | 36510933 | 378416 | 201309 | 44786627 | H35 | C | 4230359 |
92 | 53671092 | 4294549 | 201609 | 44786629 | F98 | C | 4230359 |
93 | 67741742 | 193518 | 201808 | 44786627 | K56 | C | 4230359 |
94 | 53523541 | 4098483 | 201609 | 44786629 | E78 | C | 4230359 |
95 | 36109156 | 201820 | 201307 | 44786629 | E14 | C | 4230359 |
96 | 30177582 | 4159131 | 201111 | 44786629 | E78 | C | 4230359 |
97 | 46695793 | 4224741 | 201509 | 44786629 | H35 | C | 4230359 |
98 | 63273384 | 321318 | 201801 | 44786629 | I20 | C | 4230359 |
99 | 53523539 | 443731 | 201609 | 44786627 | E11 | C | 4230359 |