Dataset statistics
Number of variables | 9 |
---|---|
Number of observations | 100 |
Missing cells | 3 |
Missing cells (%) | 0.3% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 7.8 KiB |
Average record size in memory | 80.3 B |
Variable types
Numeric | 7 |
---|---|
DateTime | 2 |
Dataset
Description | 당뇨병 환자들이 시행한 혈액 검사 결과를 이용하여 공존질환과의 관련성을 평가할 수 있는 검사 데이터를 포함함. 검사 항목은 HbA1c, TC, TG, HDL, LDL로 신장병증, 망막병증, 심근경색, 백내장과 혈관성 질환의 평가가 가능함. - HbA1c(당화혈색소): 혈액 속 적혈구 내 혈색소에 포도당 일부가 결합한 상태. 일반 혈당 검사가 검사 시점 혈당만을 알 수 있는데 반해 당화혈색소를 통해 3개월 간의 평균 혈당을 알 수 있음 - Total Cholesterol(TC, 총콜레스테롤) : 혈액 내에 있는 모든 콜레스테롤을 뜻함 - Triglyceride(TG, 중성지방): 혈 중 트리글리세라이드의 양을 측정. 혈 중 트리글리세라이드가 증가하는 이유는 분명하지 않으나 심혈관 질환으로 진행될 위험의 증가와 관련이 있음 - HDL(High Density Lipoprotein Cholesterol): 좋은 콜레스테롤이라고도 불리는 고밀도 지단백 콜레스테롤로 콜레스테롤을 흡수하여 간으로 다시 운반함. 높은 HDL cholesterol은 심장질환과 뇌졸중 위험을 낮출 수 있음 - LDL(Low Density Lipoprotein Cholesterol): 나쁜 콜레스테롤이라고도 불리는 저밀도 지단백 콜레스테롤. 신체 콜레스테롤의 대부분을 차지하며 수치가 높으면 심장질환 및 뇌놀중 위험이 높아짐 |
---|---|
Author | 가톨릭대학교 은평성모병원 |
URL | http://cmcdata.net/data/dataset/diabetes_coexlab-eunpyeong |
A1C_VAL is highly overall correlated with A1C_VAL_C | High correlation |
A1C_VAL_C is highly overall correlated with A1C_VAL | High correlation |
TC_VAL is highly overall correlated with LDL_VAL | High correlation |
LDL_VAL is highly overall correlated with TC_VAL | High correlation |
TG_VAL has 3 (3.0%) missing values | Missing |
RID has unique values | Unique |
Reproduction
Analysis started | 2023-10-08 18:55:55.787139 |
---|---|
Analysis finished | 2023-10-08 18:56:09.893502 |
Duration | 14.11 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
RID
Real number (ℝ)
UNIQUE
 
Distinct | 100 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 50.5 |
Minimum | 1 |
---|---|
Maximum | 100 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 5.95 |
Q1 | 25.75 |
median | 50.5 |
Q3 | 75.25 |
95-th percentile | 95.05 |
Maximum | 100 |
Range | 99 |
Interquartile range (IQR) | 49.5 |
Descriptive statistics
Standard deviation | 29.011492 |
---|---|
Coefficient of variation (CV) | 0.57448499 |
Kurtosis | -1.2 |
Mean | 50.5 |
Median Absolute Deviation (MAD) | 25 |
Skewness | 0 |
Sum | 5050 |
Variance | 841.66667 |
Monotonicity | Strictly increasing |
Value | Count | Frequency (%) |
1 | 1 | 1.0% |
65 | 1 | 1.0% |
75 | 1 | 1.0% |
74 | 1 | 1.0% |
73 | 1 | 1.0% |
72 | 1 | 1.0% |
71 | 1 | 1.0% |
70 | 1 | 1.0% |
69 | 1 | 1.0% |
68 | 1 | 1.0% |
Other values (90) | 90 |
Value | Count | Frequency (%) |
1 | 1 | |
2 | 1 | |
3 | 1 | |
4 | 1 | |
5 | 1 | |
6 | 1 | |
7 | 1 | |
8 | 1 | |
9 | 1 | |
10 | 1 |
Value | Count | Frequency (%) |
100 | 1 | |
99 | 1 | |
98 | 1 | |
97 | 1 | |
96 | 1 | |
95 | 1 | |
94 | 1 | |
93 | 1 | |
92 | 1 | |
91 | 1 |
A1C_DATE
Date
Distinct | 95 |
---|---|
Distinct (%) | 95.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
Minimum | 2015-10-01 00:00:00 |
---|---|
Maximum | 2020-01-07 00:00:00 |
A1C_VAL
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 52 |
---|---|
Distinct (%) | 52.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 8.052 |
Minimum | 4.7 |
---|---|
Maximum | 14.7 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 4.7 |
---|---|
5-th percentile | 5.695 |
Q1 | 6.5 |
median | 7.2 |
Q3 | 9.3 |
95-th percentile | 12.805 |
Maximum | 14.7 |
Range | 10 |
Interquartile range (IQR) | 2.8 |
Descriptive statistics
Standard deviation | 2.1686695 |
---|---|
Coefficient of variation (CV) | 0.26933302 |
Kurtosis | 0.36793447 |
Mean | 8.052 |
Median Absolute Deviation (MAD) | 1.1 |
Skewness | 1.0440809 |
Sum | 805.2 |
Variance | 4.7031273 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
6.6 | 6 | 6.0% |
6.5 | 6 | 6.0% |
7.1 | 5 | 5.0% |
8.0 | 5 | 5.0% |
6.8 | 4 | 4.0% |
10.0 | 3 | 3.0% |
7.2 | 3 | 3.0% |
8.3 | 3 | 3.0% |
11.0 | 3 | 3.0% |
5.8 | 3 | 3.0% |
Other values (42) | 59 |
Value | Count | Frequency (%) |
4.7 | 1 | 1.0% |
5.4 | 1 | 1.0% |
5.5 | 1 | 1.0% |
5.6 | 2 | |
5.7 | 2 | |
5.8 | 3 | |
5.9 | 1 | 1.0% |
6.0 | 3 | |
6.1 | 1 | 1.0% |
6.2 | 3 |
Value | Count | Frequency (%) |
14.7 | 1 | 1.0% |
13.2 | 1 | 1.0% |
13.1 | 2 | |
12.9 | 1 | 1.0% |
12.8 | 1 | 1.0% |
12.4 | 2 | |
11.5 | 1 | 1.0% |
11.1 | 1 | 1.0% |
11.0 | 3 | |
10.9 | 1 | 1.0% |
A1C_VAL_C
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 11 |
---|---|
Distinct (%) | 11.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 7.64 |
Minimum | 4 |
---|---|
Maximum | 14 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 4 |
---|---|
5-th percentile | 5 |
Q1 | 6 |
median | 7 |
Q3 | 9 |
95-th percentile | 12 |
Maximum | 14 |
Range | 10 |
Interquartile range (IQR) | 3 |
Descriptive statistics
Standard deviation | 2.2134575 |
---|---|
Coefficient of variation (CV) | 0.28971956 |
Kurtosis | 0.1877693 |
Mean | 7.64 |
Median Absolute Deviation (MAD) | 1 |
Skewness | 0.94712288 |
Sum | 764 |
Variance | 4.8993939 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
6 | 29 | |
7 | 18 | |
8 | 15 | |
5 | 10 | 10.0% |
10 | 8 | 8.0% |
9 | 6 | 6.0% |
11 | 5 | 5.0% |
12 | 4 | 4.0% |
13 | 3 | 3.0% |
4 | 1 | 1.0% |
Value | Count | Frequency (%) |
4 | 1 | 1.0% |
5 | 10 | 10.0% |
6 | 29 | |
7 | 18 | |
8 | 15 | |
9 | 6 | 6.0% |
10 | 8 | 8.0% |
11 | 5 | 5.0% |
12 | 4 | 4.0% |
13 | 3 | 3.0% |
Value | Count | Frequency (%) |
14 | 1 | 1.0% |
13 | 3 | 3.0% |
12 | 4 | 4.0% |
11 | 5 | 5.0% |
10 | 8 | 8.0% |
9 | 6 | 6.0% |
8 | 15 | |
7 | 18 | |
6 | 29 | |
5 | 10 | 10.0% |
Distinct | 92 |
---|---|
Distinct (%) | 92.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
Minimum | 2015-10-01 00:00:00 |
---|---|
Maximum | 2020-01-13 00:00:00 |
TC_VAL
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 76 |
---|---|
Distinct (%) | 76.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 170.59 |
Minimum | 77 |
---|---|
Maximum | 412 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 77 |
---|---|
5-th percentile | 102.9 |
Q1 | 131.75 |
median | 164.5 |
Q3 | 196.5 |
95-th percentile | 265.6 |
Maximum | 412 |
Range | 335 |
Interquartile range (IQR) | 64.75 |
Descriptive statistics
Standard deviation | 55.756704 |
---|---|
Coefficient of variation (CV) | 0.32684626 |
Kurtosis | 2.9505278 |
Mean | 170.59 |
Median Absolute Deviation (MAD) | 34 |
Skewness | 1.2493937 |
Sum | 17059 |
Variance | 3108.81 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
114 | 4 | 4.0% |
154 | 3 | 3.0% |
178 | 3 | 3.0% |
166 | 3 | 3.0% |
159 | 3 | 3.0% |
115 | 2 | 2.0% |
155 | 2 | 2.0% |
209 | 2 | 2.0% |
189 | 2 | 2.0% |
174 | 2 | 2.0% |
Other values (66) | 74 |
Value | Count | Frequency (%) |
77 | 1 | |
79 | 1 | |
84 | 1 | |
99 | 1 | |
101 | 1 | |
103 | 1 | |
104 | 1 | |
108 | 2 | |
110 | 1 | |
111 | 1 |
Value | Count | Frequency (%) |
412 | 1 | |
329 | 1 | |
296 | 1 | |
291 | 1 | |
277 | 1 | |
265 | 1 | |
260 | 1 | |
255 | 1 | |
244 | 1 | |
243 | 1 |
TG_VAL
Real number (ℝ)
MISSING
 
Distinct | 78 |
---|---|
Distinct (%) | 80.4% |
Missing | 3 |
Missing (%) | 3.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 168.31959 |
Minimum | 49 |
---|---|
Maximum | 839 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 49 |
---|---|
5-th percentile | 58.6 |
Q1 | 82 |
median | 118 |
Q3 | 179 |
95-th percentile | 619.4 |
Maximum | 839 |
Range | 790 |
Interquartile range (IQR) | 97 |
Descriptive statistics
Standard deviation | 158.04565 |
---|---|
Coefficient of variation (CV) | 0.93896173 |
Kurtosis | 9.0201562 |
Mean | 168.31959 |
Median Absolute Deviation (MAD) | 42 |
Skewness | 2.9942776 |
Sum | 16327 |
Variance | 24978.428 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
78 | 3 | 3.0% |
95 | 2 | 2.0% |
191 | 2 | 2.0% |
104 | 2 | 2.0% |
71 | 2 | 2.0% |
82 | 2 | 2.0% |
59 | 2 | 2.0% |
75 | 2 | 2.0% |
169 | 2 | 2.0% |
136 | 2 | 2.0% |
Other values (68) | 76 | |
(Missing) | 3 | 3.0% |
Value | Count | Frequency (%) |
49 | 1 | |
50 | 1 | |
55 | 2 | |
57 | 1 | |
59 | 2 | |
62 | 1 | |
68 | 1 | |
71 | 2 | |
72 | 1 | |
74 | 2 |
Value | Count | Frequency (%) |
839 | 1 | |
829 | 1 | |
752 | 1 | |
686 | 1 | |
661 | 1 | |
609 | 1 | |
328 | 1 | |
295 | 1 | |
287 | 1 | |
265 | 1 |
HDL_VAL
Real number (ℝ)
Distinct | 40 |
---|---|
Distinct (%) | 40.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 43.27 |
Minimum | 9 |
---|---|
Maximum | 74 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 9 |
---|---|
5-th percentile | 26.95 |
Q1 | 35.75 |
median | 44 |
Q3 | 51 |
95-th percentile | 58.05 |
Maximum | 74 |
Range | 65 |
Interquartile range (IQR) | 15.25 |
Descriptive statistics
Standard deviation | 11.188789 |
---|---|
Coefficient of variation (CV) | 0.25858074 |
Kurtosis | 0.51957714 |
Mean | 43.27 |
Median Absolute Deviation (MAD) | 8 |
Skewness | -0.15377708 |
Sum | 4327 |
Variance | 125.18899 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
52 | 9 | 9.0% |
50 | 6 | 6.0% |
47 | 6 | 6.0% |
36 | 5 | 5.0% |
51 | 5 | 5.0% |
49 | 5 | 5.0% |
43 | 4 | 4.0% |
44 | 4 | 4.0% |
39 | 4 | 4.0% |
40 | 4 | 4.0% |
Other values (30) | 48 |
Value | Count | Frequency (%) |
9 | 1 | 1.0% |
15 | 1 | 1.0% |
25 | 2 | |
26 | 1 | 1.0% |
27 | 2 | |
28 | 3 | |
29 | 3 | |
30 | 1 | 1.0% |
31 | 2 | |
32 | 2 |
Value | Count | Frequency (%) |
74 | 1 | |
71 | 1 | |
68 | 1 | |
61 | 1 | |
59 | 1 | |
58 | 2 | |
57 | 2 | |
56 | 1 | |
54 | 2 | |
53 | 2 |
LDL_VAL
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 73 |
---|---|
Distinct (%) | 73.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 95.46 |
Minimum | 28 |
---|---|
Maximum | 231 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 28 |
---|---|
5-th percentile | 41.95 |
Q1 | 68.75 |
median | 87 |
Q3 | 115.25 |
95-th percentile | 166.4 |
Maximum | 231 |
Range | 203 |
Interquartile range (IQR) | 46.5 |
Descriptive statistics
Standard deviation | 39.268673 |
---|---|
Coefficient of variation (CV) | 0.41136259 |
Kurtosis | 0.83643485 |
Mean | 95.46 |
Median Absolute Deviation (MAD) | 22.5 |
Skewness | 0.88642099 |
Sum | 9546 |
Variance | 1542.0287 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
80 | 4 | 4.0% |
81 | 3 | 3.0% |
62 | 3 | 3.0% |
104 | 3 | 3.0% |
102 | 3 | 3.0% |
107 | 3 | 3.0% |
69 | 3 | 3.0% |
34 | 2 | 2.0% |
86 | 2 | 2.0% |
79 | 2 | 2.0% |
Other values (63) | 72 |
Value | Count | Frequency (%) |
28 | 1 | |
34 | 2 | |
38 | 1 | |
41 | 1 | |
42 | 1 | |
44 | 1 | |
47 | 1 | |
48 | 2 | |
51 | 1 | |
54 | 2 |
Value | Count | Frequency (%) |
231 | 1 | |
194 | 1 | |
193 | 1 | |
180 | 1 | |
174 | 1 | |
166 | 1 | |
163 | 2 | |
154 | 1 | |
153 | 1 | |
152 | 1 |
RID | A1C_DATE | A1C_VAL | A1C_VAL_C | TC/TG/HDL/LDL_DATE | TC_VAL | TG_VAL | HDL_VAL | LDL_VAL | |
---|---|---|---|---|---|---|---|---|---|
RID | 1.000 | 0.941 | 0.302 | 0.315 | 0.953 | 0.000 | 0.242 | 0.000 | 0.000 |
A1C_DATE | 0.941 | 1.000 | 0.943 | 0.939 | 0.999 | 0.944 | 0.972 | 0.960 | 0.889 |
A1C_VAL | 0.302 | 0.943 | 1.000 | 0.979 | 0.902 | 0.000 | 0.000 | 0.083 | 0.000 |
A1C_VAL_C | 0.315 | 0.939 | 0.979 | 1.000 | 0.898 | 0.000 | 0.000 | 0.000 | 0.113 |
TC/TG/HDL/LDL_DATE | 0.953 | 0.999 | 0.902 | 0.898 | 1.000 | 0.938 | 0.735 | 0.798 | 0.863 |
TC_VAL | 0.000 | 0.944 | 0.000 | 0.000 | 0.938 | 1.000 | 0.629 | 0.444 | 0.856 |
TG_VAL | 0.242 | 0.972 | 0.000 | 0.000 | 0.735 | 0.629 | 1.000 | 0.000 | 0.000 |
HDL_VAL | 0.000 | 0.960 | 0.083 | 0.000 | 0.798 | 0.444 | 0.000 | 1.000 | 0.000 |
LDL_VAL | 0.000 | 0.889 | 0.000 | 0.113 | 0.863 | 0.856 | 0.000 | 0.000 | 1.000 |
RID | A1C_VAL | A1C_VAL_C | TC_VAL | TG_VAL | HDL_VAL | LDL_VAL | |
---|---|---|---|---|---|---|---|
RID | 1.000 | -0.113 | -0.121 | -0.092 | -0.070 | -0.004 | -0.031 |
A1C_VAL | -0.113 | 1.000 | 0.978 | 0.079 | 0.101 | -0.226 | 0.120 |
A1C_VAL_C | -0.121 | 0.978 | 1.000 | 0.081 | 0.080 | -0.210 | 0.123 |
TC_VAL | -0.092 | 0.079 | 0.081 | 1.000 | 0.412 | 0.332 | 0.881 |
TG_VAL | -0.070 | 0.101 | 0.080 | 0.412 | 1.000 | -0.172 | 0.207 |
HDL_VAL | -0.004 | -0.226 | -0.210 | 0.332 | -0.172 | 1.000 | 0.210 |
LDL_VAL | -0.031 | 0.120 | 0.123 | 0.881 | 0.207 | 0.210 | 1.000 |
RID | A1C_DATE | A1C_VAL | A1C_VAL_C | TC/TG/HDL/LDL_DATE | TC_VAL | TG_VAL | HDL_VAL | LDL_VAL | |
---|---|---|---|---|---|---|---|---|---|
0 | 1 | 2019-08-19 | 5.7 | 5 | 2019-08-19 | 103 | 234 | 28 | 34 |
1 | 2 | 2019-07-23 | 6.5 | 7 | 2019-07-23 | 143 | 59 | 50 | 73 |
2 | 3 | 2017-10-11 | 6.7 | 6 | 2017-10-13 | 111 | 127 | 53 | 34 |
3 | 4 | 2017-04-17 | 6.5 | 6 | 2017-04-17 | 162 | 103 | 47 | 97 |
4 | 5 | 2019-04-16 | 4.7 | 4 | 2019-04-16 | 175 | 125 | 51 | 87 |
5 | 6 | 2019-06-20 | 9.9 | 9 | 2019-06-20 | 133 | 109 | 34 | 81 |
6 | 7 | 2019-05-29 | 6.4 | 6 | 2019-05-29 | 194 | 96 | 71 | 100 |
7 | 8 | 2018-10-02 | 9.2 | 9 | 2018-10-02 | 178 | 116 | 49 | 116 |
8 | 9 | 2019-10-18 | 8.0 | 8 | 2019-10-18 | 141 | 91 | 52 | 72 |
9 | 10 | 2018-11-08 | 6.3 | 6 | 2018-11-08 | 166 | 98 | 27 | 124 |
RID | A1C_DATE | A1C_VAL | A1C_VAL_C | TC/TG/HDL/LDL_DATE | TC_VAL | TG_VAL | HDL_VAL | LDL_VAL | |
---|---|---|---|---|---|---|---|---|---|
90 | 91 | 2019-11-23 | 6.4 | 6 | 2019-11-23 | 110 | 49 | 47 | 48 |
91 | 92 | 2019-02-25 | 10.0 | 10 | 2019-02-25 | 119 | 78 | 39 | 148 |
92 | 93 | 2019-08-27 | 6.2 | 6 | 2019-08-27 | 177 | 191 | 52 | 89 |
93 | 94 | 2019-12-06 | 7.6 | 7 | 2019-12-06 | 142 | <NA> | 49 | 62 |
94 | 95 | 2020-01-03 | 6.4 | 6 | 2020-01-03 | 178 | 169 | 58 | 86 |
95 | 96 | 2016-01-13 | 6.5 | 6 | 2016-01-13 | 99 | 106 | 36 | 42 |
96 | 97 | 2019-06-21 | 5.8 | 5 | 2019-06-21 | 202 | 201 | 47 | 126 |
97 | 98 | 2019-06-13 | 6.7 | 6 | 2019-06-13 | 159 | 104 | 52 | 79 |
98 | 99 | 2016-06-27 | 10.0 | 10 | 2016-06-27 | 236 | 175 | 52 | 152 |
99 | 100 | 2019-05-22 | 8.8 | 8 | 2019-05-22 | 155 | 191 | 48 | 82 |