Dataset statistics
Number of variables | 9 |
---|---|
Number of observations | 100 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 7.6 KiB |
Average record size in memory | 77.3 B |
Variable types
Text | 1 |
---|---|
DateTime | 4 |
Numeric | 4 |
Dataset
Description | 알코올 사용 장애 환자들이 시행한 혈액 검사를 이용하여 당뇨, 고지혈증 질환과의 관련성을 평가할 수 있는 검사 데이터를 포함함. 검체 채취 일장, 접수 일자를 이용하여 처방시점으로 부터의 기간을 계산한 시점 데이터를 생성함. 검사항목은HbA1c, Glucose, HDL Cholesterol, LDL Cholesterol 등의 검사항목이 포함됨 - HbA1c(당화혈색소) :혈액 속 적혈구 내 혈색소에 포도당 일부가 결합한 상태. 일반 혈당 검사가 검사 시점 혈당만을 알 수 있는데 반해 당화혈색소를 통해 3개월 간의 평균 혈당을 알 수 있음 - LDL(Low Density Lipoprotein) Cholesterol : 나쁜 콜레스테롤이라고도 불리는 저밀도 지단백 콜레스테롤. 신체 콜레스테롤의 대부분을 차지하며 수치가 높으면 심장질환 및 뇌놀중 위험이 높아짐 - HDL(High Density Lipoprotein) Cholesterol : 좋은 콜레스테롤이라고도 불리는 고밀도 지단백 콜레스테롤로 콜레스테롤을 흡수하여 간으로 다시 운반함. 높은 HDL cholesterol은 심장질환과 뇌졸중 위험을 낮출 수 있음 |
---|---|
Author | 가톨릭대학교 서울성모병원 |
URL | http://cmcdata.net/data/dataset/coexistence-disease-analysis-blood-test-data-alcohol-use-disorder |
Reproduction
Analysis started | 2023-10-08 18:56:30.201794 |
---|---|
Analysis finished | 2023-10-08 18:56:34.782063 |
Duration | 4.58 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
RID
Text
UNIQUE
 
Distinct | 100 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
Value | Count | Frequency (%) |
r0000002 | 1 | 1.0% |
r0000368 | 1 | 1.0% |
r0000434 | 1 | 1.0% |
r0000432 | 1 | 1.0% |
r0000425 | 1 | 1.0% |
r0000413 | 1 | 1.0% |
r0000411 | 1 | 1.0% |
r0000401 | 1 | 1.0% |
r0000398 | 1 | 1.0% |
r0000397 | 1 | 1.0% |
Other values (90) | 90 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 439 | |
R | 100 | 12.5% |
2 | 45 | 5.6% |
4 | 41 | 5.1% |
1 | 38 | 4.8% |
3 | 34 | 4.2% |
5 | 27 | 3.4% |
6 | 23 | 2.9% |
7 | 21 | 2.6% |
9 | 17 | 2.1% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 700 | |
Uppercase Letter | 100 | 12.5% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 439 | |
2 | 45 | 6.4% |
4 | 41 | 5.9% |
1 | 38 | 5.4% |
3 | 34 | 4.9% |
5 | 27 | 3.9% |
6 | 23 | 3.3% |
7 | 21 | 3.0% |
9 | 17 | 2.4% |
8 | 15 | 2.1% |
Uppercase Letter
Value | Count | Frequency (%) |
R | 100 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 700 | |
Latin | 100 | 12.5% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 439 | |
2 | 45 | 6.4% |
4 | 41 | 5.9% |
1 | 38 | 5.4% |
3 | 34 | 4.9% |
5 | 27 | 3.9% |
6 | 23 | 3.3% |
7 | 21 | 3.0% |
9 | 17 | 2.4% |
8 | 15 | 2.1% |
Latin
Value | Count | Frequency (%) |
R | 100 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 800 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 439 | |
R | 100 | 12.5% |
2 | 45 | 5.6% |
4 | 41 | 5.1% |
1 | 38 | 4.8% |
3 | 34 | 4.2% |
5 | 27 | 3.4% |
6 | 23 | 2.9% |
7 | 21 | 2.6% |
9 | 17 | 2.1% |
A1C_DCT
Date
Distinct | 97 |
---|---|
Distinct (%) | 97.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
Minimum | 2008-11-03 00:00:00 |
---|---|
Maximum | 2018-07-31 00:00:00 |
A1C_SRC
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 71 |
---|---|
Distinct (%) | 71.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 134.413 |
Minimum | 77 |
---|---|
Maximum | 403 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 77 |
---|---|
5-th percentile | 84.9 |
Q1 | 97.75 |
median | 113.5 |
Q3 | 147 |
95-th percentile | 249.84 |
Maximum | 403 |
Range | 326 |
Interquartile range (IQR) | 49.25 |
Descriptive statistics
Standard deviation | 58.071189 |
---|---|
Coefficient of variation (CV) | 0.43203551 |
Kurtosis | 6.8768221 |
Mean | 134.413 |
Median Absolute Deviation (MAD) | 20 |
Skewness | 2.3589926 |
Sum | 13441.3 |
Variance | 3372.263 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
111.0 | 5 | 5.0% |
97.0 | 4 | 4.0% |
113.0 | 3 | 3.0% |
147.0 | 3 | 3.0% |
91.0 | 3 | 3.0% |
86.0 | 3 | 3.0% |
110.0 | 2 | 2.0% |
112.0 | 2 | 2.0% |
170.0 | 2 | 2.0% |
105.0 | 2 | 2.0% |
Other values (61) | 71 |
Value | Count | Frequency (%) |
77.0 | 1 | 1.0% |
78.0 | 1 | 1.0% |
79.0 | 1 | 1.0% |
82.0 | 1 | 1.0% |
83.0 | 1 | 1.0% |
85.0 | 1 | 1.0% |
86.0 | 3 | |
87.0 | 1 | 1.0% |
89.0 | 2 | |
91.0 | 3 |
Value | Count | Frequency (%) |
403.0 | 1 | |
380.0 | 1 | |
277.0 | 1 | |
268.0 | 1 | |
262.0 | 1 | |
249.2 | 1 | |
238.0 | 1 | |
237.4 | 1 | |
234.0 | 1 | |
213.0 | 1 |
GLC_DCT
Date
Distinct | 98 |
---|---|
Distinct (%) | 98.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
Minimum | 2008-10-23 00:00:00 |
---|---|
Maximum | 2018-07-31 00:00:00 |
GLC_SRC
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 42 |
---|---|
Distinct (%) | 42.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 6.433 |
Minimum | 4.2 |
---|---|
Maximum | 13 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 4.2 |
---|---|
5-th percentile | 4.695 |
Q1 | 5.4 |
median | 5.95 |
Q3 | 7.125 |
95-th percentile | 10.02 |
Maximum | 13 |
Range | 8.8 |
Interquartile range (IQR) | 1.725 |
Descriptive statistics
Standard deviation | 1.6425562 |
---|---|
Coefficient of variation (CV) | 0.25533285 |
Kurtosis | 2.7920744 |
Mean | 6.433 |
Median Absolute Deviation (MAD) | 0.65 |
Skewness | 1.5838866 |
Sum | 643.3 |
Variance | 2.6979909 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
5.5 | 11 | 11.0% |
5.4 | 7 | 7.0% |
5.7 | 5 | 5.0% |
7.6 | 5 | 5.0% |
6.4 | 4 | 4.0% |
6.5 | 4 | 4.0% |
6.3 | 4 | 4.0% |
5.1 | 4 | 4.0% |
4.7 | 3 | 3.0% |
5.0 | 3 | 3.0% |
Other values (32) | 50 |
Value | Count | Frequency (%) |
4.2 | 1 | 1.0% |
4.3 | 1 | 1.0% |
4.4 | 1 | 1.0% |
4.6 | 2 | |
4.7 | 3 | |
4.9 | 2 | |
5.0 | 3 | |
5.1 | 4 | |
5.2 | 2 | |
5.3 | 2 |
Value | Count | Frequency (%) |
13.0 | 1 | |
11.2 | 1 | |
10.8 | 2 | |
10.4 | 1 | |
10.0 | 1 | |
9.4 | 2 | |
9.3 | 1 | |
8.6 | 1 | |
8.3 | 1 | |
8.2 | 2 |
HDL_DCT
Date
Distinct | 97 |
---|---|
Distinct (%) | 97.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
Minimum | 2008-11-03 00:00:00 |
---|---|
Maximum | 2018-07-31 00:00:00 |
HDL_SRC
Real number (ℝ)
Distinct | 46 |
---|---|
Distinct (%) | 46.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 44.09 |
Minimum | 3 |
---|---|
Maximum | 94 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 3 |
---|---|
5-th percentile | 9 |
Q1 | 36 |
median | 44.5 |
Q3 | 54 |
95-th percentile | 71.15 |
Maximum | 94 |
Range | 91 |
Interquartile range (IQR) | 18 |
Descriptive statistics
Standard deviation | 17.549408 |
---|---|
Coefficient of variation (CV) | 0.39803601 |
Kurtosis | 0.43752756 |
Mean | 44.09 |
Median Absolute Deviation (MAD) | 9.5 |
Skewness | -0.18746109 |
Sum | 4409 |
Variance | 307.98172 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
41 | 7 | 7.0% |
47 | 6 | 6.0% |
49 | 6 | 6.0% |
65 | 5 | 5.0% |
54 | 4 | 4.0% |
39 | 4 | 4.0% |
10 | 4 | 4.0% |
40 | 4 | 4.0% |
29 | 3 | 3.0% |
55 | 3 | 3.0% |
Other values (36) | 54 |
Value | Count | Frequency (%) |
3 | 1 | 1.0% |
7 | 1 | 1.0% |
8 | 2 | |
9 | 2 | |
10 | 4 | |
24 | 1 | 1.0% |
27 | 2 | |
28 | 1 | 1.0% |
29 | 3 | |
30 | 1 | 1.0% |
Value | Count | Frequency (%) |
94 | 1 | 1.0% |
83 | 1 | 1.0% |
75 | 1 | 1.0% |
74 | 2 | 2.0% |
71 | 1 | 1.0% |
68 | 1 | 1.0% |
66 | 2 | 2.0% |
65 | 5 | |
64 | 2 | 2.0% |
61 | 1 | 1.0% |
LDL_DCT
Date
Distinct | 98 |
---|---|
Distinct (%) | 98.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
Minimum | 2008-10-23 00:00:00 |
---|---|
Maximum | 2018-07-31 00:00:00 |
LDL_SRC
Real number (ℝ)
Distinct | 72 |
---|---|
Distinct (%) | 72.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 98.61 |
Minimum | 5 |
---|---|
Maximum | 195 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 5 |
---|---|
5-th percentile | 28.95 |
Q1 | 70 |
median | 99 |
Q3 | 124 |
95-th percentile | 165.35 |
Maximum | 195 |
Range | 190 |
Interquartile range (IQR) | 54 |
Descriptive statistics
Standard deviation | 39.881616 |
---|---|
Coefficient of variation (CV) | 0.40443785 |
Kurtosis | -0.2955984 |
Mean | 98.61 |
Median Absolute Deviation (MAD) | 29 |
Skewness | 0.010456013 |
Sum | 9861 |
Variance | 1590.5433 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
70 | 3 | 3.0% |
93 | 3 | 3.0% |
117 | 3 | 3.0% |
100 | 3 | 3.0% |
69 | 3 | 3.0% |
135 | 2 | 2.0% |
98 | 2 | 2.0% |
78 | 2 | 2.0% |
55 | 2 | 2.0% |
121 | 2 | 2.0% |
Other values (62) | 75 |
Value | Count | Frequency (%) |
5 | 1 | |
17 | 1 | |
20 | 1 | |
23 | 1 | |
28 | 1 | |
29 | 1 | |
38 | 1 | |
42 | 1 | |
46 | 2 | |
48 | 2 |
Value | Count | Frequency (%) |
195 | 1 | |
182 | 1 | |
177 | 1 | |
175 | 1 | |
172 | 1 | |
165 | 1 | |
164 | 1 | |
159 | 1 | |
154 | 1 | |
149 | 1 |
RID | A1C_DCT | A1C_SRC | GLC_DCT | GLC_SRC | HDL_DCT | HDL_SRC | LDL_DCT | LDL_SRC | |
---|---|---|---|---|---|---|---|---|---|
RID | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
A1C_DCT | 1.000 | 1.000 | 0.000 | 1.000 | 0.000 | 1.000 | 0.965 | 1.000 | 0.951 |
A1C_SRC | 1.000 | 0.000 | 1.000 | 0.889 | 0.635 | 0.872 | 0.000 | 0.889 | 0.000 |
GLC_DCT | 1.000 | 1.000 | 0.889 | 1.000 | 0.000 | 1.000 | 0.990 | 1.000 | 0.959 |
GLC_SRC | 1.000 | 0.000 | 0.635 | 0.000 | 1.000 | 0.000 | 0.000 | 0.000 | 0.000 |
HDL_DCT | 1.000 | 1.000 | 0.872 | 1.000 | 0.000 | 1.000 | 0.965 | 1.000 | 0.844 |
HDL_SRC | 1.000 | 0.965 | 0.000 | 0.990 | 0.000 | 0.965 | 1.000 | 0.990 | 0.326 |
LDL_DCT | 1.000 | 1.000 | 0.889 | 1.000 | 0.000 | 1.000 | 0.990 | 1.000 | 0.959 |
LDL_SRC | 1.000 | 0.951 | 0.000 | 0.959 | 0.000 | 0.844 | 0.326 | 0.959 | 1.000 |
A1C_SRC | GLC_SRC | HDL_SRC | LDL_SRC | |
---|---|---|---|---|
A1C_SRC | 1.000 | 0.629 | -0.165 | -0.073 |
GLC_SRC | 0.629 | 1.000 | -0.048 | -0.113 |
HDL_SRC | -0.165 | -0.048 | 1.000 | 0.210 |
LDL_SRC | -0.073 | -0.113 | 0.210 | 1.000 |
RID | A1C_DCT | A1C_SRC | GLC_DCT | GLC_SRC | HDL_DCT | HDL_SRC | LDL_DCT | LDL_SRC | |
---|---|---|---|---|---|---|---|---|---|
0 | R0000002 | 2011-09-23 | 113.0 | 2011-09-23 | 6.4 | 2011-09-23 | 54 | 2011-09-23 | 118 |
1 | R0000004 | 2014-10-24 | 160.0 | 2014-10-24 | 6.3 | 2014-10-24 | 52 | 2014-10-24 | 23 |
2 | R0000009 | 2013-01-10 | 111.0 | 2013-01-10 | 6.0 | 2013-01-10 | 29 | 2013-01-10 | 154 |
3 | R0000020 | 2015-09-23 | 133.0 | 2015-09-23 | 5.5 | 2015-09-23 | 34 | 2015-10-02 | 93 |
4 | R0000032 | 2009-02-05 | 249.2 | 2009-01-27 | 7.1 | 2009-01-24 | 64 | 2009-01-24 | 129 |
5 | R0000039 | 2011-04-05 | 147.0 | 2011-04-22 | 7.8 | 2011-04-05 | 40 | 2011-04-05 | 141 |
6 | R0000044 | 2016-09-27 | 93.0 | 2016-09-27 | 5.1 | 2016-09-27 | 47 | 2016-09-27 | 182 |
7 | R0000053 | 2016-05-12 | 97.0 | 2016-05-12 | 5.5 | 2016-05-12 | 49 | 2016-05-12 | 133 |
8 | R0000057 | 2017-11-21 | 109.0 | 2017-11-21 | 6.3 | 2017-11-21 | 47 | 2017-11-21 | 175 |
9 | R0000060 | 2016-08-16 | 95.0 | 2016-08-16 | 5.8 | 2016-08-16 | 41 | 2016-08-29 | 84 |
RID | A1C_DCT | A1C_SRC | GLC_DCT | GLC_SRC | HDL_DCT | HDL_SRC | LDL_DCT | LDL_SRC | |
---|---|---|---|---|---|---|---|---|---|
90 | R0000503 | 2011-03-23 | 380.0 | 2011-03-22 | 7.2 | 2011-03-22 | 8 | 2011-03-22 | 109 |
91 | R0000505 | 2009-05-16 | 111.0 | 2009-05-12 | 4.7 | 2009-05-28 | 38 | 2009-05-28 | 107 |
92 | R0000507 | 2009-05-13 | 111.0 | 2009-05-14 | 5.4 | 2009-05-13 | 55 | 2009-05-13 | 103 |
93 | R0000514 | 2011-03-23 | 113.0 | 2011-03-23 | 7.0 | 2011-03-23 | 39 | 2011-03-23 | 128 |
94 | R0000516 | 2017-04-25 | 116.0 | 2017-04-25 | 6.0 | 2017-04-25 | 3 | 2017-04-25 | 17 |
95 | R0000517 | 2010-01-25 | 86.0 | 2010-01-25 | 5.4 | 2010-01-25 | 32 | 2010-01-25 | 149 |
96 | R0000520 | 2013-06-17 | 99.0 | 2013-06-17 | 6.2 | 2013-06-17 | 35 | 2013-06-17 | 46 |
97 | R0000526 | 2010-11-16 | 403.0 | 2010-11-16 | 10.8 | 2010-11-20 | 42 | 2010-11-20 | 130 |
98 | R0000527 | 2013-07-01 | 170.0 | 2013-07-15 | 6.4 | 2013-07-01 | 38 | 2013-07-01 | 117 |
99 | R0000528 | 2017-03-21 | 111.0 | 2017-03-21 | 5.2 | 2017-03-21 | 94 | 2017-03-21 | 69 |