Dataset statistics
Number of variables | 7 |
---|---|
Number of observations | 100 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 6.1 KiB |
Average record size in memory | 62.3 B |
Variable types
Text | 1 |
---|---|
DateTime | 1 |
Numeric | 5 |
Dataset
Description | 당뇨병 환자들이 시행한 혈액 검사 중에 간, 신장 기능 평가할 수 있는 검사 데이터를 포함함. 검사항목은 Bun, Creatinine, AST(GOT), ALT(GPT), MDRD-eGFR - AST(Aspartate aminotransferase. GOT(Glutamic Oxalacetic Transaminase)), ALT(alanine aminotransferase, GPT(glutamic pyruvate transaminase)): 간세포 손상을 반영하는 아미노전이효소(Aminotransferases)로 기본적인 간기능검사 항목임 - BUN(Blood Urea Nitrogen): 간세포 손상이나 신장의 기능을 평가할 수 있는 항목 - Creatinine: 근육에서 크레틴(Creatine)으로부터 생성되며 신장 기능 이외의 영향이 적어 신기능을 평가하는데 유용함 - MDRD-eGFR(Modification of Diet in Renal Disease Study, MDRD-Estimated Glomerular Filtration Rate, eGFR): 혈액 내 크레아티닌 수치를 측정하고 그 결과를 MDRD공식을 사용하여 계산해 신장이 얼마나 잘 기능 하는지를 나태내는 수치 |
---|---|
Author | 가톨릭대학교 서울성모병원 |
URL | http://cmcdata.net/data/dataset/diabetes_lab |
Reproduction
Analysis started | 2023-10-08 18:57:13.256801 |
---|---|
Analysis finished | 2023-10-08 18:57:21.459808 |
Duration | 8.2 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
RID
Text
UNIQUE
 
Distinct | 100 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
Value | Count | Frequency (%) |
r0000001 | 1 | 1.0% |
r0000063 | 1 | 1.0% |
r0000074 | 1 | 1.0% |
r0000073 | 1 | 1.0% |
r0000072 | 1 | 1.0% |
r0000071 | 1 | 1.0% |
r0000070 | 1 | 1.0% |
r0000069 | 1 | 1.0% |
r0000068 | 1 | 1.0% |
r0000067 | 1 | 1.0% |
Other values (90) | 90 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 519 | |
R | 100 | 12.5% |
1 | 21 | 2.6% |
3 | 20 | 2.5% |
4 | 20 | 2.5% |
5 | 20 | 2.5% |
6 | 20 | 2.5% |
7 | 20 | 2.5% |
8 | 20 | 2.5% |
9 | 20 | 2.5% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 700 | |
Uppercase Letter | 100 | 12.5% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 519 | |
1 | 21 | 3.0% |
3 | 20 | 2.9% |
4 | 20 | 2.9% |
5 | 20 | 2.9% |
6 | 20 | 2.9% |
7 | 20 | 2.9% |
8 | 20 | 2.9% |
9 | 20 | 2.9% |
2 | 20 | 2.9% |
Uppercase Letter
Value | Count | Frequency (%) |
R | 100 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 700 | |
Latin | 100 | 12.5% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 519 | |
1 | 21 | 3.0% |
3 | 20 | 2.9% |
4 | 20 | 2.9% |
5 | 20 | 2.9% |
6 | 20 | 2.9% |
7 | 20 | 2.9% |
8 | 20 | 2.9% |
9 | 20 | 2.9% |
2 | 20 | 2.9% |
Latin
Value | Count | Frequency (%) |
R | 100 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 800 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 519 | |
R | 100 | 12.5% |
1 | 21 | 2.6% |
3 | 20 | 2.5% |
4 | 20 | 2.5% |
5 | 20 | 2.5% |
6 | 20 | 2.5% |
7 | 20 | 2.5% |
8 | 20 | 2.5% |
9 | 20 | 2.5% |
BUN/Cr_DATE
Date
Distinct | 62 |
---|---|
Distinct (%) | 62.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
Minimum | 2009-06-01 00:00:00 |
---|---|
Maximum | 2019-05-01 00:00:00 |
BUN_VAL
Real number (ℝ)
Distinct | 80 |
---|---|
Distinct (%) | 80.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 16.849 |
Minimum | 5.7 |
---|---|
Maximum | 84 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 5.7 |
---|---|
5-th percentile | 9.18 |
Q1 | 11.8 |
median | 15.25 |
Q3 | 19.375 |
95-th percentile | 26.12 |
Maximum | 84 |
Range | 78.3 |
Interquartile range (IQR) | 7.575 |
Descriptive statistics
Standard deviation | 9.129954 |
---|---|
Coefficient of variation (CV) | 0.54186919 |
Kurtosis | 29.572725 |
Mean | 16.849 |
Median Absolute Deviation (MAD) | 3.7 |
Skewness | 4.4307295 |
Sum | 1684.9 |
Variance | 83.35606 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
17.9 | 3 | 3.0% |
11.0 | 3 | 3.0% |
12.1 | 3 | 3.0% |
11.9 | 3 | 3.0% |
17.7 | 2 | 2.0% |
14.3 | 2 | 2.0% |
14.7 | 2 | 2.0% |
10.8 | 2 | 2.0% |
21.1 | 2 | 2.0% |
11.8 | 2 | 2.0% |
Other values (70) | 76 |
Value | Count | Frequency (%) |
5.7 | 1 | |
7.2 | 1 | |
7.6 | 1 | |
7.8 | 1 | |
8.8 | 1 | |
9.2 | 1 | |
9.4 | 1 | |
9.5 | 2 | |
9.8 | 2 | |
9.9 | 2 |
Value | Count | Frequency (%) |
84.0 | 1 | |
42.8 | 1 | |
37.3 | 1 | |
31.1 | 1 | |
30.3 | 1 | |
25.9 | 1 | |
24.9 | 1 | |
24.8 | 1 | |
24.7 | 1 | |
23.6 | 1 |
Cr_VAL
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 58 |
---|---|
Distinct (%) | 58.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 0.9646 |
Minimum | 0.48 |
---|---|
Maximum | 5.91 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 0.48 |
---|---|
5-th percentile | 0.58 |
Q1 | 0.72 |
median | 0.87 |
Q3 | 1.0125 |
95-th percentile | 1.4415 |
Maximum | 5.91 |
Range | 5.43 |
Interquartile range (IQR) | 0.2925 |
Descriptive statistics
Standard deviation | 0.58045162 |
---|---|
Coefficient of variation (CV) | 0.6017537 |
Kurtosis | 54.033165 |
Mean | 0.9646 |
Median Absolute Deviation (MAD) | 0.15 |
Skewness | 6.587708 |
Sum | 96.46 |
Variance | 0.33692408 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0.69 | 5 | 5.0% |
0.79 | 4 | 4.0% |
1.05 | 3 | 3.0% |
0.92 | 3 | 3.0% |
0.87 | 3 | 3.0% |
1.01 | 3 | 3.0% |
0.78 | 3 | 3.0% |
0.84 | 3 | 3.0% |
0.9 | 3 | 3.0% |
0.83 | 3 | 3.0% |
Other values (48) | 67 |
Value | Count | Frequency (%) |
0.48 | 1 | |
0.5 | 1 | |
0.51 | 1 | |
0.56 | 1 | |
0.58 | 2 | |
0.59 | 1 | |
0.6 | 1 | |
0.61 | 2 | |
0.62 | 1 | |
0.65 | 1 |
Value | Count | Frequency (%) |
5.91 | 1 | |
2.22 | 1 | |
2.14 | 1 | |
1.98 | 1 | |
1.47 | 1 | |
1.44 | 1 | |
1.43 | 1 | |
1.41 | 1 | |
1.31 | 1 | |
1.27 | 1 |
AST_VAL
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 34 |
---|---|
Distinct (%) | 34.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 28.59 |
Minimum | 12 |
---|---|
Maximum | 196 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 12 |
---|---|
5-th percentile | 15 |
Q1 | 18 |
median | 22 |
Q3 | 29 |
95-th percentile | 64.5 |
Maximum | 196 |
Range | 184 |
Interquartile range (IQR) | 11 |
Descriptive statistics
Standard deviation | 23.3403 |
---|---|
Coefficient of variation (CV) | 0.81637985 |
Kurtosis | 28.546768 |
Mean | 28.59 |
Median Absolute Deviation (MAD) | 5 |
Skewness | 4.7253139 |
Sum | 2859 |
Variance | 544.7696 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
20 | 7 | 7.0% |
18 | 7 | 7.0% |
15 | 7 | 7.0% |
19 | 7 | 7.0% |
17 | 6 | 6.0% |
21 | 5 | 5.0% |
28 | 5 | 5.0% |
22 | 5 | 5.0% |
24 | 4 | 4.0% |
36 | 4 | 4.0% |
Other values (24) | 43 |
Value | Count | Frequency (%) |
12 | 1 | 1.0% |
14 | 3 | |
15 | 7 | |
16 | 4 | |
17 | 6 | |
18 | 7 | |
19 | 7 | |
20 | 7 | |
21 | 5 | |
22 | 5 |
Value | Count | Frequency (%) |
196 | 1 | |
119 | 1 | |
85 | 1 | |
75 | 1 | |
74 | 1 | |
64 | 1 | |
55 | 1 | |
53 | 1 | |
50 | 1 | |
45 | 1 |
ALT_VAL
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 51 |
---|---|
Distinct (%) | 51.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 34.83 |
Minimum | 7 |
---|---|
Maximum | 218 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 7 |
---|---|
5-th percentile | 13 |
Q1 | 18 |
median | 24 |
Q3 | 37.25 |
95-th percentile | 91.25 |
Maximum | 218 |
Range | 211 |
Interquartile range (IQR) | 19.25 |
Descriptive statistics
Standard deviation | 31.136407 |
---|---|
Coefficient of variation (CV) | 0.8939537 |
Kurtosis | 15.066775 |
Mean | 34.83 |
Median Absolute Deviation (MAD) | 8 |
Skewness | 3.3896001 |
Sum | 3483 |
Variance | 969.47586 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
18 | 7 | 7.0% |
20 | 6 | 6.0% |
19 | 6 | 6.0% |
16 | 5 | 5.0% |
13 | 4 | 4.0% |
21 | 4 | 4.0% |
29 | 4 | 4.0% |
28 | 3 | 3.0% |
24 | 3 | 3.0% |
14 | 3 | 3.0% |
Other values (41) | 55 |
Value | Count | Frequency (%) |
7 | 1 | 1.0% |
9 | 1 | 1.0% |
11 | 1 | 1.0% |
12 | 1 | 1.0% |
13 | 4 | |
14 | 3 | |
15 | 2 | 2.0% |
16 | 5 | |
17 | 2 | 2.0% |
18 | 7 |
Value | Count | Frequency (%) |
218 | 1 | |
175 | 1 | |
100 | 1 | |
96 | 2 | |
91 | 1 | |
90 | 1 | |
72 | 1 | |
71 | 1 | |
67 | 1 | |
66 | 1 |
MDRD_VAL
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 97 |
---|---|
Distinct (%) | 97.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 81.9759 |
Minimum | 9.62 |
---|---|
Maximum | 146.34 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 9.62 |
---|---|
5-th percentile | 36.9555 |
Q1 | 70.7375 |
median | 82.88 |
Q3 | 94.6825 |
95-th percentile | 121.599 |
Maximum | 146.34 |
Range | 136.72 |
Interquartile range (IQR) | 23.945 |
Descriptive statistics
Standard deviation | 22.999817 |
---|---|
Coefficient of variation (CV) | 0.28056803 |
Kurtosis | 1.215002 |
Mean | 81.9759 |
Median Absolute Deviation (MAD) | 12.165 |
Skewness | -0.34966167 |
Sum | 8197.59 |
Variance | 528.99157 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
88.8 | 2 | 2.0% |
85.66 | 2 | 2.0% |
78.84 | 2 | 2.0% |
98.58 | 1 | 1.0% |
80.17 | 1 | 1.0% |
90.22 | 1 | 1.0% |
64.94 | 1 | 1.0% |
9.62 | 1 | 1.0% |
62.76 | 1 | 1.0% |
52.28 | 1 | 1.0% |
Other values (87) | 87 |
Value | Count | Frequency (%) |
9.62 | 1 | |
22.53 | 1 | |
24.78 | 1 | |
34.76 | 1 | |
36.49 | 1 | |
36.98 | 1 | |
46.45 | 1 | |
47.08 | 1 | |
52.28 | 1 | |
53.48 | 1 |
Value | Count | Frequency (%) |
146.34 | 1 | |
131.06 | 1 | |
126.28 | 1 | |
125.67 | 1 | |
122.91 | 1 | |
121.53 | 1 | |
116.96 | 1 | |
114.62 | 1 | |
109.69 | 1 | |
108.71 | 1 |
RID | BUN/Cr_DATE | BUN_VAL | Cr_VAL | AST_VAL | ALT_VAL | MDRD_VAL | |
---|---|---|---|---|---|---|---|
RID | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
BUN/Cr_DATE | 1.000 | 1.000 | 0.000 | 0.000 | 0.490 | 0.575 | 0.636 |
BUN_VAL | 1.000 | 0.000 | 1.000 | 0.826 | 0.093 | 0.000 | 0.667 |
Cr_VAL | 1.000 | 0.000 | 0.826 | 1.000 | 0.000 | 0.000 | 0.900 |
AST_VAL | 1.000 | 0.490 | 0.093 | 0.000 | 1.000 | 0.879 | 0.000 |
ALT_VAL | 1.000 | 0.575 | 0.000 | 0.000 | 0.879 | 1.000 | 0.000 |
MDRD_VAL | 1.000 | 0.636 | 0.667 | 0.900 | 0.000 | 0.000 | 1.000 |
BUN_VAL | Cr_VAL | AST_VAL | ALT_VAL | MDRD_VAL | |
---|---|---|---|---|---|
BUN_VAL | 1.000 | 0.372 | -0.192 | -0.188 | -0.358 |
Cr_VAL | 0.372 | 1.000 | 0.190 | 0.191 | -0.746 |
AST_VAL | -0.192 | 0.190 | 1.000 | 0.736 | -0.070 |
ALT_VAL | -0.188 | 0.191 | 0.736 | 1.000 | 0.030 |
MDRD_VAL | -0.358 | -0.746 | -0.070 | 0.030 | 1.000 |
RID | BUN/Cr_DATE | BUN_VAL | Cr_VAL | AST_VAL | ALT_VAL | MDRD_VAL | |
---|---|---|---|---|---|---|---|
0 | R0000001 | 2009-09 | 23.1 | 0.59 | 20 | 14 | 98.58 |
1 | R0000002 | 2011-12 | 22.0 | 0.79 | 39 | 34 | 70.76 |
2 | R0000003 | 2009-12 | 17.9 | 0.92 | 20 | 14 | 84.8 |
3 | R0000004 | 2017-06 | 15.4 | 0.87 | 85 | 71 | 99.86 |
4 | R0000005 | 2009-07 | 24.7 | 1.01 | 18 | 21 | 54.51 |
5 | R0000006 | 2015-07 | 13.3 | 0.78 | 15 | 16 | 73.01 |
6 | R0000007 | 2017-08 | 11.4 | 0.84 | 24 | 41 | 103.39 |
7 | R0000008 | 2015-12 | 15.1 | 0.75 | 119 | 175 | 79.94 |
8 | R0000009 | 2010-08 | 12.0 | 0.94 | 36 | 72 | 88.44 |
9 | R0000010 | 2015-11 | 9.9 | 0.72 | 17 | 23 | 80.55 |
RID | BUN/Cr_DATE | BUN_VAL | Cr_VAL | AST_VAL | ALT_VAL | MDRD_VAL | |
---|---|---|---|---|---|---|---|
90 | R0000091 | 2016-04 | 12.8 | 0.72 | 31 | 67 | 121.53 |
91 | R0000092 | 2009-09 | 14.1 | 0.71 | 18 | 18 | 80.92 |
92 | R0000093 | 2015-11 | 16.6 | 0.66 | 16 | 13 | 88.8 |
93 | R0000094 | 2013-06 | 18.0 | 1.07 | 17 | 28 | 81.15 |
94 | R0000095 | 2019-02 | 13.6 | 0.69 | 19 | 28 | 85.93 |
95 | R0000096 | 2017-01 | 18.5 | 0.58 | 15 | 21 | 103.08 |
96 | R0000097 | 2014-11 | 20.3 | 0.62 | 18 | 44 | 114.62 |
97 | R0000098 | 2016-02 | 25.9 | 0.78 | 22 | 18 | 101.19 |
98 | R0000099 | 2015-10 | 13.2 | 0.5 | 26 | 24 | 126.28 |
99 | R0000100 | 2013-10 | 12.1 | 0.86 | 20 | 22 | 91.99 |