Dataset statistics
Number of variables | 6 |
---|---|
Number of observations | 100 |
Missing cells | 4 |
Missing cells (%) | 0.7% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 5.3 KiB |
Average record size in memory | 54.3 B |
Variable types
Numeric | 5 |
---|---|
DateTime | 1 |
Dataset
Description | 당뇨병 환자들이 시행한 혈액 검사 중에 약물 부작용을 평가할 수 있는 검사 데이터를 포함함. 검사항목은 AST(GOT), ALT(GPT), Bun, Creatinine 등 당뇨병의 간독성, 신독성등 다양한 부작용을 평가할 수 있는 주요 검사항목이 포함됨 - AST(Aspartate aminotransferase. GOT(Glutamic Oxalacetic Transaminase)), ALT(alanine aminotransferase, GPT(glutamic pyruvate transaminase)): 간세포 손상을 반영하는 아미노전이효소(Aminotransferases)로 기본적인 간기능검사 항목임 -BUN(Blood Urea Nitrogen): 간세포 손상이나 신장의 기능을 평가할 수 있는 항목 - Creatinine: 근육에서 크레틴(Creatine)으로부터 생성되며 신장 기능 이외의 영향이 적어 신기능을 평가하는데 유용함 |
---|---|
Author | 가톨릭대학교 은평성모병원 |
URL | http://cmcdata.net/data/dataset/diabetes_sideeffects-eunpyeong |
BUN_VAL is highly overall correlated with Cr_VAL | High correlation |
Cr_VAL is highly overall correlated with BUN_VAL | High correlation |
AST_VAL is highly overall correlated with ALT_VAL | High correlation |
ALT_VAL is highly overall correlated with AST_VAL | High correlation |
AST_VAL has 2 (2.0%) missing values | Missing |
ALT_VAL has 2 (2.0%) missing values | Missing |
RID has unique values | Unique |
Reproduction
Analysis started | 2023-10-08 18:57:29.106924 |
---|---|
Analysis finished | 2023-10-08 18:57:35.157148 |
Duration | 6.05 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
RID
Real number (ℝ)
UNIQUE
 
Distinct | 100 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 50.5 |
Minimum | 1 |
---|---|
Maximum | 100 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 5.95 |
Q1 | 25.75 |
median | 50.5 |
Q3 | 75.25 |
95-th percentile | 95.05 |
Maximum | 100 |
Range | 99 |
Interquartile range (IQR) | 49.5 |
Descriptive statistics
Standard deviation | 29.011492 |
---|---|
Coefficient of variation (CV) | 0.57448499 |
Kurtosis | -1.2 |
Mean | 50.5 |
Median Absolute Deviation (MAD) | 25 |
Skewness | 0 |
Sum | 5050 |
Variance | 841.66667 |
Monotonicity | Strictly increasing |
Value | Count | Frequency (%) |
1 | 1 | 1.0% |
65 | 1 | 1.0% |
75 | 1 | 1.0% |
74 | 1 | 1.0% |
73 | 1 | 1.0% |
72 | 1 | 1.0% |
71 | 1 | 1.0% |
70 | 1 | 1.0% |
69 | 1 | 1.0% |
68 | 1 | 1.0% |
Other values (90) | 90 |
Value | Count | Frequency (%) |
1 | 1 | |
2 | 1 | |
3 | 1 | |
4 | 1 | |
5 | 1 | |
6 | 1 | |
7 | 1 | |
8 | 1 | |
9 | 1 | |
10 | 1 |
Value | Count | Frequency (%) |
100 | 1 | |
99 | 1 | |
98 | 1 | |
97 | 1 | |
96 | 1 | |
95 | 1 | |
94 | 1 | |
93 | 1 | |
92 | 1 | |
91 | 1 |
BUN/Cr_DATE
Date
Distinct | 91 |
---|---|
Distinct (%) | 91.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
Minimum | 2015-10-01 00:00:00 |
---|---|
Maximum | 2020-01-31 00:00:00 |
BUN_VAL
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 84 |
---|---|
Distinct (%) | 84.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 20.07 |
Minimum | 7.6 |
---|---|
Maximum | 121.6 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 7.6 |
---|---|
5-th percentile | 9.86 |
Q1 | 13.3 |
median | 16.65 |
Q3 | 22.675 |
95-th percentile | 35.655 |
Maximum | 121.6 |
Range | 114 |
Interquartile range (IQR) | 9.375 |
Descriptive statistics
Standard deviation | 13.680654 |
---|---|
Coefficient of variation (CV) | 0.68164695 |
Kurtosis | 31.387032 |
Mean | 20.07 |
Median Absolute Deviation (MAD) | 4.35 |
Skewness | 4.7435752 |
Sum | 2007 |
Variance | 187.1603 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
15.3 | 3 | 3.0% |
14.4 | 3 | 3.0% |
12.3 | 2 | 2.0% |
18.4 | 2 | 2.0% |
17.5 | 2 | 2.0% |
16.9 | 2 | 2.0% |
14.1 | 2 | 2.0% |
16.6 | 2 | 2.0% |
24.8 | 2 | 2.0% |
13.3 | 2 | 2.0% |
Other values (74) | 78 |
Value | Count | Frequency (%) |
7.6 | 1 | |
8.5 | 1 | |
9.0 | 2 | |
9.1 | 1 | |
9.9 | 2 | |
10.2 | 1 | |
10.5 | 1 | |
10.7 | 1 | |
11.2 | 1 | |
11.4 | 2 |
Value | Count | Frequency (%) |
121.6 | 1 | |
68.0 | 1 | |
44.4 | 1 | |
36.8 | 1 | |
36.7 | 1 | |
35.6 | 1 | |
35.3 | 1 | |
34.7 | 1 | |
33.1 | 1 | |
31.8 | 1 |
Cr_VAL
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 65 |
---|---|
Distinct (%) | 65.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1.2227 |
Minimum | 0.48 |
---|---|
Maximum | 9.07 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 0.48 |
---|---|
5-th percentile | 0.559 |
Q1 | 0.72 |
median | 0.9 |
Q3 | 1.0725 |
95-th percentile | 2.733 |
Maximum | 9.07 |
Range | 8.59 |
Interquartile range (IQR) | 0.3525 |
Descriptive statistics
Standard deviation | 1.325764 |
---|---|
Coefficient of variation (CV) | 1.0842921 |
Kurtosis | 21.848524 |
Mean | 1.2227 |
Median Absolute Deviation (MAD) | 0.18 |
Skewness | 4.4874165 |
Sum | 122.27 |
Variance | 1.7576502 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0.92 | 4 | 4.0% |
1.03 | 4 | 4.0% |
0.95 | 4 | 4.0% |
0.72 | 4 | 4.0% |
1.0 | 3 | 3.0% |
0.58 | 3 | 3.0% |
0.87 | 3 | 3.0% |
0.76 | 3 | 3.0% |
0.89 | 3 | 3.0% |
0.96 | 2 | 2.0% |
Other values (55) | 67 |
Value | Count | Frequency (%) |
0.48 | 1 | 1.0% |
0.49 | 1 | 1.0% |
0.52 | 1 | 1.0% |
0.53 | 1 | 1.0% |
0.54 | 1 | 1.0% |
0.56 | 2 | |
0.58 | 3 | |
0.59 | 2 | |
0.61 | 2 | |
0.62 | 1 | 1.0% |
Value | Count | Frequency (%) |
9.07 | 1 | |
8.32 | 1 | |
6.49 | 1 | |
3.63 | 1 | |
3.55 | 1 | |
2.69 | 2 | |
2.51 | 1 | |
2.11 | 1 | |
1.73 | 1 | |
1.58 | 1 |
AST_VAL
Real number (ℝ)
HIGH CORRELATION
  MISSING
 
Distinct | 36 |
---|---|
Distinct (%) | 36.7% |
Missing | 2 |
Missing (%) | 2.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 33.112245 |
Minimum | 12 |
---|---|
Maximum | 564 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 12 |
---|---|
5-th percentile | 14.7 |
Q1 | 18 |
median | 23 |
Q3 | 31.75 |
95-th percentile | 71.6 |
Maximum | 564 |
Range | 552 |
Interquartile range (IQR) | 13.75 |
Descriptive statistics
Standard deviation | 56.328543 |
---|---|
Coefficient of variation (CV) | 1.7011394 |
Kurtosis | 83.472625 |
Mean | 33.112245 |
Median Absolute Deviation (MAD) | 6 |
Skewness | 8.8318612 |
Sum | 3245 |
Variance | 3172.9048 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
16 | 7 | 7.0% |
25 | 7 | 7.0% |
19 | 6 | 6.0% |
15 | 6 | 6.0% |
20 | 5 | 5.0% |
21 | 5 | 5.0% |
27 | 5 | 5.0% |
18 | 5 | 5.0% |
23 | 4 | 4.0% |
12 | 4 | 4.0% |
Other values (26) | 44 |
Value | Count | Frequency (%) |
12 | 4 | |
13 | 1 | 1.0% |
15 | 6 | |
16 | 7 | |
17 | 4 | |
18 | 5 | |
19 | 6 | |
20 | 5 | |
21 | 5 | |
22 | 3 |
Value | Count | Frequency (%) |
564 | 1 | |
89 | 1 | |
86 | 1 | |
75 | 2 | |
71 | 1 | |
61 | 1 | |
52 | 1 | |
50 | 1 | |
47 | 2 | |
42 | 1 |
ALT_VAL
Real number (ℝ)
HIGH CORRELATION
  MISSING
 
Distinct | 41 |
---|---|
Distinct (%) | 41.8% |
Missing | 2 |
Missing (%) | 2.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 33.285714 |
Minimum | 6 |
---|---|
Maximum | 375 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 6 |
---|---|
5-th percentile | 11.85 |
Q1 | 18 |
median | 24.5 |
Q3 | 35.5 |
95-th percentile | 63 |
Maximum | 375 |
Range | 369 |
Interquartile range (IQR) | 17.5 |
Descriptive statistics
Standard deviation | 39.187627 |
---|---|
Coefficient of variation (CV) | 1.1773107 |
Kurtosis | 60.537129 |
Mean | 33.285714 |
Median Absolute Deviation (MAD) | 7.5 |
Skewness | 7.0752103 |
Sum | 3262 |
Variance | 1535.6701 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
21 | 7 | 7.0% |
19 | 5 | 5.0% |
31 | 5 | 5.0% |
24 | 4 | 4.0% |
14 | 4 | 4.0% |
17 | 4 | 4.0% |
18 | 4 | 4.0% |
29 | 3 | 3.0% |
61 | 3 | 3.0% |
45 | 3 | 3.0% |
Other values (31) | 56 |
Value | Count | Frequency (%) |
6 | 1 | 1.0% |
10 | 2 | |
11 | 2 | |
12 | 3 | |
13 | 3 | |
14 | 4 | |
15 | 3 | |
16 | 1 | 1.0% |
17 | 4 | |
18 | 4 |
Value | Count | Frequency (%) |
375 | 1 | 1.0% |
99 | 1 | 1.0% |
98 | 1 | 1.0% |
68 | 1 | 1.0% |
63 | 3 | |
61 | 3 | |
59 | 2 | |
53 | 1 | 1.0% |
52 | 1 | 1.0% |
51 | 2 |
RID | BUN/Cr_DATE | BUN_VAL | Cr_VAL | AST_VAL | ALT_VAL | |
---|---|---|---|---|---|---|
RID | 1.000 | 0.726 | 0.266 | 0.214 | 0.131 | 0.000 |
BUN/Cr_DATE | 0.726 | 1.000 | 0.975 | 0.970 | 0.000 | 0.000 |
BUN_VAL | 0.266 | 0.975 | 1.000 | 0.948 | 0.000 | 0.000 |
Cr_VAL | 0.214 | 0.970 | 0.948 | 1.000 | 0.000 | 0.000 |
AST_VAL | 0.131 | 0.000 | 0.000 | 0.000 | 1.000 | 0.779 |
ALT_VAL | 0.000 | 0.000 | 0.000 | 0.000 | 0.779 | 1.000 |
RID | BUN_VAL | Cr_VAL | AST_VAL | ALT_VAL | |
---|---|---|---|---|---|
RID | 1.000 | 0.063 | -0.068 | -0.011 | 0.216 |
BUN_VAL | 0.063 | 1.000 | 0.711 | -0.078 | -0.179 |
Cr_VAL | -0.068 | 0.711 | 1.000 | -0.032 | -0.043 |
AST_VAL | -0.011 | -0.078 | -0.032 | 1.000 | 0.652 |
ALT_VAL | 0.216 | -0.179 | -0.043 | 0.652 | 1.000 |
RID | BUN/Cr_DATE | BUN_VAL | Cr_VAL | AST_VAL | ALT_VAL | |
---|---|---|---|---|---|---|
0 | 1 | 2019-08-19T00:00:00 | 29.4 | 1.4 | 23 | 11 |
1 | 2 | 2016-12-27T00:00:00 | 15.3 | 0.95 | 89 | 99 |
2 | 3 | 2019-07-23T00:00:00 | 12.0 | 0.56 | 24 | 13 |
3 | 4 | 2017-10-31T00:00:00 | 12.7 | 0.87 | 21 | 27 |
4 | 5 | 2019-11-01T00:00:00 | 35.3 | 1.49 | 25 | 14 |
5 | 6 | 2017-04-14T00:00:00 | 9.1 | 0.75 | 71 | 44 |
6 | 7 | 2019-04-16T00:00:00 | 30.3 | 2.51 | 25 | 18 |
7 | 8 | 2019-06-20T00:00:00 | 22.9 | 1.03 | 32 | 22 |
8 | 9 | 2016-09-06T00:00:00 | 14.6 | 0.84 | 18 | 12 |
9 | 10 | 2019-05-21T00:00:00 | 25.4 | 1.47 | 36 | 31 |
RID | BUN/Cr_DATE | BUN_VAL | Cr_VAL | AST_VAL | ALT_VAL | |
---|---|---|---|---|---|---|
90 | 91 | 2019-08-19T00:00:00 | 18.2 | 0.73 | 18 | 21 |
91 | 92 | 2016-04-04T00:00:00 | 31.8 | 1.03 | <NA> | <NA> |
92 | 93 | 2019-04-26T00:00:00 | 14.7 | 0.72 | 20 | 20 |
93 | 94 | 2020-01-30T00:00:00 | 29.9 | 3.63 | 38 | 27 |
94 | 95 | 2019-05-23T00:00:00 | 121.6 | 3.55 | 15 | 28 |
95 | 96 | 2019-05-22T00:00:00 | 30.6 | 0.86 | 35 | 45 |
96 | 97 | 2019-10-10T00:00:00 | 8.5 | 0.48 | 12 | 15 |
97 | 98 | 2019-11-05T00:00:00 | 13.4 | 0.66 | 41 | 26 |
98 | 99 | 2018-07-10T00:00:00 | 17.5 | 0.92 | 31 | 53 |
99 | 100 | 2018-08-16T00:00:00 | 14.1 | 0.76 | 47 | 52 |