Dataset statistics
Number of variables | 6 |
---|---|
Number of observations | 100 |
Missing cells | 102 |
Missing cells (%) | 17.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 5.3 KiB |
Average record size in memory | 54.3 B |
Variable types
Text | 1 |
---|---|
Numeric | 4 |
Unsupported | 1 |
Dataset
Description | 당뇨 환자들의 최초 진단 시점의 키, 몸무게와 같은 신체 계측 정보와 수축기/이완기 혈압을 포함하는 생체 징후 데이터. 키와 몸무게 데이터를 이용한 Body Mass Index(BMI)를 생성할 수 있으며 혈압 데이터를 이용하여 고혈압 여부를 판단할 수 있음 |
---|---|
Author | 가톨릭대학교 서울성모병원 |
URL | http://cmcdata.net/data/dataset/diabetes_vital |
BDHT is highly overall correlated with BDWT | High correlation |
BDWT is highly overall correlated with BDHT | High correlation |
SYSTOLIC is highly overall correlated with DIASTOLIC | High correlation |
DIASTOLIC is highly overall correlated with SYSTOLIC | High correlation |
BDWT has 2 (2.0%) missing values | Missing |
Unnamed: 5 has 100 (100.0%) missing values | Missing |
RID has unique values | Unique |
Unnamed: 5 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Reproduction
Analysis started | 2023-10-08 18:56:05.636806 |
---|---|
Analysis finished | 2023-10-08 18:56:12.896214 |
Duration | 7.26 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
RID
Text
UNIQUE
 
Distinct | 100 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
Value | Count | Frequency (%) |
r0000001 | 1 | 1.0% |
r0000063 | 1 | 1.0% |
r0000074 | 1 | 1.0% |
r0000073 | 1 | 1.0% |
r0000072 | 1 | 1.0% |
r0000071 | 1 | 1.0% |
r0000070 | 1 | 1.0% |
r0000069 | 1 | 1.0% |
r0000068 | 1 | 1.0% |
r0000067 | 1 | 1.0% |
Other values (90) | 90 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 519 | |
R | 100 | 12.5% |
1 | 21 | 2.6% |
3 | 20 | 2.5% |
4 | 20 | 2.5% |
5 | 20 | 2.5% |
6 | 20 | 2.5% |
7 | 20 | 2.5% |
8 | 20 | 2.5% |
9 | 20 | 2.5% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 700 | |
Uppercase Letter | 100 | 12.5% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 519 | |
1 | 21 | 3.0% |
3 | 20 | 2.9% |
4 | 20 | 2.9% |
5 | 20 | 2.9% |
6 | 20 | 2.9% |
7 | 20 | 2.9% |
8 | 20 | 2.9% |
9 | 20 | 2.9% |
2 | 20 | 2.9% |
Uppercase Letter
Value | Count | Frequency (%) |
R | 100 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 700 | |
Latin | 100 | 12.5% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 519 | |
1 | 21 | 3.0% |
3 | 20 | 2.9% |
4 | 20 | 2.9% |
5 | 20 | 2.9% |
6 | 20 | 2.9% |
7 | 20 | 2.9% |
8 | 20 | 2.9% |
9 | 20 | 2.9% |
2 | 20 | 2.9% |
Latin
Value | Count | Frequency (%) |
R | 100 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 800 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 519 | |
R | 100 | 12.5% |
1 | 21 | 2.6% |
3 | 20 | 2.5% |
4 | 20 | 2.5% |
5 | 20 | 2.5% |
6 | 20 | 2.5% |
7 | 20 | 2.5% |
8 | 20 | 2.5% |
9 | 20 | 2.5% |
BDHT
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 61 |
---|---|
Distinct (%) | 61.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 162.037 |
Minimum | 142.9 |
---|---|
Maximum | 181 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 142.9 |
---|---|
5-th percentile | 147 |
Q1 | 155.075 |
median | 162.5 |
Q3 | 168.275 |
95-th percentile | 176.05 |
Maximum | 181 |
Range | 38.1 |
Interquartile range (IQR) | 13.2 |
Descriptive statistics
Standard deviation | 8.9937538 |
---|---|
Coefficient of variation (CV) | 0.055504322 |
Kurtosis | -0.82054362 |
Mean | 162.037 |
Median Absolute Deviation (MAD) | 6.35 |
Skewness | 0.00051419387 |
Sum | 16203.7 |
Variance | 80.887607 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
165.0 | 6 | 6.0% |
163.0 | 5 | 5.0% |
172.0 | 4 | 4.0% |
158.0 | 4 | 4.0% |
168.0 | 4 | 4.0% |
160.0 | 4 | 4.0% |
147.0 | 4 | 4.0% |
153.0 | 3 | 3.0% |
157.0 | 3 | 3.0% |
174.0 | 3 | 3.0% |
Other values (51) | 60 |
Value | Count | Frequency (%) |
142.9 | 1 | 1.0% |
146.0 | 1 | 1.0% |
147.0 | 4 | |
148.0 | 1 | 1.0% |
149.0 | 2 | |
149.2 | 1 | 1.0% |
150.0 | 2 | |
150.6 | 2 | |
151.0 | 1 | 1.0% |
152.0 | 1 | 1.0% |
Value | Count | Frequency (%) |
181.0 | 1 | 1.0% |
179.5 | 1 | 1.0% |
178.0 | 2 | |
177.0 | 1 | 1.0% |
176.0 | 1 | 1.0% |
175.5 | 1 | 1.0% |
175.0 | 1 | 1.0% |
174.9 | 1 | 1.0% |
174.0 | 3 | |
173.0 | 1 | 1.0% |
BDWT
Real number (ℝ)
HIGH CORRELATION
  MISSING
 
Distinct | 64 |
---|---|
Distinct (%) | 65.3% |
Missing | 2 |
Missing (%) | 2.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 66.109694 |
Minimum | 43 |
---|---|
Maximum | 135 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 43 |
---|---|
5-th percentile | 47.485 |
Q1 | 57 |
median | 63 |
Q3 | 74.75 |
95-th percentile | 84.9425 |
Maximum | 135 |
Range | 92 |
Interquartile range (IQR) | 17.75 |
Descriptive statistics
Standard deviation | 14.279624 |
---|---|
Coefficient of variation (CV) | 0.21599894 |
Kurtosis | 4.9732653 |
Mean | 66.109694 |
Median Absolute Deviation (MAD) | 9 |
Skewness | 1.5526674 |
Sum | 6478.75 |
Variance | 203.90766 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
63.0 | 5 | 5.0% |
68.0 | 5 | 5.0% |
62.0 | 5 | 5.0% |
59.0 | 3 | 3.0% |
74.0 | 3 | 3.0% |
72.0 | 3 | 3.0% |
57.0 | 3 | 3.0% |
76.0 | 3 | 3.0% |
61.4 | 2 | 2.0% |
75.0 | 2 | 2.0% |
Other values (54) | 64 |
Value | Count | Frequency (%) |
43.0 | 1 | |
45.7 | 1 | |
47.0 | 2 | |
47.4 | 1 | |
47.5 | 1 | |
48.0 | 1 | |
48.7 | 1 | |
49.1 | 1 | |
49.5 | 1 | |
50.0 | 1 |
Value | Count | Frequency (%) |
135.0 | 1 | |
109.4 | 1 | |
98.1 | 1 | |
94.0 | 1 | |
90.0 | 1 | |
84.05 | 1 | |
84.0 | 1 | |
83.5 | 1 | |
83.0 | 2 | |
82.1 | 1 |
SYSTOLIC
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 41 |
---|---|
Distinct (%) | 41.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 127.46 |
Minimum | 95 |
---|---|
Maximum | 191 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 95 |
---|---|
5-th percentile | 104.95 |
Q1 | 115.75 |
median | 123 |
Q3 | 139.25 |
95-th percentile | 155.25 |
Maximum | 191 |
Range | 96 |
Interquartile range (IQR) | 23.5 |
Descriptive statistics
Standard deviation | 17.443948 |
---|---|
Coefficient of variation (CV) | 0.13685821 |
Kurtosis | 0.97937086 |
Mean | 127.46 |
Median Absolute Deviation (MAD) | 11.5 |
Skewness | 0.81495383 |
Sum | 12746 |
Variance | 304.29131 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
110 | 15 | 15.0% |
120 | 12 | 12.0% |
130 | 9 | 9.0% |
140 | 5 | 5.0% |
131 | 4 | 4.0% |
134 | 4 | 4.0% |
117 | 3 | 3.0% |
119 | 3 | 3.0% |
115 | 3 | 3.0% |
122 | 3 | 3.0% |
Other values (31) | 39 |
Value | Count | Frequency (%) |
95 | 1 | 1.0% |
96 | 1 | 1.0% |
99 | 1 | 1.0% |
100 | 1 | 1.0% |
104 | 1 | 1.0% |
105 | 1 | 1.0% |
106 | 1 | 1.0% |
110 | 15 | |
115 | 3 | 3.0% |
116 | 1 | 1.0% |
Value | Count | Frequency (%) |
191 | 1 | |
174 | 1 | |
166 | 1 | |
160 | 2 | |
155 | 1 | |
154 | 2 | |
152 | 1 | |
151 | 2 | |
150 | 2 | |
149 | 1 |
DIASTOLIC
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 33 |
---|---|
Distinct (%) | 33.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 76.1 |
Minimum | 60 |
---|---|
Maximum | 119 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 60 |
---|---|
5-th percentile | 60 |
Q1 | 69.75 |
median | 75.5 |
Q3 | 80 |
95-th percentile | 97.05 |
Maximum | 119 |
Range | 59 |
Interquartile range (IQR) | 10.25 |
Descriptive statistics
Standard deviation | 11.31594 |
---|---|
Coefficient of variation (CV) | 0.1486983 |
Kurtosis | 1.5096128 |
Mean | 76.1 |
Median Absolute Deviation (MAD) | 5.5 |
Skewness | 1.0076088 |
Sum | 7610 |
Variance | 128.05051 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
70 | 18 | |
80 | 17 | |
60 | 8 | 8.0% |
82 | 4 | 4.0% |
67 | 4 | 4.0% |
76 | 3 | 3.0% |
69 | 3 | 3.0% |
64 | 3 | 3.0% |
95 | 3 | 3.0% |
72 | 3 | 3.0% |
Other values (23) | 34 |
Value | Count | Frequency (%) |
60 | 8 | |
62 | 1 | 1.0% |
63 | 2 | 2.0% |
64 | 3 | 3.0% |
65 | 2 | 2.0% |
66 | 2 | 2.0% |
67 | 4 | 4.0% |
69 | 3 | 3.0% |
70 | 18 | |
71 | 2 | 2.0% |
Value | Count | Frequency (%) |
119 | 1 | 1.0% |
109 | 1 | 1.0% |
99 | 2 | |
98 | 1 | 1.0% |
97 | 1 | 1.0% |
95 | 3 | |
94 | 1 | 1.0% |
91 | 1 | 1.0% |
90 | 2 | |
89 | 1 | 1.0% |
Unnamed: 5
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 100 |
---|---|
Missing (%) | 100.0% |
Memory size | 1.0 KiB |
RID | BDHT | BDWT | SYSTOLIC | DIASTOLIC | |
---|---|---|---|---|---|
RID | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
BDHT | 1.000 | 1.000 | 0.534 | 0.000 | 0.440 |
BDWT | 1.000 | 0.534 | 1.000 | 0.000 | 0.270 |
SYSTOLIC | 1.000 | 0.000 | 0.000 | 1.000 | 0.770 |
DIASTOLIC | 1.000 | 0.440 | 0.270 | 0.770 | 1.000 |
BDHT | BDWT | SYSTOLIC | DIASTOLIC | |
---|---|---|---|---|
BDHT | 1.000 | 0.606 | 0.244 | 0.247 |
BDWT | 0.606 | 1.000 | 0.252 | 0.261 |
SYSTOLIC | 0.244 | 0.252 | 1.000 | 0.630 |
DIASTOLIC | 0.247 | 0.261 | 0.630 | 1.000 |
RID | BDHT | BDWT | SYSTOLIC | DIASTOLIC | Unnamed: 5 | |
---|---|---|---|---|---|---|
0 | R0000001 | 160.5 | 61.4 | 130 | 70 | <NA> |
1 | R0000002 | 150.6 | 62.6 | 110 | 60 | <NA> |
2 | R0000003 | 172.0 | 73.6 | 120 | 80 | <NA> |
3 | R0000004 | 168.7 | 63.0 | 146 | 76 | <NA> |
4 | R0000005 | 152.0 | 68.0 | 130 | 80 | <NA> |
5 | R0000006 | 165.0 | 75.6 | 116 | 70 | <NA> |
6 | R0000007 | 157.0 | 79.0 | 145 | 65 | <NA> |
7 | R0000008 | 158.0 | 68.0 | 122 | 64 | <NA> |
8 | R0000009 | 163.0 | 52.0 | 110 | 60 | <NA> |
9 | R0000010 | 165.0 | 72.0 | 120 | 60 | <NA> |
RID | BDHT | BDWT | SYSTOLIC | DIASTOLIC | Unnamed: 5 | |
---|---|---|---|---|---|---|
90 | R0000091 | 179.5 | 94.0 | 151 | 109 | <NA> |
91 | R0000092 | 165.0 | 63.0 | 174 | 119 | <NA> |
92 | R0000093 | 172.0 | 72.0 | 131 | 81 | <NA> |
93 | R0000094 | 168.5 | 75.25 | 160 | 90 | <NA> |
94 | R0000095 | 174.0 | 66.0 | 134 | 88 | <NA> |
95 | R0000096 | 168.0 | 68.0 | 139 | 99 | <NA> |
96 | R0000097 | 160.0 | 50.0 | 117 | 67 | <NA> |
97 | R0000098 | 147.0 | 63.0 | 110 | 70 | <NA> |
98 | R0000099 | 174.0 | 74.0 | 130 | 80 | <NA> |
99 | R0000100 | 168.2 | 78.2 | 119 | 79 | <NA> |