Dataset statistics
Number of variables | 5 |
---|---|
Number of observations | 100 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 4.4 KiB |
Average record size in memory | 45.3 B |
Variable types
Text | 1 |
---|---|
Numeric | 4 |
Dataset
Description | 고지혈증 환자들의 최초 처방시점의 키, 몸무게와 같은 신체 계측 정보와 수축기/이완기 혈압을 포함하는 생체 징후 데이터. 키와 몸무게 데이터를 이용한 Body Mass Index(BMI)를 생성할 수 있으며 혈압 데이터를 이용하여 고혈압 여부를 판단할 수 있음 |
---|---|
Author | 가톨릭대학교 서울성모병원 |
URL | http://cmcdata.net/data/dataset/vital-signs-dyslipidemia-data |
Reproduction
Analysis started | 2023-10-08 18:55:39.066970 |
---|---|
Analysis finished | 2023-10-08 18:55:49.498324 |
Duration | 10.43 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
RID
Text
UNIQUE
 
Distinct | 100 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
Value | Count | Frequency (%) |
r0000076 | 1 | 1.0% |
r0000290 | 1 | 1.0% |
r0000329 | 1 | 1.0% |
r0000327 | 1 | 1.0% |
r0000322 | 1 | 1.0% |
r0000321 | 1 | 1.0% |
r0000320 | 1 | 1.0% |
r0000318 | 1 | 1.0% |
r0000315 | 1 | 1.0% |
r0000314 | 1 | 1.0% |
Other values (90) | 90 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 422 | |
R | 100 | 12.5% |
2 | 51 | 6.4% |
1 | 48 | 6.0% |
3 | 48 | 6.0% |
5 | 27 | 3.4% |
4 | 26 | 3.2% |
8 | 22 | 2.8% |
7 | 20 | 2.5% |
6 | 20 | 2.5% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 700 | |
Uppercase Letter | 100 | 12.5% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 422 | |
2 | 51 | 7.3% |
1 | 48 | 6.9% |
3 | 48 | 6.9% |
5 | 27 | 3.9% |
4 | 26 | 3.7% |
8 | 22 | 3.1% |
7 | 20 | 2.9% |
6 | 20 | 2.9% |
9 | 16 | 2.3% |
Uppercase Letter
Value | Count | Frequency (%) |
R | 100 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 700 | |
Latin | 100 | 12.5% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 422 | |
2 | 51 | 7.3% |
1 | 48 | 6.9% |
3 | 48 | 6.9% |
5 | 27 | 3.9% |
4 | 26 | 3.7% |
8 | 22 | 3.1% |
7 | 20 | 2.9% |
6 | 20 | 2.9% |
9 | 16 | 2.3% |
Latin
Value | Count | Frequency (%) |
R | 100 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 800 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 422 | |
R | 100 | 12.5% |
2 | 51 | 6.4% |
1 | 48 | 6.0% |
3 | 48 | 6.0% |
5 | 27 | 3.4% |
4 | 26 | 3.2% |
8 | 22 | 2.8% |
7 | 20 | 2.5% |
6 | 20 | 2.5% |
BDHT
Real number (ℝ)
Distinct | 55 |
---|---|
Distinct (%) | 55.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 162.734 |
Minimum | 150 |
---|---|
Maximum | 183 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 150 |
---|---|
5-th percentile | 151.975 |
Q1 | 156.075 |
median | 163 |
Q3 | 168.625 |
95-th percentile | 176.1 |
Maximum | 183 |
Range | 33 |
Interquartile range (IQR) | 12.55 |
Descriptive statistics
Standard deviation | 7.8114164 |
---|---|
Coefficient of variation (CV) | 0.048001133 |
Kurtosis | -0.45128775 |
Mean | 162.734 |
Median Absolute Deviation (MAD) | 6 |
Skewness | 0.40785802 |
Sum | 16273.4 |
Variance | 61.018226 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
165.0 | 8 | 8.0% |
163.0 | 6 | 6.0% |
158.0 | 5 | 5.0% |
170.0 | 5 | 5.0% |
153.0 | 4 | 4.0% |
172.0 | 3 | 3.0% |
156.0 | 3 | 3.0% |
169.0 | 3 | 3.0% |
153.5 | 3 | 3.0% |
167.0 | 3 | 3.0% |
Other values (45) | 57 |
Value | Count | Frequency (%) |
150.0 | 2 | |
150.2 | 1 | 1.0% |
151.0 | 1 | 1.0% |
151.5 | 1 | 1.0% |
152.0 | 2 | |
153.0 | 4 | |
153.2 | 1 | 1.0% |
153.5 | 3 | |
153.7 | 1 | 1.0% |
154.0 | 1 | 1.0% |
Value | Count | Frequency (%) |
183.0 | 1 | 1.0% |
182.0 | 1 | 1.0% |
180.0 | 1 | 1.0% |
178.0 | 2 | |
176.0 | 1 | 1.0% |
175.0 | 2 | |
174.0 | 1 | 1.0% |
173.1 | 1 | 1.0% |
172.0 | 3 | |
171.0 | 1 | 1.0% |
BDWT
Real number (ℝ)
Distinct | 57 |
---|---|
Distinct (%) | 57.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 65.3225 |
Minimum | 46.5 |
---|---|
Maximum | 90 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 46.5 |
---|---|
5-th percentile | 51.95 |
Q1 | 58.7375 |
median | 64 |
Q3 | 71.2 |
95-th percentile | 82.9725 |
Maximum | 90 |
Range | 43.5 |
Interquartile range (IQR) | 12.4625 |
Descriptive statistics
Standard deviation | 9.3748357 |
---|---|
Coefficient of variation (CV) | 0.14351618 |
Kurtosis | -0.27415835 |
Mean | 65.3225 |
Median Absolute Deviation (MAD) | 6 |
Skewness | 0.37750014 |
Sum | 6532.25 |
Variance | 87.887544 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
64.0 | 5 | 5.0% |
75.0 | 5 | 5.0% |
70.0 | 5 | 5.0% |
63.0 | 5 | 5.0% |
58.0 | 4 | 4.0% |
62.0 | 4 | 4.0% |
53.0 | 3 | 3.0% |
69.0 | 3 | 3.0% |
57.0 | 3 | 3.0% |
80.0 | 3 | 3.0% |
Other values (47) | 60 |
Value | Count | Frequency (%) |
46.5 | 1 | 1.0% |
48.0 | 1 | 1.0% |
50.0 | 2 | |
51.0 | 1 | 1.0% |
52.0 | 1 | 1.0% |
52.8 | 1 | 1.0% |
53.0 | 3 | |
53.4 | 1 | 1.0% |
53.75 | 1 | 1.0% |
54.0 | 2 |
Value | Count | Frequency (%) |
90.0 | 1 | 1.0% |
87.3 | 1 | 1.0% |
84.4 | 1 | 1.0% |
84.0 | 1 | 1.0% |
83.4 | 1 | 1.0% |
82.95 | 1 | 1.0% |
80.2 | 1 | 1.0% |
80.0 | 3 | |
79.4 | 1 | 1.0% |
76.0 | 1 | 1.0% |
SYSTOLIC
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 32 |
---|---|
Distinct (%) | 32.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 127.88 |
Minimum | 100 |
---|---|
Maximum | 150 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 100 |
---|---|
5-th percentile | 106.9 |
Q1 | 120 |
median | 130 |
Q3 | 140 |
95-th percentile | 146.05 |
Maximum | 150 |
Range | 50 |
Interquartile range (IQR) | 20 |
Descriptive statistics
Standard deviation | 12.801736 |
---|---|
Coefficient of variation (CV) | 0.10010741 |
Kurtosis | -0.74962024 |
Mean | 127.88 |
Median Absolute Deviation (MAD) | 10 |
Skewness | -0.28859456 |
Sum | 12788 |
Variance | 163.88444 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
140 | 17 | |
120 | 15 | |
130 | 14 | |
110 | 6 | 6.0% |
100 | 4 | 4.0% |
112 | 3 | 3.0% |
150 | 3 | 3.0% |
136 | 3 | 3.0% |
121 | 3 | 3.0% |
129 | 2 | 2.0% |
Other values (22) | 30 |
Value | Count | Frequency (%) |
100 | 4 | 4.0% |
105 | 1 | 1.0% |
107 | 1 | 1.0% |
110 | 6 | 6.0% |
112 | 3 | 3.0% |
113 | 1 | 1.0% |
115 | 2 | 2.0% |
116 | 1 | 1.0% |
117 | 2 | 2.0% |
120 | 15 |
Value | Count | Frequency (%) |
150 | 3 | 3.0% |
148 | 1 | 1.0% |
147 | 1 | 1.0% |
146 | 1 | 1.0% |
145 | 2 | 2.0% |
144 | 1 | 1.0% |
143 | 2 | 2.0% |
142 | 1 | 1.0% |
141 | 2 | 2.0% |
140 | 17 |
DIASTOLIC
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 29 |
---|---|
Distinct (%) | 29.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 74.05 |
Minimum | 50 |
---|---|
Maximum | 90 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 50 |
---|---|
5-th percentile | 59.95 |
Q1 | 70 |
median | 74.5 |
Q3 | 80 |
95-th percentile | 90 |
Maximum | 90 |
Range | 40 |
Interquartile range (IQR) | 10 |
Descriptive statistics
Standard deviation | 9.6172982 |
---|---|
Coefficient of variation (CV) | 0.12987574 |
Kurtosis | -0.45872115 |
Mean | 74.05 |
Median Absolute Deviation (MAD) | 5.5 |
Skewness | -0.29019658 |
Sum | 7405 |
Variance | 92.492424 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
80 | 27 | |
70 | 17 | |
90 | 10 | 10.0% |
60 | 9 | 9.0% |
71 | 4 | 4.0% |
83 | 3 | 3.0% |
67 | 3 | 3.0% |
72 | 2 | 2.0% |
74 | 2 | 2.0% |
78 | 2 | 2.0% |
Other values (19) | 21 |
Value | Count | Frequency (%) |
50 | 1 | 1.0% |
52 | 1 | 1.0% |
54 | 1 | 1.0% |
55 | 1 | 1.0% |
59 | 1 | 1.0% |
60 | 9 | |
61 | 1 | 1.0% |
62 | 1 | 1.0% |
64 | 1 | 1.0% |
65 | 2 | 2.0% |
Value | Count | Frequency (%) |
90 | 10 | 10.0% |
86 | 1 | 1.0% |
85 | 1 | 1.0% |
83 | 3 | 3.0% |
82 | 1 | 1.0% |
81 | 1 | 1.0% |
80 | 27 | |
78 | 2 | 2.0% |
77 | 2 | 2.0% |
76 | 1 | 1.0% |
RID | BDHT | BDWT | SYSTOLIC | DIASTOLIC | |
---|---|---|---|---|---|
RID | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
BDHT | 1.000 | 1.000 | 0.600 | 0.000 | 0.271 |
BDWT | 1.000 | 0.600 | 1.000 | 0.215 | 0.211 |
SYSTOLIC | 1.000 | 0.000 | 0.215 | 1.000 | 0.619 |
DIASTOLIC | 1.000 | 0.271 | 0.211 | 0.619 | 1.000 |
BDHT | BDWT | SYSTOLIC | DIASTOLIC | |
---|---|---|---|---|
BDHT | 1.000 | 0.491 | -0.099 | 0.096 |
BDWT | 0.491 | 1.000 | -0.016 | 0.198 |
SYSTOLIC | -0.099 | -0.016 | 1.000 | 0.528 |
DIASTOLIC | 0.096 | 0.198 | 0.528 | 1.000 |
RID | BDHT | BDWT | SYSTOLIC | DIASTOLIC | |
---|---|---|---|---|---|
0 | R0000076 | 158.0 | 64.0 | 141 | 90 |
1 | R0000082 | 168.5 | 62.0 | 130 | 70 |
2 | R0000083 | 155.8 | 46.5 | 140 | 60 |
3 | R0000085 | 182.0 | 75.0 | 110 | 70 |
4 | R0000087 | 163.0 | 60.0 | 130 | 80 |
5 | R0000096 | 169.0 | 62.0 | 136 | 85 |
6 | R0000104 | 158.0 | 75.3 | 140 | 80 |
7 | R0000110 | 169.4 | 65.9 | 142 | 81 |
8 | R0000112 | 175.0 | 70.0 | 120 | 70 |
9 | R0000115 | 158.6 | 59.4 | 140 | 90 |
RID | BDHT | BDWT | SYSTOLIC | DIASTOLIC | |
---|---|---|---|---|---|
90 | R0000384 | 170.0 | 69.0 | 110 | 70 |
91 | R0000394 | 152.0 | 50.0 | 120 | 70 |
92 | R0000395 | 167.0 | 73.0 | 136 | 83 |
93 | R0000404 | 162.0 | 62.0 | 145 | 74 |
94 | R0000405 | 165.0 | 63.0 | 125 | 80 |
95 | R0000408 | 175.0 | 65.6 | 120 | 70 |
96 | R0000409 | 173.1 | 84.4 | 140 | 80 |
97 | R0000415 | 156.0 | 54.0 | 121 | 71 |
98 | R0000423 | 159.0 | 62.0 | 130 | 80 |
99 | R0000425 | 165.0 | 69.0 | 117 | 76 |