Overview

Dataset statistics

Number of variables5
Number of observations100
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory4.5 KiB
Average record size in memory46.3 B

Variable types

Numeric5

Dataset

Description당뇨 환자들의 최초 진단시점점의 키, 몸무게와 같은 신체 계측 정보와 수축기/이완기 혈압을 포함하는 생체 징후 데이터. 키와 몸무게 데이터를 이용한 Body Mass Index(BMI)를 생성할 수 있으며 혈압 데이터를 이용하여 고혈압 여부를 판단할 수 있음
Author가톨릭대학교 은평성모병원
URLhttp://cmcdata.net/data/dataset/diabetes_vital-eunpyeong

Alerts

SYSTOLIC is highly overall correlated with DIASTOLICHigh correlation
DIASTOLIC is highly overall correlated with SYSTOLICHigh correlation
RID has unique valuesUnique

Reproduction

Analysis started2023-10-08 18:56:30.942515
Analysis finished2023-10-08 18:56:36.923739
Duration5.98 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

RID
Real number (ℝ)

UNIQUE 

Distinct100
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean50.5
Minimum1
Maximum100
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-10-09T03:56:37.050992image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile5.95
Q125.75
median50.5
Q375.25
95-th percentile95.05
Maximum100
Range99
Interquartile range (IQR)49.5

Descriptive statistics

Standard deviation29.011492
Coefficient of variation (CV)0.57448499
Kurtosis-1.2
Mean50.5
Median Absolute Deviation (MAD)25
Skewness0
Sum5050
Variance841.66667
MonotonicityStrictly increasing
2023-10-09T03:56:37.308417image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.0%
65 1
 
1.0%
75 1
 
1.0%
74 1
 
1.0%
73 1
 
1.0%
72 1
 
1.0%
71 1
 
1.0%
70 1
 
1.0%
69 1
 
1.0%
68 1
 
1.0%
Other values (90) 90
90.0%
ValueCountFrequency (%)
1 1
1.0%
2 1
1.0%
3 1
1.0%
4 1
1.0%
5 1
1.0%
6 1
1.0%
7 1
1.0%
8 1
1.0%
9 1
1.0%
10 1
1.0%
ValueCountFrequency (%)
100 1
1.0%
99 1
1.0%
98 1
1.0%
97 1
1.0%
96 1
1.0%
95 1
1.0%
94 1
1.0%
93 1
1.0%
92 1
1.0%
91 1
1.0%

BDHT
Real number (ℝ)

Distinct83
Distinct (%)83.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean160.696
Minimum141.5
Maximum186.7
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-10-09T03:56:37.838751image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum141.5
5-th percentile146.395
Q1152.975
median160.25
Q3168.05
95-th percentile174.05
Maximum186.7
Range45.2
Interquartile range (IQR)15.075

Descriptive statistics

Standard deviation9.5459433
Coefficient of variation (CV)0.05940374
Kurtosis-0.72771125
Mean160.696
Median Absolute Deviation (MAD)7.75
Skewness0.11024353
Sum16069.6
Variance91.125034
MonotonicityNot monotonic
2023-10-09T03:56:38.212662image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
168.0 4
 
4.0%
171.0 3
 
3.0%
154.0 2
 
2.0%
155.2 2
 
2.0%
159.0 2
 
2.0%
173.0 2
 
2.0%
169.4 2
 
2.0%
164.7 2
 
2.0%
152.4 2
 
2.0%
172.4 2
 
2.0%
Other values (73) 77
77.0%
ValueCountFrequency (%)
141.5 1
1.0%
142.7 1
1.0%
143.8 1
1.0%
144.5 1
1.0%
146.3 1
1.0%
146.4 1
1.0%
147.0 1
1.0%
147.5 1
1.0%
147.8 1
1.0%
148.1 1
1.0%
ValueCountFrequency (%)
186.7 1
1.0%
179.2 1
1.0%
178.3 1
1.0%
176.0 1
1.0%
175.0 1
1.0%
174.0 1
1.0%
173.1 1
1.0%
173.0 2
2.0%
172.4 2
2.0%
172.3 1
1.0%

BDWT
Real number (ℝ)

Distinct83
Distinct (%)83.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean70.664
Minimum41
Maximum616
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-10-09T03:56:38.617285image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum41
5-th percentile48.095
Q156.025
median63.475
Q373.7
95-th percentile88.2875
Maximum616
Range575
Interquartile range (IQR)17.675

Descriptive statistics

Standard deviation56.351863
Coefficient of variation (CV)0.79746212
Kurtosis91.040006
Mean70.664
Median Absolute Deviation (MAD)8.55
Skewness9.3318279
Sum7066.4
Variance3175.5325
MonotonicityNot monotonic
2023-10-09T03:56:38.858156image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
74.0 4
 
4.0%
66.0 3
 
3.0%
55.8 3
 
3.0%
51.7 3
 
3.0%
73.0 3
 
3.0%
64.0 2
 
2.0%
57.7 2
 
2.0%
61.0 2
 
2.0%
62.0 2
 
2.0%
59.6 2
 
2.0%
Other values (73) 74
74.0%
ValueCountFrequency (%)
41.0 1
 
1.0%
44.5 1
 
1.0%
45.8 1
 
1.0%
47.2 1
 
1.0%
48.0 1
 
1.0%
48.1 1
 
1.0%
49.6 1
 
1.0%
49.7 1
 
1.0%
50.0 1
 
1.0%
51.7 3
3.0%
ValueCountFrequency (%)
616.0 1
1.0%
97.1 1
1.0%
94.2 1
1.0%
92.0 1
1.0%
89.0 1
1.0%
88.25 1
1.0%
86.1 1
1.0%
85.8 1
1.0%
83.9 1
1.0%
83.2 1
1.0%

SYSTOLIC
Real number (ℝ)

HIGH CORRELATION 

Distinct29
Distinct (%)29.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean132.81
Minimum90
Maximum481
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-10-09T03:56:39.063397image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum90
5-th percentile100
Q1120
median130
Q3140
95-th percentile165.25
Maximum481
Range391
Interquartile range (IQR)20

Descriptive statistics

Standard deviation40.120449
Coefficient of variation (CV)0.30208906
Kurtosis58.068361
Mean132.81
Median Absolute Deviation (MAD)10
Skewness6.7983324
Sum13281
Variance1609.6504
MonotonicityNot monotonic
2023-10-09T03:56:39.275303image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=29)
ValueCountFrequency (%)
120 25
25.0%
130 17
17.0%
140 12
12.0%
110 9
 
9.0%
100 6
 
6.0%
160 3
 
3.0%
150 3
 
3.0%
145 2
 
2.0%
170 2
 
2.0%
137 2
 
2.0%
Other values (19) 19
19.0%
ValueCountFrequency (%)
90 1
 
1.0%
100 6
 
6.0%
103 1
 
1.0%
110 9
 
9.0%
111 1
 
1.0%
115 1
 
1.0%
118 1
 
1.0%
119 1
 
1.0%
120 25
25.0%
122 1
 
1.0%
ValueCountFrequency (%)
481 1
 
1.0%
212 1
 
1.0%
180 1
 
1.0%
170 2
2.0%
165 1
 
1.0%
162 1
 
1.0%
160 3
3.0%
159 1
 
1.0%
158 1
 
1.0%
150 3
3.0%

DIASTOLIC
Real number (ℝ)

HIGH CORRELATION 

Distinct25
Distinct (%)25.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean83.9
Minimum50
Maximum802
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-10-09T03:56:39.491783image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum50
5-th percentile60
Q170
median80
Q380
95-th percentile93
Maximum802
Range752
Interquartile range (IQR)10

Descriptive statistics

Standard deviation73.243402
Coefficient of variation (CV)0.87298453
Kurtosis96.07348
Mean83.9
Median Absolute Deviation (MAD)10
Skewness9.7064455
Sum8390
Variance5364.596
MonotonicityNot monotonic
2023-10-09T03:56:39.779482image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=25)
ValueCountFrequency (%)
80 33
33.0%
70 18
18.0%
90 10
 
10.0%
60 10
 
10.0%
84 3
 
3.0%
75 3
 
3.0%
82 2
 
2.0%
76 2
 
2.0%
69 2
 
2.0%
93 2
 
2.0%
Other values (15) 15
15.0%
ValueCountFrequency (%)
50 1
 
1.0%
56 1
 
1.0%
59 1
 
1.0%
60 10
10.0%
63 1
 
1.0%
66 1
 
1.0%
68 1
 
1.0%
69 2
 
2.0%
70 18
18.0%
74 1
 
1.0%
ValueCountFrequency (%)
802 1
 
1.0%
100 1
 
1.0%
99 1
 
1.0%
95 1
 
1.0%
93 2
 
2.0%
92 1
 
1.0%
90 10
10.0%
88 1
 
1.0%
84 3
 
3.0%
83 1
 
1.0%

Interactions

2023-10-09T03:56:35.011559image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-10-09T03:56:31.179863image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-10-09T03:56:32.213958image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-10-09T03:56:33.078880image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-10-09T03:56:34.060817image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-10-09T03:56:35.319460image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-10-09T03:56:31.510848image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-10-09T03:56:32.396056image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-10-09T03:56:33.270374image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-10-09T03:56:34.214299image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-10-09T03:56:35.677438image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-10-09T03:56:31.703949image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-10-09T03:56:32.587608image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-10-09T03:56:33.498589image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-10-09T03:56:34.372008image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-10-09T03:56:35.938142image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-10-09T03:56:31.857414image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-10-09T03:56:32.739371image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-10-09T03:56:33.720805image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-10-09T03:56:34.537143image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-10-09T03:56:36.201399image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-10-09T03:56:32.037979image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-10-09T03:56:32.906961image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-10-09T03:56:33.907581image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-10-09T03:56:34.817209image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-10-09T03:56:40.128815image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
RIDBDHTBDWTSYSTOLICDIASTOLIC
RID1.0000.2910.0410.1740.041
BDHT\t0.2911.0000.0000.4010.000
BDWT\t0.0410.0001.0000.0000.693
SYSTOLIC0.1740.4010.0001.0000.000
DIASTOLIC0.0410.0000.6930.0001.000
2023-10-09T03:56:40.438668image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
RIDBDHTBDWTSYSTOLICDIASTOLIC
RID1.0000.0390.0780.026-0.077
BDHT\t0.0391.0000.4930.1540.263
BDWT\t0.0780.4931.0000.0160.071
SYSTOLIC0.0260.1540.0161.0000.570
DIASTOLIC-0.0770.2630.0710.5701.000

Missing values

2023-10-09T03:56:36.608290image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-10-09T03:56:36.835522image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

RIDBDHTBDWTSYSTOLICDIASTOLIC
01154.271.5514082
12162.058.812080
23153.749.611070
34160.855.417090
45186.794.214595
56154.982.011060
67148.159.6517056
78167.261.011070
89147.855.814050
910172.262.812080
RIDBDHTBDWTSYSTOLICDIASTOLIC
9091141.561.512070
9192160.262.012070
9293155.074.012070
9394166.364.813090
9495174.066.021290
9596151.762.013068
9697149.745.814080
9798173.092.015875
9899164.783.211070
99100171.878.414080