Overview

Dataset statistics

Number of variables3
Number of observations100
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.7 KiB
Average record size in memory27.3 B

Variable types

Numeric1
Categorical2

Dataset

Description병원정보시스템에 저장되어 있는 전체 데이터로 부터 고지혈증 연구를 위한 선정기준을 적용한 쿼리문을 생성하여 추출한 코호트의 인구통계학적 정보 데이터임. 스타틴을 최초 처방받은 환자들의 최초 처방 당시의 연령, 성별 데이터를 이용하여 연령대별 특성과 성별 특성을 분석할 수 있음. -SEX : 0은 남자, 1은 여자로 구분 하였음
Author가톨릭대학교 은평성모병원
URLhttp://cmcdata.net/data/dataset/demographic-data-dyslipidemia-eunpyeong

Alerts

일련번호 has unique valuesUnique

Reproduction

Analysis started2023-10-08 18:57:07.303751
Analysis finished2023-10-08 18:57:07.955968
Duration0.65 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

일련번호
Real number (ℝ)

UNIQUE 

Distinct100
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean50.5
Minimum1
Maximum100
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-10-09T03:57:08.100434image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile5.95
Q125.75
median50.5
Q375.25
95-th percentile95.05
Maximum100
Range99
Interquartile range (IQR)49.5

Descriptive statistics

Standard deviation29.011492
Coefficient of variation (CV)0.57448499
Kurtosis-1.2
Mean50.5
Median Absolute Deviation (MAD)25
Skewness0
Sum5050
Variance841.66667
MonotonicityStrictly increasing
2023-10-09T03:57:08.444123image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.0%
65 1
 
1.0%
75 1
 
1.0%
74 1
 
1.0%
73 1
 
1.0%
72 1
 
1.0%
71 1
 
1.0%
70 1
 
1.0%
69 1
 
1.0%
68 1
 
1.0%
Other values (90) 90
90.0%
ValueCountFrequency (%)
1 1
1.0%
2 1
1.0%
3 1
1.0%
4 1
1.0%
5 1
1.0%
6 1
1.0%
7 1
1.0%
8 1
1.0%
9 1
1.0%
10 1
1.0%
ValueCountFrequency (%)
100 1
1.0%
99 1
1.0%
98 1
1.0%
97 1
1.0%
96 1
1.0%
95 1
1.0%
94 1
1.0%
93 1
1.0%
92 1
1.0%
91 1
1.0%

Age_grp
Categorical

Distinct5
Distinct (%)5.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
70대
34 
60대
29 
50대
21 
40대
80대

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row50대
2nd row50대
3rd row70대
4th row50대
5th row80대

Common Values

ValueCountFrequency (%)
70대 34
34.0%
60대 29
29.0%
50대 21
21.0%
40대 9
 
9.0%
80대 7
 
7.0%

Length

2023-10-09T03:57:08.726457image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-10-09T03:57:08.937053image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
70대 34
34.0%
60대 29
29.0%
50대 21
21.0%
40대 9
 
9.0%
80대 7
 
7.0%

SEX
Categorical

Distinct2
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
1
57 
0
43 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row1

Common Values

ValueCountFrequency (%)
1 57
57.0%
0 43
43.0%

Length

2023-10-09T03:57:09.156467image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-10-09T03:57:09.394714image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 57
57.0%
0 43
43.0%

Interactions

2023-10-09T03:57:07.492647image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-10-09T03:57:09.510765image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
일련번호Age_grpSEX
일련번호1.0000.0000.248
Age_grp0.0001.0000.306
SEX0.2480.3061.000
2023-10-09T03:57:09.662028image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
SEXAge_grp
SEX1.0000.368
Age_grp0.3681.000
2023-10-09T03:57:09.981465image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
일련번호Age_grpSEX
일련번호1.0000.0000.180
Age_grp0.0001.0000.368
SEX0.1800.3681.000

Missing values

2023-10-09T03:57:07.698941image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-10-09T03:57:07.904905image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

일련번호Age_grpSEX
0150대0
1250대0
2370대0
3450대0
4580대1
5660대0
6760대0
7840대0
8970대0
91070대0
일련번호Age_grpSEX
909160대1
919250대1
929360대0
939440대0
949570대1
959640대0
969750대0
979860대1
989970대1
9910050대1