Overview

Dataset statistics

Number of variables3
Number of observations100
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.7 KiB
Average record size in memory27.3 B

Variable types

Numeric1
Categorical2

Dataset

Description병원정보시스템에 저장되어 있는 전체 데이터에서 ICD-10 코드 중 F101, F102, F103, F104, F109의 진단코드를 가진 환자와 K700, K701, K703, K7030, K7031, K7041, K709의 진단코드를 가진 환자들을 추출한 코호트의 인구통계학적 정보 데이터임. 환자들의 최초 처방 당시의 연령, 성별 데이터를 이용하여 연령대별 특성과 성별 특성을 분석할 수 있음. -SEX : 0은 남자, 1은 여자로 구분 하였음
Author가톨릭대학교 은평성모병원
URLhttp://cmcdata.net/data/dataset/demographic-data-alcohol-use-disorder-eunpyeong

Alerts

RID has unique valuesUnique

Reproduction

Analysis started2023-10-08 18:56:06.708157
Analysis finished2023-10-08 18:56:08.486846
Duration1.78 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

RID
Real number (ℝ)

UNIQUE 

Distinct100
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean50.5
Minimum1
Maximum100
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-10-09T03:56:09.191407image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile5.95
Q125.75
median50.5
Q375.25
95-th percentile95.05
Maximum100
Range99
Interquartile range (IQR)49.5

Descriptive statistics

Standard deviation29.011492
Coefficient of variation (CV)0.57448499
Kurtosis-1.2
Mean50.5
Median Absolute Deviation (MAD)25
Skewness0
Sum5050
Variance841.66667
MonotonicityStrictly increasing
2023-10-09T03:56:09.609627image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.0%
65 1
 
1.0%
75 1
 
1.0%
74 1
 
1.0%
73 1
 
1.0%
72 1
 
1.0%
71 1
 
1.0%
70 1
 
1.0%
69 1
 
1.0%
68 1
 
1.0%
Other values (90) 90
90.0%
ValueCountFrequency (%)
1 1
1.0%
2 1
1.0%
3 1
1.0%
4 1
1.0%
5 1
1.0%
6 1
1.0%
7 1
1.0%
8 1
1.0%
9 1
1.0%
10 1
1.0%
ValueCountFrequency (%)
100 1
1.0%
99 1
1.0%
98 1
1.0%
97 1
1.0%
96 1
1.0%
95 1
1.0%
94 1
1.0%
93 1
1.0%
92 1
1.0%
91 1
1.0%

AGE
Categorical

Distinct7
Distinct (%)7.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
50대
37 
60대
22 
40대
15 
30대
11 
70대
11 
Other values (2)

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row50대
2nd row50대
3rd row30대
4th row40대
5th row20대

Common Values

ValueCountFrequency (%)
50대 37
37.0%
60대 22
22.0%
40대 15
15.0%
30대 11
 
11.0%
70대 11
 
11.0%
20대 2
 
2.0%
80대 2
 
2.0%

Length

2023-10-09T03:56:09.855587image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-10-09T03:56:10.103890image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
50대 37
37.0%
60대 22
22.0%
40대 15
15.0%
30대 11
 
11.0%
70대 11
 
11.0%
20대 2
 
2.0%
80대 2
 
2.0%

SEX
Categorical

Distinct2
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
0
82 
1
18 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 82
82.0%
1 18
 
18.0%

Length

2023-10-09T03:56:10.401499image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-10-09T03:56:10.676879image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 82
82.0%
1 18
 
18.0%

Interactions

2023-10-09T03:56:07.403903image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-10-09T03:56:10.833540image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
RIDAGESEX
RID1.0000.1940.000
AGE0.1941.0000.106
SEX0.0000.1061.000
2023-10-09T03:56:11.068119image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
SEXAGE
SEX1.0000.108
AGE0.1081.000
2023-10-09T03:56:11.257962image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
RIDAGESEX
RID1.0000.0920.000
AGE0.0921.0000.108
SEX0.0000.1081.000

Missing values

2023-10-09T03:56:07.944998image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-10-09T03:56:08.373117image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

RIDAGESEX
0150대0
1250대0
2330대0
3440대0
4520대0
5630대1
6750대0
7860대0
8940대0
91050대0
RIDAGESEX
909130대0
919250대0
929360대0
939440대0
949550대0
959630대0
969750대0
979840대1
989930대0
9910060대0