Overview

Dataset statistics

Number of variables4
Number of observations100
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory3.4 KiB
Average record size in memory34.3 B

Variable types

Numeric1
Categorical3

Dataset

Description알코올 사용 장애 환자들의 최초 진단과와 최초 진단명과 진단코드 데이터. 진단과로는 소화기내과, 정신건강의학과, 응급의학과, 가정의학과 심장내과 등이 포함되어 환자 유입 경로를 분석할 수 있음. 진단코드는 ICD-11 코드와 SNOMED-CT 코드로 매핑됨.
Author가톨릭대학교 은평성모병원
URLhttp://cmcdata.net/data/dataset/diagnosis-data-alcohol-use-disorder-eunpyeong

Alerts

RID has unique valuesUnique

Reproduction

Analysis started2023-10-08 18:57:57.624094
Analysis finished2023-10-08 18:57:58.302634
Duration0.68 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

RID
Real number (ℝ)

UNIQUE 

Distinct100
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean50.5
Minimum1
Maximum100
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-10-09T03:57:58.448041image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile5.95
Q125.75
median50.5
Q375.25
95-th percentile95.05
Maximum100
Range99
Interquartile range (IQR)49.5

Descriptive statistics

Standard deviation29.011492
Coefficient of variation (CV)0.57448499
Kurtosis-1.2
Mean50.5
Median Absolute Deviation (MAD)25
Skewness0
Sum5050
Variance841.66667
MonotonicityStrictly increasing
2023-10-09T03:57:58.701629image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.0%
65 1
 
1.0%
75 1
 
1.0%
74 1
 
1.0%
73 1
 
1.0%
72 1
 
1.0%
71 1
 
1.0%
70 1
 
1.0%
69 1
 
1.0%
68 1
 
1.0%
Other values (90) 90
90.0%
ValueCountFrequency (%)
1 1
1.0%
2 1
1.0%
3 1
1.0%
4 1
1.0%
5 1
1.0%
6 1
1.0%
7 1
1.0%
8 1
1.0%
9 1
1.0%
10 1
1.0%
ValueCountFrequency (%)
100 1
1.0%
99 1
1.0%
98 1
1.0%
97 1
1.0%
96 1
1.0%
95 1
1.0%
94 1
1.0%
93 1
1.0%
92 1
1.0%
91 1
1.0%

DEPTNM
Categorical

Distinct7
Distinct (%)7.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
정신건강의학과
57 
응급의학과
21 
소화기내과
16 
신경외과
 
2
내분비내과
 
2
Other values (2)
 
2

Length

Max length7
Median length7
Mean length6.11
Min length4

Unique

Unique2 ?
Unique (%)2.0%

Sample

1st row정신건강의학과
2nd row정신건강의학과
3rd row정신건강의학과
4th row신경외과
5th row정신건강의학과

Common Values

ValueCountFrequency (%)
정신건강의학과 57
57.0%
응급의학과 21
 
21.0%
소화기내과 16
 
16.0%
신경외과 2
 
2.0%
내분비내과 2
 
2.0%
완화의학과 1
 
1.0%
정형외과 1
 
1.0%

Length

2023-10-09T03:57:58.925777image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-10-09T03:57:59.136537image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
정신건강의학과 57
57.0%
응급의학과 21
 
21.0%
소화기내과 16
 
16.0%
신경외과 2
 
2.0%
내분비내과 2
 
2.0%
완화의학과 1
 
1.0%
정형외과 1
 
1.0%

DIAGCD
Categorical

Distinct5
Distinct (%)5.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
F102
69 
F104
14 
F101
F109
 
4
F103
 
4

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowF102
2nd rowF102
3rd rowF102
4th rowF102
5th rowF102

Common Values

ValueCountFrequency (%)
F102 69
69.0%
F104 14
 
14.0%
F101 9
 
9.0%
F109 4
 
4.0%
F103 4
 
4.0%

Length

2023-10-09T03:57:59.300094image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-10-09T03:57:59.463814image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
f102 69
69.0%
f104 14
 
14.0%
f101 9
 
9.0%
f109 4
 
4.0%
f103 4
 
4.0%

DIAG_date
Categorical

Distinct6
Distinct (%)6.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
2019-01-01T00:00:00
41 
2017-01-01T00:00:00
16 
2018-01-01T00:00:00
16 
2015-01-01T00:00:00
11 
2020-01-01T00:00:00
10 

Length

Max length19
Median length19
Mean length19
Min length19

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2015-01-01T00:00:00
2nd row2019-01-01T00:00:00
3rd row2017-01-01T00:00:00
4th row2018-01-01T00:00:00
5th row2018-01-01T00:00:00

Common Values

ValueCountFrequency (%)
2019-01-01T00:00:00 41
41.0%
2017-01-01T00:00:00 16
 
16.0%
2018-01-01T00:00:00 16
 
16.0%
2015-01-01T00:00:00 11
 
11.0%
2020-01-01T00:00:00 10
 
10.0%
2016-01-01T00:00:00 6
 
6.0%

Length

2023-10-09T03:57:59.621882image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-10-09T03:57:59.799513image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2019-01-01t00:00:00 41
41.0%
2017-01-01t00:00:00 16
 
16.0%
2018-01-01t00:00:00 16
 
16.0%
2015-01-01t00:00:00 11
 
11.0%
2020-01-01t00:00:00 10
 
10.0%
2016-01-01t00:00:00 6
 
6.0%

Interactions

2023-10-09T03:57:57.923955image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-10-09T03:57:59.947791image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
RIDDEPTNMDIAGCDDIAG_date
RID1.0000.0000.3800.203
DEPTNM0.0001.0000.4050.251
DIAGCD0.3800.4051.0000.369
DIAG_date0.2030.2510.3691.000
2023-10-09T03:58:00.091889image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
DEPTNMDIAGCDDIAG_date
DEPTNM1.0000.2700.149
DIAGCD0.2701.0000.258
DIAG_date0.1490.2581.000
2023-10-09T03:58:00.213248image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
RIDDEPTNMDIAGCDDIAG_date
RID1.0000.0000.1600.101
DEPTNM0.0001.0000.2700.149
DIAGCD0.1600.2701.0000.258
DIAG_date0.1010.1490.2581.000

Missing values

2023-10-09T03:57:58.133196image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-10-09T03:57:58.249845image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

RIDDEPTNMDIAGCDDIAG_date
01정신건강의학과F1022015-01-01T00:00:00
12정신건강의학과F1022019-01-01T00:00:00
23정신건강의학과F1022017-01-01T00:00:00
34신경외과F1022018-01-01T00:00:00
45정신건강의학과F1022018-01-01T00:00:00
56응급의학과F1022019-01-01T00:00:00
67정신건강의학과F1022019-01-01T00:00:00
78정신건강의학과F1092015-01-01T00:00:00
89정신건강의학과F1022019-01-01T00:00:00
910정신건강의학과F1022019-01-01T00:00:00
RIDDEPTNMDIAGCDDIAG_date
9091응급의학과F1022017-01-01T00:00:00
9192소화기내과F1022019-01-01T00:00:00
9293정신건강의학과F1022019-01-01T00:00:00
9394정신건강의학과F1022017-01-01T00:00:00
9495정신건강의학과F1022019-01-01T00:00:00
9596정신건강의학과F1022019-01-01T00:00:00
9697정신건강의학과F1022015-01-01T00:00:00
9798소화기내과F1042016-01-01T00:00:00
9899소화기내과F1042019-01-01T00:00:00
99100정신건강의학과F1012018-01-01T00:00:00