Dataset statistics
Number of variables | 7 |
---|---|
Number of observations | 278 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 25 |
Duplicate rows (%) | 9.0% |
Total size in memory | 16.1 KiB |
Average record size in memory | 59.5 B |
Variable types
Numeric | 1 |
---|---|
Categorical | 6 |
Dataset
Description | 국립정신건강센터 입원환자 기초정보 데이터 입니다. 연령대, 성별, 상병코드, 상병명, 보험유형, 진료연도, 진료월 등이 포함되어 있습니다. |
---|---|
Author | 보건복지부 국립정신건강센터 |
URL | https://www.data.go.kr/data/15059739/fileData.do |
진료년도 has constant value "" | Constant |
Dataset has 25 (9.0%) duplicate rows | Duplicates |
상병명 is highly overall correlated with 상병코드 | High correlation |
상병코드 is highly overall correlated with 상병명 | High correlation |
Reproduction
Analysis started | 2023-12-12 09:09:53.566948 |
---|---|
Analysis finished | 2023-12-12 09:09:54.389088 |
Duration | 0.82 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
연령대
Real number (ℝ)
Distinct | 8 |
---|---|
Distinct (%) | 2.9% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 43.417266 |
Minimum | 10 |
---|---|
Maximum | 80 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 2.6 KiB |
Quantile statistics
Minimum | 10 |
---|---|
5-th percentile | 20 |
Q1 | 30 |
median | 50 |
Q3 | 50 |
95-th percentile | 70 |
Maximum | 80 |
Range | 70 |
Interquartile range (IQR) | 20 |
Descriptive statistics
Standard deviation | 15.512147 |
---|---|
Coefficient of variation (CV) | 0.3572806 |
Kurtosis | -0.72539903 |
Mean | 43.417266 |
Median Absolute Deviation (MAD) | 10 |
Skewness | -0.057879162 |
Sum | 12070 |
Variance | 240.6267 |
Monotonicity | Increasing |
Value | Count | Frequency (%) |
50 | 75 | |
60 | 54 | |
30 | 50 | |
40 | 42 | |
20 | 38 | |
70 | 10 | 3.6% |
80 | 5 | 1.8% |
10 | 4 | 1.4% |
Value | Count | Frequency (%) |
10 | 4 | 1.4% |
20 | 38 | |
30 | 50 | |
40 | 42 | |
50 | 75 | |
60 | 54 | |
70 | 10 | 3.6% |
80 | 5 | 1.8% |
Value | Count | Frequency (%) |
80 | 5 | 1.8% |
70 | 10 | 3.6% |
60 | 54 | |
50 | 75 | |
40 | 42 | |
30 | 50 | |
20 | 38 | |
10 | 4 | 1.4% |
성별
Categorical
Distinct | 2 |
---|---|
Distinct (%) | 0.7% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 2.3 KiB |
남 | |
---|---|
여 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 남 |
---|---|
2nd row | 남 |
3rd row | 남 |
4th row | 여 |
5th row | 남 |
Common Values
Value | Count | Frequency (%) |
남 | 162 | |
여 | 116 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
남 | 162 | |
여 | 116 |
상병코드
Categorical
HIGH CORRELATION
 
Distinct | 39 |
---|---|
Distinct (%) | 14.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 2.3 KiB |
U07.1 | |
---|---|
F20.0 | |
F20.9 | |
F29 | |
F31.2 | |
Other values (34) |
Length
Max length | 5 |
---|---|
Median length | 5 |
Mean length | 4.8489209 |
Min length | 3 |
Unique
Unique | 14 ? |
---|---|
Unique (%) | 5.0% |
Sample
1st row | F31.9 |
---|---|
2nd row | F32.1 |
3rd row | F91.8 |
4th row | F91.8 |
5th row | F10.2 |
Common Values
Value | Count | Frequency (%) |
U07.1 | 101 | |
F20.0 | 24 | 8.6% |
F20.9 | 23 | 8.3% |
F29 | 17 | 6.1% |
F31.2 | 12 | 4.3% |
F32.1 | 12 | 4.3% |
F10.2 | 9 | 3.2% |
F32.9 | 7 | 2.5% |
F32.2 | 7 | 2.5% |
F31.9 | 7 | 2.5% |
Other values (29) | 59 |
Length
Value | Count | Frequency (%) |
u07.1 | 101 | |
f20.0 | 24 | 8.6% |
f20.9 | 23 | 8.3% |
f29 | 17 | 6.1% |
f31.2 | 12 | 4.3% |
f32.1 | 12 | 4.3% |
f10.2 | 9 | 3.2% |
f32.9 | 7 | 2.5% |
f32.2 | 7 | 2.5% |
f31.9 | 7 | 2.5% |
Other values (29) | 59 |
상병명
Categorical
HIGH CORRELATION
 
Distinct | 40 |
---|---|
Distinct (%) | 14.4% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 2.3 KiB |
Coronavirus disease 2019, virus identified [COVID-19, virus identified] | |
---|---|
Paranoid schizophrenia | |
Schizophrenia, unspecified | |
Unspecified nonorganic psychosis | |
Bipolar affective disorder, current episode manic with psychotic symptoms | |
Other values (35) |
Length
Max length | 104 |
---|---|
Median length | 97 |
Mean length | 49.568345 |
Min length | 19 |
Unique
Unique | 14 ? |
---|---|
Unique (%) | 5.0% |
Sample
1st row | Bipolar affective disorder, unspecified |
---|---|
2nd row | Moderate depressive episode |
3rd row | Other conduct disorders |
4th row | Other conduct disorders |
5th row | Dependence syndrome of alcohol |
Common Values
Value | Count | Frequency (%) |
Coronavirus disease 2019, virus identified [COVID-19, virus identified] | 98 | |
Paranoid schizophrenia | 24 | 8.6% |
Schizophrenia, unspecified | 23 | 8.3% |
Unspecified nonorganic psychosis | 17 | 6.1% |
Bipolar affective disorder, current episode manic with psychotic symptoms | 12 | 4.3% |
Moderate depressive episode | 12 | 4.3% |
Dependence syndrome of alcohol | 9 | 3.2% |
Depressive episode, unspecified | 7 | 2.5% |
Severe depressive episode without psychotic symptoms | 7 | 2.5% |
Bipolar affective disorder, unspecified | 7 | 2.5% |
Other values (30) | 62 |
Length
Value | Count | Frequency (%) |
virus | 196 | 12.8% |
identified | 196 | 12.8% |
disease | 103 | 6.7% |
coronavirus | 101 | 6.6% |
2019 | 98 | 6.4% |
covid-19 | 98 | 6.4% |
unspecified | 62 | 4.0% |
schizophrenia | 53 | 3.5% |
episode | 51 | 3.3% |
disorder | 47 | 3.1% |
Other values (68) | 530 |
보험유형
Categorical
Distinct | 6 |
---|---|
Distinct (%) | 2.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 2.3 KiB |
건강보험 | |
---|---|
의료급여1종 | |
의료급여2종 | |
의료급여2종장애 | 8 |
건강보험장애인 | 4 |
Length
Max length | 8 |
---|---|
Median length | 7 |
Mean length | 5.1007194 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 건강보험 |
---|---|
2nd row | 건강보험 |
3rd row | 건강보험 |
4th row | 건강보험 |
5th row | 건강보험 |
Common Values
Value | Count | Frequency (%) |
건강보험 | 129 | |
의료급여1종 | 118 | |
의료급여2종 | 16 | 5.8% |
의료급여2종장애 | 8 | 2.9% |
건강보험장애인 | 4 | 1.4% |
일반 | 3 | 1.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
건강보험 | 129 | |
의료급여1종 | 118 | |
의료급여2종 | 16 | 5.8% |
의료급여2종장애 | 8 | 2.9% |
건강보험장애인 | 4 | 1.4% |
일반 | 3 | 1.1% |
진료년도
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 0.4% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 2.3 KiB |
2020 |
---|
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 4 |
Min length | 4 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 2020 |
---|---|
2nd row | 2020 |
3rd row | 2020 |
4th row | 2020 |
5th row | 2020 |
Common Values
Value | Count | Frequency (%) |
2020 | 278 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
2020 | 278 |
진료월
Categorical
Distinct | 5 |
---|---|
Distinct (%) | 1.8% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 2.3 KiB |
4 | |
---|---|
8 | |
5 | |
7 | |
6 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 7 |
---|---|
2nd row | 5 |
3rd row | 6 |
4th row | 8 |
5th row | 4 |
Common Values
Value | Count | Frequency (%) |
4 | 74 | |
8 | 73 | |
5 | 57 | |
7 | 45 | |
6 | 29 | 10.4% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
4 | 74 | |
8 | 73 | |
5 | 57 | |
7 | 45 | |
6 | 29 | 10.4% |
연령대 | 성별 | 상병코드 | 상병명 | 보험유형 | 진료월 | |
---|---|---|---|---|---|---|
연령대 | 1.000 | 0.000 | 0.711 | 0.742 | 0.230 | 0.140 |
성별 | 0.000 | 1.000 | 0.407 | 0.432 | 0.251 | 0.000 |
상병코드 | 0.711 | 0.407 | 1.000 | 1.000 | 0.724 | 0.579 |
상병명 | 0.742 | 0.432 | 1.000 | 1.000 | 0.717 | 0.611 |
보험유형 | 0.230 | 0.251 | 0.724 | 0.717 | 1.000 | 0.304 |
진료월 | 0.140 | 0.000 | 0.579 | 0.611 | 0.304 | 1.000 |
진료월 | 상병명 | 상병코드 | 보험유형 | 성별 | |
---|---|---|---|---|---|
진료월 | 1.000 | 0.292 | 0.291 | 0.210 | 0.000 |
상병명 | 0.292 | 1.000 | 0.998 | 0.381 | 0.320 |
상병코드 | 0.291 | 0.998 | 1.000 | 0.385 | 0.318 |
보험유형 | 0.210 | 0.381 | 0.385 | 1.000 | 0.179 |
성별 | 0.000 | 0.320 | 0.318 | 0.179 | 1.000 |
연령대 | 성별 | 상병코드 | 상병명 | 보험유형 | 진료월 | |
---|---|---|---|---|---|---|
연령대 | 1.000 | 0.000 | 0.341 | 0.344 | 0.129 | 0.085 |
성별 | 0.000 | 1.000 | 0.318 | 0.320 | 0.179 | 0.000 |
상병코드 | 0.341 | 0.318 | 1.000 | 0.998 | 0.385 | 0.291 |
상병명 | 0.344 | 0.320 | 0.998 | 1.000 | 0.381 | 0.292 |
보험유형 | 0.129 | 0.179 | 0.385 | 0.381 | 1.000 | 0.210 |
진료월 | 0.085 | 0.000 | 0.291 | 0.292 | 0.210 | 1.000 |
연령대 | 성별 | 상병코드 | 상병명 | 보험유형 | 진료년도 | 진료월 | |
---|---|---|---|---|---|---|---|
0 | 10 | 남 | F31.9 | Bipolar affective disorder, unspecified | 건강보험 | 2020 | 7 |
1 | 10 | 남 | F32.1 | Moderate depressive episode | 건강보험 | 2020 | 5 |
2 | 10 | 남 | F91.8 | Other conduct disorders | 건강보험 | 2020 | 6 |
3 | 10 | 여 | F91.8 | Other conduct disorders | 건강보험 | 2020 | 8 |
4 | 20 | 남 | F10.2 | Dependence syndrome of alcohol | 건강보험 | 2020 | 4 |
5 | 20 | 남 | F20.0 | Paranoid schizophrenia | 건강보험 | 2020 | 7 |
6 | 20 | 남 | F20.0 | Paranoid schizophrenia | 건강보험 | 2020 | 8 |
7 | 20 | 남 | F20.9 | Schizophrenia, unspecified | 건강보험 | 2020 | 8 |
8 | 20 | 남 | F25.1 | Schizoaffective disorder, depressive type | 건강보험 | 2020 | 7 |
9 | 20 | 남 | F25.1 | Schizoaffective disorder, depressive type | 건강보험 | 2020 | 8 |
연령대 | 성별 | 상병코드 | 상병명 | 보험유형 | 진료년도 | 진료월 | |
---|---|---|---|---|---|---|---|
268 | 70 | 남 | U07.1 | Coronavirus disease 2019[COVID-19] | 의료급여1종 | 2020 | 4 |
269 | 70 | 남 | U07.1 | Coronavirus disease 2019, virus identified [COVID-19, virus identified] | 의료급여1종 | 2020 | 4 |
270 | 70 | 남 | U07.1 | Coronavirus disease 2019, virus identified [COVID-19, virus identified] | 의료급여1종 | 2020 | 5 |
271 | 70 | 여 | F25.1 | Schizoaffective disorder, depressive type | 건강보험 | 2020 | 4 |
272 | 70 | 여 | F32.1 | Moderate depressive episode | 건강보험 | 2020 | 7 |
273 | 80 | 남 | F00.1 | Dementia in Alzheimer’s disease with late onset(G30.1†) | 건강보험 | 2020 | 8 |
274 | 80 | 남 | F10.2 | Dependence syndrome of alcohol | 건강보험 | 2020 | 7 |
275 | 80 | 남 | F10.2 | Dependence syndrome of alcohol | 건강보험 | 2020 | 8 |
276 | 80 | 남 | F22.0 | Delusional disorder | 건강보험 | 2020 | 4 |
277 | 80 | 여 | F32.1 | Moderate depressive episode | 의료급여1종 | 2020 | 8 |
Most frequently occurring
연령대 | 성별 | 상병코드 | 상병명 | 보험유형 | 진료년도 | 진료월 | # duplicates | |
---|---|---|---|---|---|---|---|---|
14 | 50 | 남 | U07.1 | Coronavirus disease 2019, virus identified [COVID-19, virus identified] | 의료급여1종 | 2020 | 4 | 9 |
21 | 60 | 남 | U07.1 | Coronavirus disease 2019, virus identified [COVID-19, virus identified] | 의료급여1종 | 2020 | 4 | 8 |
22 | 60 | 남 | U07.1 | Coronavirus disease 2019, virus identified [COVID-19, virus identified] | 의료급여1종 | 2020 | 5 | 7 |
15 | 50 | 남 | U07.1 | Coronavirus disease 2019, virus identified [COVID-19, virus identified] | 의료급여1종 | 2020 | 5 | 6 |
17 | 50 | 여 | U07.1 | Coronavirus disease 2019, virus identified [COVID-19, virus identified] | 의료급여1종 | 2020 | 4 | 6 |
18 | 50 | 여 | U07.1 | Coronavirus disease 2019, virus identified [COVID-19, virus identified] | 의료급여1종 | 2020 | 5 | 5 |
7 | 30 | 남 | U07.1 | Coronavirus disease 2019, virus identified [COVID-19, virus identified] | 의료급여1종 | 2020 | 4 | 4 |
23 | 60 | 여 | U07.1 | Coronavirus disease 2019, virus identified [COVID-19, virus identified] | 의료급여1종 | 2020 | 4 | 4 |
24 | 60 | 여 | U07.1 | Coronavirus disease 2019, virus identified [COVID-19, virus identified] | 의료급여1종 | 2020 | 5 | 4 |
6 | 30 | 남 | U07.1 | Coronavirus disease 2019, virus identified [COVID-19, virus identified] | 건강보험 | 2020 | 8 | 3 |