Overview

Dataset statistics

Number of variables8
Number of observations7375
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory482.7 KiB
Average record size in memory67.0 B

Variable types

Categorical5
Numeric3

Dataset

Description2010년 ~ 2021년간 정신질환 상병 11개 그룹*에 대한 연도별, 건강보험가입자격별, 성별, 연령별 진료실인원(명), 진료건수(건), 총진료비(천원) *상병분류(11개 그룹) ①F00-F09 ②F10-F19 ③F20-F29 ④F30-F39 ⑤F40-F48 ⑥F50-F59 ⑦F60-F69 ⑧F70-F79 ⑨F80-F89 ⑩F90-F98 ⑪F99 1. 상병분류 : 정신질환 상병 11개 그룹 2. 진료연도 : 2010~2021년 3. 건강보험 가입구분 : 지역가입자, 직장가입자 4. 성별 : 남녀 5. 연령 : (65세 미만) 5세 단위, (65세 이상) 한 그룹 ※ 데이터 변경('23.4.14일 제공 건에 데이터 추가) 1. 데이터 발췌기간 추가(2010~2018년 데이터 추가) - (기존) 2019~2021년 - (변경) 2010~2021년 2. 데이터 항목 추가(진료건수, 총진료비 추가) - (기존) 주상병코드-진료년도-가입자구분-성별-연령-진료실인원(명) - (변경) 주상병코드-진료년도-가입자구분-성별-연령-진료실인원(명)-진료건수(건)-총진료비(천원)
URLhttps://www.data.go.kr/data/15113343/fileData.do

Alerts

진료실인원(명) is highly overall correlated with 진료건수(건) and 1 other fieldsHigh correlation
진료건수(건) is highly overall correlated with 진료실인원(명) and 1 other fieldsHigh correlation
총진료비(천원) is highly overall correlated with 진료실인원(명) and 1 other fieldsHigh correlation
총진료비(천원) is highly skewed (γ1 = 20.35347344)Skewed

Reproduction

Analysis started2023-12-12 12:21:36.344667
Analysis finished2023-12-12 12:21:39.186952
Duration2.84 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

주상병코드
Categorical

Distinct11
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size57.7 KiB
F00-F09
672 
F30-F39
672 
F40-F48
672 
F50-F59
672 
F60-F69
672 
Other values (6)
4015 

Length

Max length7
Median length7
Mean length6.6355254
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowF00-F09
2nd rowF00-F09
3rd rowF00-F09
4th rowF00-F09
5th rowF00-F09

Common Values

ValueCountFrequency (%)
F00-F09 672
9.1%
F30-F39 672
9.1%
F40-F48 672
9.1%
F50-F59 672
9.1%
F60-F69 672
9.1%
F70-F79 672
9.1%
F80-F89 672
9.1%
F90-F98 672
9.1%
F99 672
9.1%
F10-F19 671
9.1%

Length

2023-12-12T21:21:39.282692image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
f00-f09 672
9.1%
f30-f39 672
9.1%
f40-f48 672
9.1%
f50-f59 672
9.1%
f60-f69 672
9.1%
f70-f79 672
9.1%
f80-f89 672
9.1%
f90-f98 672
9.1%
f99 672
9.1%
f10-f19 671
9.1%

진료년도
Categorical

Distinct12
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size57.7 KiB
2010년
616 
2012년
616 
2014년
615 
2015년
615 
2017년
615 
Other values (7)
4298 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2010년
2nd row2010년
3rd row2010년
4th row2010년
5th row2010년

Common Values

ValueCountFrequency (%)
2010년 616
8.4%
2012년 616
8.4%
2014년 615
8.3%
2015년 615
8.3%
2017년 615
8.3%
2018년 615
8.3%
2013년 614
8.3%
2016년 614
8.3%
2019년 614
8.3%
2020년 614
8.3%
Other values (2) 1227
16.6%

Length

2023-12-12T21:21:39.451334image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
2010년 616
8.4%
2012년 616
8.4%
2014년 615
8.3%
2015년 615
8.3%
2017년 615
8.3%
2018년 615
8.3%
2013년 614
8.3%
2016년 614
8.3%
2019년 614
8.3%
2020년 614
8.3%
Other values (2) 1227
16.6%

가입자구분
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size57.7 KiB
직장
3689 
지역
3686 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row지역
2nd row지역
3rd row지역
4th row지역
5th row지역

Common Values

ValueCountFrequency (%)
직장 3689
50.0%
지역 3686
50.0%

Length

2023-12-12T21:21:39.645615image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T21:21:39.769494image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
직장 3689
50.0%
지역 3686
50.0%

성별
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size57.7 KiB
남자
3692 
여자
3683 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row남자
2nd row남자
3rd row남자
4th row남자
5th row남자

Common Values

ValueCountFrequency (%)
남자 3692
50.1%
여자 3683
49.9%

Length

2023-12-12T21:21:39.880271image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T21:21:39.996334image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
남자 3692
50.1%
여자 3683
49.9%

연령
Categorical

Distinct14
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size57.7 KiB
10-14세
528 
15-19세
528 
20-24세
528 
25-29세
528 
30-34세
528 
Other values (9)
4735 

Length

Max length6
Median length6
Mean length5.7182373
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0-4세
2nd row5-9세
3rd row10-14세
4th row15-19세
5th row20-24세

Common Values

ValueCountFrequency (%)
10-14세 528
 
7.2%
15-19세 528
 
7.2%
20-24세 528
 
7.2%
25-29세 528
 
7.2%
30-34세 528
 
7.2%
35-39세 528
 
7.2%
40-44세 528
 
7.2%
45-49세 528
 
7.2%
50-54세 528
 
7.2%
55-59세 528
 
7.2%
Other values (4) 2095
28.4%

Length

2023-12-12T21:21:40.143242image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
10-14세 528
 
6.7%
15-19세 528
 
6.7%
20-24세 528
 
6.7%
25-29세 528
 
6.7%
30-34세 528
 
6.7%
35-39세 528
 
6.7%
40-44세 528
 
6.7%
45-49세 528
 
6.7%
50-54세 528
 
6.7%
55-59세 528
 
6.7%
Other values (5) 2623
33.2%

진료실인원(명)
Real number (ℝ)

HIGH CORRELATION 

Distinct3580
Distinct (%)48.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5252.4452
Minimum1
Maximum387085
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size64.9 KiB
2023-12-12T21:21:40.322023image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile19
Q198.5
median512
Q34206
95-th percentile22910.5
Maximum387085
Range387084
Interquartile range (IQR)4107.5

Descriptive statistics

Standard deviation16895.071
Coefficient of variation (CV)3.2166106
Kurtosis162.59502
Mean5252.4452
Median Absolute Deviation (MAD)481
Skewness10.479419
Sum38736783
Variance2.8544342 × 108
MonotonicityNot monotonic
2023-12-12T21:21:40.503361image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
53 32
 
0.4%
13 31
 
0.4%
41 31
 
0.4%
19 30
 
0.4%
56 30
 
0.4%
21 29
 
0.4%
50 29
 
0.4%
48 27
 
0.4%
1 27
 
0.4%
17 27
 
0.4%
Other values (3570) 7082
96.0%
ValueCountFrequency (%)
1 27
0.4%
2 24
0.3%
3 11
0.1%
4 24
0.3%
5 16
0.2%
6 13
0.2%
7 22
0.3%
8 21
0.3%
9 22
0.3%
10 20
0.3%
ValueCountFrequency (%)
387085 1
< 0.1%
366000 1
< 0.1%
363060 1
< 0.1%
333242 1
< 0.1%
294757 1
< 0.1%
266550 1
< 0.1%
235737 1
< 0.1%
219252 1
< 0.1%
208388 1
< 0.1%
196062 1
< 0.1%

진료건수(건)
Real number (ℝ)

HIGH CORRELATION 

Distinct4918
Distinct (%)66.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean29738.433
Minimum1
Maximum2183920
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size64.9 KiB
2023-12-12T21:21:40.707196image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile36
Q1317
median2095
Q323695.5
95-th percentile135948.5
Maximum2183920
Range2183919
Interquartile range (IQR)23378.5

Descriptive statistics

Standard deviation96125.462
Coefficient of variation (CV)3.2323647
Kurtosis164.03053
Mean29738.433
Median Absolute Deviation (MAD)2043
Skewness10.527596
Sum2.1932095 × 108
Variance9.2401045 × 109
MonotonicityNot monotonic
2023-12-12T21:21:40.854929image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
28 20
 
0.3%
9 17
 
0.2%
1 16
 
0.2%
23 16
 
0.2%
41 15
 
0.2%
4 15
 
0.2%
18 14
 
0.2%
92 14
 
0.2%
14 14
 
0.2%
2 14
 
0.2%
Other values (4908) 7220
97.9%
ValueCountFrequency (%)
1 16
0.2%
2 14
0.2%
3 11
0.1%
4 15
0.2%
5 6
 
0.1%
6 4
 
0.1%
7 8
0.1%
8 9
0.1%
9 17
0.2%
10 6
 
0.1%
ValueCountFrequency (%)
2183920 1
< 0.1%
2073728 1
< 0.1%
2071808 1
< 0.1%
1910147 1
< 0.1%
1701252 1
< 0.1%
1519995 1
< 0.1%
1346882 1
< 0.1%
1254574 1
< 0.1%
1195605 1
< 0.1%
1128030 1
< 0.1%

총진료비(천원)
Real number (ℝ)

HIGH CORRELATION  SKEWED 

Distinct7295
Distinct (%)98.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4548413.9
Minimum9
Maximum1.0308651 × 109
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size64.9 KiB
2023-12-12T21:21:41.002952image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum9
5-th percentile2536.1
Q135296.5
median287765
Q32194154.5
95-th percentile11850874
Maximum1.0308651 × 109
Range1.0308651 × 109
Interquartile range (IQR)2158858

Descriptive statistics

Standard deviation36487228
Coefficient of variation (CV)8.0219673
Kurtosis475.78059
Mean4548413.9
Median Absolute Deviation (MAD)281527
Skewness20.353473
Sum3.3544553 × 1010
Variance1.3313178 × 1015
MonotonicityNot monotonic
2023-12-12T21:21:41.155848image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
7096 3
 
< 0.1%
16663 3
 
< 0.1%
2273 3
 
< 0.1%
12443 2
 
< 0.1%
149407 2
 
< 0.1%
138040 2
 
< 0.1%
62983 2
 
< 0.1%
5697 2
 
< 0.1%
16074 2
 
< 0.1%
1316 2
 
< 0.1%
Other values (7285) 7352
99.7%
ValueCountFrequency (%)
9 1
< 0.1%
17 1
< 0.1%
18 1
< 0.1%
21 2
< 0.1%
25 1
< 0.1%
33 1
< 0.1%
35 1
< 0.1%
37 1
< 0.1%
44 1
< 0.1%
47 1
< 0.1%
ValueCountFrequency (%)
1030865119 1
< 0.1%
1016868741 1
< 0.1%
1006704376 1
< 0.1%
995896862 1
< 0.1%
916208754 1
< 0.1%
814682853 1
< 0.1%
703734994 1
< 0.1%
618884278 1
< 0.1%
607082655 1
< 0.1%
581032393 1
< 0.1%

Interactions

2023-12-12T21:21:38.542179image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:21:37.254672image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:21:37.759288image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:21:38.681276image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:21:37.431419image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:21:37.896558image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:21:38.804031image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:21:37.623172image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:21:38.020037image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T21:21:41.266124image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
주상병코드진료년도가입자구분성별연령진료실인원(명)진료건수(건)총진료비(천원)
주상병코드1.0000.0000.0000.0000.0000.1970.2150.160
진료년도0.0001.0000.0000.0000.0000.0000.0000.000
가입자구분0.0000.0001.0000.0000.0000.0880.1050.024
성별0.0000.0000.0001.0000.0000.0790.0890.048
연령0.0000.0000.0000.0001.0000.3290.2910.191
진료실인원(명)0.1970.0000.0880.0790.3291.0000.9930.963
진료건수(건)0.2150.0000.1050.0890.2910.9931.0000.959
총진료비(천원)0.1600.0000.0240.0480.1910.9630.9591.000
2023-12-12T21:21:41.388641image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연령가입자구분진료년도주상병코드성별
연령1.0000.0000.0000.0000.000
가입자구분0.0001.0000.0000.0000.000
진료년도0.0000.0001.0000.0000.000
주상병코드0.0000.0000.0001.0000.000
성별0.0000.0000.0000.0001.000
2023-12-12T21:21:41.529127image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
진료실인원(명)진료건수(건)총진료비(천원)주상병코드진료년도가입자구분성별연령
진료실인원(명)1.0000.9840.9180.0850.0000.0680.0600.139
진료건수(건)0.9841.0000.9450.0930.0000.0800.0680.121
총진료비(천원)0.9180.9451.0000.0690.0000.0180.0370.078
주상병코드0.0850.0930.0691.0000.0000.0000.0000.000
진료년도0.0000.0000.0000.0001.0000.0000.0000.000
가입자구분0.0680.0800.0180.0000.0001.0000.0000.000
성별0.0600.0680.0370.0000.0000.0001.0000.000
연령0.1390.1210.0780.0000.0000.0000.0001.000

Missing values

2023-12-12T21:21:38.961747image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T21:21:39.117367image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

주상병코드진료년도가입자구분성별연령진료실인원(명)진료건수(건)총진료비(천원)
0F00-F092010년지역남자0-4세27312049
1F00-F092010년지역남자5-9세43775605
2F00-F092010년지역남자10-14세8226118678
3F00-F092010년지역남자15-19세15940738788
4F00-F092010년지역남자20-24세18163884044
5F00-F092010년지역남자25-29세236897238417
6F00-F092010년지역남자30-34세3051212259404
7F00-F092010년지역남자35-39세4461473541388
8F00-F092010년지역남자40-44세6012171763366
9F00-F092010년지역남자45-49세81129211083172
주상병코드진료년도가입자구분성별연령진료실인원(명)진료건수(건)총진료비(천원)
7365F992021년직장여자20-24세17159855791
7366F992021년직장여자25-29세17055146854
7367F992021년직장여자30-34세11928125381
7368F992021년직장여자35-39세11830229174
7369F992021년직장여자40-44세9825525671
7370F992021년직장여자45-49세7216315352
7371F992021년직장여자50-54세9425036340
7372F992021년직장여자55-59세8516942373
7373F992021년직장여자60-64세8517143534
7374F992021년직장여자65세 이상379633314150