Overview

Dataset statistics

Number of variables10
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory888.7 KiB
Average record size in memory91.0 B

Variable types

DateTime1
Categorical6
Numeric3

Dataset

Description1. 진료년월: 2020~2022년 월단위(진료일기준)2. 주소지: 전국 및 17개 시도 단위(수진자 주민등록지 기준)3. 주상병코드: F20~F41, F70~F90, F95(주상병명 본 파일 확인)4. 연령: 20세 미만, 20세 이상(연말기준)5. 요양기관종별: 상급종합병원, 종합병원, 병원, 정신병원, 의원 등6. 진료과목코드: 정신건강의학과, 정신건강의학과 이외7. 각 구분별 건강보험 급여실적(의료급여 제외)이며 한의분류 및 약국 제외, 비급여 제외- 2023년 6월 지급분까지 반영※ 해당 질병통계 자료는 요양기관에서 환자진료 중 진단명이 확정되지 않은 상태에서의 호소, 증세 등에 따라일차진단명을 부여하고 청구한 내역 중 주진단명 기준으로 발췌한 것이므로 최종 확정된 질병과는 다를 수 있음※ 민원인의 공공데이터 제공 신청에 의해 2024-02-21 발췌
Author국민건강보험공단
URLhttps://www.data.go.kr/data/15126812/fileData.do

Alerts

주상병코드 is highly overall correlated with 주상병명High correlation
주상병명 is highly overall correlated with 주상병코드High correlation
진료인원(명) is highly overall correlated with 진료건수(건) and 1 other fieldsHigh correlation
진료건수(건) is highly overall correlated with 진료인원(명) and 1 other fieldsHigh correlation
진료비(천원) is highly overall correlated with 진료인원(명) and 1 other fieldsHigh correlation
진료인원(명) is highly skewed (γ1 = 33.95306179)Skewed
진료건수(건) is highly skewed (γ1 = 34.74413441)Skewed
진료비(천원) is highly skewed (γ1 = 27.41465436)Skewed

Reproduction

Analysis started2024-03-14 17:01:58.114462
Analysis finished2024-03-14 17:02:03.547070
Duration5.43 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct36
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2020-01-01 00:00:00
Maximum2022-12-01 00:00:00
2024-03-15T02:02:03.663409image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T02:02:03.892735image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=36)

주소지
Categorical

Distinct18
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
전국
2337 
경기도
718 
서울특별시
649 
경상남도
 
499
인천광역시
 
486
Other values (13)
5311 

Length

Max length7
Median length5
Mean length3.9059
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row강원도
2nd row부산광역시
3rd row광주광역시
4th row전라남도
5th row서울특별시

Common Values

ValueCountFrequency (%)
전국 2337
23.4%
경기도 718
 
7.2%
서울특별시 649
 
6.5%
경상남도 499
 
5.0%
인천광역시 486
 
4.9%
충청북도 480
 
4.8%
전라남도 477
 
4.8%
부산광역시 468
 
4.7%
충청남도 463
 
4.6%
대구광역시 451
 
4.5%
Other values (8) 2972
29.7%

Length

2024-03-15T02:02:04.236130image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
전국 2337
23.4%
경기도 718
 
7.2%
서울특별시 649
 
6.5%
경상남도 499
 
5.0%
인천광역시 486
 
4.9%
충청북도 480
 
4.8%
전라남도 477
 
4.8%
부산광역시 468
 
4.7%
충청남도 463
 
4.6%
대구광역시 451
 
4.5%
Other values (8) 2972
29.7%

주상병코드
Categorical

HIGH CORRELATION 

Distinct32
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
F41
 
658
F32
 
565
F84
 
521
F31
 
506
F90
 
486
Other values (27)
7264 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowF70
2nd rowF23
3rd rowF39
4th rowF32
5th rowF78

Common Values

ValueCountFrequency (%)
F41 658
 
6.6%
F32 565
 
5.7%
F84 521
 
5.2%
F31 506
 
5.1%
F90 486
 
4.9%
F20 447
 
4.5%
F95 434
 
4.3%
F70 417
 
4.2%
F33 399
 
4.0%
F34 398
 
4.0%
Other values (22) 5169
51.7%

Length

2024-03-15T02:02:04.473648image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
f41 658
 
6.6%
f32 565
 
5.7%
f84 521
 
5.2%
f31 506
 
5.1%
f90 486
 
4.9%
f20 447
 
4.5%
f95 434
 
4.3%
f70 417
 
4.2%
f33 399
 
4.0%
f34 398
 
4.0%
Other values (22) 5169
51.7%

주상병명
Categorical

HIGH CORRELATION 

Distinct32
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
기타불안장애
 
658
우울에피소드
 
565
전반발달장애
 
521
양극성정동장애
 
506
운동과다장애
 
486
Other values (27)
7264 

Length

Max length13
Median length12
Mean length7.5771
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row경도정신지체
2nd row급성및일과성정신병장애
3rd row상세불명의기분[정동]장애
4th row우울에피소드
5th row기타정신지체

Common Values

ValueCountFrequency (%)
기타불안장애 658
 
6.6%
우울에피소드 565
 
5.7%
전반발달장애 521
 
5.2%
양극성정동장애 506
 
5.1%
운동과다장애 486
 
4.9%
조현병 447
 
4.5%
틱장애 434
 
4.3%
경도정신지체 417
 
4.2%
재발성우울장애 399
 
4.0%
지속성기분[정동]장애 398
 
4.0%
Other values (22) 5169
51.7%

Length

2024-03-15T02:02:04.859017image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
기타불안장애 658
 
6.6%
우울에피소드 565
 
5.7%
전반발달장애 521
 
5.2%
양극성정동장애 506
 
5.1%
운동과다장애 486
 
4.9%
조현병 447
 
4.5%
틱장애 434
 
4.3%
경도정신지체 417
 
4.2%
재발성우울장애 399
 
4.0%
지속성기분[정동]장애 398
 
4.0%
Other values (22) 5169
51.7%

연령
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
20세 이상
6015 
20세 미만
3985 

Length

Max length6
Median length6
Mean length6
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row20세 미만
2nd row20세 이상
3rd row20세 이상
4th row20세 이상
5th row20세 이상

Common Values

ValueCountFrequency (%)
20세 이상 6015
60.2%
20세 미만 3985
39.9%

Length

2024-03-15T02:02:05.242397image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-15T02:02:05.547805image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
20세 10000
50.0%
이상 6015
30.1%
미만 3985
 
19.9%
Distinct13
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
의원
2358 
종합병원
1991 
상급종합병원
1909 
병원
1689 
요양병원
1163 
Other values (8)
890 

Length

Max length13
Median length6
Mean length3.7832
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row상급종합병원
2nd row병원
3rd row정신병원
4th row보건소
5th row요양병원

Common Values

ValueCountFrequency (%)
의원 2358
23.6%
종합병원 1991
19.9%
상급종합병원 1909
19.1%
병원 1689
16.9%
요양병원 1163
11.6%
정신병원 259
 
2.6%
보건의료원(병원화보건소) 242
 
2.4%
한방병원 161
 
1.6%
보건소 79
 
0.8%
치과병원 69
 
0.7%
Other values (3) 80
 
0.8%

Length

2024-03-15T02:02:05.776320image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
의원 2358
23.6%
종합병원 1991
19.9%
상급종합병원 1909
19.1%
병원 1689
16.9%
요양병원 1163
11.6%
정신병원 259
 
2.6%
보건의료원(병원화보건소 242
 
2.4%
한방병원 161
 
1.6%
보건소 79
 
0.8%
치과병원 69
 
0.7%
Other values (3) 80
 
0.8%
Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
정신건강의학과
6428 
정신건강의학과 이외
3572 

Length

Max length10
Median length7
Mean length8.0716
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row정신건강의학과 이외
2nd row정신건강의학과
3rd row정신건강의학과
4th row정신건강의학과 이외
5th row정신건강의학과

Common Values

ValueCountFrequency (%)
정신건강의학과 6428
64.3%
정신건강의학과 이외 3572
35.7%

Length

2024-03-15T02:02:06.053435image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-15T02:02:06.276654image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
정신건강의학과 10000
73.7%
이외 3572
 
26.3%

진료인원(명)
Real number (ℝ)

HIGH CORRELATION  SKEWED 

Distinct1041
Distinct (%)10.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean439.6388
Minimum1
Maximum269942
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-03-15T02:02:06.892308image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q12
median7
Q341
95-th percentile776.2
Maximum269942
Range269941
Interquartile range (IQR)39

Descriptive statistics

Standard deviation6230.403
Coefficient of variation (CV)14.17164
Kurtosis1267.6926
Mean439.6388
Median Absolute Deviation (MAD)6
Skewness33.953062
Sum4396388
Variance38817921
MonotonicityNot monotonic
2024-03-15T02:02:07.317136image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 2095
20.9%
2 970
 
9.7%
3 663
 
6.6%
4 457
 
4.6%
5 335
 
3.4%
6 278
 
2.8%
7 246
 
2.5%
8 229
 
2.3%
10 165
 
1.7%
9 164
 
1.6%
Other values (1031) 4398
44.0%
ValueCountFrequency (%)
1 2095
20.9%
2 970
9.7%
3 663
 
6.6%
4 457
 
4.6%
5 335
 
3.4%
6 278
 
2.8%
7 246
 
2.5%
8 229
 
2.3%
9 164
 
1.6%
10 165
 
1.7%
ValueCountFrequency (%)
269942 1
< 0.1%
260251 1
< 0.1%
236735 1
< 0.1%
220031 1
< 0.1%
196812 1
< 0.1%
194852 1
< 0.1%
173721 1
< 0.1%
58121 1
< 0.1%
57077 1
< 0.1%
48739 1
< 0.1%

진료건수(건)
Real number (ℝ)

HIGH CORRELATION  SKEWED 

Distinct1195
Distinct (%)11.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean658.815
Minimum1
Maximum434750
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-03-15T02:02:07.736524image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q12
median10
Q358
95-th percentile1103.15
Maximum434750
Range434749
Interquartile range (IQR)56

Descriptive statistics

Standard deviation9964.3028
Coefficient of variation (CV)15.124584
Kurtosis1326.6203
Mean658.815
Median Absolute Deviation (MAD)9
Skewness34.744134
Sum6588150
Variance99287331
MonotonicityNot monotonic
2024-03-15T02:02:08.166684image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1637
 
16.4%
2 867
 
8.7%
3 595
 
5.9%
4 485
 
4.9%
5 298
 
3.0%
6 295
 
2.9%
7 238
 
2.4%
8 236
 
2.4%
9 204
 
2.0%
10 173
 
1.7%
Other values (1185) 4972
49.7%
ValueCountFrequency (%)
1 1637
16.4%
2 867
8.7%
3 595
 
5.9%
4 485
 
4.9%
5 298
 
3.0%
6 295
 
2.9%
7 238
 
2.4%
8 236
 
2.4%
9 204
 
2.0%
10 173
 
1.7%
ValueCountFrequency (%)
434750 1
< 0.1%
433519 1
< 0.1%
381601 1
< 0.1%
373601 1
< 0.1%
292119 1
< 0.1%
291519 1
< 0.1%
269736 1
< 0.1%
105970 1
< 0.1%
97072 1
< 0.1%
88165 1
< 0.1%

진료비(천원)
Real number (ℝ)

HIGH CORRELATION  SKEWED 

Distinct5383
Distinct (%)53.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean59514.356
Minimum0
Maximum26018714
Zeros3
Zeros (%)< 0.1%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-03-15T02:02:08.568111image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile18
Q1159
median1076.5
Q37802
95-th percentile127951.85
Maximum26018714
Range26018714
Interquartile range (IQR)7643

Descriptive statistics

Standard deviation670704.69
Coefficient of variation (CV)11.269629
Kurtosis865.2059
Mean59514.356
Median Absolute Deviation (MAD)1048.5
Skewness27.414654
Sum5.9514356 × 108
Variance4.4984479 × 1011
MonotonicityNot monotonic
2024-03-15T02:02:09.023498image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
16 91
 
0.9%
12 84
 
0.8%
23 66
 
0.7%
5 55
 
0.5%
14 48
 
0.5%
25 43
 
0.4%
6 39
 
0.4%
27 38
 
0.4%
17 34
 
0.3%
18 33
 
0.3%
Other values (5373) 9469
94.7%
ValueCountFrequency (%)
0 3
 
< 0.1%
3 1
 
< 0.1%
4 14
 
0.1%
5 55
0.5%
6 39
0.4%
7 22
 
0.2%
8 18
 
0.2%
9 11
 
0.1%
10 5
 
0.1%
11 32
0.3%
ValueCountFrequency (%)
26018714 1
< 0.1%
25631322 1
< 0.1%
22034220 1
< 0.1%
20666916 1
< 0.1%
19859347 1
< 0.1%
19041601 1
< 0.1%
14474906 1
< 0.1%
14417275 1
< 0.1%
12538888 1
< 0.1%
12402863 1
< 0.1%

Interactions

2024-03-15T02:02:01.870746image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T02:02:00.180598image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T02:02:01.001391image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T02:02:02.154170image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T02:02:00.544609image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T02:02:01.307167image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T02:02:02.428600image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T02:02:00.788819image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T02:02:01.591641image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-15T02:02:09.350855image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
진료년월주소지주상병코드주상병명연령요양기관종별진료과목코드진료인원(명)진료건수(건)진료비(천원)
진료년월1.0000.5880.1030.1030.0350.2860.1180.1830.1570.195
주소지0.5881.0000.1560.1560.0710.2410.2040.0000.0000.000
주상병코드0.1030.1561.0001.0000.3670.2530.2390.0390.0830.100
주상병명0.1030.1561.0001.0000.3670.2530.2390.0390.0830.100
연령0.0350.0710.3670.3671.0000.1600.0900.0110.0280.034
요양기관종별0.2860.2410.2530.2530.1601.0000.2190.0000.0000.051
진료과목코드0.1180.2040.2390.2390.0900.2191.0000.0230.0310.044
진료인원(명)0.1830.0000.0390.0390.0110.0000.0231.0000.9550.919
진료건수(건)0.1570.0000.0830.0830.0280.0000.0310.9551.0000.941
진료비(천원)0.1950.0000.1000.1000.0340.0510.0440.9190.9411.000
2024-03-15T02:02:09.588673image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
주상병코드연령주상병명주소지요양기관종별진료과목코드
주상병코드1.0000.2921.0000.0430.0820.190
연령0.2921.0000.2920.0560.1490.058
주상병명1.0000.2921.0000.0430.0820.190
주소지0.0430.0560.0431.0000.0850.160
요양기관종별0.0820.1490.0820.0851.0000.204
진료과목코드0.1900.0580.1900.1600.2041.000
2024-03-15T02:02:09.961517image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
진료인원(명)진료건수(건)진료비(천원)주소지주상병코드주상병명연령요양기관종별진료과목코드
진료인원(명)1.0000.9770.8730.0000.0160.0160.0120.0000.025
진료건수(건)0.9771.0000.8690.0000.0360.0360.0200.0000.022
진료비(천원)0.8730.8691.0000.0000.0350.0350.0250.0230.033
주소지0.0000.0000.0001.0000.0430.0430.0560.0850.160
주상병코드0.0160.0360.0350.0431.0001.0000.2920.0820.190
주상병명0.0160.0360.0350.0431.0001.0000.2920.0820.190
연령0.0120.0200.0250.0560.2920.2921.0000.1490.058
요양기관종별0.0000.0000.0230.0850.0820.0820.1491.0000.204
진료과목코드0.0250.0220.0330.1600.1900.1900.0580.2041.000

Missing values

2024-03-15T02:02:02.842265image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-15T02:02:03.348721image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

진료년월주소지주상병코드주상병명연령요양기관종별진료과목코드진료인원(명)진료건수(건)진료비(천원)
872842021-01강원도F70경도정신지체20세 미만상급종합병원정신건강의학과 이외35766
692792020-10부산광역시F23급성및일과성정신병장애20세 이상병원정신건강의학과23159
962802021-03광주광역시F39상세불명의기분[정동]장애20세 이상정신병원정신건강의학과78637
322012020-02전라남도F32우울에피소드20세 이상보건소정신건강의학과 이외3421
284942020-02서울특별시F78기타정신지체20세 이상요양병원정신건강의학과9919788
739612020-10제주특별자치도F80말하기와언어의특정발달장애20세 이상의원정신건강의학과2274
696632020-10대구광역시F32우울에피소드20세 이상의원정신건강의학과 이외63987636798
552972020-07울산광역시F29상세불명의비기질성정신병20세 이상의원정신건강의학과32673569
819842020-12경기도F90운동과다장애20세 이상요양병원정신건강의학과37516884
595922020-08인천광역시F29상세불명의비기질성정신병20세 미만의원정신건강의학과2289
진료년월주소지주상병코드주상병명연령요양기관종별진료과목코드진료인원(명)진료건수(건)진료비(천원)
988742021-03전라남도F29상세불명의비기질성정신병20세 이상병원정신건강의학과 이외2216
452142020-05세종특별자치시F20조현병20세 미만의원정신건강의학과25272
544772020-07인천광역시F32우울에피소드20세 이상상급종합병원정신건강의학과57664784663
65992020-11전국F84전반발달장애20세 미만의원정신건강의학과 이외21962727451
597092020-08인천광역시F41기타불안장애20세 미만종합병원정신건강의학과26373368
958392021-03인천광역시F23급성및일과성정신병장애20세 이상의원정신건강의학과1016764
908112021-02인천광역시F84전반발달장애20세 이상정신병원정신건강의학과22232
160262022-02전국F34지속성기분[정동]장애20세 이상상급종합병원정신건강의학과 이외9911642792
873642021-01강원도F84전반발달장애20세 이상종합병원정신건강의학과20212237
673802020-09전라북도F21조현형장애20세 이상상급종합병원정신건강의학과22120