Overview

Dataset statistics

Number of variables5
Number of observations211
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory8.8 KiB
Average record size in memory42.6 B

Variable types

Categorical4
Numeric1

Dataset

Descriptionㅇ 상세 내용- 진료년도 : 2020, 2021, 2022년- 보험자 : 건강보험- 성별구분 : 남자, 여자- 진단코드 : 그룹1 - I60, I61, I62, I63, I65, I66그룹2 - I20, I21, I22, I23, I25.2- 수가코드(건강보험요양급여비용에서 정한 수가코드) : M6636, M6637, M6638, M6639ㅇ 참조- 진료일기준(한의분류 제외, 약국 제외), 연령(연말기준)- 건강보험 급여실적(의료급여 제외)으로, 비급여는 제외(2023년 6월 심사분까지 반영)- 연간 동일 그룹 내의 중복 환자는 1건으로 count- 요양기관에서 환자진료중 진단명이 확정되지 않은 상태에서의 호소, 증세 등에 따라일차진단명을 부여하고 청구한 내역중 주진단명 기준으로 발췌한 것이므로 최종확정된 질병과는 다를수 있음
Author국민건강보험공단
URLhttps://www.data.go.kr/data/15125096/fileData.do

Reproduction

Analysis started2023-12-12 22:34:24.526788
Analysis finished2023-12-12 22:34:25.008741
Duration0.48 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

진료연도
Categorical

Distinct3
Distinct (%)1.4%
Missing0
Missing (%)0.0%
Memory size1.8 KiB
2021년
72 
2020년
70 
2022년
69 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2020년
2nd row2020년
3rd row2020년
4th row2020년
5th row2021년

Common Values

ValueCountFrequency (%)
2021년 72
34.1%
2020년 70
33.2%
2022년 69
32.7%

Length

2023-12-13T07:34:25.078013image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T07:34:25.192704image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2021년 72
34.1%
2020년 70
33.2%
2022년 69
32.7%
Distinct2
Distinct (%)0.9%
Missing0
Missing (%)0.0%
Memory size1.8 KiB
1
115 
2
96 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row2
4th row2
5th row1

Common Values

ValueCountFrequency (%)
1 115
54.5%
2 96
45.5%

Length

2023-12-13T07:34:25.307184image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T07:34:25.396853image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 115
54.5%
2 96
45.5%

성별
Categorical

Distinct2
Distinct (%)0.9%
Missing0
Missing (%)0.0%
Memory size1.8 KiB
남자
108 
여자
103 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row남자
2nd row여자
3rd row남자
4th row여자
5th row남자

Common Values

ValueCountFrequency (%)
남자 108
51.2%
여자 103
48.8%

Length

2023-12-13T07:34:25.506419image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T07:34:25.615889image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
남자 108
51.2%
여자 103
48.8%

연령
Categorical

Distinct21
Distinct (%)10.0%
Missing0
Missing (%)0.0%
Memory size1.8 KiB
전체
 
12
65-69세
 
12
35-39세
 
12
40-44세
 
12
45-49세
 
12
Other values (16)
151 

Length

Max length6
Median length6
Mean length5.7630332
Min length2

Unique

Unique2 ?
Unique (%)0.9%

Sample

1st row전체
2nd row전체
3rd row전체
4th row전체
5th row전체

Common Values

ValueCountFrequency (%)
전체 12
 
5.7%
65-69세 12
 
5.7%
35-39세 12
 
5.7%
40-44세 12
 
5.7%
45-49세 12
 
5.7%
55-59세 12
 
5.7%
60-64세 12
 
5.7%
50-54세 12
 
5.7%
70-74세 12
 
5.7%
75-79세 12
 
5.7%
Other values (11) 91
43.1%

Length

2023-12-13T07:34:25.745969image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
전체 12
 
5.7%
70-74세 12
 
5.7%
95-99세 12
 
5.7%
90-94세 12
 
5.7%
85-89세 12
 
5.7%
65-69세 12
 
5.7%
75-79세 12
 
5.7%
80-84세 12
 
5.7%
50-54세 12
 
5.7%
60-64세 12
 
5.7%
Other values (11) 91
43.1%

진료인원(명)
Real number (ℝ)

Distinct145
Distinct (%)68.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean290.59716
Minimum1
Maximum4348
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.0 KiB
2023-12-13T07:34:25.896857image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1.5
Q114.5
median90
Q3325
95-th percentile841.5
Maximum4348
Range4347
Interquartile range (IQR)310.5

Descriptive statistics

Standard deviation653.46193
Coefficient of variation (CV)2.2486866
Kurtosis21.229337
Mean290.59716
Median Absolute Deviation (MAD)85
Skewness4.4378828
Sum61316
Variance427012.49
MonotonicityNot monotonic
2023-12-13T07:34:26.022808image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 11
 
5.2%
2 8
 
3.8%
7 6
 
2.8%
3 6
 
2.8%
5 5
 
2.4%
8 4
 
1.9%
149 4
 
1.9%
22 3
 
1.4%
21 3
 
1.4%
9 3
 
1.4%
Other values (135) 158
74.9%
ValueCountFrequency (%)
1 11
5.2%
2 8
3.8%
3 6
2.8%
4 2
 
0.9%
5 5
2.4%
6 2
 
0.9%
7 6
2.8%
8 4
 
1.9%
9 3
 
1.4%
10 3
 
1.4%
ValueCountFrequency (%)
4348 1
0.5%
4224 1
0.5%
4033 1
0.5%
3072 1
0.5%
2996 1
0.5%
2840 1
0.5%
2349 1
0.5%
2175 1
0.5%
2041 1
0.5%
897 1
0.5%

Interactions

2023-12-13T07:34:24.746046image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T07:34:26.100747image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
진료연도상병코드그룹성별연령진료인원(명)
진료연도1.0000.0000.0000.0000.000
상병코드그룹0.0001.0000.0000.0000.214
성별0.0000.0001.0000.0000.353
연령0.0000.0000.0001.0000.544
진료인원(명)0.0000.2140.3530.5441.000
2023-12-13T07:34:26.223982image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
진료연도성별연령상병코드그룹
진료연도1.0000.0000.0000.000
성별0.0001.0000.0000.000
연령0.0000.0001.0000.000
상병코드그룹0.0000.0000.0001.000
2023-12-13T07:34:26.340764image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
진료인원(명)진료연도상병코드그룹성별연령
진료인원(명)1.0000.0000.1580.2610.250
진료연도0.0001.0000.0000.0000.000
상병코드그룹0.1580.0001.0000.0000.000
성별0.2610.0000.0001.0000.000
연령0.2500.0000.0000.0001.000

Missing values

2023-12-13T07:34:24.859847image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T07:34:24.968302image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

진료연도상병코드그룹성별연령진료인원(명)
02020년1남자전체2840
12020년1여자전체2041
22020년2남자전체4033
32020년2여자전체790
42021년1남자전체2996
52021년1여자전체2175
62021년2남자전체4224
72021년2여자전체897
82022년1남자전체3072
92022년1여자전체2349
진료연도상병코드그룹성별연령진료인원(명)
2012022년2여자50-54세30
2022022년2여자55-59세37
2032022년2여자60-64세76
2042022년2여자65-69세107
2052022년2여자70-74세129
2062022년2여자75-79세167
2072022년2여자80-84세157
2082022년2여자85-89세106
2092022년2여자90-94세40
2102022년2여자95-99세6