Overview

Dataset statistics

Number of variables8
Number of observations59
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory4.1 KiB
Average record size in memory71.2 B

Variable types

Categorical3
Numeric5

Dataset

Description계절성질환(한랭 및 온열질환) 월별 진료비 통계 / 진료일자 기준(심사분은 각 진료년+4개월) (예) 진료년월: 2020.1월~12월, 심사년월: 2020.1월~2021.4월 / 보험자: 건강보험
URLhttps://www.data.go.kr/data/15102398/fileData.do

Alerts

주상병코드 is highly overall correlated with 상병명High correlation
상병명 is highly overall correlated with 주상병코드High correlation
환자수 is highly overall correlated with 명세서청구건수 and 3 other fieldsHigh correlation
명세서청구건수 is highly overall correlated with 환자수 and 3 other fieldsHigh correlation
입내원일수 is highly overall correlated with 환자수 and 3 other fieldsHigh correlation
보험자부담금 is highly overall correlated with 환자수 and 3 other fieldsHigh correlation
요양급여비용총액 is highly overall correlated with 환자수 and 3 other fieldsHigh correlation
보험자부담금 has unique valuesUnique
요양급여비용총액 has unique valuesUnique

Reproduction

Analysis started2023-12-12 08:59:32.857816
Analysis finished2023-12-12 08:59:35.902928
Duration3.05 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

진료년월
Categorical

Distinct7
Distinct (%)11.9%
Missing0
Missing (%)0.0%
Memory size604.0 B
2022-07
10 
2022-08
10 
2022-09
10 
2022-06
2022-12
Other values (2)
13 

Length

Max length7
Median length7
Mean length7
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2022-06
2nd row2022-06
3rd row2022-06
4th row2022-06
5th row2022-06

Common Values

ValueCountFrequency (%)
2022-07 10
16.9%
2022-08 10
16.9%
2022-09 10
16.9%
2022-06 9
15.3%
2022-12 7
11.9%
2023-01 7
11.9%
2022-11 6
10.2%

Length

2023-12-12T17:59:35.978379image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T17:59:36.149996image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2022-07 10
16.9%
2022-08 10
16.9%
2022-09 10
16.9%
2022-06 9
15.3%
2022-12 7
11.9%
2023-01 7
11.9%
2022-11 6
10.2%

주상병코드
Categorical

HIGH CORRELATION 

Distinct17
Distinct (%)28.8%
Missing0
Missing (%)0.0%
Memory size604.0 B
T675
T673
T671
T670
T677
Other values (12)
39 

Length

Max length7
Median length4
Mean length4.0508475
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowT675
2nd rowT678
3rd rowT672
4th rowT670
5th rowT677

Common Values

ValueCountFrequency (%)
T675 4
 
6.8%
T673 4
 
6.8%
T671 4
 
6.8%
T670 4
 
6.8%
T677 4
 
6.8%
T672 4
 
6.8%
T674 4
 
6.8%
T676 4
 
6.8%
T678 4
 
6.8%
T679 3
 
5.1%
Other values (7) 20
33.9%

Length

2023-12-12T17:59:36.312758image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
t675 4
 
6.8%
t672 4
 
6.8%
t673 4
 
6.8%
t676 4
 
6.8%
t674 4
 
6.8%
t678 4
 
6.8%
t677 4
 
6.8%
t670 4
 
6.8%
t671 4
 
6.8%
t679 3
 
5.1%
Other values (7) 20
33.9%

상병명
Categorical

HIGH CORRELATION 

Distinct17
Distinct (%)28.8%
Missing0
Missing (%)0.0%
Memory size604.0 B
상세불명의열탈진
탈수성열탈진
열실신
열사병및일사병
열성부종
Other values (12)
39 

Length

Max length13
Median length8
Mean length6.6610169
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row상세불명의열탈진
2nd row열및빛의기타영향
3rd row열경련
4th row열사병및일사병
5th row열성부종

Common Values

ValueCountFrequency (%)
상세불명의열탈진 4
 
6.8%
탈수성열탈진 4
 
6.8%
열실신 4
 
6.8%
열사병및일사병 4
 
6.8%
열성부종 4
 
6.8%
열경련 4
 
6.8%
염분상실에의한열탈진 4
 
6.8%
일과성열피로 4
 
6.8%
열및빛의기타영향 4
 
6.8%
열및빛의상세불명의영향 3
 
5.1%
Other values (7) 20
33.9%

Length

2023-12-12T17:59:36.452932image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
상세불명의열탈진 4
 
6.8%
열경련 4
 
6.8%
탈수성열탈진 4
 
6.8%
일과성열피로 4
 
6.8%
염분상실에의한열탈진 4
 
6.8%
열및빛의기타영향 4
 
6.8%
열성부종 4
 
6.8%
열사병및일사병 4
 
6.8%
열실신 4
 
6.8%
열및빛의상세불명의영향 3
 
5.1%
Other values (7) 20
33.9%

환자수
Real number (ℝ)

HIGH CORRELATION 

Distinct49
Distinct (%)83.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean551.88136
Minimum1
Maximum4355
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size663.0 B
2023-12-12T17:59:36.595347image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2
Q17.5
median112
Q3417
95-th percentile3640.4
Maximum4355
Range4354
Interquartile range (IQR)409.5

Descriptive statistics

Standard deviation1060.9906
Coefficient of variation (CV)1.9224976
Kurtosis6.0461344
Mean551.88136
Median Absolute Deviation (MAD)108
Skewness2.5744034
Sum32561
Variance1125701.1
MonotonicityNot monotonic
2023-12-12T17:59:36.751258image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=49)
ValueCountFrequency (%)
2 5
 
8.5%
4 4
 
6.8%
5 3
 
5.1%
65 2
 
3.4%
249 1
 
1.7%
53 1
 
1.7%
105 1
 
1.7%
10 1
 
1.7%
40 1
 
1.7%
148 1
 
1.7%
Other values (39) 39
66.1%
ValueCountFrequency (%)
1 1
 
1.7%
2 5
8.5%
3 1
 
1.7%
4 4
6.8%
5 3
5.1%
6 1
 
1.7%
9 1
 
1.7%
10 1
 
1.7%
12 1
 
1.7%
28 1
 
1.7%
ValueCountFrequency (%)
4355 1
1.7%
4309 1
1.7%
3671 1
1.7%
3637 1
1.7%
2221 1
1.7%
2038 1
1.7%
1414 1
1.7%
1401 1
1.7%
1116 1
1.7%
1108 1
1.7%

명세서청구건수
Real number (ℝ)

HIGH CORRELATION 

Distinct50
Distinct (%)84.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean686.16949
Minimum1
Maximum5119
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size663.0 B
2023-12-12T17:59:36.909416image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2
Q19.5
median129
Q3461.5
95-th percentile4562.2
Maximum5119
Range5118
Interquartile range (IQR)452

Descriptive statistics

Standard deviation1320.985
Coefficient of variation (CV)1.9251584
Kurtosis5.0452473
Mean686.16949
Median Absolute Deviation (MAD)125
Skewness2.4386422
Sum40484
Variance1745001.4
MonotonicityNot monotonic
2023-12-12T17:59:37.076607image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2 5
 
8.5%
4 3
 
5.1%
6 2
 
3.4%
12 2
 
3.4%
42 2
 
3.4%
4558 1
 
1.7%
55 1
 
1.7%
134 1
 
1.7%
11 1
 
1.7%
172 1
 
1.7%
Other values (40) 40
67.8%
ValueCountFrequency (%)
1 1
 
1.7%
2 5
8.5%
3 1
 
1.7%
4 3
5.1%
5 1
 
1.7%
6 2
 
3.4%
7 1
 
1.7%
8 1
 
1.7%
11 1
 
1.7%
12 2
 
3.4%
ValueCountFrequency (%)
5119 1
1.7%
5066 1
1.7%
4600 1
1.7%
4558 1
1.7%
3336 1
1.7%
3120 1
1.7%
1760 1
1.7%
1724 1
1.7%
1264 1
1.7%
1200 1
1.7%

입내원일수
Real number (ℝ)

HIGH CORRELATION 

Distinct49
Distinct (%)83.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean752.28814
Minimum1
Maximum5123
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size663.0 B
2023-12-12T17:59:37.235866image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2
Q110
median166
Q3474.5
95-th percentile4594.2
Maximum5123
Range5122
Interquartile range (IQR)464.5

Descriptive statistics

Standard deviation1368.4388
Coefficient of variation (CV)1.8190355
Kurtosis4.0818674
Mean752.28814
Median Absolute Deviation (MAD)162
Skewness2.2544743
Sum44385
Variance1872624.8
MonotonicityNot monotonic
2023-12-12T17:59:37.405227image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=49)
ValueCountFrequency (%)
2 5
 
8.5%
4 3
 
5.1%
12 2
 
3.4%
42 2
 
3.4%
219 2
 
3.4%
6 2
 
3.4%
314 1
 
1.7%
4590 1
 
1.7%
327 1
 
1.7%
13 1
 
1.7%
Other values (39) 39
66.1%
ValueCountFrequency (%)
1 1
 
1.7%
2 5
8.5%
3 1
 
1.7%
4 3
5.1%
5 1
 
1.7%
6 2
 
3.4%
7 1
 
1.7%
8 1
 
1.7%
12 2
 
3.4%
13 1
 
1.7%
ValueCountFrequency (%)
5123 1
1.7%
5070 1
1.7%
4632 1
1.7%
4590 1
1.7%
3996 1
1.7%
3375 1
1.7%
2032 1
1.7%
1815 1
1.7%
1773 1
1.7%
1538 1
1.7%

보험자부담금
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct59
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean40677702
Minimum12280
Maximum4.4305456 × 108
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size663.0 B
2023-12-12T17:59:37.604765image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum12280
5-th percentile30827
Q1336405
median8161040
Q346601190
95-th percentile1.6196499 × 108
Maximum4.4305456 × 108
Range4.4304228 × 108
Interquartile range (IQR)46264785

Descriptive statistics

Standard deviation77945535
Coefficient of variation (CV)1.9161735
Kurtosis13.667243
Mean40677702
Median Absolute Deviation (MAD)8128400
Skewness3.3909904
Sum2.3999844 × 109
Variance6.0755065 × 1015
MonotonicityNot monotonic
2023-12-12T17:59:37.820009image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
32655820 1
 
1.7%
111990 1
 
1.7%
8161040 1
 
1.7%
100876360 1
 
1.7%
12280 1
 
1.7%
3516540 1
 
1.7%
586420 1
 
1.7%
3100590 1
 
1.7%
18512150 1
 
1.7%
4925310 1
 
1.7%
Other values (49) 49
83.1%
ValueCountFrequency (%)
12280 1
1.7%
19130 1
1.7%
25760 1
1.7%
31390 1
1.7%
32640 1
1.7%
51280 1
1.7%
83690 1
1.7%
89000 1
1.7%
89260 1
1.7%
94760 1
1.7%
ValueCountFrequency (%)
443054560 1
1.7%
310141450 1
1.7%
198624890 1
1.7%
157891670 1
1.7%
121691090 1
1.7%
118490170 1
1.7%
100876360 1
1.7%
95644380 1
1.7%
89108690 1
1.7%
85089040 1
1.7%

요양급여비용총액
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct59
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean54681650
Minimum24430
Maximum5.7024352 × 108
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size663.0 B
2023-12-12T17:59:38.008392image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum24430
5-th percentile45581
Q1516870
median10149460
Q363508625
95-th percentile2.2291763 × 108
Maximum5.7024352 × 108
Range5.7021909 × 108
Interquartile range (IQR)62991755

Descriptive statistics

Standard deviation1.0194015 × 108
Coefficient of variation (CV)1.8642478
Kurtosis12.541609
Mean54681650
Median Absolute Deviation (MAD)10090220
Skewness3.2460047
Sum3.2262174 × 109
Variance1.0391794 × 1016
MonotonicityNot monotonic
2023-12-12T17:59:38.188397image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
46260090 1
 
1.7%
182190 1
 
1.7%
10149460 1
 
1.7%
130984890 1
 
1.7%
45680 1
 
1.7%
4884360 1
 
1.7%
729420 1
 
1.7%
4140490 1
 
1.7%
24637880 1
 
1.7%
7069610 1
 
1.7%
Other values (49) 49
83.1%
ValueCountFrequency (%)
24430 1
1.7%
36660 1
1.7%
44690 1
1.7%
45680 1
1.7%
59240 1
1.7%
68980 1
1.7%
116600 1
1.7%
118090 1
1.7%
127160 1
1.7%
127760 1
1.7%
ValueCountFrequency (%)
570243520 1
1.7%
401971060 1
1.7%
270416680 1
1.7%
217639960 1
1.7%
165811790 1
1.7%
155490980 1
1.7%
134688580 1
1.7%
130984890 1
1.7%
119282700 1
1.7%
112730850 1
1.7%

Interactions

2023-12-12T17:59:34.857947image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:59:33.263621image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:59:33.702381image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:59:34.084021image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:59:34.484584image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:59:34.954807image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:59:33.352627image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:59:33.783845image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:59:34.160850image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:59:34.558137image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:59:35.033642image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:59:33.432873image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:59:33.855828image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:59:34.241623image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:59:34.637554image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:59:35.113999image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:59:33.516695image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:59:33.928410image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:59:34.327981image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:59:34.718011image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:59:35.195592image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:59:33.619463image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:59:34.008643image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:59:34.405966image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:59:34.783480image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T17:59:38.323512image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
진료년월주상병코드상병명환자수명세서청구건수입내원일수보험자부담금요양급여비용총액
진료년월1.0000.0000.0000.4540.6040.3950.1960.186
주상병코드0.0001.0001.0000.5830.6650.5910.5070.507
상병명0.0001.0001.0000.5830.6650.5910.5070.507
환자수0.4540.5830.5831.0000.9720.9900.8530.853
명세서청구건수0.6040.6650.6650.9721.0000.9170.9130.913
입내원일수0.3950.5910.5910.9900.9171.0000.8170.817
보험자부담금0.1960.5070.5070.8530.9130.8171.0001.000
요양급여비용총액0.1860.5070.5070.8530.9130.8171.0001.000
2023-12-12T17:59:38.476151image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
주상병코드진료년월상병명
주상병코드1.0000.0001.000
진료년월0.0001.0000.000
상병명1.0000.0001.000
2023-12-12T17:59:38.613918image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
환자수명세서청구건수입내원일수보험자부담금요양급여비용총액진료년월주상병코드상병명
환자수1.0000.9970.9910.8560.8570.2570.2530.253
명세서청구건수0.9971.0000.9940.8610.8620.2440.3310.331
입내원일수0.9910.9941.0000.8940.8940.2170.2580.258
보험자부담금0.8560.8610.8941.0000.9990.0720.2320.232
요양급여비용총액0.8570.8620.8940.9991.0000.0720.2320.232
진료년월0.2570.2440.2170.0720.0721.0000.0000.000
주상병코드0.2530.3310.2580.2320.2320.0001.0001.000
상병명0.2530.3310.2580.2320.2320.0001.0001.000

Missing values

2023-12-12T17:59:35.705465image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T17:59:35.851249image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

진료년월주상병코드상병명환자수명세서청구건수입내원일수보험자부담금요양급여비용총액
02022-06T675상세불명의열탈진2492603143265582046260090
12022-06T678열및빛의기타영향222111990182190
22022-06T672열경련44456367700808960960
32022-06T670열사병및일사병22724542785089040112730850
42022-06T677열성부종91212223530304530
52022-06T671열실신303135872664012319660
62022-06T674염분상실에의한열탈진57646824444403381510
72022-06T676일과성열피로19421821942423205891450
82022-06T673탈수성열탈진832109911032127732029126630
92022-07T675상세불명의열탈진110812001538157891670217639960
진료년월주상병코드상병명환자수명세서청구건수입내원일수보험자부담금요양급여비용총액
492023-01T690침수병및침족병283636519890732390
502022-11T33-T35동상2374334411174900017174320
512022-11T68저체온증65831172972628040597310
522022-11T69저하된온도의기타영향28234934949758307138730
532022-12T33-T35동상222131203375121691090165811790
542022-12T68저체온증122141230118490170155490980
552022-12T69저하된온도의기타영향4355511951236985174097881670
562023-01T33-T35동상203833363996198624890270416680
572023-01T68저체온증11212930089108690119282700
582023-01T69저하된온도의기타영향3671460046326926996097004840