Overview

Dataset statistics

Number of variables7
Number of observations385
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory23.4 KiB
Average record size in memory62.3 B

Variable types

Categorical1
Text1
Numeric5

Dataset

Description희귀질환별 진료비 통계 / 진료일자 기준(심사분은 각 진료년+4개월) (예) 진료년월: 2020.1월~12월, 심사년월: 2020.1월~2021.4월 / 보험자: 건강보험 / 요양기관 종별: 약국 제외 ※ 상병코드가 있는 희귀질환 대상으로 작성되었으며, 산출기준은 유의사항 참조
URLhttps://www.data.go.kr/data/15072785/fileData.do

Alerts

진료년도 has constant value ""Constant
환자수 is highly overall correlated with 명세서 청구건수 and 3 other fieldsHigh correlation
명세서 청구건수 is highly overall correlated with 환자수 and 3 other fieldsHigh correlation
입내원일수 is highly overall correlated with 환자수 and 3 other fieldsHigh correlation
보험자부담금 is highly overall correlated with 환자수 and 3 other fieldsHigh correlation
요양급여비용총액 is highly overall correlated with 환자수 and 3 other fieldsHigh correlation
희귀질환 상병코드 has unique valuesUnique
보험자부담금 has unique valuesUnique
요양급여비용총액 has unique valuesUnique

Reproduction

Analysis started2023-12-12 07:23:02.979994
Analysis finished2023-12-12 07:23:06.526991
Duration3.55 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

진료년도
Categorical

CONSTANT 

Distinct1
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size3.1 KiB
2022
385 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2022
2nd row2022
3rd row2022
4th row2022
5th row2022

Common Values

ValueCountFrequency (%)
2022 385
100.0%

Length

2023-12-12T16:23:06.656835image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T16:23:06.777544image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2022 385
100.0%
Distinct385
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size3.1 KiB
2023-12-12T16:23:07.328444image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length4
Median length4
Mean length3.9844156
Min length3

Characters and Unicode

Total characters1534
Distinct characters24
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique385 ?
Unique (%)100.0%

Sample

1st rowD66
2nd rowD67
3rd rowD70
4th rowD71
5th rowG10
ValueCountFrequency (%)
d66 1
 
0.3%
k508 1
 
0.3%
q131 1
 
0.3%
q112 1
 
0.3%
q070 1
 
0.3%
q062 1
 
0.3%
q059 1
 
0.3%
q058 1
 
0.3%
q057 1
 
0.3%
q056 1
 
0.3%
Other values (375) 375
97.4%
2023-12-12T16:23:08.045983image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 164
10.7%
0 155
10.1%
2 153
10.0%
Q 139
9.1%
8 138
9.0%
7 119
 
7.8%
3 105
 
6.8%
4 101
 
6.6%
5 85
 
5.5%
6 72
 
4.7%
Other values (14) 303
19.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1149
74.9%
Uppercase Letter 385
 
25.1%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
Q 139
36.1%
D 59
15.3%
E 58
15.1%
G 47
 
12.2%
M 34
 
8.8%
I 11
 
2.9%
N 9
 
2.3%
H 8
 
2.1%
K 6
 
1.6%
L 5
 
1.3%
Other values (4) 9
 
2.3%
Decimal Number
ValueCountFrequency (%)
1 164
14.3%
0 155
13.5%
2 153
13.3%
8 138
12.0%
7 119
10.4%
3 105
9.1%
4 101
8.8%
5 85
7.4%
6 72
6.3%
9 57
 
5.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1149
74.9%
Latin 385
 
25.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
Q 139
36.1%
D 59
15.3%
E 58
15.1%
G 47
 
12.2%
M 34
 
8.8%
I 11
 
2.9%
N 9
 
2.3%
H 8
 
2.1%
K 6
 
1.6%
L 5
 
1.3%
Other values (4) 9
 
2.3%
Common
ValueCountFrequency (%)
1 164
14.3%
0 155
13.5%
2 153
13.3%
8 138
12.0%
7 119
10.4%
3 105
9.1%
4 101
8.8%
5 85
7.4%
6 72
6.3%
9 57
 
5.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1534
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 164
10.7%
0 155
10.1%
2 153
10.0%
Q 139
9.1%
8 138
9.0%
7 119
 
7.8%
3 105
 
6.8%
4 101
 
6.6%
5 85
 
5.5%
6 72
 
4.7%
Other values (14) 303
19.8%

환자수
Real number (ℝ)

HIGH CORRELATION 

Distinct296
Distinct (%)76.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2616.2364
Minimum1
Maximum296218
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.5 KiB
2023-12-12T16:23:08.230793image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile4
Q144
median223
Q31110
95-th percentile8860
Maximum296218
Range296217
Interquartile range (IQR)1066

Descriptive statistics

Standard deviation16059.268
Coefficient of variation (CV)6.1383095
Kurtosis293.46612
Mean2616.2364
Median Absolute Deviation (MAD)206
Skewness16.288751
Sum1007251
Variance2.579001 × 108
MonotonicityNot monotonic
2023-12-12T16:23:08.397478image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
5 8
 
2.1%
4 7
 
1.8%
3 7
 
1.8%
1 5
 
1.3%
27 5
 
1.3%
19 4
 
1.0%
28 4
 
1.0%
16 4
 
1.0%
34 3
 
0.8%
44 3
 
0.8%
Other values (286) 335
87.0%
ValueCountFrequency (%)
1 5
1.3%
2 3
 
0.8%
3 7
1.8%
4 7
1.8%
5 8
2.1%
6 2
 
0.5%
7 2
 
0.5%
8 2
 
0.5%
9 2
 
0.5%
10 1
 
0.3%
ValueCountFrequency (%)
296218 1
0.3%
67113 1
0.3%
50339 1
0.3%
28126 1
0.3%
27536 1
0.3%
26429 1
0.3%
24005 1
0.3%
23786 1
0.3%
20858 1
0.3%
19272 1
0.3%

명세서 청구건수
Real number (ℝ)

HIGH CORRELATION 

Distinct363
Distinct (%)94.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean9827.0805
Minimum1
Maximum486487
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.5 KiB
2023-12-12T16:23:08.556275image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile27.2
Q1276
median1145
Q36825
95-th percentile36854
Maximum486487
Range486486
Interquartile range (IQR)6549

Descriptive statistics

Standard deviation33112.132
Coefficient of variation (CV)3.369478
Kurtosis116.54921
Mean9827.0805
Median Absolute Deviation (MAD)1067
Skewness9.2622323
Sum3783426
Variance1.0964133 × 109
MonotonicityNot monotonic
2023-12-12T16:23:08.778993image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
3 5
 
1.3%
598 3
 
0.8%
64 2
 
0.5%
7 2
 
0.5%
8359 2
 
0.5%
165 2
 
0.5%
73 2
 
0.5%
61 2
 
0.5%
212 2
 
0.5%
569 2
 
0.5%
Other values (353) 361
93.8%
ValueCountFrequency (%)
1 1
 
0.3%
3 5
1.3%
5 1
 
0.3%
6 1
 
0.3%
7 2
 
0.5%
9 1
 
0.3%
11 1
 
0.3%
12 1
 
0.3%
13 1
 
0.3%
15 1
 
0.3%
ValueCountFrequency (%)
486487 1
0.3%
171316 1
0.3%
170226 1
0.3%
158541 1
0.3%
137534 1
0.3%
120137 1
0.3%
117334 1
0.3%
108627 1
0.3%
107529 1
0.3%
103705 1
0.3%

입내원일수
Real number (ℝ)

HIGH CORRELATION 

Distinct361
Distinct (%)93.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean13614.036
Minimum1
Maximum520419
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.5 KiB
2023-12-12T16:23:08.947781image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile28.4
Q1375
median1467
Q38912
95-th percentile56037.8
Maximum520419
Range520418
Interquartile range (IQR)8537

Descriptive statistics

Standard deviation40334.212
Coefficient of variation (CV)2.9626931
Kurtosis70.44948
Mean13614.036
Median Absolute Deviation (MAD)1386
Skewness7.05201
Sum5241404
Variance1.6268486 × 109
MonotonicityNot monotonic
2023-12-12T16:23:09.122429image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
28 3
 
0.8%
3 3
 
0.8%
329 2
 
0.5%
238 2
 
0.5%
24 2
 
0.5%
367 2
 
0.5%
326 2
 
0.5%
5 2
 
0.5%
891 2
 
0.5%
1384 2
 
0.5%
Other values (351) 363
94.3%
ValueCountFrequency (%)
1 1
 
0.3%
3 3
0.8%
5 2
0.5%
6 1
 
0.3%
7 1
 
0.3%
8 1
 
0.3%
9 1
 
0.3%
11 1
 
0.3%
12 1
 
0.3%
13 1
 
0.3%
ValueCountFrequency (%)
520419 1
0.3%
210318 1
0.3%
200894 1
0.3%
193985 1
0.3%
188185 1
0.3%
174838 1
0.3%
165322 1
0.3%
162054 1
0.3%
144177 1
0.3%
125987 1
0.3%

보험자부담금
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct385
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.5849669 × 109
Minimum28800
Maximum1.4 × 1011
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.5 KiB
2023-12-12T16:23:09.336586image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum28800
5-th percentile2464760
Q157812410
median2.8629527 × 108
Q31.8402158 × 109
95-th percentile1.6789716 × 1010
Maximum1.4 × 1011
Range1.3999997 × 1011
Interquartile range (IQR)1.7824034 × 109

Descriptive statistics

Standard deviation1.1201186 × 1010
Coefficient of variation (CV)3.1244879
Kurtosis65.178703
Mean3.5849669 × 109
Median Absolute Deviation (MAD)2.7930636 × 108
Skewness6.8863507
Sum1.3802123 × 1012
Variance1.2546656 × 1020
MonotonicityNot monotonic
2023-12-12T16:23:09.556042image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
140000000000 1
 
0.3%
80166300 1
 
0.3%
37039940 1
 
0.3%
58445310 1
 
0.3%
1294365360 1
 
0.3%
26216080 1
 
0.3%
2324538440 1
 
0.3%
8988640 1
 
0.3%
24307240 1
 
0.3%
19951740 1
 
0.3%
Other values (375) 375
97.4%
ValueCountFrequency (%)
28800 1
0.3%
47610 1
0.3%
143530 1
0.3%
144430 1
0.3%
203700 1
0.3%
282030 1
0.3%
297620 1
0.3%
726400 1
0.3%
798690 1
0.3%
887620 1
0.3%
ValueCountFrequency (%)
140000000000 1
0.3%
69596792960 1
0.3%
63168459700 1
0.3%
55683333390 1
0.3%
54740079160 1
0.3%
51707865120 1
0.3%
43034408700 1
0.3%
37513997470 1
0.3%
34936797640 1
0.3%
34067900720 1
0.3%

요양급여비용총액
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct385
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4.1970792 × 109
Minimum32000
Maximum1.55 × 1011
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.5 KiB
2023-12-12T16:23:09.733573image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum32000
5-th percentile2693698
Q170171920
median3.5380882 × 108
Q32.0462736 × 109
95-th percentile1.8637632 × 1010
Maximum1.55 × 1011
Range1.5499997 × 1011
Interquartile range (IQR)1.9761017 × 109

Descriptive statistics

Standard deviation1.3129101 × 1010
Coefficient of variation (CV)3.1281519
Kurtosis57.483725
Mean4.1970792 × 109
Median Absolute Deviation (MAD)3.4581828 × 108
Skewness6.5981895
Sum1.6158755 × 1012
Variance1.7237329 × 1020
MonotonicityNot monotonic
2023-12-12T16:23:09.934923image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
155000000000 1
 
0.3%
91323210 1
 
0.3%
56268860 1
 
0.3%
80341260 1
 
0.3%
1469856820 1
 
0.3%
30011120 1
 
0.3%
2536624240 1
 
0.3%
10876690 1
 
0.3%
29256320 1
 
0.3%
24774160 1
 
0.3%
Other values (375) 375
97.4%
ValueCountFrequency (%)
32000 1
0.3%
67810 1
0.3%
192930 1
0.3%
212730 1
0.3%
226300 1
0.3%
449530 1
0.3%
635020 1
0.3%
806000 1
0.3%
1019490 1
0.3%
1033880 1
0.3%
ValueCountFrequency (%)
155000000000 1
0.3%
101021000000 1
0.3%
77526319750 1
0.3%
61810186700 1
0.3%
60505539180 1
0.3%
57404102200 1
0.3%
49493906950 1
0.3%
47477151610 1
0.3%
40018074830 1
0.3%
39595512510 1
0.3%

Interactions

2023-12-12T16:23:05.748286image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:23:03.235238image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:23:03.857173image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:23:04.438759image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:23:04.924433image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:23:05.851456image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:23:03.354485image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:23:03.965316image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:23:04.536333image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:23:05.043249image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:23:05.946316image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:23:03.479613image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:23:04.073288image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:23:04.616903image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:23:05.143549image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:23:06.051777image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:23:03.593010image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:23:04.188351image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:23:04.717065image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:23:05.514939image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:23:06.157730image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:23:03.724857image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:23:04.319644image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:23:04.823841image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:23:05.620996image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T16:23:10.075805image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
환자수명세서 청구건수입내원일수보험자부담금요양급여비용총액
환자수1.0000.7560.8040.5590.701
명세서 청구건수0.7561.0000.8590.6160.761
입내원일수0.8040.8591.0000.8340.737
보험자부담금0.5590.6160.8341.0000.961
요양급여비용총액0.7010.7610.7370.9611.000
2023-12-12T16:23:10.195331image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
환자수명세서 청구건수입내원일수보험자부담금요양급여비용총액
환자수1.0000.9540.9430.8360.851
명세서 청구건수0.9541.0000.9890.8980.909
입내원일수0.9430.9891.0000.9220.931
보험자부담금0.8360.8980.9221.0000.999
요양급여비용총액0.8510.9090.9310.9991.000

Missing values

2023-12-12T16:23:06.312505image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T16:23:06.467585image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

진료년도희귀질환 상병코드환자수명세서 청구건수입내원일수보험자부담금요양급여비용총액
02022D6617183049231616140000000000155000000000
12022D67395597762005474007916060505539180
22022D701691436070569661369628794015919579930
32022D711079731786528293110591926350
42022G1034718741312311114070301335467120
52022G35270020548347691394179823015703625810
62022A3195869238772767418402158003089941250
72022A8101251044836513211042701535268340
82022D12629621848648752041963168459700101021000000
92022D55035778868563107990540
진료년도희귀질환 상병코드환자수명세서 청구건수입내원일수보험자부담금요양급여비용총액
3752022Q911814114182687609134460
3762022Q91241521684384113046772990
3772022Q9145595931048403225340
3782022Q915336369333801033880
3792022Q916133203700226300
3802022Q91717251355124900190126575990
3812022Q922361452326178022064749370
3822022Q92310012951321105057230118008990
3832022Q928193363361726276020630760
3842022Q932322253674276493047552990