Overview

Dataset statistics

Number of variables8
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory761.7 KiB
Average record size in memory78.0 B

Variable types

Categorical2
Text1
Numeric5

Dataset

Description상병별 의료기관종별 진료비 통계 / 진료일자 기준(심사분은 각 진료년+4개월) (예) 진료년월: 2022.1월~12월, 심사년월: 2022.1월~2023.4월 / 보험자: 건강보험 / 요양기관 종별: 약국 제외 / 한방상병 제외
URLhttps://www.data.go.kr/data/15072874/fileData.do

Alerts

진료년도 has constant value ""Constant
환자수 is highly overall correlated with 명세서청구건수 and 3 other fieldsHigh correlation
명세서청구건수 is highly overall correlated with 환자수 and 3 other fieldsHigh correlation
입내원일수 is highly overall correlated with 환자수 and 3 other fieldsHigh correlation
요양급여비용총액 is highly overall correlated with 환자수 and 3 other fieldsHigh correlation
보험자부담금 is highly overall correlated with 환자수 and 3 other fieldsHigh correlation
환자수 is highly skewed (γ1 = 30.32486203)Skewed
명세서청구건수 is highly skewed (γ1 = 38.88151125)Skewed
입내원일수 is highly skewed (γ1 = 35.30682569)Skewed
요양급여비용총액 is highly skewed (γ1 = 21.17987743)Skewed
보험자부담금 is highly skewed (γ1 = 20.49601808)Skewed

Reproduction

Analysis started2023-12-12 12:15:25.741923
Analysis finished2023-12-12 12:15:30.241561
Duration4.5 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

진료년도
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2022
10000 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2022
2nd row2022
3rd row2022
4th row2022
5th row2022

Common Values

ValueCountFrequency (%)
2022 10000
100.0%

Length

2023-12-12T21:15:30.320539image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T21:15:30.414593image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2022 10000
100.0%
Distinct1671
Distinct (%)16.7%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T21:15:30.893136image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters30000
Distinct characters32
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique30 ?
Unique (%)0.3%

Sample

1st rowR32
2nd rowA59
3rd rowF72
4th rowM00
5th rowK21
ValueCountFrequency (%)
k04 13
 
0.1%
r09 12
 
0.1%
k30 12
 
0.1%
b07 12
 
0.1%
s03 12
 
0.1%
k06 12
 
0.1%
i15 12
 
0.1%
m79 12
 
0.1%
s09 12
 
0.1%
k51 12
 
0.1%
Other values (1661) 9879
98.8%
2023-12-12T21:15:31.557551image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 2500
 
8.3%
1 2264
 
7.5%
2 2169
 
7.2%
3 2069
 
6.9%
4 2009
 
6.7%
5 1948
 
6.5%
6 1920
 
6.4%
8 1800
 
6.0%
7 1668
 
5.6%
9 1653
 
5.5%
Other values (22) 10000
33.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 20000
66.7%
Uppercase Letter 10000
33.3%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
S 658
 
6.6%
R 638
 
6.4%
M 591
 
5.9%
K 579
 
5.8%
T 557
 
5.6%
C 557
 
5.6%
D 515
 
5.1%
L 513
 
5.1%
Z 501
 
5.0%
I 500
 
5.0%
Other values (12) 4391
43.9%
Decimal Number
ValueCountFrequency (%)
0 2500
12.5%
1 2264
11.3%
2 2169
10.8%
3 2069
10.3%
4 2009
10.0%
5 1948
9.7%
6 1920
9.6%
8 1800
9.0%
7 1668
8.3%
9 1653
8.3%

Most occurring scripts

ValueCountFrequency (%)
Common 20000
66.7%
Latin 10000
33.3%

Most frequent character per script

Latin
ValueCountFrequency (%)
S 658
 
6.6%
R 638
 
6.4%
M 591
 
5.9%
K 579
 
5.8%
T 557
 
5.6%
C 557
 
5.6%
D 515
 
5.1%
L 513
 
5.1%
Z 501
 
5.0%
I 500
 
5.0%
Other values (12) 4391
43.9%
Common
ValueCountFrequency (%)
0 2500
12.5%
1 2264
11.3%
2 2169
10.8%
3 2069
10.3%
4 2009
10.0%
5 1948
9.7%
6 1920
9.6%
8 1800
9.0%
7 1668
8.3%
9 1653
8.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 30000
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 2500
 
8.3%
1 2264
 
7.5%
2 2169
 
7.2%
3 2069
 
6.9%
4 2009
 
6.7%
5 1948
 
6.5%
6 1920
 
6.4%
8 1800
 
6.0%
7 1668
 
5.6%
9 1653
 
5.5%
Other values (22) 10000
33.3%
Distinct14
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
종합병원
1161 
상급종합병원
1161 
의원
1129 
병원
1118 
요양병원
991 
Other values (9)
4440 

Length

Max length6
Median length5
Mean length3.9542
Min length2

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row요양병원
2nd row요양병원
3rd row의원
4th row한방병원
5th row보건진료소

Common Values

ValueCountFrequency (%)
종합병원 1161
11.6%
상급종합병원 1161
11.6%
의원 1129
11.3%
병원 1118
11.2%
요양병원 991
9.9%
한방병원 803
8.0%
보건의료원 725
7.2%
보건지소 626
6.3%
보건소 623
6.2%
정신요양병원 564
5.6%
Other values (4) 1099
11.0%

Length

2023-12-12T21:15:31.786441image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
종합병원 1161
11.6%
상급종합병원 1161
11.6%
의원 1129
11.3%
병원 1118
11.2%
요양병원 991
9.9%
한방병원 803
8.0%
보건의료원 725
7.2%
보건지소 626
6.3%
보건소 623
6.2%
정신요양병원 564
5.6%
Other values (4) 1099
11.0%

환자수
Real number (ℝ)

HIGH CORRELATION  SKEWED 

Distinct3328
Distinct (%)33.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean21166.466
Minimum1
Maximum12319809
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T21:15:31.964515image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q16
median79
Q31325.25
95-th percentile43359.75
Maximum12319809
Range12319808
Interquartile range (IQR)1319.25

Descriptive statistics

Standard deviation213877.1
Coefficient of variation (CV)10.104526
Kurtosis1338.7885
Mean21166.466
Median Absolute Deviation (MAD)78
Skewness30.324862
Sum2.1166466 × 108
Variance4.5743416 × 1010
MonotonicityNot monotonic
2023-12-12T21:15:32.121530image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1019
 
10.2%
2 521
 
5.2%
3 349
 
3.5%
4 252
 
2.5%
5 194
 
1.9%
6 177
 
1.8%
8 126
 
1.3%
7 122
 
1.2%
9 121
 
1.2%
10 106
 
1.1%
Other values (3318) 7013
70.1%
ValueCountFrequency (%)
1 1019
10.2%
2 521
5.2%
3 349
 
3.5%
4 252
 
2.5%
5 194
 
1.9%
6 177
 
1.8%
7 122
 
1.2%
8 126
 
1.3%
9 121
 
1.2%
10 106
 
1.1%
ValueCountFrequency (%)
12319809 1
< 0.1%
6092906 1
< 0.1%
5780060 1
< 0.1%
5766409 1
< 0.1%
4828526 1
< 0.1%
4113245 1
< 0.1%
3739786 1
< 0.1%
3284446 1
< 0.1%
3270492 1
< 0.1%
2924804 1
< 0.1%

명세서청구건수
Real number (ℝ)

HIGH CORRELATION  SKEWED 

Distinct4017
Distinct (%)40.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean57231.894
Minimum1
Maximum42305457
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T21:15:32.306894image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q113
median182
Q33471.75
95-th percentile116318.75
Maximum42305457
Range42305456
Interquartile range (IQR)3458.75

Descriptive statistics

Standard deviation711753.32
Coefficient of variation (CV)12.436306
Kurtosis1940.72
Mean57231.894
Median Absolute Deviation (MAD)181
Skewness38.881511
Sum5.7231894 × 108
Variance5.0659279 × 1011
MonotonicityNot monotonic
2023-12-12T21:15:32.492095image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 732
 
7.3%
2 375
 
3.8%
3 268
 
2.7%
4 210
 
2.1%
5 162
 
1.6%
6 154
 
1.5%
8 124
 
1.2%
7 119
 
1.2%
9 97
 
1.0%
10 86
 
0.9%
Other values (4007) 7673
76.7%
ValueCountFrequency (%)
1 732
7.3%
2 375
3.8%
3 268
 
2.7%
4 210
 
2.1%
5 162
 
1.6%
6 154
 
1.5%
7 119
 
1.2%
8 124
 
1.2%
9 97
 
1.0%
10 86
 
0.9%
ValueCountFrequency (%)
42305457 1
< 0.1%
34712159 1
< 0.1%
21201276 1
< 0.1%
16725597 1
< 0.1%
13126643 1
< 0.1%
10535482 1
< 0.1%
9842303 1
< 0.1%
8383783 1
< 0.1%
8305678 1
< 0.1%
8251914 1
< 0.1%

입내원일수
Real number (ℝ)

HIGH CORRELATION  SKEWED 

Distinct4402
Distinct (%)44.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean66335.11
Minimum0
Maximum42304434
Zeros96
Zeros (%)1.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T21:15:32.690697image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile1
Q115
median287.5
Q35522
95-th percentile149969.35
Maximum42304434
Range42304434
Interquartile range (IQR)5507

Descriptive statistics

Standard deviation750983.45
Coefficient of variation (CV)11.321055
Kurtosis1621.6349
Mean66335.11
Median Absolute Deviation (MAD)286.5
Skewness35.306826
Sum6.633511 × 108
Variance5.6397614 × 1011
MonotonicityNot monotonic
2023-12-12T21:15:32.849512image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 687
 
6.9%
2 355
 
3.5%
3 230
 
2.3%
4 195
 
1.9%
5 142
 
1.4%
6 134
 
1.3%
8 116
 
1.2%
7 108
 
1.1%
0 96
 
1.0%
9 83
 
0.8%
Other values (4392) 7854
78.5%
ValueCountFrequency (%)
0 96
 
1.0%
1 687
6.9%
2 355
3.5%
3 230
 
2.3%
4 195
 
1.9%
5 142
 
1.4%
6 134
 
1.3%
7 108
 
1.1%
8 116
 
1.2%
9 83
 
0.8%
ValueCountFrequency (%)
42304434 1
< 0.1%
34745307 1
< 0.1%
21264864 1
< 0.1%
20331469 1
< 0.1%
16739088 1
< 0.1%
13127317 1
< 0.1%
10513701 1
< 0.1%
9818297 1
< 0.1%
9538237 1
< 0.1%
8385604 1
< 0.1%

요양급여비용총액
Real number (ℝ)

HIGH CORRELATION  SKEWED 

Distinct9065
Distinct (%)90.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5.3553187 × 109
Minimum0
Maximum1.67 × 1012
Zeros22
Zeros (%)0.2%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T21:15:33.062965image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile16399.5
Q1419680
median17762685
Q35.5515356 × 108
95-th percentile1.8407241 × 1010
Maximum1.67 × 1012
Range1.67 × 1012
Interquartile range (IQR)5.5473388 × 108

Descriptive statistics

Standard deviation3.8131756 × 1010
Coefficient of variation (CV)7.1203524
Kurtosis653.58124
Mean5.3553187 × 109
Median Absolute Deviation (MAD)17746035
Skewness21.179877
Sum5.3553187 × 1013
Variance1.4540308 × 1021
MonotonicityNot monotonic
2023-12-12T21:15:33.260273image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
5610 101
 
1.0%
11220 62
 
0.6%
4930 56
 
0.6%
16370 37
 
0.4%
22440 33
 
0.3%
16830 30
 
0.3%
0 22
 
0.2%
33660 21
 
0.2%
11870 19
 
0.2%
3570 18
 
0.2%
Other values (9055) 9601
96.0%
ValueCountFrequency (%)
0 22
 
0.2%
1250 1
 
< 0.1%
3570 18
 
0.2%
4180 2
 
< 0.1%
4300 1
 
< 0.1%
4380 1
 
< 0.1%
4930 56
0.6%
5100 3
 
< 0.1%
5610 101
1.0%
5780 1
 
< 0.1%
ValueCountFrequency (%)
1670000000000 1
< 0.1%
1300000000000 1
< 0.1%
1010000000000 1
< 0.1%
936000000000 1
< 0.1%
757000000000 1
< 0.1%
689000000000 1
< 0.1%
687000000000 1
< 0.1%
662000000000 1
< 0.1%
537000000000 1
< 0.1%
532000000000 1
< 0.1%

보험자부담금
Real number (ℝ)

HIGH CORRELATION  SKEWED 

Distinct9315
Distinct (%)93.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4.0868228 × 109
Minimum0
Maximum1.26 × 1012
Zeros22
Zeros (%)0.2%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T21:15:33.508578image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile11750
Q1311987.5
median13178395
Q34.0589192 × 108
95-th percentile1.3223037 × 1010
Maximum1.26 × 1012
Range1.26 × 1012
Interquartile range (IQR)4.0557993 × 108

Descriptive statistics

Standard deviation2.9529768 × 1010
Coefficient of variation (CV)7.2256051
Kurtosis603.39569
Mean4.0868228 × 109
Median Absolute Deviation (MAD)13166140
Skewness20.496018
Sum4.0868228 × 1013
Variance8.720072 × 1020
MonotonicityNot monotonic
2023-12-12T21:15:33.995021image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
4510 47
 
0.5%
5110 40
 
0.4%
9870 34
 
0.3%
4030 29
 
0.3%
0 22
 
0.2%
9020 21
 
0.2%
10320 19
 
0.2%
2670 18
 
0.2%
4430 18
 
0.2%
7170 14
 
0.1%
Other values (9305) 9738
97.4%
ValueCountFrequency (%)
0 22
0.2%
1000 1
 
< 0.1%
2580 2
 
< 0.1%
2670 18
0.2%
2700 1
 
< 0.1%
2870 1
 
< 0.1%
2910 1
 
< 0.1%
3210 1
 
< 0.1%
3410 1
 
< 0.1%
3510 1
 
< 0.1%
ValueCountFrequency (%)
1260000000000 1
< 0.1%
950000000000 1
< 0.1%
739000000000 1
< 0.1%
729000000000 1
< 0.1%
700000000000 1
< 0.1%
607000000000 1
< 0.1%
541000000000 1
< 0.1%
520000000000 1
< 0.1%
405000000000 1
< 0.1%
402000000000 1
< 0.1%

Interactions

2023-12-12T21:15:29.236401image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:15:27.047126image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:15:27.656322image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:15:28.166907image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:15:28.646394image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:15:29.362887image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:15:27.162616image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:15:27.775750image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:15:28.266773image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:15:28.755517image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:15:29.487425image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:15:27.271106image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:15:27.885115image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:15:28.358976image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:15:28.840044image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:15:29.600812image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:15:27.403424image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:15:27.976949image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:15:28.438391image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:15:28.935095image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:15:29.734403image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:15:27.544404image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:15:28.073897image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:15:28.546146image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:15:29.087396image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T21:15:34.121838image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
의료기관종별환자수명세서청구건수입내원일수요양급여비용총액보험자부담금
의료기관종별1.0000.1120.1250.0840.0530.049
환자수0.1121.0000.8510.8530.7620.617
명세서청구건수0.1250.8511.0000.9820.8680.714
입내원일수0.0840.8530.9821.0000.9190.939
요양급여비용총액0.0530.7620.8680.9191.0000.958
보험자부담금0.0490.6170.7140.9390.9581.000
2023-12-12T21:15:34.242391image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
환자수명세서청구건수입내원일수요양급여비용총액보험자부담금의료기관종별
환자수1.0000.9840.9590.9140.9110.055
명세서청구건수0.9841.0000.9830.9390.9370.047
입내원일수0.9590.9831.0000.9660.9660.037
요양급여비용총액0.9140.9390.9661.0000.9990.022
보험자부담금0.9110.9370.9660.9991.0000.021
의료기관종별0.0550.0470.0370.0220.0211.000

Missing values

2023-12-12T21:15:29.895398image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T21:15:30.118514image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

진료년도주상병코드의료기관종별환자수명세서청구건수입내원일수요양급여비용총액보험자부담금
110712022R32요양병원1445157507601632720441993880
3372022A59요양병원1126291126780775460
37582022F72의원104384059228471688880362696090
80182022M00한방병원46852723186144024107150
67192022K21보건진료소543168317152779959022635390
49132022H53보건지소81313306170223870
70262022K61의원450561639291739882777370074022091256150
30802022E71한방병원140119870107900
48952022H51종합병원2294835528417091052250140
61522022J34치과의원91010246760173260
진료년도주상병코드의료기관종별환자수명세서청구건수입내원일수요양급여비용총액보험자부담금
86702022M81병원1690583534213559033118731292019162499790
39262022G03보건의료원1111665011750
123642022S76병원10495292124412637910135702808536000
83282022M35병원117348856992569281470468635800
90802022N31보건진료소2445100037500
95602022O00정신요양병원122203500122200
53952022I34종합병원5949199352501181726130507061990580
17552022C82의원37620392464151963560140643130
131762022T75상급종합병원4145741253821587450656205500
124182022S82병원7351738751377436312200000000091549420040