Overview

Dataset statistics

Number of variables7
Number of observations3486
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory204.4 KiB
Average record size in memory60.0 B

Variable types

Numeric4
Categorical2
Text1

Dataset

Description○ 내용: 해당년도 안전손상질환 관련 의료 이용이 있는 환자의 비율○ 대상: 해당년도 직장가입자○ 산출식-분자: 안전손상질환 관련 주상병코드로 의료 이용이 있는 환자 수-분모: 직장가입자 수※ 안전손상질환 상병코드 : S02, S03, S12, S13, S23, S32, S33, S42, S43, S52, S53, S56, S57, S62, S63, S66, S67, S68, S72, S73, S82, S83, S220, S221, S222, S223, S224, S228, S229, S300
Author국민건강보험공단
URLhttps://www.data.go.kr/data/15089378/fileData.do

Alerts

지표명 has constant value ""Constant
분모(명) is highly overall correlated with 분자(명)High correlation
분자(명) is highly overall correlated with 분모(명)High correlation

Reproduction

Analysis started2023-12-12 09:21:02.852625
Analysis finished2023-12-12 09:21:05.729607
Duration2.88 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

지표연도
Real number (ℝ)

Distinct13
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2015.006
Minimum2009
Maximum2021
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size30.8 KiB
2023-12-12T18:21:05.792749image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2009
5-th percentile2009
Q12012
median2015
Q32018
95-th percentile2021
Maximum2021
Range12
Interquartile range (IQR)6

Descriptive statistics

Standard deviation3.7367028
Coefficient of variation (CV)0.0018544375
Kurtosis-1.2104475
Mean2015.006
Median Absolute Deviation (MAD)3
Skewness-0.0011248425
Sum7024311
Variance13.962948
MonotonicityIncreasing
2023-12-12T18:21:05.957476image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=13)
ValueCountFrequency (%)
2015 270
 
7.7%
2016 270
 
7.7%
2013 269
 
7.7%
2014 269
 
7.7%
2011 268
 
7.7%
2012 268
 
7.7%
2017 268
 
7.7%
2018 268
 
7.7%
2019 268
 
7.7%
2020 268
 
7.7%
Other values (3) 800
22.9%
ValueCountFrequency (%)
2009 266
7.6%
2010 266
7.6%
2011 268
7.7%
2012 268
7.7%
2013 269
7.7%
2014 269
7.7%
2015 270
7.7%
2016 270
7.7%
2017 268
7.7%
2018 268
7.7%
ValueCountFrequency (%)
2021 268
7.7%
2020 268
7.7%
2019 268
7.7%
2018 268
7.7%
2017 268
7.7%
2016 270
7.7%
2015 270
7.7%
2014 269
7.7%
2013 269
7.7%
2012 268
7.7%

시도
Categorical

Distinct18
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size27.4 KiB
경기도
575 
서울특별시
338 
경상북도
325 
전라남도
299 
경상남도
295 
Other values (13)
1654 

Length

Max length7
Median length5
Mean length4.10786
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row전국
2nd row서울특별시
3rd row서울특별시
4th row서울특별시
5th row서울특별시

Common Values

ValueCountFrequency (%)
경기도 575
16.5%
서울특별시 338
9.7%
경상북도 325
9.3%
전라남도 299
8.6%
경상남도 295
8.5%
강원도 247
7.1%
충청남도 225
 
6.5%
부산광역시 221
 
6.3%
전라북도 208
 
6.0%
충청북도 189
 
5.4%
Other values (8) 564
16.2%

Length

2023-12-12T18:21:06.142941image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
경기도 575
16.5%
서울특별시 338
9.7%
경상북도 325
9.3%
전라남도 299
8.6%
경상남도 295
8.5%
강원도 247
7.1%
충청남도 225
 
6.5%
부산광역시 221
 
6.3%
전라북도 208
 
6.0%
충청북도 189
 
5.4%
Other values (8) 564
16.2%
Distinct239
Distinct (%)6.9%
Missing0
Missing (%)0.0%
Memory size27.4 KiB
2023-12-12T18:21:06.617409image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length9
Median length3
Mean length3.3646013
Min length2

Characters and Unicode

Total characters11729
Distinct characters145
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row전체
2nd row전체
3rd row종로구
4th row중구
5th row용산구
ValueCountFrequency (%)
전체 230
 
5.9%
동구 78
 
2.0%
중구 78
 
2.0%
남구 75
 
1.9%
서구 65
 
1.7%
북구 65
 
1.7%
창원시 57
 
1.5%
수원시 52
 
1.3%
청주시 40
 
1.0%
고양시 39
 
1.0%
Other values (237) 3125
80.0%
2023-12-12T18:21:07.311942image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1380
 
11.8%
1288
 
11.0%
1123
 
9.6%
418
 
3.6%
315
 
2.7%
300
 
2.6%
295
 
2.5%
286
 
2.4%
271
 
2.3%
260
 
2.2%
Other values (135) 5793
49.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 11311
96.4%
Space Separator 418
 
3.6%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1380
 
12.2%
1288
 
11.4%
1123
 
9.9%
315
 
2.8%
300
 
2.7%
295
 
2.6%
286
 
2.5%
271
 
2.4%
260
 
2.3%
256
 
2.3%
Other values (134) 5537
49.0%
Space Separator
ValueCountFrequency (%)
418
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 11311
96.4%
Common 418
 
3.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1380
 
12.2%
1288
 
11.4%
1123
 
9.9%
315
 
2.8%
300
 
2.7%
295
 
2.6%
286
 
2.5%
271
 
2.4%
260
 
2.3%
256
 
2.3%
Other values (134) 5537
49.0%
Common
ValueCountFrequency (%)
418
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 11311
96.4%
ASCII 418
 
3.6%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1380
 
12.2%
1288
 
11.4%
1123
 
9.9%
315
 
2.8%
300
 
2.7%
295
 
2.6%
286
 
2.5%
271
 
2.4%
260
 
2.3%
256
 
2.3%
Other values (134) 5537
49.0%
ASCII
ValueCountFrequency (%)
418
100.0%

지표명
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size27.4 KiB
안전손상의료이용률
3486 

Length

Max length9
Median length9
Mean length9
Min length9

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row안전손상의료이용률
2nd row안전손상의료이용률
3rd row안전손상의료이용률
4th row안전손상의료이용률
5th row안전손상의료이용률

Common Values

ValueCountFrequency (%)
안전손상의료이용률 3486
100.0%

Length

2023-12-12T18:21:07.480020image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T18:21:07.594820image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
안전손상의료이용률 3486
100.0%

분모(명)
Real number (ℝ)

HIGH CORRELATION 

Distinct3416
Distinct (%)98.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean162150.92
Minimum1359
Maximum17273414
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size30.8 KiB
2023-12-12T18:21:07.734450image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1359
5-th percentile5361.75
Q113893.5
median41592.5
Q379534.75
95-th percentile437919.75
Maximum17273414
Range17272055
Interquartile range (IQR)65641.25

Descriptive statistics

Standard deviation954657.81
Coefficient of variation (CV)5.8874646
Kurtosis208.06553
Mean162150.92
Median Absolute Deviation (MAD)30365
Skewness13.771639
Sum5.6525812 × 108
Variance9.1137154 × 1011
MonotonicityNot monotonic
2023-12-12T18:21:07.917387image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
6435 3
 
0.1%
47547 3
 
0.1%
6080 2
 
0.1%
5947 2
 
0.1%
10651 2
 
0.1%
15009 2
 
0.1%
67472 2
 
0.1%
87673 2
 
0.1%
141161 2
 
0.1%
45578 2
 
0.1%
Other values (3406) 3464
99.4%
ValueCountFrequency (%)
1359 1
< 0.1%
1537 2
0.1%
1617 1
< 0.1%
1625 1
< 0.1%
1654 1
< 0.1%
1666 1
< 0.1%
1699 1
< 0.1%
1739 1
< 0.1%
1851 1
< 0.1%
1865 1
< 0.1%
ValueCountFrequency (%)
17273414 1
< 0.1%
16995762 1
< 0.1%
16563389 1
< 0.1%
15940192 1
< 0.1%
15441440 1
< 0.1%
15006289 1
< 0.1%
14485092 1
< 0.1%
14259200 1
< 0.1%
13635522 1
< 0.1%
13094050 1
< 0.1%

분자(명)
Real number (ℝ)

HIGH CORRELATION 

Distinct3226
Distinct (%)92.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean35555.978
Minimum222
Maximum3907620
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size30.8 KiB
2023-12-12T18:21:08.086348image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum222
5-th percentile1118.5
Q13078.75
median9008
Q317938.25
95-th percentile95913
Maximum3907620
Range3907398
Interquartile range (IQR)14859.5

Descriptive statistics

Standard deviation210839.11
Coefficient of variation (CV)5.9297797
Kurtosis221.64489
Mean35555.978
Median Absolute Deviation (MAD)6659
Skewness14.176064
Sum1.2394814 × 108
Variance4.4453132 × 1010
MonotonicityNot monotonic
2023-12-12T18:21:08.247604image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
6080 4
 
0.1%
17772 3
 
0.1%
1350 3
 
0.1%
1607 3
 
0.1%
1632 3
 
0.1%
1307 3
 
0.1%
2172 3
 
0.1%
1585 3
 
0.1%
2934 3
 
0.1%
1313 3
 
0.1%
Other values (3216) 3455
99.1%
ValueCountFrequency (%)
222 1
< 0.1%
252 1
< 0.1%
272 1
< 0.1%
298 1
< 0.1%
305 1
< 0.1%
306 1
< 0.1%
315 1
< 0.1%
328 1
< 0.1%
332 1
< 0.1%
333 1
< 0.1%
ValueCountFrequency (%)
3907620 1
< 0.1%
3848827 1
< 0.1%
3767789 1
< 0.1%
3630473 1
< 0.1%
3534350 1
< 0.1%
3436629 1
< 0.1%
3296852 1
< 0.1%
3238459 1
< 0.1%
3065337 1
< 0.1%
2832768 1
< 0.1%

지표값(퍼센트)
Real number (ℝ)

Distinct1021
Distinct (%)29.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean22.14889
Minimum11.28
Maximum30.07
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size30.8 KiB
2023-12-12T18:21:08.452481image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum11.28
5-th percentile16.4925
Q120.97
median22.505
Q323.84
95-th percentile25.7175
Maximum30.07
Range18.79
Interquartile range (IQR)2.87

Descriptive statistics

Standard deviation2.6481536
Coefficient of variation (CV)0.11956146
Kurtosis1.736601
Mean22.14889
Median Absolute Deviation (MAD)1.425
Skewness-0.96463962
Sum77211.03
Variance7.0127176
MonotonicityNot monotonic
2023-12-12T18:21:08.913754image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
22.96 15
 
0.4%
23.47 15
 
0.4%
22.62 14
 
0.4%
21.57 13
 
0.4%
22.48 13
 
0.4%
23.52 11
 
0.3%
22.34 11
 
0.3%
24.63 11
 
0.3%
21.87 11
 
0.3%
21.9 11
 
0.3%
Other values (1011) 3361
96.4%
ValueCountFrequency (%)
11.28 1
< 0.1%
11.5 1
< 0.1%
11.73 1
< 0.1%
11.79 1
< 0.1%
11.95 1
< 0.1%
12.17 1
< 0.1%
12.31 1
< 0.1%
12.42 2
0.1%
12.54 1
< 0.1%
12.85 1
< 0.1%
ValueCountFrequency (%)
30.07 1
< 0.1%
29.63 1
< 0.1%
29.62 1
< 0.1%
29.25 1
< 0.1%
29.24 1
< 0.1%
29.18 1
< 0.1%
29.11 1
< 0.1%
29.05 1
< 0.1%
29.03 1
< 0.1%
28.9 1
< 0.1%

Interactions

2023-12-12T18:21:04.963835image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:21:03.352580image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:21:03.877273image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:21:04.390243image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:21:05.075564image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:21:03.472522image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:21:04.022617image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:21:04.504748image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:21:05.210820image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:21:03.615275image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:21:04.135879image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:21:04.676849image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:21:05.342793image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:21:03.766499image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:21:04.247036image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:21:04.823879image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T18:21:09.036285image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
지표연도시도분모(명)분자(명)지표값(퍼센트)
지표연도1.0000.0000.0000.0000.264
시도0.0001.0000.7050.6920.432
분모(명)0.0000.7051.0000.9570.022
분자(명)0.0000.6920.9571.0000.048
지표값(퍼센트)0.2640.4320.0220.0481.000
2023-12-12T18:21:09.168620image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
지표연도분모(명)분자(명)지표값(퍼센트)시도
지표연도1.0000.1220.1650.3380.000
분모(명)0.1221.0000.9930.0190.410
분자(명)0.1650.9931.0000.1110.377
지표값(퍼센트)0.3380.0190.1111.0000.180
시도0.0000.4100.3770.1801.000

Missing values

2023-12-12T18:21:05.506252image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T18:21:05.674168image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

지표연도시도시군구지표명분모(명)분자(명)지표값(퍼센트)
02009전국전체안전손상의료이용률11414267170151014.91
12009서울특별시전체안전손상의료이용률395743252240613.2
22009서울특별시종로구안전손상의료이용률2427783014612.42
32009서울특별시중구안전손상의료이용률4584165694512.42
42009서울특별시용산구안전손상의료이용률1185371609613.58
52009서울특별시성동구안전손상의료이용률1003231363513.59
62009서울특별시광진구안전손상의료이용률63403861613.59
72009서울특별시동대문구안전손상의료이용률776531050413.53
82009서울특별시중랑구안전손상의료이용률37263573615.39
92009서울특별시성북구안전손상의료이용률46337595912.86
지표연도시도시군구지표명분모(명)분자(명)지표값(퍼센트)
34762021경상남도고성군안전손상의료이용률15038348223.15
34772021경상남도남해군안전손상의료이용률8207159119.39
34782021경상남도하동군안전손상의료이용률8608184021.38
34792021경상남도산청군안전손상의료이용률8509159018.69
34802021경상남도함양군안전손상의료이용률8520178620.96
34812021경상남도거창군안전손상의료이용률12766269821.13
34822021경상남도합천군안전손상의료이용률8968190421.23
34832021제주특별자치도전체안전손상의료이용률1856253725220.07
34842021제주특별자치도제주시안전손상의료이용률1419622831019.94
34852021제주특별자치도서귀포시안전손상의료이용률43663894220.48