Overview

Dataset statistics

Number of variables6
Number of observations260
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory12.8 KiB
Average record size in memory50.5 B

Variable types

Text2
DateTime1
Numeric2
Categorical1

Dataset

Description건강보험심사평가원에서 제공하는 「건강보험 행위 급여·비급여 목록표 및 급여 상대가치점수」 / 건강보험심사평가원 홈페이지 > 제도정책 > 보험인정기준 메뉴에서 확인
Author건강보험심사평가원
URLhttps://www.data.go.kr/data/15067458/fileData.do

Alerts

적용일자 has constant value ""Constant
단가 is highly overall correlated with 상대가치점수High correlation
상대가치점수 is highly overall correlated with 단가High correlation
수가코드 has unique valuesUnique
한글명 has unique valuesUnique

Reproduction

Analysis started2023-12-12 22:05:11.645434
Analysis finished2023-12-12 22:05:12.263959
Duration0.62 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

수가코드
Text

UNIQUE 

Distinct260
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size2.2 KiB
2023-12-13T07:05:12.503597image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length8
Median length8
Mean length7.4923077
Min length5

Characters and Unicode

Total characters1948
Distinct characters12
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique260 ?
Unique (%)100.0%

Sample

1st rowZ1000
2nd rowZ1000001
3rd rowZ2000
4th rowZ2000010
5th rowZ2000011
ValueCountFrequency (%)
z1000 1
 
0.4%
z4200050 1
 
0.4%
z4221010 1
 
0.4%
z4201011 1
 
0.4%
z4201020 1
 
0.4%
z4201021 1
 
0.4%
z4201030 1
 
0.4%
z4201050 1
 
0.4%
z4220 1
 
0.4%
z4220010 1
 
0.4%
Other values (250) 250
96.2%
2023-12-13T07:05:12.941093image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 561
28.8%
1 423
21.7%
Z 260
13.3%
4 255
13.1%
2 167
 
8.6%
3 142
 
7.3%
5 59
 
3.0%
6 35
 
1.8%
7 15
 
0.8%
8 14
 
0.7%
Other values (2) 17
 
0.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1685
86.5%
Uppercase Letter 263
 
13.5%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 561
33.3%
1 423
25.1%
4 255
15.1%
2 167
 
9.9%
3 142
 
8.4%
5 59
 
3.5%
6 35
 
2.1%
7 15
 
0.9%
8 14
 
0.8%
9 14
 
0.8%
Uppercase Letter
ValueCountFrequency (%)
Z 260
98.9%
H 3
 
1.1%

Most occurring scripts

ValueCountFrequency (%)
Common 1685
86.5%
Latin 263
 
13.5%

Most frequent character per script

Common
ValueCountFrequency (%)
0 561
33.3%
1 423
25.1%
4 255
15.1%
2 167
 
9.9%
3 142
 
8.4%
5 59
 
3.5%
6 35
 
2.1%
7 15
 
0.9%
8 14
 
0.8%
9 14
 
0.8%
Latin
ValueCountFrequency (%)
Z 260
98.9%
H 3
 
1.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1948
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 561
28.8%
1 423
21.7%
Z 260
13.3%
4 255
13.1%
2 167
 
8.6%
3 142
 
7.3%
5 59
 
3.0%
6 35
 
1.8%
7 15
 
0.8%
8 14
 
0.7%
Other values (2) 17
 
0.9%

한글명
Text

UNIQUE 

Distinct260
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size2.2 KiB
2023-12-13T07:05:13.225024image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length31
Median length26
Mean length20.761538
Min length7

Characters and Unicode

Total characters5398
Distinct characters90
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique260 ?
Unique (%)100.0%

Sample

1st row약국관리료(방문당)
2nd row약국관리료(방문당)-차등수가제외
3rd row조제기본료(방문당)
4th row조제기본료(방문당)-야간
5th row조제기본료(방문당)-야간,차등수가제외
ValueCountFrequency (%)
처방조제-내복약 175
30.4%
이상 64
 
11.1%
61일분 7
 
1.2%
26일분 7
 
1.2%
81일분 7
 
1.2%
71일분 7
 
1.2%
41일분 7
 
1.2%
91일분 7
 
1.2%
16일분 7
 
1.2%
21일분 7
 
1.2%
Other values (266) 281
48.8%
2023-12-13T07:05:13.643213image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 537
 
9.9%
328
 
6.1%
316
 
5.9%
260
 
4.8%
249
 
4.6%
245
 
4.5%
238
 
4.4%
227
 
4.2%
219
 
4.1%
212
 
3.9%
Other values (80) 2567
47.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3702
68.6%
Decimal Number 605
 
11.2%
Dash Punctuation 537
 
9.9%
Space Separator 316
 
5.9%
Other Punctuation 92
 
1.7%
Close Punctuation 72
 
1.3%
Open Punctuation 72
 
1.3%
Letter Number 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
328
 
8.9%
260
 
7.0%
249
 
6.7%
245
 
6.6%
238
 
6.4%
227
 
6.1%
219
 
5.9%
212
 
5.7%
204
 
5.5%
143
 
3.9%
Other values (62) 1377
37.2%
Decimal Number
ValueCountFrequency (%)
1 193
31.9%
0 99
16.4%
3 64
 
10.6%
9 60
 
9.9%
2 42
 
6.9%
6 42
 
6.9%
5 35
 
5.8%
4 28
 
4.6%
7 21
 
3.5%
8 21
 
3.5%
Other Punctuation
ValueCountFrequency (%)
, 91
98.9%
· 1
 
1.1%
Letter Number
ValueCountFrequency (%)
1
50.0%
1
50.0%
Dash Punctuation
ValueCountFrequency (%)
- 537
100.0%
Space Separator
ValueCountFrequency (%)
316
100.0%
Close Punctuation
ValueCountFrequency (%)
) 72
100.0%
Open Punctuation
ValueCountFrequency (%)
( 72
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3702
68.6%
Common 1694
31.4%
Latin 2
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
328
 
8.9%
260
 
7.0%
249
 
6.7%
245
 
6.6%
238
 
6.4%
227
 
6.1%
219
 
5.9%
212
 
5.7%
204
 
5.5%
143
 
3.9%
Other values (62) 1377
37.2%
Common
ValueCountFrequency (%)
- 537
31.7%
316
18.7%
1 193
 
11.4%
0 99
 
5.8%
, 91
 
5.4%
) 72
 
4.3%
( 72
 
4.3%
3 64
 
3.8%
9 60
 
3.5%
2 42
 
2.5%
Other values (6) 148
 
8.7%
Latin
ValueCountFrequency (%)
1
50.0%
1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3702
68.6%
ASCII 1693
31.4%
Number Forms 2
 
< 0.1%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 537
31.7%
316
18.7%
1 193
 
11.4%
0 99
 
5.8%
, 91
 
5.4%
) 72
 
4.3%
( 72
 
4.3%
3 64
 
3.8%
9 60
 
3.5%
2 42
 
2.5%
Other values (5) 147
 
8.7%
Hangul
ValueCountFrequency (%)
328
 
8.9%
260
 
7.0%
249
 
6.7%
245
 
6.6%
238
 
6.4%
227
 
6.1%
219
 
5.9%
212
 
5.7%
204
 
5.5%
143
 
3.9%
Other values (62) 1377
37.2%
Number Forms
ValueCountFrequency (%)
1
50.0%
1
50.0%
None
ValueCountFrequency (%)
· 1
100.0%

적용일자
Date

CONSTANT 

Distinct1
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size2.2 KiB
Minimum2023-01-01 00:00:00
Maximum2023-01-01 00:00:00
2023-12-13T07:05:13.761035image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T07:05:13.848818image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

단가
Real number (ℝ)

HIGH CORRELATION 

Distinct132
Distinct (%)50.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6846.2692
Minimum20
Maximum30810
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.4 KiB
2023-12-13T07:05:13.993767image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum20
5-th percentile120
Q11612.5
median4160
Q39730
95-th percentile20890
Maximum30810
Range30790
Interquartile range (IQR)8117.5

Descriptive statistics

Standard deviation7206.541
Coefficient of variation (CV)1.0526231
Kurtosis1.6711584
Mean6846.2692
Median Absolute Deviation (MAD)3360
Skewness1.4669576
Sum1780030
Variance51934234
MonotonicityNot monotonic
2023-12-13T07:05:14.133405image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2060 9
 
3.5%
220 6
 
2.3%
2180 6
 
2.3%
170 4
 
1.5%
330 4
 
1.5%
3160 4
 
1.5%
80 4
 
1.5%
3360 4
 
1.5%
7000 3
 
1.2%
3650 3
 
1.2%
Other values (122) 213
81.9%
ValueCountFrequency (%)
20 1
 
0.4%
30 1
 
0.4%
50 2
 
0.8%
60 1
 
0.4%
80 4
1.5%
110 3
1.2%
120 2
 
0.8%
170 4
1.5%
180 1
 
0.4%
220 6
2.3%
ValueCountFrequency (%)
30810 2
0.8%
29830 2
0.8%
29000 2
0.8%
28170 2
0.8%
27210 2
0.8%
22640 2
0.8%
20890 2
0.8%
20020 3
1.2%
19390 3
1.2%
18850 3
1.2%

상대가치점수
Real number (ℝ)

HIGH CORRELATION 

Distinct136
Distinct (%)52.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean70.143308
Minimum0.18
Maximum315.64
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.4 KiB
2023-12-13T07:05:14.284630image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0.18
5-th percentile1.18
Q116.485
median42.65
Q399.65
95-th percentile214.08
Maximum315.64
Range315.46
Interquartile range (IQR)83.165

Descriptive statistics

Standard deviation73.842784
Coefficient of variation (CV)1.0527417
Kurtosis1.6706636
Mean70.143308
Median Absolute Deviation (MAD)34.425
Skewness1.4668192
Sum18237.26
Variance5452.7568
MonotonicityNot monotonic
2023-12-13T07:05:14.443517image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
21.06 6
 
2.3%
2.21 6
 
2.3%
22.36 6
 
2.3%
32.4 4
 
1.5%
1.7 4
 
1.5%
3.4 4
 
1.5%
34.4 4
 
1.5%
205.17 3
 
1.2%
37.35 3
 
1.2%
68.02 3
 
1.2%
Other values (126) 217
83.5%
ValueCountFrequency (%)
0.18 1
 
0.4%
0.26 1
 
0.4%
0.51 2
0.8%
0.59 1
 
0.4%
0.77 3
1.2%
0.85 1
 
0.4%
1.11 3
1.2%
1.18 2
0.8%
1.7 4
1.5%
1.81 1
 
0.4%
ValueCountFrequency (%)
315.64 2
0.8%
305.64 2
0.8%
297.14 2
0.8%
288.64 2
0.8%
278.84 2
0.8%
231.94 2
0.8%
214.08 2
0.8%
205.17 3
1.2%
198.67 3
1.2%
193.14 3
1.2%

분류번호
Categorical

Distinct42
Distinct (%)16.2%
Missing0
Missing (%)0.0%
Memory size2.2 KiB
약2
 
12
약4가(1)(저)
 
7
약3
 
7
약4가(2)
 
7
약4가(1)(처)
 
7
Other values (37)
220 

Length

Max length9
Median length9
Mean length7.8730769
Min length2

Unique

Unique5 ?
Unique (%)1.9%

Sample

1st row약1
2nd row약1
3rd row약2
4th row약2
5th row약2

Common Values

ValueCountFrequency (%)
약2 12
 
4.6%
약4가(1)(저) 7
 
2.7%
약3 7
 
2.7%
약4가(2) 7
 
2.7%
약4가(1)(처) 7
 
2.7%
약4가(1)주2 7
 
2.7%
약4가(1)(가) 7
 
2.7%
약4가(1)(나) 7
 
2.7%
약4가(1)(라) 7
 
2.7%
약4가(1)(카) 7
 
2.7%
Other values (32) 185
71.2%

Length

2023-12-13T07:05:14.615400image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
약2 12
 
4.6%
약4가(1)(커 7
 
2.7%
약4나(2)주2 7
 
2.7%
약4가(1)(저 7
 
2.7%
약4가(1)(거 7
 
2.7%
약4가(2)주2 7
 
2.7%
약4가(3 7
 
2.7%
약4가(1)(하 7
 
2.7%
약4나(1)주3 7
 
2.7%
약4나(1 7
 
2.7%
Other values (32) 185
71.2%

Interactions

2023-12-13T07:05:11.966656image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T07:05:11.811029image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T07:05:12.052815image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T07:05:11.885865image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T07:05:14.717403image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
단가상대가치점수분류번호
단가1.0001.0000.815
상대가치점수1.0001.0000.815
분류번호0.8150.8151.000
2023-12-13T07:05:14.801725image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
단가상대가치점수분류번호
단가1.0001.0000.414
상대가치점수1.0001.0000.414
분류번호0.4140.4141.000

Missing values

2023-12-13T07:05:12.146253image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T07:05:12.230484image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

수가코드한글명적용일자단가상대가치점수분류번호
0Z1000약국관리료(방문당)2023-01-017307.45약1
1Z1000001약국관리료(방문당)-차등수가제외2023-01-017307.45약1
2Z2000조제기본료(방문당)2023-01-01159016.26약2
3Z2000010조제기본료(방문당)-야간2023-01-01206021.14약2
4Z2000011조제기본료(방문당)-야간,차등수가제외2023-01-01206021.14약2
5Z2000030조제기본료(방문당)-토요09-132023-01-014804.88약2
6Z2000050조제기본료(방문당)-공휴2023-01-01206021.14약2
7Z2000600조제기본료(방문당)-6세미만2023-01-01224022.93약2
8Z2000610조제기본료(방문당)-6세미만,야간2023-01-01271027.81약2
9Z2000611조제기본료(방문당)-6세미만,야간,차등수가제외2023-01-01271027.81약2
수가코드한글명적용일자단가상대가치점수분류번호
250Z4391020처방조제-내복약 91일분 이상-심야2023-01-0130810315.64약4가(1)(커)
251Z4391021처방조제-내복약 91일분 이상-심야,차등수가제외2023-01-0130810315.64약4가(1)(커)
252Z4391030처방조제-내복약 91일분 이상-토요09-132023-01-01462047.35약4가(1)(커)
253Z4391050처방조제-내복약 91일분 이상-공휴2023-01-0120020205.17약4가(1)(커)
254Z5000의약품관리료(방문당)2023-01-016306.42약5
255Z5001의약품관리료(방문당)-마약류 포함하여 조제투약하는 경우2023-01-018809.04약5주
256Z7001야간조제관리료2023-01-01266027.21약7
257ZH001코로나19 투약·안전관리료2023-01-01312031.95코로나19
258ZH003코로나19 대면투약관리료Ⅰ2023-01-01312031.95코로나19
259ZH004코로나19 대면투약관리료Ⅱ2023-01-01624063.91코로나19