Overview

Dataset statistics

Number of variables8
Number of observations132
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory8.6 KiB
Average record size in memory67.0 B

Variable types

Categorical3
Text2
Numeric2
DateTime1

Dataset

Description보건복지부 국립나주병원의 비급여 수가 및 의약품 목록 데이터로 제증명료 진단서, 제증명료 소견서, 제증명료 확인서, 심리검사, 뇌파검사, 치과처치, 약제 비용이 포함되어 있습니다.
Author보건복지부 국립나주병원
URLhttps://www.data.go.kr/data/15042750/fileData.do

Alerts

종료일자 has constant value ""Constant
기본수량 is highly overall correlated with 비용High correlation
비용 is highly overall correlated with 기본수량High correlation
구분 is highly overall correlated with 단위High correlation
단위 is highly overall correlated with 구분High correlation
수가코드 has unique valuesUnique
명칭 has unique valuesUnique

Reproduction

Analysis started2023-12-12 06:05:08.657129
Analysis finished2023-12-12 06:05:09.734015
Duration1.08 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

구분
Categorical

HIGH CORRELATION 

Distinct12
Distinct (%)9.1%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
약제
50 
제증명료 진단서
15 
치과처치
12 
심리검사
12 
기타검사항목
11 
Other values (7)
32 

Length

Max length8
Median length7
Mean length4.1515152
Min length2

Unique

Unique1 ?
Unique (%)0.8%

Sample

1st row제증명료 진단서
2nd row제증명료 진단서
3rd row제증명료 진단서
4th row제증명료 진단서
5th row제증명료 진단서

Common Values

ValueCountFrequency (%)
약제 50
37.9%
제증명료 진단서 15
 
11.4%
치과처치 12
 
9.1%
심리검사 12
 
9.1%
기타검사항목 11
 
8.3%
정신요법 8
 
6.1%
주사료 6
 
4.5%
제증명료 소견서 5
 
3.8%
뇌파검사 5
 
3.8%
제증명료 확인서 4
 
3.0%
Other values (2) 4
 
3.0%

Length

2023-12-12T15:05:09.806660image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
약제 50
31.4%
제증명료 27
17.0%
진단서 15
 
9.4%
치과처치 12
 
7.5%
심리검사 12
 
7.5%
기타검사항목 11
 
6.9%
정신요법 8
 
5.0%
주사료 6
 
3.8%
소견서 5
 
3.1%
뇌파검사 5
 
3.1%
Other values (3) 8
 
5.0%

수가코드
Text

UNIQUE 

Distinct132
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
2023-12-12T15:05:10.166244image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length9
Mean length6.1515152
Min length3

Characters and Unicode

Total characters812
Distinct characters39
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique132 ?
Unique (%)100.0%

Sample

1st rowAQL001
2nd rowAQL001-1
3rd rowAQL001C
4th rowAQL002
5th rowAQL002-1
ValueCountFrequency (%)
aql001 1
 
0.8%
gnz006307 1
 
0.8%
lc00241 1
 
0.8%
lc00240 1
 
0.8%
lc00237 1
 
0.8%
lc00234 1
 
0.8%
kcom001-1 1
 
0.8%
jpost10 1
 
0.8%
jpfm300 1
 
0.8%
jgc400 1
 
0.8%
Other values (122) 122
92.4%
2023-12-12T15:05:10.738900image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 113
 
13.9%
D 68
 
8.4%
A 60
 
7.4%
L 50
 
6.2%
N 37
 
4.6%
1 33
 
4.1%
S 33
 
4.1%
F 31
 
3.8%
C 30
 
3.7%
Q 29
 
3.6%
Other values (29) 328
40.4%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter 549
67.6%
Decimal Number 253
31.2%
Dash Punctuation 4
 
0.5%
Open Punctuation 3
 
0.4%
Close Punctuation 3
 
0.4%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
D 68
12.4%
A 60
 
10.9%
L 50
 
9.1%
N 37
 
6.7%
S 33
 
6.0%
F 31
 
5.6%
C 30
 
5.5%
Q 29
 
5.3%
I 24
 
4.4%
O 22
 
4.0%
Other values (16) 165
30.1%
Decimal Number
ValueCountFrequency (%)
0 113
44.7%
1 33
 
13.0%
2 22
 
8.7%
9 18
 
7.1%
6 17
 
6.7%
3 17
 
6.7%
4 12
 
4.7%
5 8
 
3.2%
8 8
 
3.2%
7 5
 
2.0%
Dash Punctuation
ValueCountFrequency (%)
- 4
100.0%
Open Punctuation
ValueCountFrequency (%)
( 3
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 549
67.6%
Common 263
32.4%

Most frequent character per script

Latin
ValueCountFrequency (%)
D 68
12.4%
A 60
 
10.9%
L 50
 
9.1%
N 37
 
6.7%
S 33
 
6.0%
F 31
 
5.6%
C 30
 
5.5%
Q 29
 
5.3%
I 24
 
4.4%
O 22
 
4.0%
Other values (16) 165
30.1%
Common
ValueCountFrequency (%)
0 113
43.0%
1 33
 
12.5%
2 22
 
8.4%
9 18
 
6.8%
6 17
 
6.5%
3 17
 
6.5%
4 12
 
4.6%
5 8
 
3.0%
8 8
 
3.0%
7 5
 
1.9%
Other values (3) 10
 
3.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 812
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 113
 
13.9%
D 68
 
8.4%
A 60
 
7.4%
L 50
 
6.2%
N 37
 
4.6%
1 33
 
4.1%
S 33
 
4.1%
F 31
 
3.8%
C 30
 
3.7%
Q 29
 
3.6%
Other values (29) 328
40.4%

명칭
Text

UNIQUE 

Distinct132
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
2023-12-12T15:05:11.009307image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length33
Median length26.5
Mean length14.287879
Min length4

Characters and Unicode

Total characters1886
Distinct characters329
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique132 ?
Unique (%)100.0%

Sample

1st row일반진단서
2nd row일반진단서(영문)
3rd row일반진단서(사본,추가1통당)
4th row상해진단서(3주미만)
5th row병무용진단서
ValueCountFrequency (%)
인지행동치료 5
 
2.4%
대한 5
 
2.4%
집단에 5
 
2.4%
입원 3
 
1.4%
심리적 3
 
1.4%
경두개 3
 
1.4%
자기 3
 
1.4%
의무기록 2
 
0.9%
집단상담10회기 2
 
0.9%
2
 
0.9%
Other values (171) 178
84.4%
2023-12-12T15:05:11.543581image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
( 102
 
5.4%
) 102
 
5.4%
82
 
4.3%
1 45
 
2.4%
42
 
2.2%
0 38
 
2.0%
35
 
1.9%
34
 
1.8%
m 32
 
1.7%
g 31
 
1.6%
Other values (319) 1343
71.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1295
68.7%
Decimal Number 122
 
6.5%
Open Punctuation 106
 
5.6%
Close Punctuation 106
 
5.6%
Lowercase Letter 101
 
5.4%
Space Separator 82
 
4.3%
Uppercase Letter 40
 
2.1%
Other Punctuation 28
 
1.5%
Math Symbol 4
 
0.2%
Dash Punctuation 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
42
 
3.2%
35
 
2.7%
34
 
2.6%
29
 
2.2%
28
 
2.2%
24
 
1.9%
20
 
1.5%
20
 
1.5%
18
 
1.4%
17
 
1.3%
Other values (265) 1028
79.4%
Uppercase Letter
ValueCountFrequency (%)
T 5
12.5%
S 5
12.5%
M 4
10.0%
C 4
10.0%
I 3
 
7.5%
H 2
 
5.0%
R 2
 
5.0%
V 2
 
5.0%
P 2
 
5.0%
L 2
 
5.0%
Other values (8) 9
22.5%
Lowercase Letter
ValueCountFrequency (%)
m 32
31.7%
g 31
30.7%
l 7
 
6.9%
t 5
 
5.0%
e 5
 
5.0%
a 3
 
3.0%
p 3
 
3.0%
o 3
 
3.0%
b 3
 
3.0%
i 2
 
2.0%
Other values (5) 7
 
6.9%
Decimal Number
ValueCountFrequency (%)
1 45
36.9%
0 38
31.1%
5 18
 
14.8%
3 8
 
6.6%
2 6
 
4.9%
8 2
 
1.6%
4 2
 
1.6%
6 2
 
1.6%
9 1
 
0.8%
Other Punctuation
ValueCountFrequency (%)
/ 11
39.3%
, 6
21.4%
: 6
21.4%
. 3
 
10.7%
% 2
 
7.1%
Open Punctuation
ValueCountFrequency (%)
( 102
96.2%
[ 4
 
3.8%
Close Punctuation
ValueCountFrequency (%)
) 102
96.2%
] 4
 
3.8%
Space Separator
ValueCountFrequency (%)
82
100.0%
Math Symbol
ValueCountFrequency (%)
~ 4
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1295
68.7%
Common 450
 
23.9%
Latin 141
 
7.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
42
 
3.2%
35
 
2.7%
34
 
2.6%
29
 
2.2%
28
 
2.2%
24
 
1.9%
20
 
1.5%
20
 
1.5%
18
 
1.4%
17
 
1.3%
Other values (265) 1028
79.4%
Latin
ValueCountFrequency (%)
m 32
22.7%
g 31
22.0%
l 7
 
5.0%
T 5
 
3.5%
t 5
 
3.5%
S 5
 
3.5%
e 5
 
3.5%
M 4
 
2.8%
C 4
 
2.8%
a 3
 
2.1%
Other values (23) 40
28.4%
Common
ValueCountFrequency (%)
( 102
22.7%
) 102
22.7%
82
18.2%
1 45
10.0%
0 38
 
8.4%
5 18
 
4.0%
/ 11
 
2.4%
3 8
 
1.8%
, 6
 
1.3%
: 6
 
1.3%
Other values (11) 32
 
7.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1295
68.7%
ASCII 591
31.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
( 102
17.3%
) 102
17.3%
82
13.9%
1 45
 
7.6%
0 38
 
6.4%
m 32
 
5.4%
g 31
 
5.2%
5 18
 
3.0%
/ 11
 
1.9%
3 8
 
1.4%
Other values (44) 122
20.6%
Hangul
ValueCountFrequency (%)
42
 
3.2%
35
 
2.7%
34
 
2.6%
29
 
2.2%
28
 
2.2%
24
 
1.9%
20
 
1.5%
20
 
1.5%
18
 
1.4%
17
 
1.3%
Other values (265) 1028
79.4%

기본수량
Real number (ℝ)

HIGH CORRELATION 

Distinct22
Distinct (%)16.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean35.39697
Minimum1
Maximum1000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.3 KiB
2023-12-12T15:05:11.672417image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11
median1
Q310
95-th percentile201.5
Maximum1000
Range999
Interquartile range (IQR)9

Descriptive statistics

Standard deviation112.15514
Coefficient of variation (CV)3.1684956
Kurtosis44.351471
Mean35.39697
Median Absolute Deviation (MAD)0
Skewness5.9723412
Sum4672.4
Variance12578.776
MonotonicityNot monotonic
2023-12-12T15:05:11.793619image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=22)
ValueCountFrequency (%)
1.0 81
61.4%
10.0 9
 
6.8%
100.0 8
 
6.1%
5.0 7
 
5.3%
20.0 4
 
3.0%
250.0 3
 
2.3%
3.0 3
 
2.3%
80.0 2
 
1.5%
2.5 2
 
1.5%
150.0 1
 
0.8%
Other values (12) 12
 
9.1%
ValueCountFrequency (%)
1.0 81
61.4%
1.4 1
 
0.8%
2.5 2
 
1.5%
3.0 3
 
2.3%
5.0 7
 
5.3%
6.0 1
 
0.8%
10.0 9
 
6.8%
15.0 1
 
0.8%
20.0 4
 
3.0%
25.0 1
 
0.8%
ValueCountFrequency (%)
1000.0 1
 
0.8%
500.0 1
 
0.8%
400.0 1
 
0.8%
250.0 3
 
2.3%
240.0 1
 
0.8%
170.0 1
 
0.8%
150.0 1
 
0.8%
100.0 8
6.1%
90.0 1
 
0.8%
80.0 2
 
1.5%

단위
Categorical

HIGH CORRELATION 

Distinct10
Distinct (%)7.6%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
48 
mg
42 
26 
ml
g
Other values (5)

Length

Max length4
Median length1
Mean length1.4090909
Min length1

Unique

Unique4 ?
Unique (%)3.0%

Sample

1st row
2nd row
3rd row
4th row
5th row

Common Values

ValueCountFrequency (%)
48
36.4%
mg 42
31.8%
26
19.7%
ml 5
 
3.8%
g 5
 
3.8%
T 2
 
1.5%
1
 
0.8%
Btl 1
 
0.8%
Tube 1
 
0.8%
Amp 1
 
0.8%

Length

2023-12-12T15:05:11.902637image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T15:05:12.032220image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
48
36.4%
mg 42
31.8%
26
19.7%
ml 5
 
3.8%
g 5
 
3.8%
t 2
 
1.5%
1
 
0.8%
btl 1
 
0.8%
tube 1
 
0.8%
amp 1
 
0.8%

비용
Real number (ℝ)

HIGH CORRELATION 

Distinct81
Distinct (%)61.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean60334.795
Minimum1
Maximum1505140
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.3 KiB
2023-12-12T15:05:12.178108image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile17.2
Q1226
median5000
Q320000
95-th percentile288477.5
Maximum1505140
Range1505139
Interquartile range (IQR)19774

Descriptive statistics

Standard deviation217428.98
Coefficient of variation (CV)3.603708
Kurtosis31.208697
Mean60334.795
Median Absolute Deviation (MAD)4971.5
Skewness5.4604858
Sum7964193
Variance4.7275363 × 1010
MonotonicityNot monotonic
2023-12-12T15:05:12.308692image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
20000 13
 
9.8%
50000 8
 
6.1%
1000 6
 
4.5%
10000 5
 
3.8%
15000 5
 
3.8%
5000 5
 
3.8%
6400 5
 
3.8%
30000 4
 
3.0%
41 3
 
2.3%
400000 2
 
1.5%
Other values (71) 76
57.6%
ValueCountFrequency (%)
1 1
0.8%
6 1
0.8%
8 1
0.8%
12 1
0.8%
14 2
1.5%
15 1
0.8%
19 1
0.8%
20 1
0.8%
27 1
0.8%
30 1
0.8%
ValueCountFrequency (%)
1505140 1
0.8%
1434580 1
0.8%
1175080 1
0.8%
600000 1
0.8%
400000 2
1.5%
300000 1
0.8%
279050 1
0.8%
200000 1
0.8%
120000 1
0.8%
100000 2
1.5%
Distinct69
Distinct (%)52.3%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
Minimum2005-12-22 00:00:00
Maximum2023-02-01 00:00:00
2023-12-12T15:05:12.431839image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:05:12.570428image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

종료일자
Categorical

CONSTANT 

Distinct1
Distinct (%)0.8%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
9999-12-31
132 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row9999-12-31
2nd row9999-12-31
3rd row9999-12-31
4th row9999-12-31
5th row9999-12-31

Common Values

ValueCountFrequency (%)
9999-12-31 132
100.0%

Length

2023-12-12T15:05:12.684265image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T15:05:12.772639image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
9999-12-31 132
100.0%

Interactions

2023-12-12T15:05:09.307935image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:05:09.089640image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:05:09.395396image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:05:09.194636image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T15:05:12.824391image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분기본수량단위비용적용일자
구분1.0000.0000.8450.3700.946
기본수량0.0001.0000.0000.0000.434
단위0.8450.0001.0000.0000.914
비용0.3700.0000.0001.0000.000
적용일자0.9460.4340.9140.0001.000
2023-12-12T15:05:12.922246image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분단위
구분1.0000.563
단위0.5631.000
2023-12-12T15:05:12.999892image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
기본수량비용구분단위
기본수량1.000-0.7130.0000.000
비용-0.7131.0000.1470.000
구분0.0000.1471.0000.563
단위0.0000.0000.5631.000

Missing values

2023-12-12T15:05:09.548237image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T15:05:09.687363image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

구분수가코드명칭기본수량단위비용적용일자종료일자
0제증명료 진단서AQL001일반진단서1.0100002006-03-089999-12-31
1제증명료 진단서AQL001-1일반진단서(영문)1.0100002021-05-049999-12-31
2제증명료 진단서AQL001C일반진단서(사본,추가1통당)1.010002006-03-149999-12-31
3제증명료 진단서AQL002상해진단서(3주미만)1.0500002006-04-019999-12-31
4제증명료 진단서AQL002-1병무용진단서1.0200002017-01-019999-12-31
5제증명료 진단서AQL003사망진단서1.0100002006-03-089999-12-31
6제증명료 진단서AQL004C상해진단서(3주미만/사본추가1통당)1.010002006-03-159999-12-31
7제증명료 진단서AQL006장애진단서(정신장애)1.0150002006-03-159999-12-31
8제증명료 소견서AQL006A간질장애소견서1.050002011-08-119999-12-31
9제증명료 진단서AQL007장애진단서(정신지체, 발달장애)1.0400002006-03-159999-12-31
구분수가코드명칭기본수량단위비용적용일자종료일자
122뇌파검사TMS경두개 자기 자극술(TMS)1.0200002016-01-079999-12-31
123뇌파검사TMS10경두개 자기 자극술(TMS)10회1.02000002016-09-299999-12-31
124뇌파검사TMS5경두개 자기 자극술(TMS)5회1.01000002016-09-299999-12-31
125주사료WDPH페니톤주100mg(한림)100.0mg6802018-10-069999-12-31
126주사료WDW1010%포도당주사액1L(대한)1000.0ml15902019-11-079999-12-31
127주사료WGM국제겐타마이신주80mg80.0mg2902018-03-019999-12-31
128주사료WKCL염화칼륨주사액(대한)20.0ml2482018-03-019999-12-31
129주사료WNA대한염화나트륨주2.34g1.0g2282018-03-019999-12-31
130주사료WOLAN자이프렉사 주사 10mg( 한국릴리)10.0mg45432017-10-019999-12-31
131마취료XQ014리도카인주(휴온스)1.8ml1.0Amp2402018-03-019999-12-31