Overview

Dataset statistics

Number of variables7
Number of observations100
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory5.9 KiB
Average record size in memory60.3 B

Variable types

Categorical3
Numeric2
Text2

Alerts

적용시작일자 has constant value ""Constant
급여구분명 has constant value ""Constant
상대가치점수 is highly overall correlated with 단가(원)High correlation
단가(원) is highly overall correlated with 상대가치점수High correlation
한글명 has unique valuesUnique
수가코드 has unique valuesUnique

Reproduction

Analysis started2023-12-10 11:27:34.365560
Analysis finished2023-12-10 11:27:35.645895
Duration1.28 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

적용시작일자
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
20200101
100 

Length

Max length8
Median length8
Mean length8
Min length8

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row20200101
2nd row20200101
3rd row20200101
4th row20200101
5th row20200101

Common Values

ValueCountFrequency (%)
20200101 100
100.0%

Length

2023-12-10T20:27:35.720667image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T20:27:35.837293image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
20200101 100
100.0%

상대가치점수
Real number (ℝ)

HIGH CORRELATION 

Distinct52
Distinct (%)52.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean36.7858
Minimum3.28
Maximum98.84
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T20:27:35.980615image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum3.28
5-th percentile5.787
Q118.8425
median33.745
Q353.26
95-th percentile86.18
Maximum98.84
Range95.56
Interquartile range (IQR)34.4175

Descriptive statistics

Standard deviation23.908378
Coefficient of variation (CV)0.64993497
Kurtosis-0.098723114
Mean36.7858
Median Absolute Deviation (MAD)17.155
Skewness0.7256715
Sum3678.58
Variance571.61054
MonotonicityNot monotonic
2023-12-10T20:27:36.162515image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
22.36 6
 
6.0%
34.4 4
 
4.0%
33.09 3
 
3.0%
46.9 3
 
3.0%
21.14 3
 
3.0%
37.35 3
 
3.0%
64.25 3
 
3.0%
27.81 3
 
3.0%
42.65 3
 
3.0%
59.4 3
 
3.0%
Other values (42) 66
66.0%
ValueCountFrequency (%)
3.28 1
1.0%
4.88 2
2.0%
5.16 2
2.0%
5.82 1
1.0%
6.67 2
2.0%
7.45 2
2.0%
7.64 1
1.0%
8.62 1
1.0%
9.84 1
1.0%
10.82 1
1.0%
ValueCountFrequency (%)
98.84 2
2.0%
91.38 2
2.0%
86.18 2
2.0%
81.94 2
2.0%
72.16 2
2.0%
65.62 2
2.0%
64.25 3
3.0%
59.4 3
3.0%
57.46 2
2.0%
56.02 3
3.0%

한글명
Text

UNIQUE 

Distinct100
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
2023-12-10T20:27:36.478705image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length30
Median length25
Mean length17.78
Min length10

Characters and Unicode

Total characters1778
Distinct characters60
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique100 ?
Unique (%)100.0%

Sample

1st row약국관리료(방문당)
2nd row약국관리료(방문당)-차등수가제외
3rd row조제기본료(방문당)
4th row조제기본료(방문당)-야간
5th row조제기본료(방문당)-야간.차등수가제외
ValueCountFrequency (%)
처방조제-내복약 70
40.7%
조제투약 2
 
1.2%
5일분-심야.차등수가제외 1
 
0.6%
7일분-야간 1
 
0.6%
7일분 1
 
0.6%
6일분-공휴 1
 
0.6%
6일분-토요09-13 1
 
0.6%
6일분-심야.차등수가제외 1
 
0.6%
6일분-심야 1
 
0.6%
6일분-야간.차등수가제외 1
 
0.6%
Other values (92) 92
53.5%
2023-12-10T20:27:36.889710image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 186
 
10.5%
121
 
6.8%
99
 
5.6%
93
 
5.2%
92
 
5.2%
86
 
4.8%
79
 
4.4%
78
 
4.4%
72
 
4.0%
70
 
3.9%
Other values (50) 802
45.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1284
72.2%
Dash Punctuation 186
 
10.5%
Decimal Number 140
 
7.9%
Space Separator 72
 
4.0%
Other Punctuation 40
 
2.2%
Open Punctuation 28
 
1.6%
Close Punctuation 28
 
1.6%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
121
 
9.4%
99
 
7.7%
93
 
7.2%
92
 
7.2%
86
 
6.7%
79
 
6.2%
78
 
6.1%
70
 
5.5%
70
 
5.5%
54
 
4.2%
Other values (35) 442
34.4%
Decimal Number
ValueCountFrequency (%)
1 28
20.0%
0 21
15.0%
9 21
15.0%
3 21
15.0%
6 14
10.0%
2 7
 
5.0%
4 7
 
5.0%
7 7
 
5.0%
5 7
 
5.0%
8 7
 
5.0%
Dash Punctuation
ValueCountFrequency (%)
- 186
100.0%
Space Separator
ValueCountFrequency (%)
72
100.0%
Other Punctuation
ValueCountFrequency (%)
. 40
100.0%
Open Punctuation
ValueCountFrequency (%)
( 28
100.0%
Close Punctuation
ValueCountFrequency (%)
) 28
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1284
72.2%
Common 494
 
27.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
121
 
9.4%
99
 
7.7%
93
 
7.2%
92
 
7.2%
86
 
6.7%
79
 
6.2%
78
 
6.1%
70
 
5.5%
70
 
5.5%
54
 
4.2%
Other values (35) 442
34.4%
Common
ValueCountFrequency (%)
- 186
37.7%
72
 
14.6%
. 40
 
8.1%
1 28
 
5.7%
( 28
 
5.7%
) 28
 
5.7%
0 21
 
4.3%
9 21
 
4.3%
3 21
 
4.3%
6 14
 
2.8%
Other values (5) 35
 
7.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1284
72.2%
ASCII 494
 
27.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 186
37.7%
72
 
14.6%
. 40
 
8.1%
1 28
 
5.7%
( 28
 
5.7%
) 28
 
5.7%
0 21
 
4.3%
9 21
 
4.3%
3 21
 
4.3%
6 14
 
2.8%
Other values (5) 35
 
7.1%
Hangul
ValueCountFrequency (%)
121
 
9.4%
99
 
7.7%
93
 
7.2%
92
 
7.2%
86
 
6.7%
79
 
6.2%
78
 
6.1%
70
 
5.5%
70
 
5.5%
54
 
4.2%
Other values (35) 442
34.4%

수가코드
Text

UNIQUE 

Distinct100
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
2023-12-10T20:27:37.235378image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length8
Median length8
Mean length7.52
Min length5

Characters and Unicode

Total characters752
Distinct characters11
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique100 ?
Unique (%)100.0%

Sample

1st rowZ1000
2nd rowZ1000001
3rd rowZ2000
4th rowZ2000010
5th rowZ2000011
ValueCountFrequency (%)
z1000 1
 
1.0%
z4105021 1
 
1.0%
z4107010 1
 
1.0%
z4107 1
 
1.0%
z4106050 1
 
1.0%
z4106030 1
 
1.0%
z4106021 1
 
1.0%
z4106020 1
 
1.0%
z4106011 1
 
1.0%
z4106010 1
 
1.0%
Other values (90) 90
90.0%
2023-12-10T20:27:37.704340image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 286
38.0%
1 150
19.9%
Z 100
 
13.3%
4 86
 
11.4%
2 46
 
6.1%
3 28
 
3.7%
5 21
 
2.8%
6 14
 
1.9%
7 7
 
0.9%
8 7
 
0.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 652
86.7%
Uppercase Letter 100
 
13.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 286
43.9%
1 150
23.0%
4 86
 
13.2%
2 46
 
7.1%
3 28
 
4.3%
5 21
 
3.2%
6 14
 
2.1%
7 7
 
1.1%
8 7
 
1.1%
9 7
 
1.1%
Uppercase Letter
ValueCountFrequency (%)
Z 100
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 652
86.7%
Latin 100
 
13.3%

Most frequent character per script

Common
ValueCountFrequency (%)
0 286
43.9%
1 150
23.0%
4 86
 
13.2%
2 46
 
7.1%
3 28
 
4.3%
5 21
 
3.2%
6 14
 
2.1%
7 7
 
1.1%
8 7
 
1.1%
9 7
 
1.1%
Latin
ValueCountFrequency (%)
Z 100
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 752
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 286
38.0%
1 150
19.9%
Z 100
 
13.3%
4 86
 
11.4%
2 46
 
6.1%
3 28
 
3.7%
5 21
 
2.8%
6 14
 
1.9%
7 7
 
0.9%
8 7
 
0.9%
Distinct16
Distinct (%)16.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
약2
12 
약3
약4가(1)주2
약4가(1)(가)
약4가(1)(나)
Other values (11)
60 

Length

Max length9
Median length9
Mean length7.44
Min length2

Unique

Unique2 ?
Unique (%)2.0%

Sample

1st row약1
2nd row약1
3rd row약2
4th row약2
5th row약2

Common Values

ValueCountFrequency (%)
약2 12
12.0%
약3 7
 
7.0%
약4가(1)주2 7
 
7.0%
약4가(1)(가) 7
 
7.0%
약4가(1)(나) 7
 
7.0%
약4가(1)(다) 7
 
7.0%
약4가(1)(라) 7
 
7.0%
약4가(1)(마) 7
 
7.0%
약4가(1)(바) 7
 
7.0%
약4가(1)(사) 7
 
7.0%
Other values (6) 25
25.0%

Length

2023-12-10T20:27:37.884723image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
약2 12
12.0%
약3 7
 
7.0%
약4가(1)주2 7
 
7.0%
약4가(1)(가 7
 
7.0%
약4가(1)(나 7
 
7.0%
약4가(1)(다 7
 
7.0%
약4가(1)(라 7
 
7.0%
약4가(1)(마 7
 
7.0%
약4가(1)(바 7
 
7.0%
약4가(1)(사 7
 
7.0%
Other values (6) 25
25.0%

급여구분명
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
급여
100 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row급여
2nd row급여
3rd row급여
4th row급여
5th row급여

Common Values

ValueCountFrequency (%)
급여 100
100.0%

Length

2023-12-10T20:27:38.158767image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T20:27:38.263087image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
급여 100
100.0%

단가(원)
Real number (ℝ)

HIGH CORRELATION 

Distinct52
Distinct (%)52.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3237.8
Minimum290
Maximum8700
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T20:27:38.412675image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum290
5-th percentile507
Q11660
median2970
Q34690
95-th percentile7580
Maximum8700
Range8410
Interquartile range (IQR)3030

Descriptive statistics

Standard deviation2103.4914
Coefficient of variation (CV)0.64966686
Kurtosis-0.098620398
Mean3237.8
Median Absolute Deviation (MAD)1510
Skewness0.72496716
Sum323780
Variance4424675.9
MonotonicityNot monotonic
2023-12-10T20:27:38.598337image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1970 6
 
6.0%
3030 4
 
4.0%
2910 3
 
3.0%
4130 3
 
3.0%
1860 3
 
3.0%
3290 3
 
3.0%
5650 3
 
3.0%
2450 3
 
3.0%
3750 3
 
3.0%
5230 3
 
3.0%
Other values (42) 66
66.0%
ValueCountFrequency (%)
290 1
1.0%
430 2
2.0%
450 2
2.0%
510 1
1.0%
590 2
2.0%
660 2
2.0%
670 1
1.0%
760 1
1.0%
870 1
1.0%
950 1
1.0%
ValueCountFrequency (%)
8700 2
2.0%
8040 2
2.0%
7580 2
2.0%
7210 2
2.0%
6350 2
2.0%
5770 2
2.0%
5650 3
3.0%
5230 3
3.0%
5060 2
2.0%
4930 3
3.0%

Interactions

2023-12-10T20:27:35.157611image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T20:27:34.947520image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T20:27:35.286714image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T20:27:35.037433image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T20:27:38.716993image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
상대가치점수한글명수가코드수가분류번호단가(원)
상대가치점수1.0001.0001.0000.7131.000
한글명1.0001.0001.0001.0001.000
수가코드1.0001.0001.0001.0001.000
수가분류번호0.7131.0001.0001.0000.716
단가(원)1.0001.0001.0000.7161.000
2023-12-10T20:27:38.853377image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
상대가치점수단가(원)수가분류번호
상대가치점수1.0001.0000.361
단가(원)1.0001.0000.361
수가분류번호0.3610.3611.000

Missing values

2023-12-10T20:27:35.438727image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T20:27:35.591948image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

적용시작일자상대가치점수한글명수가코드수가분류번호급여구분명단가(원)
0202001017.45약국관리료(방문당)Z1000약1급여660
1202001017.45약국관리료(방문당)-차등수가제외Z1000001약1급여660
22020010116.26조제기본료(방문당)Z2000약2급여1430
32020010121.14조제기본료(방문당)-야간Z2000010약2급여1860
42020010121.14조제기본료(방문당)-야간.차등수가제외Z2000011약2급여1860
5202001014.88조제기본료(방문당)-토요09-13Z2000030약2급여430
62020010121.14조제기본료(방문당)-공휴Z2000050약2급여1860
72020010122.93조제기본료(방문당)-6세미만Z2000600약2급여2020
82020010127.81조제기본료(방문당)-6세미만.야간Z2000610약2급여2450
92020010127.81조제기본료(방문당)-6세미만.야간.차등수가제외Z2000611약2급여2450
적용시작일자상대가치점수한글명수가코드수가분류번호급여구분명단가(원)
902020010191.38처방조제-내복약 9일분-심야.차등수가제외Z4109021약4가(1)(자)급여8040
912020010113.71처방조제-내복약 9일분-토요09-13Z4109030약4가(1)(자)급여1210
922020010159.4처방조제-내복약 9일분-공휴Z4109050약4가(1)(자)급여5230
932020010149.42처방조제-내복약 10일분Z4110약4가(1)(차)급여4350
942020010164.25처방조제-내복약 10일분-야간Z4110010약4가(1)(차)급여5650
952020010164.25처방조제-내복약 10일분-야간.차등수가제외Z4110011약4가(1)(차)급여5650
962020010198.84처방조제-내복약 10일분-심야Z4110020약4가(1)(차)급여8700
972020010198.84처방조제-내복약 10일분-심야.차등수가제외Z4110021약4가(1)(차)급여8700
982020010114.83처방조제-내복약 10일분-토요09-13Z4110030약4가(1)(차)급여1310
992020010164.25처방조제-내복약 10일분-공휴Z4110050약4가(1)(차)급여5650