Overview

Dataset statistics

Number of variables7
Number of observations100
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory5.9 KiB
Average record size in memory60.3 B

Variable types

Categorical3
Numeric2
Text2

Alerts

적용시작일자 has constant value ""Constant
급여구분명 has constant value ""Constant
상대가치점수 is highly overall correlated with 단가(원)High correlation
단가(원) is highly overall correlated with 상대가치점수High correlation
한글명 has unique valuesUnique
수가코드 has unique valuesUnique

Reproduction

Analysis started2023-12-10 11:27:50.791037
Analysis finished2023-12-10 11:27:51.949807
Duration1.16 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

적용시작일자
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
20200101
100 

Length

Max length8
Median length8
Mean length8
Min length8

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row20200101
2nd row20200101
3rd row20200101
4th row20200101
5th row20200101

Common Values

ValueCountFrequency (%)
20200101 100
100.0%

Length

2023-12-10T20:27:52.042158image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T20:27:52.167943image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
20200101 100
100.0%

상대가치점수
Real number (ℝ)

HIGH CORRELATION 

Distinct53
Distinct (%)53.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean43.7878
Minimum3.28
Maximum110.42
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T20:27:52.331650image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum3.28
5-th percentile6.67
Q122.24
median39.875
Q360.6125
95-th percentile98.84
Maximum110.42
Range107.14
Interquartile range (IQR)38.3725

Descriptive statistics

Standard deviation27.416324
Coefficient of variation (CV)0.62611786
Kurtosis-0.39195398
Mean43.7878
Median Absolute Deviation (MAD)19.525
Skewness0.57049954
Sum4378.78
Variance751.6548
MonotonicityNot monotonic
2023-12-10T20:27:52.567311image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
22.36 6
 
6.0%
34.4 4
 
4.0%
37.35 3
 
3.0%
71.77 3
 
3.0%
56.02 3
 
3.0%
53.26 3
 
3.0%
64.25 3
 
3.0%
14.22 3
 
3.0%
46.9 3
 
3.0%
42.65 3
 
3.0%
Other values (43) 66
66.0%
ValueCountFrequency (%)
3.28 1
1.0%
5.16 2
2.0%
5.82 1
1.0%
6.67 2
2.0%
7.64 1
1.0%
8.62 1
1.0%
9.84 1
1.0%
10.82 1
1.0%
10.94 1
1.0%
12.29 1
1.0%
ValueCountFrequency (%)
110.42 2
2.0%
104.64 2
2.0%
98.84 2
2.0%
91.38 2
2.0%
86.18 2
2.0%
81.94 2
2.0%
72.16 2
2.0%
71.77 3
3.0%
68.02 3
3.0%
65.62 2
2.0%

한글명
Text

UNIQUE 

Distinct100
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
2023-12-10T20:27:52.913450image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length30
Median length23
Mean length17.91
Min length10

Characters and Unicode

Total characters1791
Distinct characters52
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique100 ?
Unique (%)100.0%

Sample

1st row복약지도료(방문당)
2nd row복약지도료(방문당)-야간
3rd row복약지도료(방문당)-야간,차등수가제외
4th row복약지도료(방문당)-심야
5th row복약지도료(방문당)-심야,차등수가제외
ValueCountFrequency (%)
처방조제-내복약 84
45.2%
조제투약 2
 
1.1%
복약지도료(방문당 1
 
0.5%
9일분-야간 1
 
0.5%
9일분 1
 
0.5%
8일분-공휴 1
 
0.5%
8일분-토요09-13 1
 
0.5%
8일분-심야,차등수가제외 1
 
0.5%
8일분-심야 1
 
0.5%
8일분-야간,차등수가제외 1
 
0.5%
Other values (92) 92
49.5%
2023-12-10T20:27:53.495986image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 200
 
11.2%
123
 
6.9%
104
 
5.8%
100
 
5.6%
99
 
5.5%
95
 
5.3%
93
 
5.2%
92
 
5.1%
86
 
4.8%
84
 
4.7%
Other values (42) 715
39.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1281
71.5%
Dash Punctuation 200
 
11.2%
Decimal Number 161
 
9.0%
Space Separator 86
 
4.8%
Other Punctuation 35
 
2.0%
Open Punctuation 14
 
0.8%
Close Punctuation 14
 
0.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
123
 
9.6%
104
 
8.1%
100
 
7.8%
99
 
7.7%
95
 
7.4%
93
 
7.3%
92
 
7.2%
84
 
6.6%
84
 
6.6%
56
 
4.4%
Other values (27) 351
27.4%
Decimal Number
ValueCountFrequency (%)
1 49
30.4%
3 21
13.0%
9 21
13.0%
0 21
13.0%
2 14
 
8.7%
7 7
 
4.3%
5 7
 
4.3%
4 7
 
4.3%
6 7
 
4.3%
8 7
 
4.3%
Dash Punctuation
ValueCountFrequency (%)
- 200
100.0%
Space Separator
ValueCountFrequency (%)
86
100.0%
Other Punctuation
ValueCountFrequency (%)
, 35
100.0%
Open Punctuation
ValueCountFrequency (%)
( 14
100.0%
Close Punctuation
ValueCountFrequency (%)
) 14
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1281
71.5%
Common 510
 
28.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
123
 
9.6%
104
 
8.1%
100
 
7.8%
99
 
7.7%
95
 
7.4%
93
 
7.3%
92
 
7.2%
84
 
6.6%
84
 
6.6%
56
 
4.4%
Other values (27) 351
27.4%
Common
ValueCountFrequency (%)
- 200
39.2%
86
16.9%
1 49
 
9.6%
, 35
 
6.9%
3 21
 
4.1%
9 21
 
4.1%
0 21
 
4.1%
( 14
 
2.7%
) 14
 
2.7%
2 14
 
2.7%
Other values (5) 35
 
6.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1281
71.5%
ASCII 510
 
28.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 200
39.2%
86
16.9%
1 49
 
9.6%
, 35
 
6.9%
3 21
 
4.1%
9 21
 
4.1%
0 21
 
4.1%
( 14
 
2.7%
) 14
 
2.7%
2 14
 
2.7%
Other values (5) 35
 
6.9%
Hangul
ValueCountFrequency (%)
123
 
9.6%
104
 
8.1%
100
 
7.8%
99
 
7.7%
95
 
7.4%
93
 
7.3%
92
 
7.2%
84
 
6.6%
84
 
6.6%
56
 
4.4%
Other values (27) 351
27.4%

수가코드
Text

UNIQUE 

Distinct100
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
2023-12-10T20:27:53.953041image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length8
Median length8
Mean length7.52
Min length5

Characters and Unicode

Total characters752
Distinct characters11
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique100 ?
Unique (%)100.0%

Sample

1st rowZ3000
2nd rowZ3000010
3rd rowZ3000011
4th rowZ3000020
5th rowZ3000021
ValueCountFrequency (%)
z3000 1
 
1.0%
z4107021 1
 
1.0%
z4109010 1
 
1.0%
z4109 1
 
1.0%
z4108050 1
 
1.0%
z4108030 1
 
1.0%
z4108021 1
 
1.0%
z4108020 1
 
1.0%
z4108011 1
 
1.0%
z4108010 1
 
1.0%
Other values (90) 90
90.0%
2023-12-10T20:27:54.625090image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 249
33.1%
1 183
24.3%
Z 100
13.3%
4 100
13.3%
2 43
 
5.7%
3 28
 
3.7%
5 21
 
2.8%
7 7
 
0.9%
6 7
 
0.9%
9 7
 
0.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 652
86.7%
Uppercase Letter 100
 
13.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 249
38.2%
1 183
28.1%
4 100
15.3%
2 43
 
6.6%
3 28
 
4.3%
5 21
 
3.2%
7 7
 
1.1%
6 7
 
1.1%
9 7
 
1.1%
8 7
 
1.1%
Uppercase Letter
ValueCountFrequency (%)
Z 100
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 652
86.7%
Latin 100
 
13.3%

Most frequent character per script

Common
ValueCountFrequency (%)
0 249
38.2%
1 183
28.1%
4 100
15.3%
2 43
 
6.6%
3 28
 
4.3%
5 21
 
3.2%
7 7
 
1.1%
6 7
 
1.1%
9 7
 
1.1%
8 7
 
1.1%
Latin
ValueCountFrequency (%)
Z 100
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 752
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 249
33.1%
1 183
24.3%
Z 100
13.3%
4 100
13.3%
2 43
 
5.7%
3 28
 
3.7%
5 21
 
2.8%
7 7
 
0.9%
6 7
 
0.9%
9 7
 
0.9%
Distinct16
Distinct (%)16.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
약3
약4가(1)주2
약4가(1)(가)
약4가(1)(나)
약4가(1)(다)
Other values (11)
65 

Length

Max length9
Median length9
Mean length8.42
Min length2

Unique

Unique2 ?
Unique (%)2.0%

Sample

1st row약3
2nd row약3
3rd row약3
4th row약3
5th row약3

Common Values

ValueCountFrequency (%)
약3 7
 
7.0%
약4가(1)주2 7
 
7.0%
약4가(1)(가) 7
 
7.0%
약4가(1)(나) 7
 
7.0%
약4가(1)(다) 7
 
7.0%
약4가(1)(라) 7
 
7.0%
약4가(1)(마) 7
 
7.0%
약4가(1)(바) 7
 
7.0%
약4가(1)(사) 7
 
7.0%
약4가(1)(아) 7
 
7.0%
Other values (6) 30
30.0%

Length

2023-12-10T20:27:54.820627image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
약3 7
 
7.0%
약4가(1)주2 7
 
7.0%
약4가(1)(가 7
 
7.0%
약4가(1)(나 7
 
7.0%
약4가(1)(다 7
 
7.0%
약4가(1)(라 7
 
7.0%
약4가(1)(마 7
 
7.0%
약4가(1)(바 7
 
7.0%
약4가(1)(사 7
 
7.0%
약4가(1)(아 7
 
7.0%
Other values (6) 30
30.0%

급여구분명
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
급여
100 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row급여
2nd row급여
3rd row급여
4th row급여
5th row급여

Common Values

ValueCountFrequency (%)
급여 100
100.0%

Length

2023-12-10T20:27:54.977233image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T20:27:55.085226image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
급여 100
100.0%

단가(원)
Real number (ℝ)

HIGH CORRELATION 

Distinct53
Distinct (%)53.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3854.1
Minimum290
Maximum9720
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T20:27:55.238575image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum290
5-th percentile590
Q11960
median3510
Q35335
95-th percentile8700
Maximum9720
Range9430
Interquartile range (IQR)3375

Descriptive statistics

Standard deviation2412.718
Coefficient of variation (CV)0.62601334
Kurtosis-0.39091819
Mean3854.1
Median Absolute Deviation (MAD)1720
Skewness0.57034228
Sum385410
Variance5821208.3
MonotonicityNot monotonic
2023-12-10T20:27:55.708815image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1970 6
 
6.0%
3030 4
 
4.0%
3290 3
 
3.0%
6320 3
 
3.0%
4930 3
 
3.0%
4690 3
 
3.0%
5650 3
 
3.0%
1250 3
 
3.0%
4130 3
 
3.0%
3750 3
 
3.0%
Other values (43) 66
66.0%
ValueCountFrequency (%)
290 1
1.0%
450 2
2.0%
510 1
1.0%
590 2
2.0%
670 1
1.0%
760 1
1.0%
870 1
1.0%
950 1
1.0%
960 1
1.0%
1080 1
1.0%
ValueCountFrequency (%)
9720 2
2.0%
9210 2
2.0%
8700 2
2.0%
8040 2
2.0%
7580 2
2.0%
7210 2
2.0%
6350 2
2.0%
6320 3
3.0%
5990 3
3.0%
5770 2
2.0%

Interactions

2023-12-10T20:27:51.378321image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T20:27:51.107072image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T20:27:51.504052image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T20:27:51.234745image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T20:27:55.820796image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
상대가치점수한글명수가코드수가분류번호단가(원)
상대가치점수1.0001.0001.0000.7181.000
한글명1.0001.0001.0001.0001.000
수가코드1.0001.0001.0001.0001.000
수가분류번호0.7181.0001.0001.0000.721
단가(원)1.0001.0001.0000.7211.000
2023-12-10T20:27:55.948432image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
상대가치점수단가(원)수가분류번호
상대가치점수1.0001.0000.366
단가(원)1.0001.0000.366
수가분류번호0.3660.3661.000

Missing values

2023-12-10T20:27:51.669916image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T20:27:51.875206image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

적용시작일자상대가치점수한글명수가코드수가분류번호급여구분명단가(원)
02020010110.94복약지도료(방문당)Z3000약3급여960
12020010114.22복약지도료(방문당)-야간Z3000010약3급여1250
22020010114.22복약지도료(방문당)-야간,차등수가제외Z3000011약3급여1250
32020010121.88복약지도료(방문당)-심야Z3000020약3급여1930
42020010121.88복약지도료(방문당)-심야,차등수가제외Z3000021약3급여1930
5202001013.28복약지도료(방문당)-토요09-13Z3000030약3급여290
62020010114.22복약지도료(방문당)-공휴Z3000050약3급여1250
7202001016.67처방조제-내복약-가루약 조제투약Z4010약4가(1)주1급여590
8202001016.67직접조제-내복약-가루약 조제투약Z4020약4나(1)주2급여590
92020010117.2처방조제-내복약-포장단위(병,팩)지급Z4100약4가(1)주2급여1510
적용시작일자상대가치점수한글명수가코드수가분류번호급여구분명단가(원)
9020200101104.64처방조제-내복약 11일분-심야,차등수가제외Z4111021약4가(1)(카)급여9210
912020010115.7처방조제-내복약 11일분-토요09-13Z4111030약4가(1)(카)급여1380
922020010168.02처방조제-내복약 11일분-공휴Z4111050약4가(1)(카)급여5990
932020010155.21처방조제-내복약 12일분Z4112약4가(1)(타)급여4860
942020010171.77처방조제-내복약 12일분-야간Z4112010약4가(1)(타)급여6320
952020010171.77처방조제-내복약 12일분-야간,차등수가제외Z4112011약4가(1)(타)급여6320
9620200101110.42처방조제-내복약 12일분-심야Z4112020약4가(1)(타)급여9720
9720200101110.42처방조제-내복약 12일분-심야,차등수가제외Z4112021약4가(1)(타)급여9720
982020010116.56처방조제-내복약 12일분-토요09-13Z4112030약4가(1)(타)급여1460
992020010171.77처방조제-내복약 12일분-공휴Z4112050약4가(1)(타)급여6320