Overview

Dataset statistics

Number of variables12
Number of observations10000
Missing cells7494
Missing cells (%)6.2%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.0 MiB
Average record size in memory105.0 B

Variable types

Text9
DateTime1
Numeric1
Categorical1

Dataset

Description치료재료 급여비급여목록 및 급여상한금액표 / 치료재료마스터 데이터셋은 치료재료 급여 및 비급여 품목별 상한금액, 규격, 단위, 제조사 등의 정보 제공
Author건강보험심사평가원
URLhttps://www.data.go.kr/data/15067463/fileData.do

Alerts

비고2 is highly imbalanced (54.3%)Imbalance
재질 has 105 (1.1%) missing valuesMissing
비고1 has 7354 (73.5%) missing valuesMissing
코드 has unique valuesUnique
상한금액 has 4528 (45.3%) zerosZeros

Reproduction

Analysis started2024-04-20 12:36:52.601956
Analysis finished2024-04-20 12:36:58.582906
Duration5.98 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

구분
Text

Distinct591
Distinct (%)5.9%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-04-20T21:36:59.964753image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length22
Median length20
Mean length15.2263
Min length4

Characters and Unicode

Total characters152263
Distinct characters44
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique585 ?
Unique (%)5.9%

Sample

1st row급여 품목(인체조직 포함)
2nd row(비급여 품목)삭제 및 삭제예정 품목
3rd row삭제 및 삭제 예정 품목
4th row100분의 852미만 본인부담 품목
5th row급여 품목(인체조직 포함)
ValueCountFrequency (%)
포함 5621
14.8%
품목(인체조직 5621
14.8%
급여 4843
12.7%
품목 4378
11.5%
3751
9.9%
급여중지 3682
9.7%
예정 3053
8.0%
삭제 2424
6.4%
비급여 1475
 
3.9%
품목)삭제 697
 
1.8%
Other values (590) 2493
6.6%
2024-04-20T21:37:02.433209image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
28038
18.4%
10696
 
7.0%
10696
 
7.0%
10002
 
6.6%
10002
 
6.6%
( 6319
 
4.2%
) 6319
 
4.2%
6204
 
4.1%
5621
 
3.7%
5621
 
3.7%
Other values (34) 52745
34.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 107702
70.7%
Space Separator 28038
 
18.4%
Open Punctuation 6319
 
4.2%
Close Punctuation 6319
 
4.2%
Decimal Number 3885
 
2.6%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
10696
 
9.9%
10696
 
9.9%
10002
 
9.3%
10002
 
9.3%
6204
 
5.8%
5621
 
5.2%
5621
 
5.2%
5621
 
5.2%
5621
 
5.2%
5621
 
5.2%
Other values (21) 31997
29.7%
Decimal Number
ValueCountFrequency (%)
0 1330
34.2%
1 993
25.6%
2 346
 
8.9%
3 188
 
4.8%
4 186
 
4.8%
6 180
 
4.6%
5 178
 
4.6%
9 173
 
4.5%
7 157
 
4.0%
8 154
 
4.0%
Space Separator
ValueCountFrequency (%)
28038
100.0%
Open Punctuation
ValueCountFrequency (%)
( 6319
100.0%
Close Punctuation
ValueCountFrequency (%)
) 6319
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 107702
70.7%
Common 44561
29.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
10696
 
9.9%
10696
 
9.9%
10002
 
9.3%
10002
 
9.3%
6204
 
5.8%
5621
 
5.2%
5621
 
5.2%
5621
 
5.2%
5621
 
5.2%
5621
 
5.2%
Other values (21) 31997
29.7%
Common
ValueCountFrequency (%)
28038
62.9%
( 6319
 
14.2%
) 6319
 
14.2%
0 1330
 
3.0%
1 993
 
2.2%
2 346
 
0.8%
3 188
 
0.4%
4 186
 
0.4%
6 180
 
0.4%
5 178
 
0.4%
Other values (3) 484
 
1.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 107702
70.7%
ASCII 44561
29.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
28038
62.9%
( 6319
 
14.2%
) 6319
 
14.2%
0 1330
 
3.0%
1 993
 
2.2%
2 346
 
0.8%
3 188
 
0.4%
4 186
 
0.4%
6 180
 
0.4%
5 178
 
0.4%
Other values (3) 484
 
1.1%
Hangul
ValueCountFrequency (%)
10696
 
9.9%
10696
 
9.9%
10002
 
9.3%
10002
 
9.3%
6204
 
5.8%
5621
 
5.2%
5621
 
5.2%
5621
 
5.2%
5621
 
5.2%
5621
 
5.2%
Other values (21) 31997
29.7%

코드
Text

UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-04-20T21:37:03.659159image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length9
Median length8
Mean length8.0001
Min length8

Characters and Unicode

Total characters80001
Distinct characters36
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique10000 ?
Unique (%)100.0%

Sample

1st rowTFF03095
2nd rowBM5018CU
3rd rowC2100004
4th rowK9203030
5th rowJ2401801
ValueCountFrequency (%)
tff03095 1
 
< 0.1%
k7222123 1
 
< 0.1%
k6043014 1
 
< 0.1%
m3203118 1
 
< 0.1%
tbe52002 1
 
< 0.1%
tbl11078 1
 
< 0.1%
k8311022 1
 
< 0.1%
j3202003 1
 
< 0.1%
c3014032 1
 
< 0.1%
m6710025 1
 
< 0.1%
Other values (9990) 9990
99.9%
2024-04-20T21:37:05.350611image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 20344
25.4%
1 11049
13.8%
2 7236
 
9.0%
3 5976
 
7.5%
4 4632
 
5.8%
5 3926
 
4.9%
6 3342
 
4.2%
7 3208
 
4.0%
B 2655
 
3.3%
8 2495
 
3.1%
Other values (26) 15138
18.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 64270
80.3%
Uppercase Letter 15731
 
19.7%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
B 2655
16.9%
K 1997
12.7%
M 1896
12.1%
C 1784
11.3%
J 1238
7.9%
T 964
 
6.1%
L 796
 
5.1%
F 700
 
4.4%
E 615
 
3.9%
G 389
 
2.5%
Other values (16) 2697
17.1%
Decimal Number
ValueCountFrequency (%)
0 20344
31.7%
1 11049
17.2%
2 7236
 
11.3%
3 5976
 
9.3%
4 4632
 
7.2%
5 3926
 
6.1%
6 3342
 
5.2%
7 3208
 
5.0%
8 2495
 
3.9%
9 2062
 
3.2%

Most occurring scripts

ValueCountFrequency (%)
Common 64270
80.3%
Latin 15731
 
19.7%

Most frequent character per script

Latin
ValueCountFrequency (%)
B 2655
16.9%
K 1997
12.7%
M 1896
12.1%
C 1784
11.3%
J 1238
7.9%
T 964
 
6.1%
L 796
 
5.1%
F 700
 
4.4%
E 615
 
3.9%
G 389
 
2.5%
Other values (16) 2697
17.1%
Common
ValueCountFrequency (%)
0 20344
31.7%
1 11049
17.2%
2 7236
 
11.3%
3 5976
 
9.3%
4 4632
 
7.2%
5 3926
 
6.1%
6 3342
 
5.2%
7 3208
 
5.0%
8 2495
 
3.9%
9 2062
 
3.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 80001
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 20344
25.4%
1 11049
13.8%
2 7236
 
9.0%
3 5976
 
7.5%
4 4632
 
5.8%
5 3926
 
4.9%
6 3342
 
4.2%
7 3208
 
4.0%
B 2655
 
3.3%
8 2495
 
3.1%
Other values (26) 15138
18.9%
Distinct205
Distinct (%)2.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2001-05-01 00:00:00
Maximum2023-11-01 00:00:00
2024-04-20T21:37:05.591802image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-20T21:37:06.102875image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

품명
Text

Distinct7257
Distinct (%)72.6%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-04-20T21:37:07.308522image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length205
Median length86
Mean length18.1032
Min length2

Characters and Unicode

Total characters181032
Distinct characters503
Distinct categories14 ?
Distinct scripts4 ?
Distinct blocks7 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique6103 ?
Unique (%)61.0%

Sample

1st rowFASCIA
2nd rowADFLEX BANDAGE
3rd rowTIBIALIS POSTERIOR TENDON
4th rowTHE ARTERY COMPRESSION TOURNIQUET (FOR RADIAL ARTERY)
5th rowTRIANGLE TIP KNIFE J
ValueCountFrequency (%)
plate 596
 
2.2%
screw 355
 
1.3%
catheter 329
 
1.2%
system 309
 
1.2%
set 223
 
0.8%
splint 196
 
0.7%
bone 178
 
0.7%
cancellous 159
 
0.6%
plus 156
 
0.6%
locking 154
 
0.6%
Other values (7409) 23904
90.0%
2024-04-20T21:37:09.001570image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
16768
 
9.3%
E 16636
 
9.2%
A 12071
 
6.7%
T 11317
 
6.3%
I 10735
 
5.9%
R 10058
 
5.6%
S 9837
 
5.4%
O 9545
 
5.3%
L 9544
 
5.3%
N 8714
 
4.8%
Other values (493) 65807
36.4%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter 145325
80.3%
Space Separator 16768
 
9.3%
Other Letter 12014
 
6.6%
Decimal Number 1980
 
1.1%
Dash Punctuation 1648
 
0.9%
Other Punctuation 1070
 
0.6%
Close Punctuation 993
 
0.5%
Open Punctuation 993
 
0.5%
Letter Number 86
 
< 0.1%
Lowercase Letter 62
 
< 0.1%
Other values (4) 93
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
490
 
4.1%
446
 
3.7%
418
 
3.5%
331
 
2.8%
320
 
2.7%
307
 
2.6%
294
 
2.4%
236
 
2.0%
223
 
1.9%
222
 
1.8%
Other values (407) 8727
72.6%
Uppercase Letter
ValueCountFrequency (%)
E 16636
11.4%
A 12071
 
8.3%
T 11317
 
7.8%
I 10735
 
7.4%
R 10058
 
6.9%
S 9837
 
6.8%
O 9545
 
6.6%
L 9544
 
6.6%
N 8714
 
6.0%
C 7691
 
5.3%
Other values (17) 39177
27.0%
Lowercase Letter
ValueCountFrequency (%)
α 10
16.1%
m 8
12.9%
e 7
11.3%
r 6
9.7%
o 5
8.1%
i 5
8.1%
β 4
 
6.5%
n 3
 
4.8%
u 3
 
4.8%
h 2
 
3.2%
Other values (6) 9
14.5%
Decimal Number
ValueCountFrequency (%)
0 440
22.2%
3 409
20.7%
2 298
15.1%
1 295
14.9%
5 179
9.0%
4 141
 
7.1%
6 86
 
4.3%
7 72
 
3.6%
8 35
 
1.8%
9 24
 
1.2%
Other Punctuation
ValueCountFrequency (%)
, 435
40.7%
. 282
26.4%
/ 246
23.0%
& 52
 
4.9%
: 20
 
1.9%
" 15
 
1.4%
' 13
 
1.2%
· 5
 
0.5%
% 1
 
0.1%
* 1
 
0.1%
Math Symbol
ValueCountFrequency (%)
+ 41
73.2%
~ 10
 
17.9%
2
 
3.6%
1
 
1.8%
< 1
 
1.8%
> 1
 
1.8%
Letter Number
ValueCountFrequency (%)
43
50.0%
26
30.2%
13
 
15.1%
3
 
3.5%
1
 
1.2%
Other Symbol
ValueCountFrequency (%)
15
71.4%
4
 
19.0%
° 2
 
9.5%
Close Punctuation
ValueCountFrequency (%)
) 979
98.6%
] 14
 
1.4%
Open Punctuation
ValueCountFrequency (%)
( 979
98.6%
[ 14
 
1.4%
Space Separator
ValueCountFrequency (%)
16768
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1648
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 15
100.0%
Other Number
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 145458
80.3%
Common 23545
 
13.0%
Hangul 12014
 
6.6%
Greek 15
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
490
 
4.1%
446
 
3.7%
418
 
3.5%
331
 
2.8%
320
 
2.7%
307
 
2.6%
294
 
2.4%
236
 
2.0%
223
 
1.9%
222
 
1.8%
Other values (407) 8727
72.6%
Latin
ValueCountFrequency (%)
E 16636
11.4%
A 12071
 
8.3%
T 11317
 
7.8%
I 10735
 
7.4%
R 10058
 
6.9%
S 9837
 
6.8%
O 9545
 
6.6%
L 9544
 
6.6%
N 8714
 
6.0%
C 7691
 
5.3%
Other values (35) 39310
27.0%
Common
ValueCountFrequency (%)
16768
71.2%
- 1648
 
7.0%
) 979
 
4.2%
( 979
 
4.2%
0 440
 
1.9%
, 435
 
1.8%
3 409
 
1.7%
2 298
 
1.3%
1 295
 
1.3%
. 282
 
1.2%
Other values (28) 1012
 
4.3%
Greek
ValueCountFrequency (%)
α 10
66.7%
β 4
 
26.7%
Ι 1
 
6.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 168886
93.3%
Hangul 12014
 
6.6%
Number Forms 87
 
< 0.1%
None 24
 
< 0.1%
CJK Compat 15
 
< 0.1%
Letterlike Symbols 4
 
< 0.1%
Math Operators 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
16768
 
9.9%
E 16636
 
9.9%
A 12071
 
7.1%
T 11317
 
6.7%
I 10735
 
6.4%
R 10058
 
6.0%
S 9837
 
5.8%
O 9545
 
5.7%
L 9544
 
5.7%
N 8714
 
5.2%
Other values (60) 53661
31.8%
Hangul
ValueCountFrequency (%)
490
 
4.1%
446
 
3.7%
418
 
3.5%
331
 
2.8%
320
 
2.7%
307
 
2.6%
294
 
2.4%
236
 
2.0%
223
 
1.9%
222
 
1.8%
Other values (407) 8727
72.6%
Number Forms
ValueCountFrequency (%)
43
49.4%
26
29.9%
13
 
14.9%
3
 
3.4%
1
 
1.1%
1
 
1.1%
CJK Compat
ValueCountFrequency (%)
15
100.0%
None
ValueCountFrequency (%)
α 10
41.7%
· 5
20.8%
β 4
 
16.7%
° 2
 
8.3%
1
 
4.2%
1
 
4.2%
Ι 1
 
4.2%
Letterlike Symbols
ValueCountFrequency (%)
4
100.0%
Math Operators
ValueCountFrequency (%)
2
100.0%

규격
Text

Distinct2073
Distinct (%)20.8%
Missing21
Missing (%)0.2%
Memory size156.2 KiB
2024-04-20T21:37:09.992946image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length226
Median length3
Mean length6.6363363
Min length1

Characters and Unicode

Total characters66224
Distinct characters169
Distinct categories12 ?
Distinct scripts4 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1463 ?
Unique (%)14.7%

Sample

1st row100CM² 이상
2nd row9.0CMX15.0CM(5.0CMX10.0CM)
3rd row전규격
4th row전규격
5th row전규격
ValueCountFrequency (%)
전규격 5039
35.5%
x 1075
 
7.6%
10cm 212
 
1.5%
type 140
 
1.0%
215cm 134
 
0.9%
이하 133
 
0.9%
straight 123
 
0.9%
5cm 122
 
0.9%
needle 120
 
0.8%
double 118
 
0.8%
Other values (1846) 6966
49.1%
2024-04-20T21:37:11.464288image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5054
 
7.6%
5052
 
7.6%
5052
 
7.6%
M 4925
 
7.4%
C 4294
 
6.5%
4254
 
6.4%
0 4097
 
6.2%
1 2871
 
4.3%
5 2675
 
4.0%
X 2298
 
3.5%
Other values (159) 25652
38.7%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter 21843
33.0%
Other Letter 18153
27.4%
Decimal Number 15861
24.0%
Space Separator 4254
 
6.4%
Other Punctuation 2769
 
4.2%
Other Symbol 758
 
1.1%
Lowercase Letter 749
 
1.1%
Close Punctuation 531
 
0.8%
Open Punctuation 530
 
0.8%
Dash Punctuation 487
 
0.7%
Other values (2) 289
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
5054
27.8%
5052
27.8%
5052
27.8%
754
 
4.2%
479
 
2.6%
449
 
2.5%
449
 
2.5%
272
 
1.5%
138
 
0.8%
138
 
0.8%
Other values (74) 316
 
1.7%
Uppercase Letter
ValueCountFrequency (%)
M 4925
22.5%
C 4294
19.7%
X 2298
10.5%
E 1419
 
6.5%
T 895
 
4.1%
H 845
 
3.9%
L 843
 
3.9%
P 728
 
3.3%
O 671
 
3.1%
A 659
 
3.0%
Other values (17) 4266
19.5%
Lowercase Letter
ValueCountFrequency (%)
c 240
32.0%
m 163
21.8%
x 137
18.3%
g 49
 
6.5%
p 35
 
4.7%
s 31
 
4.1%
l 24
 
3.2%
e 21
 
2.8%
n 12
 
1.6%
d 8
 
1.1%
Other values (10) 29
 
3.9%
Decimal Number
ValueCountFrequency (%)
0 4097
25.8%
1 2871
18.1%
5 2675
16.9%
2 1924
12.1%
4 1213
 
7.6%
3 1150
 
7.3%
7 583
 
3.7%
6 558
 
3.5%
8 525
 
3.3%
9 260
 
1.6%
Other values (3) 5
 
< 0.1%
Other Punctuation
ValueCountFrequency (%)
. 1054
38.1%
" 651
23.5%
, 577
20.8%
/ 401
 
14.5%
* 66
 
2.4%
: 7
 
0.3%
# 6
 
0.2%
& 5
 
0.2%
2
 
0.1%
Other Symbol
ValueCountFrequency (%)
632
83.4%
58
 
7.7%
49
 
6.5%
10
 
1.3%
7
 
0.9%
2
 
0.3%
Math Symbol
ValueCountFrequency (%)
× 166
63.1%
~ 54
 
20.5%
+ 41
 
15.6%
= 1
 
0.4%
1
 
0.4%
Space Separator
ValueCountFrequency (%)
4254
100.0%
Close Punctuation
ValueCountFrequency (%)
) 531
100.0%
Open Punctuation
ValueCountFrequency (%)
( 530
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 487
100.0%
Other Number
ValueCountFrequency (%)
² 26
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 25479
38.5%
Latin 22583
34.1%
Hangul 18153
27.4%
Greek 9
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
5054
27.8%
5052
27.8%
5052
27.8%
754
 
4.2%
479
 
2.6%
449
 
2.5%
449
 
2.5%
272
 
1.5%
138
 
0.8%
138
 
0.8%
Other values (74) 316
 
1.7%
Latin
ValueCountFrequency (%)
M 4925
21.8%
C 4294
19.0%
X 2298
10.2%
E 1419
 
6.3%
T 895
 
4.0%
H 845
 
3.7%
L 843
 
3.7%
P 728
 
3.2%
O 671
 
3.0%
A 659
 
2.9%
Other values (35) 5006
22.2%
Common
ValueCountFrequency (%)
4254
16.7%
0 4097
16.1%
1 2871
11.3%
5 2675
10.5%
2 1924
 
7.6%
4 1213
 
4.8%
3 1150
 
4.5%
. 1054
 
4.1%
" 651
 
2.6%
632
 
2.5%
Other values (28) 4958
19.5%
Greek
ValueCountFrequency (%)
Φ 6
66.7%
μ 3
33.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 47104
71.1%
Hangul 18153
 
27.4%
CJK Compat 758
 
1.1%
None 207
 
0.3%
Punctuation 2
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
5054
27.8%
5052
27.8%
5052
27.8%
754
 
4.2%
479
 
2.6%
449
 
2.5%
449
 
2.5%
272
 
1.5%
138
 
0.8%
138
 
0.8%
Other values (74) 316
 
1.7%
ASCII
ValueCountFrequency (%)
M 4925
 
10.5%
C 4294
 
9.1%
4254
 
9.0%
0 4097
 
8.7%
1 2871
 
6.1%
5 2675
 
5.7%
X 2298
 
4.9%
2 1924
 
4.1%
E 1419
 
3.0%
4 1213
 
2.6%
Other values (60) 17134
36.4%
CJK Compat
ValueCountFrequency (%)
632
83.4%
58
 
7.7%
49
 
6.5%
10
 
1.3%
7
 
0.9%
2
 
0.3%
None
ValueCountFrequency (%)
× 166
80.2%
² 26
 
12.6%
Φ 6
 
2.9%
μ 3
 
1.4%
2
 
1.0%
2
 
1.0%
1
 
0.5%
1
 
0.5%
Punctuation
ValueCountFrequency (%)
2
100.0%

단위
Text

Distinct66
Distinct (%)0.7%
Missing4
Missing (%)< 0.1%
Memory size156.2 KiB
2024-04-20T21:37:12.138151image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length3
Mean length2.9867947
Min length1

Characters and Unicode

Total characters29856
Distinct characters52
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique34 ?
Unique (%)0.3%

Sample

1st row1EA
2nd row1EA
3rd row1EA
4th row1EA
5th row1EA
ValueCountFrequency (%)
1ea 8934
89.2%
1장 339
 
3.4%
1set 149
 
1.5%
1roll 118
 
1.2%
편측 111
 
1.1%
1회 80
 
0.8%
1매 62
 
0.6%
45
 
0.4%
1kit 30
 
0.3%
cm2 22
 
0.2%
Other values (51) 121
 
1.2%
2024-04-20T21:37:12.896971image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 9769
32.7%
E 9113
30.5%
A 8961
30.0%
339
 
1.1%
L 244
 
0.8%
T 199
 
0.7%
S 166
 
0.6%
R 124
 
0.4%
O 122
 
0.4%
111
 
0.4%
Other values (42) 708
 
2.4%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter 19106
64.0%
Decimal Number 9888
33.1%
Other Letter 778
 
2.6%
Dash Punctuation 45
 
0.2%
Space Separator 20
 
0.1%
Other Punctuation 11
 
< 0.1%
Open Punctuation 3
 
< 0.1%
Close Punctuation 3
 
< 0.1%
Lowercase Letter 2
 
< 0.1%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
E 9113
47.7%
A 8961
46.9%
L 244
 
1.3%
T 199
 
1.0%
S 166
 
0.9%
R 124
 
0.6%
O 122
 
0.6%
I 52
 
0.3%
K 38
 
0.2%
C 27
 
0.1%
Other values (10) 60
 
0.3%
Other Letter
ValueCountFrequency (%)
339
43.6%
111
 
14.3%
111
 
14.3%
98
 
12.6%
62
 
8.0%
13
 
1.7%
9
 
1.2%
8
 
1.0%
6
 
0.8%
6
 
0.8%
Other values (4) 15
 
1.9%
Decimal Number
ValueCountFrequency (%)
1 9769
98.8%
2 47
 
0.5%
0 27
 
0.3%
7 11
 
0.1%
5 11
 
0.1%
3 8
 
0.1%
4 7
 
0.1%
9 4
 
< 0.1%
6 3
 
< 0.1%
8 1
 
< 0.1%
Other Punctuation
ValueCountFrequency (%)
/ 7
63.6%
, 4
36.4%
Lowercase Letter
ValueCountFrequency (%)
e 1
50.0%
t 1
50.0%
Dash Punctuation
ValueCountFrequency (%)
- 45
100.0%
Space Separator
ValueCountFrequency (%)
20
100.0%
Open Punctuation
ValueCountFrequency (%)
( 3
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 19108
64.0%
Common 9970
33.4%
Hangul 778
 
2.6%

Most frequent character per script

Latin
ValueCountFrequency (%)
E 9113
47.7%
A 8961
46.9%
L 244
 
1.3%
T 199
 
1.0%
S 166
 
0.9%
R 124
 
0.6%
O 122
 
0.6%
I 52
 
0.3%
K 38
 
0.2%
C 27
 
0.1%
Other values (12) 62
 
0.3%
Common
ValueCountFrequency (%)
1 9769
98.0%
2 47
 
0.5%
- 45
 
0.5%
0 27
 
0.3%
20
 
0.2%
7 11
 
0.1%
5 11
 
0.1%
3 8
 
0.1%
/ 7
 
0.1%
4 7
 
0.1%
Other values (6) 18
 
0.2%
Hangul
ValueCountFrequency (%)
339
43.6%
111
 
14.3%
111
 
14.3%
98
 
12.6%
62
 
8.0%
13
 
1.7%
9
 
1.2%
8
 
1.0%
6
 
0.8%
6
 
0.8%
Other values (4) 15
 
1.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 29078
97.4%
Hangul 778
 
2.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 9769
33.6%
E 9113
31.3%
A 8961
30.8%
L 244
 
0.8%
T 199
 
0.7%
S 166
 
0.6%
R 124
 
0.4%
O 122
 
0.4%
I 52
 
0.2%
2 47
 
0.2%
Other values (28) 281
 
1.0%
Hangul
ValueCountFrequency (%)
339
43.6%
111
 
14.3%
111
 
14.3%
98
 
12.6%
62
 
8.0%
13
 
1.7%
9
 
1.2%
8
 
1.0%
6
 
0.8%
6
 
0.8%
Other values (4) 15
 
1.9%

상한금액
Real number (ℝ)

ZEROS 

Distinct1499
Distinct (%)15.0%
Missing2
Missing (%)< 0.1%
Infinite0
Infinite (%)0.0%
Mean193143.18
Minimum0
Maximum22230140
Zeros4528
Zeros (%)45.3%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-20T21:37:13.283360image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median648
Q375450
95-th percentile867140
Maximum22230140
Range22230140
Interquartile range (IQR)75450

Descriptive statistics

Standard deviation966606.31
Coefficient of variation (CV)5.00461
Kurtosis285.91707
Mean193143.18
Median Absolute Deviation (MAD)648
Skewness15.159057
Sum1.9310456 × 109
Variance9.3432777 × 1011
MonotonicityNot monotonic
2024-04-20T21:37:13.546656image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 4528
45.3%
27580 69
 
0.7%
1470 64
 
0.6%
1235780 53
 
0.5%
44560 47
 
0.5%
81300 44
 
0.4%
5170 38
 
0.4%
4400 37
 
0.4%
98970 36
 
0.4%
307 33
 
0.3%
Other values (1489) 5049
50.5%
ValueCountFrequency (%)
0 4528
45.3%
9 16
 
0.2%
14 8
 
0.1%
18 3
 
< 0.1%
20 15
 
0.1%
21 7
 
0.1%
23 2
 
< 0.1%
28 5
 
0.1%
29 4
 
< 0.1%
30 8
 
0.1%
ValueCountFrequency (%)
22230140 4
< 0.1%
20971820 3
< 0.1%
20209220 1
 
< 0.1%
19868040 2
< 0.1%
19009430 1
 
< 0.1%
17286320 1
 
< 0.1%
15682230 1
 
< 0.1%
13265100 1
 
< 0.1%
13031300 1
 
< 0.1%
12347370 4
< 0.1%
Distinct2758
Distinct (%)27.6%
Missing2
Missing (%)< 0.1%
Memory size156.2 KiB
2024-04-20T21:37:14.544598image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length141
Median length59
Mean length17.224245
Min length1

Characters and Unicode

Total characters172208
Distinct characters202
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1428 ?
Unique (%)14.3%

Sample

1st rowCOMMUNITY BLOOD CENTER/COMMUNITY TISSUE SERVICES
2nd rowYOUNG CHEMICAL
3rd rowREGENERATION TECHNOLOGY INC
4th rowHANGZHOU SHANYOU MEDICAL EQUIPMENT CO. LTD
5th rowOLYMPUS MEDICAL SYSTEMS CORPORATION
ValueCountFrequency (%)
medical 1614
 
6.7%
ltd 904
 
3.7%
co 773
 
3.2%
inc 743
 
3.1%
co.,ltd 560
 
2.3%
gmbh 494
 
2.0%
corporation 333
 
1.4%
227
 
0.9%
tissue 211
 
0.9%
surgical 202
 
0.8%
Other values (2683) 18178
75.0%
2024-04-20T21:37:16.094920image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
14425
 
8.4%
E 14209
 
8.3%
I 12961
 
7.5%
O 11920
 
6.9%
C 10973
 
6.4%
N 10713
 
6.2%
A 10445
 
6.1%
T 9075
 
5.3%
L 8317
 
4.8%
S 7988
 
4.6%
Other values (192) 61182
35.5%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter 146407
85.0%
Space Separator 14425
 
8.4%
Other Punctuation 6605
 
3.8%
Other Letter 3777
 
2.2%
Dash Punctuation 366
 
0.2%
Close Punctuation 197
 
0.1%
Open Punctuation 196
 
0.1%
Lowercase Letter 144
 
0.1%
Decimal Number 77
 
< 0.1%
Other Symbol 8
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
181
 
4.8%
177
 
4.7%
177
 
4.7%
177
 
4.7%
174
 
4.6%
151
 
4.0%
132
 
3.5%
132
 
3.5%
130
 
3.4%
128
 
3.4%
Other values (126) 2218
58.7%
Uppercase Letter
ValueCountFrequency (%)
E 14209
 
9.7%
I 12961
 
8.9%
O 11920
 
8.1%
C 10973
 
7.5%
N 10713
 
7.3%
A 10445
 
7.1%
T 9075
 
6.2%
L 8317
 
5.7%
S 7988
 
5.5%
D 7863
 
5.4%
Other values (16) 41943
28.6%
Lowercase Letter
ValueCountFrequency (%)
o 24
16.7%
n 14
9.7%
r 13
9.0%
a 12
8.3%
c 12
8.3%
i 12
8.3%
e 11
7.6%
d 10
6.9%
t 9
 
6.2%
u 6
 
4.2%
Other values (6) 21
14.6%
Other Punctuation
ValueCountFrequency (%)
. 3794
57.4%
, 2023
30.6%
& 574
 
8.7%
/ 196
 
3.0%
9
 
0.1%
' 6
 
0.1%
: 1
 
< 0.1%
1
 
< 0.1%
· 1
 
< 0.1%
Decimal Number
ValueCountFrequency (%)
3 53
68.8%
0 8
 
10.4%
1 6
 
7.8%
2 3
 
3.9%
6 3
 
3.9%
5 1
 
1.3%
9 1
 
1.3%
8 1
 
1.3%
7 1
 
1.3%
Space Separator
ValueCountFrequency (%)
14425
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 366
100.0%
Close Punctuation
ValueCountFrequency (%)
) 197
100.0%
Open Punctuation
ValueCountFrequency (%)
( 196
100.0%
Other Symbol
ValueCountFrequency (%)
8
100.0%
Math Symbol
ValueCountFrequency (%)
+ 6
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 146551
85.1%
Common 21872
 
12.7%
Hangul 3785
 
2.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
181
 
4.8%
177
 
4.7%
177
 
4.7%
177
 
4.7%
174
 
4.6%
151
 
4.0%
132
 
3.5%
132
 
3.5%
130
 
3.4%
128
 
3.4%
Other values (127) 2226
58.8%
Latin
ValueCountFrequency (%)
E 14209
 
9.7%
I 12961
 
8.8%
O 11920
 
8.1%
C 10973
 
7.5%
N 10713
 
7.3%
A 10445
 
7.1%
T 9075
 
6.2%
L 8317
 
5.7%
S 7988
 
5.5%
D 7863
 
5.4%
Other values (32) 42087
28.7%
Common
ValueCountFrequency (%)
14425
66.0%
. 3794
 
17.3%
, 2023
 
9.2%
& 574
 
2.6%
- 366
 
1.7%
) 197
 
0.9%
/ 196
 
0.9%
( 196
 
0.9%
3 53
 
0.2%
9
 
< 0.1%
Other values (13) 39
 
0.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 168412
97.8%
Hangul 3777
 
2.2%
None 19
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
14425
 
8.6%
E 14209
 
8.4%
I 12961
 
7.7%
O 11920
 
7.1%
C 10973
 
6.5%
N 10713
 
6.4%
A 10445
 
6.2%
T 9075
 
5.4%
L 8317
 
4.9%
S 7988
 
4.7%
Other values (52) 57386
34.1%
Hangul
ValueCountFrequency (%)
181
 
4.8%
177
 
4.7%
177
 
4.7%
177
 
4.7%
174
 
4.6%
151
 
4.0%
132
 
3.5%
132
 
3.5%
130
 
3.4%
128
 
3.4%
Other values (126) 2218
58.7%
None
ValueCountFrequency (%)
9
47.4%
8
42.1%
1
 
5.3%
· 1
 
5.3%

재질
Text

MISSING 

Distinct3118
Distinct (%)31.5%
Missing105
Missing (%)1.1%
Memory size156.2 KiB
2024-04-20T21:37:17.225432image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length156
Median length84
Mean length15.4762
Min length1

Characters and Unicode

Total characters153137
Distinct characters424
Distinct categories14 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2063 ?
Unique (%)20.8%

Sample

1st rowFASCIA LATA
2nd row폴리우레탄필름/폴리아크릴알킬에스텔에멀전/부직포/폴리에칠렌망
3rd rowHUMAN
4th rowPC100%, SILICONE 100%
5th rowSTAINLESS STEEL 304 등
ValueCountFrequency (%)
2003
 
9.7%
titanium 1280
 
6.2%
stainless 743
 
3.6%
steel 707
 
3.4%
579
 
2.8%
alloy 546
 
2.6%
cotton 464
 
2.2%
polyurethane 309
 
1.5%
bone 293
 
1.4%
면사 252
 
1.2%
Other values (2606) 13569
65.4%
2024-04-20T21:37:18.889148image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
11115
 
7.3%
E 10991
 
7.2%
L 10483
 
6.8%
T 10015
 
6.5%
I 8694
 
5.7%
A 8685
 
5.7%
O 8297
 
5.4%
N 7322
 
4.8%
S 5536
 
3.6%
P 4933
 
3.2%
Other values (414) 67066
43.8%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter 104460
68.2%
Other Letter 26461
 
17.3%
Space Separator 11115
 
7.3%
Other Punctuation 4014
 
2.6%
Decimal Number 2450
 
1.6%
Dash Punctuation 1748
 
1.1%
Math Symbol 1394
 
0.9%
Close Punctuation 558
 
0.4%
Open Punctuation 551
 
0.4%
Lowercase Letter 367
 
0.2%
Other values (4) 19
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2243
 
8.5%
1964
 
7.4%
1597
 
6.0%
1069
 
4.0%
926
 
3.5%
914
 
3.5%
618
 
2.3%
614
 
2.3%
611
 
2.3%
611
 
2.3%
Other values (328) 15294
57.8%
Uppercase Letter
ValueCountFrequency (%)
E 10991
10.5%
L 10483
10.0%
T 10015
9.6%
I 8694
 
8.3%
A 8685
 
8.3%
O 8297
 
7.9%
N 7322
 
7.0%
S 5536
 
5.3%
P 4933
 
4.7%
C 4705
 
4.5%
Other values (18) 24799
23.7%
Lowercase Letter
ValueCountFrequency (%)
β 126
34.3%
e 46
 
12.5%
l 22
 
6.0%
t 20
 
5.4%
a 19
 
5.2%
i 16
 
4.4%
o 16
 
4.4%
n 14
 
3.8%
y 13
 
3.5%
c 13
 
3.5%
Other values (14) 62
16.9%
Decimal Number
ValueCountFrequency (%)
4 537
21.9%
6 532
21.7%
0 492
20.1%
1 250
10.2%
2 189
 
7.7%
5 135
 
5.5%
3 116
 
4.7%
7 75
 
3.1%
8 70
 
2.9%
9 54
 
2.2%
Other Punctuation
ValueCountFrequency (%)
, 2932
73.0%
% 426
 
10.6%
. 251
 
6.3%
/ 243
 
6.1%
: 104
 
2.6%
& 50
 
1.2%
; 7
 
0.2%
' 1
 
< 0.1%
Math Symbol
ValueCountFrequency (%)
+ 1363
97.8%
~ 12
 
0.9%
11
 
0.8%
± 8
 
0.6%
Other Number
ValueCountFrequency (%)
2
40.0%
2
40.0%
1
20.0%
Close Punctuation
ValueCountFrequency (%)
) 554
99.3%
] 4
 
0.7%
Open Punctuation
ValueCountFrequency (%)
( 550
99.8%
[ 1
 
0.2%
Space Separator
ValueCountFrequency (%)
11115
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1748
100.0%
Letter Number
ValueCountFrequency (%)
9
100.0%
Format
ValueCountFrequency (%)
­ 3
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 104695
68.4%
Hangul 26461
 
17.3%
Common 21840
 
14.3%
Greek 141
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2243
 
8.5%
1964
 
7.4%
1597
 
6.0%
1069
 
4.0%
926
 
3.5%
914
 
3.5%
618
 
2.3%
614
 
2.3%
611
 
2.3%
611
 
2.3%
Other values (328) 15294
57.8%
Latin
ValueCountFrequency (%)
E 10991
10.5%
L 10483
10.0%
T 10015
9.6%
I 8694
 
8.3%
A 8685
 
8.3%
O 8297
 
7.9%
N 7322
 
7.0%
S 5536
 
5.3%
P 4933
 
4.7%
C 4705
 
4.5%
Other values (37) 25034
23.9%
Common
ValueCountFrequency (%)
11115
50.9%
, 2932
 
13.4%
- 1748
 
8.0%
+ 1363
 
6.2%
) 554
 
2.5%
( 550
 
2.5%
4 537
 
2.5%
6 532
 
2.4%
0 492
 
2.3%
% 426
 
2.0%
Other values (23) 1591
 
7.3%
Greek
ValueCountFrequency (%)
β 126
89.4%
Β 9
 
6.4%
α 3
 
2.1%
σ 1
 
0.7%
Ι 1
 
0.7%
ω 1
 
0.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 126492
82.6%
Hangul 26461
 
17.3%
None 175
 
0.1%
Number Forms 9
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
11115
 
8.8%
E 10991
 
8.7%
L 10483
 
8.3%
T 10015
 
7.9%
I 8694
 
6.9%
A 8685
 
6.9%
O 8297
 
6.6%
N 7322
 
5.8%
S 5536
 
4.4%
P 4933
 
3.9%
Other values (62) 40421
32.0%
Hangul
ValueCountFrequency (%)
2243
 
8.5%
1964
 
7.4%
1597
 
6.0%
1069
 
4.0%
926
 
3.5%
914
 
3.5%
618
 
2.3%
614
 
2.3%
611
 
2.3%
611
 
2.3%
Other values (328) 15294
57.8%
None
ValueCountFrequency (%)
β 126
72.0%
11
 
6.3%
Β 9
 
5.1%
± 8
 
4.6%
ß 7
 
4.0%
­ 3
 
1.7%
α 3
 
1.7%
2
 
1.1%
2
 
1.1%
σ 1
 
0.6%
Other values (3) 3
 
1.7%
Number Forms
ValueCountFrequency (%)
9
100.0%
Distinct1787
Distinct (%)17.9%
Missing6
Missing (%)0.1%
Memory size156.2 KiB
2024-04-20T21:37:19.911166image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length22
Median length19
Mean length7.0257154
Min length1

Characters and Unicode

Total characters70215
Distinct characters459
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique709 ?
Unique (%)7.1%

Sample

1st row주식회사 제이비엠그룹
2nd row영케미칼
3rd row코리아본뱅크
4th row오메드
5th row올림푸스한국㈜
ValueCountFrequency (%)
한국존슨앤드존슨메디칼 372
 
3.5%
주식회사 292
 
2.8%
메드트로닉코리아 228
 
2.2%
한국스트라이커 215
 
2.0%
짐머바이오메트코리아 182
 
1.7%
비브라운코리아 135
 
1.3%
스미스앤드네퓨 112
 
1.1%
원바이오젠 111
 
1.1%
보스톤사이언티픽코리아 96
 
0.9%
시지바이오 92
 
0.9%
Other values (1774) 8678
82.5%
2024-04-20T21:37:21.011358image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4066
 
5.8%
3153
 
4.5%
3010
 
4.3%
2775
 
4.0%
2460
 
3.5%
2252
 
3.2%
2181
 
3.1%
2181
 
3.1%
2040
 
2.9%
) 1841
 
2.6%
Other values (449) 44256
63.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 65366
93.1%
Close Punctuation 1841
 
2.6%
Open Punctuation 1840
 
2.6%
Space Separator 577
 
0.8%
Other Symbol 434
 
0.6%
Uppercase Letter 144
 
0.2%
Dash Punctuation 8
 
< 0.1%
Other Punctuation 4
 
< 0.1%
Decimal Number 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4066
 
6.2%
3153
 
4.8%
3010
 
4.6%
2775
 
4.2%
2460
 
3.8%
2252
 
3.4%
2181
 
3.3%
2181
 
3.3%
2040
 
3.1%
1522
 
2.3%
Other values (425) 39726
60.8%
Uppercase Letter
ValueCountFrequency (%)
K 28
19.4%
B 19
13.2%
E 18
12.5%
H 10
 
6.9%
A 10
 
6.9%
M 9
 
6.2%
I 9
 
6.2%
S 9
 
6.2%
C 7
 
4.9%
L 5
 
3.5%
Other values (7) 20
13.9%
Close Punctuation
ValueCountFrequency (%)
) 1841
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1840
100.0%
Space Separator
ValueCountFrequency (%)
577
100.0%
Other Symbol
ValueCountFrequency (%)
434
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 8
100.0%
Other Punctuation
ValueCountFrequency (%)
. 4
100.0%
Decimal Number
ValueCountFrequency (%)
2 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 65800
93.7%
Common 4271
 
6.1%
Latin 144
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
4066
 
6.2%
3153
 
4.8%
3010
 
4.6%
2775
 
4.2%
2460
 
3.7%
2252
 
3.4%
2181
 
3.3%
2181
 
3.3%
2040
 
3.1%
1522
 
2.3%
Other values (426) 40160
61.0%
Latin
ValueCountFrequency (%)
K 28
19.4%
B 19
13.2%
E 18
12.5%
H 10
 
6.9%
A 10
 
6.9%
M 9
 
6.2%
I 9
 
6.2%
S 9
 
6.2%
C 7
 
4.9%
L 5
 
3.5%
Other values (7) 20
13.9%
Common
ValueCountFrequency (%)
) 1841
43.1%
( 1840
43.1%
577
 
13.5%
- 8
 
0.2%
. 4
 
0.1%
2 1
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 65366
93.1%
ASCII 4415
 
6.3%
None 434
 
0.6%

Most frequent character per block

Hangul
ValueCountFrequency (%)
4066
 
6.2%
3153
 
4.8%
3010
 
4.6%
2775
 
4.2%
2460
 
3.8%
2252
 
3.4%
2181
 
3.3%
2181
 
3.3%
2040
 
3.1%
1522
 
2.3%
Other values (425) 39726
60.8%
ASCII
ValueCountFrequency (%)
) 1841
41.7%
( 1840
41.7%
577
 
13.1%
K 28
 
0.6%
B 19
 
0.4%
E 18
 
0.4%
H 10
 
0.2%
A 10
 
0.2%
M 9
 
0.2%
I 9
 
0.2%
Other values (13) 54
 
1.2%
None
ValueCountFrequency (%)
434
100.0%

비고1
Text

MISSING 

Distinct123
Distinct (%)4.6%
Missing7354
Missing (%)73.5%
Memory size156.2 KiB
2024-04-20T21:37:21.569778image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length149
Median length131
Mean length43.805745
Min length9

Characters and Unicode

Total characters115910
Distinct characters139
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique59 ?
Unique (%)2.2%

Sample

1st row본인부담률 80% / 고시 제2022-175호(2022.8.1.적용) 관련 본인부담률 90%
2nd row고시 제2019-80호(2019.5.1.적용) 관련 선별급여 본인부담률 80% 적용/ 내시경적 점막하 박리절제술 (ENDOSCOPIC SUBMUCOSAL DISSECTION,ESD)용 KNIFE→내시경적 시술용 KNIFE 중분류명 변경
3rd row고시 제2018-281호(2019.1.1.적용) 관련 선별급여 본인부담률 80% 적용
4th row고시 제2019-80호(2019.5.1.적용) 관련 선별급여 본인부담률 80% 적용
5th row본인부담률 80%
ValueCountFrequency (%)
본인부담률 2663
15.5%
80 2366
13.7%
관련 2094
12.2%
고시 2092
12.1%
적용 2062
12.0%
선별급여 2055
11.9%
제2019-80호(2019.5.1.적용 1126
6.5%
527
 
3.1%
제2018-281호(2019.1.1.적용 434
 
2.5%
제2021-48호(2021.7.1.적용 329
 
1.9%
Other values (152) 1481
8.6%
2024-04-20T21:37:22.516552image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
14597
 
12.6%
0 9401
 
8.1%
1 8115
 
7.0%
2 7557
 
6.5%
. 7421
 
6.4%
8 5158
 
4.5%
4553
 
3.9%
4547
 
3.9%
9 3187
 
2.7%
% 2824
 
2.4%
Other values (129) 48550
41.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 45163
39.0%
Decimal Number 36586
31.6%
Space Separator 14597
 
12.6%
Other Punctuation 10911
 
9.4%
Open Punctuation 2548
 
2.2%
Close Punctuation 2548
 
2.2%
Dash Punctuation 2479
 
2.1%
Uppercase Letter 984
 
0.8%
Math Symbol 94
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4553
 
10.1%
4547
 
10.1%
2685
 
5.9%
2685
 
5.9%
2664
 
5.9%
2663
 
5.9%
2663
 
5.9%
2483
 
5.5%
2473
 
5.5%
2201
 
4.9%
Other values (81) 15546
34.4%
Uppercase Letter
ValueCountFrequency (%)
S 103
10.5%
B 90
 
9.1%
E 90
 
9.1%
I 82
 
8.3%
O 72
 
7.3%
N 68
 
6.9%
C 66
 
6.7%
M 61
 
6.2%
D 52
 
5.3%
K 49
 
5.0%
Other values (16) 251
25.5%
Decimal Number
ValueCountFrequency (%)
0 9401
25.7%
1 8115
22.2%
2 7557
20.7%
8 5158
14.1%
9 3187
 
8.7%
5 1886
 
5.2%
7 570
 
1.6%
4 507
 
1.4%
3 132
 
0.4%
6 73
 
0.2%
Other Punctuation
ValueCountFrequency (%)
. 7421
68.0%
% 2824
 
25.9%
/ 538
 
4.9%
, 126
 
1.2%
& 2
 
< 0.1%
Math Symbol
ValueCountFrequency (%)
79
84.0%
~ 14
 
14.9%
+ 1
 
1.1%
Space Separator
ValueCountFrequency (%)
14597
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2548
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2548
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2479
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 69763
60.2%
Hangul 45163
39.0%
Latin 984
 
0.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
4553
 
10.1%
4547
 
10.1%
2685
 
5.9%
2685
 
5.9%
2664
 
5.9%
2663
 
5.9%
2663
 
5.9%
2483
 
5.5%
2473
 
5.5%
2201
 
4.9%
Other values (81) 15546
34.4%
Latin
ValueCountFrequency (%)
S 103
10.5%
B 90
 
9.1%
E 90
 
9.1%
I 82
 
8.3%
O 72
 
7.3%
N 68
 
6.9%
C 66
 
6.7%
M 61
 
6.2%
D 52
 
5.3%
K 49
 
5.0%
Other values (16) 251
25.5%
Common
ValueCountFrequency (%)
14597
20.9%
0 9401
13.5%
1 8115
11.6%
2 7557
10.8%
. 7421
10.6%
8 5158
 
7.4%
9 3187
 
4.6%
% 2824
 
4.0%
( 2548
 
3.7%
) 2548
 
3.7%
Other values (12) 6407
9.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 70668
61.0%
Hangul 45163
39.0%
Arrows 79
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
14597
20.7%
0 9401
13.3%
1 8115
11.5%
2 7557
10.7%
. 7421
10.5%
8 5158
 
7.3%
9 3187
 
4.5%
% 2824
 
4.0%
( 2548
 
3.6%
) 2548
 
3.6%
Other values (37) 7312
10.3%
Hangul
ValueCountFrequency (%)
4553
 
10.1%
4547
 
10.1%
2685
 
5.9%
2685
 
5.9%
2664
 
5.9%
2663
 
5.9%
2663
 
5.9%
2483
 
5.5%
2473
 
5.5%
2201
 
4.9%
Other values (81) 15546
34.4%
Arrows
ValueCountFrequency (%)
79
100.0%

비고2
Categorical

IMBALANCE 

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
7992 
중복인정여부 Y
2007 
중복인정여부 Y
 
1

Length

Max length9
Median length4
Mean length4.8033
Min length4

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row중복인정여부 Y

Common Values

ValueCountFrequency (%)
<NA> 7992
79.9%
중복인정여부 Y 2007
 
20.1%
중복인정여부 Y 1
 
< 0.1%

Length

2024-04-20T21:37:22.735951image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-20T21:37:22.907073image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 7992
66.6%
중복인정여부 2008
 
16.7%
y 2008
 
16.7%

Interactions

2024-04-20T21:36:56.865744image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-20T21:37:23.020705image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
단위상한금액비고2
단위1.0000.0000.000
상한금액0.0001.0000.000
비고20.0000.0001.000
2024-04-20T21:37:23.189645image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
상한금액비고2
상한금액1.0000.000
비고20.0001.000

Missing values

2024-04-20T21:36:57.242714image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-20T21:36:57.807738image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-04-20T21:36:58.241512image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

구분코드적용일자품명규격단위상한금액제조회사재질수입업소비고1비고2
20335급여 품목(인체조직 포함)TFF030952023-04-01FASCIA100CM² 이상1EA499200COMMUNITY BLOOD CENTER/COMMUNITY TISSUE SERVICESFASCIA LATA주식회사 제이비엠그룹<NA><NA>
43582(비급여 품목)삭제 및 삭제예정 품목BM5018CU2021-07-01ADFLEX BANDAGE9.0CMX15.0CM(5.0CMX10.0CM)1EA0YOUNG CHEMICAL폴리우레탄필름/폴리아크릴알킬에스텔에멀전/부직포/폴리에칠렌망영케미칼<NA><NA>
32127삭제 및 삭제 예정 품목C21000042006-08-01TIBIALIS POSTERIOR TENDON전규격1EA0REGENERATION TECHNOLOGY INCHUMAN코리아본뱅크<NA><NA>
21763100분의 852미만 본인부담 품목K92030302023-04-01THE ARTERY COMPRESSION TOURNIQUET (FOR RADIAL ARTERY)전규격1EA13910HANGZHOU SHANYOU MEDICAL EQUIPMENT CO. LTDPC100%, SILICONE 100%오메드본인부담률 80% / 고시 제2022-175호(2022.8.1.적용) 관련 본인부담률 90%<NA>
7976급여 품목(인체조직 포함)J24018012023-04-01TRIANGLE TIP KNIFE J전규격1EA215680OLYMPUS MEDICAL SYSTEMS CORPORATIONSTAINLESS STEEL 304 등올림푸스한국㈜고시 제2019-80호(2019.5.1.적용) 관련 선별급여 본인부담률 80% 적용/ 내시경적 점막하 박리절제술 (ENDOSCOPIC SUBMUCOSAL DISSECTION,ESD)용 KNIFE→내시경적 시술용 KNIFE 중분류명 변경중복인정여부 Y
4965급여 품목(인체조직 포함)E20021042023-04-01NEXGEN LCCK FEMORAL COMPONENT전규격1EA1875640ZIMMERCO.CR.MO.ALLOY짐머바이오메트코리아<NA><NA>
27132급여중지 및 급여중지 예정 품목G82014042022-01-01ENTICOS S전규격1EA0BIOTRONIK SE&CO.KGTITANIUM 등바이오트로닉코리아㈜<NA><NA>
40622비급여 품목(인체조직 포함)BM5119HF2022-07-01가드픽스전규격1EA0EVERAID폴리우레탄필름, 부직포 등에버레이드(주)<NA><NA>
19780급여 품목(인체조직 포함)TBE622012023-04-01DISTAL FEMUR HEMI, MEDIALW/O CARTILAGE1EA1507020셀루메드FEMUR셀루메드고시 제2018-281호(2019.1.1.적용) 관련 선별급여 본인부담률 80% 적용중복인정여부 Y
10940급여 품목(인체조직 포함)K50112042023-04-01SILICONE FOLEY CATHTER(T)3WAY1EA4700SE-WOONSILICONE세운메디칼고시 제2019-80호(2019.5.1.적용) 관련 선별급여 본인부담률 80% 적용중복인정여부 Y
구분코드적용일자품명규격단위상한금액제조회사재질수입업소비고1비고2
4512급여 품목(인체조직 포함)D12135192023-04-01TREU DISPOSABLE CANNULA전규격1EA43710SMG INC.폴리카보네이트+니티놀+스테인레스스틸(주)에스엠지<NA><NA>
16844급여 품목(인체조직 포함)M21340082023-04-01G-PLATE전규격1EA2020HUREV알루미늄박, 겔 등(주)휴레브<NA><NA>
36907삭제 및 삭제 예정 품목TTA020052014-05-01ACHILLES TENDON W/BONE HEMIHEMI1EA0COMMUNITY TISSUE SERVICESACHILLES TENDON W/BONE셀루메드<NA><NA>
12090급여 품목(인체조직 포함)K72010392023-04-01탄력붕대15CM X 215CM1EA650SHAOXING HOSMED MEDICAL PRODUCTS CO.LTDCOTTON하우스메디칼고시 제2019-80호(2019.5.1.적용) 관련 선별급여 본인부담률 80% 적용중복인정여부 Y
5481급여 품목(인체조직 포함)F00010422023-04-014CIS ACP SYSTEM전규격1EA441930SOLCO BIOMEDICALTITANIUM ALLOY(주)솔고바이오메디칼<NA><NA>
43346(비급여 품목)삭제 및 삭제예정 품목BM5002BF2021-07-01비씨플라스터 멸균반창고전규격1EA0LIBATAPE PHARMACEUTICAL CO.,LTD면과 폴리에스테르 혼합물, 합성고무접착제, 부직포(주)나음케어<NA><NA>
25138급여중지 및 급여중지 예정 품목C54787242021-01-01CLAVICLE PLATE전규격1EA0TRAUSON MEDICAL INSTRUMENT CO., LTDTITANIUM한국스트라이커<NA><NA>
30200급여중지 및 급여중지 예정 품목L30110072014-11-01CHECKCLEAN ONE PIECE COLOSTOMY전규격1EA0C&C MEDICAL-씨앤씨메디칼<NA><NA>
43006(비급여 품목)삭제 및 삭제예정 품목BM1303XE2020-07-01DUAL FILTER SYRINGE전규격1EA0IMT KOREA스테인레스강, 폴리프로필렌, 다이메틸 폴리실록세인, 에폭시(주)아이엠티코리아<NA><NA>
6404급여 품목(인체조직 포함)F14120982023-04-01GUARDIAN전규격1EA575950BM KOREATHERMOPLASTIC POLYURETHANE등비엠코리아<NA><NA>