Overview

Dataset statistics

Number of variables11
Number of observations10000
Missing cells68
Missing cells (%)0.1%
Duplicate rows1
Duplicate rows (%)< 0.1%
Total size in memory937.5 KiB
Average record size in memory96.0 B

Variable types

Categorical4
Text7

Dataset

Description현행 동물용의약품등 허가현황을 동물용의약품, 동물용의약외품, 동물용의료기기로 구분하여 허가번호, 업종, 품목명, 성분명(국문, 영문), 허가일, 업체명을 제공함
Author농림축산식품부 농림축산검역본부
URLhttps://www.data.go.kr/data/15112723/fileData.do

Alerts

Dataset has 1 (< 0.1%) duplicate rowsDuplicates
허가유형 is highly imbalanced (80.6%)Imbalance
품목형태 is highly imbalanced (77.8%)Imbalance

Reproduction

Analysis started2023-12-12 09:08:32.757231
Analysis finished2023-12-12 09:08:35.800540
Duration3.04 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

용도
Categorical

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
동물용의약품
5915 
동물용의료기기
3211 
동물용의약외품
874 

Length

Max length7
Median length6
Mean length6.4085
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row동물용의약품
2nd row동물용의약품
3rd row동물용의약품
4th row동물용의약외품
5th row동물용의료기기

Common Values

ValueCountFrequency (%)
동물용의약품 5915
59.2%
동물용의료기기 3211
32.1%
동물용의약외품 874
 
8.7%

Length

2023-12-12T18:08:35.899710image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T18:08:36.053662image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
동물용의약품 5915
59.2%
동물용의료기기 3211
32.1%
동물용의약외품 874
 
8.7%
Distinct8900
Distinct (%)89.0%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T18:08:36.543227image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length7
Median length7
Mean length7
Min length7

Characters and Unicode

Total characters70000
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique7937 ?
Unique (%)79.4%

Sample

1st row283-072
2nd row020-239
3rd row033-212
4th row016-161
5th row049-006
ValueCountFrequency (%)
037-007 5
 
< 0.1%
381-001 4
 
< 0.1%
140-001 4
 
< 0.1%
212-001 4
 
< 0.1%
128-005 4
 
< 0.1%
104-001 4
 
< 0.1%
054-002 4
 
< 0.1%
244-001 3
 
< 0.1%
121-001 3
 
< 0.1%
301-001 3
 
< 0.1%
Other values (8890) 9962
99.6%
2023-12-12T18:08:37.330076image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 19092
27.3%
1 10232
14.6%
- 10000
14.3%
2 7048
 
10.1%
3 5480
 
7.8%
4 3980
 
5.7%
6 3356
 
4.8%
5 2880
 
4.1%
8 2753
 
3.9%
9 2751
 
3.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 60000
85.7%
Dash Punctuation 10000
 
14.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 19092
31.8%
1 10232
17.1%
2 7048
 
11.7%
3 5480
 
9.1%
4 3980
 
6.6%
6 3356
 
5.6%
5 2880
 
4.8%
8 2753
 
4.6%
9 2751
 
4.6%
7 2428
 
4.0%
Dash Punctuation
ValueCountFrequency (%)
- 10000
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 70000
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 19092
27.3%
1 10232
14.6%
- 10000
14.3%
2 7048
 
10.1%
3 5480
 
7.8%
4 3980
 
5.7%
6 3356
 
4.8%
5 2880
 
4.1%
8 2753
 
3.9%
9 2751
 
3.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 70000
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 19092
27.3%
1 10232
14.6%
- 10000
14.3%
2 7048
 
10.1%
3 5480
 
7.8%
4 3980
 
5.7%
6 3356
 
4.8%
5 2880
 
4.1%
8 2753
 
3.9%
9 2751
 
3.9%

업종
Categorical

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
제조
6984 
수입
3012 
위탁제조판매
 
4

Length

Max length6
Median length2
Mean length2.0016
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row수입
2nd row제조
3rd row제조
4th row제조
5th row수입

Common Values

ValueCountFrequency (%)
제조 6984
69.8%
수입 3012
30.1%
위탁제조판매 4
 
< 0.1%

Length

2023-12-12T18:08:37.495759image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T18:08:37.626195image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
제조 6984
69.8%
수입 3012
30.1%
위탁제조판매 4
 
< 0.1%
Distinct9936
Distinct (%)99.4%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T18:08:37.911002image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length147
Median length83
Mean length26.4942
Min length2

Characters and Unicode

Total characters264942
Distinct characters840
Distinct categories18 ?
Distinct scripts4 ?
Distinct blocks8 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9896 ?
Unique (%)99.0%

Sample

1st row테르메딘 정(테르비나핀염산염)(Termedin tablet)
2nd row마보맥스 10 주(Marbomax 10 Inj.)
3rd row안티콕시(수출용)(ANTI-COCCI)
4th row글루타-에프(GLUTA-F)
5th row제각기[1](180150외 5종)
ValueCountFrequency (%)
428
 
1.4%
kit 418
 
1.4%
inj 374
 
1.2%
수출용 329
 
1.1%
test 299
 
1.0%
ag 204
 
0.7%
powder 186
 
0.6%
대성 183
 
0.6%
ab 178
 
0.6%
solution 172
 
0.6%
Other values (14998) 27343
90.8%
2023-12-12T18:08:38.541826image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
20114
 
7.6%
( 10164
 
3.8%
) 10152
 
3.8%
e 6809
 
2.6%
i 6128
 
2.3%
0 5137
 
1.9%
o 5033
 
1.9%
a 4656
 
1.8%
n 4656
 
1.8%
A 4523
 
1.7%
Other values (830) 187570
70.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 83667
31.6%
Lowercase Letter 58681
22.1%
Uppercase Letter 51519
19.4%
Space Separator 20114
 
7.6%
Decimal Number 17235
 
6.5%
Open Punctuation 13336
 
5.0%
Close Punctuation 13325
 
5.0%
Dash Punctuation 4198
 
1.6%
Other Punctuation 2611
 
1.0%
Math Symbol 112
 
< 0.1%
Other values (8) 144
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3503
 
4.2%
2547
 
3.0%
2159
 
2.6%
2024
 
2.4%
2014
 
2.4%
1840
 
2.2%
1407
 
1.7%
1338
 
1.6%
1317
 
1.6%
1225
 
1.5%
Other values (725) 64293
76.8%
Lowercase Letter
ValueCountFrequency (%)
e 6809
11.6%
i 6128
10.4%
o 5033
 
8.6%
a 4656
 
7.9%
n 4656
 
7.9%
t 4183
 
7.1%
l 3714
 
6.3%
r 3603
 
6.1%
c 2568
 
4.4%
s 2560
 
4.4%
Other values (19) 14771
25.2%
Uppercase Letter
ValueCountFrequency (%)
A 4523
 
8.8%
I 3715
 
7.2%
E 3707
 
7.2%
C 3649
 
7.1%
S 3436
 
6.7%
T 3273
 
6.4%
P 3133
 
6.1%
R 2753
 
5.3%
O 2607
 
5.1%
N 2559
 
5.0%
Other values (17) 18164
35.3%
Other Punctuation
ValueCountFrequency (%)
. 1245
47.7%
, 712
27.3%
/ 296
 
11.3%
120
 
4.6%
% 72
 
2.8%
· 48
 
1.8%
? 43
 
1.6%
& 42
 
1.6%
: 20
 
0.8%
* 3
 
0.1%
Other values (4) 10
 
0.4%
Decimal Number
ValueCountFrequency (%)
0 5137
29.8%
2 3488
20.2%
1 3333
19.3%
3 1649
 
9.6%
5 1408
 
8.2%
4 801
 
4.6%
6 458
 
2.7%
7 362
 
2.1%
8 311
 
1.8%
9 288
 
1.7%
Other Symbol
ValueCountFrequency (%)
90
83.3%
8
 
7.4%
5
 
4.6%
4
 
3.7%
1
 
0.9%
Open Punctuation
ValueCountFrequency (%)
( 10164
76.2%
[ 3154
 
23.7%
18
 
0.1%
Close Punctuation
ValueCountFrequency (%)
) 10152
76.2%
] 3137
 
23.5%
36
 
0.3%
Letter Number
ValueCountFrequency (%)
4
40.0%
3
30.0%
3
30.0%
Dash Punctuation
ValueCountFrequency (%)
- 4195
99.9%
3
 
0.1%
Math Symbol
ValueCountFrequency (%)
+ 106
94.6%
6
 
5.4%
Space Separator
ValueCountFrequency (%)
20114
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 18
100.0%
Final Punctuation
ValueCountFrequency (%)
3
100.0%
Currency Symbol
ValueCountFrequency (%)
¤ 2
100.0%
Initial Punctuation
ValueCountFrequency (%)
1
100.0%
Other Number
ValueCountFrequency (%)
1
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 110129
41.6%
Hangul 83672
31.6%
Common 71060
26.8%
Greek 81
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3503
 
4.2%
2547
 
3.0%
2159
 
2.6%
2024
 
2.4%
2014
 
2.4%
1840
 
2.2%
1407
 
1.7%
1338
 
1.6%
1317
 
1.6%
1225
 
1.5%
Other values (726) 64298
76.8%
Latin
ValueCountFrequency (%)
e 6809
 
6.2%
i 6128
 
5.6%
o 5033
 
4.6%
a 4656
 
4.2%
n 4656
 
4.2%
A 4523
 
4.1%
t 4183
 
3.8%
I 3715
 
3.4%
l 3714
 
3.4%
E 3707
 
3.4%
Other values (47) 63005
57.2%
Common
ValueCountFrequency (%)
20114
28.3%
( 10164
14.3%
) 10152
14.3%
0 5137
 
7.2%
- 4195
 
5.9%
2 3488
 
4.9%
1 3333
 
4.7%
[ 3154
 
4.4%
] 3137
 
4.4%
3 1649
 
2.3%
Other values (35) 6537
 
9.2%
Greek
ValueCountFrequency (%)
μ 79
97.5%
α 2
 
2.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 180832
68.3%
Hangul 83666
31.6%
None 323
 
0.1%
Letterlike Symbols 90
 
< 0.1%
CJK Compat 13
 
< 0.1%
Number Forms 10
 
< 0.1%
Punctuation 7
 
< 0.1%
Compat Jamo 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
20114
 
11.1%
( 10164
 
5.6%
) 10152
 
5.6%
e 6809
 
3.8%
i 6128
 
3.4%
0 5137
 
2.8%
o 5033
 
2.8%
a 4656
 
2.6%
n 4656
 
2.6%
A 4523
 
2.5%
Other values (71) 103460
57.2%
Hangul
ValueCountFrequency (%)
3503
 
4.2%
2547
 
3.0%
2159
 
2.6%
2024
 
2.4%
2014
 
2.4%
1840
 
2.2%
1407
 
1.7%
1338
 
1.6%
1317
 
1.6%
1225
 
1.5%
Other values (724) 64292
76.8%
None
ValueCountFrequency (%)
120
37.2%
μ 79
24.5%
· 48
 
14.9%
36
 
11.1%
18
 
5.6%
6
 
1.9%
5
 
1.5%
¡ 2
 
0.6%
α 2
 
0.6%
¤ 2
 
0.6%
Other values (4) 5
 
1.5%
Letterlike Symbols
ValueCountFrequency (%)
90
100.0%
CJK Compat
ValueCountFrequency (%)
8
61.5%
4
30.8%
1
 
7.7%
Number Forms
ValueCountFrequency (%)
4
40.0%
3
30.0%
3
30.0%
Punctuation
ValueCountFrequency (%)
3
42.9%
3
42.9%
1
 
14.3%
Compat Jamo
ValueCountFrequency (%)
1
100.0%

허가유형
Categorical

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
A
9701 
D
 
299

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowA
2nd rowA
3rd rowA
4th rowA
5th rowD

Common Values

ValueCountFrequency (%)
A 9701
97.0%
D 299
 
3.0%

Length

2023-12-12T18:08:38.729692image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T18:08:38.860599image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
a 9701
97.0%
d 299
 
3.0%

품목형태
Categorical

IMBALANCE 

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
C
9470 
M
 
282
O
 
248

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowC
2nd rowC
3rd rowC
4th rowM
5th rowC

Common Values

ValueCountFrequency (%)
C 9470
94.7%
M 282
 
2.8%
O 248
 
2.5%

Length

2023-12-12T18:08:39.003103image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T18:08:39.135925image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
c 9470
94.7%
m 282
 
2.8%
o 248
 
2.5%
Distinct1709
Distinct (%)17.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T18:08:39.534750image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length9
Median length9
Mean length9
Min length9

Characters and Unicode

Total characters90000
Distinct characters28
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique745 ?
Unique (%)7.4%

Sample

1st rowIE1010104
2nd rowIA5010127
3rd rowIX8030258
4th rowKA5010140
5th rowLD0200400
ValueCountFrequency (%)
hm2210301 232
 
2.3%
lc0401400 219
 
2.2%
io1810107 202
 
2.0%
lc0400100 175
 
1.8%
ia5010124 147
 
1.5%
lc0401200 145
 
1.5%
io1010104 122
 
1.2%
lc0401300 121
 
1.2%
pa1000106 120
 
1.2%
kc2000100 111
 
1.1%
Other values (1699) 8406
84.1%
2023-12-12T18:08:40.126162image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 30004
33.3%
1 16118
17.9%
2 6864
 
7.6%
3 5163
 
5.7%
A 4176
 
4.6%
4 3234
 
3.6%
L 3208
 
3.6%
I 3044
 
3.4%
8 2700
 
3.0%
C 2008
 
2.2%
Other values (18) 13481
15.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 69580
77.3%
Uppercase Letter 20420
 
22.7%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
A 4176
20.5%
L 3208
15.7%
I 3044
14.9%
C 2008
9.8%
O 1720
8.4%
H 1174
 
5.7%
B 1000
 
4.9%
K 746
 
3.7%
N 716
 
3.5%
Z 694
 
3.4%
Other values (8) 1934
9.5%
Decimal Number
ValueCountFrequency (%)
0 30004
43.1%
1 16118
23.2%
2 6864
 
9.9%
3 5163
 
7.4%
4 3234
 
4.6%
8 2700
 
3.9%
6 1826
 
2.6%
5 1779
 
2.6%
7 1098
 
1.6%
9 794
 
1.1%

Most occurring scripts

ValueCountFrequency (%)
Common 69580
77.3%
Latin 20420
 
22.7%

Most frequent character per script

Latin
ValueCountFrequency (%)
A 4176
20.5%
L 3208
15.7%
I 3044
14.9%
C 2008
9.8%
O 1720
8.4%
H 1174
 
5.7%
B 1000
 
4.9%
K 746
 
3.7%
N 716
 
3.5%
Z 694
 
3.4%
Other values (8) 1934
9.5%
Common
ValueCountFrequency (%)
0 30004
43.1%
1 16118
23.2%
2 6864
 
9.9%
3 5163
 
7.4%
4 3234
 
4.6%
8 2700
 
3.9%
6 1826
 
2.6%
5 1779
 
2.6%
7 1098
 
1.6%
9 794
 
1.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 90000
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 30004
33.3%
1 16118
17.9%
2 6864
 
7.6%
3 5163
 
5.7%
A 4176
 
4.6%
4 3234
 
3.6%
L 3208
 
3.6%
I 3044
 
3.4%
8 2700
 
3.0%
C 2008
 
2.2%
Other values (18) 13481
15.0%
Distinct1640
Distinct (%)16.4%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T18:08:40.449097image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length61
Median length41
Mean length10.0888
Min length2

Characters and Unicode

Total characters100888
Distinct characters584
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique715 ?
Unique (%)7.1%

Sample

1st row염산테르비나핀
2nd row마보플록사신
3rd row암프롤리움+설파퀴녹살린
4th row글루타알데하이드
5th row제각기[1]
ValueCountFrequency (%)
기타영양공급약 300
 
2.7%
비타민 230
 
2.1%
저위험성동물전염병면역검사시약[2 219
 
2.0%
플로르페니콜 204
 
1.9%
면역화학검사시약[2 175
 
1.6%
엔로플록사신 167
 
1.5%
인수공통전염병면역검사시약[3 145
 
1.3%
아목시실린 122
 
1.1%
고위험성동물전염병면역검사시약[3 121
 
1.1%
물티슈 120
 
1.1%
Other values (1755) 9206
83.6%
2023-12-12T18:08:41.124376image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
] 3208
 
3.2%
[ 3208
 
3.2%
+ 2404
 
2.4%
2280
 
2.3%
2181
 
2.2%
2079
 
2.1%
1834
 
1.8%
1787
 
1.8%
1671
 
1.7%
2 1614
 
1.6%
Other values (574) 78622
77.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 85617
84.9%
Decimal Number 3584
 
3.6%
Close Punctuation 3469
 
3.4%
Open Punctuation 3469
 
3.4%
Math Symbol 2456
 
2.4%
Space Separator 1009
 
1.0%
Uppercase Letter 763
 
0.8%
Other Punctuation 333
 
0.3%
Lowercase Letter 120
 
0.1%
Dash Punctuation 64
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2280
 
2.7%
2181
 
2.5%
2079
 
2.4%
1834
 
2.1%
1787
 
2.1%
1671
 
2.0%
1595
 
1.9%
1570
 
1.8%
1467
 
1.7%
1416
 
1.7%
Other values (507) 67737
79.1%
Lowercase Letter
ValueCountFrequency (%)
e 13
 
10.8%
l 13
 
10.8%
r 12
 
10.0%
m 10
 
8.3%
i 9
 
7.5%
n 8
 
6.7%
a 7
 
5.8%
u 5
 
4.2%
y 5
 
4.2%
o 5
 
4.2%
Other values (11) 33
27.5%
Uppercase Letter
ValueCountFrequency (%)
E 174
22.8%
A 118
15.5%
C 107
14.0%
D 105
13.8%
B 75
9.8%
F 57
 
7.5%
H 18
 
2.4%
V 18
 
2.4%
L 11
 
1.4%
I 10
 
1.3%
Other values (9) 70
9.2%
Decimal Number
ValueCountFrequency (%)
2 1614
45.0%
1 979
27.3%
3 818
22.8%
4 115
 
3.2%
0 19
 
0.5%
8 14
 
0.4%
6 11
 
0.3%
9 8
 
0.2%
5 5
 
0.1%
7 1
 
< 0.1%
Other Punctuation
ValueCountFrequency (%)
/ 234
70.3%
, 43
 
12.9%
. 25
 
7.5%
? 22
 
6.6%
· 6
 
1.8%
% 2
 
0.6%
: 1
 
0.3%
Math Symbol
ValueCountFrequency (%)
+ 2404
97.9%
~ 47
 
1.9%
5
 
0.2%
Close Punctuation
ValueCountFrequency (%)
] 3208
92.5%
) 261
 
7.5%
Open Punctuation
ValueCountFrequency (%)
[ 3208
92.5%
( 261
 
7.5%
Space Separator
ValueCountFrequency (%)
1009
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 64
100.0%
Letter Number
ValueCountFrequency (%)
4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 85617
84.9%
Common 14384
 
14.3%
Latin 887
 
0.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2280
 
2.7%
2181
 
2.5%
2079
 
2.4%
1834
 
2.1%
1787
 
2.1%
1671
 
2.0%
1595
 
1.9%
1570
 
1.8%
1467
 
1.7%
1416
 
1.7%
Other values (507) 67737
79.1%
Latin
ValueCountFrequency (%)
E 174
19.6%
A 118
13.3%
C 107
12.1%
D 105
11.8%
B 75
 
8.5%
F 57
 
6.4%
H 18
 
2.0%
V 18
 
2.0%
e 13
 
1.5%
l 13
 
1.5%
Other values (31) 189
21.3%
Common
ValueCountFrequency (%)
] 3208
22.3%
[ 3208
22.3%
+ 2404
16.7%
2 1614
11.2%
1009
 
7.0%
1 979
 
6.8%
3 818
 
5.7%
) 261
 
1.8%
( 261
 
1.8%
/ 234
 
1.6%
Other values (16) 388
 
2.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 85617
84.9%
ASCII 15256
 
15.1%
None 11
 
< 0.1%
Number Forms 4
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
] 3208
21.0%
[ 3208
21.0%
+ 2404
15.8%
2 1614
10.6%
1009
 
6.6%
1 979
 
6.4%
3 818
 
5.4%
) 261
 
1.7%
( 261
 
1.7%
/ 234
 
1.5%
Other values (54) 1260
 
8.3%
Hangul
ValueCountFrequency (%)
2280
 
2.7%
2181
 
2.5%
2079
 
2.4%
1834
 
2.1%
1787
 
2.1%
1671
 
2.0%
1595
 
1.9%
1570
 
1.8%
1467
 
1.7%
1416
 
1.7%
Other values (507) 67737
79.1%
None
ValueCountFrequency (%)
· 6
54.5%
5
45.5%
Number Forms
ValueCountFrequency (%)
4
100.0%
Distinct1635
Distinct (%)16.4%
Missing20
Missing (%)0.2%
Memory size156.2 KiB
2023-12-12T18:08:41.703859image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length100
Median length73
Mean length28.039379
Min length2

Characters and Unicode

Total characters279833
Distinct characters125
Distinct categories12 ?
Distinct scripts3 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique703 ?
Unique (%)7.0%

Sample

1st rowTerbinafine Hydrochloride
2nd rowMarbofloxacin
3rd rowAmprolium HCl+Sulfaquinoxaline
4th rowGlutaraldehyde
5th rowDehorner
ValueCountFrequency (%)
for 1381
 
4.6%
ivd 1216
 
4.0%
reagents 1205
 
4.0%
of 757
 
2.5%
infectious 526
 
1.7%
immunological 492
 
1.6%
method 485
 
1.6%
pathogens 470
 
1.6%
by 470
 
1.6%
oie 470
 
1.6%
Other values (2153) 22597
75.2%
2023-12-12T18:08:42.396508image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
e 24871
 
8.9%
i 22443
 
8.0%
20089
 
7.2%
o 19056
 
6.8%
a 18821
 
6.7%
n 18605
 
6.6%
l 16066
 
5.7%
t 15401
 
5.5%
r 14792
 
5.3%
s 12865
 
4.6%
Other values (115) 96824
34.6%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 231127
82.6%
Space Separator 20089
 
7.2%
Uppercase Letter 20014
 
7.2%
Other Punctuation 3911
 
1.4%
Math Symbol 2271
 
0.8%
Dash Punctuation 1256
 
0.4%
Decimal Number 469
 
0.2%
Other Letter 260
 
0.1%
Open Punctuation 218
 
0.1%
Close Punctuation 213
 
0.1%
Other values (2) 5
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
17
 
6.5%
14
 
5.4%
14
 
5.4%
12
 
4.6%
10
 
3.8%
7
 
2.7%
7
 
2.7%
7
 
2.7%
7
 
2.7%
7
 
2.7%
Other values (41) 158
60.8%
Lowercase Letter
ValueCountFrequency (%)
e 24871
10.8%
i 22443
 
9.7%
o 19056
 
8.2%
a 18821
 
8.1%
n 18605
 
8.0%
l 16066
 
7.0%
t 15401
 
6.7%
r 14792
 
6.4%
s 12865
 
5.6%
c 10496
 
4.5%
Other values (16) 57711
25.0%
Uppercase Letter
ValueCountFrequency (%)
I 2711
13.5%
D 2104
10.5%
C 1973
9.9%
V 1708
 
8.5%
E 1406
 
7.0%
A 1363
 
6.8%
T 997
 
5.0%
S 994
 
5.0%
P 961
 
4.8%
O 912
 
4.6%
Other values (16) 4885
24.4%
Decimal Number
ValueCountFrequency (%)
2 105
22.4%
3 99
21.1%
1 83
17.7%
6 49
10.4%
7 46
9.8%
5 32
 
6.8%
9 21
 
4.5%
0 18
 
3.8%
8 16
 
3.4%
Other Punctuation
ValueCountFrequency (%)
, 3002
76.8%
. 535
 
13.7%
/ 370
 
9.5%
: 3
 
0.1%
% 1
 
< 0.1%
Math Symbol
ValueCountFrequency (%)
+ 2266
99.8%
5
 
0.2%
Space Separator
ValueCountFrequency (%)
20089
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1256
100.0%
Open Punctuation
ValueCountFrequency (%)
( 218
100.0%
Close Punctuation
ValueCountFrequency (%)
) 213
100.0%
Letter Number
ValueCountFrequency (%)
4
100.0%
Final Punctuation
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 251145
89.7%
Common 28428
 
10.2%
Hangul 260
 
0.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 24871
 
9.9%
i 22443
 
8.9%
o 19056
 
7.6%
a 18821
 
7.5%
n 18605
 
7.4%
l 16066
 
6.4%
t 15401
 
6.1%
r 14792
 
5.9%
s 12865
 
5.1%
c 10496
 
4.2%
Other values (43) 77729
30.9%
Hangul
ValueCountFrequency (%)
17
 
6.5%
14
 
5.4%
14
 
5.4%
12
 
4.6%
10
 
3.8%
7
 
2.7%
7
 
2.7%
7
 
2.7%
7
 
2.7%
7
 
2.7%
Other values (41) 158
60.8%
Common
ValueCountFrequency (%)
20089
70.7%
, 3002
 
10.6%
+ 2266
 
8.0%
- 1256
 
4.4%
. 535
 
1.9%
/ 370
 
1.3%
( 218
 
0.8%
) 213
 
0.7%
2 105
 
0.4%
3 99
 
0.3%
Other values (11) 275
 
1.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 279563
99.9%
Hangul 260
 
0.1%
None 5
 
< 0.1%
Number Forms 4
 
< 0.1%
Punctuation 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e 24871
 
8.9%
i 22443
 
8.0%
20089
 
7.2%
o 19056
 
6.8%
a 18821
 
6.7%
n 18605
 
6.7%
l 16066
 
5.7%
t 15401
 
5.5%
r 14792
 
5.3%
s 12865
 
4.6%
Other values (61) 96554
34.5%
Hangul
ValueCountFrequency (%)
17
 
6.5%
14
 
5.4%
14
 
5.4%
12
 
4.6%
10
 
3.8%
7
 
2.7%
7
 
2.7%
7
 
2.7%
7
 
2.7%
7
 
2.7%
Other values (41) 158
60.8%
None
ValueCountFrequency (%)
5
100.0%
Number Forms
ValueCountFrequency (%)
4
100.0%
Punctuation
ValueCountFrequency (%)
1
100.0%
Distinct3681
Distinct (%)37.0%
Missing48
Missing (%)0.5%
Memory size156.2 KiB
2023-12-12T18:08:42.877541image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length10
Mean length10
Min length10

Characters and Unicode

Total characters99520
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1474 ?
Unique (%)14.8%

Sample

1st row2022-04-14
2nd row2015-02-12
3rd row2000-12-04
4th row2000-04-15
5th row2013-10-16
ValueCountFrequency (%)
2020-01-30 41
 
0.4%
2019-11-05 31
 
0.3%
1987-12-18 27
 
0.3%
2019-08-26 25
 
0.3%
2017-07-06 24
 
0.2%
2016-02-11 24
 
0.2%
2016-03-29 21
 
0.2%
1988-12-07 21
 
0.2%
1997-06-28 20
 
0.2%
2022-05-04 20
 
0.2%
Other values (3671) 9698
97.4%
2023-12-12T18:08:43.422104image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 22904
23.0%
- 19904
20.0%
2 16369
16.4%
1 15262
15.3%
9 6346
 
6.4%
3 3810
 
3.8%
8 3649
 
3.7%
7 3215
 
3.2%
4 2811
 
2.8%
6 2730
 
2.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 79616
80.0%
Dash Punctuation 19904
 
20.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 22904
28.8%
2 16369
20.6%
1 15262
19.2%
9 6346
 
8.0%
3 3810
 
4.8%
8 3649
 
4.6%
7 3215
 
4.0%
4 2811
 
3.5%
6 2730
 
3.4%
5 2520
 
3.2%
Dash Punctuation
ValueCountFrequency (%)
- 19904
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 99520
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 22904
23.0%
- 19904
20.0%
2 16369
16.4%
1 15262
15.3%
9 6346
 
6.4%
3 3810
 
3.8%
8 3649
 
3.7%
7 3215
 
3.2%
4 2811
 
2.8%
6 2730
 
2.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 99520
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 22904
23.0%
- 19904
20.0%
2 16369
16.4%
1 15262
15.3%
9 6346
 
6.4%
3 3810
 
3.8%
8 3649
 
3.7%
7 3215
 
3.2%
4 2811
 
2.8%
6 2730
 
2.7%
Distinct762
Distinct (%)7.6%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T18:08:43.712021image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length24
Median length18
Mean length8.1974
Min length2

Characters and Unicode

Total characters81974
Distinct characters375
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique295 ?
Unique (%)2.9%

Sample

1st row(주)제이에스케이
2nd row(주)이글벳
3rd row(주)코미팜
4th row(주)삼우메디안
5th row주식회사 오창
ValueCountFrequency (%)
주식회사 600
 
5.5%
주)고려비엔피 295
 
2.7%
주)코미팜 275
 
2.5%
주)대성미생물연구소 262
 
2.4%
주)한동 208
 
1.9%
한국엘랑코동물약품(주 198
 
1.8%
주)이글벳 191
 
1.8%
주)제일바이오 187
 
1.7%
에스비신일(주 185
 
1.7%
주)삼양애니팜 181
 
1.7%
Other values (776) 8313
76.3%
2023-12-12T18:08:44.235689image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
9115
 
11.1%
) 8536
 
10.4%
( 8536
 
10.4%
3345
 
4.1%
2169
 
2.6%
1812
 
2.2%
1663
 
2.0%
1476
 
1.8%
1210
 
1.5%
1197
 
1.5%
Other values (365) 42915
52.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 63881
77.9%
Close Punctuation 8536
 
10.4%
Open Punctuation 8536
 
10.4%
Space Separator 895
 
1.1%
Uppercase Letter 102
 
0.1%
Lowercase Letter 14
 
< 0.1%
Other Punctuation 8
 
< 0.1%
Other Symbol 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
9115
 
14.3%
3345
 
5.2%
2169
 
3.4%
1812
 
2.8%
1663
 
2.6%
1476
 
2.3%
1210
 
1.9%
1197
 
1.9%
1154
 
1.8%
1131
 
1.8%
Other values (345) 39609
62.0%
Uppercase Letter
ValueCountFrequency (%)
A 21
20.6%
B 18
17.6%
C 18
17.6%
K 13
12.7%
O 6
 
5.9%
I 6
 
5.9%
V 6
 
5.9%
N 4
 
3.9%
S 3
 
2.9%
E 2
 
2.0%
Other values (4) 5
 
4.9%
Close Punctuation
ValueCountFrequency (%)
) 8536
100.0%
Open Punctuation
ValueCountFrequency (%)
( 8536
100.0%
Space Separator
ValueCountFrequency (%)
895
100.0%
Lowercase Letter
ValueCountFrequency (%)
n 14
100.0%
Other Punctuation
ValueCountFrequency (%)
. 8
100.0%
Other Symbol
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 63883
77.9%
Common 17975
 
21.9%
Latin 116
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
9115
 
14.3%
3345
 
5.2%
2169
 
3.4%
1812
 
2.8%
1663
 
2.6%
1476
 
2.3%
1210
 
1.9%
1197
 
1.9%
1154
 
1.8%
1131
 
1.8%
Other values (346) 39611
62.0%
Latin
ValueCountFrequency (%)
A 21
18.1%
B 18
15.5%
C 18
15.5%
n 14
12.1%
K 13
11.2%
O 6
 
5.2%
I 6
 
5.2%
V 6
 
5.2%
N 4
 
3.4%
S 3
 
2.6%
Other values (5) 7
 
6.0%
Common
ValueCountFrequency (%)
) 8536
47.5%
( 8536
47.5%
895
 
5.0%
. 8
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 63881
77.9%
ASCII 18091
 
22.1%
None 2
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
9115
 
14.3%
3345
 
5.2%
2169
 
3.4%
1812
 
2.8%
1663
 
2.6%
1476
 
2.3%
1210
 
1.9%
1197
 
1.9%
1154
 
1.8%
1131
 
1.8%
Other values (345) 39609
62.0%
ASCII
ValueCountFrequency (%)
) 8536
47.2%
( 8536
47.2%
895
 
4.9%
A 21
 
0.1%
B 18
 
0.1%
C 18
 
0.1%
n 14
 
0.1%
K 13
 
0.1%
. 8
 
< 0.1%
O 6
 
< 0.1%
Other values (9) 26
 
0.1%
None
ValueCountFrequency (%)
2
100.0%

Correlations

2023-12-12T18:08:44.402792image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
용도업종허가유형품목형태
용도1.0000.5390.1280.416
업종0.5391.0000.1010.294
허가유형0.1280.1011.0000.024
품목형태0.4160.2940.0241.000
2023-12-12T18:08:44.543539image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
품목형태허가유형업종용도
품목형태1.0000.0390.0980.155
허가유형0.0391.0000.1670.212
업종0.0980.1671.0000.228
용도0.1550.2120.2281.000
2023-12-12T18:08:44.671072image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
용도업종허가유형품목형태
용도1.0000.2280.2120.155
업종0.2281.0000.1670.098
허가유형0.2120.1671.0000.039
품목형태0.1550.0980.0391.000

Missing values

2023-12-12T18:08:35.280960image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T18:08:35.546630image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-12T18:08:35.714867image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

용도허가번호업종품목명허가유형품목형태성분코드성분명(국문)성분명(영문)허가일업체명
6765동물용의약품283-072수입테르메딘 정(테르비나핀염산염)(Termedin tablet)ACIE1010104염산테르비나핀Terbinafine Hydrochloride2022-04-14(주)제이에스케이
2498동물용의약품020-239제조마보맥스 10 주(Marbomax 10 Inj.)ACIA5010127마보플록사신Marbofloxacin2015-02-12(주)이글벳
4085동물용의약품033-212제조안티콕시(수출용)(ANTI-COCCI)ACIX8030258암프롤리움+설파퀴녹살린Amprolium HCl+Sulfaquinoxaline2000-12-04(주)코미팜
7347동물용의약외품016-161제조글루타-에프(GLUTA-F)AMKA5010140글루타알데하이드Glutaraldehyde2000-04-15(주)삼우메디안
9015동물용의료기기049-006수입제각기[1](180150외 5종)DCLD0200400제각기[1]Dehorner2013-10-16주식회사 오창
6870동물용의약품337-004수입한펜졸 산(Hanfenzol 4 percent)ACIC2010104펜벤다졸Fenbendazole2018-08-10한산에프앤피
10576동물용의료기기140-011수입수동식재사용가능의료용핸드피스[1](AZ023105외 1종)ACLA5900700수동식재사용가능의료용핸드피스[1]Surgical drill handpiece, manual2017-07-06에이블 주식회사
7561동물용의약외품034-038제조슈퍼바라살-이씨ACKB3010120비피엠씨BPMC(2-Sec-butylphenyl-N-methylcarbamate)1981-10-17한국썸벧(주)
3294동물용의약품025-253제조한동 포도당 20% 주사(HD Glucose 20% Inj.)ACHM1210303포도당Glucose2013-07-24(주)한동
2360동물용의약품020-078제조비내용 가나마이신용액ACIO1410124카나마이신Kanamycin1986-01-18(주)이글벳
용도허가번호업종품목명허가유형품목형태성분코드성분명(국문)성분명(영문)허가일업체명
10460동물용의료기기135-012제조저위험성동물전염병면역검사시약[2](Bionet PEDV Ag Rapid kit, 수출용)ACLC0401400저위험성동물전염병면역검사시약[2]IVD reagents of immunological method for non-legally designate infectious pathogens by OIE.2019-05-20주식회사 바이오넷
5119동물용의약품051-040수입옥시토신 주사(OXYTOCIN INJ.)ACEA3010101옥시토신Oxytocin1993-08-03(주)버박코리아
11631동물용의료기기247-025제조저위험성동물전염병유전검사용시약[2](CareDX™ Canine Gastroenteritis Disease Real-time PCR Kit)ACLC0600600저위험성동물전염병유전검사용시약[2]IVD reagents of molecular genetics for non-legally designated infectious pathogens by OIE.2023-10-17(주)케어벳
1372동물용의약품011-055제조대성 타이신 50 수용산ACIO2010108타이로신Tylosin1978-12-07(주)대성미생물연구소
2006동물용의약품016-038제조비타톤 마린(수산용)ACHM2210301기타영양공급약Mineral supplemental preparations1975-08-07(주)삼우메디안
3535동물용의약품027-066제조플로킬(FLOKILL)ACIO1810107플로르페니콜Florfenicol2014-07-18(주)남전물산
8163동물용의약외품382-001수입폴미첼 존폴펫 바디 앤 포 펫 물티(John Paul Pet - Body & Paw Pet Wipes)DCPA1000106물티슈wet wipes2019-07-22(주)폴미첼코리아
6753동물용의약품283-053수입리펠로액(피프로닐)(Repello Spot On)ACIH1010185피프로닐Fipronil2017-01-23(주)제이에스케이
7891동물용의약외품122-050제조도그라인 이어 클리너(Dogline Ear Cleaner)ACKC6000100귀세척제Ear cleaner<NA>우신화장품
6168동물용의약품125-032제조프루너스 닥터 미코클로딘 샴푸액(Prunus Dr.MicoChlodine Shampoo)ACGB1030113글루콘산 클로르헥시딘액+미코나졸 니트레이트Chlorhexidine gluconate+Miconazole Nitrate2016-04-21(주)나우코스

Duplicate rows

Most frequently occurring

용도허가번호업종품목명허가유형품목형태성분코드성분명(국문)성분명(영문)허가일업체명# duplicates
0동물용의료기기037-007수입동물 전신용전산화단층엑스선촬영장치[2](SOMATOM Emotion 6-slice configuration)ACLA1800100전신용전산화단층엑스선촬영장치[2]CT system, full-body2011-09-30지멘스헬시니어스(주)2