Overview

Dataset statistics

Number of variables8
Number of observations10000
Missing cells10
Missing cells (%)< 0.1%
Duplicate rows1
Duplicate rows (%)< 0.1%
Total size in memory703.1 KiB
Average record size in memory72.0 B

Variable types

Categorical2
Text5
DateTime1

Dataset

Description동물용의품 허가(신고) 신청에 따른 허가(신고) 결과 정보에 대한 데이터로 업체명, 제품명, 허가일자 등을 제공합니다.
Author농림축산식품부
URLhttps://www.data.go.kr/data/3037261/fileData.do

Alerts

Dataset has 1 (< 0.1%) duplicate rowsDuplicates

Reproduction

Analysis started2023-12-12 01:43:18.190809
Analysis finished2023-12-12 01:43:20.087340
Duration1.9 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

용도
Categorical

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
동물용의약품
6858 
동물용의료기기
2361 
동물용의약외품
781 

Length

Max length7
Median length6
Mean length6.3142
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row동물용의약품
2nd row동물용의료기기
3rd row동물용의약품
4th row동물용의약품
5th row동물용의료기기

Common Values

ValueCountFrequency (%)
동물용의약품 6858
68.6%
동물용의료기기 2361
 
23.6%
동물용의약외품 781
 
7.8%

Length

2023-12-12T10:43:20.182730image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T10:43:20.311939image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
동물용의약품 6858
68.6%
동물용의료기기 2361
 
23.6%
동물용의약외품 781
 
7.8%
Distinct8817
Distinct (%)88.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T10:43:20.695151image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length7
Median length7
Mean length7
Min length7

Characters and Unicode

Total characters70000
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique7805 ?
Unique (%)78.0%

Sample

1st row054-082
2nd row036-015
3rd row042-045
4th row024-174
5th row059-011
ValueCountFrequency (%)
037-007 5
 
< 0.1%
052-003 4
 
< 0.1%
013-002 4
 
< 0.1%
054-001 4
 
< 0.1%
052-004 4
 
< 0.1%
161-001 4
 
< 0.1%
054-002 4
 
< 0.1%
037-005 4
 
< 0.1%
111-001 4
 
< 0.1%
054-003 4
 
< 0.1%
Other values (8807) 9959
99.6%
2023-12-12T10:43:21.175072image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 19669
28.1%
1 10431
14.9%
- 10000
14.3%
2 6650
 
9.5%
3 5169
 
7.4%
4 3724
 
5.3%
6 3290
 
4.7%
5 3134
 
4.5%
9 2860
 
4.1%
8 2646
 
3.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 60000
85.7%
Dash Punctuation 10000
 
14.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 19669
32.8%
1 10431
17.4%
2 6650
 
11.1%
3 5169
 
8.6%
4 3724
 
6.2%
6 3290
 
5.5%
5 3134
 
5.2%
9 2860
 
4.8%
8 2646
 
4.4%
7 2427
 
4.0%
Dash Punctuation
ValueCountFrequency (%)
- 10000
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 70000
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 19669
28.1%
1 10431
14.9%
- 10000
14.3%
2 6650
 
9.5%
3 5169
 
7.4%
4 3724
 
5.3%
6 3290
 
4.7%
5 3134
 
4.5%
9 2860
 
4.1%
8 2646
 
3.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 70000
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 19669
28.1%
1 10431
14.9%
- 10000
14.3%
2 6650
 
9.5%
3 5169
 
7.4%
4 3724
 
5.3%
6 3290
 
4.7%
5 3134
 
4.5%
9 2860
 
4.1%
8 2646
 
3.8%

업종
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
제조
7133 
수입
2867 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row수입
2nd row수입
3rd row제조
4th row제조
5th row수입

Common Values

ValueCountFrequency (%)
제조 7133
71.3%
수입 2867
28.7%

Length

2023-12-12T10:43:21.322466image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T10:43:21.427455image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
제조 7133
71.3%
수입 2867
28.7%
Distinct9930
Distinct (%)99.3%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T10:43:21.676128image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length111
Median length79
Mean length23.3415
Min length2

Characters and Unicode

Total characters233415
Distinct characters805
Distinct categories18 ?
Distinct scripts4 ?
Distinct blocks8 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9888 ?
Unique (%)98.9%

Sample

1st row다이펜 주사제(Dipen)
2nd row환축감시장치[2](G3 vet patient monitor)
3rd row서울-살리노60(SALINO-60)
4th row다원린스펙
5th row재사용가능동물안과용큐렛[1](Acrivet Arlt lens loop)
ValueCountFrequency (%)
inj 366
 
1.4%
kit 271
 
1.0%
test 242
 
0.9%
214
 
0.8%
대성 203
 
0.8%
injection 184
 
0.7%
ag 157
 
0.6%
solution 151
 
0.6%
powder 145
 
0.5%
vaccine 143
 
0.5%
Other values (14304) 24907
92.3%
2023-12-12T10:43:22.114592image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
16983
 
7.3%
( 9085
 
3.9%
) 9079
 
3.9%
e 5368
 
2.3%
i 4944
 
2.1%
0 4511
 
1.9%
A 4373
 
1.9%
o 4226
 
1.8%
- 4058
 
1.7%
I 3947
 
1.7%
Other values (795) 166841
71.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 75240
32.2%
Uppercase Letter 49853
21.4%
Lowercase Letter 47868
20.5%
Space Separator 16983
 
7.3%
Decimal Number 14332
 
6.1%
Open Punctuation 11394
 
4.9%
Close Punctuation 11392
 
4.9%
Dash Punctuation 4066
 
1.7%
Other Punctuation 2094
 
0.9%
Math Symbol 99
 
< 0.1%
Other values (8) 94
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3001
 
4.0%
2205
 
2.9%
2138
 
2.8%
2066
 
2.7%
1815
 
2.4%
1471
 
2.0%
1294
 
1.7%
1161
 
1.5%
1065
 
1.4%
1046
 
1.4%
Other values (692) 57978
77.1%
Lowercase Letter
ValueCountFrequency (%)
e 5368
11.2%
i 4944
10.3%
o 4226
 
8.8%
a 3837
 
8.0%
n 3801
 
7.9%
t 3281
 
6.9%
l 3063
 
6.4%
r 2896
 
6.0%
c 2211
 
4.6%
s 2038
 
4.3%
Other values (18) 12203
25.5%
Uppercase Letter
ValueCountFrequency (%)
A 4373
 
8.8%
I 3947
 
7.9%
E 3748
 
7.5%
C 3404
 
6.8%
S 3152
 
6.3%
T 3097
 
6.2%
O 2854
 
5.7%
N 2797
 
5.6%
P 2797
 
5.6%
R 2694
 
5.4%
Other values (17) 16990
34.1%
Other Punctuation
ValueCountFrequency (%)
. 1214
58.0%
, 360
 
17.2%
/ 194
 
9.3%
155
 
7.4%
% 83
 
4.0%
· 43
 
2.1%
& 22
 
1.1%
: 11
 
0.5%
# 4
 
0.2%
; 4
 
0.2%
Other values (3) 4
 
0.2%
Decimal Number
ValueCountFrequency (%)
0 4511
31.5%
2 2906
20.3%
1 2635
18.4%
3 1277
 
8.9%
5 1228
 
8.6%
4 629
 
4.4%
6 383
 
2.7%
7 271
 
1.9%
8 252
 
1.8%
9 240
 
1.7%
Other Symbol
ValueCountFrequency (%)
51
69.9%
10
 
13.7%
7
 
9.6%
4
 
5.5%
1
 
1.4%
Open Punctuation
ValueCountFrequency (%)
( 9085
79.7%
[ 2284
 
20.0%
25
 
0.2%
Close Punctuation
ValueCountFrequency (%)
) 9079
79.7%
] 2274
 
20.0%
39
 
0.3%
Dash Punctuation
ValueCountFrequency (%)
- 4058
99.8%
8
 
0.2%
Math Symbol
ValueCountFrequency (%)
+ 93
93.9%
6
 
6.1%
Letter Number
ValueCountFrequency (%)
3
50.0%
3
50.0%
Other Number
ValueCountFrequency (%)
1
50.0%
1
50.0%
Space Separator
ValueCountFrequency (%)
16983
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 7
100.0%
Format
ValueCountFrequency (%)
­ 2
100.0%
Currency Symbol
ValueCountFrequency (%)
¤ 2
100.0%
Initial Punctuation
ValueCountFrequency (%)
1
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 97675
41.8%
Hangul 75247
32.2%
Common 60441
25.9%
Greek 52
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3001
 
4.0%
2205
 
2.9%
2138
 
2.8%
2066
 
2.7%
1815
 
2.4%
1471
 
2.0%
1294
 
1.7%
1161
 
1.5%
1065
 
1.4%
1046
 
1.4%
Other values (693) 57985
77.1%
Latin
ValueCountFrequency (%)
e 5368
 
5.5%
i 4944
 
5.1%
A 4373
 
4.5%
o 4226
 
4.3%
I 3947
 
4.0%
a 3837
 
3.9%
n 3801
 
3.9%
E 3748
 
3.8%
C 3404
 
3.5%
t 3281
 
3.4%
Other values (45) 56746
58.1%
Common
ValueCountFrequency (%)
16983
28.1%
( 9085
15.0%
) 9079
15.0%
0 4511
 
7.5%
- 4058
 
6.7%
2 2906
 
4.8%
1 2635
 
4.4%
[ 2284
 
3.8%
] 2274
 
3.8%
3 1277
 
2.1%
Other values (35) 5349
 
8.8%
Greek
ValueCountFrequency (%)
μ 51
98.1%
α 1
 
1.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 157757
67.6%
Hangul 75235
32.2%
None 337
 
0.1%
Letterlike Symbols 51
 
< 0.1%
CJK Compat 15
 
< 0.1%
Punctuation 9
 
< 0.1%
Number Forms 6
 
< 0.1%
Compat Jamo 5
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
16983
 
10.8%
( 9085
 
5.8%
) 9079
 
5.8%
e 5368
 
3.4%
i 4944
 
3.1%
0 4511
 
2.9%
A 4373
 
2.8%
o 4226
 
2.7%
- 4058
 
2.6%
I 3947
 
2.5%
Other values (70) 91183
57.8%
Hangul
ValueCountFrequency (%)
3001
 
4.0%
2205
 
2.9%
2138
 
2.8%
2066
 
2.7%
1815
 
2.4%
1471
 
2.0%
1294
 
1.7%
1161
 
1.5%
1065
 
1.4%
1046
 
1.4%
Other values (691) 57973
77.1%
None
ValueCountFrequency (%)
155
46.0%
μ 51
 
15.1%
· 43
 
12.8%
39
 
11.6%
25
 
7.4%
7
 
2.1%
6
 
1.8%
­ 2
 
0.6%
¡ 2
 
0.6%
¤ 2
 
0.6%
Other values (5) 5
 
1.5%
Letterlike Symbols
ValueCountFrequency (%)
51
100.0%
CJK Compat
ValueCountFrequency (%)
10
66.7%
4
 
26.7%
1
 
6.7%
Punctuation
ValueCountFrequency (%)
8
88.9%
1
 
11.1%
Compat Jamo
ValueCountFrequency (%)
5
100.0%
Number Forms
ValueCountFrequency (%)
3
50.0%
3
50.0%
Distinct1670
Distinct (%)16.7%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T10:43:22.367539image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length62
Median length52
Mean length9.7078
Min length2

Characters and Unicode

Total characters97078
Distinct characters575
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique726 ?
Unique (%)7.3%

Sample

1st row페니실린지프로카인+디하이드로스트렙토마이신
2nd row환축감시장치[2]
3rd row살리노마이신나트륨
4th row린코마이신+스펙티노마이신
5th row재사용가능안과용큐렛[1]
ValueCountFrequency (%)
기타영양공급약 374
 
3.4%
비타민 278
 
2.6%
플로르페니콜 233
 
2.1%
저위험성동물전염병면역검사시약[2 179
 
1.6%
엔로플록사신 177
 
1.6%
아목시실린 154
 
1.4%
면역화학검사시약[2 131
 
1.2%
이버멕틴 129
 
1.2%
인수공통전염병면역검사시약[3 123
 
1.1%
암피실린 105
 
1.0%
Other values (1781) 9003
82.7%
2023-12-12T10:43:22.787884image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
+ 2950
 
3.0%
2525
 
2.6%
[ 2358
 
2.4%
] 2358
 
2.4%
1891
 
1.9%
1856
 
1.9%
1796
 
1.9%
1748
 
1.8%
1628
 
1.7%
1585
 
1.6%
Other values (565) 76383
78.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 83886
86.4%
Math Symbol 3044
 
3.1%
Decimal Number 2847
 
2.9%
Open Punctuation 2480
 
2.6%
Close Punctuation 2480
 
2.6%
Space Separator 886
 
0.9%
Uppercase Letter 858
 
0.9%
Other Punctuation 349
 
0.4%
Lowercase Letter 168
 
0.2%
Dash Punctuation 79
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2525
 
3.0%
1891
 
2.3%
1856
 
2.2%
1796
 
2.1%
1748
 
2.1%
1628
 
1.9%
1585
 
1.9%
1578
 
1.9%
1548
 
1.8%
1505
 
1.8%
Other values (500) 66226
78.9%
Lowercase Letter
ValueCountFrequency (%)
e 19
11.3%
r 17
10.1%
i 16
9.5%
t 15
8.9%
a 14
 
8.3%
n 12
 
7.1%
o 11
 
6.5%
l 11
 
6.5%
m 9
 
5.4%
p 9
 
5.4%
Other values (11) 35
20.8%
Uppercase Letter
ValueCountFrequency (%)
E 203
23.7%
A 141
16.4%
C 125
14.6%
D 124
14.5%
B 88
10.3%
F 57
 
6.6%
H 26
 
3.0%
V 20
 
2.3%
T 14
 
1.6%
L 10
 
1.2%
Other values (9) 50
 
5.8%
Decimal Number
ValueCountFrequency (%)
2 1251
43.9%
1 742
26.1%
3 656
23.0%
4 123
 
4.3%
0 29
 
1.0%
6 16
 
0.6%
8 15
 
0.5%
9 13
 
0.5%
5 1
 
< 0.1%
7 1
 
< 0.1%
Other Punctuation
ValueCountFrequency (%)
/ 283
81.1%
. 36
 
10.3%
, 21
 
6.0%
· 8
 
2.3%
: 1
 
0.3%
Math Symbol
ValueCountFrequency (%)
+ 2950
96.9%
~ 90
 
3.0%
4
 
0.1%
Open Punctuation
ValueCountFrequency (%)
[ 2358
95.1%
( 122
 
4.9%
Close Punctuation
ValueCountFrequency (%)
] 2358
95.1%
) 122
 
4.9%
Space Separator
ValueCountFrequency (%)
886
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 79
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 83886
86.4%
Common 12165
 
12.5%
Latin 1027
 
1.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2525
 
3.0%
1891
 
2.3%
1856
 
2.2%
1796
 
2.1%
1748
 
2.1%
1628
 
1.9%
1585
 
1.9%
1578
 
1.9%
1548
 
1.8%
1505
 
1.8%
Other values (500) 66226
78.9%
Latin
ValueCountFrequency (%)
E 203
19.8%
A 141
13.7%
C 125
12.2%
D 124
12.1%
B 88
8.6%
F 57
 
5.6%
H 26
 
2.5%
V 20
 
1.9%
e 19
 
1.9%
r 17
 
1.7%
Other values (31) 207
20.2%
Common
ValueCountFrequency (%)
+ 2950
24.2%
[ 2358
19.4%
] 2358
19.4%
2 1251
10.3%
886
 
7.3%
1 742
 
6.1%
3 656
 
5.4%
/ 283
 
2.3%
4 123
 
1.0%
( 122
 
1.0%
Other values (14) 436
 
3.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 83886
86.4%
ASCII 13179
 
13.6%
None 12
 
< 0.1%
Number Forms 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
+ 2950
22.4%
[ 2358
17.9%
] 2358
17.9%
2 1251
9.5%
886
 
6.7%
1 742
 
5.6%
3 656
 
5.0%
/ 283
 
2.1%
E 203
 
1.5%
A 141
 
1.1%
Other values (52) 1351
10.3%
Hangul
ValueCountFrequency (%)
2525
 
3.0%
1891
 
2.3%
1856
 
2.2%
1796
 
2.1%
1748
 
2.1%
1628
 
1.9%
1585
 
1.9%
1578
 
1.9%
1548
 
1.8%
1505
 
1.8%
Other values (500) 66226
78.9%
None
ValueCountFrequency (%)
· 8
66.7%
4
33.3%
Number Forms
ValueCountFrequency (%)
1
100.0%
Distinct1685
Distinct (%)16.9%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T10:43:23.116798image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length100
Median length78
Mean length26.1279
Min length2

Characters and Unicode

Total characters261279
Distinct characters138
Distinct categories12 ?
Distinct scripts3 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique736 ?
Unique (%)7.4%

Sample

1st rowPenicillin G procaine+Dihydrostreptomycin sulfate
2nd rowAnimal patient monitor
3rd rowSalinomysin sodium
4th rowLincomycin+Spectinomycin
5th rowCurette, ophthalmic, reusable
ValueCountFrequency (%)
for 1032
 
3.8%
ivd 895
 
3.3%
reagents 881
 
3.3%
of 566
 
2.1%
supplemental 554
 
2.1%
preparations 554
 
2.1%
immunological 404
 
1.5%
infectious 401
 
1.5%
method 399
 
1.5%
vitamin 355
 
1.3%
Other values (2194) 20773
77.5%
2023-12-12T10:43:23.631517image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
e 22932
 
8.8%
i 21652
 
8.3%
a 17969
 
6.9%
o 17754
 
6.8%
n 17674
 
6.8%
16814
 
6.4%
l 15599
 
6.0%
t 14364
 
5.5%
r 13844
 
5.3%
s 11469
 
4.4%
Other values (128) 91208
34.9%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 216973
83.0%
Uppercase Letter 19620
 
7.5%
Space Separator 16814
 
6.4%
Other Punctuation 2990
 
1.1%
Math Symbol 2782
 
1.1%
Dash Punctuation 1041
 
0.4%
Decimal Number 484
 
0.2%
Other Letter 304
 
0.1%
Open Punctuation 133
 
0.1%
Close Punctuation 132
 
0.1%
Other values (2) 6
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
17
 
5.6%
16
 
5.3%
13
 
4.3%
10
 
3.3%
9
 
3.0%
9
 
3.0%
9
 
3.0%
8
 
2.6%
8
 
2.6%
8
 
2.6%
Other values (55) 197
64.8%
Lowercase Letter
ValueCountFrequency (%)
e 22932
10.6%
i 21652
10.0%
a 17969
 
8.3%
o 17754
 
8.2%
n 17674
 
8.1%
l 15599
 
7.2%
t 14364
 
6.6%
r 13844
 
6.4%
s 11469
 
5.3%
c 9949
 
4.6%
Other values (16) 53767
24.8%
Uppercase Letter
ValueCountFrequency (%)
I 2263
11.5%
C 2037
10.4%
D 1811
 
9.2%
V 1481
 
7.5%
A 1388
 
7.1%
E 1360
 
6.9%
P 1130
 
5.8%
S 1090
 
5.6%
T 1049
 
5.3%
M 980
 
5.0%
Other values (16) 5031
25.6%
Decimal Number
ValueCountFrequency (%)
3 114
23.6%
2 102
21.1%
1 92
19.0%
6 50
10.3%
7 45
 
9.3%
5 30
 
6.2%
0 18
 
3.7%
9 17
 
3.5%
8 16
 
3.3%
Other Punctuation
ValueCountFrequency (%)
, 2175
72.7%
. 430
 
14.4%
/ 384
 
12.8%
: 1
 
< 0.1%
Math Symbol
ValueCountFrequency (%)
+ 2778
99.9%
4
 
0.1%
Space Separator
ValueCountFrequency (%)
16814
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1041
100.0%
Open Punctuation
ValueCountFrequency (%)
( 133
100.0%
Close Punctuation
ValueCountFrequency (%)
) 132
100.0%
Final Punctuation
ValueCountFrequency (%)
5
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 236594
90.6%
Common 24381
 
9.3%
Hangul 304
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
17
 
5.6%
16
 
5.3%
13
 
4.3%
10
 
3.3%
9
 
3.0%
9
 
3.0%
9
 
3.0%
8
 
2.6%
8
 
2.6%
8
 
2.6%
Other values (55) 197
64.8%
Latin
ValueCountFrequency (%)
e 22932
 
9.7%
i 21652
 
9.2%
a 17969
 
7.6%
o 17754
 
7.5%
n 17674
 
7.5%
l 15599
 
6.6%
t 14364
 
6.1%
r 13844
 
5.9%
s 11469
 
4.8%
c 9949
 
4.2%
Other values (43) 73388
31.0%
Common
ValueCountFrequency (%)
16814
69.0%
+ 2778
 
11.4%
, 2175
 
8.9%
- 1041
 
4.3%
. 430
 
1.8%
/ 384
 
1.6%
( 133
 
0.5%
) 132
 
0.5%
3 114
 
0.5%
2 102
 
0.4%
Other values (10) 278
 
1.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 260965
99.9%
Hangul 304
 
0.1%
Punctuation 5
 
< 0.1%
None 4
 
< 0.1%
Number Forms 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e 22932
 
8.8%
i 21652
 
8.3%
a 17969
 
6.9%
o 17754
 
6.8%
n 17674
 
6.8%
16814
 
6.4%
l 15599
 
6.0%
t 14364
 
5.5%
r 13844
 
5.3%
s 11469
 
4.4%
Other values (60) 90894
34.8%
Hangul
ValueCountFrequency (%)
17
 
5.6%
16
 
5.3%
13
 
4.3%
10
 
3.3%
9
 
3.0%
9
 
3.0%
9
 
3.0%
8
 
2.6%
8
 
2.6%
8
 
2.6%
Other values (55) 197
64.8%
Punctuation
ValueCountFrequency (%)
5
100.0%
None
ValueCountFrequency (%)
4
100.0%
Number Forms
ValueCountFrequency (%)
1
100.0%
Distinct3328
Distinct (%)33.3%
Missing5
Missing (%)< 0.1%
Memory size156.2 KiB
Minimum1963-05-13 00:00:00
Maximum2019-09-25 00:00:00
2023-12-12T10:43:23.821037image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:43:23.983229image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct529
Distinct (%)5.3%
Missing5
Missing (%)< 0.1%
Memory size156.2 KiB
2023-12-12T10:43:24.245590image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length24
Median length16
Mean length8.0678339
Min length2

Characters and Unicode

Total characters80638
Distinct characters326
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique161 ?
Unique (%)1.6%

Sample

1st row화성동물약품(주)
2nd row한국동물약품공업협동조합
3rd row(주)서울신약
4th row(주)다원케미칼
5th row하이퍼메딕스
ValueCountFrequency (%)
주식회사 552
 
5.2%
주)고려비엔피 320
 
3.0%
주)코미팜 315
 
2.9%
주)대성미생물연구소 305
 
2.8%
녹십자수의약품(주 256
 
2.4%
바이엘코리아(주 242
 
2.3%
주)한동 236
 
2.2%
주)제일바이오 222
 
2.1%
주)삼양애니팜 215
 
2.0%
주)유니바이오테크 212
 
2.0%
Other values (543) 7828
73.1%
2023-12-12T10:43:24.664829image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
9291
 
11.5%
) 8745
 
10.8%
( 8745
 
10.8%
3249
 
4.0%
1846
 
2.3%
1767
 
2.2%
1725
 
2.1%
1352
 
1.7%
1167
 
1.4%
1143
 
1.4%
Other values (316) 41608
51.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 62380
77.4%
Close Punctuation 8745
 
10.8%
Open Punctuation 8745
 
10.8%
Space Separator 708
 
0.9%
Uppercase Letter 52
 
0.1%
Other Punctuation 6
 
< 0.1%
Other Symbol 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
9291
 
14.9%
3249
 
5.2%
1846
 
3.0%
1767
 
2.8%
1725
 
2.8%
1352
 
2.2%
1167
 
1.9%
1143
 
1.8%
1128
 
1.8%
1093
 
1.8%
Other values (296) 38619
61.9%
Uppercase Letter
ValueCountFrequency (%)
B 10
19.2%
K 9
17.3%
N 4
 
7.7%
C 4
 
7.7%
V 4
 
7.7%
A 4
 
7.7%
I 3
 
5.8%
S 3
 
5.8%
O 3
 
5.8%
E 2
 
3.8%
Other values (5) 6
11.5%
Close Punctuation
ValueCountFrequency (%)
) 8745
100.0%
Open Punctuation
ValueCountFrequency (%)
( 8745
100.0%
Space Separator
ValueCountFrequency (%)
708
100.0%
Other Punctuation
ValueCountFrequency (%)
. 6
100.0%
Other Symbol
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 62382
77.4%
Common 18204
 
22.6%
Latin 52
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
9291
 
14.9%
3249
 
5.2%
1846
 
3.0%
1767
 
2.8%
1725
 
2.8%
1352
 
2.2%
1167
 
1.9%
1143
 
1.8%
1128
 
1.8%
1093
 
1.8%
Other values (297) 38621
61.9%
Latin
ValueCountFrequency (%)
B 10
19.2%
K 9
17.3%
N 4
 
7.7%
C 4
 
7.7%
V 4
 
7.7%
A 4
 
7.7%
I 3
 
5.8%
S 3
 
5.8%
O 3
 
5.8%
E 2
 
3.8%
Other values (5) 6
11.5%
Common
ValueCountFrequency (%)
) 8745
48.0%
( 8745
48.0%
708
 
3.9%
. 6
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 62380
77.4%
ASCII 18256
 
22.6%
None 2
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
9291
 
14.9%
3249
 
5.2%
1846
 
3.0%
1767
 
2.8%
1725
 
2.8%
1352
 
2.2%
1167
 
1.9%
1143
 
1.8%
1128
 
1.8%
1093
 
1.8%
Other values (296) 38619
61.9%
ASCII
ValueCountFrequency (%)
) 8745
47.9%
( 8745
47.9%
708
 
3.9%
B 10
 
0.1%
K 9
 
< 0.1%
. 6
 
< 0.1%
N 4
 
< 0.1%
C 4
 
< 0.1%
V 4
 
< 0.1%
A 4
 
< 0.1%
Other values (9) 17
 
0.1%
None
ValueCountFrequency (%)
2
100.0%

Correlations

2023-12-12T10:43:24.800799image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
용도업종
용도1.0000.186
업종0.1861.000
2023-12-12T10:43:24.893304image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
용도업종
용도1.0000.306
업종0.3061.000
2023-12-12T10:43:25.275685image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
용도업종
용도1.0000.306
업종0.3061.000

Missing values

2023-12-12T10:43:19.671908image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T10:43:19.853950image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-12T10:43:20.003486image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

용도허가번호업종품목명성분명(국문)성분명(영문)허가일업체명
5381동물용의약품054-082수입다이펜 주사제(Dipen)페니실린지프로카인+디하이드로스트렙토마이신Penicillin G procaine+Dihydrostreptomycin sulfate2015-12-18화성동물약품(주)
8213동물용의료기기036-015수입환축감시장치[2](G3 vet patient monitor)환축감시장치[2]Animal patient monitor2013-12-09한국동물약품공업협동조합
5005동물용의약품042-045제조서울-살리노60(SALINO-60)살리노마이신나트륨Salinomysin sodium1996-12-14(주)서울신약
3246동물용의약품024-174제조다원린스펙린코마이신+스펙티노마이신Lincomycin+Spectinomycin2008-01-04(주)다원케미칼
8669동물용의료기기059-011수입재사용가능동물안과용큐렛[1](Acrivet Arlt lens loop)재사용가능안과용큐렛[1]Curette, ophthalmic, reusable2015-07-06하이퍼메딕스
4457동물용의약품034-130제조프로펜콜 2.3액플로르페니콜Florfenicol2004-06-11한국썸벧(주)
6983동물용의약품385-001제조포비맘 액(포비돈요오드)(POVIMAM)포비돈+요오드Povidone+Iodine2019-08-05K.V 바이오젠
3810동물용의약품029-173제조에스에프네오산110 첨가제(SF Neosan 110 Feed Additive)네오마이신Neomycin sulfate2009-07-29(주)에스에프
1978동물용의약품015-135제조비-비 박스 3(B-B VAX 3)전염성비기관염+바이러스성설사증+파라인플루엔자IBR+BVD+Parainfluenza1986-11-17녹십자수의약품(주)
5296동물용의약품052-003제조에톡시퀸-에이-50(ETHOXYQUIN-A-50)에톡시퀸Ethoxyquine1982-04-10동선산업(주)
용도허가번호업종품목명성분명(국문)성분명(영문)허가일업체명
2139동물용의약품015-329제조수출용 녹수테라10-주(GC TERA 10-Inj.)옥시테트라싸이클린염산염Oxytetracycline HCl2013-09-16녹십자수의약품(주)
4689동물용의약품037-011제조맘마소마기타영양공급약Mineral supplemental preparations1974-04-18(주)유니바이오테크
3285동물용의약품024-222제조다원 디크라주릴 2.5 액 (디크라주릴)(DaOne Diclazuril 2.5 Solution)디클라주릴Diclazuril2017-11-29(주)다원케미칼
1495동물용의약품011-256제조대성 카라실 주(CARASIL)부타포스판Butaphosphan2004-01-10(주)대성미생물연구소
4433동물용의약품034-092제조아쿠아 치암콜치암페니콜Thiamphenicol1996-04-19한국썸벧(주)
7699동물용의약외품181-001제조뉴어스텍(NEW EARTHTEC)구리염화물Copper salt2010-04-14(주)소프트아쿠아
5911동물용의약품102-003제조싸이로마진10%(CYROMAZINE10%)사이로마진Cyromazine2000-06-21대정화금(주)
4892동물용의약품037-045수입피테이즈노보 CT(PHYTASE NOVO CT)효소유기물+파테이즈Enzymes+Phytase1997-08-25(주)동방
3775동물용의약품029-132제조SF 카토판-S부타포스판Butaphosphan2003-01-14(주)에스에프
9768동물용의료기기146-004수입폴리글리콜산봉합사[4](152403 외 2종)폴리글리콜산봉합사[4]Polyglycolic acid suture2017-11-22비엘엔에이치(주)

Duplicate rows

Most frequently occurring

용도허가번호업종품목명성분명(국문)성분명(영문)허가일업체명# duplicates
0동물용의료기기037-007수입동물 전신용전산화단층엑스선촬영장치[2](SOMATOM Emotion 6-slice configuration)전신용전산화단층엑스선촬영장치[2]CT system, full-body2011-09-30지멘스헬시니어스(주)2