Overview

Dataset statistics

Number of variables13
Number of observations4022
Missing cells3109
Missing cells (%)5.9%
Duplicate rows1
Duplicate rows (%)< 0.1%
Total size in memory408.6 KiB
Average record size in memory104.0 B

Variable types

Text8
Categorical4
DateTime1

Dataset

Description생활화학제품 및 살생물제의 안전관리에 관한 법률에 따른 안전·표시기준을 위반한 생활화학제품에 대한 정보로 위반내용, 조치내용 등의 항목을 제공합니다.
Author환경부
URLhttps://www.data.go.kr/data/15118357/fileData.do

Alerts

Dataset has 1 (< 0.1%) duplicate rowsDuplicates
생산지 is highly imbalanced (52.0%)Imbalance
출처 is highly imbalanced (80.9%)Imbalance
제품군 소분류 has 806 (20.0%) missing valuesMissing
업체주소 has 270 (6.7%) missing valuesMissing
기타 has 2033 (50.5%) missing valuesMissing

Reproduction

Analysis started2023-12-12 23:08:45.722729
Analysis finished2023-12-12 23:08:48.384791
Duration2.66 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct3616
Distinct (%)89.9%
Missing0
Missing (%)0.0%
Memory size31.6 KiB
2023-12-13T08:08:48.676368image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length79
Median length48
Mean length14.299354
Min length1

Characters and Unicode

Total characters57512
Distinct characters921
Distinct categories16 ?
Distinct scripts4 ?
Distinct blocks8 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3304 ?
Unique (%)82.1%

Sample

1st row희드라(가루세제)
2nd row썬플라워캔들
3rd row디퓨저(페어프리지아향)
4th row트리캔들(우드앤솔트향)
5th row향기 리스타블렛(페어프리지아향)
ValueCountFrequency (%)
287
 
2.8%
글루 110
 
1.1%
차량용 106
 
1.0%
wet 100
 
1.0%
캔들 94
 
0.9%
1 74
 
0.7%
wipes 65
 
0.6%
방향제 61
 
0.6%
디퓨저 56
 
0.5%
석고방향제 55
 
0.5%
Other values (5375) 9302
90.2%
2023-12-13T08:08:49.291157image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
6395
 
11.1%
( 1631
 
2.8%
) 1630
 
2.8%
1204
 
2.1%
e 1167
 
2.0%
1114
 
1.9%
1089
 
1.9%
856
 
1.5%
a 738
 
1.3%
723
 
1.3%
Other values (911) 40965
71.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 31950
55.6%
Lowercase Letter 9416
 
16.4%
Space Separator 6395
 
11.1%
Uppercase Letter 4588
 
8.0%
Open Punctuation 1639
 
2.8%
Close Punctuation 1638
 
2.8%
Decimal Number 1349
 
2.3%
Dash Punctuation 255
 
0.4%
Other Punctuation 208
 
0.4%
Math Symbol 37
 
0.1%
Other values (6) 37
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1204
 
3.8%
1114
 
3.5%
1089
 
3.4%
856
 
2.7%
723
 
2.3%
615
 
1.9%
584
 
1.8%
574
 
1.8%
528
 
1.7%
461
 
1.4%
Other values (822) 24202
75.7%
Lowercase Letter
ValueCountFrequency (%)
e 1167
12.4%
a 738
 
7.8%
i 687
 
7.3%
o 681
 
7.2%
l 643
 
6.8%
r 611
 
6.5%
t 602
 
6.4%
s 593
 
6.3%
n 575
 
6.1%
c 435
 
4.6%
Other values (16) 2684
28.5%
Uppercase Letter
ValueCountFrequency (%)
E 416
 
9.1%
A 410
 
8.9%
I 301
 
6.6%
L 301
 
6.6%
R 287
 
6.3%
S 270
 
5.9%
T 259
 
5.6%
C 254
 
5.5%
N 253
 
5.5%
O 236
 
5.1%
Other values (16) 1601
34.9%
Other Punctuation
ValueCountFrequency (%)
& 77
37.0%
, 39
18.8%
. 30
 
14.4%
# 16
 
7.7%
/ 16
 
7.7%
% 12
 
5.8%
· 6
 
2.9%
' 5
 
2.4%
: 4
 
1.9%
* 2
 
1.0%
Decimal Number
ValueCountFrequency (%)
0 364
27.0%
1 327
24.2%
2 177
13.1%
5 120
 
8.9%
3 108
 
8.0%
4 68
 
5.0%
6 58
 
4.3%
7 53
 
3.9%
8 39
 
2.9%
9 35
 
2.6%
Other Number
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%
Open Punctuation
ValueCountFrequency (%)
( 1631
99.5%
[ 8
 
0.5%
Close Punctuation
ValueCountFrequency (%)
) 1630
99.5%
] 8
 
0.5%
Math Symbol
ValueCountFrequency (%)
+ 35
94.6%
~ 2
 
5.4%
Space Separator
ValueCountFrequency (%)
6395
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 255
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 21
100.0%
Final Punctuation
ValueCountFrequency (%)
9
100.0%
Control
ValueCountFrequency (%)
2
100.0%
Modifier Symbol
ValueCountFrequency (%)
´ 1
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 31948
55.6%
Latin 14004
24.3%
Common 11558
 
20.1%
Han 2
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1204
 
3.8%
1114
 
3.5%
1089
 
3.4%
856
 
2.7%
723
 
2.3%
615
 
1.9%
584
 
1.8%
574
 
1.8%
528
 
1.7%
461
 
1.4%
Other values (820) 24200
75.7%
Latin
ValueCountFrequency (%)
e 1167
 
8.3%
a 738
 
5.3%
i 687
 
4.9%
o 681
 
4.9%
l 643
 
4.6%
r 611
 
4.4%
t 602
 
4.3%
s 593
 
4.2%
n 575
 
4.1%
c 435
 
3.1%
Other values (42) 7272
51.9%
Common
ValueCountFrequency (%)
6395
55.3%
( 1631
 
14.1%
) 1630
 
14.1%
0 364
 
3.1%
1 327
 
2.8%
- 255
 
2.2%
2 177
 
1.5%
5 120
 
1.0%
3 108
 
0.9%
& 77
 
0.7%
Other values (27) 474
 
4.1%
Han
ValueCountFrequency (%)
1
50.0%
1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 31946
55.5%
ASCII 25542
44.4%
Punctuation 9
 
< 0.1%
None 7
 
< 0.1%
Enclosed Alphanum 3
 
< 0.1%
Compat Jamo 2
 
< 0.1%
CJK 2
 
< 0.1%
CJK Compat 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
6395
25.0%
( 1631
 
6.4%
) 1630
 
6.4%
e 1167
 
4.6%
a 738
 
2.9%
i 687
 
2.7%
o 681
 
2.7%
l 643
 
2.5%
r 611
 
2.4%
t 602
 
2.4%
Other values (72) 10757
42.1%
Hangul
ValueCountFrequency (%)
1204
 
3.8%
1114
 
3.5%
1089
 
3.4%
856
 
2.7%
723
 
2.3%
615
 
1.9%
584
 
1.8%
574
 
1.8%
528
 
1.7%
461
 
1.4%
Other values (819) 24198
75.7%
Punctuation
ValueCountFrequency (%)
9
100.0%
None
ValueCountFrequency (%)
· 6
85.7%
´ 1
 
14.3%
Compat Jamo
ValueCountFrequency (%)
2
100.0%
CJK
ValueCountFrequency (%)
1
50.0%
1
50.0%
Enclosed Alphanum
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%
CJK Compat
ValueCountFrequency (%)
1
100.0%
Distinct10
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size31.6 KiB
방향제류
1345 
기타류
989 
코팅/접착제류
555 
세제류
434 
미용제품
427 
Other values (5)
272 

Length

Max length12
Median length9
Mean length4.2178021
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row세제류
2nd row기타류
3rd row방향제류
4th row기타류
5th row방향제류

Common Values

ValueCountFrequency (%)
방향제류 1345
33.4%
기타류 989
24.6%
코팅/접착제류 555
13.8%
세제류 434
 
10.8%
미용제품 427
 
10.6%
살생물제류 147
 
3.7%
염료/염색류 78
 
1.9%
인쇄 및 문서관련 제품 32
 
0.8%
자동차 전용 제품 12
 
0.3%
보존/보존처리제품 3
 
0.1%

Length

2023-12-13T08:08:49.446042image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T08:08:49.566755image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
방향제류 1345
32.5%
기타류 989
23.9%
코팅/접착제류 555
13.4%
세제류 434
 
10.5%
미용제품 427
 
10.3%
살생물제류 147
 
3.5%
염료/염색류 78
 
1.9%
제품 44
 
1.1%
인쇄 32
 
0.8%
32
 
0.8%
Other values (4) 59
 
1.4%
Distinct74
Distinct (%)1.8%
Missing0
Missing (%)0.0%
Memory size31.6 KiB
2023-12-13T08:08:49.769836image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length22
Median length3
Mean length4.0932372
Min length1

Characters and Unicode

Total characters16463
Distinct characters103
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique10 ?
Unique (%)0.2%

Sample

1st row세탁세제
2nd row
3rd row방향제
4th row
5th row방향제
ValueCountFrequency (%)
방향제 1217
23.9%
688
13.5%
접착제 436
 
8.6%
탈취제 351
 
6.9%
세정제 313
 
6.1%
살균제 243
 
4.8%
코팅제 225
 
4.4%
미용 223
 
4.4%
문신용 204
 
4.0%
염료 204
 
4.0%
Other values (60) 992
19.5%
2023-12-13T08:08:50.134480image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3663
22.2%
1302
 
7.9%
1232
 
7.5%
1074
 
6.5%
722
 
4.4%
592
 
3.6%
492
 
3.0%
481
 
2.9%
436
 
2.6%
422
 
2.6%
Other values (93) 6047
36.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 14836
90.1%
Space Separator 1074
 
6.5%
Other Punctuation 553
 
3.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3663
24.7%
1302
 
8.8%
1232
 
8.3%
722
 
4.9%
592
 
4.0%
492
 
3.3%
481
 
3.2%
436
 
2.9%
422
 
2.8%
408
 
2.8%
Other values (89) 5086
34.3%
Other Punctuation
ValueCountFrequency (%)
, 349
63.1%
/ 149
26.9%
· 55
 
9.9%
Space Separator
ValueCountFrequency (%)
1074
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 14836
90.1%
Common 1627
 
9.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3663
24.7%
1302
 
8.8%
1232
 
8.3%
722
 
4.9%
592
 
4.0%
492
 
3.3%
481
 
3.2%
436
 
2.9%
422
 
2.8%
408
 
2.8%
Other values (89) 5086
34.3%
Common
ValueCountFrequency (%)
1074
66.0%
, 349
 
21.5%
/ 149
 
9.2%
· 55
 
3.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 14836
90.1%
ASCII 1572
 
9.5%
None 55
 
0.3%

Most frequent character per block

Hangul
ValueCountFrequency (%)
3663
24.7%
1302
 
8.8%
1232
 
8.3%
722
 
4.9%
592
 
4.0%
492
 
3.3%
481
 
3.2%
436
 
2.9%
422
 
2.8%
408
 
2.8%
Other values (89) 5086
34.3%
ASCII
ValueCountFrequency (%)
1074
68.3%
, 349
 
22.2%
/ 149
 
9.5%
None
ValueCountFrequency (%)
· 55
100.0%

제품군 소분류
Text

MISSING 

Distinct196
Distinct (%)6.1%
Missing806
Missing (%)20.0%
Memory size31.6 KiB
2023-12-13T08:08:50.383297image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length28
Median length27
Mean length4.2468905
Min length1

Characters and Unicode

Total characters13658
Distinct characters128
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique84 ?
Unique (%)2.6%

Sample

1st row세탁세제
2nd row
3rd row방향제
4th row
5th row방향제
ValueCountFrequency (%)
방향제 977
25.2%
643
16.6%
접착제 277
 
7.1%
세정제 174
 
4.5%
탈취제 156
 
4.0%
미용 155
 
4.0%
문신용 135
 
3.5%
염료 135
 
3.5%
살균제 119
 
3.1%
제거제 91
 
2.3%
Other values (171) 1018
26.2%
2023-12-13T08:08:50.764611image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2938
21.5%
1119
 
8.2%
1078
 
7.9%
671
 
4.9%
652
 
4.8%
590
 
4.3%
408
 
3.0%
351
 
2.6%
310
 
2.3%
304
 
2.2%
Other values (118) 5237
38.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 12387
90.7%
Space Separator 671
 
4.9%
Other Punctuation 467
 
3.4%
Close Punctuation 51
 
0.4%
Open Punctuation 51
 
0.4%
Dash Punctuation 31
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2938
23.7%
1119
 
9.0%
1078
 
8.7%
652
 
5.3%
590
 
4.8%
408
 
3.3%
351
 
2.8%
310
 
2.5%
304
 
2.5%
250
 
2.0%
Other values (109) 4387
35.4%
Other Punctuation
ValueCountFrequency (%)
, 179
38.3%
/ 147
31.5%
· 137
29.3%
. 3
 
0.6%
' 1
 
0.2%
Space Separator
ValueCountFrequency (%)
671
100.0%
Close Punctuation
ValueCountFrequency (%)
) 51
100.0%
Open Punctuation
ValueCountFrequency (%)
( 51
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 31
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 12387
90.7%
Common 1271
 
9.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2938
23.7%
1119
 
9.0%
1078
 
8.7%
652
 
5.3%
590
 
4.8%
408
 
3.3%
351
 
2.8%
310
 
2.5%
304
 
2.5%
250
 
2.0%
Other values (109) 4387
35.4%
Common
ValueCountFrequency (%)
671
52.8%
, 179
 
14.1%
/ 147
 
11.6%
· 137
 
10.8%
) 51
 
4.0%
( 51
 
4.0%
- 31
 
2.4%
. 3
 
0.2%
' 1
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 12386
90.7%
ASCII 1134
 
8.3%
None 137
 
1.0%
Compat Jamo 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
2938
23.7%
1119
 
9.0%
1078
 
8.7%
652
 
5.3%
590
 
4.8%
408
 
3.3%
351
 
2.8%
310
 
2.5%
304
 
2.5%
250
 
2.0%
Other values (108) 4386
35.4%
ASCII
ValueCountFrequency (%)
671
59.2%
, 179
 
15.8%
/ 147
 
13.0%
) 51
 
4.5%
( 51
 
4.5%
- 31
 
2.7%
. 3
 
0.3%
' 1
 
0.1%
None
ValueCountFrequency (%)
· 137
100.0%
Compat Jamo
ValueCountFrequency (%)
1
100.0%
Distinct2500
Distinct (%)62.2%
Missing0
Missing (%)0.0%
Memory size31.6 KiB
2023-12-13T08:08:50.994606image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length60
Median length43
Mean length20.327449
Min length2

Characters and Unicode

Total characters81757
Distinct characters757
Distinct categories14 ?
Distinct scripts4 ?
Distinct blocks6 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1829 ?
Unique (%)45.5%

Sample

1st row[제조/판매] 천광(055-337-3501)
2nd row[제조/판매] 썬플라워
3rd row[제조/판매] Amour Christine
4th row[제조/판매] Amour Christine
5th row[제조/판매] Amour Christine
ValueCountFrequency (%)
제조/판매 917
 
10.6%
판매 461
 
5.3%
제조 446
 
5.2%
수입 295
 
3.4%
주식회사 274
 
3.2%
수입/판매 265
 
3.1%
제조,판매 87
 
1.0%
제조자 85
 
1.0%
판매자 82
 
1.0%
대진케미칼(031-435-8746 74
 
0.9%
Other values (3109) 5633
65.4%
2023-12-13T08:08:51.566703image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4929
 
6.0%
0 4795
 
5.9%
- 4523
 
5.5%
] 3323
 
4.1%
[ 3322
 
4.1%
( 2761
 
3.4%
) 2758
 
3.4%
7 2675
 
3.3%
2 2509
 
3.1%
1 2410
 
2.9%
Other values (747) 47752
58.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 30811
37.7%
Decimal Number 23937
29.3%
Open Punctuation 6085
 
7.4%
Close Punctuation 6082
 
7.4%
Space Separator 4929
 
6.0%
Dash Punctuation 4523
 
5.5%
Other Punctuation 1804
 
2.2%
Uppercase Letter 1463
 
1.8%
Lowercase Letter 1333
 
1.6%
Other Symbol 702
 
0.9%
Other values (4) 88
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2394
 
7.8%
2372
 
7.7%
2096
 
6.8%
1966
 
6.4%
1080
 
3.5%
802
 
2.6%
761
 
2.5%
679
 
2.2%
596
 
1.9%
538
 
1.7%
Other values (665) 17527
56.9%
Lowercase Letter
ValueCountFrequency (%)
e 145
10.9%
o 138
 
10.4%
r 105
 
7.9%
a 102
 
7.7%
i 101
 
7.6%
n 93
 
7.0%
l 85
 
6.4%
t 80
 
6.0%
s 74
 
5.6%
c 63
 
4.7%
Other values (15) 347
26.0%
Uppercase Letter
ValueCountFrequency (%)
A 206
14.1%
R 165
11.3%
E 127
 
8.7%
O 118
 
8.1%
N 108
 
7.4%
I 103
 
7.0%
K 101
 
6.9%
S 57
 
3.9%
T 56
 
3.8%
C 48
 
3.3%
Other values (14) 374
25.6%
Decimal Number
ValueCountFrequency (%)
0 4795
20.0%
7 2675
11.2%
2 2509
10.5%
1 2410
10.1%
3 2403
10.0%
5 2112
8.8%
4 2010
8.4%
6 1884
 
7.9%
8 1753
 
7.3%
9 1386
 
5.8%
Other Punctuation
ValueCountFrequency (%)
/ 1488
82.5%
, 233
 
12.9%
. 56
 
3.1%
& 15
 
0.8%
: 9
 
0.5%
· 2
 
0.1%
* 1
 
0.1%
Math Symbol
ValueCountFrequency (%)
~ 33
71.7%
> 6
 
13.0%
< 6
 
13.0%
× 1
 
2.2%
Close Punctuation
ValueCountFrequency (%)
] 3323
54.6%
) 2758
45.3%
} 1
 
< 0.1%
Open Punctuation
ValueCountFrequency (%)
[ 3322
54.6%
( 2761
45.4%
{ 2
 
< 0.1%
Space Separator
ValueCountFrequency (%)
4929
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 4523
100.0%
Other Symbol
ValueCountFrequency (%)
702
100.0%
Control
ValueCountFrequency (%)
34
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 6
100.0%
Final Punctuation
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 47448
58.0%
Hangul 31508
38.5%
Latin 2796
 
3.4%
Han 5
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2394
 
7.6%
2372
 
7.5%
2096
 
6.7%
1966
 
6.2%
1080
 
3.4%
802
 
2.5%
761
 
2.4%
702
 
2.2%
679
 
2.2%
596
 
1.9%
Other values (663) 18060
57.3%
Latin
ValueCountFrequency (%)
A 206
 
7.4%
R 165
 
5.9%
e 145
 
5.2%
o 138
 
4.9%
E 127
 
4.5%
O 118
 
4.2%
N 108
 
3.9%
r 105
 
3.8%
I 103
 
3.7%
a 102
 
3.6%
Other values (39) 1479
52.9%
Common
ValueCountFrequency (%)
4929
 
10.4%
0 4795
 
10.1%
- 4523
 
9.5%
] 3323
 
7.0%
[ 3322
 
7.0%
( 2761
 
5.8%
) 2758
 
5.8%
7 2675
 
5.6%
2 2509
 
5.3%
1 2410
 
5.1%
Other values (22) 13443
28.3%
Han
ValueCountFrequency (%)
2
40.0%
2
40.0%
1
20.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 50239
61.4%
Hangul 30801
37.7%
None 705
 
0.9%
Compat Jamo 5
 
< 0.1%
CJK 5
 
< 0.1%
Punctuation 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
4929
 
9.8%
0 4795
 
9.5%
- 4523
 
9.0%
] 3323
 
6.6%
[ 3322
 
6.6%
( 2761
 
5.5%
) 2758
 
5.5%
7 2675
 
5.3%
2 2509
 
5.0%
1 2410
 
4.8%
Other values (68) 16234
32.3%
Hangul
ValueCountFrequency (%)
2394
 
7.8%
2372
 
7.7%
2096
 
6.8%
1966
 
6.4%
1080
 
3.5%
802
 
2.6%
761
 
2.5%
679
 
2.2%
596
 
1.9%
538
 
1.7%
Other values (661) 17517
56.9%
None
ValueCountFrequency (%)
702
99.6%
· 2
 
0.3%
× 1
 
0.1%
Compat Jamo
ValueCountFrequency (%)
5
100.0%
Punctuation
ValueCountFrequency (%)
2
100.0%
CJK
ValueCountFrequency (%)
2
40.0%
2
40.0%
1
20.0%

업체주소
Text

MISSING 

Distinct2245
Distinct (%)59.8%
Missing270
Missing (%)6.7%
Memory size31.6 KiB
2023-12-13T08:08:51.815771image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length137
Median length58
Mean length28.074627
Min length5

Characters and Unicode

Total characters105336
Distinct characters592
Distinct categories13 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1601 ?
Unique (%)42.7%

Sample

1st row경상남도 함안군 군북면 함안산단6길 52
2nd row부산광역시 기장군 정관읍 정관5로, 112동 1205호
3rd row부산광역시 동래구 사직북로28번길 9
4th row부산광역시 동래구 사직북로28번길 9
5th row부산광역시 동래구 사직북로28번길 9
ValueCountFrequency (%)
경기도 1266
 
5.9%
서울특별시 577
 
2.7%
1층 291
 
1.4%
서울시 231
 
1.1%
인천광역시 210
 
1.0%
2층 188
 
0.9%
고양시 147
 
0.7%
부천시 119
 
0.6%
마포구 117
 
0.5%
시흥시 116
 
0.5%
Other values (5030) 18125
84.7%
2023-12-13T08:08:52.215904image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
17722
 
16.8%
1 5115
 
4.9%
3540
 
3.4%
3312
 
3.1%
2 3110
 
3.0%
2775
 
2.6%
0 2724
 
2.6%
2715
 
2.6%
3 2355
 
2.2%
, 2337
 
2.2%
Other values (582) 59631
56.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 58922
55.9%
Decimal Number 21952
 
20.8%
Space Separator 17722
 
16.8%
Other Punctuation 2451
 
2.3%
Close Punctuation 1490
 
1.4%
Open Punctuation 1490
 
1.4%
Dash Punctuation 949
 
0.9%
Uppercase Letter 287
 
0.3%
Lowercase Letter 46
 
< 0.1%
Math Symbol 18
 
< 0.1%
Other values (3) 9
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3540
 
6.0%
3312
 
5.6%
2775
 
4.7%
2715
 
4.6%
1894
 
3.2%
1860
 
3.2%
1844
 
3.1%
1692
 
2.9%
1492
 
2.5%
1458
 
2.5%
Other values (525) 36340
61.7%
Uppercase Letter
ValueCountFrequency (%)
B 86
30.0%
A 62
21.6%
C 28
 
9.8%
D 19
 
6.6%
S 14
 
4.9%
T 11
 
3.8%
E 10
 
3.5%
F 9
 
3.1%
K 8
 
2.8%
R 8
 
2.8%
Other values (10) 32
 
11.1%
Lowercase Letter
ValueCountFrequency (%)
b 13
28.3%
r 12
26.1%
o 4
 
8.7%
s 4
 
8.7%
e 3
 
6.5%
u 3
 
6.5%
w 2
 
4.3%
h 1
 
2.2%
t 1
 
2.2%
j 1
 
2.2%
Other values (2) 2
 
4.3%
Decimal Number
ValueCountFrequency (%)
1 5115
23.3%
2 3110
14.2%
0 2724
12.4%
3 2355
10.7%
4 1814
 
8.3%
6 1666
 
7.6%
5 1506
 
6.9%
7 1332
 
6.1%
8 1199
 
5.5%
9 1131
 
5.2%
Other Punctuation
ValueCountFrequency (%)
, 2337
95.3%
/ 81
 
3.3%
. 33
 
1.3%
Math Symbol
ValueCountFrequency (%)
< 8
44.4%
> 8
44.4%
~ 2
 
11.1%
Close Punctuation
ValueCountFrequency (%)
) 1430
96.0%
] 60
 
4.0%
Open Punctuation
ValueCountFrequency (%)
( 1430
96.0%
[ 60
 
4.0%
Space Separator
ValueCountFrequency (%)
17722
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 949
100.0%
Control
ValueCountFrequency (%)
5
100.0%
Other Symbol
ValueCountFrequency (%)
3
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 58925
55.9%
Common 46077
43.7%
Latin 334
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3540
 
6.0%
3312
 
5.6%
2775
 
4.7%
2715
 
4.6%
1894
 
3.2%
1860
 
3.2%
1844
 
3.1%
1692
 
2.9%
1492
 
2.5%
1458
 
2.5%
Other values (526) 36343
61.7%
Latin
ValueCountFrequency (%)
B 86
25.7%
A 62
18.6%
C 28
 
8.4%
D 19
 
5.7%
S 14
 
4.2%
b 13
 
3.9%
r 12
 
3.6%
T 11
 
3.3%
E 10
 
3.0%
F 9
 
2.7%
Other values (23) 70
21.0%
Common
ValueCountFrequency (%)
17722
38.5%
1 5115
 
11.1%
2 3110
 
6.7%
0 2724
 
5.9%
3 2355
 
5.1%
, 2337
 
5.1%
4 1814
 
3.9%
6 1666
 
3.6%
5 1506
 
3.3%
) 1430
 
3.1%
Other values (13) 6298
 
13.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 58922
55.9%
ASCII 46410
44.1%
None 3
 
< 0.1%
Number Forms 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
17722
38.2%
1 5115
 
11.0%
2 3110
 
6.7%
0 2724
 
5.9%
3 2355
 
5.1%
, 2337
 
5.0%
4 1814
 
3.9%
6 1666
 
3.6%
5 1506
 
3.2%
) 1430
 
3.1%
Other values (45) 6631
 
14.3%
Hangul
ValueCountFrequency (%)
3540
 
6.0%
3312
 
5.6%
2775
 
4.7%
2715
 
4.6%
1894
 
3.2%
1860
 
3.2%
1844
 
3.1%
1692
 
2.9%
1492
 
2.5%
1458
 
2.5%
Other values (525) 36340
61.7%
None
ValueCountFrequency (%)
3
100.0%
Number Forms
ValueCountFrequency (%)
1
100.0%

생산지
Categorical

IMBALANCE 

Distinct33
Distinct (%)0.8%
Missing0
Missing (%)0.0%
Memory size31.6 KiB
대한민국
1759 
<NA>
1140 
중국
314 
-
198 
미국
189 
Other values (28)
422 

Length

Max length8
Median length4
Mean length3.4428145
Min length1

Unique

Unique10 ?
Unique (%)0.2%

Sample

1st row대한민국
2nd row대한민국
3rd row대한민국
4th row대한민국
5th row대한민국

Common Values

ValueCountFrequency (%)
대한민국 1759
43.7%
<NA> 1140
28.3%
중국 314
 
7.8%
- 198
 
4.9%
미국 189
 
4.7%
한국 154
 
3.8%
일본 82
 
2.0%
독일 30
 
0.7%
중국(미국) 29
 
0.7%
영국 25
 
0.6%
Other values (23) 102
 
2.5%

Length

2023-12-13T08:08:52.340290image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
대한민국 1759
43.7%
na 1140
28.3%
중국 316
 
7.9%
198
 
4.9%
미국 189
 
4.7%
한국 154
 
3.8%
일본 82
 
2.0%
독일 30
 
0.7%
중국(미국 29
 
0.7%
영국 25
 
0.6%
Other values (22) 102
 
2.5%

출처
Categorical

IMBALANCE 

Distinct22
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size31.6 KiB
생활화학제품 및 살생물제의 안전관리에 관한 법률
3390 
<NA>
487 
화학제품안전법 제10조 및 제35조
 
41
종전 화학물질의 등록 및 평가 등에 관한 법률, 생활화학제품 및 살생물제의 안전관리에 관한 법률
 
36
자진신고
 
33
Other values (17)
 
35

Length

Max length66
Median length26
Mean length23.268274
Min length4

Unique

Unique14 ?
Unique (%)0.3%

Sample

1st row생활화학제품 및 살생물제의 안전관리에 관한 법률
2nd row생활화학제품 및 살생물제의 안전관리에 관한 법률
3rd row생활화학제품 및 살생물제의 안전관리에 관한 법률
4th row생활화학제품 및 살생물제의 안전관리에 관한 법률
5th row생활화학제품 및 살생물제의 안전관리에 관한 법률

Common Values

ValueCountFrequency (%)
생활화학제품 및 살생물제의 안전관리에 관한 법률 3390
84.3%
<NA> 487
 
12.1%
화학제품안전법 제10조 및 제35조 41
 
1.0%
종전 화학물질의 등록 및 평가 등에 관한 법률, 생활화학제품 및 살생물제의 안전관리에 관한 법률 36
 
0.9%
자진신고 33
 
0.8%
화학제품안전법 제10조 및 제36조 13
 
0.3%
종전 화학물질의 등록 및 평가 등에 관한 법률 4
 
0.1%
화학물질의 등록 및 평가 등에 관한 법률 4
 
0.1%
뷰랩 버블 세정제 1
 
< 0.1%
화학제품안전법 제10조 및 제37조 1
 
< 0.1%
Other values (12) 12
 
0.3%

Length

2023-12-13T08:08:52.474926image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
3530
16.3%
관한 3474
16.0%
법률 3473
16.0%
생활화학제품 3427
15.8%
살생물제의 3427
15.8%
안전관리에 3427
15.8%
na 487
 
2.2%
화학제품안전법 56
 
0.3%
제10조 56
 
0.3%
화학물질의 46
 
0.2%
Other values (36) 295
 
1.4%

처분권자
Categorical

Distinct16
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size31.6 KiB
한강유역환경청
2427 
낙동강유역환경청
326 
<NA>
252 
영산강유역환경청
 
212
금강유역환경청
 
210
Other values (11)
595 

Length

Max length21
Median length7
Mean length6.7220288
Min length3

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row낙동강유역환경청
2nd row낙동강유역환경청
3rd row낙동강유역환경청
4th row낙동강유역환경청
5th row낙동강유역환경청

Common Values

ValueCountFrequency (%)
한강유역환경청 2427
60.3%
낙동강유역환경청 326
 
8.1%
<NA> 252
 
6.3%
영산강유역환경청 212
 
5.3%
금강유역환경청 210
 
5.2%
대구지방환경청 210
 
5.2%
한강청 174
 
4.3%
원주지방환경청 92
 
2.3%
전북지방환경청 59
 
1.5%
대구청 20
 
0.5%
Other values (6) 40
 
1.0%

Length

2023-12-13T08:08:52.589333image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
한강유역환경청 2427
60.3%
낙동강유역환경청 326
 
8.1%
na 252
 
6.3%
영산강유역환경청 212
 
5.3%
금강유역환경청 210
 
5.2%
대구지방환경청 210
 
5.2%
한강청 174
 
4.3%
원주지방환경청 92
 
2.3%
전북지방환경청 59
 
1.5%
대구청 20
 
0.5%
Other values (7) 42
 
1.0%
Distinct711
Distinct (%)17.7%
Missing0
Missing (%)0.0%
Memory size31.6 KiB
Minimum2016-05-18 00:00:00
Maximum2023-06-30 00:00:00
2023-12-13T08:08:52.711400image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:08:52.852864image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct116
Distinct (%)2.9%
Missing0
Missing (%)0.0%
Memory size31.6 KiB
2023-12-13T08:08:52.994098image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length51
Median length16
Mean length14.44356
Min length4

Characters and Unicode

Total characters58092
Distinct characters82
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique46 ?
Unique (%)1.1%

Sample

1st row제조금지,판매금지,회수명령
2nd row제조금지,판매금지,회수명령
3rd row판매금지,회수명령
4th row판매금지,회수명령
5th row제조금지,판매금지,회수명령
ValueCountFrequency (%)
회수명령 3115
31.5%
판매금지 3078
31.1%
제조금지 1661
16.8%
수입금지 684
 
6.9%
제조금지,판매금지,회수명령 270
 
2.7%
판매금지,회수명령,개선명령 185
 
1.9%
개선명령 113
 
1.1%
수입금지,판매금지,회수명령 89
 
0.9%
판매금지,회수명령 56
 
0.6%
회수/명령 55
 
0.6%
Other values (83) 576
 
5.8%
2023-12-13T08:08:53.272108image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
, 6828
11.8%
6686
11.5%
6685
11.5%
5908
10.2%
4795
8.3%
4358
7.5%
4358
7.5%
3999
6.9%
3869
6.7%
3869
6.7%
Other values (72) 6737
11.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 44901
77.3%
Other Punctuation 7081
 
12.2%
Space Separator 5908
 
10.2%
Open Punctuation 83
 
0.1%
Close Punctuation 83
 
0.1%
Lowercase Letter 14
 
< 0.1%
Math Symbol 14
 
< 0.1%
Decimal Number 5
 
< 0.1%
Control 3
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
6686
14.9%
6685
14.9%
4795
10.7%
4358
9.7%
4358
9.7%
3999
8.9%
3869
8.6%
3869
8.6%
2110
 
4.7%
2093
 
4.7%
Other values (54) 2079
 
4.6%
Other Punctuation
ValueCountFrequency (%)
, 6828
96.4%
/ 128
 
1.8%
· 121
 
1.7%
. 3
 
< 0.1%
1
 
< 0.1%
Decimal Number
ValueCountFrequency (%)
0 2
40.0%
3 2
40.0%
2 1
20.0%
Open Punctuation
ValueCountFrequency (%)
( 53
63.9%
[ 30
36.1%
Close Punctuation
ValueCountFrequency (%)
) 53
63.9%
] 30
36.1%
Lowercase Letter
ValueCountFrequency (%)
b 7
50.0%
r 7
50.0%
Math Symbol
ValueCountFrequency (%)
< 7
50.0%
> 7
50.0%
Space Separator
ValueCountFrequency (%)
5908
100.0%
Control
ValueCountFrequency (%)
3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 44901
77.3%
Common 13177
 
22.7%
Latin 14
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
6686
14.9%
6685
14.9%
4795
10.7%
4358
9.7%
4358
9.7%
3999
8.9%
3869
8.6%
3869
8.6%
2110
 
4.7%
2093
 
4.7%
Other values (54) 2079
 
4.6%
Common
ValueCountFrequency (%)
, 6828
51.8%
5908
44.8%
/ 128
 
1.0%
· 121
 
0.9%
( 53
 
0.4%
) 53
 
0.4%
[ 30
 
0.2%
] 30
 
0.2%
< 7
 
0.1%
> 7
 
0.1%
Other values (6) 12
 
0.1%
Latin
ValueCountFrequency (%)
b 7
50.0%
r 7
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 44901
77.3%
ASCII 13069
 
22.5%
None 121
 
0.2%
Punctuation 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
, 6828
52.2%
5908
45.2%
/ 128
 
1.0%
( 53
 
0.4%
) 53
 
0.4%
[ 30
 
0.2%
] 30
 
0.2%
b 7
 
0.1%
< 7
 
0.1%
r 7
 
0.1%
Other values (6) 18
 
0.1%
Hangul
ValueCountFrequency (%)
6686
14.9%
6685
14.9%
4795
10.7%
4358
9.7%
4358
9.7%
3999
8.9%
3869
8.6%
3869
8.6%
2110
 
4.7%
2093
 
4.7%
Other values (54) 2079
 
4.6%
None
ValueCountFrequency (%)
· 121
100.0%
Punctuation
ValueCountFrequency (%)
1
100.0%
Distinct753
Distinct (%)18.7%
Missing0
Missing (%)0.0%
Memory size31.6 KiB
2023-12-13T08:08:53.506185image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length159
Median length108
Mean length31.203879
Min length5

Characters and Unicode

Total characters125502
Distinct characters303
Distinct categories14 ?
Distinct scripts4 ?
Distinct blocks8 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique442 ?
Unique (%)11.0%

Sample

1st row확인을 받지 아니한 제품 제조·판매
2nd row확인을 받지 아니한 제품 제조·판매
3rd row확인을 받지 아니한 제품 제조·판매
4th row확인을 받지 아니한 제품 제조·판매
5th row확인을 받지 아니한 제품 제조·판매
ValueCountFrequency (%)
미실시 2167
 
8.3%
2158
 
8.3%
표시사항 2050
 
7.8%
미표기 1730
 
6.6%
위반 1647
 
6.3%
안전기준 1614
 
6.2%
안전·표시기준 1410
 
5.4%
확인 969
 
3.7%
적합확인 828
 
3.2%
제품 583
 
2.2%
Other values (776) 10967
42.0%
2023-12-13T08:08:53.931305image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
22006
 
17.5%
7082
 
5.6%
6787
 
5.4%
6722
 
5.4%
5153
 
4.1%
4956
 
3.9%
4874
 
3.9%
4263
 
3.4%
2863
 
2.3%
2777
 
2.2%
Other values (293) 58019
46.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 88826
70.8%
Space Separator 22006
 
17.5%
Other Punctuation 3778
 
3.0%
Close Punctuation 2987
 
2.4%
Open Punctuation 2978
 
2.4%
Decimal Number 2428
 
1.9%
Uppercase Letter 849
 
0.7%
Control 554
 
0.4%
Lowercase Letter 532
 
0.4%
Dash Punctuation 491
 
0.4%
Other values (4) 73
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
7082
 
8.0%
6787
 
7.6%
6722
 
7.6%
5153
 
5.8%
4956
 
5.6%
4874
 
5.5%
4263
 
4.8%
2863
 
3.2%
2777
 
3.1%
2753
 
3.1%
Other values (235) 40596
45.7%
Uppercase Letter
ValueCountFrequency (%)
B 212
25.0%
M 149
17.6%
C 126
14.8%
T 86
10.1%
I 83
 
9.8%
D 69
 
8.1%
A 44
 
5.2%
H 28
 
3.3%
F 13
 
1.5%
G 10
 
1.2%
Other values (6) 29
 
3.4%
Other Punctuation
ValueCountFrequency (%)
· 2291
60.6%
, 819
 
21.7%
* 194
 
5.1%
. 155
 
4.1%
/ 123
 
3.3%
96
 
2.5%
: 47
 
1.2%
' 30
 
0.8%
% 18
 
0.5%
" 5
 
0.1%
Decimal Number
ValueCountFrequency (%)
0 774
31.9%
1 555
22.9%
2 458
18.9%
9 239
 
9.8%
3 142
 
5.8%
5 84
 
3.5%
4 63
 
2.6%
8 46
 
1.9%
6 44
 
1.8%
7 23
 
0.9%
Lowercase Letter
ValueCountFrequency (%)
g 236
44.4%
m 118
22.2%
k 118
22.2%
a 53
 
10.0%
o 7
 
1.3%
Close Punctuation
ValueCountFrequency (%)
) 2017
67.5%
] 936
31.3%
34
 
1.1%
Open Punctuation
ValueCountFrequency (%)
( 2009
67.5%
[ 935
31.4%
34
 
1.1%
Other Symbol
ValueCountFrequency (%)
50
92.6%
2
 
3.7%
2
 
3.7%
Other Number
ValueCountFrequency (%)
6
35.3%
6
35.3%
5
29.4%
Space Separator
ValueCountFrequency (%)
22006
100.0%
Control
ValueCountFrequency (%)
554
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 491
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 88823
70.8%
Common 35295
 
28.1%
Latin 1381
 
1.1%
Han 3
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
7082
 
8.0%
6787
 
7.6%
6722
 
7.6%
5153
 
5.8%
4956
 
5.6%
4874
 
5.5%
4263
 
4.8%
2863
 
3.2%
2777
 
3.1%
2753
 
3.1%
Other values (234) 40593
45.7%
Common
ValueCountFrequency (%)
22006
62.3%
· 2291
 
6.5%
) 2017
 
5.7%
( 2009
 
5.7%
] 936
 
2.7%
[ 935
 
2.6%
, 819
 
2.3%
0 774
 
2.2%
1 555
 
1.6%
554
 
1.6%
Other values (27) 2399
 
6.8%
Latin
ValueCountFrequency (%)
g 236
17.1%
B 212
15.4%
M 149
10.8%
C 126
9.1%
m 118
8.5%
k 118
8.5%
T 86
 
6.2%
I 83
 
6.0%
D 69
 
5.0%
a 53
 
3.8%
Other values (11) 131
9.5%
Han
ValueCountFrequency (%)
3
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 88823
70.8%
ASCII 34150
 
27.2%
None 2359
 
1.9%
Punctuation 96
 
0.1%
Geometric Shapes 50
 
< 0.1%
Enclosed Alphanum 17
 
< 0.1%
CJK Compat 4
 
< 0.1%
CJK 3
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
22006
64.4%
) 2017
 
5.9%
( 2009
 
5.9%
] 936
 
2.7%
[ 935
 
2.7%
, 819
 
2.4%
0 774
 
2.3%
1 555
 
1.6%
554
 
1.6%
- 491
 
1.4%
Other values (38) 3054
 
8.9%
Hangul
ValueCountFrequency (%)
7082
 
8.0%
6787
 
7.6%
6722
 
7.6%
5153
 
5.8%
4956
 
5.6%
4874
 
5.5%
4263
 
4.8%
2863
 
3.2%
2777
 
3.1%
2753
 
3.1%
Other values (234) 40593
45.7%
None
ValueCountFrequency (%)
· 2291
97.1%
34
 
1.4%
34
 
1.4%
Punctuation
ValueCountFrequency (%)
96
100.0%
Geometric Shapes
ValueCountFrequency (%)
50
100.0%
Enclosed Alphanum
ValueCountFrequency (%)
6
35.3%
6
35.3%
5
29.4%
CJK
ValueCountFrequency (%)
3
100.0%
CJK Compat
ValueCountFrequency (%)
2
50.0%
2
50.0%

기타
Text

MISSING 

Distinct1211
Distinct (%)60.9%
Missing2033
Missing (%)50.5%
Memory size31.6 KiB
2023-12-13T08:08:54.202525image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length178
Median length118
Mean length40.020111
Min length1

Characters and Unicode

Total characters79600
Distinct characters554
Distinct categories17 ?
Distinct scripts3 ?
Distinct blocks9 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1014 ?
Unique (%)51.0%

Sample

1st row유효기간 만료일(21.11.19, 자가검사번호:F-A02B-M00020002-A151) 이후 부터 신고일(22.11.15, 신고번호 : FB22-03-0252)이전까지 제조된 제품 유통 불가
2nd row자가검사번호(C-A09B-B128001-A160) 유효기간이 만료된 제품으로 2019.6.30 이후 제조된 제품 유통금지
3rd row신고증명서 발급(2023.6.18)이전 제조·판매 제품 유통불가, 신고번호(GB22-12-2057) 표시제품 유통 가능
4th row신고증명서 발급(2023.6.15)이전 제조·판매 제품 유통불가, 신고번호(GB21-26-2054) 표시제품 유통 가능
5th row향균/살균 문구 표기한 제품은 유통 금지, 신고증명서를 발급받은 (CB21-12-1018) 방향제 품목은 유통가능
ValueCountFrequency (%)
제품 827
 
6.1%
신고번호 489
 
3.6%
486
 
3.6%
사용 435
 
3.2%
판매금지 416
 
3.1%
회수 415
 
3.1%
모두 391
 
2.9%
가능 388
 
2.9%
유통 378
 
2.8%
제품은 357
 
2.6%
Other values (2393) 9008
66.3%
2023-12-13T08:08:54.704487image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
11722
 
14.7%
0 4284
 
5.4%
2 3754
 
4.7%
1 3395
 
4.3%
- 2956
 
3.7%
2725
 
3.4%
2095
 
2.6%
. 1652
 
2.1%
1590
 
2.0%
1570
 
2.0%
Other values (544) 43857
55.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 40608
51.0%
Decimal Number 15506
 
19.5%
Space Separator 11722
 
14.7%
Uppercase Letter 3273
 
4.1%
Dash Punctuation 2956
 
3.7%
Other Punctuation 2561
 
3.2%
Open Punctuation 1300
 
1.6%
Close Punctuation 1299
 
1.6%
Other Symbol 144
 
0.2%
Lowercase Letter 87
 
0.1%
Other values (7) 144
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2725
 
6.7%
2095
 
5.2%
1590
 
3.9%
1570
 
3.9%
1510
 
3.7%
1175
 
2.9%
1117
 
2.8%
1067
 
2.6%
906
 
2.2%
901
 
2.2%
Other values (455) 25952
63.9%
Uppercase Letter
ValueCountFrequency (%)
B 1326
40.5%
C 641
19.6%
A 357
 
10.9%
F 244
 
7.5%
H 221
 
6.8%
D 186
 
5.7%
E 96
 
2.9%
G 88
 
2.7%
T 19
 
0.6%
M 16
 
0.5%
Other values (13) 79
 
2.4%
Lowercase Letter
ValueCountFrequency (%)
l 17
19.5%
m 11
12.6%
g 9
10.3%
e 8
9.2%
a 6
 
6.9%
r 5
 
5.7%
n 4
 
4.6%
i 4
 
4.6%
o 4
 
4.6%
p 3
 
3.4%
Other values (9) 16
18.4%
Other Punctuation
ValueCountFrequency (%)
. 1652
64.5%
, 338
 
13.2%
· 212
 
8.3%
137
 
5.3%
' 76
 
3.0%
: 59
 
2.3%
/ 45
 
1.8%
" 30
 
1.2%
* 7
 
0.3%
& 3
 
0.1%
Decimal Number
ValueCountFrequency (%)
0 4284
27.6%
2 3754
24.2%
1 3395
21.9%
9 866
 
5.6%
3 728
 
4.7%
6 624
 
4.0%
8 519
 
3.3%
5 488
 
3.1%
7 440
 
2.8%
4 408
 
2.6%
Other Number
ValueCountFrequency (%)
5
20.0%
5
20.0%
5
20.0%
5
20.0%
5
20.0%
Other Symbol
ValueCountFrequency (%)
81
56.2%
61
42.4%
1
 
0.7%
1
 
0.7%
Math Symbol
ValueCountFrequency (%)
~ 13
86.7%
< 1
 
6.7%
> 1
 
6.7%
Open Punctuation
ValueCountFrequency (%)
( 1281
98.5%
[ 19
 
1.5%
Close Punctuation
ValueCountFrequency (%)
) 1280
98.5%
] 19
 
1.5%
Modifier Symbol
ValueCountFrequency (%)
` 71
98.6%
˙ 1
 
1.4%
Initial Punctuation
ValueCountFrequency (%)
12
63.2%
7
36.8%
Final Punctuation
ValueCountFrequency (%)
7
77.8%
2
 
22.2%
Space Separator
ValueCountFrequency (%)
11722
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2956
100.0%
Control
ValueCountFrequency (%)
3
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 40689
51.1%
Common 35551
44.7%
Latin 3360
 
4.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2725
 
6.7%
2095
 
5.1%
1590
 
3.9%
1570
 
3.9%
1510
 
3.7%
1175
 
2.9%
1117
 
2.7%
1067
 
2.6%
906
 
2.2%
901
 
2.2%
Other values (456) 26033
64.0%
Common
ValueCountFrequency (%)
11722
33.0%
0 4284
 
12.1%
2 3754
 
10.6%
1 3395
 
9.5%
- 2956
 
8.3%
. 1652
 
4.6%
( 1281
 
3.6%
) 1280
 
3.6%
9 866
 
2.4%
3 728
 
2.0%
Other values (36) 3633
 
10.2%
Latin
ValueCountFrequency (%)
B 1326
39.5%
C 641
19.1%
A 357
 
10.6%
F 244
 
7.3%
H 221
 
6.6%
D 186
 
5.5%
E 96
 
2.9%
G 88
 
2.6%
T 19
 
0.6%
l 17
 
0.5%
Other values (32) 165
 
4.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 40579
51.0%
ASCII 38445
48.3%
None 293
 
0.4%
Punctuation 165
 
0.2%
Geometric Shapes 61
 
0.1%
Compat Jamo 29
 
< 0.1%
Enclosed Alphanum 25
 
< 0.1%
CJK Compat 2
 
< 0.1%
Modifier Letters 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
11722
30.5%
0 4284
 
11.1%
2 3754
 
9.8%
1 3395
 
8.8%
- 2956
 
7.7%
. 1652
 
4.3%
B 1326
 
3.4%
( 1281
 
3.3%
) 1280
 
3.3%
9 866
 
2.3%
Other values (63) 5929
15.4%
Hangul
ValueCountFrequency (%)
2725
 
6.7%
2095
 
5.2%
1590
 
3.9%
1570
 
3.9%
1510
 
3.7%
1175
 
2.9%
1117
 
2.8%
1067
 
2.6%
906
 
2.2%
901
 
2.2%
Other values (453) 25923
63.9%
None
ValueCountFrequency (%)
· 212
72.4%
81
 
27.6%
Punctuation
ValueCountFrequency (%)
137
83.0%
12
 
7.3%
7
 
4.2%
7
 
4.2%
2
 
1.2%
Geometric Shapes
ValueCountFrequency (%)
61
100.0%
Compat Jamo
ValueCountFrequency (%)
25
86.2%
4
 
13.8%
Enclosed Alphanum
ValueCountFrequency (%)
5
20.0%
5
20.0%
5
20.0%
5
20.0%
5
20.0%
CJK Compat
ValueCountFrequency (%)
1
50.0%
1
50.0%
Modifier Letters
ValueCountFrequency (%)
˙ 1
100.0%

Correlations

2023-12-13T08:08:54.804335image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
제품군 대분류제품군 중분류생산지출처처분권자
제품군 대분류1.0001.0000.5420.1750.300
제품군 중분류1.0001.0000.6740.4310.591
생산지0.5420.6741.0000.5190.415
출처0.1750.4310.5191.0000.408
처분권자0.3000.5910.4150.4081.000
2023-12-13T08:08:54.913513image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
처분권자제품군 대분류생산지출처
처분권자1.0000.1160.1370.157
제품군 대분류0.1161.0000.2230.065
생산지0.1370.2231.0000.157
출처0.1570.0650.1571.000
2023-12-13T08:08:55.009269image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
제품군 대분류생산지출처처분권자
제품군 대분류1.0000.2230.0650.116
생산지0.2231.0000.1570.137
출처0.0650.1571.0000.157
처분권자0.1160.1370.1571.000

Missing values

2023-12-13T08:08:47.899671image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T08:08:48.131599image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-13T08:08:48.279086image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

제품명제품군 대분류제품군 중분류제품군 소분류업체명업체주소생산지출처처분권자조치일자조치내용위반내용기타
0희드라(가루세제)세제류세탁세제세탁세제[제조/판매] 천광(055-337-3501)경상남도 함안군 군북면 함안산단6길 52대한민국생활화학제품 및 살생물제의 안전관리에 관한 법률낙동강유역환경청2023-02-21제조금지,판매금지,회수명령확인을 받지 아니한 제품 제조·판매유효기간 만료일(21.11.19, 자가검사번호:F-A02B-M00020002-A151) 이후 부터 신고일(22.11.15, 신고번호 : FB22-03-0252)이전까지 제조된 제품 유통 불가
1썬플라워캔들기타류[제조/판매] 썬플라워부산광역시 기장군 정관읍 정관5로, 112동 1205호대한민국생활화학제품 및 살생물제의 안전관리에 관한 법률낙동강유역환경청2023-02-16제조금지,판매금지,회수명령확인을 받지 아니한 제품 제조·판매자가검사번호(C-A09B-B128001-A160) 유효기간이 만료된 제품으로 2019.6.30 이후 제조된 제품 유통금지
2디퓨저(페어프리지아향)방향제류방향제방향제[제조/판매] Amour Christine부산광역시 동래구 사직북로28번길 9대한민국생활화학제품 및 살생물제의 안전관리에 관한 법률낙동강유역환경청2023-03-29판매금지,회수명령확인을 받지 아니한 제품 제조·판매<NA>
3트리캔들(우드앤솔트향)기타류[제조/판매] Amour Christine부산광역시 동래구 사직북로28번길 9대한민국생활화학제품 및 살생물제의 안전관리에 관한 법률낙동강유역환경청2023-03-29판매금지,회수명령확인을 받지 아니한 제품 제조·판매<NA>
4향기 리스타블렛(페어프리지아향)방향제류방향제방향제[제조/판매] Amour Christine부산광역시 동래구 사직북로28번길 9대한민국생활화학제품 및 살생물제의 안전관리에 관한 법률낙동강유역환경청2023-03-29제조금지,판매금지,회수명령확인을 받지 아니한 제품 제조·판매<NA>
5투빈페이퍼 친환경 디퓨저(러브스펠)방향제류방향제방향제[제조/판매] 투빈페이퍼경상남도 창원시 마산회원구 구암서2길 110-1 천사유치원 1층대한민국생활화학제품 및 살생물제의 안전관리에 관한 법률낙동강유역환경청2023-06-15제조금지,판매금지,회수명령확인을 받지 아니한 제품 제조·판매<NA>
6코지모지 룸 스프레이 포기(Foggy)방향제류방향제방향제[제조/판매] 코지모지부산광역시 수영구 수영로 624번길 38-1, 1층대한민국생활화학제품 및 살생물제의 안전관리에 관한 법률낙동강유역환경청2023-06-27제조금지,판매금지,회수명령확인을 받지 아니한 제품 제조·판매신고증명서 발급(2023.6.18)이전 제조·판매 제품 유통불가, 신고번호(GB22-12-2057) 표시제품 유통 가능
7코지모지 컨테이너 캔들 포기(Foggy)기타류[제조/판매] 코지모지부산광역시 수영구 수영로 624번길 38-1, 1층대한민국생활화학제품 및 살생물제의 안전관리에 관한 법률낙동강유역환경청2023-06-27제조금지,판매금지,회수명령확인을 받지 아니한 제품 제조·판매신고증명서 발급(2023.6.15)이전 제조·판매 제품 유통불가, 신고번호(GB21-26-2054) 표시제품 유통 가능
8그린N클린 허브향 패치방향제류방향제, 살균제방향제(마스크용), 살균제[제조] G&C KOREA(032-507-3229)인천 남동구 용천로87번길 23, 3103-104호대한민국생활화학제품 및 살생물제의 안전관리에 관한 법률한강유역환경청2023-06-30제조금지,판매금지,회수명령안전·표시기준 위반[안전기준 적합확인 미실시 및 표시사항 미표기]향균/살균 문구 표기한 제품은 유통 금지, 신고증명서를 발급받은 (CB21-12-1018) 방향제 품목은 유통가능
9HERCULINER코팅/접착제류특수목적 코팅제특수목적코팅제/녹 방지제[수입/판매]GSTRADE충청북도 청주시 서원구 2순환로 1530번길 45(성화동, 주1동)남아프리카공화국화학제품안전법 제10조 및 제36조금강유역환경청2023-06-19수입금지,판매금지,회수명령안전기준 적합 확인·신고 미실시<NA>
제품명제품군 대분류제품군 중분류제품군 소분류업체명업체주소생산지출처처분권자조치일자조치내용위반내용기타
4012HYBRID COAT코팅/접착제류코팅제<NA>THE CLASS(1522-5355)<NA><NA><NA><NA>2018-03-09회수명령폼알데하이드 기준 초과전 제품
4013CC워터골드코팅/접착제류코팅제<NA>메이칸<NA><NA><NA><NA>2018-03-09회수명령자가검사 미실시전 제품
4014퍼실 겔 컬러(Persil-GEL COLOR) - 병행수입세제류합성세제<NA>㈜뉴스토아(031-940-4816)<NA><NA><NA><NA>2018-03-09회수명령자가검사 미실시전 제품
4015샹떼클레어 다목적 세정제 마르실리아세제류세정제<NA>㈜쉬즈하우스(070-4070-1200)<NA><NA><NA><NA>2018-03-09회수명령PHMB 검출전 제품
4016샹떼클레어 다목적 세정제 라벤더세제류세정제<NA>㈜쉬즈하우스(070-4070-1200)<NA><NA><NA><NA>2018-03-09회수명령PHMB 검출전 제품
4017사니스틱세제류세정제<NA>플라잉 피그코리아(031-990-4005)<NA><NA><NA><NA>2018-03-09회수명령자가검사 미실시전 제품
4018곰팡이세정제세제류세정제<NA>성진켐(1522-4400)<NA><NA><NA><NA>2018-03-09회수명령PHMB 검출전 제품
4019곰팡이OUT세제류세정제<NA>㈜한국미라클피플사(080-900-7878)<NA><NA><NA><NA>2018-03-09회수명령PHMB 검출전 제품
4020Motul 모튤 체인 클린세제류세정제<NA>리오오일<NA><NA><NA><NA>2018-03-09회수명령자가검사 미실시전 제품
4021BRI114세제류세정제<NA>㈜그레이스인터내셔날(02-578-1550)<NA><NA><NA><NA>2018-03-09회수명령PHMB 검출전 제품

Duplicate rows

Most frequently occurring

제품명제품군 대분류제품군 중분류제품군 소분류업체명업체주소생산지출처처분권자조치일자조치내용위반내용기타# duplicates
0렉솔 레더 클리너세제류세정제<NA>㈜케토시인터내셔널 (070-4412-4982)<NA><NA><NA><NA>2016-07-28회수명령폼알데하이드 기준 초과전제품2