Overview

Dataset statistics

Number of variables20
Number of observations100
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory15.9 KiB
Average record size in memory162.3 B

Variable types

Numeric1
Text5
Categorical14

Alerts

함량정보_금지물질 has constant value ""Constant
GHS주소4 has constant value ""Constant
GHS주소5 has constant value ""Constant
GHS주소6 has constant value ""Constant
물질분류 is highly overall correlated with 함량정보_유독물질 and 5 other fieldsHigh correlation
함량정보_사고대비물질 is highly overall correlated with 함량정보_유독물질 and 2 other fieldsHigh correlation
유해성분류 is highly overall correlated with 물질분류 and 5 other fieldsHigh correlation
MSDS주소 is highly overall correlated with 함량정보_유독물질 and 2 other fieldsHigh correlation
GHS주소1 is highly overall correlated with 물질분류 and 4 other fieldsHigh correlation
GHS주소2 is highly overall correlated with 물질분류 and 7 other fieldsHigh correlation
함량정보_제한물질 is highly overall correlated with 물질분류 and 3 other fieldsHigh correlation
GHS주소3 is highly overall correlated with 물질분류 and 4 other fieldsHigh correlation
함량정보_유독물질 is highly overall correlated with 물질분류 and 7 other fieldsHigh correlation
유해위험문구주소 is highly overall correlated with 함량정보_사고대비물질 and 1 other fieldsHigh correlation
물질분류 is highly imbalanced (67.0%)Imbalance
함량정보_유독물질 is highly imbalanced (67.1%)Imbalance
함량정보_제한물질 is highly imbalanced (91.9%)Imbalance
함량정보_사고대비물질 is highly imbalanced (91.9%)Imbalance
유해성분류 is highly imbalanced (67.0%)Imbalance
GHS주소1 is highly imbalanced (57.5%)Imbalance
GHS주소2 is highly imbalanced (59.2%)Imbalance
GHS주소3 is highly imbalanced (69.8%)Imbalance
MSDS주소 is highly imbalanced (91.9%)Imbalance
유해위험문구주소 is highly imbalanced (50.4%)Imbalance
연번 has unique valuesUnique
CAS번호 has unique valuesUnique
모바일MSDS주소 has unique valuesUnique

Reproduction

Analysis started2023-12-10 10:16:28.865313
Analysis finished2023-12-10 10:16:31.286476
Duration2.42 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct100
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean50.5
Minimum1
Maximum100
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T19:16:31.373606image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile5.95
Q125.75
median50.5
Q375.25
95-th percentile95.05
Maximum100
Range99
Interquartile range (IQR)49.5

Descriptive statistics

Standard deviation29.011492
Coefficient of variation (CV)0.57448499
Kurtosis-1.2
Mean50.5
Median Absolute Deviation (MAD)25
Skewness0
Sum5050
Variance841.66667
MonotonicityStrictly increasing
2023-12-10T19:16:31.540311image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.0%
65 1
 
1.0%
75 1
 
1.0%
74 1
 
1.0%
73 1
 
1.0%
72 1
 
1.0%
71 1
 
1.0%
70 1
 
1.0%
69 1
 
1.0%
68 1
 
1.0%
Other values (90) 90
90.0%
ValueCountFrequency (%)
1 1
1.0%
2 1
1.0%
3 1
1.0%
4 1
1.0%
5 1
1.0%
6 1
1.0%
7 1
1.0%
8 1
1.0%
9 1
1.0%
10 1
1.0%
ValueCountFrequency (%)
100 1
1.0%
99 1
1.0%
98 1
1.0%
97 1
1.0%
96 1
1.0%
95 1
1.0%
94 1
1.0%
93 1
1.0%
92 1
1.0%
91 1
1.0%
Distinct90
Distinct (%)90.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
2023-12-10T19:16:31.805592image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length149
Median length50
Mean length21.51
Min length1

Characters and Unicode

Total characters2151
Distinct characters82
Distinct categories10 ?
Distinct scripts4 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique89 ?
Unique (%)89.0%

Sample

1st rowSelenium oxide
2nd rowIron vanadium tetraoxide
3rd rowAcetic acid uranium(4+) zinc salt
4th rowAntimony, compd. with thallium (1:1)
5th rowZirconium, acetate lactate oxo ammonium complexes
ValueCountFrequency (%)
12
 
6.8%
acid 12
 
6.8%
alcohol 4
 
2.3%
1:1 2
 
1.1%
oxide 2
 
1.1%
n-octyl 1
 
0.6%
xylose 1
 
0.6%
d-lyxose 1
 
0.6%
1-pyrenol 1
 
0.6%
selenium 1
 
0.6%
Other values (139) 139
79.0%
2023-12-10T19:16:32.340733image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
e 204
 
9.5%
n 173
 
8.0%
- 171
 
7.9%
i 133
 
6.2%
l 126
 
5.9%
o 124
 
5.8%
a 92
 
4.3%
t 90
 
4.2%
y 90
 
4.2%
76
 
3.5%
Other values (72) 872
40.5%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 1481
68.9%
Dash Punctuation 171
 
7.9%
Uppercase Letter 166
 
7.7%
Decimal Number 123
 
5.7%
Space Separator 76
 
3.5%
Other Punctuation 68
 
3.2%
Open Punctuation 25
 
1.2%
Close Punctuation 25
 
1.2%
Other Letter 14
 
0.7%
Math Symbol 2
 
0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 204
13.8%
n 173
11.7%
i 133
9.0%
l 126
8.5%
o 124
8.4%
a 92
 
6.2%
t 90
 
6.1%
y 90
 
6.1%
h 75
 
5.1%
c 68
 
4.6%
Other values (17) 306
20.7%
Uppercase Letter
ValueCountFrequency (%)
D 21
12.7%
P 16
9.6%
L 14
 
8.4%
M 14
 
8.4%
H 13
 
7.8%
N 12
 
7.2%
O 12
 
7.2%
A 10
 
6.0%
B 9
 
5.4%
E 8
 
4.8%
Other values (11) 37
22.3%
Other Letter
ValueCountFrequency (%)
2
14.3%
2
14.3%
1
7.1%
1
7.1%
1
7.1%
1
7.1%
1
7.1%
1
7.1%
1
7.1%
1
7.1%
Other values (2) 2
14.3%
Decimal Number
ValueCountFrequency (%)
2 29
23.6%
3 28
22.8%
1 28
22.8%
4 22
17.9%
6 6
 
4.9%
5 4
 
3.3%
0 3
 
2.4%
8 2
 
1.6%
9 1
 
0.8%
Other Punctuation
ValueCountFrequency (%)
, 29
42.6%
; 27
39.7%
. 8
 
11.8%
' 2
 
2.9%
: 2
 
2.9%
Open Punctuation
ValueCountFrequency (%)
( 17
68.0%
[ 8
32.0%
Close Punctuation
ValueCountFrequency (%)
) 17
68.0%
] 8
32.0%
Math Symbol
ValueCountFrequency (%)
± 1
50.0%
+ 1
50.0%
Dash Punctuation
ValueCountFrequency (%)
- 171
100.0%
Space Separator
ValueCountFrequency (%)
76
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 1639
76.2%
Common 490
 
22.8%
Hangul 14
 
0.7%
Greek 8
 
0.4%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 204
12.4%
n 173
 
10.6%
i 133
 
8.1%
l 126
 
7.7%
o 124
 
7.6%
a 92
 
5.6%
t 90
 
5.5%
y 90
 
5.5%
h 75
 
4.6%
c 68
 
4.1%
Other values (36) 464
28.3%
Common
ValueCountFrequency (%)
- 171
34.9%
76
15.5%
, 29
 
5.9%
2 29
 
5.9%
3 28
 
5.7%
1 28
 
5.7%
; 27
 
5.5%
4 22
 
4.5%
( 17
 
3.5%
) 17
 
3.5%
Other values (12) 46
 
9.4%
Hangul
ValueCountFrequency (%)
2
14.3%
2
14.3%
1
7.1%
1
7.1%
1
7.1%
1
7.1%
1
7.1%
1
7.1%
1
7.1%
1
7.1%
Other values (2) 2
14.3%
Greek
ValueCountFrequency (%)
α 6
75.0%
β 2
 
25.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2128
98.9%
Hangul 14
 
0.7%
None 9
 
0.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e 204
 
9.6%
n 173
 
8.1%
- 171
 
8.0%
i 133
 
6.2%
l 126
 
5.9%
o 124
 
5.8%
a 92
 
4.3%
t 90
 
4.2%
y 90
 
4.2%
76
 
3.6%
Other values (57) 849
39.9%
None
ValueCountFrequency (%)
α 6
66.7%
β 2
 
22.2%
± 1
 
11.1%
Hangul
ValueCountFrequency (%)
2
14.3%
2
14.3%
1
7.1%
1
7.1%
1
7.1%
1
7.1%
1
7.1%
1
7.1%
1
7.1%
1
7.1%
Other values (2) 2
14.3%
Distinct97
Distinct (%)97.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
2023-12-10T19:16:32.685764image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length86
Median length31
Mean length6.65
Min length1

Characters and Unicode

Total characters665
Distinct characters145
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique96 ?
Unique (%)96.0%

Sample

1st row셀레늄화합물질
2nd row-
3rd row-
4th row무기안티몬 화합물질
5th row-
ValueCountFrequency (%)
4
 
3.7%
디메톤-o 1
 
0.9%
3-메틸펜탄 1
 
0.9%
p-크실리딘 1
 
0.9%
m-크실리딘 1
 
0.9%
p-테르페닐 1
 
0.9%
4-페닐페놀 1
 
0.9%
m-테르페닐 1
 
0.9%
2-에틸페놀 1
 
0.9%
o-크실리딘 1
 
0.9%
Other values (94) 94
87.9%
2023-12-10T19:16:33.432107image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 115
 
17.3%
1 20
 
3.0%
16
 
2.4%
15
 
2.3%
4 14
 
2.1%
13
 
2.0%
12
 
1.8%
2 12
 
1.8%
10
 
1.5%
3 10
 
1.5%
Other values (135) 428
64.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 407
61.2%
Dash Punctuation 115
 
17.3%
Decimal Number 59
 
8.9%
Uppercase Letter 41
 
6.2%
Close Punctuation 10
 
1.5%
Open Punctuation 10
 
1.5%
Lowercase Letter 9
 
1.4%
Space Separator 7
 
1.1%
Other Punctuation 7
 
1.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
16
 
3.9%
15
 
3.7%
13
 
3.2%
12
 
2.9%
10
 
2.5%
9
 
2.2%
9
 
2.2%
9
 
2.2%
9
 
2.2%
9
 
2.2%
Other values (105) 296
72.7%
Uppercase Letter
ValueCountFrequency (%)
L 9
22.0%
D 8
19.5%
P 7
17.1%
O 6
14.6%
N 5
12.2%
S 2
 
4.9%
M 2
 
4.9%
T 1
 
2.4%
R 1
 
2.4%
Decimal Number
ValueCountFrequency (%)
1 20
33.9%
4 14
23.7%
2 12
20.3%
3 10
16.9%
5 2
 
3.4%
9 1
 
1.7%
Lowercase Letter
ValueCountFrequency (%)
m 3
33.3%
n 2
22.2%
t 2
22.2%
e 1
 
11.1%
r 1
 
11.1%
Other Punctuation
ValueCountFrequency (%)
, 4
57.1%
' 1
 
14.3%
: 1
 
14.3%
; 1
 
14.3%
Close Punctuation
ValueCountFrequency (%)
] 5
50.0%
) 5
50.0%
Open Punctuation
ValueCountFrequency (%)
[ 5
50.0%
( 5
50.0%
Dash Punctuation
ValueCountFrequency (%)
- 115
100.0%
Space Separator
ValueCountFrequency (%)
7
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 407
61.2%
Common 208
31.3%
Latin 50
 
7.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
16
 
3.9%
15
 
3.7%
13
 
3.2%
12
 
2.9%
10
 
2.5%
9
 
2.2%
9
 
2.2%
9
 
2.2%
9
 
2.2%
9
 
2.2%
Other values (105) 296
72.7%
Common
ValueCountFrequency (%)
- 115
55.3%
1 20
 
9.6%
4 14
 
6.7%
2 12
 
5.8%
3 10
 
4.8%
7
 
3.4%
] 5
 
2.4%
[ 5
 
2.4%
( 5
 
2.4%
) 5
 
2.4%
Other values (6) 10
 
4.8%
Latin
ValueCountFrequency (%)
L 9
18.0%
D 8
16.0%
P 7
14.0%
O 6
12.0%
N 5
10.0%
m 3
 
6.0%
n 2
 
4.0%
S 2
 
4.0%
M 2
 
4.0%
t 2
 
4.0%
Other values (4) 4
8.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 407
61.2%
ASCII 258
38.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 115
44.6%
1 20
 
7.8%
4 14
 
5.4%
2 12
 
4.7%
3 10
 
3.9%
L 9
 
3.5%
D 8
 
3.1%
P 7
 
2.7%
7
 
2.7%
O 6
 
2.3%
Other values (20) 50
19.4%
Hangul
ValueCountFrequency (%)
16
 
3.9%
15
 
3.7%
13
 
3.2%
12
 
2.9%
10
 
2.5%
9
 
2.2%
9
 
2.2%
9
 
2.2%
9
 
2.2%
9
 
2.2%
Other values (105) 296
72.7%

CAS번호
Text

UNIQUE 

Distinct100
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
2023-12-10T19:16:33.787451image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length10
Mean length10.12
Min length9

Characters and Unicode

Total characters1012
Distinct characters13
Distinct categories4 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique100 ?
Unique (%)100.0%

Sample

1st row(12640-89-0)
2nd row(13977-56-5)
3rd row(20596-91-2)
4th row(29095-38-3)
5th row(68909-34-2)
ValueCountFrequency (%)
12640-89-0 1
 
1.0%
52-90-4 1
 
1.0%
99-94-5 1
 
1.0%
96-14-0 1
 
1.0%
95-78-3 1
 
1.0%
95-68-1 1
 
1.0%
92-94-4 1
 
1.0%
92-69-3 1
 
1.0%
92-06-8 1
 
1.0%
90-00-6 1
 
1.0%
Other values (90) 90
90.0%
2023-12-10T19:16:34.309347image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 200
19.8%
( 100
9.9%
1 100
9.9%
) 100
9.9%
8 72
 
7.1%
9 70
 
6.9%
0 65
 
6.4%
5 58
 
5.7%
3 57
 
5.6%
6 52
 
5.1%
Other values (3) 138
13.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 612
60.5%
Dash Punctuation 200
 
19.8%
Open Punctuation 100
 
9.9%
Close Punctuation 100
 
9.9%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 100
16.3%
8 72
11.8%
9 70
11.4%
0 65
10.6%
5 58
9.5%
3 57
9.3%
6 52
8.5%
7 50
8.2%
2 46
7.5%
4 42
6.9%
Dash Punctuation
ValueCountFrequency (%)
- 200
100.0%
Open Punctuation
ValueCountFrequency (%)
( 100
100.0%
Close Punctuation
ValueCountFrequency (%)
) 100
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1012
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 200
19.8%
( 100
9.9%
1 100
9.9%
) 100
9.9%
8 72
 
7.1%
9 70
 
6.9%
0 65
 
6.4%
5 58
 
5.7%
3 57
 
5.6%
6 52
 
5.1%
Other values (3) 138
13.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1012
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 200
19.8%
( 100
9.9%
1 100
9.9%
) 100
9.9%
8 72
 
7.1%
9 70
 
6.9%
0 65
 
6.4%
5 58
 
5.7%
3 57
 
5.6%
6 52
 
5.1%
Other values (3) 138
13.6%
Distinct82
Distinct (%)82.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
2023-12-10T19:16:34.616150image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length10
Mean length8.34
Min length1

Characters and Unicode

Total characters834
Distinct characters15
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique81 ?
Unique (%)81.0%

Sample

1st row(KE-30928)
2nd row(KE-21143)
3rd row(KE-00062)
4th row(KE-01844)
5th row(KE-35609)
ValueCountFrequency (%)
19
 
19.0%
ke-30933 1
 
1.0%
ke-24700 1
 
1.0%
ke-11201 1
 
1.0%
ke-34733 1
 
1.0%
ke-02871 1
 
1.0%
ke-34732 1
 
1.0%
ke-14028 1
 
1.0%
ke-11207 1
 
1.0%
ke-35439 1
 
1.0%
Other values (72) 72
72.0%
2023-12-10T19:16:35.151857image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 104
12.5%
( 80
9.6%
) 80
9.6%
K 78
9.4%
E 78
9.4%
2 73
8.8%
0 55
 
6.6%
1 46
 
5.5%
3 46
 
5.5%
4 38
 
4.6%
Other values (5) 156
18.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 414
49.6%
Uppercase Letter 156
 
18.7%
Dash Punctuation 104
 
12.5%
Open Punctuation 80
 
9.6%
Close Punctuation 80
 
9.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 73
17.6%
0 55
13.3%
1 46
11.1%
3 46
11.1%
4 38
9.2%
8 37
8.9%
6 34
8.2%
9 30
7.2%
7 29
 
7.0%
5 26
 
6.3%
Uppercase Letter
ValueCountFrequency (%)
K 78
50.0%
E 78
50.0%
Dash Punctuation
ValueCountFrequency (%)
- 104
100.0%
Open Punctuation
ValueCountFrequency (%)
( 80
100.0%
Close Punctuation
ValueCountFrequency (%)
) 80
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 678
81.3%
Latin 156
 
18.7%

Most frequent character per script

Common
ValueCountFrequency (%)
- 104
15.3%
( 80
11.8%
) 80
11.8%
2 73
10.8%
0 55
8.1%
1 46
6.8%
3 46
6.8%
4 38
 
5.6%
8 37
 
5.5%
6 34
 
5.0%
Other values (3) 85
12.5%
Latin
ValueCountFrequency (%)
K 78
50.0%
E 78
50.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 834
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 104
12.5%
( 80
9.6%
) 80
9.6%
K 78
9.4%
E 78
9.4%
2 73
8.8%
0 55
 
6.6%
1 46
 
5.5%
3 46
 
5.5%
4 38
 
4.6%
Other values (5) 156
18.7%

물질분류
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct14
Distinct (%)14.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
-
83 
유독물질: 97-1-275
 
3
유독물질: 97-1-268
 
3
유독물질: 97-1-134
 
1
유독물질: 97-1-176
 
1
Other values (9)

Length

Max length28
Median length1
Mean length3.91
Min length1

Unique

Unique11 ?
Unique (%)11.0%

Sample

1st row유독물질: 97-1-134
2nd row-
3rd row-
4th row유독물질: 97-1-176
5th row-

Common Values

ValueCountFrequency (%)
- 83
83.0%
유독물질: 97-1-275 3
 
3.0%
유독물질: 97-1-268 3
 
3.0%
유독물질: 97-1-134 1
 
1.0%
유독물질: 97-1-176 1
 
1.0%
유독물질: 2020-1-1001(신규화학물질) 1
 
1.0%
유독물질: 2020-1-974(신규화학물질) 1
 
1.0%
유독물질: 2020-1-973(신규화학물질) 1
 
1.0%
유독물질: 2020-1-981(신규화학물질) 1
 
1.0%
유독물질: 2020-1-970(신규화학물질 1
 
1.0%
Other values (4) 4
 
4.0%

Length

2023-12-10T19:16:35.364198image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
83
69.7%
유독물질 17
 
14.3%
97-1-275 3
 
2.5%
97-1-268 3
 
2.5%
2020-1-970(신규화학물질 1
 
0.8%
금지물질 1
 
0.8%
97-1-260 1
 
0.8%
2021-1-1026 1
 
0.8%
2021-1-1024 1
 
0.8%
97-1-413 1
 
0.8%
Other values (7) 7
 
5.9%

함량정보_유독물질
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct15
Distinct (%)15.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
-
83 
유독물질: 크실렌 및 이를 85% 이상 함유한 혼합물
 
3
유독물질: 크레졸 및 이를 5% 이상 함유한 혼합물
 
2
유독물질: 셀레늄[Selenium; 7782-49-2] 또는 그 화합물과 셀렌화합물을 1% 이상 함유한 혼합물. 다만, 카드뮴 설포셀레나이드 오렌지[Cadmium sulfoselenide orange; 12656-57-4]의 경우는 이를 25% 미만 함유한 것은 제외
 
1
유독물질: 무기 안티몬 화합물 및 이를 1% 이상 함유한 혼합물. 다만, 산화 안티몬(Antimony(Ⅴ) pentoxide, Antimony(Ⅳ) tetroxide), 황화안티몬(Antimony(Ⅴ) pentasulfide, Antimony(Ⅲ) trisulfide), 안티몬산(HSbO3) 염류(Antimonic(Ⅴ) acid, salts), 피그먼트 브라운 24(C.I. Pigment Brown 24), 피그먼트 옐로우 53 (C.I. Pigment Yellow 53) 및 이를 함유한 혼합물과 산화안티몬(Antimony(Ⅲ) trioxide)을 함유한 혼합물은 제외
 
1
Other values (10)
10 

Length

Max length321
Median length1
Mean length15.28
Min length1

Unique

Unique12 ?
Unique (%)12.0%

Sample

1st row유독물질: 셀레늄[Selenium; 7782-49-2] 또는 그 화합물과 셀렌화합물을 1% 이상 함유한 혼합물. 다만, 카드뮴 설포셀레나이드 오렌지[Cadmium sulfoselenide orange; 12656-57-4]의 경우는 이를 25% 미만 함유한 것은 제외
2nd row-
3rd row-
4th row유독물질: 무기 안티몬 화합물 및 이를 1% 이상 함유한 혼합물. 다만, 산화 안티몬(Antimony(Ⅴ) pentoxide, Antimony(Ⅳ) tetroxide), 황화안티몬(Antimony(Ⅴ) pentasulfide, Antimony(Ⅲ) trisulfide), 안티몬산(HSbO3) 염류(Antimonic(Ⅴ) acid, salts), 피그먼트 브라운 24(C.I. Pigment Brown 24), 피그먼트 옐로우 53 (C.I. Pigment Yellow 53) 및 이를 함유한 혼합물과 산화안티몬(Antimony(Ⅲ) trioxide)을 함유한 혼합물은 제외
5th row-

Common Values

ValueCountFrequency (%)
- 83
83.0%
유독물질: 크실렌 및 이를 85% 이상 함유한 혼합물 3
 
3.0%
유독물질: 크레졸 및 이를 5% 이상 함유한 혼합물 2
 
2.0%
유독물질: 셀레늄[Selenium; 7782-49-2] 또는 그 화합물과 셀렌화합물을 1% 이상 함유한 혼합물. 다만, 카드뮴 설포셀레나이드 오렌지[Cadmium sulfoselenide orange; 12656-57-4]의 경우는 이를 25% 미만 함유한 것은 제외 1
 
1.0%
유독물질: 무기 안티몬 화합물 및 이를 1% 이상 함유한 혼합물. 다만, 산화 안티몬(Antimony(Ⅴ) pentoxide, Antimony(Ⅳ) tetroxide), 황화안티몬(Antimony(Ⅴ) pentasulfide, Antimony(Ⅲ) trisulfide), 안티몬산(HSbO3) 염류(Antimonic(Ⅴ) acid, salts), 피그먼트 브라운 24(C.I. Pigment Brown 24), 피그먼트 옐로우 53 (C.I. Pigment Yellow 53) 및 이를 함유한 혼합물과 산화안티몬(Antimony(Ⅲ) trioxide)을 함유한 혼합물은 제외 1
 
1.0%
유독물질: 4-tert-부틸피리딘[4-tert-Butylpyridine; 3978-81-2] 및 이를 25% 이상 함유한 혼합물 1
 
1.0%
유독물질: 5,9-디메틸-4-데세날[5,9-Dimethyl-4-decenal; 689-65-6] 및 이를 25% 이상 함유한 혼합물 1
 
1.0%
유독물질: (4R)-4-(1-메틸에틸)-1-사이클로헥센-1-프로판알[(4R)-4-(1-Methylethyl)-1-cyclohexene-1-propanal; 1378867-81-2] 및 이를 25% 이상 함유한 혼합물 1
 
1.0%
유독물질: 3-[[4-(페닐아미노)페닐]아조]벤젠술폰산 N-[4-[비스[4-(디메틸아미노)페닐]메틸렌]-2,5-시클로헥사디엔-1-일리딘]-N-메틸메탄아미늄 (1:1)[N-[4-[Bis[4-(dimethylamino)phenyl]methylene]-2,5-cyclohexadien-1-ylidene]-N-methylmethanaminium, 3-[[4-(phenylamino)phenyl]azo]benzenesulfonate (1:1); 65113-55-5] 및 이를 1% 이상 함유한 혼합물 1
 
1.0%
유독물질: (클로로메틸)에테닐벤젠[(Chloromethyl)ethenylbenzene; 30030-25-2] 및 이를 25% 이상 함유한 혼합물 1
 
1.0%
Other values (5) 5
 
5.0%

Length

2023-12-10T19:16:35.569542image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
83
29.0%
함유한 20
 
7.0%
이를 18
 
6.3%
17
 
5.9%
이상 17
 
5.9%
혼합물 17
 
5.9%
유독물질 17
 
5.9%
25 7
 
2.4%
크실렌 3
 
1.0%
85 3
 
1.0%
Other values (73) 84
29.4%

함량정보_금지물질
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
-
100 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row-
2nd row-
3rd row-
4th row-
5th row-

Common Values

ValueCountFrequency (%)
- 100
100.0%

Length

2023-12-10T19:16:35.740293image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T19:16:35.858410image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
100
100.0%

함량정보_제한물질
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
-
99 
금지물질: 켑타폴 및 이를 0.1% 이상 함유한 혼합물
 
1

Length

Max length30
Median length1
Mean length1.29
Min length1

Unique

Unique1 ?
Unique (%)1.0%

Sample

1st row-
2nd row-
3rd row-
4th row-
5th row-

Common Values

ValueCountFrequency (%)
- 99
99.0%
금지물질: 켑타폴 및 이를 0.1% 이상 함유한 혼합물 1
 
1.0%

Length

2023-12-10T19:16:36.004423image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T19:16:36.138326image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
99
92.5%
금지물질 1
 
0.9%
켑타폴 1
 
0.9%
1
 
0.9%
이를 1
 
0.9%
0.1 1
 
0.9%
이상 1
 
0.9%
함유한 1
 
0.9%
혼합물 1
 
0.9%

함량정보_사고대비물질
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
-
99 
사고대비물질: m-크레졸 및 이를 5% 이상 함유한 혼합물
 
1

Length

Max length32
Median length1
Mean length1.31
Min length1

Unique

Unique1 ?
Unique (%)1.0%

Sample

1st row-
2nd row-
3rd row-
4th row-
5th row-

Common Values

ValueCountFrequency (%)
- 99
99.0%
사고대비물질: m-크레졸 및 이를 5% 이상 함유한 혼합물 1
 
1.0%

Length

2023-12-10T19:16:36.282899image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T19:16:36.411889image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
99
92.5%
사고대비물질 1
 
0.9%
m-크레졸 1
 
0.9%
1
 
0.9%
이를 1
 
0.9%
5 1
 
0.9%
이상 1
 
0.9%
함유한 1
 
0.9%
혼합물 1
 
0.9%

유해성분류
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct14
Distinct (%)14.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
-
83 
인화성 액체 급성독성-경피 급성독성-흡입 피부 부식성/자극성 심한 눈 손상/눈 자극성 특정 표적장기 독성-1회 노출 특정 표적장기 독성-반복 노출
 
3
급성독성-경구 급성독성-경피 피부 부식성/자극성
 
3
급성독성-경구 급성독성-흡입 특정 표적장기 독성-반복 노출 수생환경 유해성 급성 수생환경 유해성 만성
 
1
급성독성-경구 급성독성-흡입 피부 부식성/자극성 수생환경 유해성 만성
 
1
Other values (9)

Length

Max length81
Median length1
Mean length8.46
Min length1

Unique

Unique11 ?
Unique (%)11.0%

Sample

1st row급성독성-경구 급성독성-흡입 특정 표적장기 독성-반복 노출 수생환경 유해성 급성 수생환경 유해성 만성
2nd row-
3rd row-
4th row급성독성-경구 급성독성-흡입 피부 부식성/자극성 수생환경 유해성 만성
5th row-

Common Values

ValueCountFrequency (%)
- 83
83.0%
인화성 액체 급성독성-경피 급성독성-흡입 피부 부식성/자극성 심한 눈 손상/눈 자극성 특정 표적장기 독성-1회 노출 특정 표적장기 독성-반복 노출 3
 
3.0%
급성독성-경구 급성독성-경피 피부 부식성/자극성 3
 
3.0%
급성독성-경구 급성독성-흡입 특정 표적장기 독성-반복 노출 수생환경 유해성 급성 수생환경 유해성 만성 1
 
1.0%
급성독성-경구 급성독성-흡입 피부 부식성/자극성 수생환경 유해성 만성 1
 
1.0%
급성독성-경구 1
 
1.0%
급성독성-경구 급성독성-흡입 피부 부식성/자극성 피부 과민성 수생환경 유해성 급성 1
 
1.0%
피부 부식성/자극성 피부 과민성 수생환경 유해성 급성 수생환경 유해성 만성 1
 
1.0%
심한 눈 손상/눈 자극성 피부 과민성 수생환경 유해성 급성 수생환경 유해성 만성 1
 
1.0%
급성독성-경구 피부 과민성 수생환경 유해성 급성 수생환경 유해성 만성 1
 
1.0%
Other values (4) 4
 
4.0%

Length

2023-12-10T19:16:36.549031image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
83
33.7%
피부 17
 
6.9%
수생환경 16
 
6.5%
유해성 10
 
4.1%
급성독성-경구 10
 
4.1%
부식성/자극성 9
 
3.7%
특정 8
 
3.3%
과민성 8
 
3.3%
노출 8
 
3.3%
표적장기 8
 
3.3%
Other values (15) 69
28.0%

GHS주소1
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct5
Distinct (%)5.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
-
83 
http://msds.kosha.or.kr/MSDSInfo/msds/web/shared/image/ghs/GHS07.gif
 
6
http://msds.kosha.or.kr/MSDSInfo/msds/web/shared/image/ghs/GHS05.gif
 
5
http://msds.kosha.or.kr/MSDSInfo/msds/web/shared/image/ghs/GHS06.gif
 
3
http://msds.kosha.or.kr/MSDSInfo/msds/web/shared/image/ghs/GHS02.gif
 
3

Length

Max length68
Median length1
Mean length12.39
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowhttp://msds.kosha.or.kr/MSDSInfo/msds/web/shared/image/ghs/GHS06.gif
2nd row-
3rd row-
4th rowhttp://msds.kosha.or.kr/MSDSInfo/msds/web/shared/image/ghs/GHS05.gif
5th row-

Common Values

ValueCountFrequency (%)
- 83
83.0%
http://msds.kosha.or.kr/MSDSInfo/msds/web/shared/image/ghs/GHS07.gif 6
 
6.0%
http://msds.kosha.or.kr/MSDSInfo/msds/web/shared/image/ghs/GHS05.gif 5
 
5.0%
http://msds.kosha.or.kr/MSDSInfo/msds/web/shared/image/ghs/GHS06.gif 3
 
3.0%
http://msds.kosha.or.kr/MSDSInfo/msds/web/shared/image/ghs/GHS02.gif 3
 
3.0%

Length

2023-12-10T19:16:36.712230image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T19:16:36.863128image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
83
83.0%
http://msds.kosha.or.kr/msdsinfo/msds/web/shared/image/ghs/ghs07.gif 6
 
6.0%
http://msds.kosha.or.kr/msdsinfo/msds/web/shared/image/ghs/ghs05.gif 5
 
5.0%
http://msds.kosha.or.kr/msdsinfo/msds/web/shared/image/ghs/ghs06.gif 3
 
3.0%
http://msds.kosha.or.kr/msdsinfo/msds/web/shared/image/ghs/ghs02.gif 3
 
3.0%

GHS주소2
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct5
Distinct (%)5.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
-
84 
http://msds.kosha.or.kr/MSDSInfo/msds/web/shared/image/ghs/GHS07.gif
 
5
http://msds.kosha.or.kr/MSDSInfo/msds/web/shared/image/ghs/GHS09.gif
 
5
http://msds.kosha.or.kr/MSDSInfo/msds/web/shared/image/ghs/GHS08.gif
 
3
http://msds.kosha.or.kr/MSDSInfo/msds/web/shared/image/ghs/GHS06.gif
 
3

Length

Max length68
Median length1
Mean length11.72
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowhttp://msds.kosha.or.kr/MSDSInfo/msds/web/shared/image/ghs/GHS08.gif
2nd row-
3rd row-
4th rowhttp://msds.kosha.or.kr/MSDSInfo/msds/web/shared/image/ghs/GHS07.gif
5th row-

Common Values

ValueCountFrequency (%)
- 84
84.0%
http://msds.kosha.or.kr/MSDSInfo/msds/web/shared/image/ghs/GHS07.gif 5
 
5.0%
http://msds.kosha.or.kr/MSDSInfo/msds/web/shared/image/ghs/GHS09.gif 5
 
5.0%
http://msds.kosha.or.kr/MSDSInfo/msds/web/shared/image/ghs/GHS08.gif 3
 
3.0%
http://msds.kosha.or.kr/MSDSInfo/msds/web/shared/image/ghs/GHS06.gif 3
 
3.0%

Length

2023-12-10T19:16:37.053160image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T19:16:37.187649image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
84
84.0%
http://msds.kosha.or.kr/msdsinfo/msds/web/shared/image/ghs/ghs07.gif 5
 
5.0%
http://msds.kosha.or.kr/msdsinfo/msds/web/shared/image/ghs/ghs09.gif 5
 
5.0%
http://msds.kosha.or.kr/msdsinfo/msds/web/shared/image/ghs/ghs08.gif 3
 
3.0%
http://msds.kosha.or.kr/msdsinfo/msds/web/shared/image/ghs/ghs06.gif 3
 
3.0%

GHS주소3
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct3
Distinct (%)3.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
-
92 
http://msds.kosha.or.kr/MSDSInfo/msds/web/shared/image/ghs/GHS09.gif
 
5
http://msds.kosha.or.kr/MSDSInfo/msds/web/shared/image/ghs/GHS08.gif
 
3

Length

Max length68
Median length1
Mean length6.36
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowhttp://msds.kosha.or.kr/MSDSInfo/msds/web/shared/image/ghs/GHS09.gif
2nd row-
3rd row-
4th rowhttp://msds.kosha.or.kr/MSDSInfo/msds/web/shared/image/ghs/GHS09.gif
5th row-

Common Values

ValueCountFrequency (%)
- 92
92.0%
http://msds.kosha.or.kr/MSDSInfo/msds/web/shared/image/ghs/GHS09.gif 5
 
5.0%
http://msds.kosha.or.kr/MSDSInfo/msds/web/shared/image/ghs/GHS08.gif 3
 
3.0%

Length

2023-12-10T19:16:37.340238image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T19:16:37.496821image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
92
92.0%
http://msds.kosha.or.kr/msdsinfo/msds/web/shared/image/ghs/ghs09.gif 5
 
5.0%
http://msds.kosha.or.kr/msdsinfo/msds/web/shared/image/ghs/ghs08.gif 3
 
3.0%

GHS주소4
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
-
100 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row-
2nd row-
3rd row-
4th row-
5th row-

Common Values

ValueCountFrequency (%)
- 100
100.0%

Length

2023-12-10T19:16:37.657511image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T19:16:37.785928image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
100
100.0%

GHS주소5
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
-
100 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row-
2nd row-
3rd row-
4th row-
5th row-

Common Values

ValueCountFrequency (%)
- 100
100.0%

Length

2023-12-10T19:16:37.939162image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T19:16:38.074027image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
100
100.0%

GHS주소6
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
-
100 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row-
2nd row-
3rd row-
4th row-
5th row-

Common Values

ValueCountFrequency (%)
- 100
100.0%

Length

2023-12-10T19:16:38.207182image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T19:16:38.312516image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
100
100.0%

모바일MSDS주소
Text

UNIQUE 

Distinct100
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
2023-12-10T19:16:38.634375image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length68
Median length68
Mean length68
Min length68

Characters and Unicode

Total characters6800
Distinct characters39
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique100 ?
Unique (%)100.0%

Sample

1st rowhttps://msds.kosha.or.kr/MSDSInfo/m/msds/qrDirect.mdo?CHEM_ID=041603
2nd rowhttps://msds.kosha.or.kr/MSDSInfo/m/msds/qrDirect.mdo?CHEM_ID=034374
3rd rowhttps://msds.kosha.or.kr/MSDSInfo/m/msds/qrDirect.mdo?CHEM_ID=018642
4th rowhttps://msds.kosha.or.kr/MSDSInfo/m/msds/qrDirect.mdo?CHEM_ID=019815
5th rowhttps://msds.kosha.or.kr/MSDSInfo/m/msds/qrDirect.mdo?CHEM_ID=044937
ValueCountFrequency (%)
https://msds.kosha.or.kr/msdsinfo/m/msds/qrdirect.mdo?chem_id=041603 1
 
1.0%
https://msds.kosha.or.kr/msdsinfo/m/msds/qrdirect.mdo?chem_id=010602 1
 
1.0%
https://msds.kosha.or.kr/msdsinfo/m/msds/qrdirect.mdo?chem_id=018598 1
 
1.0%
https://msds.kosha.or.kr/msdsinfo/m/msds/qrdirect.mdo?chem_id=018318 1
 
1.0%
https://msds.kosha.or.kr/msdsinfo/m/msds/qrdirect.mdo?chem_id=018282 1
 
1.0%
https://msds.kosha.or.kr/msdsinfo/m/msds/qrdirect.mdo?chem_id=018270 1
 
1.0%
https://msds.kosha.or.kr/msdsinfo/m/msds/qrdirect.mdo?chem_id=018041 1
 
1.0%
https://msds.kosha.or.kr/msdsinfo/m/msds/qrdirect.mdo?chem_id=018009 1
 
1.0%
https://msds.kosha.or.kr/msdsinfo/m/msds/qrdirect.mdo?chem_id=017951 1
 
1.0%
https://msds.kosha.or.kr/msdsinfo/m/msds/qrdirect.mdo?chem_id=017445 1
 
1.0%
Other values (90) 90
90.0%
2023-12-10T19:16:39.190366image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
s 600
 
8.8%
/ 600
 
8.8%
. 400
 
5.9%
m 400
 
5.9%
r 400
 
5.9%
o 400
 
5.9%
t 300
 
4.4%
D 300
 
4.4%
d 300
 
4.4%
S 200
 
2.9%
Other values (29) 2900
42.6%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 3600
52.9%
Other Punctuation 1200
 
17.6%
Uppercase Letter 1200
 
17.6%
Decimal Number 600
 
8.8%
Connector Punctuation 100
 
1.5%
Math Symbol 100
 
1.5%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
s 600
16.7%
m 400
11.1%
r 400
11.1%
o 400
11.1%
t 300
8.3%
d 300
8.3%
h 200
 
5.6%
k 200
 
5.6%
c 100
 
2.8%
q 100
 
2.8%
Other values (6) 600
16.7%
Decimal Number
ValueCountFrequency (%)
0 190
31.7%
1 77
12.8%
2 57
 
9.5%
3 51
 
8.5%
4 49
 
8.2%
8 43
 
7.2%
6 37
 
6.2%
7 36
 
6.0%
5 35
 
5.8%
9 25
 
4.2%
Uppercase Letter
ValueCountFrequency (%)
D 300
25.0%
S 200
16.7%
M 200
16.7%
I 200
16.7%
H 100
 
8.3%
C 100
 
8.3%
E 100
 
8.3%
Other Punctuation
ValueCountFrequency (%)
/ 600
50.0%
. 400
33.3%
? 100
 
8.3%
: 100
 
8.3%
Connector Punctuation
ValueCountFrequency (%)
_ 100
100.0%
Math Symbol
ValueCountFrequency (%)
= 100
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 4800
70.6%
Common 2000
29.4%

Most frequent character per script

Latin
ValueCountFrequency (%)
s 600
 
12.5%
m 400
 
8.3%
r 400
 
8.3%
o 400
 
8.3%
t 300
 
6.2%
D 300
 
6.2%
d 300
 
6.2%
S 200
 
4.2%
M 200
 
4.2%
h 200
 
4.2%
Other values (13) 1500
31.2%
Common
ValueCountFrequency (%)
/ 600
30.0%
. 400
20.0%
0 190
 
9.5%
_ 100
 
5.0%
= 100
 
5.0%
? 100
 
5.0%
: 100
 
5.0%
1 77
 
3.9%
2 57
 
2.9%
3 51
 
2.5%
Other values (6) 225
 
11.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 6800
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
s 600
 
8.8%
/ 600
 
8.8%
. 400
 
5.9%
m 400
 
5.9%
r 400
 
5.9%
o 400
 
5.9%
t 300
 
4.4%
D 300
 
4.4%
d 300
 
4.4%
S 200
 
2.9%
Other values (29) 2900
42.6%

MSDS주소
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
-
99 
http://210.206.33.133:8080/upFile/108-39-4.pdf
 
1

Length

Max length46
Median length1
Mean length1.45
Min length1

Unique

Unique1 ?
Unique (%)1.0%

Sample

1st row-
2nd row-
3rd row-
4th row-
5th row-

Common Values

ValueCountFrequency (%)
- 99
99.0%
http://210.206.33.133:8080/upFile/108-39-4.pdf 1
 
1.0%

Length

2023-12-10T19:16:39.355494image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T19:16:39.489371image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
99
99.0%
http://210.206.33.133:8080/upfile/108-39-4.pdf 1
 
1.0%

유해위험문구주소
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct33
Distinct (%)33.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
-
68 
https://ecolife.me.go.kr/ecolife/chmstryMttr/chmstryMttrShow/689-65-6
 
1
https://ecolife.me.go.kr/ecolife/chmstryMttr/chmstryMttrShow/1378867-81-2
 
1
https://ecolife.me.go.kr/ecolife/chmstryMttr/chmstryMttrShow/65113-55-5
 
1
https://ecolife.me.go.kr/ecolife/chmstryMttr/chmstryMttrShow/30030-25-2
 
1
Other values (28)
28 

Length

Max length73
Median length1
Mean length22.85
Min length1

Unique

Unique32 ?
Unique (%)32.0%

Sample

1st row-
2nd row-
3rd row-
4th row-
5th row-

Common Values

ValueCountFrequency (%)
- 68
68.0%
https://ecolife.me.go.kr/ecolife/chmstryMttr/chmstryMttrShow/689-65-6 1
 
1.0%
https://ecolife.me.go.kr/ecolife/chmstryMttr/chmstryMttrShow/1378867-81-2 1
 
1.0%
https://ecolife.me.go.kr/ecolife/chmstryMttr/chmstryMttrShow/65113-55-5 1
 
1.0%
https://ecolife.me.go.kr/ecolife/chmstryMttr/chmstryMttrShow/30030-25-2 1
 
1.0%
https://ecolife.me.go.kr/ecolife/chmstryMttr/chmstryMttrShow/57-48-7 1
 
1.0%
https://ecolife.me.go.kr/ecolife/chmstryMttr/chmstryMttrShow/124-18-5 1
 
1.0%
https://ecolife.me.go.kr/ecolife/chmstryMttr/chmstryMttrShow/130885-09-5 1
 
1.0%
https://ecolife.me.go.kr/ecolife/chmstryMttr/chmstryMttrShow/50-70-4 1
 
1.0%
https://ecolife.me.go.kr/ecolife/chmstryMttr/chmstryMttrShow/56-41-7 1
 
1.0%
Other values (23) 23
 
23.0%

Length

2023-12-10T19:16:39.652710image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
68
68.0%
https://ecolife.me.go.kr/ecolife/chmstrymttr/chmstrymttrshow/111-87-5 1
 
1.0%
https://ecolife.me.go.kr/ecolife/chmstrymttr/chmstrymttrshow/464-45-9 1
 
1.0%
https://ecolife.me.go.kr/ecolife/chmstrymttr/chmstrymttrshow/120-71-8 1
 
1.0%
https://ecolife.me.go.kr/ecolife/chmstrymttr/chmstrymttrshow/119-93-7 1
 
1.0%
https://ecolife.me.go.kr/ecolife/chmstrymttr/chmstrymttrshow/107-83-5 1
 
1.0%
https://ecolife.me.go.kr/ecolife/chmstrymttr/chmstrymttrshow/104-94-9 1
 
1.0%
https://ecolife.me.go.kr/ecolife/chmstrymttr/chmstrymttrshow/58-86-6 1
 
1.0%
https://ecolife.me.go.kr/ecolife/chmstrymttr/chmstrymttrshow/10380-28-6 1
 
1.0%
https://ecolife.me.go.kr/ecolife/chmstrymttr/chmstrymttrshow/5315-79-7 1
 
1.0%
Other values (23) 23
 
23.0%

Interactions

2023-12-10T19:16:30.746666image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T19:16:39.777151image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번영문물질명국문물질명CAS번호KE번호물질분류함량정보_유독물질함량정보_제한물질함량정보_사고대비물질유해성분류GHS주소1GHS주소2GHS주소3모바일MSDS주소MSDS주소유해위험문구주소
연번1.0000.6781.0001.0000.6340.5130.4990.0410.0410.5130.6590.6230.5491.0000.0410.233
영문물질명0.6781.0000.9721.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.0000.948
국문물질명1.0000.9721.0001.0000.9901.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.000
CAS번호1.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.000
KE번호0.6341.0000.9901.0001.0000.0000.0000.0001.0000.0000.0000.0000.1741.0001.0000.000
물질분류0.5131.0001.0001.0000.0001.0001.0001.0000.6021.0001.0001.0001.0001.0000.6020.879
함량정보_유독물질0.4991.0001.0001.0000.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.0000.903
함량정보_제한물질0.0411.0001.0001.0000.0001.0001.0001.0000.0001.0000.2870.4490.2571.0000.0000.000
함량정보_사고대비물질0.0411.0001.0001.0001.0000.6021.0000.0001.0000.6020.3260.4490.0001.0000.6931.000
유해성분류0.5131.0001.0001.0000.0001.0001.0001.0000.6021.0001.0001.0001.0001.0000.6020.879
GHS주소10.6591.0001.0001.0000.0001.0001.0000.2870.3261.0001.0000.9830.8131.0000.3260.760
GHS주소20.6231.0001.0001.0000.0001.0001.0000.4490.4491.0000.9831.0000.7971.0000.4490.801
GHS주소30.5491.0001.0001.0000.1741.0001.0000.2570.0001.0000.8130.7971.0001.0000.0000.672
모바일MSDS주소1.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.000
MSDS주소0.0411.0001.0001.0001.0000.6021.0000.0000.6930.6020.3260.4490.0001.0001.0001.000
유해위험문구주소0.2330.9481.0001.0000.0000.8790.9030.0001.0000.8790.7600.8010.6721.0001.0001.000
2023-12-10T19:16:39.991312image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
물질분류함량정보_사고대비물질유해성분류MSDS주소GHS주소1GHS주소2유해위험문구주소함량정보_제한물질GHS주소3함량정보_유독물질
물질분류1.0000.4441.0000.4440.9510.9510.4360.9370.9420.994
함량정보_사고대비물질0.4441.0000.4440.4870.3910.5380.8270.0000.0000.931
유해성분류1.0000.4441.0000.4440.9510.9510.4360.9370.9420.994
MSDS주소0.4440.4870.4441.0000.3910.5380.8270.0000.0000.931
GHS주소10.9510.3910.9510.3911.0000.8110.4040.3450.8300.946
GHS주소20.9510.5380.9510.5380.8111.0000.4460.5380.8080.946
유해위험문구주소0.4360.8270.4360.8270.4040.4461.0000.0000.3360.470
함량정보_제한물질0.9370.0000.9370.0000.3450.5380.0001.0000.4160.931
GHS주소30.9420.0000.9420.0000.8300.8080.3360.4161.0000.936
함량정보_유독물질0.9940.9310.9940.9310.9460.9460.4700.9310.9361.000
2023-12-10T19:16:40.146027image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번물질분류함량정보_유독물질함량정보_제한물질함량정보_사고대비물질유해성분류GHS주소1GHS주소2GHS주소3MSDS주소유해위험문구주소
연번1.0000.2270.2010.0000.0000.2270.3220.2970.3780.0000.042
물질분류0.2271.0000.9940.9370.4441.0000.9510.9510.9420.4440.436
함량정보_유독물질0.2010.9941.0000.9310.9310.9940.9460.9460.9360.9310.470
함량정보_제한물질0.0000.9370.9311.0000.0000.9370.3450.5380.4160.0000.000
함량정보_사고대비물질0.0000.4440.9310.0001.0000.4440.3910.5380.0000.4870.827
유해성분류0.2271.0000.9940.9370.4441.0000.9510.9510.9420.4440.436
GHS주소10.3220.9510.9460.3450.3910.9511.0000.8110.8300.3910.404
GHS주소20.2970.9510.9460.5380.5380.9510.8111.0000.8080.5380.446
GHS주소30.3780.9420.9360.4160.0000.9420.8300.8081.0000.0000.336
MSDS주소0.0000.4440.9310.0000.4870.4440.3910.5380.0001.0000.827
유해위험문구주소0.0420.4360.4700.0000.8270.4360.4040.4460.3360.8271.000

Missing values

2023-12-10T19:16:30.900812image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T19:16:31.178200image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번영문물질명국문물질명CAS번호KE번호물질분류함량정보_유독물질함량정보_금지물질함량정보_제한물질함량정보_사고대비물질유해성분류GHS주소1GHS주소2GHS주소3GHS주소4GHS주소5GHS주소6모바일MSDS주소MSDS주소유해위험문구주소
01Selenium oxide셀레늄화합물질(12640-89-0)(KE-30928)유독물질: 97-1-134유독물질: 셀레늄[Selenium; 7782-49-2] 또는 그 화합물과 셀렌화합물을 1% 이상 함유한 혼합물. 다만, 카드뮴 설포셀레나이드 오렌지[Cadmium sulfoselenide orange; 12656-57-4]의 경우는 이를 25% 미만 함유한 것은 제외---급성독성-경구 급성독성-흡입 특정 표적장기 독성-반복 노출 수생환경 유해성 급성 수생환경 유해성 만성http://msds.kosha.or.kr/MSDSInfo/msds/web/shared/image/ghs/GHS06.gifhttp://msds.kosha.or.kr/MSDSInfo/msds/web/shared/image/ghs/GHS08.gifhttp://msds.kosha.or.kr/MSDSInfo/msds/web/shared/image/ghs/GHS09.gif---https://msds.kosha.or.kr/MSDSInfo/m/msds/qrDirect.mdo?CHEM_ID=041603--
12Iron vanadium tetraoxide-(13977-56-5)(KE-21143)------------https://msds.kosha.or.kr/MSDSInfo/m/msds/qrDirect.mdo?CHEM_ID=034374--
23Acetic acid uranium(4+) zinc salt-(20596-91-2)(KE-00062)------------https://msds.kosha.or.kr/MSDSInfo/m/msds/qrDirect.mdo?CHEM_ID=018642--
34Antimony, compd. with thallium (1:1)무기안티몬 화합물질(29095-38-3)(KE-01844)유독물질: 97-1-176유독물질: 무기 안티몬 화합물 및 이를 1% 이상 함유한 혼합물. 다만, 산화 안티몬(Antimony(Ⅴ) pentoxide, Antimony(Ⅳ) tetroxide), 황화안티몬(Antimony(Ⅴ) pentasulfide, Antimony(Ⅲ) trisulfide), 안티몬산(HSbO3) 염류(Antimonic(Ⅴ) acid, salts), 피그먼트 브라운 24(C.I. Pigment Brown 24), 피그먼트 옐로우 53 (C.I. Pigment Yellow 53) 및 이를 함유한 혼합물과 산화안티몬(Antimony(Ⅲ) trioxide)을 함유한 혼합물은 제외---급성독성-경구 급성독성-흡입 피부 부식성/자극성 수생환경 유해성 만성http://msds.kosha.or.kr/MSDSInfo/msds/web/shared/image/ghs/GHS05.gifhttp://msds.kosha.or.kr/MSDSInfo/msds/web/shared/image/ghs/GHS07.gifhttp://msds.kosha.or.kr/MSDSInfo/msds/web/shared/image/ghs/GHS09.gif---https://msds.kosha.or.kr/MSDSInfo/m/msds/qrDirect.mdo?CHEM_ID=019815--
45Zirconium, acetate lactate oxo ammonium complexes-(68909-34-2)(KE-35609)------------https://msds.kosha.or.kr/MSDSInfo/m/msds/qrDirect.mdo?CHEM_ID=044937--
56Cobalt lithium manganese nickel oxide (Co0.33LiMn0.33Ni0.33O2)-(346417-97-8)-------------https://msds.kosha.or.kr/MSDSInfo/m/msds/qrDirect.mdo?CHEM_ID=456100--
674-tert-Butylpyridine4-tert-부틸피리딘(3978-81-2)-유독물질: 2020-1-1001(신규화학물질)유독물질: 4-tert-부틸피리딘[4-tert-Butylpyridine; 3978-81-2] 및 이를 25% 이상 함유한 혼합물---급성독성-경구http://msds.kosha.or.kr/MSDSInfo/msds/web/shared/image/ghs/GHS06.gif-----https://msds.kosha.or.kr/MSDSInfo/m/msds/qrDirect.mdo?CHEM_ID=455687--
785,9-Dimethyl-4-decenal;5,9-디메틸-4-데세날(689-65-6)-유독물질: 2020-1-974(신규화학물질)유독물질: 5,9-디메틸-4-데세날[5,9-Dimethyl-4-decenal; 689-65-6] 및 이를 25% 이상 함유한 혼합물---급성독성-경구 급성독성-흡입 피부 부식성/자극성 피부 과민성 수생환경 유해성 급성http://msds.kosha.or.kr/MSDSInfo/msds/web/shared/image/ghs/GHS07.gifhttp://msds.kosha.or.kr/MSDSInfo/msds/web/shared/image/ghs/GHS09.gif----https://msds.kosha.or.kr/MSDSInfo/m/msds/qrDirect.mdo?CHEM_ID=455682-https://ecolife.me.go.kr/ecolife/chmstryMttr/chmstryMttrShow/689-65-6
89(4R)-4-(1-메틸에틸)-1-사이클로헥센-1-프로판알(4R)-4-(1-메틸에틸)-1-사이클로헥센-1-프로판알(1378867-81-2)-유독물질: 2020-1-973(신규화학물질)유독물질: (4R)-4-(1-메틸에틸)-1-사이클로헥센-1-프로판알[(4R)-4-(1-Methylethyl)-1-cyclohexene-1-propanal; 1378867-81-2] 및 이를 25% 이상 함유한 혼합물---피부 부식성/자극성 피부 과민성 수생환경 유해성 급성 수생환경 유해성 만성http://msds.kosha.or.kr/MSDSInfo/msds/web/shared/image/ghs/GHS07.gifhttp://msds.kosha.or.kr/MSDSInfo/msds/web/shared/image/ghs/GHS09.gif----https://msds.kosha.or.kr/MSDSInfo/m/msds/qrDirect.mdo?CHEM_ID=455681-https://ecolife.me.go.kr/ecolife/chmstryMttr/chmstryMttrShow/1378867-81-2
910N-[4-[Bis[4-(dimethylamino)phenyl]methylene]-2,5-cyclohexadien-1-ylidene]-N-methylmethanaminium, 3-[[4-(phenylamino)phenyl]azo]benzenesulfonate (1:1)3-[[4-(페닐아미노)페닐]아조]벤젠술폰산 N-[4-[비스[4-(디메틸아미노)페닐]메틸렌]-2,5-시클로헥사디엔-1-일리딘]-N-메틸메탄아미늄 (1:1)(65113-55-5)-유독물질: 2020-1-981(신규화학물질)유독물질: 3-[[4-(페닐아미노)페닐]아조]벤젠술폰산 N-[4-[비스[4-(디메틸아미노)페닐]메틸렌]-2,5-시클로헥사디엔-1-일리딘]-N-메틸메탄아미늄 (1:1)[N-[4-[Bis[4-(dimethylamino)phenyl]methylene]-2,5-cyclohexadien-1-ylidene]-N-methylmethanaminium, 3-[[4-(phenylamino)phenyl]azo]benzenesulfonate (1:1); 65113-55-5] 및 이를 1% 이상 함유한 혼합물---심한 눈 손상/눈 자극성 피부 과민성 수생환경 유해성 급성 수생환경 유해성 만성http://msds.kosha.or.kr/MSDSInfo/msds/web/shared/image/ghs/GHS05.gifhttp://msds.kosha.or.kr/MSDSInfo/msds/web/shared/image/ghs/GHS07.gifhttp://msds.kosha.or.kr/MSDSInfo/msds/web/shared/image/ghs/GHS09.gif---https://msds.kosha.or.kr/MSDSInfo/m/msds/qrDirect.mdo?CHEM_ID=455683-https://ecolife.me.go.kr/ecolife/chmstryMttr/chmstryMttrShow/65113-55-5
연번영문물질명국문물질명CAS번호KE번호물질분류함량정보_유독물질함량정보_금지물질함량정보_제한물질함량정보_사고대비물질유해성분류GHS주소1GHS주소2GHS주소3GHS주소4GHS주소5GHS주소6모바일MSDS주소MSDS주소유해위험문구주소
9091Aniline, 3-methoxy-m-아니시딘(536-90-3)-------------https://msds.kosha.or.kr/MSDSInfo/m/msds/qrDirect.mdo?CHEM_ID=316438--
9192-n-펜틸벤젠(538-68-1)-------------https://msds.kosha.or.kr/MSDSInfo/m/msds/qrDirect.mdo?CHEM_ID=010803--
92934-Methylheptane4-메틸헵탄(589-53-7)(KE-24152)------------https://msds.kosha.or.kr/MSDSInfo/m/msds/qrDirect.mdo?CHEM_ID=011840--
93944-(α,α-Dimethylbenzyl)phenol4-쿠밀페놀(599-64-4)(99-3-1158)------------https://msds.kosha.or.kr/MSDSInfo/m/msds/qrDirect.mdo?CHEM_ID=012088--
94951-Hexadecene1-헥사데센(629-73-2)(KE-18475)------------https://msds.kosha.or.kr/MSDSInfo/m/msds/qrDirect.mdo?CHEM_ID=012899--
95961-Docosanol; Behenyl alcohol1-도코사놀(661-19-8)(KE-12789)------------https://msds.kosha.or.kr/MSDSInfo/m/msds/qrDirect.mdo?CHEM_ID=013560-https://ecolife.me.go.kr/ecolife/chmstryMttr/chmstryMttrShow/661-19-8
9697-부텐-3-인(689-97-4)-------------https://msds.kosha.or.kr/MSDSInfo/m/msds/qrDirect.mdo?CHEM_ID=014576--
9798Dodecan-3-one3-도데칸온(1534-27-6)(KE-12894)------------https://msds.kosha.or.kr/MSDSInfo/m/msds/qrDirect.mdo?CHEM_ID=004540--
9899Octylbenzene1-페닐옥탄(2189-60-8)(KE-26726)------------https://msds.kosha.or.kr/MSDSInfo/m/msds/qrDirect.mdo?CHEM_ID=006000--
99100cis-Captafol시스-켑타폴(2939-80-2)-유독물질: 97-1-260 금지물질: 06-4-33유독물질: 켑타폴 및 이를 0.1% 이상 함유한 혼합물-금지물질: 켑타폴 및 이를 0.1% 이상 함유한 혼합물-피부 과민성 발암성 수생환경 유해성-급성 수생환경 유해성-만성http://msds.kosha.or.kr/MSDSInfo/msds/web/shared/image/ghs/GHS07.gifhttp://msds.kosha.or.kr/MSDSInfo/msds/web/shared/image/ghs/GHS08.gifhttp://msds.kosha.or.kr/MSDSInfo/msds/web/shared/image/ghs/GHS09.gif---https://msds.kosha.or.kr/MSDSInfo/m/msds/qrDirect.mdo?CHEM_ID=049413--