Dataset statistics
Number of variables | 17 |
---|---|
Number of observations | 10000 |
Missing cells | 64349 |
Missing cells (%) | 37.9% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 1.5 MiB |
Average record size in memory | 155.0 B |
Variable types
Text | 4 |
---|---|
Categorical | 8 |
Numeric | 1 |
Unsupported | 4 |
Dataset
Description | 화학물질 사고 및 유해 위험 지표 데이터 입니다. |
---|---|
Author | 충청남도 |
URL | https://alldam.chungnam.go.kr/index.chungnam?menuCd=DOM_000000201001001001&st=&cds=&orgCd=&apiType=&isOpen=Y&pageIndex=64&beforeMenuCd=DOM_000000201001001000&publicdatapk=15110061 |
유독물질 is highly imbalanced (76.1%) | Imbalance |
제한물질 is highly imbalanced (79.5%) | Imbalance |
금지물질 is highly imbalanced (79.2%) | Imbalance |
사고대비물질 is highly imbalanced (78.9%) | Imbalance |
중점관리대상물질 is highly imbalanced (76.4%) | Imbalance |
끝점농도 기준 is highly imbalanced (79.8%) | Imbalance |
사고노출위험구분수 is highly imbalanced (70.7%) | Imbalance |
국문 has 8256 (82.6%) missing values | Missing |
끝점농도 has 9504 (95.0%) missing values | Missing |
유해위험코드분류수 has 6589 (65.9%) missing values | Missing |
Unnamed: 13 has 10000 (100.0%) missing values | Missing |
Unnamed: 14 has 10000 (100.0%) missing values | Missing |
Unnamed: 15 has 10000 (100.0%) missing values | Missing |
Unnamed: 16 has 10000 (100.0%) missing values | Missing |
고유(CAS)번호 has unique values | Unique |
Unnamed: 13 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Unnamed: 14 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Unnamed: 15 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Unnamed: 16 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Reproduction
Analysis started | 2024-01-09 22:31:54.267909 |
---|---|
Analysis finished | 2024-01-09 22:31:56.009135 |
Duration | 1.74 second |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
고유(CAS)번호
Text
UNIQUE
 
Distinct | 10000 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Length
Max length | 36 |
---|---|
Median length | 11 |
Mean length | 10.7223 |
Min length | 8 |
Characters and Unicode
Total characters | 107223 |
---|---|
Distinct characters | 13 |
Distinct categories | 4 ? |
Distinct scripts | 1 ? |
Distinct blocks | 1 ? |
Unique
Unique | 10000 ? |
---|---|
Unique (%) | 100.0% |
Sample
1st row | 68002-98-2 |
---|---|
2nd row | 73049-43-1 |
3rd row | 1323-65-5 |
4th row | 18373-31-4 |
5th row | 98-79-3 |
Value | Count | Frequency (%) |
68002-98-2 | 1 | < 0.1% |
69011-69-4 | 1 | < 0.1% |
68909-79-5 | 1 | < 0.1% |
220767-20-4 | 1 | < 0.1% |
80939-62-4 | 1 | < 0.1% |
822-06-0 | 1 | < 0.1% |
12220-10-9 | 1 | < 0.1% |
51-28-5 | 1 | < 0.1% |
75300-68-4 | 1 | < 0.1% |
124-25-4 | 1 | < 0.1% |
Other values (9992) | 9992 |
Most occurring characters
Value | Count | Frequency (%) |
- | 20014 | |
10002 | ||
1 | 9752 | |
6 | 8722 | |
8 | 8185 | |
2 | 7704 | 7.2% |
0 | 7544 | 7.0% |
7 | 7209 | 6.7% |
9 | 7158 | 6.7% |
3 | 7135 | 6.7% |
Other values (3) | 13798 |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 77200 | |
Dash Punctuation | 20014 | 18.7% |
Space Separator | 10002 | 9.3% |
Other Punctuation | 7 | < 0.1% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
1 | 9752 | |
6 | 8722 | |
8 | 8185 | |
2 | 7704 | |
0 | 7544 | |
7 | 7209 | |
9 | 7158 | |
3 | 7135 | |
5 | 7127 | |
4 | 6664 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 20014 |
Space Separator
Value | Count | Frequency (%) |
10002 |
Other Punctuation
Value | Count | Frequency (%) |
, | 7 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 107223 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
- | 20014 | |
10002 | ||
1 | 9752 | |
6 | 8722 | |
8 | 8185 | |
2 | 7704 | 7.2% |
0 | 7544 | 7.0% |
7 | 7209 | 6.7% |
9 | 7158 | 6.7% |
3 | 7135 | 6.7% |
Other values (3) | 13798 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 107223 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
- | 20014 | |
10002 | ||
1 | 9752 | |
6 | 8722 | |
8 | 8185 | |
2 | 7704 | 7.2% |
0 | 7544 | 7.0% |
7 | 7209 | 6.7% |
9 | 7158 | 6.7% |
3 | 7135 | 6.7% |
Other values (3) | 13798 |
영문
Text
Distinct | 9997 |
---|---|
Distinct (%) | > 99.9% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Length
Max length | 724 |
---|---|
Median length | 282 |
Mean length | 60.5119 |
Min length | 2 |
Characters and Unicode
Total characters | 605119 |
---|---|
Distinct characters | 108 |
Distinct categories | 13 ? |
Distinct scripts | 3 ? |
Distinct blocks | 6 ? |
Unique
Unique | 9994 ? |
---|---|
Unique (%) | 99.9% |
Sample
1st row | Tall oil polymer with phthalic anhydride and trimethylolpropane |
---|---|
2nd row | Fatty acids, (C=16-22), (6-phenyl-1,3,5-triazine-2,4-diyl)bis[[(methoxymethyl)imino]methylene] esters |
3rd row | Phenol, dinonyl- |
4th row | Glycerol 1,3-dipropionate |
5th row | 2-Pyrrolidon-5-carboxylic acid |
Value | Count | Frequency (%) |
with | 2825 | 6.1% |
acid | 2682 | 5.8% |
and | 2254 | 4.9% |
polymer | 1772 | 3.9% |
oil | 1066 | 2.3% |
acids | 869 | 1.9% |
fatty | 768 | 1.7% |
anhydride | 658 | 1.4% |
salt | 553 | 1.2% |
ester | 504 | 1.1% |
Other values (9276) | 31997 |
Most occurring characters
Value | Count | Frequency (%) |
e | 50539 | 8.4% |
o | 38757 | 6.4% |
i | 37640 | 6.2% |
35979 | 5.9% | |
l | 34466 | 5.7% |
a | 33220 | 5.5% |
t | 32783 | 5.4% |
n | 30352 | 5.0% |
- | 28893 | 4.8% |
y | 28149 | 4.7% |
Other values (98) | 254341 |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 454593 | |
Space Separator | 35979 | 5.9% |
Decimal Number | 30330 | 5.0% |
Dash Punctuation | 28893 | 4.8% |
Other Punctuation | 22110 | 3.7% |
Uppercase Letter | 15541 | 2.6% |
Open Punctuation | 8342 | 1.4% |
Close Punctuation | 8315 | 1.4% |
Math Symbol | 934 | 0.2% |
Modifier Symbol | 72 | < 0.1% |
Other values (3) | 10 | < 0.1% |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
e | 50539 | |
o | 38757 | 8.5% |
i | 37640 | 8.3% |
l | 34466 | 7.6% |
a | 33220 | 7.3% |
t | 32783 | 7.2% |
n | 30352 | 6.7% |
y | 28149 | 6.2% |
h | 24849 | 5.5% |
d | 23948 | 5.3% |
Other values (26) | 119890 |
Uppercase Letter
Value | Count | Frequency (%) |
C | 1927 | |
D | 1366 | 8.8% |
N | 1320 | 8.5% |
H | 1281 | 8.2% |
M | 1183 | 7.6% |
B | 972 | 6.3% |
F | 885 | 5.7% |
T | 879 | 5.7% |
P | 829 | 5.3% |
S | 796 | 5.1% |
Other values (17) | 4103 |
Other Punctuation
Value | Count | Frequency (%) |
, | 18358 | |
. | 1445 | 6.5% |
' | 1386 | 6.3% |
; | 558 | 2.5% |
: | 291 | 1.3% |
* | 21 | 0.1% |
′ | 19 | 0.1% |
" | 16 | 0.1% |
″ | 7 | < 0.1% |
/ | 6 | < 0.1% |
Other values (2) | 3 | < 0.1% |
Decimal Number
Value | Count | Frequency (%) |
2 | 8730 | |
1 | 8099 | |
3 | 4155 | |
4 | 3716 | |
5 | 1803 | 5.9% |
6 | 1403 | 4.6% |
8 | 743 | 2.4% |
7 | 641 | 2.1% |
0 | 585 | 1.9% |
9 | 455 | 1.5% |
Math Symbol
Value | Count | Frequency (%) |
= | 789 | |
+ | 74 | 7.9% |
→ | 24 | 2.6% |
~ | 24 | 2.6% |
> | 10 | 1.1% |
± | 5 | 0.5% |
∼ | 5 | 0.5% |
< | 2 | 0.2% |
≥ | 1 | 0.1% |
Open Punctuation
Value | Count | Frequency (%) |
( | 5616 | |
[ | 2723 | |
{ | 3 | < 0.1% |
Close Punctuation
Value | Count | Frequency (%) |
) | 5615 | |
] | 2696 | |
} | 4 | < 0.1% |
Other Number
Value | Count | Frequency (%) |
¹ | 2 | |
² | 1 | |
³ | 1 |
Space Separator
Value | Count | Frequency (%) |
35979 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 28893 |
Modifier Symbol
Value | Count | Frequency (%) |
` | 72 |
Final Punctuation
Value | Count | Frequency (%) |
’ | 5 |
Letter Number
Value | Count | Frequency (%) |
Ⅱ | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 469259 | |
Common | 134984 | 22.3% |
Greek | 876 | 0.1% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
e | 50539 | 10.8% |
o | 38757 | 8.3% |
i | 37640 | 8.0% |
l | 34466 | 7.3% |
a | 33220 | 7.1% |
t | 32783 | 7.0% |
n | 30352 | 6.5% |
y | 28149 | 6.0% |
h | 24849 | 5.3% |
d | 23948 | 5.1% |
Other values (43) | 134556 |
Common
Value | Count | Frequency (%) |
35979 | ||
- | 28893 | |
, | 18358 | |
2 | 8730 | 6.5% |
1 | 8099 | 6.0% |
( | 5616 | 4.2% |
) | 5615 | 4.2% |
3 | 4155 | 3.1% |
4 | 3716 | 2.8% |
[ | 2723 | 2.0% |
Other values (34) | 13100 | 9.7% |
Greek
Value | Count | Frequency (%) |
α | 467 | |
ω | 241 | |
β | 100 | 11.4% |
κ | 25 | 2.9% |
γ | 14 | 1.6% |
μ | 13 | 1.5% |
ε | 6 | 0.7% |
η | 5 | 0.6% |
δ | 3 | 0.3% |
Ο | 1 | 0.1% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 604172 | |
None | 885 | 0.1% |
Punctuation | 31 | < 0.1% |
Arrows | 24 | < 0.1% |
Math Operators | 6 | < 0.1% |
Number Forms | 1 | < 0.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
e | 50539 | 8.4% |
o | 38757 | 6.4% |
i | 37640 | 6.2% |
35979 | 6.0% | |
l | 34466 | 5.7% |
a | 33220 | 5.5% |
t | 32783 | 5.4% |
n | 30352 | 5.0% |
- | 28893 | 4.8% |
y | 28149 | 4.7% |
Other values (76) | 253394 |
None
Value | Count | Frequency (%) |
α | 467 | |
ω | 241 | |
β | 100 | 11.3% |
κ | 25 | 2.8% |
γ | 14 | 1.6% |
μ | 13 | 1.5% |
ε | 6 | 0.7% |
± | 5 | 0.6% |
η | 5 | 0.6% |
δ | 3 | 0.3% |
Other values (5) | 6 | 0.7% |
Arrows
Value | Count | Frequency (%) |
→ | 24 |
Punctuation
Value | Count | Frequency (%) |
′ | 19 | |
″ | 7 | 22.6% |
’ | 5 | 16.1% |
Math Operators
Value | Count | Frequency (%) |
∼ | 5 | |
≥ | 1 | 16.7% |
Number Forms
Value | Count | Frequency (%) |
Ⅱ | 1 |
국문
Text
MISSING
 
Distinct | 1741 |
---|---|
Distinct (%) | 99.8% |
Missing | 8256 |
Missing (%) | 82.6% |
Memory size | 156.2 KiB |
Length
Max length | 183 |
---|---|
Median length | 122 |
Mean length | 16.376147 |
Min length | 1 |
Characters and Unicode
Total characters | 28560 |
---|---|
Distinct characters | 413 |
Distinct categories | 13 ? |
Distinct scripts | 4 ? |
Distinct blocks | 5 ? |
Unique
Unique | 1738 ? |
---|---|
Unique (%) | 99.7% |
Sample
1st row | 2-부타논 옥심과 디이소시안산이소포론의 중합체 |
---|---|
2nd row | 아비산,은 |
3rd row | 1-(2-옥소-2-페닐에틸)피리디늄,브로마이드 |
4th row | 아이소데실,2-메틸-2-프로펜산염과,결합한,2-메틸-2-프로펜산,헥사데실,에스터,중합체와,옥타데실,2-메틸-2-프로펜산염 |
5th row | 메타아크릴산 3-클로로-2-히드록시프로필 |
Value | Count | Frequency (%) |
1:1 | 11 | 0.6% |
염 | 11 | 0.6% |
c.i | 5 | 0.3% |
나트륨 | 5 | 0.3% |
글리시딜 | 4 | 0.2% |
반응생성물 | 4 | 0.2% |
메틸 | 4 | 0.2% |
디스퍼스 | 3 | 0.2% |
알코올 | 3 | 0.2% |
에스터 | 3 | 0.2% |
Other values (1878) | 1907 |
Most occurring characters
Value | Count | Frequency (%) |
- | 2332 | 8.2% |
, | 2201 | 7.7% |
이 | 1711 | 6.0% |
로 | 980 | 3.4% |
트 | 768 | 2.7% |
틸 | 747 | 2.6% |
아 | 720 | 2.5% |
2 | 671 | 2.3% |
1 | 582 | 2.0% |
메 | 543 | 1.9% |
Other values (403) | 17305 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 19494 | |
Other Punctuation | 2431 | 8.5% |
Decimal Number | 2399 | 8.4% |
Dash Punctuation | 2332 | 8.2% |
Open Punctuation | 522 | 1.8% |
Close Punctuation | 517 | 1.8% |
Uppercase Letter | 438 | 1.5% |
Space Separator | 217 | 0.8% |
Lowercase Letter | 184 | 0.6% |
Math Symbol | 17 | 0.1% |
Other values (3) | 9 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
이 | 1711 | 8.8% |
로 | 980 | 5.0% |
트 | 768 | 3.9% |
틸 | 747 | 3.8% |
아 | 720 | 3.7% |
메 | 543 | 2.8% |
다 | 495 | 2.5% |
드 | 478 | 2.5% |
라 | 427 | 2.2% |
산 | 404 | 2.1% |
Other values (316) | 12221 |
Lowercase Letter
Value | Count | Frequency (%) |
t | 33 | |
e | 22 | |
r | 16 | 8.7% |
p | 13 | 7.1% |
a | 12 | 6.5% |
o | 12 | 6.5% |
c | 9 | 4.9% |
n | 8 | 4.3% |
α | 8 | 4.3% |
i | 8 | 4.3% |
Other values (16) | 43 |
Uppercase Letter
Value | Count | Frequency (%) |
N | 146 | |
I | 68 | |
O | 41 | 9.4% |
C | 34 | 7.8% |
H | 26 | 5.9% |
D | 18 | 4.1% |
S | 14 | 3.2% |
L | 13 | 3.0% |
T | 13 | 3.0% |
R | 12 | 2.7% |
Other values (11) | 53 | 12.1% |
Other Punctuation
Value | Count | Frequency (%) |
, | 2201 | |
' | 105 | 4.3% |
: | 48 | 2.0% |
. | 41 | 1.7% |
′ | 9 | 0.4% |
/ | 6 | 0.2% |
" | 6 | 0.2% |
; | 4 | 0.2% |
· | 4 | 0.2% |
# | 2 | 0.1% |
Other values (4) | 5 | 0.2% |
Decimal Number
Value | Count | Frequency (%) |
2 | 671 | |
1 | 582 | |
4 | 364 | |
3 | 361 | |
5 | 159 | 6.6% |
6 | 126 | 5.3% |
7 | 43 | 1.8% |
9 | 35 | 1.5% |
8 | 32 | 1.3% |
0 | 26 | 1.1% |
Math Symbol
Value | Count | Frequency (%) |
= | 7 | |
+ | 6 | |
~ | 2 | 11.8% |
> | 1 | 5.9% |
< | 1 | 5.9% |
Open Punctuation
Value | Count | Frequency (%) |
( | 374 | |
[ | 146 | 28.0% |
{ | 2 | 0.4% |
Close Punctuation
Value | Count | Frequency (%) |
) | 374 | |
] | 140 | 27.1% |
} | 3 | 0.6% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 2332 |
Space Separator
Value | Count | Frequency (%) |
217 |
Modifier Symbol
Value | Count | Frequency (%) |
` | 7 |
Final Punctuation
Value | Count | Frequency (%) |
’ | 1 |
Letter Number
Value | Count | Frequency (%) |
Ⅱ | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 19494 | |
Common | 8443 | |
Latin | 605 | 2.1% |
Greek | 18 | 0.1% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
이 | 1711 | 8.8% |
로 | 980 | 5.0% |
트 | 768 | 3.9% |
틸 | 747 | 3.8% |
아 | 720 | 3.7% |
메 | 543 | 2.8% |
다 | 495 | 2.5% |
드 | 478 | 2.5% |
라 | 427 | 2.2% |
산 | 404 | 2.1% |
Other values (316) | 12221 |
Latin
Value | Count | Frequency (%) |
N | 146 | |
I | 68 | 11.2% |
O | 41 | 6.8% |
C | 34 | 5.6% |
t | 33 | 5.5% |
H | 26 | 4.3% |
e | 22 | 3.6% |
D | 18 | 3.0% |
r | 16 | 2.6% |
S | 14 | 2.3% |
Other values (31) | 187 |
Common
Value | Count | Frequency (%) |
- | 2332 | |
, | 2201 | |
2 | 671 | 7.9% |
1 | 582 | 6.9% |
( | 374 | 4.4% |
) | 374 | 4.4% |
4 | 364 | 4.3% |
3 | 361 | 4.3% |
217 | 2.6% | |
5 | 159 | 1.9% |
Other values (29) | 808 | 9.6% |
Greek
Value | Count | Frequency (%) |
α | 8 | |
κ | 4 | |
ω | 2 | 11.1% |
η | 1 | 5.6% |
β | 1 | 5.6% |
ε | 1 | 5.6% |
Ο | 1 | 5.6% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 19494 | |
ASCII | 9032 | |
None | 23 | 0.1% |
Punctuation | 10 | < 0.1% |
Number Forms | 1 | < 0.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
- | 2332 | |
, | 2201 | |
2 | 671 | 7.4% |
1 | 582 | 6.4% |
( | 374 | 4.1% |
) | 374 | 4.1% |
4 | 364 | 4.0% |
3 | 361 | 4.0% |
217 | 2.4% | |
5 | 159 | 1.8% |
Other values (65) | 1397 |
Hangul
Value | Count | Frequency (%) |
이 | 1711 | 8.8% |
로 | 980 | 5.0% |
트 | 768 | 3.9% |
틸 | 747 | 3.8% |
아 | 720 | 3.7% |
메 | 543 | 2.8% |
다 | 495 | 2.5% |
드 | 478 | 2.5% |
라 | 427 | 2.2% |
산 | 404 | 2.1% |
Other values (316) | 12221 |
Punctuation
Value | Count | Frequency (%) |
′ | 9 | |
’ | 1 | 10.0% |
None
Value | Count | Frequency (%) |
α | 8 | |
· | 4 | |
κ | 4 | |
ω | 2 | 8.7% |
η | 1 | 4.3% |
β | 1 | 4.3% |
\ | 1 | 4.3% |
ε | 1 | 4.3% |
Ο | 1 | 4.3% |
Number Forms
Value | Count | Frequency (%) |
Ⅱ | 1 |
유독물질
Categorical
IMBALANCE
 
Distinct | 3 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
<NA> | |
---|---|
0 | 297 |
1 | 285 |
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 3.8254 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | <NA> |
---|---|
2nd row | <NA> |
3rd row | <NA> |
4th row | <NA> |
5th row | <NA> |
Common Values
Value | Count | Frequency (%) |
<NA> | 9418 | |
0 | 297 | 3.0% |
1 | 285 | 2.9% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
na | 9418 | |
0 | 297 | 3.0% |
1 | 285 | 2.9% |
제한물질
Categorical
IMBALANCE
 
Distinct | 3 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
<NA> | |
---|---|
0 | 576 |
1 | 6 |
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 3.8254 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | <NA> |
---|---|
2nd row | <NA> |
3rd row | <NA> |
4th row | <NA> |
5th row | <NA> |
Common Values
Value | Count | Frequency (%) |
<NA> | 9418 | |
0 | 576 | 5.8% |
1 | 6 | 0.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
na | 9418 | |
0 | 576 | 5.8% |
1 | 6 | 0.1% |
금지물질
Categorical
IMBALANCE
 
Distinct | 3 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
<NA> | |
---|---|
0 | 569 |
1 | 13 |
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 3.8254 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | <NA> |
---|---|
2nd row | <NA> |
3rd row | <NA> |
4th row | <NA> |
5th row | <NA> |
Common Values
Value | Count | Frequency (%) |
<NA> | 9418 | |
0 | 569 | 5.7% |
1 | 13 | 0.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
na | 9418 | |
0 | 569 | 5.7% |
1 | 13 | 0.1% |
사고대비물질
Categorical
IMBALANCE
 
Distinct | 3 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
<NA> | |
---|---|
0 | 557 |
1 | 24 |
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 3.8257 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | <NA> |
---|---|
2nd row | <NA> |
3rd row | <NA> |
4th row | <NA> |
5th row | <NA> |
Common Values
Value | Count | Frequency (%) |
<NA> | 9419 | |
0 | 557 | 5.6% |
1 | 24 | 0.2% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
na | 9419 | |
0 | 557 | 5.6% |
1 | 24 | 0.2% |
중점관리대상물질
Categorical
IMBALANCE
 
Distinct | 3 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
<NA> | |
---|---|
0 | 381 |
1 | 201 |
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 3.8254 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | <NA> |
---|---|
2nd row | <NA> |
3rd row | <NA> |
4th row | <NA> |
5th row | <NA> |
Common Values
Value | Count | Frequency (%) |
<NA> | 9418 | |
0 | 381 | 3.8% |
1 | 201 | 2.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
na | 9418 | |
0 | 381 | 3.8% |
1 | 201 | 2.0% |
물질상태
Categorical
Distinct | 4 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
<NA> | |
---|---|
액체 | |
고체 | |
기체 | 25 |
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 3.453 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | <NA> |
---|---|
2nd row | <NA> |
3rd row | 액체 |
4th row | <NA> |
5th row | 고체 |
Common Values
Value | Count | Frequency (%) |
<NA> | 7265 | |
액체 | 1396 | 14.0% |
고체 | 1314 | 13.1% |
기체 | 25 | 0.2% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
na | 7265 | |
액체 | 1396 | 14.0% |
고체 | 1314 | 13.1% |
기체 | 25 | 0.2% |
끝점농도
Text
MISSING
 
Distinct | 324 |
---|---|
Distinct (%) | 65.3% |
Missing | 9504 |
Missing (%) | 95.0% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
ppm | 496 | |
0 | 33 | 3.3% |
0.01 | 17 | 1.7% |
100 | 9 | 0.9% |
0.02 | 9 | 0.9% |
2 | 8 | 0.8% |
75 | 7 | 0.7% |
1 | 6 | 0.6% |
0.05 | 6 | 0.6% |
0.03 | 5 | 0.5% |
Other values (315) | 396 |
Most occurring characters
Value | Count | Frequency (%) |
p | 992 | |
496 | ||
m | 496 | |
. | 352 | 9.3% |
0 | 350 | 9.3% |
1 | 199 | 5.3% |
2 | 179 | 4.8% |
3 | 139 | 3.7% |
5 | 127 | 3.4% |
7 | 108 | 2.9% |
Other values (4) | 328 | 8.7% |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 1488 | |
Decimal Number | 1430 | |
Space Separator | 496 | 13.2% |
Other Punctuation | 352 | 9.3% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 350 | |
1 | 199 | |
2 | 179 | |
3 | 139 | 9.7% |
5 | 127 | 8.9% |
7 | 108 | 7.6% |
4 | 98 | 6.9% |
6 | 90 | 6.3% |
8 | 73 | 5.1% |
9 | 67 | 4.7% |
Lowercase Letter
Value | Count | Frequency (%) |
p | 992 | |
m | 496 |
Space Separator
Value | Count | Frequency (%) |
496 |
Other Punctuation
Value | Count | Frequency (%) |
. | 352 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 2278 | |
Latin | 1488 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
496 | ||
. | 352 | |
0 | 350 | |
1 | 199 | |
2 | 179 | 7.9% |
3 | 139 | 6.1% |
5 | 127 | 5.6% |
7 | 108 | 4.7% |
4 | 98 | 4.3% |
6 | 90 | 4.0% |
Other values (2) | 140 | 6.1% |
Latin
Value | Count | Frequency (%) |
p | 992 | |
m | 496 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 3766 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
p | 992 | |
496 | ||
m | 496 | |
. | 352 | 9.3% |
0 | 350 | 9.3% |
1 | 199 | 5.3% |
2 | 179 | 4.8% |
3 | 139 | 3.7% |
5 | 127 | 3.4% |
7 | 108 | 2.9% |
Other values (4) | 328 | 8.7% |
끝점농도 기준
Categorical
IMBALANCE
 
Distinct | 3 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
<NA> | |
---|---|
PAC-2 | 401 |
IDLH*0.1 | 96 |
Length
Max length | 8 |
---|---|
Median length | 4 |
Mean length | 4.0785 |
Min length | 4 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | <NA> |
---|---|
2nd row | <NA> |
3rd row | <NA> |
4th row | <NA> |
5th row | <NA> |
Common Values
Value | Count | Frequency (%) |
<NA> | 9503 | |
PAC-2 | 401 | 4.0% |
IDLH*0.1 | 96 | 1.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
na | 9503 | |
pac-2 | 401 | 4.0% |
idlh*0.1 | 96 | 1.0% |
유해위험코드분류수
Real number (ℝ)
MISSING
 
Distinct | 23 |
---|---|
Distinct (%) | 0.7% |
Missing | 6589 |
Missing (%) | 65.9% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 4.2099091 |
Minimum | 1 |
---|---|
Maximum | 25 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 2 |
median | 3 |
Q3 | 5 |
95-th percentile | 10 |
Maximum | 25 |
Range | 24 |
Interquartile range (IQR) | 3 |
Descriptive statistics
Standard deviation | 2.9806662 |
---|---|
Coefficient of variation (CV) | 0.70801201 |
Kurtosis | 5.4043103 |
Mean | 4.2099091 |
Median Absolute Deviation (MAD) | 2 |
Skewness | 1.8571005 |
Sum | 14360 |
Variance | 8.884371 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
3 | 704 | 7.0% |
1 | 521 | 5.2% |
4 | 499 | 5.0% |
2 | 481 | 4.8% |
5 | 378 | 3.8% |
6 | 286 | 2.9% |
7 | 170 | 1.7% |
8 | 108 | 1.1% |
9 | 64 | 0.6% |
10 | 50 | 0.5% |
Other values (13) | 150 | 1.5% |
(Missing) | 6589 |
Value | Count | Frequency (%) |
1 | 521 | |
2 | 481 | |
3 | 704 | |
4 | 499 | |
5 | 378 | |
6 | 286 | |
7 | 170 | 1.7% |
8 | 108 | 1.1% |
9 | 64 | 0.6% |
10 | 50 | 0.5% |
Value | Count | Frequency (%) |
25 | 2 | < 0.1% |
23 | 1 | < 0.1% |
21 | 1 | < 0.1% |
20 | 1 | < 0.1% |
19 | 3 | < 0.1% |
18 | 11 | |
17 | 4 | < 0.1% |
16 | 11 | |
15 | 10 | |
14 | 13 |
사고노출위험구분수
Categorical
IMBALANCE
 
Distinct | 6 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
<NA> | |
---|---|
4 | 830 |
5 | 460 |
3 | 46 |
2 | 24 |
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 3.5872 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | <NA> |
---|---|
2nd row | <NA> |
3rd row | 4 |
4th row | <NA> |
5th row | <NA> |
Common Values
Value | Count | Frequency (%) |
<NA> | 8624 | |
4 | 830 | 8.3% |
5 | 460 | 4.6% |
3 | 46 | 0.5% |
2 | 24 | 0.2% |
1 | 16 | 0.2% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
na | 8624 | |
4 | 830 | 8.3% |
5 | 460 | 4.6% |
3 | 46 | 0.5% |
2 | 24 | 0.2% |
1 | 16 | 0.2% |
Unnamed: 13
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 10000 |
---|---|
Missing (%) | 100.0% |
Memory size | 166.0 KiB |
Unnamed: 14
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 10000 |
---|---|
Missing (%) | 100.0% |
Memory size | 166.0 KiB |
Unnamed: 15
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 10000 |
---|---|
Missing (%) | 100.0% |
Memory size | 166.0 KiB |
Unnamed: 16
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 10000 |
---|---|
Missing (%) | 100.0% |
Memory size | 166.0 KiB |
유독물질 | 제한물질 | 금지물질 | 사고대비물질 | 중점관리대상물질 | 물질상태 | 끝점농도 기준 | 유해위험코드분류수 | 사고노출위험구분수 | |
---|---|---|---|---|---|---|---|---|---|
유독물질 | 1.000 | 0.000 | 0.136 | 0.079 | 0.582 | 0.061 | 0.230 | 0.635 | 0.000 |
제한물질 | 0.000 | 1.000 | 0.000 | 0.000 | 0.000 | 0.032 | 0.000 | 0.000 | 0.000 |
금지물질 | 0.136 | 0.000 | 1.000 | 0.000 | 0.138 | 0.068 | 0.000 | 0.000 | 0.000 |
사고대비물질 | 0.079 | 0.000 | 0.000 | 1.000 | 0.000 | 0.174 | 0.000 | 0.000 | 0.071 |
중점관리대상물질 | 0.582 | 0.000 | 0.138 | 0.000 | 1.000 | 0.080 | 0.202 | 0.430 | 0.127 |
물질상태 | 0.061 | 0.032 | 0.068 | 0.174 | 0.080 | 1.000 | 0.069 | 0.128 | 0.104 |
끝점농도 기준 | 0.230 | 0.000 | 0.000 | 0.000 | 0.202 | 0.069 | 1.000 | 0.058 | 0.134 |
유해위험코드분류수 | 0.635 | 0.000 | 0.000 | 0.000 | 0.430 | 0.128 | 0.058 | 1.000 | 0.279 |
사고노출위험구분수 | 0.000 | 0.000 | 0.000 | 0.071 | 0.127 | 0.104 | 0.134 | 0.279 | 1.000 |
사고노출위험구분수 | 끝점농도 기준 | 유독물질 | 물질상태 | 중점관리대상물질 | 제한물질 | 금지물질 | 사고대비물질 | |
---|---|---|---|---|---|---|---|---|
사고노출위험구분수 | 1.000 | 0.163 | 0.000 | 0.078 | 0.155 | 0.000 | 0.000 | 0.087 |
끝점농도 기준 | 0.163 | 1.000 | 0.147 | 0.114 | 0.130 | 0.000 | 0.000 | 0.000 |
유독물질 | 0.000 | 0.147 | 1.000 | 0.102 | 0.395 | 0.000 | 0.087 | 0.050 |
물질상태 | 0.078 | 0.114 | 0.102 | 1.000 | 0.133 | 0.052 | 0.112 | 0.286 |
중점관리대상물질 | 0.155 | 0.130 | 0.395 | 0.133 | 1.000 | 0.000 | 0.088 | 0.000 |
제한물질 | 0.000 | 0.000 | 0.000 | 0.052 | 0.000 | 1.000 | 0.000 | 0.000 |
금지물질 | 0.000 | 0.000 | 0.087 | 0.112 | 0.088 | 0.000 | 1.000 | 0.000 |
사고대비물질 | 0.087 | 0.000 | 0.050 | 0.286 | 0.000 | 0.000 | 0.000 | 1.000 |
유해위험코드분류수 | 유독물질 | 제한물질 | 금지물질 | 사고대비물질 | 중점관리대상물질 | 물질상태 | 끝점농도 기준 | 사고노출위험구분수 | |
---|---|---|---|---|---|---|---|---|---|
유해위험코드분류수 | 1.000 | 0.479 | 0.000 | 0.000 | 0.000 | 0.320 | 0.074 | 0.070 | 0.166 |
유독물질 | 0.479 | 1.000 | 0.000 | 0.087 | 0.050 | 0.395 | 0.102 | 0.147 | 0.000 |
제한물질 | 0.000 | 0.000 | 1.000 | 0.000 | 0.000 | 0.000 | 0.052 | 0.000 | 0.000 |
금지물질 | 0.000 | 0.087 | 0.000 | 1.000 | 0.000 | 0.088 | 0.112 | 0.000 | 0.000 |
사고대비물질 | 0.000 | 0.050 | 0.000 | 0.000 | 1.000 | 0.000 | 0.286 | 0.000 | 0.087 |
중점관리대상물질 | 0.320 | 0.395 | 0.000 | 0.088 | 0.000 | 1.000 | 0.133 | 0.130 | 0.155 |
물질상태 | 0.074 | 0.102 | 0.052 | 0.112 | 0.286 | 0.133 | 1.000 | 0.114 | 0.078 |
끝점농도 기준 | 0.070 | 0.147 | 0.000 | 0.000 | 0.000 | 0.130 | 0.114 | 1.000 | 0.163 |
사고노출위험구분수 | 0.166 | 0.000 | 0.000 | 0.000 | 0.087 | 0.155 | 0.078 | 0.163 | 1.000 |
고유(CAS)번호 | 영문 | 국문 | 유독물질 | 제한물질 | 금지물질 | 사고대비물질 | 중점관리대상물질 | 물질상태 | 끝점농도 | 끝점농도 기준 | 유해위험코드분류수 | 사고노출위험구분수 | Unnamed: 13 | Unnamed: 14 | Unnamed: 15 | Unnamed: 16 | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
26000 | 68002-98-2 | Tall oil polymer with phthalic anhydride and trimethylolpropane | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
34922 | 73049-43-1 | Fatty acids, (C=16-22), (6-phenyl-1,3,5-triazine-2,4-diyl)bis[[(methoxymethyl)imino]methylene] esters | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
6052 | 1323-65-5 | Phenol, dinonyl- | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | 액체 | <NA> | <NA> | 8 | 4 | <NA> | <NA> | <NA> | <NA> |
10999 | 18373-31-4 | Glycerol 1,3-dipropionate | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
42505 | 98-79-3 | 2-Pyrrolidon-5-carboxylic acid | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | 고체 | <NA> | <NA> | 4 | <NA> | <NA> | <NA> | <NA> | <NA> |
36138 | 7757-86-0 | Magnesium hydrogenorthophosphate | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
4712 | 12270-00-7 | C.I. acid red 227 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
4805 | 123236-78-2 | Formaldehyde polymer with 3-methylphenol, 4-methylphenol and 2,3,5-trimethylphenol | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
12832 | 232938-10-2 | 2-Butanone oxime polymer with isophoronediisocyanate | 2-부타논 옥심과 디이소시안산이소포론의 중합체 | 1 | 0 | 0 | 0 | 0 | <NA> | <NA> | <NA> | 8 | <NA> | <NA> | <NA> | <NA> | <NA> |
13528 | 25134-01-4 | 2,6-Dimethylphenol homopolymer; Poly (2,6-dimethyl-1,4-phenylene oxide) | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | 액체 | <NA> | <NA> | 12 | <NA> | <NA> | <NA> | <NA> | <NA> |
고유(CAS)번호 | 영문 | 국문 | 유독물질 | 제한물질 | 금지물질 | 사고대비물질 | 중점관리대상물질 | 물질상태 | 끝점농도 | 끝점농도 기준 | 유해위험코드분류수 | 사고노출위험구분수 | Unnamed: 13 | Unnamed: 14 | Unnamed: 15 | Unnamed: 16 | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
6875 | 136210-30-5 | Tetraethyl N,N'-(methylenedi-4,1-cyclohexanediyl)bis(aspartate) | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | 액체 | <NA> | <NA> | 2 | <NA> | <NA> | <NA> | <NA> | <NA> |
30018 | 68585-48-8 | Sulfuric acid nickel(2+) salt (1:1), reaction products with nickel and nickel oxide (NiO) | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
23720 | 63221-88-5 | 1-(4-Methoxyphenyl)-2-(4-ethylphenyl)ethyne | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | 1 | <NA> | <NA> | <NA> | <NA> | <NA> |
8693 | 15451-00-0 | 3-Sulfinobenzoic,acid | 3-설피노벤조산 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | 2 | <NA> | <NA> | <NA> | <NA> | <NA> |
32127 | 68952-55-6 | Fatty acids, cottonseed-oil polymers with benzoic acid, isophthalic acid, pentaerythritol and phthalic anhydride | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
37404 | 819862-31-2 | 7,8-Difluoro-3,4-dihydro-2-pentyl-6-[[(trans,trans)-4'-propyl[1,1'-bicyclohexyl]-4-yl]methoxy]-2H-1-benzopyran | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
1370 | 104351-92-0 | Poly(oxy-1,2-ethanediyl), α-sulfo-ω-[2-(methyloctadecylamino)ethoxy]ammonium salt | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
19841 | 51999-23-6 | Methylenebutanedioic acid polymer with butyl 2-propenoate, ethyl 2-propenoate and N-(hydroxymethyl)-2-propenamide | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
7609 | 14167-15-8 | [2,2'-Ethylenebis(nitrilomethylidene)diphenolate]copper(II) | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | 액체 | <NA> | <NA> | 1 | <NA> | <NA> | <NA> | <NA> | <NA> |
26007 | 68003-16-7 | Hexanedioic acid polymer with 1,3-benzenedicarboxylic acid, 2,2-dimethyl-1,3-propanediol, 1,3-isobenzofurandione, (Z)-9-octadecenoic acid dimer and 1,2-propanediol | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |