Dataset statistics
Number of variables | 5 |
---|---|
Number of observations | 30 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 1.3 KiB |
Average record size in memory | 44.4 B |
Variable types
Text | 4 |
---|---|
Categorical | 1 |
Dataset
Description | 샘플 데이터 |
---|---|
Author | 한국생산기술연구원 |
URL | https://bigdata-region.kr/#/dataset/bb99ea99-1800-4848-b50a-6c827d7a84be |
Reproduction
Analysis started | 2023-12-10 13:48:38.227093 |
---|---|
Analysis finished | 2023-12-10 13:48:39.127732 |
Duration | 0.9 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
화학물질요약서비스번호
Text
Distinct | 28 |
---|---|
Distinct (%) | 93.3% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 372.0 B |
Value | Count | Frequency (%) |
287-92-3 | 2 | 6.7% |
26658-19-5 | 2 | 6.7% |
75-04-7 | 1 | 3.3% |
68-11-1 | 1 | 3.3% |
281-23-2 | 1 | 3.3% |
27458-94-2 | 1 | 3.3% |
27193-86-8 | 1 | 3.3% |
2695-37-6 | 1 | 3.3% |
26761-40-0 | 1 | 3.3% |
26523-78-4 | 1 | 3.3% |
Other values (18) | 18 |
Most occurring characters
Value | Count | Frequency (%) |
- | 60 | |
2 | 36 | |
1 | 27 | |
6 | 25 | |
8 | 21 | 7.7% |
5 | 19 | 7.0% |
7 | 18 | 6.6% |
3 | 18 | 6.6% |
0 | 18 | 6.6% |
4 | 17 | 6.2% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 213 | |
Dash Punctuation | 60 | 22.0% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
2 | 36 | |
1 | 27 | |
6 | 25 | |
8 | 21 | |
5 | 19 | |
7 | 18 | |
3 | 18 | |
0 | 18 | |
4 | 17 | |
9 | 14 | 6.6% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 60 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 273 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
- | 60 | |
2 | 36 | |
1 | 27 | |
6 | 25 | |
8 | 21 | 7.7% |
5 | 19 | 7.0% |
7 | 18 | 6.6% |
3 | 18 | 6.6% |
0 | 18 | 6.6% |
4 | 17 | 6.2% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 273 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
- | 60 | |
2 | 36 | |
1 | 27 | |
6 | 25 | |
8 | 21 | 7.7% |
5 | 19 | 7.0% |
7 | 18 | 6.6% |
3 | 18 | 6.6% |
0 | 18 | 6.6% |
4 | 17 | 6.2% |
화학물질요약서비스물질명
Text
Distinct | 28 |
---|---|
Distinct (%) | 93.3% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 372.0 B |
Length
Max length | 37 |
---|---|
Median length | 27 |
Mean length | 21.066667 |
Min length | 10 |
Characters and Unicode
Total characters | 632 |
---|---|
Distinct characters | 52 |
Distinct categories | 9 ? |
Distinct scripts | 3 ? |
Distinct blocks | 3 ? |
Unique
Unique | 26 ? |
---|---|
Unique (%) | 86.7% |
Sample
1st row | Ethylamine |
---|---|
2nd row | Thioglycolic acid derivatives |
3rd row | p-Nitrophenylhydrazine |
4th row | Pentachloroniobium |
5th row | Cobalt (Ⅱ) sulfate |
Value | Count | Frequency (%) |
cyclopentane | 2 | 3.3% |
비이온성 | 2 | 3.3% |
sodium | 2 | 3.3% |
1 | 2 | 3.3% |
유기surfactant | 2 | 3.3% |
1-bis | 1 | 1.7% |
plastics | 1 | 1.7% |
silicon | 1 | 1.7% |
tetrafluoride | 1 | 1.7% |
hydrazine | 1 | 1.7% |
Other values (45) | 45 |
Most occurring characters
Value | Count | Frequency (%) |
e | 55 | 8.7% |
o | 48 | 7.6% |
n | 44 | 7.0% |
a | 42 | 6.6% |
i | 42 | 6.6% |
t | 41 | 6.5% |
l | 38 | 6.0% |
31 | 4.9% | |
r | 27 | 4.3% |
h | 26 | 4.1% |
Other values (42) | 238 |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 516 | |
Uppercase Letter | 39 | 6.2% |
Space Separator | 31 | 4.9% |
Dash Punctuation | 13 | 2.1% |
Other Letter | 12 | 1.9% |
Decimal Number | 10 | 1.6% |
Close Punctuation | 5 | 0.8% |
Open Punctuation | 5 | 0.8% |
Letter Number | 1 | 0.2% |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
e | 55 | |
o | 48 | 9.3% |
n | 44 | 8.5% |
a | 42 | 8.1% |
i | 42 | 8.1% |
t | 41 | 7.9% |
l | 38 | 7.4% |
r | 27 | 5.2% |
h | 26 | 5.0% |
y | 22 | 4.3% |
Other values (12) | 131 |
Uppercase Letter
Value | Count | Frequency (%) |
S | 6 | |
T | 6 | |
D | 5 | |
C | 5 | |
A | 4 | |
B | 3 | |
P | 2 | 5.1% |
L | 2 | 5.1% |
H | 1 | 2.6% |
I | 1 | 2.6% |
Other values (4) | 4 |
Other Letter
Value | Count | Frequency (%) |
기 | 2 | |
유 | 2 | |
성 | 2 | |
온 | 2 | |
이 | 2 | |
비 | 2 |
Decimal Number
Value | Count | Frequency (%) |
1 | 4 | |
4 | 2 | |
7 | 2 | |
3 | 1 | 10.0% |
2 | 1 | 10.0% |
Space Separator
Value | Count | Frequency (%) |
31 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 13 |
Close Punctuation
Value | Count | Frequency (%) |
) | 5 |
Open Punctuation
Value | Count | Frequency (%) |
( | 5 |
Letter Number
Value | Count | Frequency (%) |
Ⅱ | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 556 | |
Common | 64 | 10.1% |
Hangul | 12 | 1.9% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
e | 55 | 9.9% |
o | 48 | 8.6% |
n | 44 | 7.9% |
a | 42 | 7.6% |
i | 42 | 7.6% |
t | 41 | 7.4% |
l | 38 | 6.8% |
r | 27 | 4.9% |
h | 26 | 4.7% |
y | 22 | 4.0% |
Other values (27) | 171 |
Common
Value | Count | Frequency (%) |
31 | ||
- | 13 | |
) | 5 | 7.8% |
( | 5 | 7.8% |
1 | 4 | 6.2% |
4 | 2 | 3.1% |
7 | 2 | 3.1% |
3 | 1 | 1.6% |
2 | 1 | 1.6% |
Hangul
Value | Count | Frequency (%) |
기 | 2 | |
유 | 2 | |
성 | 2 | |
온 | 2 | |
이 | 2 | |
비 | 2 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 619 | |
Hangul | 12 | 1.9% |
Number Forms | 1 | 0.2% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
e | 55 | 8.9% |
o | 48 | 7.8% |
n | 44 | 7.1% |
a | 42 | 6.8% |
i | 42 | 6.8% |
t | 41 | 6.6% |
l | 38 | 6.1% |
31 | 5.0% | |
r | 27 | 4.4% |
h | 26 | 4.2% |
Other values (35) | 225 |
Hangul
Value | Count | Frequency (%) |
기 | 2 | |
유 | 2 | |
성 | 2 | |
온 | 2 | |
이 | 2 | |
비 | 2 |
Number Forms
Value | Count | Frequency (%) |
Ⅱ | 1 |
물질용도분류명
Categorical
Distinct | 2 |
---|---|
Distinct (%) | 6.7% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 372.0 B |
Chem | |
---|---|
Elec |
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 4 |
Min length | 4 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | Chem |
---|---|
2nd row | Chem |
3rd row | Chem |
4th row | Chem |
5th row | Chem |
Common Values
Value | Count | Frequency (%) |
Chem | 22 | |
Elec | 8 | 26.7% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
chem | 22 | |
elec | 8 | 26.7% |
물질용도영어명
Text
Distinct | 29 |
---|---|
Distinct (%) | 96.7% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 372.0 B |
Length
Max length | 40 |
---|---|
Median length | 28 |
Mean length | 15.466667 |
Min length | 7 |
Characters and Unicode
Total characters | 464 |
---|---|
Distinct characters | 35 |
Distinct categories | 3 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 28 ? |
---|---|
Unique (%) | 93.3% |
Sample
1st row | Insecticid |
---|---|
2nd row | Hair dye |
3rd row | Cosmetic cleanser |
4th row | Catalyst |
5th row | Fertilizer |
Value | Count | Frequency (%) |
electrolyte | 4 | 7.3% |
battery | 4 | 7.3% |
agent | 3 | 5.5% |
emulsifier | 2 | 3.6% |
ion | 2 | 3.6% |
lithium | 2 | 3.6% |
electric | 2 | 3.6% |
swelling | 1 | 1.8% |
fertilizer | 1 | 1.8% |
aircraft | 1 | 1.8% |
Other values (33) | 33 |
Most occurring characters
Value | Count | Frequency (%) |
e | 56 | |
t | 53 | |
i | 46 | 9.9% |
r | 37 | 8.0% |
a | 34 | 7.3% |
l | 28 | 6.0% |
25 | 5.4% | |
c | 24 | 5.2% |
n | 23 | 5.0% |
o | 19 | 4.1% |
Other values (25) | 119 |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 409 | |
Uppercase Letter | 30 | 6.5% |
Space Separator | 25 | 5.4% |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
e | 56 | |
t | 53 | |
i | 46 | |
r | 37 | |
a | 34 | |
l | 28 | 6.8% |
c | 24 | 5.9% |
n | 23 | 5.6% |
o | 19 | 4.6% |
s | 13 | 3.2% |
Other values (12) | 76 |
Uppercase Letter
Value | Count | Frequency (%) |
E | 7 | |
A | 5 | |
H | 4 | |
C | 3 | |
S | 2 | 6.7% |
F | 2 | 6.7% |
L | 2 | 6.7% |
R | 1 | 3.3% |
B | 1 | 3.3% |
I | 1 | 3.3% |
Other values (2) | 2 | 6.7% |
Space Separator
Value | Count | Frequency (%) |
25 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 439 | |
Common | 25 | 5.4% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
e | 56 | |
t | 53 | |
i | 46 | |
r | 37 | 8.4% |
a | 34 | 7.7% |
l | 28 | 6.4% |
c | 24 | 5.5% |
n | 23 | 5.2% |
o | 19 | 4.3% |
s | 13 | 3.0% |
Other values (24) | 106 |
Common
Value | Count | Frequency (%) |
25 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 464 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
e | 56 | |
t | 53 | |
i | 46 | 9.9% |
r | 37 | 8.0% |
a | 34 | 7.3% |
l | 28 | 6.0% |
25 | 5.4% | |
c | 24 | 5.2% |
n | 23 | 5.0% |
o | 19 | 4.1% |
Other values (25) | 119 |
물질용도한글명
Text
Distinct | 29 |
---|---|
Distinct (%) | 96.7% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 372.0 B |
Value | Count | Frequency (%) |
전지 | 3 | 6.1% |
유화제 | 2 | 4.1% |
전해질 | 2 | 4.1% |
전해액 | 2 | 4.1% |
이온 | 2 | 4.1% |
리튬 | 2 | 4.1% |
방지제 | 1 | 2.0% |
제초제 | 1 | 2.0% |
항공기 | 1 | 2.0% |
코팅 | 1 | 2.0% |
Other values (32) | 32 |
Most occurring characters
Value | Count | Frequency (%) |
19 | 12.8% | |
제 | 19 | 12.8% |
전 | 11 | 7.4% |
화 | 6 | 4.1% |
지 | 5 | 3.4% |
해 | 5 | 3.4% |
리 | 4 | 2.7% |
연 | 4 | 2.7% |
기 | 3 | 2.0% |
이 | 3 | 2.0% |
Other values (56) | 69 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 129 | |
Space Separator | 19 | 12.8% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
제 | 19 | 14.7% |
전 | 11 | 8.5% |
화 | 6 | 4.7% |
지 | 5 | 3.9% |
해 | 5 | 3.9% |
리 | 4 | 3.1% |
연 | 4 | 3.1% |
기 | 3 | 2.3% |
이 | 3 | 2.3% |
질 | 3 | 2.3% |
Other values (55) | 66 |
Space Separator
Value | Count | Frequency (%) |
19 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 129 | |
Common | 19 | 12.8% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
제 | 19 | 14.7% |
전 | 11 | 8.5% |
화 | 6 | 4.7% |
지 | 5 | 3.9% |
해 | 5 | 3.9% |
리 | 4 | 3.1% |
연 | 4 | 3.1% |
기 | 3 | 2.3% |
이 | 3 | 2.3% |
질 | 3 | 2.3% |
Other values (55) | 66 |
Common
Value | Count | Frequency (%) |
19 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 129 | |
ASCII | 19 | 12.8% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
19 |
Hangul
Value | Count | Frequency (%) |
제 | 19 | 14.7% |
전 | 11 | 8.5% |
화 | 6 | 4.7% |
지 | 5 | 3.9% |
해 | 5 | 3.9% |
리 | 4 | 3.1% |
연 | 4 | 3.1% |
기 | 3 | 2.3% |
이 | 3 | 2.3% |
질 | 3 | 2.3% |
Other values (55) | 66 |
화학물질요약서비스번호 | 화학물질요약서비스물질명 | 물질용도분류명 | 물질용도영어명 | 물질용도한글명 | |
---|---|---|---|---|---|
화학물질요약서비스번호 | 1.000 | 1.000 | 1.000 | 0.984 | 0.984 |
화학물질요약서비스물질명 | 1.000 | 1.000 | 1.000 | 0.984 | 0.984 |
물질용도분류명 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
물질용도영어명 | 0.984 | 0.984 | 1.000 | 1.000 | 1.000 |
물질용도한글명 | 0.984 | 0.984 | 1.000 | 1.000 | 1.000 |
화학물질요약서비스번호 | 화학물질요약서비스물질명 | 물질용도분류명 | 물질용도영어명 | 물질용도한글명 | |
---|---|---|---|---|---|
0 | 75-04-7 | Ethylamine | Chem | Insecticid | 살충제 |
1 | 68-11-1 | Thioglycolic acid derivatives | Chem | Hair dye | 머리 염색제 |
2 | 100-16-3 | p-Nitrophenylhydrazine | Chem | Cosmetic cleanser | 화장품 클렌저 |
3 | 10026-12-7 | Pentachloroniobium | Chem | Catalyst | 촉매 |
4 | 10026-24-1 | Cobalt (Ⅱ) sulfate | Chem | Fertilizer | 비료 |
5 | 12002-48-1 | Trichlorobenzene (mixture of isomers) | Chem | Swelling agent | 팽창제 |
6 | 120-54-7 | Dipentamethylenethiuram tetrasulfide | Chem | Heat stabilizer | 열 안정제 |
7 | 12058-66-1 | Sodium stannate | Chem | Electrolytic surface treatment agent | 전해 표면 처리제 |
8 | 12069-32-8 | Boron carbide | Chem | Abrasive | 연마제 |
9 | 253-52-1 | Phthalazine | Elec | Rechargeable lithium ion battery cathode | 리튬 이온 전지 양극 |
화학물질요약서비스번호 | 화학물질요약서비스물질명 | 물질용도분류명 | 물질용도영어명 | 물질용도한글명 | |
---|---|---|---|---|---|
20 | 26658-19-5 | 비이온성 유기Surfactant | Chem | Corrosion inhibitor | 부식억제제 |
21 | 26658-19-5 | 비이온성 유기Surfactant | Chem | Emulsifier | 유화제 |
22 | 26761-40-0 | Diisodecyl phthalate | Chem | Artificial leather | 인조 가죽 |
23 | 2695-37-6 | Sodium styrenesulfonate | Chem | Emulsifier | 유화제 |
24 | 27193-86-8 | p-Dodecylphenol | Chem | Fuel additive | 연료 첨가제 |
25 | 27458-94-2 | Isononyl alcohol | Chem | Detergent | 세제 |
26 | 281-23-2 | Adamantane | Chem | Hardener | 경화제 |
27 | 287-92-3 | Cyclopentane | Chem | Adhesive | 접착제 |
28 | 287-92-3 | Cyclopentane | Chem | Lubricant | 윤활유 |
29 | 3006-86-8 | 1 1-Bis (t-butylperoxy) cyclohexane | Chem | Oxidation agent | 산화제 |