Dataset statistics
Number of variables | 6 |
---|---|
Number of observations | 100 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 4.8 KiB |
Average record size in memory | 49.3 B |
Variable types
Text | 3 |
---|---|
Categorical | 2 |
DateTime | 1 |
Dataset
Description | 샘플 데이터 |
---|---|
Author | 그린에코스 |
URL | https://www.bigdata-environment.kr/user/data_market/detail.do?id=c8526260-c0d9-11ea-8930-811a19579074 |
Reproduction
Analysis started | 2023-12-10 13:23:02.037327 |
---|---|
Analysis finished | 2023-12-10 13:23:02.831532 |
Duration | 0.79 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
CHRIP등록번호
Text
UNIQUE
 
Distinct | 100 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
Length
Max length | 12 |
---|---|
Median length | 12 |
Mean length | 12 |
Min length | 12 |
Characters and Unicode
Total characters | 1200 |
---|---|
Distinct characters | 13 |
Distinct categories | 3 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 100 ? |
---|---|
Unique (%) | 100.0% |
Sample
1st row | C004-685-91A |
---|---|
2nd row | C005-480-87A |
3rd row | C004-690-50A |
4th row | C004-741-06A |
5th row | C004-660-17A |
Value | Count | Frequency (%) |
c004-685-91a | 1 | 1.0% |
c004-785-95a | 1 | 1.0% |
c004-660-62a | 1 | 1.0% |
c004-666-31a | 1 | 1.0% |
c004-685-24a | 1 | 1.0% |
c004-691-41a | 1 | 1.0% |
c004-737-38a | 1 | 1.0% |
c004-665-51a | 1 | 1.0% |
c004-726-55a | 1 | 1.0% |
c004-675-98a | 1 | 1.0% |
Other values (90) | 90 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 245 | |
- | 200 | |
4 | 123 | |
6 | 101 | |
C | 100 | |
A | 100 | |
7 | 79 | 6.6% |
8 | 47 | 3.9% |
2 | 47 | 3.9% |
5 | 46 | 3.8% |
Other values (3) | 112 |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 800 | |
Dash Punctuation | 200 | 16.7% |
Uppercase Letter | 200 | 16.7% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 245 | |
4 | 123 | |
6 | 101 | |
7 | 79 | 9.9% |
8 | 47 | 5.9% |
2 | 47 | 5.9% |
5 | 46 | 5.8% |
1 | 41 | 5.1% |
3 | 37 | 4.6% |
9 | 34 | 4.2% |
Uppercase Letter
Value | Count | Frequency (%) |
C | 100 | |
A | 100 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 200 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 1000 | |
Latin | 200 | 16.7% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 245 | |
- | 200 | |
4 | 123 | |
6 | 101 | |
7 | 79 | 7.9% |
8 | 47 | 4.7% |
2 | 47 | 4.7% |
5 | 46 | 4.6% |
1 | 41 | 4.1% |
3 | 37 | 3.7% |
Latin
Value | Count | Frequency (%) |
C | 100 | |
A | 100 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 1200 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 245 | |
- | 200 | |
4 | 123 | |
6 | 101 | |
C | 100 | |
A | 100 | |
7 | 79 | 6.6% |
8 | 47 | 3.9% |
2 | 47 | 3.9% |
5 | 46 | 3.8% |
Other values (3) | 112 |
CAS등록번호
Text
UNIQUE
 
Distinct | 100 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
Value | Count | Frequency (%) |
50-00-0 | 1 | 1.0% |
95-63-6 | 1 | 1.0% |
100-44-7 | 1 | 1.0% |
100-42-5 | 1 | 1.0% |
100-41-4 | 1 | 1.0% |
100-21-0 | 1 | 1.0% |
100-00-5 | 1 | 1.0% |
98-95-3 | 1 | 1.0% |
98-83-9 | 1 | 1.0% |
98-82-8 | 1 | 1.0% |
Other values (90) | 90 |
Most occurring characters
Value | Count | Frequency (%) |
- | 200 | |
0 | 85 | |
7 | 73 | 10.0% |
1 | 68 | 9.3% |
5 | 59 | 8.1% |
8 | 51 | 7.0% |
6 | 48 | 6.6% |
9 | 42 | 5.7% |
4 | 37 | 5.1% |
3 | 35 | 4.8% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 531 | |
Dash Punctuation | 200 | 27.4% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 85 | |
7 | 73 | |
1 | 68 | |
5 | 59 | |
8 | 51 | |
6 | 48 | |
9 | 42 | |
4 | 37 | |
3 | 35 | |
2 | 33 | 6.2% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 200 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 731 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
- | 200 | |
0 | 85 | |
7 | 73 | 10.0% |
1 | 68 | 9.3% |
5 | 59 | 8.1% |
8 | 51 | 7.0% |
6 | 48 | 6.6% |
9 | 42 | 5.7% |
4 | 37 | 5.1% |
3 | 35 | 4.8% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 731 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
- | 200 | |
0 | 85 | |
7 | 73 | 10.0% |
1 | 68 | 9.3% |
5 | 59 | 8.1% |
8 | 51 | 7.0% |
6 | 48 | 6.6% |
9 | 42 | 5.7% |
4 | 37 | 5.1% |
3 | 35 | 4.8% |
비고
Categorical
IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | 2.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
<NA> | |
---|---|
※2 : CSCL 간주 물질 (substance which is regarded as CSCL) | 5 |
Length
Max length | 53 |
---|---|
Median length | 4 |
Mean length | 6.45 |
Min length | 4 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | <NA> |
---|---|
2nd row | ※2 : CSCL 간주 물질 (substance which is regarded as CSCL) |
3rd row | <NA> |
4th row | <NA> |
5th row | <NA> |
Common Values
Value | Count | Frequency (%) |
<NA> | 95 | |
※2 : CSCL 간주 물질 (substance which is regarded as CSCL) | 5 | 5.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
na | 95 | |
cscl | 10 | 6.7% |
※2 | 5 | 3.3% |
5 | 3.3% | |
간주 | 5 | 3.3% |
물질 | 5 | 3.3% |
substance | 5 | 3.3% |
which | 5 | 3.3% |
is | 5 | 3.3% |
regarded | 5 | 3.3% |
화학물질영문
Text
UNIQUE
 
Distinct | 100 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
Length
Max length | 55 |
---|---|
Median length | 33 |
Mean length | 19.26 |
Min length | 6 |
Characters and Unicode
Total characters | 1926 |
---|---|
Distinct characters | 57 |
Distinct categories | 8 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 100 ? |
---|---|
Unique (%) | 100.0% |
Sample
1st row | Formaldehyde |
---|---|
2nd row | Cetrimonium bromide |
3rd row | Propane-1,2-diol |
4th row | Ethylenediaminetetraacetic acid |
5th row | Aniline |
Value | Count | Frequency (%) |
acid | 8 | 5.9% |
chloride | 4 | 3.0% |
methyl | 4 | 3.0% |
acetate | 3 | 2.2% |
ketone | 2 | 1.5% |
tetramethylammonium | 2 | 1.5% |
n,n,n-trimethylmethanaminium | 2 | 1.5% |
bromide | 2 | 1.5% |
ethyl | 2 | 1.5% |
phosphate | 2 | 1.5% |
Other values (103) | 104 |
Most occurring characters
Value | Count | Frequency (%) |
e | 219 | 11.4% |
o | 130 | 6.7% |
l | 125 | 6.5% |
n | 113 | 5.9% |
i | 109 | 5.7% |
t | 108 | 5.6% |
a | 103 | 5.3% |
- | 101 | 5.2% |
h | 99 | 5.1% |
y | 92 | 4.8% |
Other values (47) | 727 |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 1511 | |
Uppercase Letter | 117 | 6.1% |
Dash Punctuation | 101 | 5.2% |
Decimal Number | 89 | 4.6% |
Other Punctuation | 39 | 2.0% |
Space Separator | 35 | 1.8% |
Close Punctuation | 17 | 0.9% |
Open Punctuation | 17 | 0.9% |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
e | 219 | |
o | 130 | 8.6% |
l | 125 | 8.3% |
n | 113 | 7.5% |
i | 109 | 7.2% |
t | 108 | 7.1% |
a | 103 | 6.8% |
h | 99 | 6.6% |
y | 92 | 6.1% |
r | 88 | 5.8% |
Other values (12) | 325 |
Uppercase Letter
Value | Count | Frequency (%) |
N | 22 | |
T | 16 | |
C | 12 | |
B | 12 | |
A | 9 | |
D | 9 | |
E | 9 | |
P | 7 | 6.0% |
M | 6 | 5.1% |
I | 3 | 2.6% |
Other values (7) | 12 |
Decimal Number
Value | Count | Frequency (%) |
2 | 34 | |
1 | 20 | |
4 | 11 | 12.4% |
3 | 9 | 10.1% |
5 | 7 | 7.9% |
6 | 4 | 4.5% |
7 | 2 | 2.2% |
0 | 1 | 1.1% |
8 | 1 | 1.1% |
Other Punctuation
Value | Count | Frequency (%) |
, | 33 | |
. | 5 | 12.8% |
' | 1 | 2.6% |
Close Punctuation
Value | Count | Frequency (%) |
) | 15 | |
] | 2 | 11.8% |
Open Punctuation
Value | Count | Frequency (%) |
( | 15 | |
[ | 2 | 11.8% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 101 |
Space Separator
Value | Count | Frequency (%) |
35 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 1628 | |
Common | 298 | 15.5% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
e | 219 | |
o | 130 | 8.0% |
l | 125 | 7.7% |
n | 113 | 6.9% |
i | 109 | 6.7% |
t | 108 | 6.6% |
a | 103 | 6.3% |
h | 99 | 6.1% |
y | 92 | 5.7% |
r | 88 | 5.4% |
Other values (29) | 442 |
Common
Value | Count | Frequency (%) |
- | 101 | |
35 | 11.7% | |
2 | 34 | 11.4% |
, | 33 | 11.1% |
1 | 20 | 6.7% |
) | 15 | 5.0% |
( | 15 | 5.0% |
4 | 11 | 3.7% |
3 | 9 | 3.0% |
5 | 7 | 2.3% |
Other values (8) | 18 | 6.0% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 1926 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
e | 219 | 11.4% |
o | 130 | 6.7% |
l | 125 | 6.5% |
n | 113 | 5.9% |
i | 109 | 5.7% |
t | 108 | 5.6% |
a | 103 | 5.3% |
- | 101 | 5.2% |
h | 99 | 5.1% |
y | 92 | 4.8% |
Other values (47) | 727 |
인용 출처
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 1.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
NITE-CHRIP |
---|
Length
Max length | 10 |
---|---|
Median length | 10 |
Mean length | 10 |
Min length | 10 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | NITE-CHRIP |
---|---|
2nd row | NITE-CHRIP |
3rd row | NITE-CHRIP |
4th row | NITE-CHRIP |
5th row | NITE-CHRIP |
Common Values
Value | Count | Frequency (%) |
NITE-CHRIP | 100 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
nite-chrip | 100 |
갱신일자
Date
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 1.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
Minimum | 2020-05-27 00:00:00 |
---|---|
Maximum | 2020-05-27 00:00:00 |
CHRIP등록번호 | CAS등록번호 | 화학물질영문 | |
---|---|---|---|
CHRIP등록번호 | 1.000 | 1.000 | 1.000 |
CAS등록번호 | 1.000 | 1.000 | 1.000 |
화학물질영문 | 1.000 | 1.000 | 1.000 |
CHRIP등록번호 | CAS등록번호 | 비고 | 화학물질영문 | 인용 출처 | 갱신일자 | |
---|---|---|---|---|---|---|
0 | C004-685-91A | 50-00-0 | <NA> | Formaldehyde | NITE-CHRIP | 2020.05.27 |
1 | C005-480-87A | 57-09-0 | ※2 : CSCL 간주 물질 (substance which is regarded as CSCL) | Cetrimonium bromide | NITE-CHRIP | 2020.05.27 |
2 | C004-690-50A | 57-55-6 | <NA> | Propane-1,2-diol | NITE-CHRIP | 2020.05.27 |
3 | C004-741-06A | 60-00-4 | <NA> | Ethylenediaminetetraacetic acid | NITE-CHRIP | 2020.05.27 |
4 | C004-660-17A | 62-53-3 | <NA> | Aniline | NITE-CHRIP | 2020.05.27 |
5 | C004-721-99A | 62-56-6 | <NA> | Thiourea | NITE-CHRIP | 2020.05.27 |
6 | C004-704-99A | 64-18-6 | <NA> | Formic acid | NITE-CHRIP | 2020.05.27 |
7 | C006-326-60A | 64-20-0 | ※2 : CSCL 간주 물질 (substance which is regarded as CSCL) | N,N,N-Trimethylmethanaminium bromide | NITE-CHRIP | 2020.05.27 |
8 | C004-664-71A | 67-56-1 | <NA> | Methanol | NITE-CHRIP | 2020.05.27 |
9 | C004-711-07A | 67-63-0 | <NA> | Propan-2-ol | NITE-CHRIP | 2020.05.27 |
CHRIP등록번호 | CAS등록번호 | 비고 | 화학물질영문 | 인용 출처 | 갱신일자 | |
---|---|---|---|---|---|---|
90 | C004-660-06A | 107-05-1 | <NA> | 3-Chloroprop-1-ene | NITE-CHRIP | 2020.05.27 |
91 | C004-683-42A | 107-06-2 | <NA> | 1,2-Dichloroethane | NITE-CHRIP | 2020.05.27 |
92 | C004-668-24A | 107-13-1 | <NA> | Acrylonitrile | NITE-CHRIP | 2020.05.27 |
93 | C004-685-46A | 107-21-1 | <NA> | Ethylene glycol | NITE-CHRIP | 2020.05.27 |
94 | C004-762-15A | 107-22-2 | <NA> | Oxalaldehyde | NITE-CHRIP | 2020.05.27 |
95 | C005-489-85A | 107-46-0 | <NA> | Disiloxane, hexamethyl- | NITE-CHRIP | 2020.05.27 |
96 | C004-794-96A | 107-64-2 | ※2 : CSCL 간주 물질 (substance which is regarded as CSCL) | N,N-Dimethyl-N,N-dioctadecan-1-ylammonium chloride | NITE-CHRIP | 2020.05.27 |
97 | C004-692-98A | 108-05-4 | <NA> | Vinyl acetate | NITE-CHRIP | 2020.05.27 |
98 | C004-707-40A | 108-10-1 | <NA> | Methyl isobutyl ketone | NITE-CHRIP | 2020.05.27 |
99 | C004-679-52A | 108-24-7 | <NA> | Acetic anhydride | NITE-CHRIP | 2020.05.27 |