Dataset statistics
Number of variables | 8 |
---|---|
Number of observations | 100 |
Missing cells | 200 |
Missing cells (%) | 25.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 6.7 KiB |
Average record size in memory | 68.3 B |
Variable types
Numeric | 1 |
---|---|
Categorical | 5 |
Unsupported | 2 |
Dataset
Description | 샘플 데이터 |
---|---|
Author | 그린에코스 |
URL | https://www.bigdata-environment.kr/user/data_market/detail.do?id=504b5190-2e31-11ea-ab8d-f5a73bd88f42 |
인용출처 has constant value "" | Constant |
화학물질영문 is highly overall correlated with 연번 and 2 other fields | High correlation |
CAS등록번호 is highly overall correlated with 연번 and 2 other fields | High correlation |
화학물질국문 is highly overall correlated with 연번 and 2 other fields | High correlation |
연번 is highly overall correlated with CAS등록번호 and 2 other fields | High correlation |
비고 has 100 (100.0%) missing values | Missing |
출처 has 100 (100.0%) missing values | Missing |
연번 has unique values | Unique |
비고 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
출처 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Reproduction
Analysis started | 2023-12-10 12:09:51.022055 |
---|---|
Analysis finished | 2023-12-10 12:09:52.019615 |
Duration | 1 second |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
연번
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 100 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 50.5 |
Minimum | 1 |
---|---|
Maximum | 100 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 5.95 |
Q1 | 25.75 |
median | 50.5 |
Q3 | 75.25 |
95-th percentile | 95.05 |
Maximum | 100 |
Range | 99 |
Interquartile range (IQR) | 49.5 |
Descriptive statistics
Standard deviation | 29.011492 |
---|---|
Coefficient of variation (CV) | 0.57448499 |
Kurtosis | -1.2 |
Mean | 50.5 |
Median Absolute Deviation (MAD) | 25 |
Skewness | 0 |
Sum | 5050 |
Variance | 841.66667 |
Monotonicity | Strictly increasing |
Value | Count | Frequency (%) |
1 | 1 | 1.0% |
65 | 1 | 1.0% |
75 | 1 | 1.0% |
74 | 1 | 1.0% |
73 | 1 | 1.0% |
72 | 1 | 1.0% |
71 | 1 | 1.0% |
70 | 1 | 1.0% |
69 | 1 | 1.0% |
68 | 1 | 1.0% |
Other values (90) | 90 |
Value | Count | Frequency (%) |
1 | 1 | |
2 | 1 | |
3 | 1 | |
4 | 1 | |
5 | 1 | |
6 | 1 | |
7 | 1 | |
8 | 1 | |
9 | 1 | |
10 | 1 |
Value | Count | Frequency (%) |
100 | 1 | |
99 | 1 | |
98 | 1 | |
97 | 1 | |
96 | 1 | |
95 | 1 | |
94 | 1 | |
93 | 1 | |
92 | 1 | |
91 | 1 |
CAS등록번호
Categorical
HIGH CORRELATION
 
Distinct | 42 |
---|---|
Distinct (%) | 42.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
4109-96-0 | 4 |
---|---|
75-79-6 | 3 |
531-86-2 | 3 |
19750-95-9 | 3 |
297-99-4 | 3 |
Other values (37) |
Length
Max length | 11 |
---|---|
Median length | 10 |
Mean length | 8.71 |
Min length | 7 |
Unique
Unique | 5 ? |
---|---|
Unique (%) | 5.0% |
Sample
1st row | 531-86-2 |
---|---|
2nd row | 531-86-2 |
3rd row | 531-86-2 |
4th row | 60-41-3 |
5th row | 60-41-3 |
Common Values
Value | Count | Frequency (%) |
4109-96-0 | 4 | 4.0% |
75-79-6 | 3 | 3.0% |
531-86-2 | 3 | 3.0% |
19750-95-9 | 3 | 3.0% |
297-99-4 | 3 | 3.0% |
10294-34-5 | 3 | 3.0% |
7778-73-6 | 3 | 3.0% |
11113-75-0 | 3 | 3.0% |
115-21-9 | 3 | 3.0% |
75-94-5 | 3 | 3.0% |
Other values (32) | 69 |
Length
Value | Count | Frequency (%) |
4109-96-0 | 4 | 4.0% |
75-54-7 | 3 | 3.0% |
75-79-6 | 3 | 3.0% |
10025-78-2 | 3 | 3.0% |
7803-62-5 | 3 | 3.0% |
13463-40-6 | 3 | 3.0% |
116-14-3 | 3 | 3.0% |
78-79-5 | 3 | 3.0% |
7782-65-2 | 3 | 3.0% |
75-35-4 | 3 | 3.0% |
Other values (32) | 69 |
물질표지세계조화시스템
Categorical
Distinct | 8 |
---|---|
Distinct (%) | 8.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
GHS06 | |
---|---|
GHS09 | |
GHS08 | |
GHS02 | |
GHS05 | |
Other values (3) |
Length
Max length | 5 |
---|---|
Median length | 5 |
Mean length | 5 |
Min length | 5 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | 1.0% |
Sample
1st row | GHS07 |
---|---|
2nd row | GHS08 |
3rd row | GHS09 |
4th row | GHS06 |
5th row | GHS09 |
Common Values
Value | Count | Frequency (%) |
GHS06 | 26 | |
GHS09 | 18 | |
GHS08 | 16 | |
GHS02 | 16 | |
GHS05 | 11 | |
GHS04 | 7 | 7.0% |
GHS07 | 5 | 5.0% |
GHS03 | 1 | 1.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
ghs06 | 26 | |
ghs09 | 18 | |
ghs08 | 16 | |
ghs02 | 16 | |
ghs05 | 11 | |
ghs04 | 7 | 7.0% |
ghs07 | 5 | 5.0% |
ghs03 | 1 | 1.0% |
비고
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 100 |
---|---|
Missing (%) | 100.0% |
Memory size | 1.0 KiB |
출처
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 100 |
---|---|
Missing (%) | 100.0% |
Memory size | 1.0 KiB |
인용출처
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 1.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
NCIS |
---|
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 4 |
Min length | 4 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | NCIS |
---|---|
2nd row | NCIS |
3rd row | NCIS |
4th row | NCIS |
5th row | NCIS |
Common Values
Value | Count | Frequency (%) |
NCIS | 100 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
ncis | 100 |
화학물질국문
Categorical
HIGH CORRELATION
 
Distinct | 39 |
---|---|
Distinct (%) | 39.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
<NA> | |
---|---|
1,1-다이클로로에틸렌 | 3 |
황산 벤지딘 | 3 |
테트라플루오르화 규소 | 3 |
클로르디메폼 염화수소산염 | 3 |
Other values (34) |
Length
Max length | 62 |
---|---|
Median length | 22.5 |
Mean length | 10.71 |
Min length | 3 |
Unique
Unique | 5 ? |
---|---|
Unique (%) | 5.0% |
Sample
1st row | 황산 벤지딘 |
---|---|
2nd row | 황산 벤지딘 |
3rd row | 황산 벤지딘 |
4th row | 스트리크닌 황산 |
5th row | 스트리크닌 황산 |
Common Values
Value | Count | Frequency (%) |
<NA> | 12 | 12.0% |
1,1-다이클로로에틸렌 | 3 | 3.0% |
황산 벤지딘 | 3 | 3.0% |
테트라플루오르화 규소 | 3 | 3.0% |
클로르디메폼 염화수소산염 | 3 | 3.0% |
저메인 | 3 | 3.0% |
트라이뷰틸(펜타클로로페녹시)스태낸) | 3 | 3.0% |
삼염화 붕소 | 3 | 3.0% |
이소프렌 | 3 | 3.0% |
칼륨 펜타클로로펜에이트 | 3 | 3.0% |
Other values (29) | 61 |
Length
Value | Count | Frequency (%) |
na | 12 | 7.4% |
스트리크닌 | 10 | 6.2% |
황산 | 5 | 3.1% |
펜타클로로펜에이트 | 3 | 1.9% |
1,1-다이클로로에틸렌 | 3 | 1.9% |
에테르 | 3 | 1.9% |
트랜스-포스파미돈 | 3 | 1.9% |
트라이클로로에틸실레인 | 3 | 1.9% |
트라이클로로비닐실레인 | 3 | 1.9% |
메틸다이클로로실레인 | 3 | 1.9% |
Other values (52) | 114 |
화학물질영문
Categorical
HIGH CORRELATION
 
Distinct | 42 |
---|---|
Distinct (%) | 42.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
Dichlorosilane | 4 |
---|---|
Trichloromethylsilane | 3 |
Benzidine sulfate | 3 |
Chlordimeform hydrochloride | 3 |
trans-Phosphamidon | 3 |
Other values (37) |
Length
Max length | 83 |
---|---|
Median length | 36 |
Mean length | 22 |
Min length | 6 |
Unique
Unique | 5 ? |
---|---|
Unique (%) | 5.0% |
Sample
1st row | Benzidine sulfate |
---|---|
2nd row | Benzidine sulfate |
3rd row | Benzidine sulfate |
4th row | Strychnine sulfate |
5th row | Strychnine sulfate |
Common Values
Value | Count | Frequency (%) |
Dichlorosilane | 4 | 4.0% |
Trichloromethylsilane | 3 | 3.0% |
Benzidine sulfate | 3 | 3.0% |
Chlordimeform hydrochloride | 3 | 3.0% |
trans-Phosphamidon | 3 | 3.0% |
Boron trichloride | 3 | 3.0% |
Potassium pentachlorophenate | 3 | 3.0% |
Nickel sulfide | 3 | 3.0% |
Silane, trichloroethyl- | 3 | 3.0% |
Trichloroethenylsilane | 3 | 3.0% |
Other values (32) | 69 |
Length
Value | Count | Frequency (%) |
strychnine | 14 | 8.1% |
silane | 9 | 5.2% |
sulfate | 7 | 4.0% |
hydrochloride | 5 | 2.9% |
silicon | 5 | 2.9% |
dichlorosilane | 4 | 2.3% |
phosphate | 4 | 2.3% |
tetrachloride | 4 | 2.3% |
strychnidin-10-one | 4 | 2.3% |
potassium | 3 | 1.7% |
Other values (50) | 114 |
연번 | CAS등록번호 | 물질표지세계조화시스템 | 화학물질국문 | 화학물질영문 | |
---|---|---|---|---|---|
연번 | 1.000 | 0.992 | 0.419 | 0.992 | 0.992 |
CAS등록번호 | 0.992 | 1.000 | 0.000 | 1.000 | 1.000 |
물질표지세계조화시스템 | 0.419 | 0.000 | 1.000 | 0.000 | 0.000 |
화학물질국문 | 0.992 | 1.000 | 0.000 | 1.000 | 1.000 |
화학물질영문 | 0.992 | 1.000 | 0.000 | 1.000 | 1.000 |
물질표지세계조화시스템 | 화학물질영문 | CAS등록번호 | 화학물질국문 | |
---|---|---|---|---|
물질표지세계조화시스템 | 1.000 | 0.000 | 0.000 | 0.000 |
화학물질영문 | 0.000 | 1.000 | 1.000 | 1.000 |
CAS등록번호 | 0.000 | 1.000 | 1.000 | 1.000 |
화학물질국문 | 0.000 | 1.000 | 1.000 | 1.000 |
연번 | CAS등록번호 | 물질표지세계조화시스템 | 화학물질국문 | 화학물질영문 | |
---|---|---|---|---|---|
연번 | 1.000 | 0.743 | 0.210 | 0.748 | 0.743 |
CAS등록번호 | 0.743 | 1.000 | 0.000 | 1.000 | 1.000 |
물질표지세계조화시스템 | 0.210 | 0.000 | 1.000 | 0.000 | 0.000 |
화학물질국문 | 0.748 | 1.000 | 0.000 | 1.000 | 1.000 |
화학물질영문 | 0.743 | 1.000 | 0.000 | 1.000 | 1.000 |
연번 | CAS등록번호 | 물질표지세계조화시스템 | 비고 | 출처 | 인용출처 | 화학물질국문 | 화학물질영문 | |
---|---|---|---|---|---|---|---|---|
0 | 1 | 531-86-2 | GHS07 | <NA> | <NA> | NCIS | 황산 벤지딘 | Benzidine sulfate |
1 | 2 | 531-86-2 | GHS08 | <NA> | <NA> | NCIS | 황산 벤지딘 | Benzidine sulfate |
2 | 3 | 531-86-2 | GHS09 | <NA> | <NA> | NCIS | 황산 벤지딘 | Benzidine sulfate |
3 | 4 | 60-41-3 | GHS06 | <NA> | <NA> | NCIS | 스트리크닌 황산 | Strychnine sulfate |
4 | 5 | 60-41-3 | GHS09 | <NA> | <NA> | NCIS | 스트리크닌 황산 | Strychnine sulfate |
5 | 6 | 60491-10-3 | GHS06 | <NA> | <NA> | NCIS | 스트리시닌-10-온, 황산염(2:1), 5수화물 | Strychnidin-10-one, sulfate (2:1), pentahydrate |
6 | 7 | 60491-10-3 | GHS09 | <NA> | <NA> | NCIS | 스트리시닌-10-온, 황산염(2:1), 5수화물 | Strychnidin-10-one, sulfate (2:1), pentahydrate |
7 | 8 | 10476-87-6 | GHS06 | <NA> | <NA> | NCIS | 스트리크닌 다이메틸아르신산 | Strychnine dimethylarsinate |
8 | 9 | 10476-87-6 | GHS09 | <NA> | <NA> | NCIS | 스트리크닌 다이메틸아르신산 | Strychnine dimethylarsinate |
9 | 10 | 10476-82-1 | GHS06 | <NA> | <NA> | NCIS | 스트리시닌 비산염 | Strychnine arsenate |
연번 | CAS등록번호 | 물질표지세계조화시스템 | 비고 | 출처 | 인용출처 | 화학물질국문 | 화학물질영문 | |
---|---|---|---|---|---|---|---|---|
90 | 91 | 10026-04-7 | GHS05 | <NA> | <NA> | NCIS | 실리콘 테트라염화물 | Silicon tetrachloride |
91 | 92 | 10026-04-7 | GHS06 | <NA> | <NA> | NCIS | 실리콘 테트라염화물 | Silicon tetrachloride |
92 | 93 | 7783-61-1 | GHS04 | <NA> | <NA> | NCIS | 테트라플루오르화 규소 | Silicon tetrafluoride |
93 | 94 | 7783-61-1 | GHS05 | <NA> | <NA> | NCIS | 테트라플루오르화 규소 | Silicon tetrafluoride |
94 | 95 | 7783-61-1 | GHS06 | <NA> | <NA> | NCIS | 테트라플루오르화 규소 | Silicon tetrafluoride |
95 | 96 | 11113-75-0 | GHS06 | <NA> | <NA> | NCIS | <NA> | Nickel sulfide |
96 | 97 | 11113-75-0 | GHS08 | <NA> | <NA> | NCIS | <NA> | Nickel sulfide |
97 | 98 | 11113-75-0 | GHS09 | <NA> | <NA> | NCIS | <NA> | Nickel sulfide |
98 | 99 | 1314-06-3 | GHS07 | <NA> | <NA> | NCIS | <NA> | Dinickel trioxide |
99 | 100 | 1314-06-3 | GHS08 | <NA> | <NA> | NCIS | <NA> | Dinickel trioxide |