Overview

Dataset statistics

Number of variables14
Number of observations100
Missing cells97
Missing cells (%)6.9%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory11.1 KiB
Average record size in memory113.3 B

Variable types

Text6
Categorical8

Alerts

인용출처 has constant value ""Constant
생물종 is highly overall correlated with 생물종상세유형High correlation
생물종상세유형 is highly overall correlated with 생물종 and 1 other fieldsHigh correlation
단위 is highly overall correlated with 출처High correlation
출처 is highly overall correlated with 생물종상세유형 and 1 other fieldsHigh correlation
생물종 is highly imbalanced (54.0%)Imbalance
계통 has 89 (89.0%) missing valuesMissing
조건 has 8 (8.0%) missing valuesMissing

Reproduction

Analysis started2023-12-10 13:15:43.113300
Analysis finished2023-12-10 13:15:45.751837
Duration2.64 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct57
Distinct (%)57.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
2023-12-10T22:15:46.130290image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length9
Mean length8.06
Min length7

Characters and Unicode

Total characters806
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique34 ?
Unique (%)34.0%

Sample

1st row108-90-7
2nd row108-80-5
3rd row108-10-1
4th row19044-88-3
5th row108-24-7
ValueCountFrequency (%)
100-00-5 5
 
5.0%
1071-83-6 5
 
5.0%
999-81-5 5
 
5.0%
87-62-7 4
 
4.0%
123-77-3 4
 
4.0%
78-78-4 4
 
4.0%
18691-97-9 4
 
4.0%
103-23-1 3
 
3.0%
7440-62-2 3
 
3.0%
4685-14-7 3
 
3.0%
Other values (47) 60
60.0%
2023-12-10T22:15:46.704779image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 200
24.8%
1 101
12.5%
0 79
 
9.8%
7 77
 
9.6%
8 70
 
8.7%
9 58
 
7.2%
3 48
 
6.0%
2 47
 
5.8%
6 46
 
5.7%
4 43
 
5.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 606
75.2%
Dash Punctuation 200
 
24.8%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 101
16.7%
0 79
13.0%
7 77
12.7%
8 70
11.6%
9 58
9.6%
3 48
7.9%
2 47
7.8%
6 46
7.6%
4 43
7.1%
5 37
 
6.1%
Dash Punctuation
ValueCountFrequency (%)
- 200
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 806
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 200
24.8%
1 101
12.5%
0 79
 
9.8%
7 77
 
9.6%
8 70
 
8.7%
9 58
 
7.2%
3 48
 
6.0%
2 47
 
5.8%
6 46
 
5.7%
4 43
 
5.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 806
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 200
24.8%
1 101
12.5%
0 79
 
9.8%
7 77
 
9.6%
8 70
 
8.7%
9 58
 
7.2%
3 48
 
6.0%
2 47
 
5.8%
6 46
 
5.7%
4 43
 
5.3%

항목
Categorical

Distinct7
Distinct (%)7.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
NOAEL
73 
NOEL
12 
NOAEC
 
6
LOAEL
 
3
LOAEC
 
3
Other values (2)
 
3

Length

Max length5
Median length5
Mean length4.85
Min length4

Unique

Unique1 ?
Unique (%)1.0%

Sample

1st rowNOAEL
2nd rowNOAEL
3rd rowNOAEL
4th rowNOEL
5th rowNOEL

Common Values

ValueCountFrequency (%)
NOAEL 73
73.0%
NOEL 12
 
12.0%
NOAEC 6
 
6.0%
LOAEL 3
 
3.0%
LOAEC 3
 
3.0%
<NA> 2
 
2.0%
LOEL 1
 
1.0%

Length

2023-12-10T22:15:47.055183image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T22:15:47.261820image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
noael 73
73.0%
noel 12
 
12.0%
noaec 6
 
6.0%
loael 3
 
3.0%
loaec 3
 
3.0%
na 2
 
2.0%
loel 1
 
1.0%

생물종
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct8
Distinct (%)8.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
랫드
74 
마우스
14 
 
3
래빗
 
3
<NA>
 
2
Other values (3)
 
4

Length

Max length7
Median length2
Mean length2.28
Min length1

Unique

Unique2 ?
Unique (%)2.0%

Sample

1st row
2nd row랫드
3rd row랫드
4th row
5th row<NA>

Common Values

ValueCountFrequency (%)
랫드 74
74.0%
마우스 14
 
14.0%
3
 
3.0%
래빗 3
 
3.0%
<NA> 2
 
2.0%
랫드, 마우스 2
 
2.0%
기니아피그 1
 
1.0%
사람 1
 
1.0%

Length

2023-12-10T22:15:47.540946image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T22:15:47.744442image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
랫드 76
74.5%
마우스 16
 
15.7%
3
 
2.9%
래빗 3
 
2.9%
na 2
 
2.0%
기니아피그 1
 
1.0%
사람 1
 
1.0%

생물종상세유형
Categorical

HIGH CORRELATION 

Distinct15
Distinct (%)15.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
<NA>
59 
Sprague-Dawley
Wistar
F344
B6C3F1
Other values (10)
14 

Length

Max length17
Median length4
Mean length5.44
Min length2

Unique

Unique8 ?
Unique (%)8.0%

Sample

1st rowBeagle
2nd row<NA>
3rd rowSprague-Dawley
4th rowBeagle
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 59
59.0%
Sprague-Dawley 8
 
8.0%
Wistar 7
 
7.0%
F344 6
 
6.0%
B6C3F1 6
 
6.0%
Beagle 3
 
3.0%
CD-1 3
 
3.0%
New Zealand white 1
 
1.0%
CD 1
 
1.0%
Nelsom albino 1
 
1.0%
Other values (5) 5
 
5.0%

Length

2023-12-10T22:15:47.972725image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
na 59
57.3%
sprague-dawley 8
 
7.8%
wistar 7
 
6.8%
f344 6
 
5.8%
b6c3f1 6
 
5.8%
beagle 3
 
2.9%
cd-1 3
 
2.9%
albino 2
 
1.9%
f344/n 1
 
1.0%
수컷 1
 
1.0%
Other values (7) 7
 
6.8%

성별내용
Categorical

Distinct4
Distinct (%)4.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
<NA>
62 
암컷, 수컷
25 
암컷
수컷
 
6

Length

Max length6
Median length4
Mean length4.24
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row암컷, 수컷
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 62
62.0%
암컷, 수컷 25
25.0%
암컷 7
 
7.0%
수컷 6
 
6.0%

Length

2023-12-10T22:15:48.260631image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T22:15:48.587623image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 62
49.6%
암컷 32
25.6%
수컷 31
24.8%

계통
Text

MISSING 

Distinct7
Distinct (%)63.6%
Missing89
Missing (%)89.0%
Memory size932.0 B
2023-12-10T22:15:48.968818image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length5
Median length4
Mean length2.9090909
Min length1

Characters and Unicode

Total characters32
Distinct characters19
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique5 ?
Unique (%)45.5%

Sample

1st row생식/발달
2nd row간,신장
3rd row간,신장
4th row모계/태아
5th row전신
ValueCountFrequency (%)
전신 4
36.4%
간,신장 2
18.2%
생식/발달 1
 
9.1%
모계/태아 1
 
9.1%
기관지 1
 
9.1%
국소 1
 
9.1%
1
 
9.1%
2023-12-10T22:15:49.443542image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
6
18.8%
4
12.5%
3
 
9.4%
, 2
 
6.2%
2
 
6.2%
/ 2
 
6.2%
1
 
3.1%
1
 
3.1%
1
 
3.1%
1
 
3.1%
Other values (9) 9
28.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 28
87.5%
Other Punctuation 4
 
12.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
6
21.4%
4
14.3%
3
10.7%
2
 
7.1%
1
 
3.6%
1
 
3.6%
1
 
3.6%
1
 
3.6%
1
 
3.6%
1
 
3.6%
Other values (7) 7
25.0%
Other Punctuation
ValueCountFrequency (%)
, 2
50.0%
/ 2
50.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 28
87.5%
Common 4
 
12.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
6
21.4%
4
14.3%
3
10.7%
2
 
7.1%
1
 
3.6%
1
 
3.6%
1
 
3.6%
1
 
3.6%
1
 
3.6%
1
 
3.6%
Other values (7) 7
25.0%
Common
ValueCountFrequency (%)
, 2
50.0%
/ 2
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 28
87.5%
ASCII 4
 
12.5%

Most frequent character per block

Hangul
ValueCountFrequency (%)
6
21.4%
4
14.3%
3
10.7%
2
 
7.1%
1
 
3.6%
1
 
3.6%
1
 
3.6%
1
 
3.6%
1
 
3.6%
1
 
3.6%
Other values (7) 7
25.0%
ASCII
ValueCountFrequency (%)
, 2
50.0%
/ 2
50.0%

노출경로
Categorical

Distinct7
Distinct (%)7.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
경구
57 
흡입
28 
경피
-
<NA>
 
1
Other values (2)
 
2

Length

Max length8
Median length2
Mean length2.07
Min length1

Unique

Unique3 ?
Unique (%)3.0%

Sample

1st row흡입
2nd row경구
3rd row경구
4th row경구
5th row경피

Common Values

ValueCountFrequency (%)
경구 57
57.0%
흡입 28
28.0%
경피 6
 
6.0%
- 6
 
6.0%
<NA> 1
 
1.0%
경구(음용수) 1
 
1.0%
경구(식이섭취) 1
 
1.0%

Length

2023-12-10T22:15:49.714708image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T22:15:49.973269image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
경구 57
57.0%
흡입 28
28.0%
경피 6
 
6.0%
6
 
6.0%
na 1
 
1.0%
경구(음용수 1
 
1.0%
경구(식이섭취 1
 
1.0%

독성
Text

Distinct73
Distinct (%)73.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
2023-12-10T22:15:50.811774image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length7
Mean length3.24
Min length1

Characters and Unicode

Total characters324
Distinct characters17
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique56 ?
Unique (%)56.0%

Sample

1st row2.06
2nd row150
3rd row50
4th row5
5th row1
ValueCountFrequency (%)
1000 5
 
5.0%
50 5
 
5.0%
500 4
 
4.0%
150 4
 
4.0%
5 3
 
3.0%
3 3
 
3.0%
100 3
 
3.0%
50000 2
 
2.0%
10 2
 
2.0%
0.1 2
 
2.0%
Other values (59) 67
67.0%
2023-12-10T22:15:51.573406image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 114
35.2%
1 50
15.4%
5 42
 
13.0%
2 29
 
9.0%
. 19
 
5.9%
4 17
 
5.2%
6 13
 
4.0%
3 13
 
4.0%
8 7
 
2.2%
9 5
 
1.5%
Other values (7) 15
 
4.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 292
90.1%
Other Punctuation 20
 
6.2%
Math Symbol 10
 
3.1%
Other Letter 1
 
0.3%
Dash Punctuation 1
 
0.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 114
39.0%
1 50
17.1%
5 42
 
14.4%
2 29
 
9.9%
4 17
 
5.8%
6 13
 
4.5%
3 13
 
4.5%
8 7
 
2.4%
9 5
 
1.7%
7 2
 
0.7%
Math Symbol
ValueCountFrequency (%)
< 4
40.0%
> 4
40.0%
~ 2
20.0%
Other Punctuation
ValueCountFrequency (%)
. 19
95.0%
, 1
 
5.0%
Other Letter
ValueCountFrequency (%)
1
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 323
99.7%
Hangul 1
 
0.3%

Most frequent character per script

Common
ValueCountFrequency (%)
0 114
35.3%
1 50
15.5%
5 42
 
13.0%
2 29
 
9.0%
. 19
 
5.9%
4 17
 
5.3%
6 13
 
4.0%
3 13
 
4.0%
8 7
 
2.2%
9 5
 
1.5%
Other values (6) 14
 
4.3%
Hangul
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 323
99.7%
Hangul 1
 
0.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 114
35.3%
1 50
15.5%
5 42
 
13.0%
2 29
 
9.0%
. 19
 
5.9%
4 17
 
5.3%
6 13
 
4.0%
3 13
 
4.0%
8 7
 
2.2%
9 5
 
1.5%
Other values (6) 14
 
4.3%
Hangul
ValueCountFrequency (%)
1
100.0%

단위
Categorical

HIGH CORRELATION 

Distinct10
Distinct (%)10.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
ppm
26 
mg/kg
18 
mg/kg/day
15 
mg/kg bw
12 
mg/L
10 
Other values (5)
19 

Length

Max length12
Median length9
Mean length5.92
Min length1

Unique

Unique3 ?
Unique (%)3.0%

Sample

1st rowmg/L
2nd rowmg/kg/day
3rd rowmg/kg
4th rowmg/kg/day
5th rowppm

Common Values

ValueCountFrequency (%)
ppm 26
26.0%
mg/kg 18
18.0%
mg/kg/day 15
15.0%
mg/kg bw 12
12.0%
mg/L 10
 
10.0%
mg/kg bw/day 9
 
9.0%
mg/㎥ 7
 
7.0%
mol/L 1
 
1.0%
mg/L bw/day 1
 
1.0%
% 1
 
1.0%

Length

2023-12-10T22:15:51.817529image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T22:15:52.125304image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
mg/kg 39
32.0%
ppm 26
21.3%
mg/kg/day 15
 
12.3%
bw 12
 
9.8%
mg/l 11
 
9.0%
bw/day 10
 
8.2%
mg/㎥ 7
 
5.7%
mol/l 1
 
0.8%
1
 
0.8%

조건
Text

MISSING 

Distinct79
Distinct (%)85.9%
Missing8
Missing (%)8.0%
Memory size932.0 B
2023-12-10T22:15:52.667127image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length100
Median length59.5
Mean length36.956522
Min length2

Characters and Unicode

Total characters3400
Distinct characters183
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique67 ?
Unique (%)72.8%

Sample

1st row0, 0.79, 1.59, 2.06 mg/L의 용량으로 6개월간 흡입 노출, (GLP : yes)
2nd row수컷에 45일동안 10, 40, 150, 600 mg/kg/day의 용량으로, 암컷에 교배 전 14일부터 수유 3일 까지 경구 섭취
3rd row13주간 경구투여(0, 50, 250, 1000 mg/kg)
4th row경구 1년 독성
5th row국소 부위 접촉으로 자극이 나타남
ValueCountFrequency (%)
mg/kg 22
 
3.0%
노출 20
 
2.7%
용량 17
 
2.3%
독성 17
 
2.3%
ppm 16
 
2.2%
0 15
 
2.1%
90일 15
 
2.1%
연구 15
 
2.1%
노출기간 14
 
1.9%
매일 12
 
1.6%
Other values (268) 565
77.6%
2023-12-10T22:15:53.721725image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
636
 
18.7%
0 303
 
8.9%
, 186
 
5.5%
1 122
 
3.6%
5 106
 
3.1%
85
 
2.5%
/ 74
 
2.2%
g 72
 
2.1%
m 63
 
1.9%
: 63
 
1.9%
Other values (173) 1690
49.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1016
29.9%
Decimal Number 809
23.8%
Space Separator 636
18.7%
Lowercase Letter 384
 
11.3%
Other Punctuation 351
 
10.3%
Uppercase Letter 93
 
2.7%
Open Punctuation 53
 
1.6%
Close Punctuation 50
 
1.5%
Dash Punctuation 6
 
0.2%
Math Symbol 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
85
 
8.4%
44
 
4.3%
39
 
3.8%
38
 
3.7%
38
 
3.7%
37
 
3.6%
31
 
3.1%
27
 
2.7%
27
 
2.7%
25
 
2.5%
Other values (120) 625
61.5%
Lowercase Letter
ValueCountFrequency (%)
g 72
18.8%
m 63
16.4%
p 38
9.9%
k 30
7.8%
e 25
 
6.5%
i 21
 
5.5%
d 21
 
5.5%
y 17
 
4.4%
b 14
 
3.6%
w 14
 
3.6%
Other values (11) 69
18.0%
Decimal Number
ValueCountFrequency (%)
0 303
37.5%
1 122
15.1%
5 106
 
13.1%
2 62
 
7.7%
3 53
 
6.6%
4 52
 
6.4%
6 36
 
4.4%
9 33
 
4.1%
7 21
 
2.6%
8 21
 
2.6%
Uppercase Letter
ValueCountFrequency (%)
G 16
17.2%
E 15
16.1%
L 14
15.1%
C 14
15.1%
D 13
14.0%
O 11
11.8%
P 6
 
6.5%
T 3
 
3.2%
S 1
 
1.1%
Other Punctuation
ValueCountFrequency (%)
, 186
53.0%
/ 74
 
21.1%
: 63
 
17.9%
. 21
 
6.0%
; 3
 
0.9%
" 2
 
0.6%
% 1
 
0.3%
? 1
 
0.3%
Space Separator
ValueCountFrequency (%)
636
100.0%
Open Punctuation
ValueCountFrequency (%)
( 53
100.0%
Close Punctuation
ValueCountFrequency (%)
) 50
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 6
100.0%
Math Symbol
ValueCountFrequency (%)
= 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1907
56.1%
Hangul 1016
29.9%
Latin 477
 
14.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
85
 
8.4%
44
 
4.3%
39
 
3.8%
38
 
3.7%
38
 
3.7%
37
 
3.6%
31
 
3.1%
27
 
2.7%
27
 
2.7%
25
 
2.5%
Other values (120) 625
61.5%
Latin
ValueCountFrequency (%)
g 72
15.1%
m 63
 
13.2%
p 38
 
8.0%
k 30
 
6.3%
e 25
 
5.2%
i 21
 
4.4%
d 21
 
4.4%
y 17
 
3.6%
G 16
 
3.4%
E 15
 
3.1%
Other values (20) 159
33.3%
Common
ValueCountFrequency (%)
636
33.4%
0 303
15.9%
, 186
 
9.8%
1 122
 
6.4%
5 106
 
5.6%
/ 74
 
3.9%
: 63
 
3.3%
2 62
 
3.3%
( 53
 
2.8%
3 53
 
2.8%
Other values (13) 249
 
13.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2384
70.1%
Hangul 1016
29.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
636
26.7%
0 303
12.7%
, 186
 
7.8%
1 122
 
5.1%
5 106
 
4.4%
/ 74
 
3.1%
g 72
 
3.0%
m 63
 
2.6%
: 63
 
2.6%
2 62
 
2.6%
Other values (43) 697
29.2%
Hangul
ValueCountFrequency (%)
85
 
8.4%
44
 
4.3%
39
 
3.8%
38
 
3.7%
38
 
3.7%
37
 
3.6%
31
 
3.1%
27
 
2.7%
27
 
2.7%
25
 
2.5%
Other values (120) 625
61.5%

출처
Categorical

HIGH CORRELATION 

Distinct7
Distinct (%)7.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
<NA>
47 
ECB IUCLID
33 
OECD SIDS
14 
IPCS EHC
 
2
ATSDR
 
2
Other values (2)
 
2

Length

Max length10
Median length9
Mean length6.86
Min length4

Unique

Unique2 ?
Unique (%)2.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 47
47.0%
ECB IUCLID 33
33.0%
OECD SIDS 14
 
14.0%
IPCS EHC 2
 
2.0%
ATSDR 2
 
2.0%
ECB IULCID 1
 
1.0%
EU RAR 1
 
1.0%

Length

2023-12-10T22:15:53.960740image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T22:15:54.166095image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 47
31.1%
ecb 34
22.5%
iuclid 33
21.9%
oecd 14
 
9.3%
sids 14
 
9.3%
ipcs 2
 
1.3%
ehc 2
 
1.3%
atsdr 2
 
1.3%
iulcid 1
 
0.7%
eu 1
 
0.7%

인용출처
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
KISChem
100 

Length

Max length7
Median length7
Mean length7
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowKISChem
2nd rowKISChem
3rd rowKISChem
4th rowKISChem
5th rowKISChem

Common Values

ValueCountFrequency (%)
KISChem 100
100.0%

Length

2023-12-10T22:15:54.398015image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T22:15:54.547642image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
kischem 100
100.0%
Distinct57
Distinct (%)57.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
2023-12-10T22:15:54.939993image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length24
Median length17
Mean length9.79
Min length3

Characters and Unicode

Total characters979
Distinct characters125
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique34 ?
Unique (%)34.0%

Sample

1st row클로로벤젠
2nd row아이소시아누르산
3rd row메틸 아이소부틸 케톤
4th row오라이잘린
5th row무수아세트산
ValueCountFrequency (%)
1-클로로-4-나이트로벤젠 5
 
3.8%
클로라이드 5
 
3.8%
글리포스페이트 5
 
3.8%
클로르메쿼트 5
 
3.8%
2,6-다이메틸아닐린 4
 
3.0%
아조다이카본아미드 4
 
3.0%
아이소펜테인 4
 
3.0%
메타벤즈티아주론 4
 
3.0%
파라콰트 3
 
2.3%
다이메틸 3
 
2.3%
Other values (64) 90
68.2%
2023-12-10T22:15:55.697538image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
71
 
7.3%
- 71
 
7.3%
59
 
6.0%
56
 
5.7%
37
 
3.8%
32
 
3.3%
29
 
3.0%
2 29
 
3.0%
, 24
 
2.5%
24
 
2.5%
Other values (115) 547
55.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 766
78.2%
Dash Punctuation 71
 
7.3%
Decimal Number 64
 
6.5%
Space Separator 32
 
3.3%
Other Punctuation 24
 
2.5%
Lowercase Letter 8
 
0.8%
Uppercase Letter 5
 
0.5%
Open Punctuation 4
 
0.4%
Close Punctuation 4
 
0.4%
Final Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
71
 
9.3%
59
 
7.7%
56
 
7.3%
37
 
4.8%
29
 
3.8%
24
 
3.1%
24
 
3.1%
21
 
2.7%
21
 
2.7%
16
 
2.1%
Other values (99) 408
53.3%
Decimal Number
ValueCountFrequency (%)
2 29
45.3%
1 15
23.4%
4 13
20.3%
6 4
 
6.2%
3 3
 
4.7%
Lowercase Letter
ValueCountFrequency (%)
p 5
62.5%
t 2
 
25.0%
m 1
 
12.5%
Uppercase Letter
ValueCountFrequency (%)
N 4
80.0%
O 1
 
20.0%
Dash Punctuation
ValueCountFrequency (%)
- 71
100.0%
Space Separator
ValueCountFrequency (%)
32
100.0%
Other Punctuation
ValueCountFrequency (%)
, 24
100.0%
Open Punctuation
ValueCountFrequency (%)
( 4
100.0%
Close Punctuation
ValueCountFrequency (%)
) 4
100.0%
Final Punctuation
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 766
78.2%
Common 200
 
20.4%
Latin 13
 
1.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
71
 
9.3%
59
 
7.7%
56
 
7.3%
37
 
4.8%
29
 
3.8%
24
 
3.1%
24
 
3.1%
21
 
2.7%
21
 
2.7%
16
 
2.1%
Other values (99) 408
53.3%
Common
ValueCountFrequency (%)
- 71
35.5%
32
16.0%
2 29
14.5%
, 24
 
12.0%
1 15
 
7.5%
4 13
 
6.5%
6 4
 
2.0%
( 4
 
2.0%
) 4
 
2.0%
3 3
 
1.5%
Latin
ValueCountFrequency (%)
p 5
38.5%
N 4
30.8%
t 2
 
15.4%
m 1
 
7.7%
O 1
 
7.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 766
78.2%
ASCII 212
 
21.7%
Punctuation 1
 
0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
71
 
9.3%
59
 
7.7%
56
 
7.3%
37
 
4.8%
29
 
3.8%
24
 
3.1%
24
 
3.1%
21
 
2.7%
21
 
2.7%
16
 
2.1%
Other values (99) 408
53.3%
ASCII
ValueCountFrequency (%)
- 71
33.5%
32
15.1%
2 29
13.7%
, 24
 
11.3%
1 15
 
7.1%
4 13
 
6.1%
p 5
 
2.4%
6 4
 
1.9%
N 4
 
1.9%
( 4
 
1.9%
Other values (5) 11
 
5.2%
Punctuation
ValueCountFrequency (%)
1
100.0%
Distinct57
Distinct (%)57.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
2023-12-10T22:15:56.064183image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length96
Median length49
Mean length37.05
Min length8

Characters and Unicode

Total characters3705
Distinct characters59
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique34 ?
Unique (%)34.0%

Sample

1st rowChlorobenzene
2nd rowIsocyanuric acid
3rd row4-Methyl-2-pentanone; Methylisobutyl ketone, MIBK
4th row3,5-Dinitro-N4,N4-dipropylsulfanilamide
5th rowAcetic anhydride
ValueCountFrequency (%)
chloride 15
 
6.5%
acid 12
 
5.2%
chlormequat 5
 
2.2%
p-nitrochlorobenzene 5
 
2.2%
glyphosate 5
 
2.2%
n-(phosphonomethyl)glycine 5
 
2.2%
1-chloro-4-nitrobenzene 5
 
2.2%
chlorocholine 5
 
2.2%
2-chloro-n,n,n-trimethylethanaminium 5
 
2.2%
p-chloronitrobenzene 5
 
2.2%
Other values (100) 163
70.9%
2023-12-10T22:15:56.665653image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
e 338
 
9.1%
o 260
 
7.0%
i 247
 
6.7%
l 223
 
6.0%
t 216
 
5.8%
n 213
 
5.7%
- 212
 
5.7%
h 199
 
5.4%
a 190
 
5.1%
r 154
 
4.2%
Other values (49) 1453
39.2%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 2772
74.8%
Uppercase Letter 234
 
6.3%
Dash Punctuation 212
 
5.7%
Decimal Number 163
 
4.4%
Other Punctuation 144
 
3.9%
Space Separator 130
 
3.5%
Open Punctuation 25
 
0.7%
Close Punctuation 25
 
0.7%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 338
12.2%
o 260
9.4%
i 247
8.9%
l 223
 
8.0%
t 216
 
7.8%
n 213
 
7.7%
h 199
 
7.2%
a 190
 
6.9%
r 154
 
5.6%
y 135
 
4.9%
Other values (15) 597
21.5%
Uppercase Letter
ValueCountFrequency (%)
N 55
23.5%
C 40
17.1%
D 28
12.0%
M 18
 
7.7%
A 16
 
6.8%
P 15
 
6.4%
B 14
 
6.0%
E 11
 
4.7%
H 7
 
3.0%
G 5
 
2.1%
Other values (9) 25
10.7%
Decimal Number
ValueCountFrequency (%)
2 65
39.9%
1 42
25.8%
4 29
17.8%
6 12
 
7.4%
3 12
 
7.4%
5 3
 
1.8%
Other Punctuation
ValueCountFrequency (%)
, 84
58.3%
; 47
32.6%
' 13
 
9.0%
Open Punctuation
ValueCountFrequency (%)
( 24
96.0%
[ 1
 
4.0%
Close Punctuation
ValueCountFrequency (%)
) 24
96.0%
] 1
 
4.0%
Dash Punctuation
ValueCountFrequency (%)
- 212
100.0%
Space Separator
ValueCountFrequency (%)
130
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 3002
81.0%
Common 699
 
18.9%
Greek 4
 
0.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 338
11.3%
o 260
 
8.7%
i 247
 
8.2%
l 223
 
7.4%
t 216
 
7.2%
n 213
 
7.1%
h 199
 
6.6%
a 190
 
6.3%
r 154
 
5.1%
y 135
 
4.5%
Other values (32) 827
27.5%
Common
ValueCountFrequency (%)
- 212
30.3%
130
18.6%
, 84
 
12.0%
2 65
 
9.3%
; 47
 
6.7%
1 42
 
6.0%
4 29
 
4.1%
( 24
 
3.4%
) 24
 
3.4%
' 13
 
1.9%
Other values (5) 29
 
4.1%
Greek
ValueCountFrequency (%)
γ 2
50.0%
α 2
50.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 3701
99.9%
None 4
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e 338
 
9.1%
o 260
 
7.0%
i 247
 
6.7%
l 223
 
6.0%
t 216
 
5.8%
n 213
 
5.8%
- 212
 
5.7%
h 199
 
5.4%
a 190
 
5.1%
r 154
 
4.2%
Other values (47) 1449
39.2%
None
ValueCountFrequency (%)
γ 2
50.0%
α 2
50.0%

Correlations

2023-12-10T22:15:56.843513image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
CAS등록번호항목생물종생물종상세유형성별내용계통노출경로독성단위조건출처화학물질국문화학물질영문
CAS등록번호1.0000.9120.9120.5640.6890.7160.8460.9280.9170.9990.9891.0001.000
항목0.9121.0000.6210.5990.3330.0000.4760.8230.4890.8960.4700.9120.912
생물종0.9120.6211.0001.0000.0000.0000.3470.7830.0000.9830.0000.9120.912
생물종상세유형0.5640.5991.0001.0000.000NaN0.6170.0000.3400.9621.0000.5640.564
성별내용0.6890.3330.0000.0001.0000.5980.0000.3990.2090.0000.0810.6890.689
계통0.7160.0000.000NaN0.5981.0000.0000.7890.5940.816NaN0.7160.716
노출경로0.8460.4760.3470.6170.0000.0001.0000.0000.4670.9230.8250.8460.846
독성0.9280.8230.7830.0000.3990.7890.0001.0000.0000.9690.9940.9280.928
단위0.9170.4890.0000.3400.2090.5940.4670.0001.0000.9980.8180.9170.917
조건0.9990.8960.9830.9620.0000.8160.9230.9690.9981.0000.9920.9990.999
출처0.9890.4700.0001.0000.081NaN0.8250.9940.8180.9921.0000.9890.989
화학물질국문1.0000.9120.9120.5640.6890.7160.8460.9280.9170.9990.9891.0001.000
화학물질영문1.0000.9120.9120.5640.6890.7160.8460.9280.9170.9990.9891.0001.000
2023-12-10T22:15:57.071427image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
생물종상세유형단위출처항목노출경로생물종성별내용
생물종상세유형1.0000.1210.7950.3160.3540.8540.000
단위0.1211.0000.5550.2750.2610.0000.058
출처0.7950.5551.0000.3360.4360.0000.000
항목0.3160.2750.3361.0000.1870.4290.314
노출경로0.3540.2610.4360.1871.0000.2120.000
생물종0.8540.0000.0000.4290.2121.0000.000
성별내용0.0000.0580.0000.3140.0000.0001.000
2023-12-10T22:15:57.377647image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
항목생물종생물종상세유형성별내용노출경로단위출처
항목1.0000.4290.3160.3140.1870.2750.336
생물종0.4291.0000.8540.0000.2120.0000.000
생물종상세유형0.3160.8541.0000.0000.3540.1210.795
성별내용0.3140.0000.0001.0000.0000.0580.000
노출경로0.1870.2120.3540.0001.0000.2610.436
단위0.2750.0000.1210.0580.2611.0000.555
출처0.3360.0000.7950.0000.4360.5551.000

Missing values

2023-12-10T22:15:44.950911image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T22:15:45.277709image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-10T22:15:45.519488image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

CAS등록번호항목생물종생물종상세유형성별내용계통노출경로독성단위조건출처인용출처화학물질국문화학물질영문
0108-90-7NOAELBeagle<NA><NA>흡입2.06mg/L0, 0.79, 1.59, 2.06 mg/L의 용량으로 6개월간 흡입 노출, (GLP : yes)<NA>KISChem클로로벤젠Chlorobenzene
1108-80-5NOAEL랫드<NA>암컷, 수컷생식/발달경구150mg/kg/day수컷에 45일동안 10, 40, 150, 600 mg/kg/day의 용량으로, 암컷에 교배 전 14일부터 수유 3일 까지 경구 섭취<NA>KISChem아이소시아누르산Isocyanuric acid
2108-10-1NOAEL랫드Sprague-Dawley<NA>간,신장경구50mg/kg13주간 경구투여(0, 50, 250, 1000 mg/kg)<NA>KISChem메틸 아이소부틸 케톤4-Methyl-2-pentanone; Methylisobutyl ketone, MIBK
319044-88-3NOELBeagle<NA>간,신장경구5mg/kg/day경구 1년 독성<NA>KISChem오라이잘린3,5-Dinitro-N4,N4-dipropylsulfanilamide
4108-24-7NOEL<NA><NA><NA><NA>경피1ppm국소 부위 접촉으로 자극이 나타남<NA>KISChem무수아세트산Acetic anhydride
51918-00-9NOEL래빗<NA>암컷모계/태아<NA>3mg/kg/day0, 1.0, 3.0, or 10.0 mg/kg/day 노출<NA>KISChem다이캄바3,6-Dichloro-2-methoxybenzoic acid; 3,6-Dichloro-o-anisic acid, Dicamba
6112-57-2NOEL래빗<NA><NA>전신경피200mg/kg/day4주 반복 독성 연구(고용량 시험)<NA>KISChem테트라에틸렌펜타민Tetraethylenepentamine
7506-68-3NOAEL랫드<NA><NA><NA>경구10.8mg/kg/day사이아나이드<NA>KISChem사이아노젠 브로마이드Cyanogen bromide
8506-68-3NOAEL랫드<NA><NA><NA>경구44mg/kg/day사이아노겐<NA>KISChem사이아노젠 브로마이드Cyanogen bromide
91071-83-6NOAEL랫드Sprague-Dawley암컷, 수컷<NA>경구약1,000mg/kg bw90일(7일/주) 연구, Directive 87/302/EEC에 따라 0, 50, 250, 1,000 mg/kg bw/day에 노출<NA>KISChem글리포스페이트N-(Phosphonomethyl)glycine; Glyphosate
CAS등록번호항목생물종생물종상세유형성별내용계통노출경로독성단위조건출처인용출처화학물질국문화학물질영문
9087-62-7NOAEL랫드F344<NA><NA>경구160mg/kg12일(매일 1번)ECB IUCLIDKISChem2,6-다이메틸아닐린2,6-Dimethylaniline; 2,6-Xylidine
9187-62-7NOAEL랫드Sprague-Dawley<NA><NA>경구20mg/kg4주(매일 1번)ECB IUCLIDKISChem2,6-다이메틸아닐린2,6-Dimethylaniline; 2,6-Xylidine
9287-62-7NOAEL마우스B6C3F1<NA><NA>경구160mg/kg12일(매일 1번)ECB IUCLIDKISChem2,6-다이메틸아닐린2,6-Dimethylaniline; 2,6-Xylidine
93100-00-5LOAEC랫드<NA><NA><NA>흡입5mg/㎥4주OECD SIDSKISChem1-클로로-4-나이트로벤젠1-Chloro-4-nitrobenzene; p-Nitrochlorobenzene, p-Chloronitrobenzene
94100-00-5LOAEC랫드<NA><NA><NA>흡입9.81mg/㎥13주OECD SIDSKISChem1-클로로-4-나이트로벤젠1-Chloro-4-nitrobenzene; p-Nitrochlorobenzene, p-Chloronitrobenzene
95100-00-5NOAEC마우스<NA><NA><NA>흡입32.94mg/㎥13주OECD SIDSKISChem1-클로로-4-나이트로벤젠1-Chloro-4-nitrobenzene; p-Nitrochlorobenzene, p-Chloronitrobenzene
96100-00-5LOAEC랫드<NA><NA><NA>경구5mg/㎥OECD TG 408OECD SIDSKISChem1-클로로-4-나이트로벤젠1-Chloro-4-nitrobenzene; p-Nitrochlorobenzene, p-Chloronitrobenzene
97100-00-5LOAEL랫드<NA><NA><NA>경구3mg/kg bw/dayOECD TG 453OECD SIDSKISChem1-클로로-4-나이트로벤젠1-Chloro-4-nitrobenzene; p-Nitrochlorobenzene, p-Chloronitrobenzene
98100-01-6NOAEL랫드Sprague-Dawley<NA><NA>경구<3mg/kg90일 독성 연구ECB IUCLIDKISChem4-나이트로아닐린4-Nitroaniline
99100-01-6NOAEL마우스B6C3F1<NA><NA>경구<10mg/kg2주 독성 연구ECB IUCLIDKISChem4-나이트로아닐린4-Nitroaniline