Overview

Dataset statistics

Number of variables7
Number of observations2250
Missing cells3124
Missing cells (%)19.8%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory125.4 KiB
Average record size in memory57.1 B

Variable types

Numeric1
Text4
Boolean1
DateTime1

Dataset

Description부산광역시 시도긴급구조표준시스템의 화학물질기본정보데이터로 물질(ID), 화학물질번호(CAS), 유독물번호, 한글명, 영문명, 모델실행, 데이타접수일 등 정보를 제공합니다.
URLhttps://www.data.go.kr/data/15121178/fileData.do

Alerts

데이타접수일 has constant value ""Constant
물질(ID) is highly overall correlated with 모델실행High correlation
모델실행 is highly overall correlated with 물질(ID)High correlation
유독물번호 has 1643 (73.0%) missing valuesMissing
한글명 has 1481 (65.8%) missing valuesMissing
물질(ID) has unique valuesUnique

Reproduction

Analysis started2023-12-12 11:04:08.303519
Analysis finished2023-12-12 11:04:10.052093
Duration1.75 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

물질(ID)
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct2250
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3964.5204
Minimum1
Maximum10329
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size19.9 KiB
2023-12-12T20:04:10.172227image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile113.45
Q1570.25
median4240.5
Q35341.5
95-th percentile10007.55
Maximum10329
Range10328
Interquartile range (IQR)4771.25

Descriptive statistics

Standard deviation2736.032
Coefficient of variation (CV)0.69012936
Kurtosis-0.14897488
Mean3964.5204
Median Absolute Deviation (MAD)1221
Skewness0.34613585
Sum8920171
Variance7485871
MonotonicityNot monotonic
2023-12-12T20:04:10.380622image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
4853 1
 
< 0.1%
4927 1
 
< 0.1%
4454 1
 
< 0.1%
4458 1
 
< 0.1%
4462 1
 
< 0.1%
4882 1
 
< 0.1%
4888 1
 
< 0.1%
4915 1
 
< 0.1%
4933 1
 
< 0.1%
4422 1
 
< 0.1%
Other values (2240) 2240
99.6%
ValueCountFrequency (%)
1 1
< 0.1%
2 1
< 0.1%
3 1
< 0.1%
4 1
< 0.1%
5 1
< 0.1%
6 1
< 0.1%
7 1
< 0.1%
8 1
< 0.1%
9 1
< 0.1%
10 1
< 0.1%
ValueCountFrequency (%)
10329 1
< 0.1%
10328 1
< 0.1%
10327 1
< 0.1%
10326 1
< 0.1%
10325 1
< 0.1%
10324 1
< 0.1%
10323 1
< 0.1%
10322 1
< 0.1%
10321 1
< 0.1%
10320 1
< 0.1%
Distinct2247
Distinct (%)99.9%
Missing0
Missing (%)0.0%
Memory size17.7 KiB
2023-12-12T20:04:10.890940image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length45
Median length34
Mean length8.4066667
Min length5

Characters and Unicode

Total characters18915
Distinct characters16
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2244 ?
Unique (%)99.7%

Sample

1st row79-04-9
2nd row260-94-6
3rd row2807-30-9
4th row60-00-4
5th row75-87-6
ValueCountFrequency (%)
78-00-2 2
 
0.1%
12001-28-4 2
 
0.1%
80-46-6 2
 
0.1%
35184-08-8 2
 
0.1%
7158-25-0 2
 
0.1%
1313-99-1 2
 
0.1%
77536-67-5 2
 
0.1%
12035-72-2 2
 
0.1%
12172-73-5 2
 
0.1%
16812-54-7 2
 
0.1%
Other values (2240) 2242
99.1%
2023-12-12T20:04:11.650706image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 4466
23.6%
1 2163
11.4%
7 1482
 
7.8%
0 1474
 
7.8%
2 1433
 
7.6%
5 1412
 
7.5%
6 1350
 
7.1%
4 1286
 
6.8%
9 1282
 
6.8%
3 1268
 
6.7%
Other values (6) 1299
 
6.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 14344
75.8%
Dash Punctuation 4466
 
23.6%
Uppercase Letter 54
 
0.3%
Connector Punctuation 27
 
0.1%
Other Punctuation 12
 
0.1%
Space Separator 12
 
0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 2163
15.1%
7 1482
10.3%
0 1474
10.3%
2 1433
10.0%
5 1412
9.8%
6 1350
9.4%
4 1286
9.0%
9 1282
8.9%
3 1268
8.8%
8 1194
8.3%
Uppercase Letter
ValueCountFrequency (%)
N 27
50.0%
A 27
50.0%
Dash Punctuation
ValueCountFrequency (%)
- 4466
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 27
100.0%
Other Punctuation
ValueCountFrequency (%)
, 12
100.0%
Space Separator
ValueCountFrequency (%)
12
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 18861
99.7%
Latin 54
 
0.3%

Most frequent character per script

Common
ValueCountFrequency (%)
- 4466
23.7%
1 2163
11.5%
7 1482
 
7.9%
0 1474
 
7.8%
2 1433
 
7.6%
5 1412
 
7.5%
6 1350
 
7.2%
4 1286
 
6.8%
9 1282
 
6.8%
3 1268
 
6.7%
Other values (4) 1245
 
6.6%
Latin
ValueCountFrequency (%)
N 27
50.0%
A 27
50.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 18915
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 4466
23.6%
1 2163
11.4%
7 1482
 
7.8%
0 1474
 
7.8%
2 1433
 
7.6%
5 1412
 
7.5%
6 1350
 
7.1%
4 1286
 
6.8%
9 1282
 
6.8%
3 1268
 
6.7%
Other values (6) 1299
 
6.9%

유독물번호
Text

MISSING 

Distinct587
Distinct (%)96.7%
Missing1643
Missing (%)73.0%
Memory size17.7 KiB
2023-12-12T20:04:12.160807image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length23
Median length8
Mean length9.3031301
Min length2

Characters and Unicode

Total characters5647
Distinct characters18
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique579 ?
Unique (%)95.4%

Sample

1st row97-1-113(유), 06-5-2(제)
2nd row97-1-391
3rd row97-1-362
4th row97-1-4
5th row97-1-342(유), 06-4-50(금)
ValueCountFrequency (%)
2001-1-515 7
 
1.1%
97-1-90 7
 
1.1%
97-1-296 3
 
0.5%
2004-1-546 3
 
0.5%
97-1-9 2
 
0.3%
97-1-119 2
 
0.3%
97-1-268 2
 
0.3%
97-1-138 2
 
0.3%
97-1-39 1
 
0.2%
06-4-12(금 1
 
0.2%
Other values (632) 632
95.5%
2023-12-12T20:04:12.960845image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 1320
23.4%
1 851
15.1%
9 670
11.9%
7 603
10.7%
2 339
 
6.0%
0 326
 
5.8%
4 308
 
5.5%
3 248
 
4.4%
5 216
 
3.8%
6 182
 
3.2%
Other values (8) 584
10.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 3878
68.7%
Dash Punctuation 1320
 
23.4%
Open Punctuation 113
 
2.0%
Close Punctuation 113
 
2.0%
Other Letter 113
 
2.0%
Other Punctuation 55
 
1.0%
Space Separator 55
 
1.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 851
21.9%
9 670
17.3%
7 603
15.5%
2 339
 
8.7%
0 326
 
8.4%
4 308
 
7.9%
3 248
 
6.4%
5 216
 
5.6%
6 182
 
4.7%
8 135
 
3.5%
Other Letter
ValueCountFrequency (%)
55
48.7%
55
48.7%
3
 
2.7%
Dash Punctuation
ValueCountFrequency (%)
- 1320
100.0%
Open Punctuation
ValueCountFrequency (%)
( 113
100.0%
Close Punctuation
ValueCountFrequency (%)
) 113
100.0%
Other Punctuation
ValueCountFrequency (%)
, 55
100.0%
Space Separator
ValueCountFrequency (%)
55
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 5534
98.0%
Hangul 113
 
2.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 1320
23.9%
1 851
15.4%
9 670
12.1%
7 603
10.9%
2 339
 
6.1%
0 326
 
5.9%
4 308
 
5.6%
3 248
 
4.5%
5 216
 
3.9%
6 182
 
3.3%
Other values (5) 471
 
8.5%
Hangul
ValueCountFrequency (%)
55
48.7%
55
48.7%
3
 
2.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 5534
98.0%
Hangul 113
 
2.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 1320
23.9%
1 851
15.4%
9 670
12.1%
7 603
10.9%
2 339
 
6.1%
0 326
 
5.9%
4 308
 
5.6%
3 248
 
4.5%
5 216
 
3.9%
6 182
 
3.3%
Other values (5) 471
 
8.5%
Hangul
ValueCountFrequency (%)
55
48.7%
55
48.7%
3
 
2.7%

한글명
Text

MISSING 

Distinct766
Distinct (%)99.6%
Missing1481
Missing (%)65.8%
Memory size17.7 KiB
2023-12-12T20:04:13.485286image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length129
Median length67
Mean length10.091027
Min length2

Characters and Unicode

Total characters7760
Distinct characters299
Distinct categories10 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique764 ?
Unique (%)99.3%

Sample

1st row염소2
2nd row시아누르 염화물
3rd row프로필 벤젠
4th row디이소프로필벤젠
5th row브롬화 메틸
ValueCountFrequency (%)
17
 
1.4%
메틸 17
 
1.4%
염화 16
 
1.4%
나트륨 16
 
1.4%
염류 14
 
1.2%
아세트산 13
 
1.1%
포함한 10
 
0.8%
화합물질 10
 
0.8%
카드뮴 9
 
0.8%
에틸 9
 
0.8%
Other values (846) 1051
88.9%
2023-12-12T20:04:14.261991image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 473
 
6.1%
413
 
5.3%
294
 
3.8%
217
 
2.8%
200
 
2.6%
199
 
2.6%
, 175
 
2.3%
173
 
2.2%
173
 
2.2%
169
 
2.2%
Other values (289) 5274
68.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5847
75.3%
Dash Punctuation 473
 
6.1%
Decimal Number 451
 
5.8%
Space Separator 413
 
5.3%
Other Punctuation 186
 
2.4%
Close Punctuation 138
 
1.8%
Open Punctuation 138
 
1.8%
Uppercase Letter 67
 
0.9%
Lowercase Letter 43
 
0.6%
Math Symbol 4
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
294
 
5.0%
217
 
3.7%
200
 
3.4%
199
 
3.4%
173
 
3.0%
173
 
3.0%
169
 
2.9%
167
 
2.9%
121
 
2.1%
117
 
2.0%
Other values (242) 4017
68.7%
Lowercase Letter
ValueCountFrequency (%)
a 14
32.6%
p 8
18.6%
n 6
14.0%
f 2
 
4.7%
b 2
 
4.7%
o 2
 
4.7%
β 1
 
2.3%
d 1
 
2.3%
e 1
 
2.3%
r 1
 
2.3%
Other values (5) 5
 
11.6%
Decimal Number
ValueCountFrequency (%)
2 122
27.1%
1 108
23.9%
4 87
19.3%
3 55
12.2%
5 27
 
6.0%
8 18
 
4.0%
6 14
 
3.1%
9 12
 
2.7%
7 4
 
0.9%
0 4
 
0.9%
Uppercase Letter
ValueCountFrequency (%)
N 39
58.2%
H 10
 
14.9%
S 5
 
7.5%
I 4
 
6.0%
O 3
 
4.5%
C 2
 
3.0%
V 2
 
3.0%
A 1
 
1.5%
E 1
 
1.5%
Other Punctuation
ValueCountFrequency (%)
, 175
94.1%
: 7
 
3.8%
. 3
 
1.6%
/ 1
 
0.5%
Math Symbol
ValueCountFrequency (%)
= 2
50.0%
+ 1
25.0%
1
25.0%
Close Punctuation
ValueCountFrequency (%)
) 91
65.9%
] 47
34.1%
Open Punctuation
ValueCountFrequency (%)
( 91
65.9%
[ 47
34.1%
Dash Punctuation
ValueCountFrequency (%)
- 473
100.0%
Space Separator
ValueCountFrequency (%)
413
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5847
75.3%
Common 1803
 
23.2%
Latin 108
 
1.4%
Greek 2
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
294
 
5.0%
217
 
3.7%
200
 
3.4%
199
 
3.4%
173
 
3.0%
173
 
3.0%
169
 
2.9%
167
 
2.9%
121
 
2.1%
117
 
2.0%
Other values (242) 4017
68.7%
Common
ValueCountFrequency (%)
- 473
26.2%
413
22.9%
, 175
 
9.7%
2 122
 
6.8%
1 108
 
6.0%
) 91
 
5.0%
( 91
 
5.0%
4 87
 
4.8%
3 55
 
3.1%
[ 47
 
2.6%
Other values (13) 141
 
7.8%
Latin
ValueCountFrequency (%)
N 39
36.1%
a 14
 
13.0%
H 10
 
9.3%
p 8
 
7.4%
n 6
 
5.6%
S 5
 
4.6%
I 4
 
3.7%
O 3
 
2.8%
C 2
 
1.9%
f 2
 
1.9%
Other values (12) 15
 
13.9%
Greek
ValueCountFrequency (%)
β 1
50.0%
α 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5847
75.3%
ASCII 1910
 
24.6%
None 2
 
< 0.1%
Math Operators 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 473
24.8%
413
21.6%
, 175
 
9.2%
2 122
 
6.4%
1 108
 
5.7%
) 91
 
4.8%
( 91
 
4.8%
4 87
 
4.6%
3 55
 
2.9%
[ 47
 
2.5%
Other values (34) 248
13.0%
Hangul
ValueCountFrequency (%)
294
 
5.0%
217
 
3.7%
200
 
3.4%
199
 
3.4%
173
 
3.0%
173
 
3.0%
169
 
2.9%
167
 
2.9%
121
 
2.1%
117
 
2.0%
Other values (242) 4017
68.7%
None
ValueCountFrequency (%)
β 1
50.0%
α 1
50.0%
Math Operators
ValueCountFrequency (%)
1
100.0%
Distinct2246
Distinct (%)99.8%
Missing0
Missing (%)0.0%
Memory size17.7 KiB
2023-12-12T20:04:14.645950image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length164
Median length96
Mean length18.115111
Min length3

Characters and Unicode

Total characters40759
Distinct characters85
Distinct categories11 ?
Distinct scripts4 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2243 ?
Unique (%)99.7%

Sample

1st rowCHLOROACETYL CHLORIDE
2nd rowACRIDINE
3rd rowETHYLENE GLYCOL MONOPROPYL ETHER
4th rowETHYLENEDIAMINETETRAACETIC ACID
5th rowTRICHLOROACETALDEHYDE
ValueCountFrequency (%)
acid 129
 
3.8%
ether 61
 
1.8%
methyl 57
 
1.7%
chloride 53
 
1.6%
ethyl 42
 
1.3%
acetate 38
 
1.1%
glycol 37
 
1.1%
sodium 35
 
1.0%
oxide 24
 
0.7%
mercaptan 22
 
0.7%
Other values (2034) 2859
85.2%
2023-12-12T20:04:15.293057image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
E 3390
 
8.3%
N 2099
 
5.1%
T 2088
 
5.1%
L 1849
 
4.5%
O 1787
 
4.4%
- 1685
 
4.1%
A 1673
 
4.1%
I 1613
 
4.0%
H 1392
 
3.4%
e 1245
 
3.1%
Other values (75) 21938
53.8%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter 24365
59.8%
Lowercase Letter 11300
27.7%
Dash Punctuation 1685
 
4.1%
Decimal Number 1429
 
3.5%
Space Separator 1107
 
2.7%
Other Punctuation 553
 
1.4%
Open Punctuation 155
 
0.4%
Close Punctuation 153
 
0.4%
Other Letter 7
 
< 0.1%
Math Symbol 4
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 1245
11.0%
o 1149
10.2%
i 985
 
8.7%
l 867
 
7.7%
n 848
 
7.5%
a 764
 
6.8%
t 744
 
6.6%
h 702
 
6.2%
r 688
 
6.1%
y 579
 
5.1%
Other values (18) 2729
24.2%
Uppercase Letter
ValueCountFrequency (%)
E 3390
13.9%
N 2099
 
8.6%
T 2088
 
8.6%
L 1849
 
7.6%
O 1787
 
7.3%
A 1673
 
6.9%
I 1613
 
6.6%
H 1392
 
5.7%
Y 1221
 
5.0%
R 1179
 
4.8%
Other values (15) 6074
24.9%
Decimal Number
ValueCountFrequency (%)
2 463
32.4%
1 424
29.7%
3 215
15.0%
4 184
 
12.9%
5 61
 
4.3%
6 35
 
2.4%
8 23
 
1.6%
9 13
 
0.9%
7 7
 
0.5%
0 4
 
0.3%
Other Letter
ValueCountFrequency (%)
1
14.3%
1
14.3%
1
14.3%
1
14.3%
1
14.3%
1
14.3%
1
14.3%
Other Punctuation
ValueCountFrequency (%)
, 535
96.7%
: 8
 
1.4%
' 5
 
0.9%
. 4
 
0.7%
/ 1
 
0.2%
Math Symbol
ValueCountFrequency (%)
= 2
50.0%
1
25.0%
+ 1
25.0%
Open Punctuation
ValueCountFrequency (%)
( 108
69.7%
[ 47
30.3%
Close Punctuation
ValueCountFrequency (%)
) 106
69.3%
] 47
30.7%
Dash Punctuation
ValueCountFrequency (%)
- 1685
100.0%
Space Separator
ValueCountFrequency (%)
1107
100.0%
Final Punctuation
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 35663
87.5%
Common 5087
 
12.5%
Hangul 7
 
< 0.1%
Greek 2
 
< 0.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
E 3390
 
9.5%
N 2099
 
5.9%
T 2088
 
5.9%
L 1849
 
5.2%
O 1787
 
5.0%
A 1673
 
4.7%
I 1613
 
4.5%
H 1392
 
3.9%
e 1245
 
3.5%
Y 1221
 
3.4%
Other values (41) 17306
48.5%
Common
ValueCountFrequency (%)
- 1685
33.1%
1107
21.8%
, 535
 
10.5%
2 463
 
9.1%
1 424
 
8.3%
3 215
 
4.2%
4 184
 
3.6%
( 108
 
2.1%
) 106
 
2.1%
5 61
 
1.2%
Other values (15) 199
 
3.9%
Hangul
ValueCountFrequency (%)
1
14.3%
1
14.3%
1
14.3%
1
14.3%
1
14.3%
1
14.3%
1
14.3%
Greek
ValueCountFrequency (%)
α 1
50.0%
β 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 40748
> 99.9%
Hangul 7
 
< 0.1%
None 2
 
< 0.1%
Math Operators 1
 
< 0.1%
Punctuation 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
E 3390
 
8.3%
N 2099
 
5.2%
T 2088
 
5.1%
L 1849
 
4.5%
O 1787
 
4.4%
- 1685
 
4.1%
A 1673
 
4.1%
I 1613
 
4.0%
H 1392
 
3.4%
e 1245
 
3.1%
Other values (64) 21927
53.8%
Hangul
ValueCountFrequency (%)
1
14.3%
1
14.3%
1
14.3%
1
14.3%
1
14.3%
1
14.3%
1
14.3%
Math Operators
ValueCountFrequency (%)
1
100.0%
Punctuation
ValueCountFrequency (%)
1
100.0%
None
ValueCountFrequency (%)
α 1
50.0%
β 1
50.0%

모델실행
Boolean

HIGH CORRELATION 

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
True
1335 
False
915 
ValueCountFrequency (%)
True 1335
59.3%
False 915
40.7%
2023-12-12T20:04:15.518628image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

데이타접수일
Date

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size17.7 KiB
Minimum2007-03-23 00:00:00
Maximum2007-03-23 00:00:00
2023-12-12T20:04:15.649794image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T20:04:15.809842image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2023-12-12T20:04:09.320734image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T20:04:15.990010image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
물질(ID)모델실행
물질(ID)1.0000.641
모델실행0.6411.000
2023-12-12T20:04:16.154900image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
물질(ID)모델실행
물질(ID)1.0000.647
모델실행0.6471.000

Missing values

2023-12-12T20:04:09.549214image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T20:04:09.805019image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-12T20:04:09.974773image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

물질(ID)화학물질번호(CAS)유독물번호한글명영문명모델실행데이타접수일
0485379-04-9<NA><NA>CHLOROACETYL CHLORIDEY2007-03-23
16740260-94-6<NA><NA>ACRIDINEN2007-03-23
278552807-30-9<NA><NA>ETHYLENE GLYCOL MONOPROPYL ETHERY2007-03-23
3786160-00-4<NA><NA>ETHYLENEDIAMINETETRAACETIC ACIDY2007-03-23
4786575-87-6<NA><NA>TRICHLOROACETALDEHYDEY2007-03-23
59883818-61-1<NA><NA>2-HYDROXYETHYL ACRYLATEY2007-03-23
6989812-11<NA>염소2AAAN2007-03-23
79990108-77-0<NA>시아누르 염화물Cyanuric chlorideN2007-03-23
89993165-65-1<NA>프로필 벤젠PROPYL BENZENEN2007-03-23
9999825321-09-9<NA>디이소프로필벤젠DiisopropylbenzeneN2007-03-23
물질(ID)화학물질번호(CAS)유독물번호한글명영문명모델실행데이타접수일
2240985860-24-2<NA><NA>2-MERCAPTOETHANOLY2007-03-23
22419872121-32-4<NA><NA>ETHYL VANILLINY2007-03-23
22429884923-26-2<NA><NA>2-HYDROXYPROPYL METHACRYLATEN2007-03-23
22439885763-69-9<NA><NA>ETHYL-3-ETHOXYPROPIONATEY2007-03-23
22449886959-26-2<NA><NA>BIS-(2-HYDROXYETHYL) TEREPHTHALATEY2007-03-23
2245992897-44487-44평산가스PYEONGSANN2007-03-23
2246993097-111144평산PYEONGSANN2007-03-23
2247993297-77779-14평산암모늄PYAMN2007-03-23
224899911321-12-6<NA>니트로톨루엔NitrotoluenesN2007-03-23
22499992138-86-3<NA>디펜텐DipenteneN2007-03-23