Overview

Dataset statistics

Number of variables14
Number of observations10000
Missing cells1820
Missing cells (%)1.3%
Duplicate rows1
Duplicate rows (%)< 0.1%
Total size in memory1.2 MiB
Average record size in memory123.0 B

Variable types

Categorical6
Text5
Numeric3

Dataset

Description환경오염물질배출업소 현황
Author경기도
URLhttps://data.gg.go.kr/portal/data/service/selectServicePage.do?&infId=GQ1EAI6RYZ3UV4DZ7JD718975142&infSeq=1

Alerts

Dataset has 1 (< 0.1%) duplicate rowsDuplicates
대기관리등급 is highly overall correlated with 폐수관리등급High correlation
관할기관명 is highly overall correlated with 소재지우편번호 and 3 other fieldsHigh correlation
시군명 is highly overall correlated with 소재지우편번호 and 3 other fieldsHigh correlation
폐수관리등급 is highly overall correlated with 대기관리등급High correlation
소재지우편번호 is highly overall correlated with WGS84위도 and 2 other fieldsHigh correlation
WGS84위도 is highly overall correlated with 소재지우편번호 and 2 other fieldsHigh correlation
WGS84경도 is highly overall correlated with 시군명 and 1 other fieldsHigh correlation
폐수관리등급 is highly imbalanced (58.7%)Imbalance
대기관리등급 is highly imbalanced (56.4%)Imbalance
폐수종별구분명 is highly imbalanced (55.5%)Imbalance
업종명 has 585 (5.9%) missing valuesMissing
소재지우편번호 has 259 (2.6%) missing valuesMissing
WGS84위도 has 449 (4.5%) missing valuesMissing
WGS84경도 has 449 (4.5%) missing valuesMissing

Reproduction

Analysis started2024-03-23 01:49:09.156953
Analysis finished2024-03-23 01:49:20.354878
Duration11.2 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시군명
Categorical

HIGH CORRELATION 

Distinct31
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
화성시
1596 
김포시
999 
안산시
801 
포천시
779 
평택시
590 
Other values (26)
5235 

Length

Max length4
Median length3
Mean length3.0415
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row안산시
2nd row화성시
3rd row김포시
4th row안산시
5th row안성시

Common Values

ValueCountFrequency (%)
화성시 1596
16.0%
김포시 999
 
10.0%
안산시 801
 
8.0%
포천시 779
 
7.8%
평택시 590
 
5.9%
파주시 579
 
5.8%
시흥시 482
 
4.8%
안성시 479
 
4.8%
용인시 458
 
4.6%
양주시 435
 
4.3%
Other values (21) 2802
28.0%

Length

2024-03-23T01:49:20.609063image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
화성시 1596
16.0%
김포시 999
 
10.0%
안산시 801
 
8.0%
포천시 779
 
7.8%
평택시 590
 
5.9%
파주시 579
 
5.8%
시흥시 482
 
4.8%
안성시 479
 
4.8%
용인시 458
 
4.6%
양주시 435
 
4.3%
Other values (21) 2802
28.0%
Distinct9733
Distinct (%)97.3%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-03-23T01:49:21.299070image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length60
Median length42
Mean length7.2342
Min length1

Characters and Unicode

Total characters72342
Distinct characters827
Distinct categories12 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9504 ?
Unique (%)95.0%

Sample

1st row㈜아이앤에이테크
2nd row(주)E1(임차:화성융릉LPG충전소이성진(1981.02.13)
3rd row은성정밀
4th row주식회사 에드캠투
5th row원진상사
ValueCountFrequency (%)
주식회사 253
 
2.3%
제2공장 27
 
0.2%
농업회사법인 26
 
0.2%
2공장 23
 
0.2%
현대오일뱅크㈜직영 22
 
0.2%
육군 19
 
0.2%
한국가스공사 16
 
0.1%
의료법인 10
 
0.1%
유한회사 10
 
0.1%
8
 
0.1%
Other values (10220) 10799
96.3%
2024-03-23T01:49:22.604530image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3396
 
4.7%
( 2822
 
3.9%
) 2814
 
3.9%
2776
 
3.8%
1728
 
2.4%
1706
 
2.4%
1446
 
2.0%
1215
 
1.7%
1114
 
1.5%
1046
 
1.4%
Other values (817) 52279
72.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 59776
82.6%
Open Punctuation 2857
 
3.9%
Close Punctuation 2849
 
3.9%
Other Symbol 2776
 
3.8%
Decimal Number 1406
 
1.9%
Space Separator 1215
 
1.7%
Uppercase Letter 995
 
1.4%
Other Punctuation 239
 
0.3%
Lowercase Letter 134
 
0.2%
Dash Punctuation 84
 
0.1%
Other values (2) 11
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3396
 
5.7%
1728
 
2.9%
1706
 
2.9%
1446
 
2.4%
1114
 
1.9%
1046
 
1.7%
1003
 
1.7%
982
 
1.6%
948
 
1.6%
941
 
1.6%
Other values (739) 45466
76.1%
Uppercase Letter
ValueCountFrequency (%)
S 120
 
12.1%
C 103
 
10.4%
K 87
 
8.7%
E 64
 
6.4%
G 61
 
6.1%
P 53
 
5.3%
T 51
 
5.1%
N 49
 
4.9%
A 48
 
4.8%
I 43
 
4.3%
Other values (16) 316
31.8%
Lowercase Letter
ValueCountFrequency (%)
s 15
11.2%
e 14
10.4%
o 13
 
9.7%
a 12
 
9.0%
t 9
 
6.7%
n 9
 
6.7%
r 9
 
6.7%
i 7
 
5.2%
l 7
 
5.2%
f 5
 
3.7%
Other values (10) 34
25.4%
Decimal Number
ValueCountFrequency (%)
1 321
22.8%
2 270
19.2%
3 142
10.1%
0 141
10.0%
7 119
 
8.5%
8 113
 
8.0%
6 93
 
6.6%
5 90
 
6.4%
9 72
 
5.1%
4 45
 
3.2%
Other Punctuation
ValueCountFrequency (%)
. 98
41.0%
: 67
28.0%
& 35
 
14.6%
, 15
 
6.3%
# 11
 
4.6%
/ 5
 
2.1%
? 3
 
1.3%
' 2
 
0.8%
· 2
 
0.8%
1
 
0.4%
Math Symbol
ValueCountFrequency (%)
+ 4
57.1%
< 1
 
14.3%
> 1
 
14.3%
~ 1
 
14.3%
Open Punctuation
ValueCountFrequency (%)
( 2822
98.8%
[ 35
 
1.2%
Close Punctuation
ValueCountFrequency (%)
) 2814
98.8%
] 35
 
1.2%
Other Symbol
ValueCountFrequency (%)
2776
100.0%
Space Separator
ValueCountFrequency (%)
1215
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 84
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 62552
86.5%
Common 8661
 
12.0%
Latin 1129
 
1.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3396
 
5.4%
2776
 
4.4%
1728
 
2.8%
1706
 
2.7%
1446
 
2.3%
1114
 
1.8%
1046
 
1.7%
1003
 
1.6%
982
 
1.6%
948
 
1.5%
Other values (740) 46407
74.2%
Latin
ValueCountFrequency (%)
S 120
 
10.6%
C 103
 
9.1%
K 87
 
7.7%
E 64
 
5.7%
G 61
 
5.4%
P 53
 
4.7%
T 51
 
4.5%
N 49
 
4.3%
A 48
 
4.3%
I 43
 
3.8%
Other values (36) 450
39.9%
Common
ValueCountFrequency (%)
( 2822
32.6%
) 2814
32.5%
1215
14.0%
1 321
 
3.7%
2 270
 
3.1%
3 142
 
1.6%
0 141
 
1.6%
7 119
 
1.4%
8 113
 
1.3%
. 98
 
1.1%
Other values (21) 606
 
7.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 59776
82.6%
ASCII 9787
 
13.5%
None 2779
 
3.8%

Most frequent character per block

Hangul
ValueCountFrequency (%)
3396
 
5.7%
1728
 
2.9%
1706
 
2.9%
1446
 
2.4%
1114
 
1.9%
1046
 
1.7%
1003
 
1.7%
982
 
1.6%
948
 
1.6%
941
 
1.6%
Other values (739) 45466
76.1%
ASCII
ValueCountFrequency (%)
( 2822
28.8%
) 2814
28.8%
1215
12.4%
1 321
 
3.3%
2 270
 
2.8%
3 142
 
1.5%
0 141
 
1.4%
S 120
 
1.2%
7 119
 
1.2%
8 113
 
1.2%
Other values (65) 1710
17.5%
None
ValueCountFrequency (%)
2776
99.9%
· 2
 
0.1%
1
 
< 0.1%

업종명
Text

MISSING 

Distinct3134
Distinct (%)33.3%
Missing585
Missing (%)5.9%
Memory size156.2 KiB
2024-03-23T01:49:23.724685image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length96
Median length70
Mean length14.169304
Min length2

Characters and Unicode

Total characters133404
Distinct characters437
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2165 ?
Unique (%)23.0%

Sample

1st row인쇄회로기판용 적층판 제조업, 경성 인쇄회로기판 제조업, 연성 및 기타 인쇄회로기판제조업(
2nd row자동차세차업(95213)
3rd row주형 및 금형 제조업
4th row기타기초무기화학물질제조업
5th row기타(폐기물처리업)(38210)
ValueCountFrequency (%)
1736
 
7.8%
제조업 1718
 
7.7%
기타 1572
 
7.0%
자동차 639
 
2.9%
373
 
1.7%
목재가구 291
 
1.3%
플라스틱 257
 
1.2%
248
 
1.1%
자동차세차업(95213 227
 
1.0%
세차업(95213 227
 
1.0%
Other values (3354) 15044
67.4%
2024-03-23T01:49:24.922777image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
13098
 
9.8%
8378
 
6.3%
6662
 
5.0%
2 6307
 
4.7%
5337
 
4.0%
( 4583
 
3.4%
) 4581
 
3.4%
4409
 
3.3%
1 4263
 
3.2%
9 3229
 
2.4%
Other values (427) 72557
54.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 88167
66.1%
Decimal Number 21875
 
16.4%
Space Separator 13098
 
9.8%
Open Punctuation 4583
 
3.4%
Close Punctuation 4581
 
3.4%
Other Punctuation 1085
 
0.8%
Uppercase Letter 7
 
< 0.1%
Dash Punctuation 4
 
< 0.1%
Lowercase Letter 3
 
< 0.1%
Connector Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
8378
 
9.5%
6662
 
7.6%
5337
 
6.1%
4409
 
5.0%
2955
 
3.4%
2800
 
3.2%
2510
 
2.8%
2333
 
2.6%
1851
 
2.1%
1515
 
1.7%
Other values (397) 49417
56.0%
Decimal Number
ValueCountFrequency (%)
2 6307
28.8%
1 4263
19.5%
9 3229
14.8%
3 2457
 
11.2%
0 1755
 
8.0%
5 1444
 
6.6%
4 936
 
4.3%
7 589
 
2.7%
8 574
 
2.6%
6 321
 
1.5%
Other Punctuation
ValueCountFrequency (%)
, 1001
92.3%
/ 40
 
3.7%
· 22
 
2.0%
. 10
 
0.9%
? 6
 
0.6%
: 4
 
0.4%
; 2
 
0.2%
Uppercase Letter
ValueCountFrequency (%)
X 3
42.9%
R 1
 
14.3%
L 1
 
14.3%
N 1
 
14.3%
G 1
 
14.3%
Lowercase Letter
ValueCountFrequency (%)
r 1
33.3%
a 1
33.3%
y 1
33.3%
Space Separator
ValueCountFrequency (%)
13098
100.0%
Open Punctuation
ValueCountFrequency (%)
( 4583
100.0%
Close Punctuation
ValueCountFrequency (%)
) 4581
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 4
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 88167
66.1%
Common 45227
33.9%
Latin 10
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
8378
 
9.5%
6662
 
7.6%
5337
 
6.1%
4409
 
5.0%
2955
 
3.4%
2800
 
3.2%
2510
 
2.8%
2333
 
2.6%
1851
 
2.1%
1515
 
1.7%
Other values (397) 49417
56.0%
Common
ValueCountFrequency (%)
13098
29.0%
2 6307
13.9%
( 4583
 
10.1%
) 4581
 
10.1%
1 4263
 
9.4%
9 3229
 
7.1%
3 2457
 
5.4%
0 1755
 
3.9%
5 1444
 
3.2%
, 1001
 
2.2%
Other values (12) 2509
 
5.5%
Latin
ValueCountFrequency (%)
X 3
30.0%
r 1
 
10.0%
R 1
 
10.0%
a 1
 
10.0%
y 1
 
10.0%
L 1
 
10.0%
N 1
 
10.0%
G 1
 
10.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 88157
66.1%
ASCII 45215
33.9%
None 22
 
< 0.1%
Compat Jamo 10
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
13098
29.0%
2 6307
13.9%
( 4583
 
10.1%
) 4581
 
10.1%
1 4263
 
9.4%
9 3229
 
7.1%
3 2457
 
5.4%
0 1755
 
3.9%
5 1444
 
3.2%
, 1001
 
2.2%
Other values (19) 2497
 
5.5%
Hangul
ValueCountFrequency (%)
8378
 
9.5%
6662
 
7.6%
5337
 
6.1%
4409
 
5.0%
2955
 
3.4%
2800
 
3.2%
2510
 
2.8%
2333
 
2.6%
1851
 
2.1%
1515
 
1.7%
Other values (395) 49407
56.0%
None
ValueCountFrequency (%)
· 22
100.0%
Compat Jamo
ValueCountFrequency (%)
9
90.0%
1
 
10.0%
Distinct5264
Distinct (%)52.7%
Missing3
Missing (%)< 0.1%
Memory size156.2 KiB
2024-03-23T01:49:25.631975image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length30
Median length3
Mean length3.6684005
Min length2

Characters and Unicode

Total characters36673
Distinct characters367
Distinct categories10 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique4806 ?
Unique (%)48.1%

Sample

1st row대표이사
2nd row구자룡
3rd row이용하
4th row대표이사
5th row김태윤
ValueCountFrequency (%)
대표이사 3969
38.8%
부대장 101
 
1.0%
대표자 58
 
0.6%
26
 
0.3%
총장 16
 
0.2%
이사장 12
 
0.1%
1명 12
 
0.1%
조합장 11
 
0.1%
1 10
 
0.1%
병원장 9
 
0.1%
Other values (5331) 6005
58.7%
2024-03-23T01:49:26.850778image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5018
 
13.7%
4403
 
12.0%
4202
 
11.5%
4159
 
11.3%
1200
 
3.3%
517
 
1.4%
485
 
1.3%
475
 
1.3%
475
 
1.3%
364
 
1.0%
Other values (357) 15375
41.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 35665
97.3%
Decimal Number 260
 
0.7%
Space Separator 253
 
0.7%
Open Punctuation 171
 
0.5%
Close Punctuation 170
 
0.5%
Other Punctuation 87
 
0.2%
Uppercase Letter 47
 
0.1%
Dash Punctuation 14
 
< 0.1%
Other Symbol 4
 
< 0.1%
Math Symbol 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
5018
 
14.1%
4403
 
12.3%
4202
 
11.8%
4159
 
11.7%
1200
 
3.4%
517
 
1.4%
485
 
1.4%
475
 
1.3%
475
 
1.3%
364
 
1.0%
Other values (319) 14367
40.3%
Uppercase Letter
ValueCountFrequency (%)
N 10
21.3%
I 7
14.9%
S 4
 
8.5%
A 4
 
8.5%
H 3
 
6.4%
G 3
 
6.4%
O 3
 
6.4%
L 2
 
4.3%
J 2
 
4.3%
Y 2
 
4.3%
Other values (7) 7
14.9%
Decimal Number
ValueCountFrequency (%)
1 120
46.2%
2 34
 
13.1%
3 24
 
9.2%
7 16
 
6.2%
8 14
 
5.4%
0 14
 
5.4%
6 12
 
4.6%
5 12
 
4.6%
9 9
 
3.5%
4 5
 
1.9%
Other Punctuation
ValueCountFrequency (%)
, 71
81.6%
: 11
 
12.6%
. 5
 
5.7%
Open Punctuation
ValueCountFrequency (%)
( 162
94.7%
[ 9
 
5.3%
Close Punctuation
ValueCountFrequency (%)
) 161
94.7%
] 9
 
5.3%
Space Separator
ValueCountFrequency (%)
253
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 14
100.0%
Other Symbol
ValueCountFrequency (%)
4
100.0%
Math Symbol
ValueCountFrequency (%)
> 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 35668
97.3%
Common 957
 
2.6%
Latin 47
 
0.1%
Han 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
5018
 
14.1%
4403
 
12.3%
4202
 
11.8%
4159
 
11.7%
1200
 
3.4%
517
 
1.4%
485
 
1.4%
475
 
1.3%
475
 
1.3%
364
 
1.0%
Other values (319) 14370
40.3%
Common
ValueCountFrequency (%)
253
26.4%
( 162
16.9%
) 161
16.8%
1 120
12.5%
, 71
 
7.4%
2 34
 
3.6%
3 24
 
2.5%
7 16
 
1.7%
8 14
 
1.5%
0 14
 
1.5%
Other values (10) 88
 
9.2%
Latin
ValueCountFrequency (%)
N 10
21.3%
I 7
14.9%
S 4
 
8.5%
A 4
 
8.5%
H 3
 
6.4%
G 3
 
6.4%
O 3
 
6.4%
L 2
 
4.3%
J 2
 
4.3%
Y 2
 
4.3%
Other values (7) 7
14.9%
Han
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 35664
97.2%
ASCII 1004
 
2.7%
None 4
 
< 0.1%
CJK 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
5018
 
14.1%
4403
 
12.3%
4202
 
11.8%
4159
 
11.7%
1200
 
3.4%
517
 
1.4%
485
 
1.4%
475
 
1.3%
475
 
1.3%
364
 
1.0%
Other values (318) 14366
40.3%
ASCII
ValueCountFrequency (%)
253
25.2%
( 162
16.1%
) 161
16.0%
1 120
12.0%
, 71
 
7.1%
2 34
 
3.4%
3 24
 
2.4%
7 16
 
1.6%
8 14
 
1.4%
0 14
 
1.4%
Other values (27) 135
13.4%
None
ValueCountFrequency (%)
4
100.0%
CJK
ValueCountFrequency (%)
1
100.0%

관할기관명
Categorical

HIGH CORRELATION 

Distinct34
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
광역환경관리사업소
1416 
화성시
1375 
김포시
825 
포천시
719 
파주시
 
496
Other values (29)
5169 

Length

Max length9
Median length3
Mean length4.0302
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row광역환경관리사업소
2nd row화성시
3rd row광역환경관리사업소
4th row광역환경관리사업소
5th row안성시

Common Values

ValueCountFrequency (%)
광역환경관리사업소 1416
14.2%
화성시 1375
13.8%
김포시 825
 
8.2%
포천시 719
 
7.2%
파주시 496
 
5.0%
안산시 477
 
4.8%
평택시 458
 
4.6%
용인시 433
 
4.3%
안성시 428
 
4.3%
광주시 402
 
4.0%
Other values (24) 2971
29.7%

Length

2024-03-23T01:49:27.234626image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
광역환경관리사업소 1416
14.2%
화성시 1375
13.8%
김포시 825
 
8.2%
포천시 719
 
7.2%
파주시 496
 
5.0%
안산시 477
 
4.8%
평택시 458
 
4.6%
용인시 433
 
4.3%
안성시 428
 
4.3%
광주시 402
 
4.0%
Other values (24) 2971
29.7%

폐수관리등급
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct16
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
4342 
우수
3053 
일반
2480 
중점
 
73
일반(신규)
 
13
Other values (11)
 
39

Length

Max length9
Median length2
Mean length2.8899
Min length2

Unique

Unique5 ?
Unique (%)< 0.1%

Sample

1st row우수
2nd row일반
3rd row우수
4th row우수
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 4342
43.4%
우수 3053
30.5%
일반 2480
24.8%
중점 73
 
0.7%
일반(신규) 13
 
0.1%
일반(21) 10
 
0.1%
우수(자율) 9
 
0.1%
일반(22) 8
 
0.1%
일반(19) 3
 
< 0.1%
일반(20) 2
 
< 0.1%
Other values (6) 7
 
0.1%

Length

2024-03-23T01:49:27.729329image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
na 4342
43.4%
우수 3053
30.5%
일반 2480
24.8%
중점 73
 
0.7%
일반(신규 13
 
0.1%
일반(21 10
 
0.1%
우수(자율 10
 
0.1%
일반(22 8
 
0.1%
일반(19 3
 
< 0.1%
일반(20 2
 
< 0.1%
Other values (5) 6
 
0.1%

대기관리등급
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct14
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
4093 
우수
2911 
일반
2890 
중점
 
46
우수(자율)
 
19
Other values (9)
 
41

Length

Max length9
Median length2
Mean length2.8435
Min length2

Unique

Unique4 ?
Unique (%)< 0.1%

Sample

1st row우수
2nd row<NA>
3rd row<NA>
4th row우수
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 4093
40.9%
우수 2911
29.1%
일반 2890
28.9%
중점 46
 
0.5%
우수(자율) 19
 
0.2%
일반(21) 10
 
0.1%
일반(신규) 9
 
0.1%
일반(19) 8
 
0.1%
일반(22) 8
 
0.1%
중점(20) 2
 
< 0.1%
Other values (4) 4
 
< 0.1%

Length

2024-03-23T01:49:28.115321image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
na 4093
40.9%
우수 2911
29.1%
일반 2890
28.9%
중점 46
 
0.5%
우수(자율 19
 
0.2%
일반(21 10
 
0.1%
일반(신규 9
 
0.1%
일반(19 8
 
0.1%
일반(22 8
 
0.1%
중점(20 2
 
< 0.1%
Other values (4) 4
 
< 0.1%

폐수종별구분명
Categorical

IMBALANCE 

Distinct9
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
5
5386 
<NA>
3939 
5종
 
245
4
 
233
3
 
108
Other values (4)
 
89

Length

Max length4
Median length1
Mean length2.2069
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row3
2nd row5
3rd row5
4th row5
5th row5

Common Values

ValueCountFrequency (%)
5 5386
53.9%
<NA> 3939
39.4%
5종 245
 
2.5%
4 233
 
2.3%
3 108
 
1.1%
2 46
 
0.5%
1 36
 
0.4%
3종 4
 
< 0.1%
4종 3
 
< 0.1%

Length

2024-03-23T01:49:28.633041image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-23T01:49:28.984892image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
5 5386
53.9%
na 3939
39.4%
5종 245
 
2.5%
4 233
 
2.3%
3 108
 
1.1%
2 46
 
0.5%
1 36
 
0.4%
3종 4
 
< 0.1%
4종 3
 
< 0.1%
Distinct9
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
5
3966 
<NA>
3592 
4
1877 
5종
 
243
3
 
136
Other values (4)
 
186

Length

Max length4
Median length1
Mean length2.1087
Min length1

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row5
2nd row<NA>
3rd row<NA>
4th row5
5th row5

Common Values

ValueCountFrequency (%)
5 3966
39.7%
<NA> 3592
35.9%
4 1877
18.8%
5종 243
 
2.4%
3 136
 
1.4%
4종 67
 
0.7%
1 59
 
0.6%
2 59
 
0.6%
3종 1
 
< 0.1%

Length

2024-03-23T01:49:29.366962image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-23T01:49:29.739369image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
5 3966
39.7%
na 3592
35.9%
4 1877
18.8%
5종 243
 
2.4%
3 136
 
1.4%
4종 67
 
0.7%
1 59
 
0.6%
2 59
 
0.6%
3종 1
 
< 0.1%

소재지우편번호
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct2146
Distinct (%)22.0%
Missing259
Missing (%)2.6%
Infinite0
Infinite (%)0.0%
Mean14426.776
Minimum10001
Maximum18635
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-03-23T01:49:30.288897image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum10001
5-th percentile10037
Q111168
median15074
Q317540
95-th percentile18577
Maximum18635
Range8634
Interquartile range (IQR)6372

Descriptive statistics

Standard deviation3081.988
Coefficient of variation (CV)0.21362971
Kurtosis-1.5459954
Mean14426.776
Median Absolute Deviation (MAD)3028
Skewness-0.040190929
Sum1.4053122 × 108
Variance9498650.2
MonotonicityNot monotonic
2024-03-23T01:49:30.800792image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
10049 105
 
1.1%
11167 88
 
0.9%
10040 56
 
0.6%
18574 55
 
0.5%
10028 55
 
0.5%
18623 53
 
0.5%
10048 50
 
0.5%
18583 48
 
0.5%
10029 48
 
0.5%
15429 43
 
0.4%
Other values (2136) 9140
91.4%
(Missing) 259
 
2.6%
ValueCountFrequency (%)
10001 1
 
< 0.1%
10003 24
0.2%
10004 1
 
< 0.1%
10005 3
 
< 0.1%
10007 1
 
< 0.1%
10008 18
0.2%
10009 17
0.2%
10010 42
0.4%
10011 42
0.4%
10012 17
0.2%
ValueCountFrequency (%)
18635 8
 
0.1%
18634 6
 
0.1%
18633 20
0.2%
18632 12
0.1%
18631 13
0.1%
18630 18
0.2%
18629 2
 
< 0.1%
18628 21
0.2%
18627 24
0.2%
18626 16
0.2%
Distinct9649
Distinct (%)97.2%
Missing70
Missing (%)0.7%
Memory size156.2 KiB
2024-03-23T01:49:31.397104image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length83
Median length72
Mean length23.301712
Min length14

Characters and Unicode

Total characters231386
Distinct characters474
Distinct categories12 ?
Distinct scripts3 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9434 ?
Unique (%)95.0%

Sample

1st row경기도 안산시 단원구 성곡동 600-4번지
2nd row경기도 화성시 안녕동 188-594, -576, -510
3rd row경기도 김포시 양촌읍 학운리 2979번지
4th row경기도 안산시 단원구 성곡동 778-9번지 시화공단4바 903호
5th row경기도 안성시 양성면 석화리 203-1번지
ValueCountFrequency (%)
경기도 9930
 
19.4%
화성시 1592
 
3.1%
김포시 994
 
1.9%
안산시 794
 
1.5%
포천시 778
 
1.5%
단원구 713
 
1.4%
평택시 580
 
1.1%
파주시 579
 
1.1%
안성시 474
 
0.9%
시흥시 465
 
0.9%
Other values (10674) 34366
67.0%
2024-03-23T01:49:32.565596image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
41335
 
17.9%
10631
 
4.6%
10439
 
4.5%
10160
 
4.4%
9968
 
4.3%
- 9191
 
4.0%
1 8938
 
3.9%
6772
 
2.9%
2 6308
 
2.7%
6102
 
2.6%
Other values (464) 111542
48.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 132947
57.5%
Decimal Number 46023
 
19.9%
Space Separator 41335
 
17.9%
Dash Punctuation 9191
 
4.0%
Other Punctuation 1182
 
0.5%
Uppercase Letter 287
 
0.1%
Open Punctuation 184
 
0.1%
Close Punctuation 181
 
0.1%
Math Symbol 28
 
< 0.1%
Lowercase Letter 25
 
< 0.1%
Other values (2) 3
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
10631
 
8.0%
10439
 
7.9%
10160
 
7.6%
9968
 
7.5%
6772
 
5.1%
6102
 
4.6%
6090
 
4.6%
5027
 
3.8%
3754
 
2.8%
2929
 
2.2%
Other values (410) 61075
45.9%
Uppercase Letter
ValueCountFrequency (%)
B 72
25.1%
A 45
15.7%
T 20
 
7.0%
L 19
 
6.6%
C 19
 
6.6%
V 18
 
6.3%
M 17
 
5.9%
I 16
 
5.6%
D 12
 
4.2%
K 11
 
3.8%
Other values (10) 38
13.2%
Decimal Number
ValueCountFrequency (%)
1 8938
19.4%
2 6308
13.7%
3 5186
11.3%
4 4665
10.1%
5 4150
9.0%
6 3995
8.7%
7 3598
7.8%
0 3165
 
6.9%
8 3144
 
6.8%
9 2874
 
6.2%
Lowercase Letter
ValueCountFrequency (%)
b 5
20.0%
e 4
16.0%
o 3
12.0%
t 3
12.0%
r 3
12.0%
n 2
 
8.0%
c 2
 
8.0%
i 1
 
4.0%
m 1
 
4.0%
s 1
 
4.0%
Other Punctuation
ValueCountFrequency (%)
, 1155
97.7%
. 17
 
1.4%
/ 9
 
0.8%
* 1
 
0.1%
Open Punctuation
ValueCountFrequency (%)
( 183
99.5%
[ 1
 
0.5%
Close Punctuation
ValueCountFrequency (%)
) 180
99.4%
] 1
 
0.6%
Other Symbol
ValueCountFrequency (%)
1
50.0%
1
50.0%
Space Separator
ValueCountFrequency (%)
41335
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 9191
100.0%
Math Symbol
ValueCountFrequency (%)
~ 28
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 132947
57.5%
Common 98126
42.4%
Latin 313
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
10631
 
8.0%
10439
 
7.9%
10160
 
7.6%
9968
 
7.5%
6772
 
5.1%
6102
 
4.6%
6090
 
4.6%
5027
 
3.8%
3754
 
2.8%
2929
 
2.2%
Other values (410) 61075
45.9%
Latin
ValueCountFrequency (%)
B 72
23.0%
A 45
14.4%
T 20
 
6.4%
L 19
 
6.1%
C 19
 
6.1%
V 18
 
5.8%
M 17
 
5.4%
I 16
 
5.1%
D 12
 
3.8%
K 11
 
3.5%
Other values (21) 64
20.4%
Common
ValueCountFrequency (%)
41335
42.1%
- 9191
 
9.4%
1 8938
 
9.1%
2 6308
 
6.4%
3 5186
 
5.3%
4 4665
 
4.8%
5 4150
 
4.2%
6 3995
 
4.1%
7 3598
 
3.7%
0 3165
 
3.2%
Other values (13) 7595
 
7.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 132946
57.5%
ASCII 98436
42.5%
CJK Compat 2
 
< 0.1%
Compat Jamo 1
 
< 0.1%
Number Forms 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
41335
42.0%
- 9191
 
9.3%
1 8938
 
9.1%
2 6308
 
6.4%
3 5186
 
5.3%
4 4665
 
4.7%
5 4150
 
4.2%
6 3995
 
4.1%
7 3598
 
3.7%
0 3165
 
3.2%
Other values (41) 7905
 
8.0%
Hangul
ValueCountFrequency (%)
10631
 
8.0%
10439
 
7.9%
10160
 
7.6%
9968
 
7.5%
6772
 
5.1%
6102
 
4.6%
6090
 
4.6%
5027
 
3.8%
3754
 
2.8%
2929
 
2.2%
Other values (409) 61074
45.9%
Compat Jamo
ValueCountFrequency (%)
1
100.0%
Number Forms
ValueCountFrequency (%)
1
100.0%
CJK Compat
ValueCountFrequency (%)
1
50.0%
1
50.0%
Distinct9516
Distinct (%)95.2%
Missing5
Missing (%)< 0.1%
Memory size156.2 KiB
2024-03-23T01:49:33.474508image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length79
Median length62
Mean length21.48044
Min length13

Characters and Unicode

Total characters214697
Distinct characters484
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9163 ?
Unique (%)91.7%

Sample

1st row경기도 안산시 단원구 강촌로 149
2nd row경기도 화성시 안녕동 188-594, -576, -510
3rd row경기도 김포시 양촌읍 황금로 117
4th row경기도 안산시 단원구 첨단로 23
5th row경기도 안성시 양성면 한내로 55
ValueCountFrequency (%)
경기도 9995
 
20.3%
화성시 1596
 
3.2%
김포시 999
 
2.0%
안산시 804
 
1.6%
포천시 778
 
1.6%
단원구 722
 
1.5%
평택시 586
 
1.2%
파주시 579
 
1.2%
안성시 479
 
1.0%
시흥시 479
 
1.0%
Other values (8582) 32112
65.4%
2024-03-23T01:49:34.886642image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
39134
 
18.2%
10501
 
4.9%
10488
 
4.9%
10386
 
4.8%
10262
 
4.8%
1 8411
 
3.9%
7538
 
3.5%
2 5990
 
2.8%
3 4816
 
2.2%
4473
 
2.1%
Other values (474) 102698
47.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 127814
59.5%
Decimal Number 42537
 
19.8%
Space Separator 39134
 
18.2%
Dash Punctuation 4287
 
2.0%
Other Punctuation 686
 
0.3%
Close Punctuation 87
 
< 0.1%
Open Punctuation 86
 
< 0.1%
Uppercase Letter 63
 
< 0.1%
Other Symbol 2
 
< 0.1%
Lowercase Letter 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
10501
 
8.2%
10488
 
8.2%
10386
 
8.1%
10262
 
8.0%
7538
 
5.9%
4473
 
3.5%
3752
 
2.9%
3006
 
2.4%
2860
 
2.2%
2569
 
2.0%
Other values (440) 61979
48.5%
Uppercase Letter
ValueCountFrequency (%)
B 16
25.4%
L 13
20.6%
A 10
15.9%
V 4
 
6.3%
F 4
 
6.3%
T 4
 
6.3%
M 4
 
6.3%
E 2
 
3.2%
D 2
 
3.2%
C 2
 
3.2%
Decimal Number
ValueCountFrequency (%)
1 8411
19.8%
2 5990
14.1%
3 4816
11.3%
4 4037
9.5%
5 3792
8.9%
6 3484
8.2%
7 3254
 
7.6%
8 2989
 
7.0%
0 2914
 
6.9%
9 2850
 
6.7%
Other Punctuation
ValueCountFrequency (%)
, 642
93.6%
. 34
 
5.0%
/ 9
 
1.3%
? 1
 
0.1%
Close Punctuation
ValueCountFrequency (%)
) 86
98.9%
] 1
 
1.1%
Open Punctuation
ValueCountFrequency (%)
( 85
98.8%
[ 1
 
1.2%
Other Symbol
ValueCountFrequency (%)
1
50.0%
1
50.0%
Space Separator
ValueCountFrequency (%)
39134
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 4287
100.0%
Lowercase Letter
ValueCountFrequency (%)
b 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 127814
59.5%
Common 86819
40.4%
Latin 64
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
10501
 
8.2%
10488
 
8.2%
10386
 
8.1%
10262
 
8.0%
7538
 
5.9%
4473
 
3.5%
3752
 
2.9%
3006
 
2.4%
2860
 
2.2%
2569
 
2.0%
Other values (440) 61979
48.5%
Common
ValueCountFrequency (%)
39134
45.1%
1 8411
 
9.7%
2 5990
 
6.9%
3 4816
 
5.5%
- 4287
 
4.9%
4 4037
 
4.6%
5 3792
 
4.4%
6 3484
 
4.0%
7 3254
 
3.7%
8 2989
 
3.4%
Other values (12) 6625
 
7.6%
Latin
ValueCountFrequency (%)
B 16
25.0%
L 13
20.3%
A 10
15.6%
V 4
 
6.2%
F 4
 
6.2%
T 4
 
6.2%
M 4
 
6.2%
E 2
 
3.1%
D 2
 
3.1%
C 2
 
3.1%
Other values (2) 3
 
4.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 127814
59.5%
ASCII 86881
40.5%
CJK Compat 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
39134
45.0%
1 8411
 
9.7%
2 5990
 
6.9%
3 4816
 
5.5%
- 4287
 
4.9%
4 4037
 
4.6%
5 3792
 
4.4%
6 3484
 
4.0%
7 3254
 
3.7%
8 2989
 
3.4%
Other values (22) 6687
 
7.7%
Hangul
ValueCountFrequency (%)
10501
 
8.2%
10488
 
8.2%
10386
 
8.1%
10262
 
8.0%
7538
 
5.9%
4473
 
3.5%
3752
 
2.9%
3006
 
2.4%
2860
 
2.2%
2569
 
2.0%
Other values (440) 61979
48.5%
CJK Compat
ValueCountFrequency (%)
1
50.0%
1
50.0%

WGS84위도
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct9066
Distinct (%)94.9%
Missing449
Missing (%)4.5%
Infinite0
Infinite (%)0.0%
Mean37.42983
Minimum36.918257
Maximum38.227873
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-03-23T01:49:35.354976image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum36.918257
5-th percentile37.032174
Q137.183309
median37.34365
Q337.693491
95-th percentile37.887675
Maximum38.227873
Range1.3096157
Interquartile range (IQR)0.51018199

Descriptive statistics

Standard deviation0.28841537
Coefficient of variation (CV)0.0077054951
Kurtosis-1.1041319
Mean37.42983
Median Absolute Deviation (MAD)0.23082542
Skewness0.29899292
Sum357492.31
Variance0.083183428
MonotonicityNot monotonic
2024-03-23T01:49:36.071423image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
37.438418567 15
 
0.1%
37.6176068263 13
 
0.1%
37.3201262611 8
 
0.1%
37.3195085191 8
 
0.1%
37.4035768384 7
 
0.1%
37.3112130404 7
 
0.1%
37.3081330042 6
 
0.1%
37.3213219086 6
 
0.1%
37.0844123125 6
 
0.1%
37.3166812357 5
 
0.1%
Other values (9056) 9470
94.7%
(Missing) 449
 
4.5%
ValueCountFrequency (%)
36.9182573487 1
< 0.1%
36.9193893861 1
< 0.1%
36.9217084057 1
< 0.1%
36.9217612592 1
< 0.1%
36.9219690535 1
< 0.1%
36.9265535251 1
< 0.1%
36.9271287059 1
< 0.1%
36.9276418087 1
< 0.1%
36.9284056491 1
< 0.1%
36.9288024308 1
< 0.1%
ValueCountFrequency (%)
38.2278730314 1
< 0.1%
38.2120939582 1
< 0.1%
38.2039946313 1
< 0.1%
38.1824337522 1
< 0.1%
38.1814072251 1
< 0.1%
38.1799326363 1
< 0.1%
38.1588693516 1
< 0.1%
38.15338299 1
< 0.1%
38.1402500882 1
< 0.1%
38.1373008591 1
< 0.1%

WGS84경도
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct9066
Distinct (%)94.9%
Missing449
Missing (%)4.5%
Infinite0
Infinite (%)0.0%
Mean126.97047
Minimum126.5317
Maximum127.7474
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-03-23T01:49:36.716151image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum126.5317
5-th percentile126.59079
Q1126.77917
median126.95265
Q3127.1672
95-th percentile127.39416
Maximum127.7474
Range1.2156951
Interquartile range (IQR)0.38802422

Descriptive statistics

Standard deviation0.24482478
Coefficient of variation (CV)0.0019282026
Kurtosis-0.43424604
Mean126.97047
Median Absolute Deviation (MAD)0.1829425
Skewness0.40545576
Sum1212694.9
Variance0.059939175
MonotonicityNot monotonic
2024-03-23T01:49:37.449644image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
127.1780452538 15
 
0.1%
126.6201461202 13
 
0.1%
126.7904132393 8
 
0.1%
126.73747582 8
 
0.1%
127.1054632657 7
 
0.1%
126.7728248686 7
 
0.1%
126.7880885246 6
 
0.1%
126.7898828225 6
 
0.1%
126.8980854718 6
 
0.1%
126.7886173391 5
 
0.1%
Other values (9056) 9470
94.7%
(Missing) 449
 
4.5%
ValueCountFrequency (%)
126.531700025 1
< 0.1%
126.5336196447 1
< 0.1%
126.5366084503 1
< 0.1%
126.5366558283 1
< 0.1%
126.5395409071 1
< 0.1%
126.5405308431 1
< 0.1%
126.5405511106 1
< 0.1%
126.5407714161 1
< 0.1%
126.541096841 1
< 0.1%
126.5415804641 1
< 0.1%
ValueCountFrequency (%)
127.7473951497 1
< 0.1%
127.7434122913 1
< 0.1%
127.7393866828 1
< 0.1%
127.7390928435 1
< 0.1%
127.7296823258 1
< 0.1%
127.7227070207 1
< 0.1%
127.7168978736 1
< 0.1%
127.7167070153 1
< 0.1%
127.7151481804 1
< 0.1%
127.7119935392 1
< 0.1%

Interactions

2024-03-23T01:49:17.339538image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-23T01:49:15.450258image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-23T01:49:16.354517image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-23T01:49:17.653352image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-23T01:49:15.703154image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-23T01:49:16.651798image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-23T01:49:17.946143image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-23T01:49:16.055681image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-23T01:49:16.985818image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-23T01:49:37.888791image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시군명관할기관명폐수관리등급대기관리등급폐수종별구분명대기종별구분명소재지우편번호WGS84위도WGS84경도
시군명1.0000.9980.5410.5330.6870.6930.9940.9450.932
관할기관명0.9981.0000.5850.5770.7620.7770.9830.9300.917
폐수관리등급0.5410.5851.0000.9880.1300.1090.4390.4320.239
대기관리등급0.5330.5770.9881.0000.1890.0380.3990.3760.261
폐수종별구분명0.6870.7620.1300.1891.0000.8590.2940.3620.198
대기종별구분명0.6930.7770.1090.0380.8591.0000.2680.3780.237
소재지우편번호0.9940.9830.4390.3990.2940.2681.0000.9180.867
WGS84위도0.9450.9300.4320.3760.3620.3780.9181.0000.717
WGS84경도0.9320.9170.2390.2610.1980.2370.8670.7171.000
2024-03-23T01:49:38.440033image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
대기관리등급관할기관명대기종별구분명폐수종별구분명시군명폐수관리등급
대기관리등급1.0000.2180.0170.0810.1970.934
관할기관명0.2181.0000.4370.4200.9350.210
대기종별구분명0.0170.4371.0000.4570.3520.049
폐수종별구분명0.0810.4200.4571.0000.3480.056
시군명0.1970.9350.3520.3481.0000.189
폐수관리등급0.9340.2100.0490.0560.1891.000
2024-03-23T01:49:38.834010image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
소재지우편번호WGS84위도WGS84경도시군명관할기관명폐수관리등급대기관리등급폐수종별구분명대기종별구분명
소재지우편번호1.000-0.8880.2010.9450.8790.1790.1770.1450.131
WGS84위도-0.8881.000-0.1070.7190.6730.1760.1650.1820.191
WGS84경도0.201-0.1071.0000.6780.6370.0910.1110.0950.115
시군명0.9450.7190.6781.0000.9350.1890.1970.3480.352
관할기관명0.8790.6730.6370.9351.0000.2100.2180.4200.437
폐수관리등급0.1790.1760.0910.1890.2101.0000.9340.0560.049
대기관리등급0.1770.1650.1110.1970.2180.9341.0000.0810.017
폐수종별구분명0.1450.1820.0950.3480.4200.0560.0811.0000.457
대기종별구분명0.1310.1910.1150.3520.4370.0490.0170.4571.000

Missing values

2024-03-23T01:49:18.508110image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-23T01:49:19.154002image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-03-23T01:49:19.994807image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

시군명사업장명업종명대표자명관할기관명폐수관리등급대기관리등급폐수종별구분명대기종별구분명소재지우편번호소재지지번주소소재지도로명주소WGS84위도WGS84경도
12220안산시㈜아이앤에이테크인쇄회로기판용 적층판 제조업, 경성 인쇄회로기판 제조업, 연성 및 기타 인쇄회로기판제조업(대표이사광역환경관리사업소우수우수3515428경기도 안산시 단원구 성곡동 600-4번지경기도 안산시 단원구 강촌로 14937.317064126.765715
28035화성시(주)E1(임차:화성융릉LPG충전소이성진(1981.02.13)자동차세차업(95213)구자룡화성시일반<NA>5<NA>18352경기도 화성시 안녕동 188-594, -576, -510경기도 화성시 안녕동 188-594, -576, -510<NA><NA>
5094김포시은성정밀주형 및 금형 제조업이용하광역환경관리사업소우수<NA>5<NA>10048경기도 김포시 양촌읍 학운리 2979번지경기도 김포시 양촌읍 황금로 11737.617607126.620146
12850안산시주식회사 에드캠투기타기초무기화학물질제조업대표이사광역환경관리사업소우수우수5515616경기도 안산시 단원구 성곡동 778-9번지 시화공단4바 903호경기도 안산시 단원구 첨단로 2337.313929126.726371
14119안성시원진상사기타(폐기물처리업)(38210)김태윤안성시<NA><NA>5517502경기도 안성시 양성면 석화리 203-1번지경기도 안성시 양성면 한내로 5537.060767127.201941
20694파주시현대교구기타 목재가구 제조업손영윤파주시<NA>우수<NA>4종10935경기도 파주시 조리읍 능안리 1530경기도 파주시 조리읍 탑삭골길 205-1137.734955126.793906
26173화성시(주)에스엘육류포장육및냉동육가공업(10122)이경희화성시중점일반5518608경기도 화성시 향남읍 화리현리 65-7경기도 화성시 향남읍 서봉로 395-8337.116263126.935586
26507화성시진흥테크놀러지주형및금형제조업(29294)김시제화성시일반<NA>5<NA>18522경기도 화성시 정남면 문학리 269-3경기도 화성시 정남면 문학로30번길 45-1137.152158126.966896
19176이천시(주)그린켐플라스틱 시트 및 판 제조업(22213)대표이사이천시<NA>우수<NA>517408경기도 이천시 모가면 양평리 17-6번지경기도 이천시 모가면 군들로 121-7137.176686127.498682
7413부천시엠앤테크도장 및 기타 피막처리업박형민부천시우수우수5514520경기도 부천시 원미구 도당동 24번지경기도 부천시 원미구 부천로357번길 3137.51555126.782909
시군명사업장명업종명대표자명관할기관명폐수관리등급대기관리등급폐수종별구분명대기종별구분명소재지우편번호소재지지번주소소재지도로명주소WGS84위도WGS84경도
29358화성시원플러스기타목재가구제조업(32029)박대성화성시<NA>일반<NA>418533경기도 화성시 팔탄면 가재리 140-1경기도 화성시 팔탄면 도이3길 39-6537.145792126.92593
3625김포시선광산업도장 및 기타 피막처리업임기욱김포시<NA>우수<NA>410046경기도 김포시 대곶면 대능리 285-2경기도 김포시 대곶면 대곶남로501번길 5937.63429126.597498
17755용인시㈜소모에너지 수지제일주유소세차업(95213)최병준용인시우수<NA>5<NA>16832경기도 용인시 수지구 풍덕천동 33-2번지경기도 용인시 수지구 수지로 35037.328413127.095416
23225포천시태성공예표면가공목재 및특정목적용제재목제조업고점덕포천시일반일반<NA>511192경기도 포천시 내촌면 음현리 813-6경기도 포천시 내촌면 금강로2047번길 4137.764664127.221483
28833화성시(주)대성아이앤티속도계및적산계기제조업(27214)대표이사화성시일반<NA>5<NA>18255경기도 화성시 남양읍 북양리 408경기도 화성시 남양읍 주석로80번길 26-1037.215088126.840222
1702광주시(주)풍산목재일반 제재업대표이사광주시<NA>우수<NA>412790경기도 광주시 추자동 291-7번지경기도 광주시 오포로 77037.360354127.226044
20490파주시(주)삼성특수브레이크자동차용 기타 신품 부품 제조업대표이사파주시<NA>우수<NA>4종10857경기도 파주시 탄현면 금승리 73-4경기도 파주시 탄현면 엘지로 45637.820857126.751344
21843평택시엔앤에프코스메틱㈜<NA>대표이사평택시우수우수5517745경기도 평택시 모곡동 441-10번지경기도 평택시 산단로16번길 8937.035108127.078649
2256광주시중부주유소자동차세차업(95213)전규상광주시일반<NA>5<NA>12812경기도 광주시 곤지암읍 곤지암리 221-6번지경기도 광주시 곤지암읍 경충대로 48737.34365127.351503
19344이천시(주)태한기타 판유리 가공품 제조업(23119)대표이사이천시<NA>일반<NA>517412경기도 이천시 설성면 수산리 71-3번지경기도 이천시 설성면 진상미로 697-8237.134157127.510009

Duplicate rows

Most frequently occurring

시군명사업장명업종명대표자명관할기관명폐수관리등급대기관리등급폐수종별구분명대기종별구분명소재지우편번호소재지지번주소소재지도로명주소WGS84위도WGS84경도# duplicates
0이천시이천시청<NA>이천시장환경안전관리과<NA><NA><NA>517379경기도 이천시 중리동 490번지경기도 이천시 부악로 4037.272641127.4351322