Overview

Dataset statistics

Number of variables4
Number of observations2735
Missing cells0
Missing cells (%)0.0%
Duplicate rows572
Duplicate rows (%)20.9%
Total size in memory85.6 KiB
Average record size in memory32.0 B

Variable types

Categorical1
Text3

Dataset

Description김해시 사업장 폐기물 배출자 신고현황에 대한 데이터로 폐기물 구분, 업체명, 주소, 폐기물 종류의 항목을 제공합니다.
Author경상남도 김해시
URLhttps://bigdata.gyeongnam.go.kr/index.gn?menuCd=DOM_000000114002001000&publicdatapk=15060327

Alerts

Dataset has 572 (20.9%) duplicate rowsDuplicates

Reproduction

Analysis started2024-03-13 00:11:28.148684
Analysis finished2024-03-13 00:11:28.618020
Duration0.47 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

폐기물구분
Categorical

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size21.5 KiB
사업장일반폐기물
1772 
지정폐기물
963 

Length

Max length8
Median length8
Mean length6.9436929
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row사업장일반폐기물
2nd row사업장일반폐기물
3rd row사업장일반폐기물
4th row사업장일반폐기물
5th row사업장일반폐기물

Common Values

ValueCountFrequency (%)
사업장일반폐기물 1772
64.8%
지정폐기물 963
35.2%

Length

2024-03-13T09:11:28.680842image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-13T09:11:28.791078image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
사업장일반폐기물 1772
64.8%
지정폐기물 963
35.2%

상호
Text

Distinct1033
Distinct (%)37.8%
Missing0
Missing (%)0.0%
Memory size21.5 KiB
2024-03-13T09:11:28.988602image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length27
Median length22
Mean length9.0084095
Min length2

Characters and Unicode

Total characters24638
Distinct characters451
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique492 ?
Unique (%)18.0%

Sample

1st row(주)제이비글로벌테크
2nd row진양오토모티브(주)김해공장
3rd row진양오토모티브(주)김해2공장
4th row주식회사 비알피
5th row주식회사 비알피
ValueCountFrequency (%)
주식회사 156
 
4.5%
의료법인 38
 
1.1%
김해지점 36
 
1.0%
김해공장 28
 
0.8%
인제대학교 19
 
0.5%
김해사업소 18
 
0.5%
삼협환경협회 16
 
0.5%
자원화처리시설 16
 
0.5%
영남자동차전문정비사업조합 15
 
0.4%
주)굴렁쇠자동차해체재활용산업 15
 
0.4%
Other values (1096) 3100
89.7%
2024-03-13T09:11:29.316530image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1707
 
6.9%
( 1623
 
6.6%
) 1623
 
6.6%
722
 
2.9%
503
 
2.0%
495
 
2.0%
484
 
2.0%
478
 
1.9%
474
 
1.9%
438
 
1.8%
Other values (441) 16091
65.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 20235
82.1%
Open Punctuation 1623
 
6.6%
Close Punctuation 1623
 
6.6%
Space Separator 722
 
2.9%
Uppercase Letter 275
 
1.1%
Decimal Number 86
 
0.3%
Lowercase Letter 35
 
0.1%
Other Punctuation 21
 
0.1%
Dash Punctuation 16
 
0.1%
Connector Punctuation 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1707
 
8.4%
503
 
2.5%
495
 
2.4%
484
 
2.4%
478
 
2.4%
474
 
2.3%
438
 
2.2%
367
 
1.8%
321
 
1.6%
319
 
1.6%
Other values (397) 14649
72.4%
Uppercase Letter
ValueCountFrequency (%)
C 33
12.0%
E 26
 
9.5%
M 24
 
8.7%
T 24
 
8.7%
K 22
 
8.0%
H 20
 
7.3%
S 17
 
6.2%
N 12
 
4.4%
P 11
 
4.0%
R 11
 
4.0%
Other values (14) 75
27.3%
Decimal Number
ValueCountFrequency (%)
2 29
33.7%
1 18
20.9%
5 12
14.0%
3 10
 
11.6%
7 9
 
10.5%
6 8
 
9.3%
Lowercase Letter
ValueCountFrequency (%)
o 10
28.6%
a 5
14.3%
p 5
14.3%
m 5
14.3%
y 5
14.3%
n 5
14.3%
Other Punctuation
ValueCountFrequency (%)
. 17
81.0%
& 3
 
14.3%
/ 1
 
4.8%
Open Punctuation
ValueCountFrequency (%)
( 1623
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1623
100.0%
Space Separator
ValueCountFrequency (%)
722
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 16
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 20235
82.1%
Common 4093
 
16.6%
Latin 310
 
1.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1707
 
8.4%
503
 
2.5%
495
 
2.4%
484
 
2.4%
478
 
2.4%
474
 
2.3%
438
 
2.2%
367
 
1.8%
321
 
1.6%
319
 
1.6%
Other values (397) 14649
72.4%
Latin
ValueCountFrequency (%)
C 33
 
10.6%
E 26
 
8.4%
M 24
 
7.7%
T 24
 
7.7%
K 22
 
7.1%
H 20
 
6.5%
S 17
 
5.5%
N 12
 
3.9%
P 11
 
3.5%
R 11
 
3.5%
Other values (20) 110
35.5%
Common
ValueCountFrequency (%)
( 1623
39.7%
) 1623
39.7%
722
17.6%
2 29
 
0.7%
1 18
 
0.4%
. 17
 
0.4%
- 16
 
0.4%
5 12
 
0.3%
3 10
 
0.2%
7 9
 
0.2%
Other values (4) 14
 
0.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 20235
82.1%
ASCII 4403
 
17.9%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1707
 
8.4%
503
 
2.5%
495
 
2.4%
484
 
2.4%
478
 
2.4%
474
 
2.3%
438
 
2.2%
367
 
1.8%
321
 
1.6%
319
 
1.6%
Other values (397) 14649
72.4%
ASCII
ValueCountFrequency (%)
( 1623
36.9%
) 1623
36.9%
722
16.4%
C 33
 
0.7%
2 29
 
0.7%
E 26
 
0.6%
M 24
 
0.5%
T 24
 
0.5%
K 22
 
0.5%
H 20
 
0.5%
Other values (34) 257
 
5.8%
Distinct1021
Distinct (%)37.3%
Missing0
Missing (%)0.0%
Memory size21.5 KiB
2024-03-13T09:11:29.521739image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length50
Median length34
Mean length23.644607
Min length17

Characters and Unicode

Total characters64668
Distinct characters188
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique476 ?
Unique (%)17.4%

Sample

1st row경상남도 김해시 진례면 고모로180번길 96
2nd row경상남도 김해시 진영읍 서부로179번길 38
3rd row경상남도 김해시 진례면 테크노밸리로 108
4th row경상남도 김해시 주촌면 서부로1541번안길 10-1
5th row경상남도 김해시 주촌면 서부로1541번안길 10-1
ValueCountFrequency (%)
경상남도 2735
20.3%
김해시 2735
20.3%
한림면 555
 
4.1%
주촌면 379
 
2.8%
진영읍 353
 
2.6%
진례면 276
 
2.0%
생림면 274
 
2.0%
상동면 208
 
1.5%
김해대로 176
 
1.3%
안동 90
 
0.7%
Other values (1110) 5717
42.4%
2024-03-13T09:11:29.843147image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
11636
18.0%
3184
 
4.9%
3171
 
4.9%
3014
 
4.7%
1 2830
 
4.4%
2755
 
4.3%
2745
 
4.2%
2735
 
4.2%
2735
 
4.2%
1855
 
2.9%
Other values (178) 28008
43.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 37843
58.5%
Decimal Number 13127
 
20.3%
Space Separator 11636
 
18.0%
Dash Punctuation 1461
 
2.3%
Open Punctuation 271
 
0.4%
Close Punctuation 271
 
0.4%
Connector Punctuation 56
 
0.1%
Uppercase Letter 3
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3184
 
8.4%
3171
 
8.4%
3014
 
8.0%
2755
 
7.3%
2745
 
7.3%
2735
 
7.2%
2735
 
7.2%
1855
 
4.9%
1717
 
4.5%
1149
 
3.0%
Other values (161) 12783
33.8%
Decimal Number
ValueCountFrequency (%)
1 2830
21.6%
2 1790
13.6%
3 1553
11.8%
5 1101
 
8.4%
9 1079
 
8.2%
6 1071
 
8.2%
4 1006
 
7.7%
0 981
 
7.5%
7 951
 
7.2%
8 765
 
5.8%
Uppercase Letter
ValueCountFrequency (%)
A 2
66.7%
F 1
33.3%
Space Separator
ValueCountFrequency (%)
11636
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1461
100.0%
Open Punctuation
ValueCountFrequency (%)
( 271
100.0%
Close Punctuation
ValueCountFrequency (%)
) 271
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 56
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 37843
58.5%
Common 26822
41.5%
Latin 3
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3184
 
8.4%
3171
 
8.4%
3014
 
8.0%
2755
 
7.3%
2745
 
7.3%
2735
 
7.2%
2735
 
7.2%
1855
 
4.9%
1717
 
4.5%
1149
 
3.0%
Other values (161) 12783
33.8%
Common
ValueCountFrequency (%)
11636
43.4%
1 2830
 
10.6%
2 1790
 
6.7%
3 1553
 
5.8%
- 1461
 
5.4%
5 1101
 
4.1%
9 1079
 
4.0%
6 1071
 
4.0%
4 1006
 
3.8%
0 981
 
3.7%
Other values (5) 2314
 
8.6%
Latin
ValueCountFrequency (%)
A 2
66.7%
F 1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 37843
58.5%
ASCII 26825
41.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
11636
43.4%
1 2830
 
10.5%
2 1790
 
6.7%
3 1553
 
5.8%
- 1461
 
5.4%
5 1101
 
4.1%
9 1079
 
4.0%
6 1071
 
4.0%
4 1006
 
3.8%
0 981
 
3.7%
Other values (7) 2317
 
8.6%
Hangul
ValueCountFrequency (%)
3184
 
8.4%
3171
 
8.4%
3014
 
8.0%
2755
 
7.3%
2745
 
7.3%
2735
 
7.2%
2735
 
7.2%
1855
 
4.9%
1717
 
4.5%
1149
 
3.0%
Other values (161) 12783
33.8%
Distinct152
Distinct (%)5.6%
Missing0
Missing (%)0.0%
Memory size21.5 KiB
2024-03-13T09:11:30.104526image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length84
Median length66
Mean length18.783547
Min length1

Characters and Unicode

Total characters51373
Distinct characters248
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique39 ?
Unique (%)1.4%

Sample

1st row폐합성수지류(폐염화비닐수지류는 제외한다)
2nd row폐합성수지류(폐염화비닐수지류는 제외한다)
3rd row폐합성수지류(폐염화비닐수지류는 제외한다)
4th row폐합성수지류(폐염화비닐수지류는 제외한다)
5th row폐합성수지류(폐염화비닐수지류는 제외한다)
ValueCountFrequency (%)
757
 
8.9%
밖의 755
 
8.8%
제외한다 648
 
7.6%
폐합성수지류(폐염화비닐수지류는 542
 
6.3%
폐유 355
 
4.2%
말한다 316
 
3.7%
242
 
2.8%
등을 197
 
2.3%
분진 177
 
2.1%
수용성절삭유 153
 
1.8%
Other values (239) 4406
51.5%
2024-03-13T09:11:30.479139image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5878
 
11.4%
3485
 
6.8%
1933
 
3.8%
1540
 
3.0%
1470
 
2.9%
1359
 
2.6%
1302
 
2.5%
1286
 
2.5%
1095
 
2.1%
1048
 
2.0%
Other values (238) 30977
60.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 41274
80.3%
Space Separator 5878
 
11.4%
Open Punctuation 1189
 
2.3%
Close Punctuation 1189
 
2.3%
Lowercase Letter 918
 
1.8%
Decimal Number 468
 
0.9%
Connector Punctuation 447
 
0.9%
Other Punctuation 10
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3485
 
8.4%
1933
 
4.7%
1540
 
3.7%
1470
 
3.6%
1359
 
3.3%
1302
 
3.2%
1286
 
3.1%
1095
 
2.7%
1048
 
2.5%
1000
 
2.4%
Other values (218) 25756
62.4%
Decimal Number
ValueCountFrequency (%)
2 238
50.9%
0 153
32.7%
1 41
 
8.8%
8 34
 
7.3%
4 1
 
0.2%
7 1
 
0.2%
Lowercase Letter
ValueCountFrequency (%)
e 306
33.3%
g 153
16.7%
r 153
16.7%
a 153
16.7%
s 153
16.7%
Open Punctuation
ValueCountFrequency (%)
( 1002
84.3%
[ 153
 
12.9%
34
 
2.9%
Close Punctuation
ValueCountFrequency (%)
) 1002
84.3%
] 153
 
12.9%
34
 
2.9%
Space Separator
ValueCountFrequency (%)
5878
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 447
100.0%
Other Punctuation
ValueCountFrequency (%)
· 10
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 41274
80.3%
Common 9181
 
17.9%
Latin 918
 
1.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3485
 
8.4%
1933
 
4.7%
1540
 
3.7%
1470
 
3.6%
1359
 
3.3%
1302
 
3.2%
1286
 
3.1%
1095
 
2.7%
1048
 
2.5%
1000
 
2.4%
Other values (218) 25756
62.4%
Common
ValueCountFrequency (%)
5878
64.0%
( 1002
 
10.9%
) 1002
 
10.9%
_ 447
 
4.9%
2 238
 
2.6%
[ 153
 
1.7%
] 153
 
1.7%
0 153
 
1.7%
1 41
 
0.4%
34
 
0.4%
Other values (5) 80
 
0.9%
Latin
ValueCountFrequency (%)
e 306
33.3%
g 153
16.7%
r 153
16.7%
a 153
16.7%
s 153
16.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 40439
78.7%
ASCII 10021
 
19.5%
Compat Jamo 835
 
1.6%
None 78
 
0.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5878
58.7%
( 1002
 
10.0%
) 1002
 
10.0%
_ 447
 
4.5%
e 306
 
3.1%
2 238
 
2.4%
[ 153
 
1.5%
g 153
 
1.5%
r 153
 
1.5%
a 153
 
1.5%
Other values (7) 536
 
5.3%
Hangul
ValueCountFrequency (%)
3485
 
8.6%
1933
 
4.8%
1540
 
3.8%
1470
 
3.6%
1359
 
3.4%
1302
 
3.2%
1286
 
3.2%
1095
 
2.7%
1048
 
2.6%
1000
 
2.5%
Other values (217) 24921
61.6%
Compat Jamo
ValueCountFrequency (%)
835
100.0%
None
ValueCountFrequency (%)
34
43.6%
34
43.6%
· 10
 
12.8%

Missing values

2024-03-13T09:11:28.522929image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-13T09:11:28.588627image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

폐기물구분상호사업장도로명주소폐기물 종류
0사업장일반폐기물(주)제이비글로벌테크경상남도 김해시 진례면 고모로180번길 96폐합성수지류(폐염화비닐수지류는 제외한다)
1사업장일반폐기물진양오토모티브(주)김해공장경상남도 김해시 진영읍 서부로179번길 38폐합성수지류(폐염화비닐수지류는 제외한다)
2사업장일반폐기물진양오토모티브(주)김해2공장경상남도 김해시 진례면 테크노밸리로 108폐합성수지류(폐염화비닐수지류는 제외한다)
3사업장일반폐기물주식회사 비알피경상남도 김해시 주촌면 서부로1541번안길 10-1폐합성수지류(폐염화비닐수지류는 제외한다)
4사업장일반폐기물주식회사 비알피경상남도 김해시 주촌면 서부로1541번안길 10-1폐합성수지류(폐염화비닐수지류는 제외한다)
5사업장일반폐기물주식회사 비알피경상남도 김해시 주촌면 서부로1541번안길 10-1폐합성수지류(폐염화비닐수지류는 제외한다)
6사업장일반폐기물주식회사 더가림경상남도 김해시 한림면 한림로 204그 밖의 폐목재류
7사업장일반폐기물시대희망복지재단 김해사업소경상남도 김해시 한림면 장방로 232-25폐합성수지류(폐염화비닐수지류는 제외한다)
8사업장일반폐기물(주)유노그린텍 (한림지점)경상남도 김해시 한림면 장방로 157-3그 밖의 폐기물
9사업장일반폐기물(주)유노그린텍 (한림지점)경상남도 김해시 한림면 장방로 157-3그 밖의 폐기물
폐기물구분상호사업장도로명주소폐기물 종류
2725지정폐기물한국전력공사부산울산지역본부경상남도 김해시 한림면 명동리 272-3폐절연유(폴리클로리네이티드비페닐 함유 폐기물을 제외한다)
2726지정폐기물한국전력공사부산울산지역본부경상남도 김해시 한림면 명동리 272-3그 밖의 폐알칼리
2727지정폐기물한국전력공사부산울산지역본부경상남도 김해시 한림면 명동리 272-3폐황산이 포함된 2차폐축전지
2728지정폐기물(주)동남공영개발경상남도 김해시 생림면 봉림리 산 196그 밖의 폐광물유[아스팔트유ㆍ그리스(grease)ㆍ방청유 및 수용성절삭유_ 20퍼센트 이상의 이물질이 함유된 폐유_ 고체상태의 폐유 등을 말한다]
2729지정폐기물(주)동남공영개발경상남도 김해시 생림면 봉림리 산 196폐윤활유(「자원의 절약과 재활용촉진에 관한 법률 시행령」 제18조에 따른 재활용의무 대상 제품ㆍ포장재인 기어유 및 내연기관용 윤활유를 말한다)
2730지정폐기물동부엔텍(주) 김해소각사업소경상남도 김해시 부곡동 490 김해시폐기물소각시설비금속성폐촉매
2731지정폐기물동부엔텍(주) 김해소각사업소경상남도 김해시 부곡동 490 김해시폐기물소각시설생활폐기물 소각시설 비산재
2732지정폐기물동부엔텍(주) 김해소각사업소경상남도 김해시 부곡동 490 김해시폐기물소각시설폐절연유(폴리클로리네이티드비페닐 함유 폐기물을 제외한다)
2733지정폐기물동부엔텍(주) 김해소각사업소경상남도 김해시 부곡동 490 김해시폐기물소각시설폐기계유ㆍ폐작동유(공업용 기계유ㆍ냉동기유ㆍ터어빈유ㆍ베어링윤활유ㆍ압축기유ㆍ유압작동유ㆍ열매체유 및 프로세스유 등을 말한다)
2734지정폐기물(주)김해환경경상남도 김해시 흥동 38-5폐윤활유(「자원의 절약과 재활용촉진에 관한 법률 시행령」 제18조에 따른 재활용의무 대상 제품ㆍ포장재인 기어유 및 내연기관용 윤활유를 말한다)

Duplicate rows

Most frequently occurring

폐기물구분상호사업장도로명주소폐기물 종류# duplicates
340지정폐기물(주)해반디에이치 김해지점경상남도 김해시 삼계동 1007-2그 밖의 폐유12
2사업장일반폐기물(주)HKM경상남도 김해시 상동면 묵방로120번길 20그 밖의 분진8
115사업장일반폐기물(주)자연(김해시 음식물류폐기물 자원화처리시설)경상남도 김해시 진영읍 김해대로 832-68중간가공음식물류폐기물8
5사업장일반폐기물(주)거산경상남도 김해시 한림면 김해대로 1102-171폐합성수지류(폐염화비닐수지류는 제외한다)7
254사업장일반폐기물주식회사 명송경상남도 김해시 한림면 김해대로1538번길 101폐합성수지류(폐염화비닐수지류는 제외한다)7
282사업장일반폐기물한국전력공사 부산울산지역본부경상남도 김해시 한림면 한림로 63그 밖의 폐합성고분자화합물(합성수지류로 피복된 폐전선을 포함한다)7
286사업장일반폐기물한성기업(주)김해공장경상남도 김해시 삼안로 51 (안동)그 밖의 폐수처리오니7
24사업장일반폐기물(주)나경 알켄즈김해지사경상남도 김해시 진영읍 가산로 91폐합성수지류(폐염화비닐수지류는 제외한다)6
45사업장일반폐기물(주)대하에코텍경상남도 김해시 상동면 묵방로 197폐합성수지류(폐염화비닐수지류는 제외한다)6
145사업장일반폐기물(주)호경경상남도 김해시 생림면 나전로 87-13그 밖의 분진6