Overview

Dataset statistics

Number of variables8
Number of observations2347
Missing cells297
Missing cells (%)1.6%
Duplicate rows460
Duplicate rows (%)19.6%
Total size in memory149.1 KiB
Average record size in memory65.1 B

Variable types

Categorical3
Text5

Dataset

Description부산광역시금정구_사업장폐기물배출자신고현황_20220520
Author부산광역시 금정구
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=15060240

Alerts

데이터기준일 has constant value ""Constant
Dataset has 460 (19.6%) duplicate rowsDuplicates
폐기물구분 is highly overall correlated with 처리방법High correlation
처리방법 is highly overall correlated with 폐기물구분High correlation
처리방법 is highly imbalanced (59.0%)Imbalance
연락처 has 293 (12.5%) missing valuesMissing

Reproduction

Analysis started2023-12-10 16:35:15.755488
Analysis finished2023-12-10 16:35:17.060148
Duration1.3 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

폐기물구분
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size18.5 KiB
의료폐기물
1395 
지정폐기물
564 
사업장비배출시설계
303 
사업장배출시설계
 
85

Length

Max length9
Median length5
Mean length5.6250533
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row사업장비배출시설계
2nd row사업장비배출시설계
3rd row사업장비배출시설계
4th row사업장비배출시설계
5th row사업장비배출시설계

Common Values

ValueCountFrequency (%)
의료폐기물 1395
59.4%
지정폐기물 564
24.0%
사업장비배출시설계 303
 
12.9%
사업장배출시설계 85
 
3.6%

Length

2023-12-11T01:35:17.164811image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:35:17.354897image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
의료폐기물 1395
59.4%
지정폐기물 564
24.0%
사업장비배출시설계 303
 
12.9%
사업장배출시설계 85
 
3.6%
Distinct876
Distinct (%)37.3%
Missing0
Missing (%)0.0%
Memory size18.5 KiB
2023-12-11T01:35:17.699248image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length17
Mean length7.9680443
Min length3

Characters and Unicode

Total characters18701
Distinct characters432
Distinct categories9 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique641 ?
Unique (%)27.3%

Sample

1st row(사)부산컨트리클럽
2nd row(사)부산컨트리클럽
3rd row(사)부산컨트리클럽
4th row(사)부산컨트리클럽
5th row(사)부산컨트리클럽
ValueCountFrequency (%)
이륜자동차환경협회 30
 
1.3%
삼성물산(주)동래베네스트골프클럽 25
 
1.1%
부산대학교 24
 
1.0%
메드윌병원 21
 
0.9%
부산교통공사 19
 
0.8%
다움병원 18
 
0.8%
의료법인영파의료재단규림요양병원 18
 
0.8%
세웅병원 17
 
0.7%
신망애치매전문요양원 17
 
0.7%
의료법인영파의료재단마음향기병원 15
 
0.6%
Other values (866) 2143
91.3%
2023-12-11T01:35:18.282813image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1177
 
6.3%
946
 
5.1%
707
 
3.8%
451
 
2.4%
430
 
2.3%
416
 
2.2%
358
 
1.9%
345
 
1.8%
343
 
1.8%
329
 
1.8%
Other values (422) 13199
70.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 17899
95.7%
Close Punctuation 322
 
1.7%
Open Punctuation 321
 
1.7%
Uppercase Letter 128
 
0.7%
Decimal Number 18
 
0.1%
Other Punctuation 9
 
< 0.1%
Space Separator 2
 
< 0.1%
Other Symbol 1
 
< 0.1%
Connector Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1177
 
6.6%
946
 
5.3%
707
 
3.9%
451
 
2.5%
430
 
2.4%
416
 
2.3%
358
 
2.0%
345
 
1.9%
343
 
1.9%
329
 
1.8%
Other values (392) 12397
69.3%
Uppercase Letter
ValueCountFrequency (%)
S 26
20.3%
H 19
14.8%
B 19
14.8%
M 12
9.4%
G 10
 
7.8%
K 8
 
6.2%
V 7
 
5.5%
L 7
 
5.5%
O 5
 
3.9%
J 3
 
2.3%
Other values (8) 12
9.4%
Decimal Number
ValueCountFrequency (%)
3 6
33.3%
1 4
22.2%
2 3
16.7%
5 2
 
11.1%
6 2
 
11.1%
0 1
 
5.6%
Close Punctuation
ValueCountFrequency (%)
) 322
100.0%
Open Punctuation
ValueCountFrequency (%)
( 321
100.0%
Other Punctuation
ValueCountFrequency (%)
. 9
100.0%
Space Separator
ValueCountFrequency (%)
2
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 17899
95.7%
Common 673
 
3.6%
Latin 128
 
0.7%
Han 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1177
 
6.6%
946
 
5.3%
707
 
3.9%
451
 
2.5%
430
 
2.4%
416
 
2.3%
358
 
2.0%
345
 
1.9%
343
 
1.9%
329
 
1.8%
Other values (392) 12397
69.3%
Latin
ValueCountFrequency (%)
S 26
20.3%
H 19
14.8%
B 19
14.8%
M 12
9.4%
G 10
 
7.8%
K 8
 
6.2%
V 7
 
5.5%
L 7
 
5.5%
O 5
 
3.9%
J 3
 
2.3%
Other values (8) 12
9.4%
Common
ValueCountFrequency (%)
) 322
47.8%
( 321
47.7%
. 9
 
1.3%
3 6
 
0.9%
1 4
 
0.6%
2 3
 
0.4%
5 2
 
0.3%
6 2
 
0.3%
2
 
0.3%
0 1
 
0.1%
Han
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 17898
95.7%
ASCII 801
 
4.3%
None 1
 
< 0.1%
CJK 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1177
 
6.6%
946
 
5.3%
707
 
4.0%
451
 
2.5%
430
 
2.4%
416
 
2.3%
358
 
2.0%
345
 
1.9%
343
 
1.9%
329
 
1.8%
Other values (391) 12396
69.3%
ASCII
ValueCountFrequency (%)
) 322
40.2%
( 321
40.1%
S 26
 
3.2%
H 19
 
2.4%
B 19
 
2.4%
M 12
 
1.5%
G 10
 
1.2%
. 9
 
1.1%
K 8
 
1.0%
V 7
 
0.9%
Other values (19) 48
 
6.0%
None
ValueCountFrequency (%)
1
100.0%
CJK
ValueCountFrequency (%)
1
100.0%
Distinct826
Distinct (%)35.3%
Missing4
Missing (%)0.2%
Memory size18.5 KiB
2023-12-11T01:35:18.745959image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length11
Median length3
Mean length3.2210841
Min length2

Characters and Unicode

Total characters7547
Distinct characters209
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique616 ?
Unique (%)26.3%

Sample

1st row서정의
2nd row서정의
3rd row서정의
4th row서정의
5th row서정의
ValueCountFrequency (%)
학교장 82
 
3.5%
대학총장 57
 
2.4%
대표이사 46
 
2.0%
한대영 35
 
1.5%
김종천 33
 
1.4%
이옥렬 30
 
1.3%
이지연 27
 
1.2%
김은희 25
 
1.1%
박재훈 21
 
0.9%
총장 20
 
0.9%
Other values (816) 1967
84.0%
2023-12-11T01:35:19.372612image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
460
 
6.1%
401
 
5.3%
292
 
3.9%
211
 
2.8%
189
 
2.5%
185
 
2.5%
174
 
2.3%
147
 
1.9%
140
 
1.9%
138
 
1.8%
Other values (199) 5210
69.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 7478
99.1%
Decimal Number 37
 
0.5%
Connector Punctuation 23
 
0.3%
Other Punctuation 9
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
460
 
6.2%
401
 
5.4%
292
 
3.9%
211
 
2.8%
189
 
2.5%
185
 
2.5%
174
 
2.3%
147
 
2.0%
140
 
1.9%
138
 
1.8%
Other values (195) 5141
68.7%
Decimal Number
ValueCountFrequency (%)
1 25
67.6%
2 12
32.4%
Connector Punctuation
ValueCountFrequency (%)
_ 23
100.0%
Other Punctuation
ValueCountFrequency (%)
, 9
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 7478
99.1%
Common 69
 
0.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
460
 
6.2%
401
 
5.4%
292
 
3.9%
211
 
2.8%
189
 
2.5%
185
 
2.5%
174
 
2.3%
147
 
2.0%
140
 
1.9%
138
 
1.8%
Other values (195) 5141
68.7%
Common
ValueCountFrequency (%)
1 25
36.2%
_ 23
33.3%
2 12
17.4%
, 9
 
13.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 7478
99.1%
ASCII 69
 
0.9%

Most frequent character per block

Hangul
ValueCountFrequency (%)
460
 
6.2%
401
 
5.4%
292
 
3.9%
211
 
2.8%
189
 
2.5%
185
 
2.5%
174
 
2.3%
147
 
2.0%
140
 
1.9%
138
 
1.8%
Other values (195) 5141
68.7%
ASCII
ValueCountFrequency (%)
1 25
36.2%
_ 23
33.3%
2 12
17.4%
, 9
 
13.0%

연락처
Text

MISSING 

Distinct807
Distinct (%)39.3%
Missing293
Missing (%)12.5%
Memory size18.5 KiB
2023-12-11T01:35:19.752613image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.002921
Min length12

Characters and Unicode

Total characters24654
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique599 ?
Unique (%)29.2%

Sample

1st row051-509-0707
2nd row051-509-0707
3rd row051-509-0707
4th row051-509-0707
5th row051-509-0707
ValueCountFrequency (%)
051-531-3020 30
 
1.5%
051-582-1234 28
 
1.4%
051-580-0342 22
 
1.1%
051-582-1664 18
 
0.9%
051-509-3000 18
 
0.9%
051-508-0011 18
 
0.9%
051-510-1169 17
 
0.8%
051-500-9700 17
 
0.8%
051-523-8680 16
 
0.8%
051-514-7737 16
 
0.8%
Other values (797) 1854
90.3%
2023-12-11T01:35:20.304991image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5 4915
19.9%
1 4192
17.0%
- 4108
16.7%
0 3826
15.5%
2 1500
 
6.1%
7 1374
 
5.6%
8 1354
 
5.5%
3 1104
 
4.5%
4 826
 
3.4%
9 756
 
3.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 20546
83.3%
Dash Punctuation 4108
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 4915
23.9%
1 4192
20.4%
0 3826
18.6%
2 1500
 
7.3%
7 1374
 
6.7%
8 1354
 
6.6%
3 1104
 
5.4%
4 826
 
4.0%
9 756
 
3.7%
6 699
 
3.4%
Dash Punctuation
ValueCountFrequency (%)
- 4108
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 24654
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
5 4915
19.9%
1 4192
17.0%
- 4108
16.7%
0 3826
15.5%
2 1500
 
6.1%
7 1374
 
5.6%
8 1354
 
5.5%
3 1104
 
4.5%
4 826
 
3.4%
9 756
 
3.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 24654
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5 4915
19.9%
1 4192
17.0%
- 4108
16.7%
0 3826
15.5%
2 1500
 
6.1%
7 1374
 
5.6%
8 1354
 
5.5%
3 1104
 
4.5%
4 826
 
3.4%
9 756
 
3.1%
Distinct62
Distinct (%)2.6%
Missing0
Missing (%)0.0%
Memory size18.5 KiB
2023-12-11T01:35:20.595021image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length69
Median length68
Mean length8.7959097
Min length2

Characters and Unicode

Total characters20644
Distinct characters189
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique19 ?
Unique (%)0.8%

Sample

1st row초본류
2nd row그밖의폐기물
3rd row임목폐목재(건설공사_산지개간등의과정에서발생된나무뿌리_가지_줄기등을말한다)
4th row그밖의폐기물
5th row그밖의폐기물
ValueCountFrequency (%)
지정폐기물 483
20.6%
의료폐기물 286
12.2%
그밖의폐기물 207
8.8%
일반의료폐기물 191
 
8.1%
병리계폐기물 187
 
8.0%
조직물류폐기물(태반을재활용하는경우는제외한다 182
 
7.8%
생물ㆍ화학폐기물 179
 
7.6%
혈액오염폐기물 160
 
6.8%
손상성폐기물 137
 
5.8%
격리의료폐기물 72
 
3.1%
Other values (52) 263
11.2%
2023-12-11T01:35:21.028885image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2619
 
12.7%
2445
 
11.8%
2196
 
10.6%
877
 
4.2%
607
 
2.9%
555
 
2.7%
490
 
2.4%
413
 
2.0%
374
 
1.8%
357
 
1.7%
Other values (179) 9711
47.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 19794
95.9%
Close Punctuation 300
 
1.5%
Open Punctuation 300
 
1.5%
Lowercase Letter 102
 
0.5%
Decimal Number 78
 
0.4%
Connector Punctuation 70
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2619
 
13.2%
2445
 
12.4%
2196
 
11.1%
877
 
4.4%
607
 
3.1%
555
 
2.8%
490
 
2.5%
413
 
2.1%
374
 
1.9%
357
 
1.8%
Other values (163) 8861
44.8%
Lowercase Letter
ValueCountFrequency (%)
e 34
33.3%
g 17
16.7%
a 17
16.7%
s 17
16.7%
r 17
16.7%
Decimal Number
ValueCountFrequency (%)
2 33
42.3%
0 17
21.8%
8 14
17.9%
1 14
17.9%
Close Punctuation
ValueCountFrequency (%)
) 269
89.7%
] 17
 
5.7%
14
 
4.7%
Open Punctuation
ValueCountFrequency (%)
( 269
89.7%
[ 17
 
5.7%
14
 
4.7%
Connector Punctuation
ValueCountFrequency (%)
_ 70
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 19794
95.9%
Common 748
 
3.6%
Latin 102
 
0.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2619
 
13.2%
2445
 
12.4%
2196
 
11.1%
877
 
4.4%
607
 
3.1%
555
 
2.8%
490
 
2.5%
413
 
2.1%
374
 
1.9%
357
 
1.8%
Other values (163) 8861
44.8%
Common
ValueCountFrequency (%)
) 269
36.0%
( 269
36.0%
_ 70
 
9.4%
2 33
 
4.4%
[ 17
 
2.3%
] 17
 
2.3%
0 17
 
2.3%
8 14
 
1.9%
1 14
 
1.9%
14
 
1.9%
Latin
ValueCountFrequency (%)
e 34
33.3%
g 17
16.7%
a 17
16.7%
s 17
16.7%
r 17
16.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 19556
94.7%
ASCII 822
 
4.0%
Compat Jamo 238
 
1.2%
None 28
 
0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
2619
 
13.4%
2445
 
12.5%
2196
 
11.2%
877
 
4.5%
607
 
3.1%
555
 
2.8%
490
 
2.5%
413
 
2.1%
374
 
1.9%
357
 
1.8%
Other values (162) 8623
44.1%
ASCII
ValueCountFrequency (%)
) 269
32.7%
( 269
32.7%
_ 70
 
8.5%
e 34
 
4.1%
2 33
 
4.0%
g 17
 
2.1%
[ 17
 
2.1%
a 17
 
2.1%
s 17
 
2.1%
r 17
 
2.1%
Other values (4) 62
 
7.5%
Compat Jamo
ValueCountFrequency (%)
238
100.0%
None
ValueCountFrequency (%)
14
50.0%
14
50.0%

처리방법
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct26
Distinct (%)1.1%
Missing0
Missing (%)0.0%
Memory size18.5 KiB
중간처분(일반소각)
1447 
공동처리
483 
재활용(연료·고형연료제품제조)
 
84
재활용(원료제조)
 
77
중간처분(지방자치단체소각)
 
60
Other values (21)
196 

Length

Max length17
Median length10
Mean length9.2846187
Min length4

Unique

Unique7 ?
Unique (%)0.3%

Sample

1st row재활용(토질개선에사용)
2nd row중간처분(일반소각)
3rd row재활용(중간가공폐기물제조)
4th row중간처분(지방자치단체소각)
5th row매립(지방자치단체매립시설)

Common Values

ValueCountFrequency (%)
중간처분(일반소각) 1447
61.7%
공동처리 483
 
20.6%
재활용(연료·고형연료제품제조) 84
 
3.6%
재활용(원료제조) 77
 
3.3%
중간처분(지방자치단체소각) 60
 
2.6%
재활용(농업생산활동에사용) 54
 
2.3%
매립(지방자치단체매립시설) 45
 
1.9%
재활용(중간가공폐기물제조) 30
 
1.3%
중간처분(고온소각) 20
 
0.9%
매립(민간관리형매립시설) 11
 
0.5%
Other values (16) 36
 
1.5%

Length

2023-12-11T01:35:21.235377image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
중간처분(일반소각 1447
61.7%
공동처리 483
 
20.6%
재활용(연료·고형연료제품제조 84
 
3.6%
재활용(원료제조 77
 
3.3%
중간처분(지방자치단체소각 60
 
2.6%
재활용(농업생산활동에사용 54
 
2.3%
매립(지방자치단체매립시설 45
 
1.9%
재활용(중간가공폐기물제조 30
 
1.3%
중간처분(고온소각 20
 
0.9%
매립(민간관리형매립시설 11
 
0.5%
Other values (16) 36
 
1.5%

주소
Text

Distinct807
Distinct (%)34.4%
Missing0
Missing (%)0.0%
Memory size18.5 KiB
2023-12-11T01:35:21.558695image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length46
Median length39
Mean length22.400085
Min length14

Characters and Unicode

Total characters52573
Distinct characters253
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique566 ?
Unique (%)24.1%

Sample

1st row부산광역시금정구중앙대로2327번길112(노포동)
2nd row부산광역시금정구중앙대로2327번길112(노포동)
3rd row부산광역시금정구중앙대로2327번길112(노포동)
4th row부산광역시금정구중앙대로2327번길112(노포동)
5th row부산광역시금정구중앙대로2327번길112(노포동)
ValueCountFrequency (%)
부산광역시금정구부산대학로63번길2(장전동 63
 
2.7%
부산광역시금정구중앙대로2238(노포동 35
 
1.5%
부산광역시금정구금샘로66(장전동 31
 
1.3%
부산광역시금정구금사로145-2(회동동 30
 
1.3%
부산광역시금정구금샘로17번안길45(장전동 27
 
1.2%
부산광역시금정구부산대학로63번길2-1(장전동 23
 
1.0%
부산광역시금정구하정로66(선동 22
 
0.9%
부산광역시금정구중앙대로1951(구서동 21
 
0.9%
부산광역시금정구부산대학로63번길2_부산대학교(장전동 20
 
0.9%
부산광역시금정구중앙대로1721(부곡동 20
 
0.9%
Other values (797) 2055
87.6%
2023-12-11T01:35:22.193922image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3247
 
6.2%
3101
 
5.9%
2961
 
5.6%
2930
 
5.6%
2699
 
5.1%
2526
 
4.8%
2399
 
4.6%
2376
 
4.5%
2364
 
4.5%
( 2271
 
4.3%
Other values (243) 25699
48.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 38314
72.9%
Decimal Number 8519
 
16.2%
Open Punctuation 2271
 
4.3%
Close Punctuation 2270
 
4.3%
Connector Punctuation 569
 
1.1%
Dash Punctuation 386
 
0.7%
Other Punctuation 201
 
0.4%
Uppercase Letter 39
 
0.1%
Math Symbol 4
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3247
 
8.5%
3101
 
8.1%
2961
 
7.7%
2930
 
7.6%
2699
 
7.0%
2526
 
6.6%
2399
 
6.3%
2376
 
6.2%
2364
 
6.2%
2264
 
5.9%
Other values (214) 11447
29.9%
Uppercase Letter
ValueCountFrequency (%)
B 13
33.3%
J 13
33.3%
F 4
 
10.3%
K 2
 
5.1%
D 1
 
2.6%
S 1
 
2.6%
V 1
 
2.6%
I 1
 
2.6%
A 1
 
2.6%
E 1
 
2.6%
Decimal Number
ValueCountFrequency (%)
1 1856
21.8%
2 1392
16.3%
3 792
9.3%
6 785
9.2%
4 760
8.9%
5 671
 
7.9%
0 621
 
7.3%
7 604
 
7.1%
9 544
 
6.4%
8 494
 
5.8%
Other Punctuation
ValueCountFrequency (%)
, 199
99.0%
/ 2
 
1.0%
Math Symbol
ValueCountFrequency (%)
~ 3
75.0%
1
 
25.0%
Open Punctuation
ValueCountFrequency (%)
( 2271
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2270
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 569
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 386
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 38314
72.9%
Common 14220
 
27.0%
Latin 39
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3247
 
8.5%
3101
 
8.1%
2961
 
7.7%
2930
 
7.6%
2699
 
7.0%
2526
 
6.6%
2399
 
6.3%
2376
 
6.2%
2364
 
6.2%
2264
 
5.9%
Other values (214) 11447
29.9%
Common
ValueCountFrequency (%)
( 2271
16.0%
) 2270
16.0%
1 1856
13.1%
2 1392
9.8%
3 792
 
5.6%
6 785
 
5.5%
4 760
 
5.3%
5 671
 
4.7%
0 621
 
4.4%
7 604
 
4.2%
Other values (8) 2198
15.5%
Latin
ValueCountFrequency (%)
B 13
33.3%
J 13
33.3%
F 4
 
10.3%
K 2
 
5.1%
D 1
 
2.6%
S 1
 
2.6%
V 1
 
2.6%
I 1
 
2.6%
A 1
 
2.6%
E 1
 
2.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 38314
72.9%
ASCII 14258
 
27.1%
Arrows 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
3247
 
8.5%
3101
 
8.1%
2961
 
7.7%
2930
 
7.6%
2699
 
7.0%
2526
 
6.6%
2399
 
6.3%
2376
 
6.2%
2364
 
6.2%
2264
 
5.9%
Other values (214) 11447
29.9%
ASCII
ValueCountFrequency (%)
( 2271
15.9%
) 2270
15.9%
1 1856
13.0%
2 1392
9.8%
3 792
 
5.6%
6 785
 
5.5%
4 760
 
5.3%
5 671
 
4.7%
0 621
 
4.4%
7 604
 
4.2%
Other values (18) 2236
15.7%
Arrows
ValueCountFrequency (%)
1
100.0%

데이터기준일
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size18.5 KiB
20220520
2347 

Length

Max length8
Median length8
Mean length8
Min length8

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row20220520
2nd row20220520
3rd row20220520
4th row20220520
5th row20220520

Common Values

ValueCountFrequency (%)
20220520 2347
100.0%

Length

2023-12-11T01:35:22.414665image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:35:22.558698image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
20220520 2347
100.0%

Correlations

2023-12-11T01:35:22.666847image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
폐기물구분폐기물종류처리방법
폐기물구분1.0000.9940.948
폐기물종류0.9941.0000.980
처리방법0.9480.9801.000
2023-12-11T01:35:22.803369image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
폐기물구분처리방법
폐기물구분1.0000.819
처리방법0.8191.000
2023-12-11T01:35:22.928607image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
폐기물구분처리방법
폐기물구분1.0000.819
처리방법0.8191.000

Missing values

2023-12-11T01:35:16.688221image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T01:35:16.859241image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-11T01:35:16.989222image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

폐기물구분상호명대표자연락처폐기물종류처리방법주소데이터기준일
0사업장비배출시설계(사)부산컨트리클럽서정의051-509-0707초본류재활용(토질개선에사용)부산광역시금정구중앙대로2327번길112(노포동)20220520
1사업장비배출시설계(사)부산컨트리클럽서정의051-509-0707그밖의폐기물중간처분(일반소각)부산광역시금정구중앙대로2327번길112(노포동)20220520
2사업장비배출시설계(사)부산컨트리클럽서정의051-509-0707임목폐목재(건설공사_산지개간등의과정에서발생된나무뿌리_가지_줄기등을말한다)재활용(중간가공폐기물제조)부산광역시금정구중앙대로2327번길112(노포동)20220520
3사업장비배출시설계(사)부산컨트리클럽서정의051-509-0707그밖의폐기물중간처분(지방자치단체소각)부산광역시금정구중앙대로2327번길112(노포동)20220520
4사업장비배출시설계(사)부산컨트리클럽서정의051-509-0707그밖의폐기물매립(지방자치단체매립시설)부산광역시금정구중앙대로2327번길112(노포동)20220520
5사업장비배출시설계(사)부산컨트리클럽서정의051-509-0707그밖의폐기물재활용(연료·고형연료제품제조)부산광역시금정구중앙대로2327번길112(노포동)20220520
6사업장비배출시설계(사)부산컨트리클럽서정의051-509-0707음식물류폐기물재활용(농업생산활동에사용)부산광역시금정구중앙대로2327번길112(노포동)20220520
7사업장비배출시설계(사)부산컨트리클럽서정의051-509-0707음식물류폐기물재활용(농업생산활동에사용)부산광역시금정구중앙대로2327번길112(노포동)20220520
8사업장비배출시설계(사)부산컨트리클럽서정의051-509-0707음식물류폐기물재활용(농업생산활동에사용)부산광역시금정구중앙대로2327번길112(노포동)20220520
9사업장비배출시설계(주)서원유통탑마트금사점이원길_김기민051-527-2222그밖의폐기물매립(지방자치단체매립시설)부산광역시금정구공단서로22(금사동)20220520
폐기물구분상호명대표자연락처폐기물종류처리방법주소데이터기준일
2337의료폐기물제일한방병원탁의수051-526-7551의료폐기물중간처분(일반소각)부산광역시금정구서동로172,제일한방병원(서동)20220520
2338의료폐기물남광노인전문요양시설하회원박도영051-508-2894의료폐기물중간처분(일반소각)부산광역시금정구중앙대로2349번길3,남광사회복지회(노포동)20220520
2339의료폐기물남광노인복지센터박해영051-508-0380의료폐기물중간처분(일반소각)부산광역시금정구중앙대로2349번길3,남광사회복지회(노포동)20220520
2340의료폐기물항사랑외과정재한051-517-7582의료폐기물중간처분(일반소각)부산광역시금정구중앙대로1841번길23,현대타운(구서동)20220520
2341의료폐기물프라임치과설영훈051-585-7582의료폐기물중간처분(일반소각)부산광역시금정구중앙대로1941,태일빌딩(구서동)20220520
2342의료폐기물심홍택치과의원심홍택051-514-3301의료폐기물중간처분(일반소각)부산광역시금정구금강로565(구서동)20220520
2343의료폐기물우송한의원은상두051-583-0633의료폐기물중간처분(일반소각)부산광역시금정구중앙대로2095(남산동)20220520
2344의료폐기물보광한의원배정후051-513-8787의료폐기물중간처분(일반소각)부산광역시금정구금샘로585,다솔빌딩2층(남산동)20220520
2345의료폐기물풀잎향기한의원손성식051-532-1835의료폐기물중간처분(일반소각)부산광역시금정구중앙대로1641(부곡동)20220520
2346의료폐기물금정노인요양원김영051-508-8822의료폐기물중간처분(일반소각)부산광역시금정구청룡예전로43번길25(청룡동)20220520

Duplicate rows

Most frequently occurring

폐기물구분상호명대표자연락처폐기물종류처리방법주소데이터기준일# duplicates
456지정폐기물이륜자동차환경협회이옥렬051-531-3020폐윤활유(「자원의절약과재활용촉진에관한법률시행령」제18조에따른재활용의무대상제품ㆍ포장재인기어유및내연기관용윤활유를말한다)재활용(연료·고형연료제품제조)부산광역시금정구금사로145-2(회동동)2022052011
428지정폐기물(주)일월운수홍정길051-523-8680지정폐기물공동처리부산광역시금정구반송로406-7(금사동)202205208
429지정폐기물(주)태화산업홍정길051-523-8680지정폐기물공동처리부산광역시금정구반송로406-7(금사동)202205208
430지정폐기물GM대우바로정비코너금정점심윤태051-513-3391지정폐기물공동처리부산광역시금정구금정로216(구서동)202205208
431지정폐기물금호고속주식회사고속사업부박삼구외051-508-8885지정폐기물공동처리부산광역시금정구중앙대로2238(노포동)202205208
445지정폐기물삼화피티에스주식회사(서동)현영희051-523-7211지정폐기물공동처리부산광역시금정구동현로121(서동)202205208
446지정폐기물삼화피티에스주식회사(회동동)현영희051-522-5996지정폐기물공동처리부산광역시금정구수원지로22번길46(회동동)202205208
447지정폐기물신한교통주형구051-522-7755지정폐기물공동처리부산광역시금정구반송로400(서동)202205208
448지정폐기물신한택시주형구051-522-7755지정폐기물공동처리부산광역시금정구반송로406-10(서동)202205208
451지정폐기물용진상사백용래051-581-1762지정폐기물공동처리부산광역시금정구무학송로69(부곡동)202205208