Overview

Dataset statistics

Number of variables21
Number of observations3153
Missing cells4616
Missing cells (%)7.0%
Duplicate rows4
Duplicate rows (%)0.1%
Total size in memory526.7 KiB
Average record size in memory171.0 B

Variable types

Text9
DateTime3
Categorical6
Numeric3

Dataset

Description충청남도 논산시 사업장폐기물배출자 신고현황(인허가관리번호, 신고번호, 상호, 대표자, 전화번호, 신고일, 생활계구분, 폐기물종류, 배출량, 처리구분, 처리업소명, 처리방법, 연처리량, 도로명주소, 지번주소, 업무구분, 월별배출량, 성상구분, 폐기물자가처리방법, 공사시작일자, 공사종료일자)
Author충청남도
URLhttps://alldam.chungnam.go.kr/index.chungnam?menuCd=DOM_000000201001001001&st=&cds=&orgCd=&apiType=&isOpen=Y&pageIndex=382&beforeMenuCd=DOM_000000201001001000&publicdatapk=15060384

Alerts

Dataset has 4 (0.1%) duplicate rowsDuplicates
생활계구분 is highly imbalanced (69.1%)Imbalance
처리구분 is highly imbalanced (58.5%)Imbalance
처리방법 is highly imbalanced (66.6%)Imbalance
업무구분 is highly imbalanced (85.3%)Imbalance
성상구분 is highly imbalanced (69.5%)Imbalance
폐기물자가처리방법 is highly imbalanced (66.6%)Imbalance
공사시작일자 has 2318 (73.5%) missing valuesMissing
공사종료일자 has 2290 (72.6%) missing valuesMissing
배출량(톤) is highly skewed (γ1 = 27.17163153)Skewed
처리량(톤/연) is highly skewed (γ1 = 36.43663438)Skewed
월별배출량 is highly skewed (γ1 = 37.57294778)Skewed
배출량(톤) has 2464 (78.1%) zerosZeros
처리량(톤/연) has 2358 (74.8%) zerosZeros
월별배출량 has 2919 (92.6%) zerosZeros

Reproduction

Analysis started2024-01-09 21:10:08.377098
Analysis finished2024-01-09 21:10:09.463020
Duration1.09 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct2989
Distinct (%)94.8%
Missing0
Missing (%)0.0%
Memory size24.8 KiB
2024-01-10T06:10:09.589574image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length21
Median length21
Mean length21
Min length21

Characters and Unicode

Total characters66213
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2867 ?
Unique (%)90.9%

Sample

1st row4540000-32-1994-00001
2nd row4540000-32-2000-00001
3rd row4540000-32-2000-00002
4th row4540000-32-2000-00003
5th row4540000-32-2000-00004
ValueCountFrequency (%)
4540000-32-2005-00149 5
 
0.2%
4540000-32-2012-00024 4
 
0.1%
4540000-32-2012-00012 4
 
0.1%
4540000-32-2012-00011 4
 
0.1%
4540000-32-2014-00011 4
 
0.1%
4540000-32-2000-00147 4
 
0.1%
4540000-32-2009-00016 4
 
0.1%
4540000-32-2015-00025 3
 
0.1%
4540000-32-2000-00043 3
 
0.1%
4540000-32-2000-00063 3
 
0.1%
Other values (2979) 3115
98.8%
2024-01-10T06:10:09.985239image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 27053
40.9%
- 9459
 
14.3%
2 7955
 
12.0%
4 7659
 
11.6%
3 4938
 
7.5%
5 4150
 
6.3%
1 2490
 
3.8%
8 642
 
1.0%
9 633
 
1.0%
7 618
 
0.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 56754
85.7%
Dash Punctuation 9459
 
14.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 27053
47.7%
2 7955
 
14.0%
4 7659
 
13.5%
3 4938
 
8.7%
5 4150
 
7.3%
1 2490
 
4.4%
8 642
 
1.1%
9 633
 
1.1%
7 618
 
1.1%
6 616
 
1.1%
Dash Punctuation
ValueCountFrequency (%)
- 9459
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 66213
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 27053
40.9%
- 9459
 
14.3%
2 7955
 
12.0%
4 7659
 
11.6%
3 4938
 
7.5%
5 4150
 
6.3%
1 2490
 
3.8%
8 642
 
1.0%
9 633
 
1.0%
7 618
 
0.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 66213
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 27053
40.9%
- 9459
 
14.3%
2 7955
 
12.0%
4 7659
 
11.6%
3 4938
 
7.5%
5 4150
 
6.3%
1 2490
 
3.8%
8 642
 
1.0%
9 633
 
1.0%
7 618
 
0.9%
Distinct2874
Distinct (%)91.2%
Missing0
Missing (%)0.0%
Memory size24.8 KiB
2024-01-10T06:10:10.283808image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length10
Mean length8.8667935
Min length1

Characters and Unicode

Total characters27957
Distinct characters19
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2647 ?
Unique (%)84.0%

Sample

1st row1994-00001
2nd row1
3rd row2
4th row3
5th row4
ValueCountFrequency (%)
147 5
 
0.2%
2005-00149 5
 
0.2%
386 4
 
0.1%
사업-524 4
 
0.1%
사업-510 4
 
0.1%
2014-00011 4
 
0.1%
2012-00012 4
 
0.1%
146 4
 
0.1%
145 3
 
0.1%
사업2020-16 3
 
0.1%
Other values (2864) 3113
98.7%
2024-01-10T06:10:10.709861image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 10801
38.6%
2 4328
15.5%
- 2799
 
10.0%
1 2201
 
7.9%
3 1639
 
5.9%
4 1462
 
5.2%
5 1125
 
4.0%
6 686
 
2.5%
8 661
 
2.4%
7 629
 
2.2%
Other values (9) 1626
 
5.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 24155
86.4%
Dash Punctuation 2799
 
10.0%
Other Letter 1003
 
3.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 10801
44.7%
2 4328
17.9%
1 2201
 
9.1%
3 1639
 
6.8%
4 1462
 
6.1%
5 1125
 
4.7%
6 686
 
2.8%
8 661
 
2.7%
7 629
 
2.6%
9 623
 
2.6%
Other Letter
ValueCountFrequency (%)
483
48.2%
457
45.6%
22
 
2.2%
22
 
2.2%
12
 
1.2%
3
 
0.3%
3
 
0.3%
1
 
0.1%
Dash Punctuation
ValueCountFrequency (%)
- 2799
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 26954
96.4%
Hangul 1003
 
3.6%

Most frequent character per script

Common
ValueCountFrequency (%)
0 10801
40.1%
2 4328
16.1%
- 2799
 
10.4%
1 2201
 
8.2%
3 1639
 
6.1%
4 1462
 
5.4%
5 1125
 
4.2%
6 686
 
2.5%
8 661
 
2.5%
7 629
 
2.3%
Hangul
ValueCountFrequency (%)
483
48.2%
457
45.6%
22
 
2.2%
22
 
2.2%
12
 
1.2%
3
 
0.3%
3
 
0.3%
1
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 26954
96.4%
Hangul 1003
 
3.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 10801
40.1%
2 4328
16.1%
- 2799
 
10.4%
1 2201
 
8.2%
3 1639
 
6.1%
4 1462
 
5.4%
5 1125
 
4.2%
6 686
 
2.5%
8 661
 
2.5%
7 629
 
2.3%
Hangul
ValueCountFrequency (%)
483
48.2%
457
45.6%
22
 
2.2%
22
 
2.2%
12
 
1.2%
3
 
0.3%
3
 
0.3%
1
 
0.1%

상호
Text

Distinct1205
Distinct (%)38.2%
Missing1
Missing (%)< 0.1%
Memory size24.8 KiB
2024-01-10T06:10:10.945952image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length24
Median length21
Mean length5.6059645
Min length1

Characters and Unicode

Total characters17670
Distinct characters413
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique863 ?
Unique (%)27.4%

Sample

1st row놀뫼환경(주)
2nd row(주)동원주택건설
3rd row대석건설(주)
4th row대석건설(주)
5th row대석건설(주)
ValueCountFrequency (%)
개인 110
 
4.5%
논산시청 89
 
3.6%
주)와이엠종합건설 43
 
1.8%
놀뫼환경(주 39
 
1.6%
육군훈련소 29
 
1.2%
유)하얀건설 25
 
1.0%
성일종합건설(주 24
 
1.0%
한국도로공사논산지사 24
 
1.0%
덕수건설(주 22
 
0.9%
고려환경개발(주 22
 
0.9%
Other values (1214) 2023
82.6%
2024-01-10T06:10:11.293523image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1557
 
8.8%
) 1537
 
8.7%
( 1508
 
8.5%
1080
 
6.1%
968
 
5.5%
866
 
4.9%
457
 
2.6%
347
 
2.0%
333
 
1.9%
277
 
1.6%
Other values (403) 8740
49.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 13664
77.3%
Close Punctuation 1537
 
8.7%
Open Punctuation 1508
 
8.5%
Space Separator 866
 
4.9%
Decimal Number 39
 
0.2%
Other Symbol 28
 
0.2%
Uppercase Letter 23
 
0.1%
Dash Punctuation 3
 
< 0.1%
Lowercase Letter 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1557
 
11.4%
1080
 
7.9%
968
 
7.1%
457
 
3.3%
347
 
2.5%
333
 
2.4%
277
 
2.0%
275
 
2.0%
248
 
1.8%
235
 
1.7%
Other values (376) 7887
57.7%
Uppercase Letter
ValueCountFrequency (%)
T 5
21.7%
A 4
17.4%
S 3
13.0%
P 2
 
8.7%
B 2
 
8.7%
D 1
 
4.3%
Q 1
 
4.3%
C 1
 
4.3%
E 1
 
4.3%
R 1
 
4.3%
Other values (2) 2
 
8.7%
Decimal Number
ValueCountFrequency (%)
1 12
30.8%
2 9
23.1%
0 6
15.4%
6 3
 
7.7%
3 3
 
7.7%
5 3
 
7.7%
7 2
 
5.1%
8 1
 
2.6%
Lowercase Letter
ValueCountFrequency (%)
i 1
50.0%
s 1
50.0%
Close Punctuation
ValueCountFrequency (%)
) 1537
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1508
100.0%
Space Separator
ValueCountFrequency (%)
866
100.0%
Other Symbol
ValueCountFrequency (%)
28
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 13692
77.5%
Common 3953
 
22.4%
Latin 25
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1557
 
11.4%
1080
 
7.9%
968
 
7.1%
457
 
3.3%
347
 
2.5%
333
 
2.4%
277
 
2.0%
275
 
2.0%
248
 
1.8%
235
 
1.7%
Other values (377) 7915
57.8%
Latin
ValueCountFrequency (%)
T 5
20.0%
A 4
16.0%
S 3
12.0%
P 2
 
8.0%
B 2
 
8.0%
D 1
 
4.0%
Q 1
 
4.0%
C 1
 
4.0%
E 1
 
4.0%
R 1
 
4.0%
Other values (4) 4
16.0%
Common
ValueCountFrequency (%)
) 1537
38.9%
( 1508
38.1%
866
21.9%
1 12
 
0.3%
2 9
 
0.2%
0 6
 
0.2%
6 3
 
0.1%
3 3
 
0.1%
5 3
 
0.1%
- 3
 
0.1%
Other values (2) 3
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 13664
77.3%
ASCII 3978
 
22.5%
None 28
 
0.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1557
 
11.4%
1080
 
7.9%
968
 
7.1%
457
 
3.3%
347
 
2.5%
333
 
2.4%
277
 
2.0%
275
 
2.0%
248
 
1.8%
235
 
1.7%
Other values (376) 7887
57.7%
ASCII
ValueCountFrequency (%)
) 1537
38.6%
( 1508
37.9%
866
21.8%
1 12
 
0.3%
2 9
 
0.2%
0 6
 
0.2%
T 5
 
0.1%
A 4
 
0.1%
6 3
 
0.1%
3 3
 
0.1%
Other values (16) 25
 
0.6%
None
ValueCountFrequency (%)
28
100.0%
Distinct1926
Distinct (%)61.1%
Missing0
Missing (%)0.0%
Memory size24.8 KiB
2024-01-10T06:10:11.571482image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length27
Median length3
Mean length4.8655249
Min length1

Characters and Unicode

Total characters15341
Distinct characters366
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1473 ?
Unique (%)46.7%

Sample

1st row서주원
2nd row이봉재
3rd row김홍석
4th row김홍석
5th row김홍석
ValueCountFrequency (%)
김복렬 91
 
2.7%
논산시장 80
 
2.4%
문장환 44
 
1.3%
놀뫼환경(주 33
 
1.0%
박희재 25
 
0.7%
육군훈련소 25
 
0.7%
장일용 20
 
0.6%
고려환경개발(주 19
 
0.6%
김복열 18
 
0.5%
백우현 18
 
0.5%
Other values (1974) 2979
88.9%
2024-01-10T06:10:11.970438image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
854
 
5.6%
) 823
 
5.4%
( 779
 
5.1%
572
 
3.7%
553
 
3.6%
522
 
3.4%
347
 
2.3%
269
 
1.8%
265
 
1.7%
264
 
1.7%
Other values (356) 10093
65.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 13426
87.5%
Close Punctuation 824
 
5.4%
Open Punctuation 779
 
5.1%
Space Separator 209
 
1.4%
Other Punctuation 41
 
0.3%
Decimal Number 28
 
0.2%
Connector Punctuation 16
 
0.1%
Uppercase Letter 14
 
0.1%
Dash Punctuation 3
 
< 0.1%
Other Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
854
 
6.4%
572
 
4.3%
553
 
4.1%
522
 
3.9%
347
 
2.6%
269
 
2.0%
265
 
2.0%
264
 
2.0%
215
 
1.6%
206
 
1.5%
Other values (334) 9359
69.7%
Decimal Number
ValueCountFrequency (%)
1 16
57.1%
6 4
 
14.3%
5 3
 
10.7%
7 2
 
7.1%
2 1
 
3.6%
8 1
 
3.6%
4 1
 
3.6%
Uppercase Letter
ValueCountFrequency (%)
A 4
28.6%
T 3
21.4%
P 2
14.3%
S 2
14.3%
R 1
 
7.1%
F 1
 
7.1%
K 1
 
7.1%
Close Punctuation
ValueCountFrequency (%)
) 823
99.9%
] 1
 
0.1%
Open Punctuation
ValueCountFrequency (%)
( 779
100.0%
Space Separator
ValueCountFrequency (%)
209
100.0%
Other Punctuation
ValueCountFrequency (%)
: 41
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 16
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 13427
87.5%
Common 1900
 
12.4%
Latin 14
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
854
 
6.4%
572
 
4.3%
553
 
4.1%
522
 
3.9%
347
 
2.6%
269
 
2.0%
265
 
2.0%
264
 
2.0%
215
 
1.6%
206
 
1.5%
Other values (335) 9360
69.7%
Common
ValueCountFrequency (%)
) 823
43.3%
( 779
41.0%
209
 
11.0%
: 41
 
2.2%
_ 16
 
0.8%
1 16
 
0.8%
6 4
 
0.2%
- 3
 
0.2%
5 3
 
0.2%
7 2
 
0.1%
Other values (4) 4
 
0.2%
Latin
ValueCountFrequency (%)
A 4
28.6%
T 3
21.4%
P 2
14.3%
S 2
14.3%
R 1
 
7.1%
F 1
 
7.1%
K 1
 
7.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 13426
87.5%
ASCII 1914
 
12.5%
None 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
854
 
6.4%
572
 
4.3%
553
 
4.1%
522
 
3.9%
347
 
2.6%
269
 
2.0%
265
 
2.0%
264
 
2.0%
215
 
1.6%
206
 
1.5%
Other values (334) 9359
69.7%
ASCII
ValueCountFrequency (%)
) 823
43.0%
( 779
40.7%
209
 
10.9%
: 41
 
2.1%
_ 16
 
0.8%
1 16
 
0.8%
6 4
 
0.2%
A 4
 
0.2%
T 3
 
0.2%
- 3
 
0.2%
Other values (11) 16
 
0.8%
None
ValueCountFrequency (%)
1
100.0%
Distinct193
Distinct (%)6.1%
Missing0
Missing (%)0.0%
Memory size24.8 KiB
2024-01-10T06:10:12.346601image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.000634
Min length11

Characters and Unicode

Total characters37838
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique141 ?
Unique (%)4.5%

Sample

1st row000-000-0000
2nd row000-000-0000
3rd row000-000-0000
4th row000-000-0000
5th row000-000-0000
ValueCountFrequency (%)
000-000-0000 2820
89.4%
041-853-9523 14
 
0.4%
042-535-1080 13
 
0.4%
042-484-9702 12
 
0.4%
041-730-3274 11
 
0.3%
041-746-5542 8
 
0.3%
041-730-1273 8
 
0.3%
042-639-0033 8
 
0.3%
041-746-5534 6
 
0.2%
041-735-8420 5
 
0.2%
Other values (183) 248
 
7.9%
2024-01-10T06:10:12.843927image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 28777
76.1%
- 6306
 
16.7%
4 608
 
1.6%
1 382
 
1.0%
3 377
 
1.0%
7 373
 
1.0%
2 315
 
0.8%
5 231
 
0.6%
6 204
 
0.5%
8 155
 
0.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 31532
83.3%
Dash Punctuation 6306
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 28777
91.3%
4 608
 
1.9%
1 382
 
1.2%
3 377
 
1.2%
7 373
 
1.2%
2 315
 
1.0%
5 231
 
0.7%
6 204
 
0.6%
8 155
 
0.5%
9 110
 
0.3%
Dash Punctuation
ValueCountFrequency (%)
- 6306
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 37838
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 28777
76.1%
- 6306
 
16.7%
4 608
 
1.6%
1 382
 
1.0%
3 377
 
1.0%
7 373
 
1.0%
2 315
 
0.8%
5 231
 
0.6%
6 204
 
0.5%
8 155
 
0.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 37838
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 28777
76.1%
- 6306
 
16.7%
4 608
 
1.6%
1 382
 
1.0%
3 377
 
1.0%
7 373
 
1.0%
2 315
 
0.8%
5 231
 
0.6%
6 204
 
0.5%
8 155
 
0.4%
Distinct1102
Distinct (%)35.0%
Missing0
Missing (%)0.0%
Memory size24.8 KiB
Minimum2000-01-03 00:00:00
Maximum2020-05-27 00:00:00
2024-01-10T06:10:12.980157image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T06:10:13.108931image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

생활계구분
Categorical

IMBALANCE 

Distinct4
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size24.8 KiB
2802 
생활계
 
193
배출시설계
 
157
<NA>
 
1

Length

Max length5
Median length1
Mean length1.32255
Min length1

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row
2nd row
3rd row
4th row
5th row

Common Values

ValueCountFrequency (%)
2802
88.9%
생활계 193
 
6.1%
배출시설계 157
 
5.0%
<NA> 1
 
< 0.1%

Length

2024-01-10T06:10:13.242495image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T06:10:13.354036image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
생활계 193
55.0%
배출시설계 157
44.7%
na 1
 
0.3%
Distinct62
Distinct (%)2.0%
Missing1
Missing (%)< 0.1%
Memory size24.8 KiB
2024-01-10T06:10:13.558357image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length76
Median length1
Mean length3.8388325
Min length1

Characters and Unicode

Total characters12100
Distinct characters158
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique25 ?
Unique (%)0.8%

Sample

1st row
2nd row건설폐기물
3rd row건설폐기물
4th row건설폐기물
5th row건설폐기물
ValueCountFrequency (%)
건설폐기물 218
 
12.1%
제외한다 97
 
5.4%
폐콘크리트 91
 
5.1%
등을 74
 
4.1%
폐합성수지류(폐염화비닐수지류는 73
 
4.1%
말한다 67
 
3.7%
폐합성수지류 63
 
3.5%
폐목재류 60
 
3.3%
등의 55
 
3.1%
가지 53
 
2.9%
Other values (104) 948
52.7%
2024-01-10T06:10:13.907843image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3279
27.1%
867
 
7.2%
368
 
3.0%
357
 
3.0%
343
 
2.8%
313
 
2.6%
311
 
2.6%
302
 
2.5%
260
 
2.1%
259
 
2.1%
Other values (148) 5441
45.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 8217
67.9%
Space Separator 3279
 
27.1%
Connector Punctuation 189
 
1.6%
Close Punctuation 169
 
1.4%
Open Punctuation 169
 
1.4%
Decimal Number 56
 
0.5%
Other Punctuation 21
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
867
 
10.6%
368
 
4.5%
357
 
4.3%
343
 
4.2%
313
 
3.8%
311
 
3.8%
302
 
3.7%
260
 
3.2%
259
 
3.2%
206
 
2.5%
Other values (138) 4631
56.4%
Decimal Number
ValueCountFrequency (%)
1 49
87.5%
2 6
 
10.7%
3 1
 
1.8%
Close Punctuation
ValueCountFrequency (%)
) 167
98.8%
2
 
1.2%
Open Punctuation
ValueCountFrequency (%)
( 167
98.8%
2
 
1.2%
Space Separator
ValueCountFrequency (%)
3279
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 189
100.0%
Other Punctuation
ValueCountFrequency (%)
. 21
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 8217
67.9%
Common 3883
32.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
867
 
10.6%
368
 
4.5%
357
 
4.3%
343
 
4.2%
313
 
3.8%
311
 
3.8%
302
 
3.7%
260
 
3.2%
259
 
3.2%
206
 
2.5%
Other values (138) 4631
56.4%
Common
ValueCountFrequency (%)
3279
84.4%
_ 189
 
4.9%
) 167
 
4.3%
( 167
 
4.3%
1 49
 
1.3%
. 21
 
0.5%
2 6
 
0.2%
2
 
0.1%
2
 
0.1%
3 1
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 8195
67.7%
ASCII 3879
32.1%
Compat Jamo 22
 
0.2%
None 4
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3279
84.5%
_ 189
 
4.9%
) 167
 
4.3%
( 167
 
4.3%
1 49
 
1.3%
. 21
 
0.5%
2 6
 
0.2%
3 1
 
< 0.1%
Hangul
ValueCountFrequency (%)
867
 
10.6%
368
 
4.5%
357
 
4.4%
343
 
4.2%
313
 
3.8%
311
 
3.8%
302
 
3.7%
260
 
3.2%
259
 
3.2%
206
 
2.5%
Other values (137) 4609
56.2%
Compat Jamo
ValueCountFrequency (%)
22
100.0%
None
ValueCountFrequency (%)
2
50.0%
2
50.0%

배출량(톤)
Real number (ℝ)

SKEWED  ZEROS 

Distinct229
Distinct (%)7.3%
Missing1
Missing (%)< 0.1%
Infinite0
Infinite (%)0.0%
Mean63.578008
Minimum0
Maximum24000
Zeros2464
Zeros (%)78.1%
Negative0
Negative (%)0.0%
Memory size27.8 KiB
2024-01-10T06:10:14.038604image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile200
Maximum24000
Range24000
Interquartile range (IQR)0

Descriptive statistics

Standard deviation588.3603
Coefficient of variation (CV)9.2541481
Kurtosis958.56854
Mean63.578008
Median Absolute Deviation (MAD)0
Skewness27.171632
Sum200397.88
Variance346167.84
MonotonicityNot monotonic
2024-01-10T06:10:14.161721image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.0 2464
78.1%
20.0 43
 
1.4%
50.0 38
 
1.2%
10.0 37
 
1.2%
100.0 31
 
1.0%
30.0 29
 
0.9%
200.0 19
 
0.6%
15.0 17
 
0.5%
8.0 15
 
0.5%
60.0 14
 
0.4%
Other values (219) 445
 
14.1%
ValueCountFrequency (%)
0.0 2464
78.1%
0.5 1
 
< 0.1%
1.0 4
 
0.1%
1.2 1
 
< 0.1%
2.0 8
 
0.3%
2.2 1
 
< 0.1%
2.7 1
 
< 0.1%
3.0 3
 
0.1%
3.5 1
 
< 0.1%
4.0 2
 
0.1%
ValueCountFrequency (%)
24000.0 1
< 0.1%
12000.0 1
< 0.1%
8208.0 1
< 0.1%
7680.0 1
< 0.1%
7500.0 1
< 0.1%
6855.96 1
< 0.1%
4152.0 1
< 0.1%
3516.0 1
< 0.1%
3422.0 1
< 0.1%
3084.0 1
< 0.1%

처리구분
Categorical

IMBALANCE 

Distinct4
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size24.8 KiB
2358 
위탁
790 
자가
 
4
<NA>
 
1

Length

Max length4
Median length1
Mean length1.2527751
Min length1

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row
2nd row위탁
3rd row위탁
4th row위탁
5th row위탁

Common Values

ValueCountFrequency (%)
2358
74.8%
위탁 790
 
25.1%
자가 4
 
0.1%
<NA> 1
 
< 0.1%

Length

2024-01-10T06:10:14.280684image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T06:10:14.373170image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
위탁 790
99.4%
자가 4
 
0.5%
na 1
 
0.1%
Distinct172
Distinct (%)5.5%
Missing1
Missing (%)< 0.1%
Memory size24.8 KiB
2024-01-10T06:10:14.548667image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length21
Median length1
Mean length2.6986041
Min length1

Characters and Unicode

Total characters8506
Distinct characters174
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique107 ?
Unique (%)3.4%

Sample

1st row
2nd row놀뫼환경(주)
3rd row놀뫼환경(주)
4th row놀뫼환경(주)
5th row놀뫼환경(주)
ValueCountFrequency (%)
놀뫼환경(주 152
18.9%
계룡우드(주 88
 
10.9%
신화환경개발(주 84
 
10.4%
대형환경(주 48
 
6.0%
그린이엔티(주 35
 
4.3%
금산환경재생산업(주 27
 
3.3%
주)동양알디 24
 
3.0%
주)동양환경 24
 
3.0%
새한환경(주 18
 
2.2%
주)엔아이티 17
 
2.1%
Other values (162) 289
35.9%
2024-01-10T06:10:14.921338image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2375
27.9%
745
 
8.8%
( 737
 
8.7%
) 736
 
8.7%
423
 
5.0%
417
 
4.9%
159
 
1.9%
157
 
1.8%
138
 
1.6%
126
 
1.5%
Other values (164) 2493
29.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4644
54.6%
Space Separator 2375
27.9%
Open Punctuation 737
 
8.7%
Close Punctuation 736
 
8.7%
Uppercase Letter 10
 
0.1%
Lowercase Letter 2
 
< 0.1%
Other Symbol 1
 
< 0.1%
Dash Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
745
 
16.0%
423
 
9.1%
417
 
9.0%
159
 
3.4%
157
 
3.4%
138
 
3.0%
126
 
2.7%
105
 
2.3%
105
 
2.3%
104
 
2.2%
Other values (153) 2165
46.6%
Uppercase Letter
ValueCountFrequency (%)
J 3
30.0%
H 3
30.0%
G 2
20.0%
R 2
20.0%
Lowercase Letter
ValueCountFrequency (%)
r 1
50.0%
g 1
50.0%
Space Separator
ValueCountFrequency (%)
2375
100.0%
Open Punctuation
ValueCountFrequency (%)
( 737
100.0%
Close Punctuation
ValueCountFrequency (%)
) 736
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4645
54.6%
Common 3849
45.3%
Latin 12
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
745
 
16.0%
423
 
9.1%
417
 
9.0%
159
 
3.4%
157
 
3.4%
138
 
3.0%
126
 
2.7%
105
 
2.3%
105
 
2.3%
104
 
2.2%
Other values (154) 2166
46.6%
Latin
ValueCountFrequency (%)
J 3
25.0%
H 3
25.0%
G 2
16.7%
R 2
16.7%
r 1
 
8.3%
g 1
 
8.3%
Common
ValueCountFrequency (%)
2375
61.7%
( 737
 
19.1%
) 736
 
19.1%
- 1
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4644
54.6%
ASCII 3861
45.4%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2375
61.5%
( 737
 
19.1%
) 736
 
19.1%
J 3
 
0.1%
H 3
 
0.1%
G 2
 
0.1%
R 2
 
0.1%
- 1
 
< 0.1%
r 1
 
< 0.1%
g 1
 
< 0.1%
Hangul
ValueCountFrequency (%)
745
 
16.0%
423
 
9.1%
417
 
9.0%
159
 
3.4%
157
 
3.4%
138
 
3.0%
126
 
2.7%
105
 
2.3%
105
 
2.3%
104
 
2.2%
Other values (153) 2165
46.6%
None
ValueCountFrequency (%)
1
100.0%

처리방법
Categorical

IMBALANCE 

Distinct28
Distinct (%)0.9%
Missing0
Missing (%)0.0%
Memory size24.8 KiB
2369 
중간처분(파쇄.분쇄)
 
221
파쇄.절단
 
174
중간처분(일반소각)
 
103
재활용(파쇄.분쇄)
 
72
Other values (23)
 
214

Length

Max length19
Median length1
Mean length3.2305741
Min length1

Unique

Unique4 ?
Unique (%)0.1%

Sample

1st row
2nd row파쇄.절단
3rd row파쇄.절단
4th row파쇄.절단
5th row파쇄.절단

Common Values

ValueCountFrequency (%)
2369
75.1%
중간처분(파쇄.분쇄) 221
 
7.0%
파쇄.절단 174
 
5.5%
중간처분(일반소각) 103
 
3.3%
재활용(파쇄.분쇄) 72
 
2.3%
재활용(연료·고형연료제품 제조) 49
 
1.6%
재활용(기타) 28
 
0.9%
재활용(중간가공폐기물 제조) 24
 
0.8%
매립(민간관리형매립시설) 22
 
0.7%
재활용(성토재·복토재 등으로 사용) 17
 
0.5%
Other values (18) 74
 
2.3%

Length

2024-01-10T06:10:15.093781image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
중간처분(파쇄.분쇄 221
23.9%
파쇄.절단 174
18.8%
중간처분(일반소각 103
11.1%
제조 89
9.6%
재활용(파쇄.분쇄 72
 
7.8%
재활용(연료·고형연료제품 49
 
5.3%
재활용(기타 28
 
3.0%
사용 27
 
2.9%
재활용(중간가공폐기물 24
 
2.6%
매립(민간관리형매립시설 22
 
2.4%
Other values (22) 115
12.4%

처리량(톤/연)
Real number (ℝ)

SKEWED  ZEROS 

Distinct234
Distinct (%)7.4%
Missing1
Missing (%)< 0.1%
Infinite0
Infinite (%)0.0%
Mean52.216869
Minimum0
Maximum24000
Zeros2358
Zeros (%)74.8%
Negative0
Negative (%)0.0%
Memory size27.8 KiB
2024-01-10T06:10:15.227562image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q31.4
95-th percentile180
Maximum24000
Range24000
Interquartile range (IQR)1.4

Descriptive statistics

Standard deviation505.15762
Coefficient of variation (CV)9.6742228
Kurtosis1635.7927
Mean52.216869
Median Absolute Deviation (MAD)0
Skewness36.436634
Sum164587.57
Variance255184.22
MonotonicityNot monotonic
2024-01-10T06:10:15.385597image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.0 2358
74.8%
20.0 46
 
1.5%
50.0 41
 
1.3%
10.0 40
 
1.3%
30.0 37
 
1.2%
100.0 35
 
1.1%
15.0 26
 
0.8%
90.0 25
 
0.8%
200.0 21
 
0.7%
60.0 16
 
0.5%
Other values (224) 507
 
16.1%
ValueCountFrequency (%)
0.0 2358
74.8%
0.5 1
 
< 0.1%
1.0 4
 
0.1%
1.2 1
 
< 0.1%
2.0 9
 
0.3%
2.2 1
 
< 0.1%
2.7 1
 
< 0.1%
3.0 3
 
0.1%
3.5 1
 
< 0.1%
4.0 2
 
0.1%
ValueCountFrequency (%)
24000.0 1
 
< 0.1%
7500.0 1
 
< 0.1%
7488.0 1
 
< 0.1%
3422.0 1
 
< 0.1%
3216.0 1
 
< 0.1%
3022.0 1
 
< 0.1%
3000.0 2
0.1%
2063.0 1
 
< 0.1%
2000.0 3
0.1%
1959.0 1
 
< 0.1%
Distinct374
Distinct (%)11.9%
Missing1
Missing (%)< 0.1%
Memory size24.8 KiB
2024-01-10T06:10:15.672007image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length30
Median length1
Mean length5.4152919
Min length1

Characters and Unicode

Total characters17069
Distinct characters184
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique251 ?
Unique (%)8.0%

Sample

1st row
2nd row
3rd row충청남도 공주시 반포면 공암장터길 24
4th row충청남도 공주시 반포면 공암장터길 24
5th row충청남도 공주시 반포면 공암장터길 24
ValueCountFrequency (%)
충청남도 648
20.2%
논산시 579
 
18.1%
시민로210번길 85
 
2.7%
9 73
 
2.3%
은진면 63
 
2.0%
연무읍 49
 
1.5%
연산면 47
 
1.5%
계백로 43
 
1.3%
강경읍 38
 
1.2%
대전광역시 33
 
1.0%
Other values (543) 1549
48.3%
2024-01-10T06:10:16.047169image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5115
30.0%
808
 
4.7%
712
 
4.2%
659
 
3.9%
659
 
3.9%
654
 
3.8%
653
 
3.8%
594
 
3.5%
588
 
3.4%
1 581
 
3.4%
Other values (174) 6046
35.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 9006
52.8%
Space Separator 5115
30.0%
Decimal Number 2708
 
15.9%
Dash Punctuation 239
 
1.4%
Connector Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
808
 
9.0%
712
 
7.9%
659
 
7.3%
659
 
7.3%
654
 
7.3%
653
 
7.3%
594
 
6.6%
588
 
6.5%
386
 
4.3%
299
 
3.3%
Other values (161) 2994
33.2%
Decimal Number
ValueCountFrequency (%)
1 581
21.5%
2 480
17.7%
3 273
10.1%
0 257
9.5%
9 247
9.1%
5 200
 
7.4%
4 189
 
7.0%
6 187
 
6.9%
7 147
 
5.4%
8 147
 
5.4%
Space Separator
ValueCountFrequency (%)
5115
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 239
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 9006
52.8%
Common 8063
47.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
808
 
9.0%
712
 
7.9%
659
 
7.3%
659
 
7.3%
654
 
7.3%
653
 
7.3%
594
 
6.6%
588
 
6.5%
386
 
4.3%
299
 
3.3%
Other values (161) 2994
33.2%
Common
ValueCountFrequency (%)
5115
63.4%
1 581
 
7.2%
2 480
 
6.0%
3 273
 
3.4%
0 257
 
3.2%
9 247
 
3.1%
- 239
 
3.0%
5 200
 
2.5%
4 189
 
2.3%
6 187
 
2.3%
Other values (3) 295
 
3.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 9006
52.8%
ASCII 8063
47.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5115
63.4%
1 581
 
7.2%
2 480
 
6.0%
3 273
 
3.4%
0 257
 
3.2%
9 247
 
3.1%
- 239
 
3.0%
5 200
 
2.5%
4 189
 
2.3%
6 187
 
2.3%
Other values (3) 295
 
3.7%
Hangul
ValueCountFrequency (%)
808
 
9.0%
712
 
7.9%
659
 
7.3%
659
 
7.3%
654
 
7.3%
653
 
7.3%
594
 
6.6%
588
 
6.5%
386
 
4.3%
299
 
3.3%
Other values (161) 2994
33.2%
Distinct1347
Distinct (%)42.7%
Missing1
Missing (%)< 0.1%
Memory size24.8 KiB
2024-01-10T06:10:16.363259image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length29
Median length24
Mean length17.57297
Min length1

Characters and Unicode

Total characters55390
Distinct characters230
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique902 ?
Unique (%)28.6%

Sample

1st row충청남도 논산시 벌곡면 한삼천리 334-2
2nd row서울특별시 강서구 화곡동 902-1
3rd row충청남도 공주시 반포면 공암리 370
4th row충청남도 공주시 반포면 공암리 370
5th row충청남도 공주시 반포면 공암리 370
ValueCountFrequency (%)
충청남도 2941
22.7%
논산시 2808
21.7%
연무읍 407
 
3.1%
취암동 284
 
2.2%
내동 256
 
2.0%
연산면 211
 
1.6%
강경읍 204
 
1.6%
은진면 154
 
1.2%
벌곡면 142
 
1.1%
양촌면 112
 
0.9%
Other values (1427) 5439
42.0%
2024-01-10T06:10:16.803144image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
11287
20.4%
3524
 
6.4%
3059
 
5.5%
3040
 
5.5%
3003
 
5.4%
2996
 
5.4%
2966
 
5.4%
2821
 
5.1%
1942
 
3.5%
1394
 
2.5%
Other values (220) 19358
34.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 36801
66.4%
Space Separator 11287
 
20.4%
Decimal Number 6167
 
11.1%
Dash Punctuation 1134
 
2.0%
Math Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3524
 
9.6%
3059
 
8.3%
3040
 
8.3%
3003
 
8.2%
2996
 
8.1%
2966
 
8.1%
2821
 
7.7%
1942
 
5.3%
1394
 
3.8%
1295
 
3.5%
Other values (207) 10761
29.2%
Decimal Number
ValueCountFrequency (%)
1 1134
18.4%
2 787
12.8%
4 701
11.4%
3 670
10.9%
5 575
9.3%
6 516
8.4%
8 486
7.9%
7 474
7.7%
0 449
 
7.3%
9 375
 
6.1%
Space Separator
ValueCountFrequency (%)
11287
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1134
100.0%
Math Symbol
ValueCountFrequency (%)
= 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 36801
66.4%
Common 18589
33.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3524
 
9.6%
3059
 
8.3%
3040
 
8.3%
3003
 
8.2%
2996
 
8.1%
2966
 
8.1%
2821
 
7.7%
1942
 
5.3%
1394
 
3.8%
1295
 
3.5%
Other values (207) 10761
29.2%
Common
ValueCountFrequency (%)
11287
60.7%
- 1134
 
6.1%
1 1134
 
6.1%
2 787
 
4.2%
4 701
 
3.8%
3 670
 
3.6%
5 575
 
3.1%
6 516
 
2.8%
8 486
 
2.6%
7 474
 
2.5%
Other values (3) 825
 
4.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 36801
66.4%
ASCII 18589
33.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
11287
60.7%
- 1134
 
6.1%
1 1134
 
6.1%
2 787
 
4.2%
4 701
 
3.8%
3 670
 
3.6%
5 575
 
3.1%
6 516
 
2.8%
8 486
 
2.6%
7 474
 
2.5%
Other values (3) 825
 
4.4%
Hangul
ValueCountFrequency (%)
3524
 
9.6%
3059
 
8.3%
3040
 
8.3%
3003
 
8.2%
2996
 
8.1%
2966
 
8.1%
2821
 
7.7%
1942
 
5.3%
1394
 
3.8%
1295
 
3.5%
Other values (207) 10761
29.2%

업무구분
Categorical

IMBALANCE 

Distinct3
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size24.8 KiB
사업장폐기물배출자(2-2호)
3035 
사업장폐기물배출자(2-3호)
 
117
<NA>
 
1

Length

Max length15
Median length15
Mean length14.996511
Min length4

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row사업장폐기물배출자(2-2호)
2nd row사업장폐기물배출자(2-2호)
3rd row사업장폐기물배출자(2-2호)
4th row사업장폐기물배출자(2-2호)
5th row사업장폐기물배출자(2-2호)

Common Values

ValueCountFrequency (%)
사업장폐기물배출자(2-2호) 3035
96.3%
사업장폐기물배출자(2-3호) 117
 
3.7%
<NA> 1
 
< 0.1%

Length

2024-01-10T06:10:16.941368image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T06:10:17.049543image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
사업장폐기물배출자(2-2호 3035
96.3%
사업장폐기물배출자(2-3호 117
 
3.7%
na 1
 
< 0.1%

월별배출량
Real number (ℝ)

SKEWED  ZEROS 

Distinct109
Distinct (%)3.5%
Missing1
Missing (%)< 0.1%
Infinite0
Infinite (%)0.0%
Mean21.527763
Minimum0
Maximum15000
Zeros2919
Zeros (%)92.6%
Negative0
Negative (%)0.0%
Memory size27.8 KiB
2024-01-10T06:10:17.163632image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile30
Maximum15000
Range15000
Interquartile range (IQR)0

Descriptive statistics

Standard deviation311.51951
Coefficient of variation (CV)14.470594
Kurtosis1719.5145
Mean21.527763
Median Absolute Deviation (MAD)0
Skewness37.572948
Sum67855.51
Variance97044.407
MonotonicityNot monotonic
2024-01-10T06:10:17.306917image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.0 2919
92.6%
10.0 14
 
0.4%
90.0 13
 
0.4%
30.0 11
 
0.3%
15.0 10
 
0.3%
20.0 9
 
0.3%
25.0 8
 
0.3%
35.0 7
 
0.2%
50.0 5
 
0.2%
200.0 5
 
0.2%
Other values (99) 151
 
4.8%
ValueCountFrequency (%)
0.0 2919
92.6%
1.0 5
 
0.2%
1.4 1
 
< 0.1%
2.0 1
 
< 0.1%
4.0 2
 
0.1%
5.0 2
 
0.1%
5.8 1
 
< 0.1%
6.0 1
 
< 0.1%
7.0 2
 
0.1%
8.0 1
 
< 0.1%
ValueCountFrequency (%)
15000.0 1
< 0.1%
4147.0 1
< 0.1%
3341.0 1
< 0.1%
3216.0 1
< 0.1%
3000.0 1
< 0.1%
2063.0 1
< 0.1%
2000.0 2
0.1%
1959.0 1
< 0.1%
1888.0 1
< 0.1%
1700.0 1
< 0.1%

성상구분
Categorical

IMBALANCE 

Distinct5
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size24.8 KiB
2570 
고상
577 
기타
 
3
액상
 
2
<NA>
 
1

Length

Max length4
Median length1
Mean length1.1855376
Min length1

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row
2nd row
3rd row
4th row
5th row

Common Values

ValueCountFrequency (%)
2570
81.5%
고상 577
 
18.3%
기타 3
 
0.1%
액상 2
 
0.1%
<NA> 1
 
< 0.1%

Length

2024-01-10T06:10:17.445158image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T06:10:17.558336image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
고상 577
99.0%
기타 3
 
0.5%
액상 2
 
0.3%
na 1
 
0.2%

폐기물자가처리방법
Categorical

IMBALANCE 

Distinct28
Distinct (%)0.9%
Missing0
Missing (%)0.0%
Memory size24.8 KiB
2369 
중간처분(파쇄.분쇄)
 
221
파쇄.절단
 
174
중간처분(일반소각)
 
103
재활용(파쇄.분쇄)
 
72
Other values (23)
 
214

Length

Max length19
Median length1
Mean length3.2305741
Min length1

Unique

Unique4 ?
Unique (%)0.1%

Sample

1st row
2nd row파쇄.절단
3rd row파쇄.절단
4th row파쇄.절단
5th row파쇄.절단

Common Values

ValueCountFrequency (%)
2369
75.1%
중간처분(파쇄.분쇄) 221
 
7.0%
파쇄.절단 174
 
5.5%
중간처분(일반소각) 103
 
3.3%
재활용(파쇄.분쇄) 72
 
2.3%
재활용(연료·고형연료제품 제조) 49
 
1.6%
재활용(기타) 28
 
0.9%
재활용(중간가공폐기물 제조) 24
 
0.8%
매립(민간관리형매립시설) 22
 
0.7%
재활용(성토재·복토재 등으로 사용) 17
 
0.5%
Other values (18) 74
 
2.3%

Length

2024-01-10T06:10:17.672509image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
중간처분(파쇄.분쇄 221
23.9%
파쇄.절단 174
18.8%
중간처분(일반소각 103
11.1%
제조 89
9.6%
재활용(파쇄.분쇄 72
 
7.8%
재활용(연료·고형연료제품 49
 
5.3%
재활용(기타 28
 
3.0%
사용 27
 
2.9%
재활용(중간가공폐기물 24
 
2.6%
매립(민간관리형매립시설 22
 
2.4%
Other values (22) 115
12.4%

공사시작일자
Date

MISSING 

Distinct542
Distinct (%)64.9%
Missing2318
Missing (%)73.5%
Memory size24.8 KiB
Minimum1999-05-01 00:00:00
Maximum2020-05-27 00:00:00
2024-01-10T06:10:17.784224image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T06:10:18.135230image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

공사종료일자
Date

MISSING 

Distinct523
Distinct (%)60.6%
Missing2290
Missing (%)72.6%
Memory size24.8 KiB
Minimum2000-01-30 00:00:00
Maximum2021-04-23 00:00:00
2024-01-10T06:10:18.254945image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T06:10:18.379328image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

Sample

인허가관리번호신고번호상호대표자전화번호신고일생활계구분폐기물종류배출량(톤)처리구분처리업소명처리방법처리량(톤/연)사업장도로명주소사업장지번주소업무구분월별배출량성상구분폐기물자가처리방법공사시작일자공사종료일자
04540000-32-1994-000011994-00001놀뫼환경(주)서주원000-000-00002000-08-280.00.0충청남도 논산시 벌곡면 한삼천리 334-2사업장폐기물배출자(2-2호)0.02019-03-212019-04-09
14540000-32-2000-000011(주)동원주택건설이봉재000-000-00002000-01-03건설폐기물60.0위탁놀뫼환경(주)파쇄.절단60.0서울특별시 강서구 화곡동 902-1사업장폐기물배출자(2-2호)0.0파쇄.절단2014-02-252015-01-17
24540000-32-2000-000022대석건설(주)김홍석000-000-00002000-01-03건설폐기물90.0위탁놀뫼환경(주)파쇄.절단90.0충청남도 공주시 반포면 공암장터길 24충청남도 공주시 반포면 공암리 370사업장폐기물배출자(2-2호)0.0파쇄.절단2014-10-012014-10-20
34540000-32-2000-000033대석건설(주)김홍석000-000-00002000-01-03건설폐기물90.0위탁놀뫼환경(주)파쇄.절단90.0충청남도 공주시 반포면 공암장터길 24충청남도 공주시 반포면 공암리 370사업장폐기물배출자(2-2호)0.0파쇄.절단2014-04-242014-09-23
44540000-32-2000-000044대석건설(주)김홍석000-000-00002000-01-03건설폐기물90.0위탁놀뫼환경(주)파쇄.절단90.0충청남도 공주시 반포면 공암장터길 24충청남도 공주시 반포면 공암리 370사업장폐기물배출자(2-2호)0.0파쇄.절단2011-06-012013-03-20
54540000-32-2000-000055대석건설(주)김홍석000-000-00002000-01-03건설폐기물90.0위탁놀뫼환경(주)파쇄.절단90.0충청남도 공주시 반포면 공암장터길 24충청남도 공주시 반포면 공암리 370사업장폐기물배출자(2-2호)0.0파쇄.절단2014-04-242014-09-16
64540000-32-2000-000066대석건설(주)김홍석000-000-00002000-01-03건설폐기물90.0위탁놀뫼환경(주)파쇄.절단90.0충청남도 공주시 반포면 공암장터길 24충청남도 공주시 반포면 공암리 370사업장폐기물배출자(2-2호)0.0파쇄.절단<NA><NA>
74540000-32-2000-000077대석건설(주)김홍석000-000-00002000-01-03건설폐기물0.0위탁놀뫼환경(주)파쇄.절단90.0충청남도 공주시 반포면 공암장터길 24충청남도 공주시 반포면 공암리 370사업장폐기물배출자(2-2호)90.0파쇄.절단2008-04-072009-09-01
84540000-32-2000-000088대석건설(주)김홍석000-000-00002000-01-03건설폐기물1080.0위탁놀뫼환경(주)파쇄.절단90.0충청남도 공주시 반포면 공암장터길 24충청남도 공주시 반포면 공암리 370사업장폐기물배출자(2-2호)90.0파쇄.절단2009-02-062009-12-31
94540000-32-2000-000099대석건설(주)김홍석000-000-00002000-01-03건설폐기물1080.0위탁놀뫼환경(주)파쇄.절단90.0충청남도 공주시 반포면 공암장터길 24충청남도 공주시 반포면 공암리 370사업장폐기물배출자(2-2호)90.0파쇄.절단2009-06-032010-12-31
인허가관리번호신고번호상호대표자전화번호신고일생활계구분폐기물종류배출량(톤)처리구분처리업소명처리방법처리량(톤/연)사업장도로명주소사업장지번주소업무구분월별배출량성상구분폐기물자가처리방법공사시작일자공사종료일자
31434540000-33-2009-00005376대한주택공사대전충남지역본부본부장041-734-70002009-09-070.00.0충청남도 논산시 중앙로260번길 59-3충청남도 논산시 취암동 80사업장폐기물배출자(2-3호)0.0<NA><NA>
31444540000-33-2009-00006377놀뫼건설(주)빈병권000-000-00002009-09-160.00.0충청남도 논산시 취암동사업장폐기물배출자(2-3호)0.0<NA><NA>
31454540000-33-2009-00007379육군훈련소육군훈련소000-000-00002009-09-210.00.0충청남도 논산시 연무읍 죽평리 761-1사업장폐기물배출자(2-3호)0.0<NA><NA>
31464540000-33-2009-00008379육군훈련소장육군훈련소000-000-00002009-10-010.00.0충청남도 논산시 연무읍 죽평길 15-1충청남도 논산시 연무읍 죽평리 76-1사업장폐기물배출자(2-3호)0.0<NA><NA>
31474540000-33-2009-00009380정한종합건설(주)정원웅000-000-00002009-10-150.00.0충청남도 논산시 연무읍 양지리사업장폐기물배출자(2-3호)0.02000-06-052000-08-05
31484540000-33-2009-00010382동남건설주식회사최상배000-000-00002009-10-280.00.0충청남도 논산시 채운면 삼거리사업장폐기물배출자(2-3호)0.0<NA><NA>
31494540000-33-2009-00011381삼부산업(주)정순영000-000-00002009-10-280.00.0충청남도 논산시 연산면 선비로1001번길 8-23충청남도 논산시 연산면 어은리 446-2사업장폐기물배출자(2-3호)0.02018-10-052018-12-18
31504540000-33-2009-000122009-00012(유)하얀건설박희재000-000-00002009-10-300.00.0충청남도 논산시 강경읍 남교리사업장폐기물배출자(2-3호)0.0<NA><NA>
31514540000-33-2009-00013385논산시청논산시장041-730-32742009-12-010.00.0충청남도 논산시 시민로210번길 9충청남도 논산시 내동 824사업장폐기물배출자(2-3호)0.02016-12-282017-02-05
31524540000-33-2010-00001428계룡건설산업(주)이시구000-000-00002010-09-130.00.0충청남도 논산시 취암동사업장폐기물배출자(2-3호)0.0<NA><NA>

Duplicate rows

Most frequently occurring

인허가관리번호신고번호상호대표자전화번호신고일생활계구분폐기물종류배출량(톤)처리구분처리업소명처리방법처리량(톤/연)사업장도로명주소사업장지번주소업무구분월별배출량성상구분폐기물자가처리방법공사시작일자공사종료일자# duplicates
04540000-32-2013-00007사업-555대성산업윤철중000-000-00002013-04-18생활계폐합성수지류50.0위탁(주)이에스세종중간처분(일반소각)50.0충청남도 논산시 광석면 천동리 589-14사업장폐기물배출자(2-2호)0.0고상중간처분(일반소각)<NA><NA>2
14540000-32-2019-00059사업2019-95논산시청논산시장000-000-00002019-11-29폐가구류_ 폐도장목_ 폐목재포장재_ 폐전선드럼(원목상태의 깨끗한 목재를 말한다)250.0위탁주식회사이한산업재활용(중간가공폐기물 제조)250.0충청남도 논산시 시민로210번길 9충청남도 논산시 내동 824사업장폐기물배출자(2-2호)0.0고상재활용(중간가공폐기물 제조)<NA><NA>2
24540000-32-2020-00013사업2020-16중앙통신(주)한의동042-639-00332020-03-25폐콘크리트100.0위탁대형환경(주)중간처분(파쇄.분쇄)100.0대전광역시 유성구 테크노2로 310-12대전광역시 유성구 탑립동 903사업장폐기물배출자(2-2호)0.0고상중간처분(파쇄.분쇄)<NA><NA>2
34540000-32-2020-00019사업2020-22논산시청 환경자원화센터논산시장041-746-55422020-04-08생활계폐가구류_ 폐도장목_ 폐목재포장재_ 폐전선드럼(원목상태의 깨끗한 목재를 말한다)150.0위탁(유)대한청정환경재활용(연료·고형연료제품 제조)150.0충청남도 논산시 은진면 버들길 137충청남도 논산시 은진면 시묘리 385-1사업장폐기물배출자(2-2호)0.0고상재활용(연료·고형연료제품 제조)<NA><NA>2