Overview

Dataset statistics

Number of variables10
Number of observations1882
Missing cells30
Missing cells (%)0.2%
Duplicate rows46
Duplicate rows (%)2.4%
Total size in memory147.2 KiB
Average record size in memory80.1 B

Variable types

Categorical4
Text6

Dataset

Description사업장별로 폐기물 배출자 신고 내역인 폐기물 종류, 사업자등록번호, 연락처, 처리업소명, 사업자도로명 주소등 제공
URLhttps://www.data.go.kr/data/15060283/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
Dataset has 46 (2.4%) duplicate rowsDuplicates
신고기준년도 is highly overall correlated with 폐기물구분High correlation
폐기물구분 is highly overall correlated with 처리방법 and 1 other fieldsHigh correlation
처리방법 is highly overall correlated with 폐기물구분High correlation
사업장도로명주소 has 22 (1.2%) missing valuesMissing

Reproduction

Analysis started2023-12-12 20:45:45.195074
Analysis finished2023-12-12 20:45:46.624197
Duration1.43 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

폐기물구분
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size14.8 KiB
일반폐기물
1488 
지정폐기물
394 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row일반폐기물
2nd row일반폐기물
3rd row일반폐기물
4th row일반폐기물
5th row일반폐기물

Common Values

ValueCountFrequency (%)
일반폐기물 1488
79.1%
지정폐기물 394
 
20.9%

Length

2023-12-13T05:45:46.691760image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T05:45:46.785488image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
일반폐기물 1488
79.1%
지정폐기물 394
 
20.9%
Distinct573
Distinct (%)30.4%
Missing0
Missing (%)0.0%
Memory size14.8 KiB
2023-12-13T05:45:46.968242image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length21
Median length16
Mean length9.7577046
Min length2

Characters and Unicode

Total characters18364
Distinct characters409
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique224 ?
Unique (%)11.9%

Sample

1st row대동건설(주)
2nd row경북건설 주식회사
3rd row(주)에코비트(충주2반입장)
4th row(주)에코비트(충주2반입장)
5th row(주)에코비트(충주2반입장)
ValueCountFrequency (%)
주식회사 178
 
7.1%
충주공장 78
 
3.1%
공군제19전투비행단 47
 
1.9%
금광배출인협회 34
 
1.4%
충주지점 32
 
1.3%
농업회사법인 28
 
1.1%
의료법인 24
 
1.0%
충주사업소 24
 
1.0%
그린에코사이클(주 24
 
1.0%
롯데칠성음료(주 21
 
0.8%
Other values (605) 2010
80.4%
2023-12-13T05:45:47.338429image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1850
 
10.1%
( 1261
 
6.9%
) 1261
 
6.9%
618
 
3.4%
521
 
2.8%
452
 
2.5%
367
 
2.0%
342
 
1.9%
337
 
1.8%
337
 
1.8%
Other values (399) 11018
60.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 14888
81.1%
Open Punctuation 1261
 
6.9%
Close Punctuation 1261
 
6.9%
Space Separator 618
 
3.4%
Decimal Number 235
 
1.3%
Uppercase Letter 58
 
0.3%
Other Punctuation 27
 
0.1%
Dash Punctuation 9
 
< 0.1%
Lowercase Letter 4
 
< 0.1%
Other Symbol 3
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1850
 
12.4%
521
 
3.5%
452
 
3.0%
367
 
2.5%
342
 
2.3%
337
 
2.3%
337
 
2.3%
304
 
2.0%
296
 
2.0%
263
 
1.8%
Other values (366) 9819
66.0%
Uppercase Letter
ValueCountFrequency (%)
C 13
22.4%
T 8
13.8%
S 7
12.1%
D 4
 
6.9%
G 4
 
6.9%
P 4
 
6.9%
N 4
 
6.9%
B 3
 
5.2%
A 3
 
5.2%
H 2
 
3.4%
Other values (6) 6
10.3%
Decimal Number
ValueCountFrequency (%)
1 73
31.1%
2 54
23.0%
9 48
20.4%
3 20
 
8.5%
0 19
 
8.1%
8 18
 
7.7%
4 2
 
0.9%
5 1
 
0.4%
Other Punctuation
ValueCountFrequency (%)
. 26
96.3%
& 1
 
3.7%
Lowercase Letter
ValueCountFrequency (%)
h 2
50.0%
e 2
50.0%
Open Punctuation
ValueCountFrequency (%)
( 1261
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1261
100.0%
Space Separator
ValueCountFrequency (%)
618
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 9
100.0%
Other Symbol
ValueCountFrequency (%)
3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 14891
81.1%
Common 3411
 
18.6%
Latin 62
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1850
 
12.4%
521
 
3.5%
452
 
3.0%
367
 
2.5%
342
 
2.3%
337
 
2.3%
337
 
2.3%
304
 
2.0%
296
 
2.0%
263
 
1.8%
Other values (367) 9822
66.0%
Latin
ValueCountFrequency (%)
C 13
21.0%
T 8
12.9%
S 7
11.3%
D 4
 
6.5%
G 4
 
6.5%
P 4
 
6.5%
N 4
 
6.5%
B 3
 
4.8%
A 3
 
4.8%
h 2
 
3.2%
Other values (8) 10
16.1%
Common
ValueCountFrequency (%)
( 1261
37.0%
) 1261
37.0%
618
18.1%
1 73
 
2.1%
2 54
 
1.6%
9 48
 
1.4%
. 26
 
0.8%
3 20
 
0.6%
0 19
 
0.6%
8 18
 
0.5%
Other values (4) 13
 
0.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 14888
81.1%
ASCII 3473
 
18.9%
None 3
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1850
 
12.4%
521
 
3.5%
452
 
3.0%
367
 
2.5%
342
 
2.3%
337
 
2.3%
337
 
2.3%
304
 
2.0%
296
 
2.0%
263
 
1.8%
Other values (366) 9819
66.0%
ASCII
ValueCountFrequency (%)
( 1261
36.3%
) 1261
36.3%
618
17.8%
1 73
 
2.1%
2 54
 
1.6%
9 48
 
1.4%
. 26
 
0.7%
3 20
 
0.6%
0 19
 
0.5%
8 18
 
0.5%
Other values (22) 75
 
2.2%
None
ValueCountFrequency (%)
3
100.0%
Distinct162
Distinct (%)8.6%
Missing1
Missing (%)0.1%
Memory size14.8 KiB
2023-12-13T05:45:47.940095image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length84
Median length66
Mean length16.757044
Min length1

Characters and Unicode

Total characters31520
Distinct characters254
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique54 ?
Unique (%)2.9%

Sample

1st row그 밖의 폐목재류
2nd row폐합성수지류(폐염화비닐수지류는 제외한다)
3rd row그 밖의 무기성오니
4th row그 밖의 무기성오니
5th row그 밖의 무기성오니
ValueCountFrequency (%)
제외한다 545
 
10.3%
513
 
9.7%
밖의 513
 
9.7%
폐합성수지류(폐염화비닐수지류는 488
 
9.2%
말한다 150
 
2.8%
폐수처리오니 142
 
2.7%
폐유 95
 
1.8%
식물성잔재물 90
 
1.7%
88
 
1.7%
폐합성수지류 72
 
1.4%
Other values (260) 2616
49.2%
2023-12-13T05:45:48.425229image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3445
 
10.9%
2204
 
7.0%
1295
 
4.1%
1288
 
4.1%
1176
 
3.7%
888
 
2.8%
777
 
2.5%
758
 
2.4%
750
 
2.4%
723
 
2.3%
Other values (244) 18216
57.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 25823
81.9%
Space Separator 3445
 
10.9%
Close Punctuation 782
 
2.5%
Open Punctuation 782
 
2.5%
Connector Punctuation 248
 
0.8%
Lowercase Letter 246
 
0.8%
Decimal Number 189
 
0.6%
Other Punctuation 5
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2204
 
8.5%
1295
 
5.0%
1288
 
5.0%
1176
 
4.6%
888
 
3.4%
777
 
3.0%
758
 
2.9%
750
 
2.9%
723
 
2.8%
703
 
2.7%
Other values (225) 15261
59.1%
Lowercase Letter
ValueCountFrequency (%)
e 82
33.3%
r 41
16.7%
g 41
16.7%
a 41
16.7%
s 41
16.7%
Decimal Number
ValueCountFrequency (%)
2 64
33.9%
1 47
24.9%
0 41
21.7%
8 37
19.6%
Close Punctuation
ValueCountFrequency (%)
) 704
90.0%
] 41
 
5.2%
37
 
4.7%
Open Punctuation
ValueCountFrequency (%)
( 704
90.0%
[ 41
 
5.2%
37
 
4.7%
Other Punctuation
ValueCountFrequency (%)
· 3
60.0%
. 2
40.0%
Space Separator
ValueCountFrequency (%)
3445
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 248
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 25823
81.9%
Common 5451
 
17.3%
Latin 246
 
0.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2204
 
8.5%
1295
 
5.0%
1288
 
5.0%
1176
 
4.6%
888
 
3.4%
777
 
3.0%
758
 
2.9%
750
 
2.9%
723
 
2.8%
703
 
2.7%
Other values (225) 15261
59.1%
Common
ValueCountFrequency (%)
3445
63.2%
) 704
 
12.9%
( 704
 
12.9%
_ 248
 
4.5%
2 64
 
1.2%
1 47
 
0.9%
] 41
 
0.8%
0 41
 
0.8%
[ 41
 
0.8%
37
 
0.7%
Other values (4) 79
 
1.4%
Latin
ValueCountFrequency (%)
e 82
33.3%
r 41
16.7%
g 41
16.7%
a 41
16.7%
s 41
16.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 25599
81.2%
ASCII 5620
 
17.8%
Compat Jamo 224
 
0.7%
None 77
 
0.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3445
61.3%
) 704
 
12.5%
( 704
 
12.5%
_ 248
 
4.4%
e 82
 
1.5%
2 64
 
1.1%
1 47
 
0.8%
r 41
 
0.7%
g 41
 
0.7%
a 41
 
0.7%
Other values (6) 203
 
3.6%
Hangul
ValueCountFrequency (%)
2204
 
8.6%
1295
 
5.1%
1288
 
5.0%
1176
 
4.6%
888
 
3.5%
777
 
3.0%
758
 
3.0%
750
 
2.9%
723
 
2.8%
703
 
2.7%
Other values (224) 15037
58.7%
Compat Jamo
ValueCountFrequency (%)
224
100.0%
None
ValueCountFrequency (%)
37
48.1%
37
48.1%
· 3
 
3.9%
Distinct539
Distinct (%)28.7%
Missing7
Missing (%)0.4%
Memory size14.8 KiB
2023-12-13T05:45:48.694101image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length11.998933
Min length11

Characters and Unicode

Total characters22498
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique201 ?
Unique (%)10.7%

Sample

1st row303-81-03787
2nd row632-81-00752
3rd row128-81-98738
4th row128-81-98738
5th row128-81-98738
ValueCountFrequency (%)
303-83-03743 47
 
2.5%
712-39-00855 34
 
1.8%
866-85-01691 24
 
1.3%
677-86-01176 21
 
1.1%
303-81-36503 20
 
1.1%
612-85-21580 19
 
1.0%
268-87-00567 19
 
1.0%
511-82-07042 18
 
1.0%
303-85-15748 16
 
0.9%
303-81-40195 15
 
0.8%
Other values (529) 1642
87.6%
2023-12-13T05:45:49.067847image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 3750
16.7%
0 3055
13.6%
3 2954
13.1%
8 2580
11.5%
1 2515
11.2%
2 1739
7.7%
5 1579
7.0%
7 1216
 
5.4%
6 1187
 
5.3%
4 1126
 
5.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 18748
83.3%
Dash Punctuation 3750
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 3055
16.3%
3 2954
15.8%
8 2580
13.8%
1 2515
13.4%
2 1739
9.3%
5 1579
8.4%
7 1216
 
6.5%
6 1187
 
6.3%
4 1126
 
6.0%
9 797
 
4.3%
Dash Punctuation
ValueCountFrequency (%)
- 3750
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 22498
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 3750
16.7%
0 3055
13.6%
3 2954
13.1%
8 2580
11.5%
1 2515
11.2%
2 1739
7.7%
5 1579
7.0%
7 1216
 
5.4%
6 1187
 
5.3%
4 1126
 
5.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 22498
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 3750
16.7%
0 3055
13.6%
3 2954
13.1%
8 2580
11.5%
1 2515
11.2%
2 1739
7.7%
5 1579
7.0%
7 1216
 
5.4%
6 1187
 
5.3%
4 1126
 
5.0%
Distinct494
Distinct (%)26.2%
Missing0
Missing (%)0.0%
Memory size14.8 KiB
2023-12-13T05:45:49.350878image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length22
Median length16
Mean length6.7959617
Min length1

Characters and Unicode

Total characters12790
Distinct characters280
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique269 ?
Unique (%)14.3%

Sample

1st row보람환경
2nd row보람환경
3rd row(주)대한물류
4th row(주)동서로직스
5th row(주)대진물류
ValueCountFrequency (%)
케이그린(주 143
 
7.5%
지구환경자원 95
 
5.0%
주)제일상사 93
 
4.9%
성문종합환경 71
 
3.7%
주)중원인더스트리 52
 
2.7%
선일산업 48
 
2.5%
멘토환경 43
 
2.3%
주)바이오그린 35
 
1.8%
우리산업 30
 
1.6%
주)에코리더 30
 
1.6%
Other values (484) 1258
66.3%
2023-12-13T05:45:49.776521image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1277
 
10.0%
( 1263
 
9.9%
) 1263
 
9.9%
617
 
4.8%
615
 
4.8%
500
 
3.9%
279
 
2.2%
275
 
2.2%
254
 
2.0%
245
 
1.9%
Other values (270) 6202
48.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 10171
79.5%
Open Punctuation 1263
 
9.9%
Close Punctuation 1263
 
9.9%
Space Separator 31
 
0.2%
Other Punctuation 19
 
0.1%
Connector Punctuation 14
 
0.1%
Uppercase Letter 10
 
0.1%
Decimal Number 9
 
0.1%
Other Symbol 7
 
0.1%
Lowercase Letter 3
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1277
 
12.6%
617
 
6.1%
615
 
6.0%
500
 
4.9%
279
 
2.7%
275
 
2.7%
254
 
2.5%
245
 
2.4%
242
 
2.4%
233
 
2.3%
Other values (254) 5634
55.4%
Uppercase Letter
ValueCountFrequency (%)
C 6
60.0%
S 1
 
10.0%
E 1
 
10.0%
T 1
 
10.0%
N 1
 
10.0%
Lowercase Letter
ValueCountFrequency (%)
e 1
33.3%
o 1
33.3%
c 1
33.3%
Decimal Number
ValueCountFrequency (%)
2 5
55.6%
1 4
44.4%
Open Punctuation
ValueCountFrequency (%)
( 1263
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1263
100.0%
Space Separator
ValueCountFrequency (%)
31
100.0%
Other Punctuation
ValueCountFrequency (%)
. 19
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 14
100.0%
Other Symbol
ValueCountFrequency (%)
7
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 10178
79.6%
Common 2599
 
20.3%
Latin 13
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1277
 
12.5%
617
 
6.1%
615
 
6.0%
500
 
4.9%
279
 
2.7%
275
 
2.7%
254
 
2.5%
245
 
2.4%
242
 
2.4%
233
 
2.3%
Other values (255) 5641
55.4%
Latin
ValueCountFrequency (%)
C 6
46.2%
e 1
 
7.7%
o 1
 
7.7%
c 1
 
7.7%
S 1
 
7.7%
E 1
 
7.7%
T 1
 
7.7%
N 1
 
7.7%
Common
ValueCountFrequency (%)
( 1263
48.6%
) 1263
48.6%
31
 
1.2%
. 19
 
0.7%
_ 14
 
0.5%
2 5
 
0.2%
1 4
 
0.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 10171
79.5%
ASCII 2612
 
20.4%
None 7
 
0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1277
 
12.6%
617
 
6.1%
615
 
6.0%
500
 
4.9%
279
 
2.7%
275
 
2.7%
254
 
2.5%
245
 
2.4%
242
 
2.4%
233
 
2.3%
Other values (254) 5634
55.4%
ASCII
ValueCountFrequency (%)
( 1263
48.4%
) 1263
48.4%
31
 
1.2%
. 19
 
0.7%
_ 14
 
0.5%
C 6
 
0.2%
2 5
 
0.2%
1 4
 
0.2%
e 1
 
< 0.1%
o 1
 
< 0.1%
Other values (5) 5
 
0.2%
None
ValueCountFrequency (%)
7
100.0%
Distinct628
Distinct (%)33.4%
Missing0
Missing (%)0.0%
Memory size14.8 KiB
2023-12-13T05:45:50.053885image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length19
Median length17
Mean length7.837407
Min length1

Characters and Unicode

Total characters14750
Distinct characters327
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique357 ?
Unique (%)19.0%

Sample

1st row(주)엘에스에너지
2nd row(주)해창
3rd row쌍용씨엔이(주)영월공장
4th row쌍용씨엔이(주)영월공장
5th row쌍용씨엔이(주)영월공장
ValueCountFrequency (%)
주)다나에너지솔루션 84
 
4.3%
주)클렌코 48
 
2.5%
주)청풍산업 44
 
2.3%
주)삼우그린 42
 
2.1%
주)중원환경산업 41
 
2.1%
주)천지화학 32
 
1.6%
자연환경(주 32
 
1.6%
진주산업(주 27
 
1.4%
창광실업(주 26
 
1.3%
노은환경개발(주 24
 
1.2%
Other values (628) 1554
79.5%
2023-12-13T05:45:50.498264image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1638
 
11.1%
( 1550
 
10.5%
) 1550
 
10.5%
531
 
3.6%
470
 
3.2%
395
 
2.7%
385
 
2.6%
357
 
2.4%
300
 
2.0%
292
 
2.0%
Other values (317) 7282
49.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 11496
77.9%
Open Punctuation 1551
 
10.5%
Close Punctuation 1551
 
10.5%
Space Separator 83
 
0.6%
Uppercase Letter 18
 
0.1%
Decimal Number 17
 
0.1%
Lowercase Letter 14
 
0.1%
Other Symbol 11
 
0.1%
Other Punctuation 5
 
< 0.1%
Dash Punctuation 3
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1638
 
14.2%
531
 
4.6%
470
 
4.1%
395
 
3.4%
385
 
3.3%
357
 
3.1%
300
 
2.6%
292
 
2.5%
229
 
2.0%
201
 
1.7%
Other values (285) 6698
58.3%
Lowercase Letter
ValueCountFrequency (%)
a 2
14.3%
g 2
14.3%
i 2
14.3%
m 1
7.1%
k 1
7.1%
p 1
7.1%
s 1
7.1%
o 1
7.1%
c 1
7.1%
r 1
7.1%
Uppercase Letter
ValueCountFrequency (%)
C 6
33.3%
R 2
 
11.1%
S 2
 
11.1%
N 2
 
11.1%
G 2
 
11.1%
O 1
 
5.6%
I 1
 
5.6%
E 1
 
5.6%
F 1
 
5.6%
Open Punctuation
ValueCountFrequency (%)
( 1550
99.9%
[ 1
 
0.1%
Close Punctuation
ValueCountFrequency (%)
) 1550
99.9%
] 1
 
0.1%
Decimal Number
ValueCountFrequency (%)
2 10
58.8%
1 7
41.2%
Other Punctuation
ValueCountFrequency (%)
. 3
60.0%
& 2
40.0%
Space Separator
ValueCountFrequency (%)
83
100.0%
Other Symbol
ValueCountFrequency (%)
11
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 11507
78.0%
Common 3211
 
21.8%
Latin 32
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1638
 
14.2%
531
 
4.6%
470
 
4.1%
395
 
3.4%
385
 
3.3%
357
 
3.1%
300
 
2.6%
292
 
2.5%
229
 
2.0%
201
 
1.7%
Other values (286) 6709
58.3%
Latin
ValueCountFrequency (%)
C 6
18.8%
R 2
 
6.2%
a 2
 
6.2%
g 2
 
6.2%
i 2
 
6.2%
S 2
 
6.2%
N 2
 
6.2%
G 2
 
6.2%
m 1
 
3.1%
k 1
 
3.1%
Other values (10) 10
31.2%
Common
ValueCountFrequency (%)
( 1550
48.3%
) 1550
48.3%
83
 
2.6%
2 10
 
0.3%
1 7
 
0.2%
- 3
 
0.1%
. 3
 
0.1%
& 2
 
0.1%
_ 1
 
< 0.1%
[ 1
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 11496
77.9%
ASCII 3243
 
22.0%
None 11
 
0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1638
 
14.2%
531
 
4.6%
470
 
4.1%
395
 
3.4%
385
 
3.3%
357
 
3.1%
300
 
2.6%
292
 
2.5%
229
 
2.0%
201
 
1.7%
Other values (285) 6698
58.3%
ASCII
ValueCountFrequency (%)
( 1550
47.8%
) 1550
47.8%
83
 
2.6%
2 10
 
0.3%
1 7
 
0.2%
C 6
 
0.2%
- 3
 
0.1%
. 3
 
0.1%
R 2
 
0.1%
a 2
 
0.1%
Other values (21) 27
 
0.8%
None
ValueCountFrequency (%)
11
100.0%

처리방법
Categorical

HIGH CORRELATION 

Distinct38
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size14.8 KiB
중간처분(일반소각)
411 
재활용(중간가공폐기물 제조)
389 
재활용(농업생산활동에 사용)
184 
재활용(연료·고형연료제품 제조)
169 
매립(민간관리형매립시설)
154 
Other values (33)
575 

Length

Max length19
Median length17
Mean length12.659936
Min length1

Unique

Unique4 ?
Unique (%)0.2%

Sample

1st row재활용(연료·고형연료제품 제조)
2nd row재활용(중간가공폐기물 제조)
3rd row재활용(직접 제품제조)
4th row재활용(직접 제품제조)
5th row재활용(직접 제품제조)

Common Values

ValueCountFrequency (%)
중간처분(일반소각) 411
21.8%
재활용(중간가공폐기물 제조) 389
20.7%
재활용(농업생산활동에 사용) 184
9.8%
재활용(연료·고형연료제품 제조) 169
9.0%
매립(민간관리형매립시설) 154
 
8.2%
재활용(원료 제조) 127
 
6.7%
재활용(직접 제품제조) 101
 
5.4%
중간처분(파쇄.분쇄) 67
 
3.6%
재활용(토질개선에 사용) 56
 
3.0%
중간처분(고온소각) 46
 
2.4%
Other values (28) 178
9.5%

Length

2023-12-13T05:45:50.667204image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
제조 685
23.0%
중간처분(일반소각 411
13.8%
재활용(중간가공폐기물 389
13.1%
사용 264
 
8.9%
재활용(농업생산활동에 184
 
6.2%
재활용(연료·고형연료제품 169
 
5.7%
매립(민간관리형매립시설 154
 
5.2%
재활용(원료 127
 
4.3%
재활용(직접 113
 
3.8%
제품제조 101
 
3.4%
Other values (32) 382
12.8%
Distinct524
Distinct (%)28.2%
Missing22
Missing (%)1.2%
Memory size14.8 KiB
2023-12-13T05:45:51.049676image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length54
Median length45
Mean length24.156452
Min length1

Characters and Unicode

Total characters44931
Distinct characters294
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique180 ?
Unique (%)9.7%

Sample

1st row충청북도 충주시 용정4길 20 (용산동)
2nd row충청북도 충주시 남산1길 6 (용산동)
3rd row충청북도 충주시 소태면 동막강현길 137
4th row충청북도 충주시 소태면 동막강현길 137
5th row충청북도 충주시 소태면 동막강현길 137
ValueCountFrequency (%)
충청북도 1851
 
19.1%
충주시 1846
 
19.0%
대소원면 391
 
4.0%
용탄동 228
 
2.4%
주덕읍 226
 
2.3%
신니면 116
 
1.2%
목행동 104
 
1.1%
중앙탑면 101
 
1.0%
금가면 92
 
0.9%
충주호수로 77
 
0.8%
Other values (698) 4660
48.1%
2023-12-13T05:45:51.618933image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
7875
17.5%
4080
 
9.1%
2561
 
5.7%
2006
 
4.5%
1966
 
4.4%
1882
 
4.2%
1877
 
4.2%
1 1412
 
3.1%
1197
 
2.7%
1033
 
2.3%
Other values (284) 19042
42.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 28800
64.1%
Space Separator 7875
 
17.5%
Decimal Number 6148
 
13.7%
Close Punctuation 769
 
1.7%
Open Punctuation 769
 
1.7%
Dash Punctuation 368
 
0.8%
Connector Punctuation 160
 
0.4%
Uppercase Letter 30
 
0.1%
Other Punctuation 9
 
< 0.1%
Other Symbol 3
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4080
 
14.2%
2561
 
8.9%
2006
 
7.0%
1966
 
6.8%
1882
 
6.5%
1877
 
6.5%
1197
 
4.2%
1033
 
3.6%
703
 
2.4%
632
 
2.2%
Other values (260) 10863
37.7%
Decimal Number
ValueCountFrequency (%)
1 1412
23.0%
2 959
15.6%
3 784
12.8%
4 509
 
8.3%
6 499
 
8.1%
5 484
 
7.9%
7 426
 
6.9%
8 408
 
6.6%
0 395
 
6.4%
9 272
 
4.4%
Uppercase Letter
ValueCountFrequency (%)
G 10
33.3%
C 9
30.0%
S 3
 
10.0%
D 3
 
10.0%
B 2
 
6.7%
E 2
 
6.7%
A 1
 
3.3%
Space Separator
ValueCountFrequency (%)
7875
100.0%
Close Punctuation
ValueCountFrequency (%)
) 769
100.0%
Open Punctuation
ValueCountFrequency (%)
( 769
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 368
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 160
100.0%
Other Punctuation
ValueCountFrequency (%)
. 9
100.0%
Other Symbol
ValueCountFrequency (%)
3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 28803
64.1%
Common 16098
35.8%
Latin 30
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
4080
 
14.2%
2561
 
8.9%
2006
 
7.0%
1966
 
6.8%
1882
 
6.5%
1877
 
6.5%
1197
 
4.2%
1033
 
3.6%
703
 
2.4%
632
 
2.2%
Other values (261) 10866
37.7%
Common
ValueCountFrequency (%)
7875
48.9%
1 1412
 
8.8%
2 959
 
6.0%
3 784
 
4.9%
) 769
 
4.8%
( 769
 
4.8%
4 509
 
3.2%
6 499
 
3.1%
5 484
 
3.0%
7 426
 
2.6%
Other values (6) 1612
 
10.0%
Latin
ValueCountFrequency (%)
G 10
33.3%
C 9
30.0%
S 3
 
10.0%
D 3
 
10.0%
B 2
 
6.7%
E 2
 
6.7%
A 1
 
3.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 28800
64.1%
ASCII 16128
35.9%
None 3
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
7875
48.8%
1 1412
 
8.8%
2 959
 
5.9%
3 784
 
4.9%
) 769
 
4.8%
( 769
 
4.8%
4 509
 
3.2%
6 499
 
3.1%
5 484
 
3.0%
7 426
 
2.6%
Other values (13) 1642
 
10.2%
Hangul
ValueCountFrequency (%)
4080
 
14.2%
2561
 
8.9%
2006
 
7.0%
1966
 
6.8%
1882
 
6.5%
1877
 
6.5%
1197
 
4.2%
1033
 
3.6%
703
 
2.4%
632
 
2.2%
Other values (260) 10863
37.7%
None
ValueCountFrequency (%)
3
100.0%

신고기준년도
Categorical

HIGH CORRELATION 

Distinct50
Distinct (%)2.7%
Missing0
Missing (%)0.0%
Memory size14.8 KiB
2017년
 
122
2018년
 
120
2011년
 
104
2006년
 
82
2021년
 
79
Other values (45)
1375 

Length

Max length8
Median length5
Mean length5.1907545
Min length2

Unique

Unique2 ?
Unique (%)0.1%

Sample

1st row2023년
2nd row2023년
3rd row2023년
4th row2023년
5th row2023년

Common Values

ValueCountFrequency (%)
2017년 122
 
6.5%
2018년 120
 
6.4%
2011년 104
 
5.5%
2006년 82
 
4.4%
2021년 79
 
4.2%
2022년 77
 
4.1%
2020년 76
 
4.0%
2015년 74
 
3.9%
2020 년 73
 
3.9%
2023년 70
 
3.7%
Other values (40) 1005
53.4%

Length

2023-12-13T05:45:51.789719image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
353
 
15.8%
2017년 122
 
5.5%
2018년 120
 
5.4%
2011년 104
 
4.7%
2006년 82
 
3.7%
2021년 79
 
3.5%
2022년 77
 
3.4%
2020년 76
 
3.4%
2015년 74
 
3.3%
2020 73
 
3.3%
Other values (41) 1075
48.1%

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size14.8 KiB
2023-08-31
1882 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023-08-31
2nd row2023-08-31
3rd row2023-08-31
4th row2023-08-31
5th row2023-08-31

Common Values

ValueCountFrequency (%)
2023-08-31 1882
100.0%

Length

2023-12-13T05:45:51.931438image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T05:45:52.043307image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023-08-31 1882
100.0%

Correlations

2023-12-13T05:45:52.137138image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
폐기물구분처리방법신고기준년도
폐기물구분1.0000.6890.996
처리방법0.6891.0000.827
신고기준년도0.9960.8271.000
2023-12-13T05:45:52.241756image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
처리방법신고기준년도폐기물구분
처리방법1.0000.2710.554
신고기준년도0.2711.0000.940
폐기물구분0.5540.9401.000
2023-12-13T05:45:52.359583image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
폐기물구분처리방법신고기준년도
폐기물구분1.0000.5540.940
처리방법0.5541.0000.271
신고기준년도0.9400.2711.000

Missing values

2023-12-13T05:45:46.263541image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T05:45:46.445872image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-13T05:45:46.565197image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

폐기물구분상호명폐기물종류사업자등록번호운반자명처리업소명처리방법사업장도로명주소신고기준년도데이터기준일자
0일반폐기물대동건설(주)그 밖의 폐목재류303-81-03787보람환경(주)엘에스에너지재활용(연료·고형연료제품 제조)충청북도 충주시 용정4길 20 (용산동)2023년2023-08-31
1일반폐기물경북건설 주식회사폐합성수지류(폐염화비닐수지류는 제외한다)632-81-00752보람환경(주)해창재활용(중간가공폐기물 제조)충청북도 충주시 남산1길 6 (용산동)2023년2023-08-31
2일반폐기물(주)에코비트(충주2반입장)그 밖의 무기성오니128-81-98738(주)대한물류쌍용씨엔이(주)영월공장재활용(직접 제품제조)충청북도 충주시 소태면 동막강현길 1372023년2023-08-31
3일반폐기물(주)에코비트(충주2반입장)그 밖의 무기성오니128-81-98738(주)동서로직스쌍용씨엔이(주)영월공장재활용(직접 제품제조)충청북도 충주시 소태면 동막강현길 1372023년2023-08-31
4일반폐기물(주)에코비트(충주2반입장)그 밖의 무기성오니128-81-98738(주)대진물류쌍용씨엔이(주)영월공장재활용(직접 제품제조)충청북도 충주시 소태면 동막강현길 1372023년2023-08-31
5일반폐기물농업회사법인(주)자연알로<NA>558-81-03092농업회사법인더클린(주)(주)보성씨엔알재활용(농업생산활동에 사용)충청북도 충주시 소태면 구룡로 692-162023년2023-08-31
6일반폐기물(주)일신테크하수준설토303-81-49366성진산업개발(주)성진산업개발(주)재활용(직접 제품제조)충청북도 충주시 절골길 44 (용탄동)2023년2023-08-31
7일반폐기물나무로집그 밖의 폐목재류878-43-00448(주)중원환경산업(주)중원환경산업재활용(중간가공폐기물 제조)충청북도 충주시 동량면 충원대로 1530-42023년2023-08-31
8일반폐기물나무로집폐합성수지류(폐염화비닐수지류는 제외한다)878-43-00448(주)중원환경산업(주)중원환경산업재활용(중간가공폐기물 제조)충청북도 충주시 동량면 충원대로 1530-42023년2023-08-31
9일반폐기물주식회사 티케이와이폐합성수지류(폐염화비닐수지류는 제외한다)790-81-02245지구환경자원(주)다나에너지솔루션중간처분(일반소각)충청북도 충주시 주덕읍 중원산업로 57-322023년2023-08-31
폐기물구분상호명폐기물종류사업자등록번호운반자명처리업소명처리방법사업장도로명주소신고기준년도데이터기준일자
1872지정폐기물충북자동차전문정비충주시지회그 밖의 폐유기용제511-82-07042상원기공상원기공재활용(원료 제조)충청북도 충주시 상방4길 56 (봉방동)2007 년2023-08-31
1873지정폐기물충북자동차전문정비충주시지회그 밖의 폐광물유[아스팔트유ㆍ그리스(grease)ㆍ방청유 및 수용성절삭유_ 20퍼센트 이상의 이물질이 함유된 폐유_ 고체상태의 폐유 등을 말한다]511-82-07042케이그린(주)(주)국인산업중간처분(일반소각)충청북도 충주시 상방4길 56 (봉방동)2007 년2023-08-31
1874지정폐기물충북자동차전문정비충주시지회폐오일필터511-82-07042케이그린(주)(주)국인산업중간처분(일반소각)충청북도 충주시 상방4길 56 (봉방동)2007 년2023-08-31
1875지정폐기물알바니인터내셔날코리아(주)그 밖의 폐유기용제303-81-14829케이그린(주)(주)천지화학재활용(연료·고형연료제품 제조)충청북도 충주시 충주호수로 308 (용탄동)2002 년2023-08-31
1876지정폐기물알바니인터내셔날코리아(주)폐유성페인트303-81-14829케이그린(주)케이지이티에스(주)중간처분(고온소각)충청북도 충주시 충주호수로 308 (용탄동)2002 년2023-08-31
1877지정폐기물알바니인터내셔날코리아(주)폐기계유ㆍ폐작동유(공업용 기계유ㆍ냉동기유ㆍ터어빈유ㆍ베어링윤활유ㆍ압축기유ㆍ유압작동유ㆍ열매체유 및 프로세스유 등을 말한다)303-81-14829케이그린(주)(주)천지화학재활용(연료·고형연료제품 제조)충청북도 충주시 충주호수로 308 (용탄동)2002 년2023-08-31
1878지정폐기물사회복지법인모두사랑재단하나의원손상성폐기물303-82-06253청주덕원위생공사(주)삼우그린소각충청북도 충주시 앙성면 앙암로 2 (하나의원)2001 년2023-08-31
1879지정폐기물사회복지법인모두사랑재단하나의원폐합성수지류303-82-06253청주덕원위생공사(주)삼우그린소각충청북도 충주시 앙성면 앙암로 2 (하나의원)2001 년2023-08-31
1880지정폐기물사회복지법인모두사랑재단하나의원탈지면류303-82-06253청주덕원위생공사(주)삼우그린소각충청북도 충주시 앙성면 앙암로 2 (하나의원)2001 년2023-08-31
1881지정폐기물충주종합폐차장폐황산이 포함된 2차폐축전지303-11-60975케이메탈(주)국제금속재활용(원료 제조)충청북도 충주시 목수1길 23 (목행동)2006 년2023-08-31

Duplicate rows

Most frequently occurring

폐기물구분상호명폐기물종류사업자등록번호운반자명처리업소명처리방법사업장도로명주소신고기준년도데이터기준일자# duplicates
0일반폐기물(주)오브이룸스 수안보 연수원폐합성수지류(폐염화비닐수지류는 제외한다)303-85-27512성문종합환경(주)에스제이환경산업재활용(중간가공폐기물 제조)충청북도 충주시 수안보면 동진이1길 992013년2023-08-312
1일반폐기물(주)천보신소재그 밖의 공정오니268-87-00567케이그린(주)(주)케이디환경매립(민간관리형매립시설)충청북도 충주시 주덕읍 중원산업로 1632018년2023-08-312
2일반폐기물(주)파리크라상 천등산(제천방향)휴게소폐식용유(식용을 목적으로 식품 재료와 원료를 제조ㆍ조리ㆍ가공하거나 식용유를 유통ㆍ사용 또는 음식물류 폐기물을 처리하는 과정에서 발생하는 기름을 말한다)605-85-46093신흥물산(주)신흥물산(주)재활용(농업생산활동에 사용)충청북도 충주시 산척면 평택제천고속도로 1062015년2023-08-312
3일반폐기물(주)퍼시스충주공장목재가공공장 부산물(접착제_ 페인트_ 기름_ 콘크리트 등의 물질이 사용된 목재부산물 및 분진을 말한다)215-81-20534(주)중앙통운(주)이현에너지재활용(연료·고형연료제품 제조)충청북도 충주시 중앙탑면 가금농공길 462017년2023-08-312
4일반폐기물강남태양열(주)충주폐합성수지303-85-10613자가처리자가중간처분(일반소각)충청북도 충주시 신니면 수월3길 212000년2023-08-312
5일반폐기물노은환경개발(주)폐합성수지류(폐염화비닐수지류는 제외한다)303-81-37991유림환경(합)(주)엔이티재활용(연료·고형연료제품 제조)충청북도 충주시 노은면 안락숭선길 772006년2023-08-312
6일반폐기물보성갈바텍(주)그 밖의 광재류303-81-47107풍전비철풍전비철재활용(원료 제조)충청북도 충주시 주덕읍 주덕농공길 402008년2023-08-312
7일반폐기물사빅코리아 유한회사폐합성수지류303-85-02641(주)제일상사자연환경(주)재활용(중간가공폐기물 제조)충청북도 충주시 국원대로 488 (목행동)2001년2023-08-312
8지정폐기물(주)케이지씨예본병리계폐기물303-81-65274(주)삼우(주)삼우그린중간처분(일반소각)충청북도 충주시 가주농공2길 27 (가주동)2016 년2023-08-312
9지정폐기물(주)태진정공 충주지점폐연마유ㆍ비수용성폐절삭유ㆍ폐열처리유(금속가공과정에서 발생된 것을 말한다)303-85-28321케이그린(주)(주)남양에너지재활용(연료·고형연료제품 제조)충청북도 충주시 충주산단3로 55 (용탄동)2015 년2023-08-312