Overview

Dataset statistics

Number of variables11
Number of observations354
Missing cells399
Missing cells (%)10.2%
Duplicate rows16
Duplicate rows (%)4.5%
Total size in memory30.6 KiB
Average record size in memory88.4 B

Variable types

Categorical2
Text7
DateTime2

Dataset

Description폐기물구분, 상호명, 폐기물종류, 사업자등록번호, 연락처, 운반자명, 처리업소명, 처리방법, 사업장도로명주소, 신고기준년도를 포함하고 있습니다.
Author충청남도
URLhttps://alldam.chungnam.go.kr/index.chungnam?menuCd=DOM_000000201001001001&st=&cds=&orgCd=&apiType=&isOpen=Y&pageIndex=383&beforeMenuCd=DOM_000000201001001000&publicdatapk=15060184

Alerts

데이터기준일자 has constant value ""Constant
Dataset has 16 (4.5%) duplicate rowsDuplicates
처리방법 is highly overall correlated with 폐기물구분High correlation
폐기물구분 is highly overall correlated with 처리방법High correlation
폐기물구분 is highly imbalanced (69.8%)Imbalance
상호명 has 19 (5.4%) missing valuesMissing
폐기물종류 has 19 (5.4%) missing valuesMissing
사업자등록번호 has 89 (25.1%) missing valuesMissing
연락처 has 175 (49.4%) missing valuesMissing
운반자명 has 20 (5.6%) missing valuesMissing
처리업소명 has 20 (5.6%) missing valuesMissing
사업장도로명주소 has 19 (5.4%) missing valuesMissing
신고기준년도 has 19 (5.4%) missing valuesMissing
데이터기준일자 has 19 (5.4%) missing valuesMissing

Reproduction

Analysis started2024-01-09 22:41:04.414355
Analysis finished2024-01-09 22:41:05.355604
Duration0.94 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

폐기물구분
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size2.9 KiB
사업장폐기물
335 
<NA>
 
19

Length

Max length6
Median length6
Mean length5.8926554
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row사업장폐기물
2nd row사업장폐기물
3rd row사업장폐기물
4th row사업장폐기물
5th row사업장폐기물

Common Values

ValueCountFrequency (%)
사업장폐기물 335
94.6%
<NA> 19
 
5.4%

Length

2024-01-10T07:41:05.642201image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T07:41:05.724936image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
사업장폐기물 335
94.6%
na 19
 
5.4%

상호명
Text

MISSING 

Distinct141
Distinct (%)42.1%
Missing19
Missing (%)5.4%
Memory size2.9 KiB
2024-01-10T07:41:05.879043image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length32
Median length22
Mean length9.5313433
Min length2

Characters and Unicode

Total characters3193
Distinct characters210
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique65 ?
Unique (%)19.4%

Sample

1st row한국농어촌공사 부여지사
2nd row리니어 주식회사
3rd row수지산업
4th row우리레미콘
5th row주식회사 서원코리아
ValueCountFrequency (%)
주)건양기술공사 23
 
5.1%
한국조폐공사 23
 
5.1%
제지본부 23
 
5.1%
건축사사무소 23
 
5.1%
주)한국인삼공사 14
 
3.1%
고려인삼창(부여공장 14
 
3.1%
주)대오 11
 
2.4%
태극제약(주 8
 
1.8%
고려개발(주 7
 
1.6%
주)동방아그로 6
 
1.3%
Other values (154) 298
66.2%
2024-01-10T07:41:06.162644image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
) 222
 
7.0%
( 219
 
6.9%
204
 
6.4%
152
 
4.8%
116
 
3.6%
115
 
3.6%
98
 
3.1%
72
 
2.3%
67
 
2.1%
61
 
1.9%
Other values (200) 1867
58.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2621
82.1%
Close Punctuation 222
 
7.0%
Open Punctuation 219
 
6.9%
Space Separator 115
 
3.6%
Uppercase Letter 15
 
0.5%
Connector Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
204
 
7.8%
152
 
5.8%
116
 
4.4%
98
 
3.7%
72
 
2.7%
67
 
2.6%
61
 
2.3%
56
 
2.1%
54
 
2.1%
49
 
1.9%
Other values (192) 1692
64.6%
Uppercase Letter
ValueCountFrequency (%)
C 12
80.0%
E 1
 
6.7%
L 1
 
6.7%
S 1
 
6.7%
Close Punctuation
ValueCountFrequency (%)
) 222
100.0%
Open Punctuation
ValueCountFrequency (%)
( 219
100.0%
Space Separator
ValueCountFrequency (%)
115
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2621
82.1%
Common 557
 
17.4%
Latin 15
 
0.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
204
 
7.8%
152
 
5.8%
116
 
4.4%
98
 
3.7%
72
 
2.7%
67
 
2.6%
61
 
2.3%
56
 
2.1%
54
 
2.1%
49
 
1.9%
Other values (192) 1692
64.6%
Common
ValueCountFrequency (%)
) 222
39.9%
( 219
39.3%
115
20.6%
_ 1
 
0.2%
Latin
ValueCountFrequency (%)
C 12
80.0%
E 1
 
6.7%
L 1
 
6.7%
S 1
 
6.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2621
82.1%
ASCII 572
 
17.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
) 222
38.8%
( 219
38.3%
115
20.1%
C 12
 
2.1%
_ 1
 
0.2%
E 1
 
0.2%
L 1
 
0.2%
S 1
 
0.2%
Hangul
ValueCountFrequency (%)
204
 
7.8%
152
 
5.8%
116
 
4.4%
98
 
3.7%
72
 
2.7%
67
 
2.6%
61
 
2.3%
56
 
2.1%
54
 
2.1%
49
 
1.9%
Other values (192) 1692
64.6%

폐기물종류
Text

MISSING 

Distinct67
Distinct (%)20.0%
Missing19
Missing (%)5.4%
Memory size2.9 KiB
2024-01-10T07:41:06.342713image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length54
Median length49
Mean length10.755224
Min length2

Characters and Unicode

Total characters3603
Distinct characters151
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique34 ?
Unique (%)10.1%

Sample

1st row폐합성수지류(폐염화비닐수지류는 제외한다)
2nd row폐합성수지류(폐염화비닐수지류는 제외한다)
3rd row폐합성수지류(폐염화비닐수지류는 제외한다)
4th row폐콘크리트
5th row그 밖의 폐기물
ValueCountFrequency (%)
제외한다 59
 
9.9%
사업장폐기물 55
 
9.2%
폐합성수지류(폐염화비닐수지류는 54
 
9.0%
밖의 37
 
6.2%
37
 
6.2%
하수처리오니 28
 
4.7%
폐합성수지류 26
 
4.4%
폐콘크리트 23
 
3.9%
폐수처리오니 18
 
3.0%
폐흡착제 12
 
2.0%
Other values (110) 248
41.5%
2024-01-10T07:41:06.635076image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
333
 
9.2%
262
 
7.3%
213
 
5.9%
159
 
4.4%
155
 
4.3%
117
 
3.2%
108
 
3.0%
99
 
2.7%
96
 
2.7%
93
 
2.6%
Other values (141) 1968
54.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3194
88.6%
Space Separator 262
 
7.3%
Open Punctuation 70
 
1.9%
Close Punctuation 70
 
1.9%
Connector Punctuation 6
 
0.2%
Other Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
333
 
10.4%
213
 
6.7%
159
 
5.0%
155
 
4.9%
117
 
3.7%
108
 
3.4%
99
 
3.1%
96
 
3.0%
93
 
2.9%
82
 
2.6%
Other values (136) 1739
54.4%
Space Separator
ValueCountFrequency (%)
262
100.0%
Open Punctuation
ValueCountFrequency (%)
( 70
100.0%
Close Punctuation
ValueCountFrequency (%)
) 70
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 6
100.0%
Other Punctuation
ValueCountFrequency (%)
. 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3194
88.6%
Common 409
 
11.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
333
 
10.4%
213
 
6.7%
159
 
5.0%
155
 
4.9%
117
 
3.7%
108
 
3.4%
99
 
3.1%
96
 
3.0%
93
 
2.9%
82
 
2.6%
Other values (136) 1739
54.4%
Common
ValueCountFrequency (%)
262
64.1%
( 70
 
17.1%
) 70
 
17.1%
_ 6
 
1.5%
. 1
 
0.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3176
88.1%
ASCII 409
 
11.4%
Compat Jamo 18
 
0.5%

Most frequent character per block

Hangul
ValueCountFrequency (%)
333
 
10.5%
213
 
6.7%
159
 
5.0%
155
 
4.9%
117
 
3.7%
108
 
3.4%
99
 
3.1%
96
 
3.0%
93
 
2.9%
82
 
2.6%
Other values (135) 1721
54.2%
ASCII
ValueCountFrequency (%)
262
64.1%
( 70
 
17.1%
) 70
 
17.1%
_ 6
 
1.5%
. 1
 
0.2%
Compat Jamo
ValueCountFrequency (%)
18
100.0%

사업자등록번호
Text

MISSING 

Distinct103
Distinct (%)38.9%
Missing89
Missing (%)25.1%
Memory size2.9 KiB
2024-01-10T07:41:06.851696image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters3180
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique53 ?
Unique (%)20.0%

Sample

1st row308-82-05074
2nd row748-87-00554
3rd row308-03-42263
4th row308-81-12243
5th row705-87-00047
ValueCountFrequency (%)
308-81-07508 28
 
10.6%
308-82-01947 23
 
8.7%
308-83-00697 11
 
4.2%
124-81-34163 9
 
3.4%
308-85-06375 7
 
2.6%
308-85-13196 6
 
2.3%
308-03-46935 6
 
2.3%
308-05-70138 6
 
2.3%
307-81-06923 5
 
1.9%
135-81-25887 5
 
1.9%
Other values (93) 159
60.0%
2024-01-10T07:41:07.165405image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
8 535
16.8%
- 530
16.7%
0 505
15.9%
3 387
12.2%
1 330
10.4%
7 191
 
6.0%
5 165
 
5.2%
2 150
 
4.7%
6 142
 
4.5%
4 132
 
4.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 2650
83.3%
Dash Punctuation 530
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
8 535
20.2%
0 505
19.1%
3 387
14.6%
1 330
12.5%
7 191
 
7.2%
5 165
 
6.2%
2 150
 
5.7%
6 142
 
5.4%
4 132
 
5.0%
9 113
 
4.3%
Dash Punctuation
ValueCountFrequency (%)
- 530
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 3180
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
8 535
16.8%
- 530
16.7%
0 505
15.9%
3 387
12.2%
1 330
10.4%
7 191
 
6.0%
5 165
 
5.2%
2 150
 
4.7%
6 142
 
4.5%
4 132
 
4.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 3180
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
8 535
16.8%
- 530
16.7%
0 505
15.9%
3 387
12.2%
1 330
10.4%
7 191
 
6.0%
5 165
 
5.2%
2 150
 
4.7%
6 142
 
4.5%
4 132
 
4.2%

연락처
Text

MISSING 

Distinct69
Distinct (%)38.5%
Missing175
Missing (%)49.4%
Memory size2.9 KiB
2024-01-10T07:41:07.384690image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters2148
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique36 ?
Unique (%)20.1%

Sample

1st row041-837-9512
2nd row041-834-5822
3rd row041-832-9375
4th row041-835-8555
5th row041-835-0369
ValueCountFrequency (%)
041-359-7366 23
 
12.8%
041-836-7172 23
 
12.8%
041-837-5531 9
 
5.0%
041-939-1726 6
 
3.4%
041-837-7933 5
 
2.8%
041-836-7171 5
 
2.8%
041-837-0151 4
 
2.2%
041-835-9070 4
 
2.2%
041-939-1012 4
 
2.2%
041-835-1566 4
 
2.2%
Other values (59) 92
51.4%
2024-01-10T07:41:07.702400image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 358
16.7%
1 280
13.0%
0 265
12.3%
3 263
12.2%
4 228
10.6%
7 183
8.5%
8 171
8.0%
5 122
 
5.7%
6 117
 
5.4%
9 86
 
4.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1790
83.3%
Dash Punctuation 358
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 280
15.6%
0 265
14.8%
3 263
14.7%
4 228
12.7%
7 183
10.2%
8 171
9.6%
5 122
6.8%
6 117
6.5%
9 86
 
4.8%
2 75
 
4.2%
Dash Punctuation
ValueCountFrequency (%)
- 358
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 2148
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 358
16.7%
1 280
13.0%
0 265
12.3%
3 263
12.2%
4 228
10.6%
7 183
8.5%
8 171
8.0%
5 122
 
5.7%
6 117
 
5.4%
9 86
 
4.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2148
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 358
16.7%
1 280
13.0%
0 265
12.3%
3 263
12.2%
4 228
10.6%
7 183
8.5%
8 171
8.0%
5 122
 
5.7%
6 117
 
5.4%
9 86
 
4.0%

운반자명
Text

MISSING 

Distinct168
Distinct (%)50.3%
Missing20
Missing (%)5.6%
Memory size2.9 KiB
2024-01-10T07:41:07.944458image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length10
Mean length6.2215569
Min length1

Characters and Unicode

Total characters2078
Distinct characters196
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique114 ?
Unique (%)34.1%

Sample

1st row한빛환경
2nd row(유)계룡환경산업
3rd row햇살환경
4th row신화환경개발(주)
5th row조은환경
ValueCountFrequency (%)
공주환경 28
 
8.5%
극동산업사 16
 
4.8%
현무환경(주 14
 
4.2%
유림환경(합 11
 
3.3%
자가처리 8
 
2.4%
자연보호환경(주 7
 
2.1%
녹색건설(주 7
 
2.1%
주)동서 7
 
2.1%
주)삼엽 6
 
1.8%
신화환경개발(주 6
 
1.8%
Other values (153) 220
66.7%
2024-01-10T07:41:08.282502image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
185
 
8.9%
( 183
 
8.8%
) 183
 
8.8%
151
 
7.3%
142
 
6.8%
66
 
3.2%
64
 
3.1%
40
 
1.9%
36
 
1.7%
35
 
1.7%
Other values (186) 993
47.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1662
80.0%
Open Punctuation 183
 
8.8%
Close Punctuation 183
 
8.8%
Space Separator 35
 
1.7%
Other Symbol 5
 
0.2%
Decimal Number 5
 
0.2%
Uppercase Letter 3
 
0.1%
Other Punctuation 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
185
 
11.1%
151
 
9.1%
142
 
8.5%
66
 
4.0%
64
 
3.9%
40
 
2.4%
36
 
2.2%
33
 
2.0%
28
 
1.7%
28
 
1.7%
Other values (174) 889
53.5%
Decimal Number
ValueCountFrequency (%)
0 2
40.0%
2 1
20.0%
1 1
20.0%
5 1
20.0%
Uppercase Letter
ValueCountFrequency (%)
D 1
33.3%
F 1
33.3%
R 1
33.3%
Open Punctuation
ValueCountFrequency (%)
( 183
100.0%
Close Punctuation
ValueCountFrequency (%)
) 183
100.0%
Space Separator
ValueCountFrequency (%)
35
100.0%
Other Symbol
ValueCountFrequency (%)
5
100.0%
Other Punctuation
ValueCountFrequency (%)
. 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1667
80.2%
Common 408
 
19.6%
Latin 3
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
185
 
11.1%
151
 
9.1%
142
 
8.5%
66
 
4.0%
64
 
3.8%
40
 
2.4%
36
 
2.2%
33
 
2.0%
28
 
1.7%
28
 
1.7%
Other values (175) 894
53.6%
Common
ValueCountFrequency (%)
( 183
44.9%
) 183
44.9%
35
 
8.6%
. 2
 
0.5%
0 2
 
0.5%
2 1
 
0.2%
1 1
 
0.2%
5 1
 
0.2%
Latin
ValueCountFrequency (%)
D 1
33.3%
F 1
33.3%
R 1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1662
80.0%
ASCII 411
 
19.8%
None 5
 
0.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
185
 
11.1%
151
 
9.1%
142
 
8.5%
66
 
4.0%
64
 
3.9%
40
 
2.4%
36
 
2.2%
33
 
2.0%
28
 
1.7%
28
 
1.7%
Other values (174) 889
53.5%
ASCII
ValueCountFrequency (%)
( 183
44.5%
) 183
44.5%
35
 
8.5%
. 2
 
0.5%
0 2
 
0.5%
2 1
 
0.2%
1 1
 
0.2%
5 1
 
0.2%
D 1
 
0.2%
F 1
 
0.2%
None
ValueCountFrequency (%)
5
100.0%

처리업소명
Text

MISSING 

Distinct185
Distinct (%)55.4%
Missing20
Missing (%)5.6%
Memory size2.9 KiB
2024-01-10T07:41:08.479077image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length14
Mean length6.742515
Min length1

Characters and Unicode

Total characters2252
Distinct characters208
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique134 ?
Unique (%)40.1%

Sample

1st row(주)중앙이엔비
2nd row(유)계룡환경산업
3rd row(주)중앙이엔비
4th row신화환경개발(주)
5th row(주)엔아이티
ValueCountFrequency (%)
극동산업사 16
 
4.7%
신화환경개발(주 15
 
4.4%
주)이에스세종 11
 
3.2%
세명기업사 7
 
2.1%
자가 7
 
2.1%
주)인영 6
 
1.8%
극동한국산업사 6
 
1.8%
주)중부에너지공사 6
 
1.8%
두제에너지산업(주 5
 
1.5%
주)중앙이엔비 5
 
1.5%
Other values (183) 256
75.3%
2024-01-10T07:41:08.801917image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
) 201
 
8.9%
( 200
 
8.9%
189
 
8.4%
96
 
4.3%
89
 
4.0%
70
 
3.1%
65
 
2.9%
57
 
2.5%
56
 
2.5%
49
 
2.2%
Other values (198) 1180
52.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1818
80.7%
Close Punctuation 201
 
8.9%
Open Punctuation 200
 
8.9%
Space Separator 12
 
0.5%
Decimal Number 11
 
0.5%
Other Symbol 4
 
0.2%
Uppercase Letter 3
 
0.1%
Other Punctuation 2
 
0.1%
Connector Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
189
 
10.4%
96
 
5.3%
89
 
4.9%
70
 
3.9%
65
 
3.6%
57
 
3.1%
56
 
3.1%
49
 
2.7%
31
 
1.7%
31
 
1.7%
Other values (183) 1085
59.7%
Decimal Number
ValueCountFrequency (%)
0 4
36.4%
2 2
18.2%
1 2
18.2%
5 1
 
9.1%
9 1
 
9.1%
4 1
 
9.1%
Uppercase Letter
ValueCountFrequency (%)
R 1
33.3%
D 1
33.3%
F 1
33.3%
Close Punctuation
ValueCountFrequency (%)
) 201
100.0%
Open Punctuation
ValueCountFrequency (%)
( 200
100.0%
Space Separator
ValueCountFrequency (%)
12
100.0%
Other Symbol
ValueCountFrequency (%)
4
100.0%
Other Punctuation
ValueCountFrequency (%)
. 2
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1822
80.9%
Common 427
 
19.0%
Latin 3
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
189
 
10.4%
96
 
5.3%
89
 
4.9%
70
 
3.8%
65
 
3.6%
57
 
3.1%
56
 
3.1%
49
 
2.7%
31
 
1.7%
31
 
1.7%
Other values (184) 1089
59.8%
Common
ValueCountFrequency (%)
) 201
47.1%
( 200
46.8%
12
 
2.8%
0 4
 
0.9%
. 2
 
0.5%
2 2
 
0.5%
1 2
 
0.5%
5 1
 
0.2%
_ 1
 
0.2%
9 1
 
0.2%
Latin
ValueCountFrequency (%)
R 1
33.3%
D 1
33.3%
F 1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1818
80.7%
ASCII 430
 
19.1%
None 4
 
0.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
) 201
46.7%
( 200
46.5%
12
 
2.8%
0 4
 
0.9%
. 2
 
0.5%
2 2
 
0.5%
1 2
 
0.5%
5 1
 
0.2%
_ 1
 
0.2%
9 1
 
0.2%
Other values (4) 4
 
0.9%
Hangul
ValueCountFrequency (%)
189
 
10.4%
96
 
5.3%
89
 
4.9%
70
 
3.9%
65
 
3.6%
57
 
3.1%
56
 
3.1%
49
 
2.7%
31
 
1.7%
31
 
1.7%
Other values (183) 1085
59.7%
None
ValueCountFrequency (%)
4
100.0%

처리방법
Categorical

HIGH CORRELATION 

Distinct31
Distinct (%)8.8%
Missing0
Missing (%)0.0%
Memory size2.9 KiB
중간처분(일반소각)
53 
기타재활용
49 
중간처분(파쇄.분쇄)
32 
재활용(토질개선에 사용)
25 
매립(민간관리형매립시설)
21 
Other values (26)
174 

Length

Max length19
Median length15
Mean length10
Min length1

Unique

Unique7 ?
Unique (%)2.0%

Sample

1st row중간처분(파쇄.분쇄)
2nd row재활용(원료 제조)
3rd row중간처분(파쇄.분쇄)
4th row중간처분(파쇄.분쇄)
5th row중간처분(일반소각)

Common Values

ValueCountFrequency (%)
중간처분(일반소각) 53
15.0%
기타재활용 49
13.8%
중간처분(파쇄.분쇄) 32
 
9.0%
재활용(토질개선에 사용) 25
 
7.1%
매립(민간관리형매립시설) 21
 
5.9%
재활용(기타) 20
 
5.6%
<NA> 19
 
5.4%
재활용(중간가공폐기물 제조) 18
 
5.1%
재활용(농업생산활동에 사용) 16
 
4.5%
재활용(연료·고형연료제품 제조) 15
 
4.2%
Other values (21) 86
24.3%

Length

2024-01-10T07:41:08.935310image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
중간처분(일반소각 53
 
11.4%
기타재활용 49
 
10.5%
사용 48
 
10.3%
제조 45
 
9.7%
중간처분(파쇄.분쇄 32
 
6.9%
재활용(토질개선에 25
 
5.4%
매립(민간관리형매립시설 21
 
4.5%
재활용(기타 20
 
4.3%
na 19
 
4.1%
재활용(중간가공폐기물 18
 
3.9%
Other values (25) 135
29.0%
Distinct128
Distinct (%)38.2%
Missing19
Missing (%)5.4%
Memory size2.9 KiB
2024-01-10T07:41:09.134223image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length34
Median length30
Mean length22.208955
Min length1

Characters and Unicode

Total characters7440
Distinct characters141
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique62 ?
Unique (%)18.5%

Sample

1st row충청남도 부여군 부여읍 성왕로 90 (한국농어촌공사)
2nd row충청남도 부여군 임천면 만가로 20-6
3rd row충청남도 부여군 석성면 증산천길 140
4th row충청남도 부여군 석성면 금백로 166-13
5th row충청남도 부여군 임천면 부흥로171번길 27
ValueCountFrequency (%)
충청남도 332
19.7%
부여군 331
19.7%
초촌면 75
 
4.5%
부여읍 62
 
3.7%
규암면 40
 
2.4%
석성면 38
 
2.3%
장암면 38
 
2.3%
신암리 31
 
1.8%
임천면 27
 
1.6%
금백로 25
 
1.5%
Other values (208) 683
40.6%
2024-01-10T07:41:09.454589image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1432
19.2%
411
 
5.5%
401
 
5.4%
374
 
5.0%
338
 
4.5%
335
 
4.5%
334
 
4.5%
332
 
4.5%
270
 
3.6%
221
 
3.0%
Other values (131) 2992
40.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4762
64.0%
Space Separator 1432
 
19.2%
Decimal Number 1145
 
15.4%
Dash Punctuation 87
 
1.2%
Open Punctuation 7
 
0.1%
Close Punctuation 7
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
411
 
8.6%
401
 
8.4%
374
 
7.9%
338
 
7.1%
335
 
7.0%
334
 
7.0%
332
 
7.0%
270
 
5.7%
221
 
4.6%
129
 
2.7%
Other values (117) 1617
34.0%
Decimal Number
ValueCountFrequency (%)
1 200
17.5%
2 174
15.2%
0 130
11.4%
7 121
10.6%
6 106
9.3%
4 102
8.9%
3 100
8.7%
8 95
8.3%
9 60
 
5.2%
5 57
 
5.0%
Space Separator
ValueCountFrequency (%)
1432
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 87
100.0%
Open Punctuation
ValueCountFrequency (%)
( 7
100.0%
Close Punctuation
ValueCountFrequency (%)
) 7
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4762
64.0%
Common 2678
36.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
411
 
8.6%
401
 
8.4%
374
 
7.9%
338
 
7.1%
335
 
7.0%
334
 
7.0%
332
 
7.0%
270
 
5.7%
221
 
4.6%
129
 
2.7%
Other values (117) 1617
34.0%
Common
ValueCountFrequency (%)
1432
53.5%
1 200
 
7.5%
2 174
 
6.5%
0 130
 
4.9%
7 121
 
4.5%
6 106
 
4.0%
4 102
 
3.8%
3 100
 
3.7%
8 95
 
3.5%
- 87
 
3.2%
Other values (4) 131
 
4.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4762
64.0%
ASCII 2678
36.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1432
53.5%
1 200
 
7.5%
2 174
 
6.5%
0 130
 
4.9%
7 121
 
4.5%
6 106
 
4.0%
4 102
 
3.8%
3 100
 
3.7%
8 95
 
3.5%
- 87
 
3.2%
Other values (4) 131
 
4.9%
Hangul
ValueCountFrequency (%)
411
 
8.6%
401
 
8.4%
374
 
7.9%
338
 
7.1%
335
 
7.0%
334
 
7.0%
332
 
7.0%
270
 
5.7%
221
 
4.6%
129
 
2.7%
Other values (117) 1617
34.0%

신고기준년도
Date

MISSING 

Distinct136
Distinct (%)40.6%
Missing19
Missing (%)5.4%
Memory size2.9 KiB
Minimum1993-01-14 00:00:00
Maximum2021-04-21 00:00:00
2024-01-10T07:41:09.569170image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T07:41:09.688904image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

데이터기준일자
Date

CONSTANT  MISSING 

Distinct1
Distinct (%)0.3%
Missing19
Missing (%)5.4%
Memory size2.9 KiB
Minimum2021-04-29 00:00:00
Maximum2021-04-29 00:00:00
2024-01-10T07:41:09.771097image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T07:41:09.844424image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Correlations

2024-01-10T07:41:09.903932image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
폐기물종류연락처처리방법
폐기물종류1.0000.0000.943
연락처0.0001.0000.837
처리방법0.9430.8371.000
2024-01-10T07:41:09.981232image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
처리방법폐기물구분
처리방법1.0001.000
폐기물구분1.0001.000
2024-01-10T07:41:10.059096image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
폐기물구분처리방법
폐기물구분1.0001.000
처리방법1.0001.000

Missing values

2024-01-10T07:41:05.013418image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-10T07:41:05.132740image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-01-10T07:41:05.254404image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

폐기물구분상호명폐기물종류사업자등록번호연락처운반자명처리업소명처리방법사업장도로명주소신고기준년도데이터기준일자
0사업장폐기물한국농어촌공사 부여지사폐합성수지류(폐염화비닐수지류는 제외한다)308-82-05074041-837-9512한빛환경(주)중앙이엔비중간처분(파쇄.분쇄)충청남도 부여군 부여읍 성왕로 90 (한국농어촌공사)2021-04-212021-04-29
1사업장폐기물리니어 주식회사폐합성수지류(폐염화비닐수지류는 제외한다)748-87-00554041-834-5822(유)계룡환경산업(유)계룡환경산업재활용(원료 제조)충청남도 부여군 임천면 만가로 20-62021-04-122021-04-29
2사업장폐기물수지산업폐합성수지류(폐염화비닐수지류는 제외한다)308-03-42263041-832-9375햇살환경(주)중앙이엔비중간처분(파쇄.분쇄)충청남도 부여군 석성면 증산천길 1402021-03-242021-04-29
3사업장폐기물우리레미콘폐콘크리트308-81-12243041-835-8555신화환경개발(주)신화환경개발(주)중간처분(파쇄.분쇄)충청남도 부여군 석성면 금백로 166-132021-02-232021-04-29
4사업장폐기물주식회사 서원코리아그 밖의 폐기물705-87-00047041-835-0369조은환경(주)엔아이티중간처분(일반소각)충청남도 부여군 임천면 부흥로171번길 272021-01-072021-04-29
5사업장폐기물진흥기업(주)폐수처리오니106-81-32769041-931-8441구항산업(주)제이에이그린매립(민간관리형매립시설)충청남도 보령시 미산면 만수로 1170-72020-11-062021-04-29
6사업장폐기물(주)크린환경폐합성수지류(폐염화비닐수지류는 제외한다)722-85-01071041-832-5598(주)중앙이엔비(주)중앙이엔비중간처분(파쇄.분쇄)충청남도 부여군 초촌면 신암로 4102020-10-162021-04-29
7사업장폐기물(주)크린환경폐합성수지류(폐염화비닐수지류는 제외한다)722-85-01071041-832-5598(주)크린환경(주)중앙이엔비중간처분(파쇄.분쇄)충청남도 부여군 초촌면 신암로 4102020-10-162021-04-29
8사업장폐기물대한폴리텍(주)폐합성수지류(폐염화비닐수지류는 제외한다)137-81-46966031-989-2350녹색건설(주)새한환경(주)중간처분(일반소각)충청남도 부여군 임천면 부흥로171번길 272020-05-182021-04-29
9사업장폐기물충남아스콘(주)폐콘크리트137-81-46966031-989-2350신화환경개발(주)신화환경개발(주)중간처분(파쇄.분쇄)충청남도 부여군 석성면 증산리 1316번지 외3필지2020-03-202021-04-29
폐기물구분상호명폐기물종류사업자등록번호연락처운반자명처리업소명처리방법사업장도로명주소신고기준년도데이터기준일자
344<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
345<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
346<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
347<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
348<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
349<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
350<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
351<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
352<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
353<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>

Duplicate rows

Most frequently occurring

폐기물구분상호명폐기물종류사업자등록번호연락처운반자명처리업소명처리방법사업장도로명주소신고기준년도데이터기준일자# duplicates
15<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>19
0사업장폐기물(주)건양기술공사 건축사사무소하수처리오니308-83-00697<NA>공주환경미륵농장재활용(토질개선에 사용)충청남도 부여군 부여읍 백마강길 2392004-05-252021-04-292
1사업장폐기물(주)건양기술공사 건축사사무소하수처리오니<NA><NA>공주환경토룡농장재활용(토질개선에 사용)충청남도 부여군 부여읍 백마강길 2392004-05-252021-04-292
2사업장폐기물(주)대성석재사업장폐기물<NA><NA>극동산업사극동산업사기타재활용충청남도 부여군 초촌면 신암리 2-23번지1996-12-202021-04-292
3사업장폐기물(주)비봉이앤지폐합성수지류308-05-46680041-837-7474(주)제이엔텍삼우산업중간처분(절단)충청남도 부여군 석성면 증산로 952011-03-182021-04-292
4사업장폐기물다산대리석(합)사업장폐기물<NA><NA>금강환경(주)(주)인영기타재활용충청남도 부여군 초촌면 신암리 1-4번지2000-01-312021-04-292
5사업장폐기물대창석재사업장폐기물308-81-07508041-836-7172극동산업사극동산업사기타재활용충청남도 부여군 초촌면 신암리 2-40번지1995-01-062021-04-292
6사업장폐기물동광석재사업장폐기물308-81-07508041-836-7172인영인영기타재활용충청남도 부여군 초촌면 신암리 2-14번지1995-01-062021-04-292
7사업장폐기물동산석재사업장폐기물<NA><NA>극동산업사극동산업사기타재활용충청남도 부여군 초촌면 신암리 2-24번지1994-12-312021-04-292
8사업장폐기물동양석재석재ㆍ골재폐수처리오니(석재ㆍ골재 생산 시 발생한 폐수를 처리하는 과정에서 발생한 오니로 한정한다)<NA><NA>삼홍조경건설삼홍조경건설재활용(성토재·복토재 등으로 사용)충청남도 부여군 외산면 무량로 51997-05-282021-04-292