Overview

Dataset statistics

Number of variables10
Number of observations457
Missing cells0
Missing cells (%)0.0%
Duplicate rows56
Duplicate rows (%)12.3%
Total size in memory36.3 KiB
Average record size in memory81.3 B

Variable types

Categorical2
Text6
Numeric1
DateTime1

Dataset

Description충청북도 증평군의 사업장 폐기물 배출자 신고현황(사업장명, 주소, 폐기물종류, 운반업체, 처리업체 등)입니다.
URLhttps://www.data.go.kr/data/15060381/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
Dataset has 56 (12.3%) duplicate rowsDuplicates
신고년도 is highly overall correlated with 구분High correlation
구분 is highly overall correlated with 신고년도 and 1 other fieldsHigh correlation
처리방법 is highly overall correlated with 구분High correlation

Reproduction

Analysis started2023-12-12 03:02:21.274274
Analysis finished2023-12-12 03:02:23.131482
Duration1.86 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

구분
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size3.7 KiB
사업장일반
273 
지정
184 

Length

Max length5
Median length5
Mean length3.7921225
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row사업장일반
2nd row사업장일반
3rd row사업장일반
4th row사업장일반
5th row사업장일반

Common Values

ValueCountFrequency (%)
사업장일반 273
59.7%
지정 184
40.3%

Length

2023-12-12T12:02:23.217918image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T12:02:23.367115image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
사업장일반 273
59.7%
지정 184
40.3%

상호
Text

Distinct104
Distinct (%)22.8%
Missing0
Missing (%)0.0%
Memory size3.7 KiB
2023-12-12T12:02:23.723252image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length13
Mean length10.225383
Min length3

Characters and Unicode

Total characters4673
Distinct characters203
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique22 ?
Unique (%)4.8%

Sample

1st row풍남레미콘(주)
2nd row풍남레미콘(주)
3rd row성산정밀(주)
4th row성산정밀(주)
5th row(주)한국알미늄
ValueCountFrequency (%)
주식회사 29
 
4.6%
의료법인 28
 
4.4%
증평연세병원 24
 
3.8%
청진의료재단 24
 
3.8%
의)여덕의료재단 24
 
3.8%
농업회사법인 19
 
3.0%
증평요양병원 16
 
2.5%
증평공장 16
 
2.5%
에스케이아이이테크놀로지(주 15
 
2.4%
기아오토큐 14
 
2.2%
Other values (107) 427
67.1%
2023-12-12T12:02:24.340804image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
270
 
5.8%
( 249
 
5.3%
) 249
 
5.3%
191
 
4.1%
159
 
3.4%
159
 
3.4%
150
 
3.2%
137
 
2.9%
108
 
2.3%
85
 
1.8%
Other values (193) 2916
62.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3968
84.9%
Open Punctuation 249
 
5.3%
Close Punctuation 249
 
5.3%
Space Separator 191
 
4.1%
Decimal Number 10
 
0.2%
Uppercase Letter 6
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
270
 
6.8%
159
 
4.0%
159
 
4.0%
150
 
3.8%
137
 
3.5%
108
 
2.7%
85
 
2.1%
82
 
2.1%
81
 
2.0%
78
 
2.0%
Other values (186) 2659
67.0%
Uppercase Letter
ValueCountFrequency (%)
G 2
33.3%
N 2
33.3%
E 2
33.3%
Open Punctuation
ValueCountFrequency (%)
( 249
100.0%
Close Punctuation
ValueCountFrequency (%)
) 249
100.0%
Space Separator
ValueCountFrequency (%)
191
100.0%
Decimal Number
ValueCountFrequency (%)
2 10
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3968
84.9%
Common 699
 
15.0%
Latin 6
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
270
 
6.8%
159
 
4.0%
159
 
4.0%
150
 
3.8%
137
 
3.5%
108
 
2.7%
85
 
2.1%
82
 
2.1%
81
 
2.0%
78
 
2.0%
Other values (186) 2659
67.0%
Common
ValueCountFrequency (%)
( 249
35.6%
) 249
35.6%
191
27.3%
2 10
 
1.4%
Latin
ValueCountFrequency (%)
G 2
33.3%
N 2
33.3%
E 2
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3968
84.9%
ASCII 705
 
15.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
270
 
6.8%
159
 
4.0%
159
 
4.0%
150
 
3.8%
137
 
3.5%
108
 
2.7%
85
 
2.1%
82
 
2.1%
81
 
2.0%
78
 
2.0%
Other values (186) 2659
67.0%
ASCII
ValueCountFrequency (%)
( 249
35.3%
) 249
35.3%
191
27.1%
2 10
 
1.4%
G 2
 
0.3%
N 2
 
0.3%
E 2
 
0.3%
Distinct76
Distinct (%)16.6%
Missing0
Missing (%)0.0%
Memory size3.7 KiB
2023-12-12T12:02:24.707914image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length84
Median length66
Mean length16.157549
Min length2

Characters and Unicode

Total characters7384
Distinct characters202
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique32 ?
Unique (%)7.0%

Sample

1st row폐콘크리트
2nd row폐수처리오니
3rd row폐합성수지류
4th row분진
5th row폐가구류_ 폐도장목_ 폐목재포장재_ 폐전선드럼(접착제_ 페인트_ 기름_ 콘크리트 등의 물질이 사용된 목재를 말한다)
ValueCountFrequency (%)
127
 
10.3%
밖의 127
 
10.3%
제외한다 120
 
9.7%
폐합성수지류(폐염화비닐수지류는 90
 
7.3%
폐수처리오니 31
 
2.5%
폐유 30
 
2.4%
말한다 29
 
2.3%
경우는 27
 
2.2%
재활용하는 26
 
2.1%
조직물류폐기물(태반을 26
 
2.1%
Other values (139) 604
48.8%
2023-12-12T12:02:25.245980image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
781
 
10.6%
525
 
7.1%
329
 
4.5%
253
 
3.4%
245
 
3.3%
203
 
2.7%
202
 
2.7%
194
 
2.6%
193
 
2.6%
164
 
2.2%
Other values (192) 4295
58.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 6066
82.2%
Space Separator 781
 
10.6%
Open Punctuation 173
 
2.3%
Close Punctuation 173
 
2.3%
Lowercase Letter 90
 
1.2%
Connector Punctuation 55
 
0.7%
Decimal Number 46
 
0.6%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
525
 
8.7%
329
 
5.4%
253
 
4.2%
245
 
4.0%
203
 
3.3%
202
 
3.3%
194
 
3.2%
193
 
3.2%
164
 
2.7%
160
 
2.6%
Other values (175) 3598
59.3%
Lowercase Letter
ValueCountFrequency (%)
e 30
33.3%
a 15
16.7%
g 15
16.7%
r 15
16.7%
s 15
16.7%
Decimal Number
ValueCountFrequency (%)
2 18
39.1%
0 15
32.6%
1 7
 
15.2%
8 6
 
13.0%
Open Punctuation
ValueCountFrequency (%)
( 151
87.3%
[ 15
 
8.7%
7
 
4.0%
Close Punctuation
ValueCountFrequency (%)
) 151
87.3%
] 15
 
8.7%
7
 
4.0%
Space Separator
ValueCountFrequency (%)
781
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 55
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 6066
82.2%
Common 1228
 
16.6%
Latin 90
 
1.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
525
 
8.7%
329
 
5.4%
253
 
4.2%
245
 
4.0%
203
 
3.3%
202
 
3.3%
194
 
3.2%
193
 
3.2%
164
 
2.7%
160
 
2.6%
Other values (175) 3598
59.3%
Common
ValueCountFrequency (%)
781
63.6%
( 151
 
12.3%
) 151
 
12.3%
_ 55
 
4.5%
2 18
 
1.5%
] 15
 
1.2%
[ 15
 
1.2%
0 15
 
1.2%
1 7
 
0.6%
7
 
0.6%
Other values (2) 13
 
1.1%
Latin
ValueCountFrequency (%)
e 30
33.3%
a 15
16.7%
g 15
16.7%
r 15
16.7%
s 15
16.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5993
81.2%
ASCII 1304
 
17.7%
Compat Jamo 73
 
1.0%
None 14
 
0.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
781
59.9%
( 151
 
11.6%
) 151
 
11.6%
_ 55
 
4.2%
e 30
 
2.3%
2 18
 
1.4%
a 15
 
1.2%
g 15
 
1.2%
r 15
 
1.2%
] 15
 
1.2%
Other values (5) 58
 
4.4%
Hangul
ValueCountFrequency (%)
525
 
8.8%
329
 
5.5%
253
 
4.2%
245
 
4.1%
203
 
3.4%
202
 
3.4%
194
 
3.2%
193
 
3.2%
164
 
2.7%
160
 
2.7%
Other values (174) 3525
58.8%
Compat Jamo
ValueCountFrequency (%)
73
100.0%
None
ValueCountFrequency (%)
7
50.0%
7
50.0%
Distinct94
Distinct (%)20.6%
Missing0
Missing (%)0.0%
Memory size3.7 KiB
2023-12-12T12:02:25.556202image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length12
Mean length11.875274
Min length6

Characters and Unicode

Total characters5427
Distinct characters19
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique17 ?
Unique (%)3.7%

Sample

1st row043-836-7805
2nd row043-836-7805
3rd row043-836-0750
4th row043-836-0750
5th row043-836-8801
ValueCountFrequency (%)
043-760-8885 28
 
6.1%
043-909-9000 24
 
5.2%
043-838-9771 24
 
5.2%
043-820-1797 15
 
3.3%
043-838-8584 14
 
3.0%
043-820-8225 13
 
2.8%
043-836-0025 13
 
2.8%
043-836-9771 12
 
2.6%
개인정보포함 11
 
2.4%
043-838-9112 11
 
2.4%
Other values (85) 295
64.1%
2023-12-12T12:02:26.065110image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 937
17.3%
- 892
16.4%
3 800
14.7%
8 735
13.5%
4 522
9.6%
7 299
 
5.5%
2 257
 
4.7%
6 253
 
4.7%
1 231
 
4.3%
5 222
 
4.1%
Other values (9) 279
 
5.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 4463
82.2%
Dash Punctuation 892
 
16.4%
Other Letter 66
 
1.2%
Connector Punctuation 3
 
0.1%
Space Separator 3
 
0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 937
21.0%
3 800
17.9%
8 735
16.5%
4 522
11.7%
7 299
 
6.7%
2 257
 
5.8%
6 253
 
5.7%
1 231
 
5.2%
5 222
 
5.0%
9 207
 
4.6%
Other Letter
ValueCountFrequency (%)
11
16.7%
11
16.7%
11
16.7%
11
16.7%
11
16.7%
11
16.7%
Dash Punctuation
ValueCountFrequency (%)
- 892
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 3
100.0%
Space Separator
ValueCountFrequency (%)
3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 5361
98.8%
Hangul 66
 
1.2%

Most frequent character per script

Common
ValueCountFrequency (%)
0 937
17.5%
- 892
16.6%
3 800
14.9%
8 735
13.7%
4 522
9.7%
7 299
 
5.6%
2 257
 
4.8%
6 253
 
4.7%
1 231
 
4.3%
5 222
 
4.1%
Other values (3) 213
 
4.0%
Hangul
ValueCountFrequency (%)
11
16.7%
11
16.7%
11
16.7%
11
16.7%
11
16.7%
11
16.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 5361
98.8%
Hangul 66
 
1.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 937
17.5%
- 892
16.6%
3 800
14.9%
8 735
13.7%
4 522
9.7%
7 299
 
5.6%
2 257
 
4.8%
6 253
 
4.7%
1 231
 
4.3%
5 222
 
4.1%
Other values (3) 213
 
4.0%
Hangul
ValueCountFrequency (%)
11
16.7%
11
16.7%
11
16.7%
11
16.7%
11
16.7%
11
16.7%
Distinct154
Distinct (%)33.7%
Missing0
Missing (%)0.0%
Memory size3.7 KiB
2023-12-12T12:02:26.383347image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length10
Mean length7.1356674
Min length1

Characters and Unicode

Total characters3261
Distinct characters186
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique82 ?
Unique (%)17.9%

Sample

1st row우진환경
2nd row자가처리
3rd row백로환경
4th row백로환경
5th row금성
ValueCountFrequency (%)
주식회사 56
 
10.9%
메디코청호 37
 
7.2%
지파트너스 36
 
7.0%
주)그린환경산업 23
 
4.5%
주훈이알씨(주 15
 
2.9%
현무환경(주 13
 
2.5%
주식회사메덱스더블유 12
 
2.3%
지파트너 12
 
2.3%
주)지파트너스 12
 
2.3%
주식회사메덱더블유 12
 
2.3%
Other values (145) 285
55.6%
2023-12-12T12:02:26.818307image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
330
 
10.1%
( 226
 
6.9%
) 226
 
6.9%
111
 
3.4%
104
 
3.2%
104
 
3.2%
91
 
2.8%
87
 
2.7%
85
 
2.6%
82
 
2.5%
Other values (176) 1815
55.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2722
83.5%
Open Punctuation 226
 
6.9%
Close Punctuation 226
 
6.9%
Space Separator 61
 
1.9%
Uppercase Letter 14
 
0.4%
Other Punctuation 9
 
0.3%
Lowercase Letter 3
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
330
 
12.1%
111
 
4.1%
104
 
3.8%
104
 
3.8%
91
 
3.3%
87
 
3.2%
85
 
3.1%
82
 
3.0%
81
 
3.0%
72
 
2.6%
Other values (163) 1575
57.9%
Uppercase Letter
ValueCountFrequency (%)
S 4
28.6%
E 4
28.6%
R 2
14.3%
N 2
14.3%
C 2
14.3%
Lowercase Letter
ValueCountFrequency (%)
c 1
33.3%
e 1
33.3%
o 1
33.3%
Other Punctuation
ValueCountFrequency (%)
. 8
88.9%
· 1
 
11.1%
Open Punctuation
ValueCountFrequency (%)
( 226
100.0%
Close Punctuation
ValueCountFrequency (%)
) 226
100.0%
Space Separator
ValueCountFrequency (%)
61
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2722
83.5%
Common 522
 
16.0%
Latin 17
 
0.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
330
 
12.1%
111
 
4.1%
104
 
3.8%
104
 
3.8%
91
 
3.3%
87
 
3.2%
85
 
3.1%
82
 
3.0%
81
 
3.0%
72
 
2.6%
Other values (163) 1575
57.9%
Latin
ValueCountFrequency (%)
S 4
23.5%
E 4
23.5%
R 2
11.8%
N 2
11.8%
C 2
11.8%
c 1
 
5.9%
e 1
 
5.9%
o 1
 
5.9%
Common
ValueCountFrequency (%)
( 226
43.3%
) 226
43.3%
61
 
11.7%
. 8
 
1.5%
· 1
 
0.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2722
83.5%
ASCII 538
 
16.5%
None 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
330
 
12.1%
111
 
4.1%
104
 
3.8%
104
 
3.8%
91
 
3.3%
87
 
3.2%
85
 
3.1%
82
 
3.0%
81
 
3.0%
72
 
2.6%
Other values (163) 1575
57.9%
ASCII
ValueCountFrequency (%)
( 226
42.0%
) 226
42.0%
61
 
11.3%
. 8
 
1.5%
S 4
 
0.7%
E 4
 
0.7%
R 2
 
0.4%
N 2
 
0.4%
C 2
 
0.4%
c 1
 
0.2%
Other values (2) 2
 
0.4%
None
ValueCountFrequency (%)
· 1
100.0%
Distinct184
Distinct (%)40.3%
Missing0
Missing (%)0.0%
Memory size3.7 KiB
2023-12-12T12:02:27.208605image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length12
Mean length8.0940919
Min length1

Characters and Unicode

Total characters3699
Distinct characters206
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique121 ?
Unique (%)26.5%

Sample

1st row우진환경
2nd row자가
3rd row아남환경
4th row아남환경
5th row충북자원 재생산업
ValueCountFrequency (%)
주)스테리싸이클코리아 55
 
11.6%
주)메디코 44
 
9.3%
주)이메디원 26
 
5.5%
주)다나에너지솔루션 18
 
3.8%
우진환경개발(주 14
 
2.9%
주)그린환경산업 13
 
2.7%
케이지이티에스(주 9
 
1.9%
자)정풍 9
 
1.9%
주훈리사이클링 7
 
1.5%
주)에코비트에너지세종 6
 
1.3%
Other values (182) 274
57.7%
2023-12-12T12:02:27.668897image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
373
 
10.1%
) 365
 
9.9%
( 365
 
9.9%
185
 
5.0%
151
 
4.1%
140
 
3.8%
100
 
2.7%
91
 
2.5%
91
 
2.5%
75
 
2.0%
Other values (196) 1763
47.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2942
79.5%
Close Punctuation 365
 
9.9%
Open Punctuation 365
 
9.9%
Space Separator 23
 
0.6%
Uppercase Letter 2
 
0.1%
Dash Punctuation 1
 
< 0.1%
Other Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
373
 
12.7%
185
 
6.3%
151
 
5.1%
140
 
4.8%
100
 
3.4%
91
 
3.1%
91
 
3.1%
75
 
2.5%
74
 
2.5%
72
 
2.4%
Other values (189) 1590
54.0%
Uppercase Letter
ValueCountFrequency (%)
G 1
50.0%
R 1
50.0%
Close Punctuation
ValueCountFrequency (%)
) 365
100.0%
Open Punctuation
ValueCountFrequency (%)
( 365
100.0%
Space Separator
ValueCountFrequency (%)
23
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%
Other Punctuation
ValueCountFrequency (%)
& 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2942
79.5%
Common 755
 
20.4%
Latin 2
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
373
 
12.7%
185
 
6.3%
151
 
5.1%
140
 
4.8%
100
 
3.4%
91
 
3.1%
91
 
3.1%
75
 
2.5%
74
 
2.5%
72
 
2.4%
Other values (189) 1590
54.0%
Common
ValueCountFrequency (%)
) 365
48.3%
( 365
48.3%
23
 
3.0%
- 1
 
0.1%
& 1
 
0.1%
Latin
ValueCountFrequency (%)
G 1
50.0%
R 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2942
79.5%
ASCII 757
 
20.5%

Most frequent character per block

Hangul
ValueCountFrequency (%)
373
 
12.7%
185
 
6.3%
151
 
5.1%
140
 
4.8%
100
 
3.4%
91
 
3.1%
91
 
3.1%
75
 
2.5%
74
 
2.5%
72
 
2.4%
Other values (189) 1590
54.0%
ASCII
ValueCountFrequency (%)
) 365
48.2%
( 365
48.2%
23
 
3.0%
- 1
 
0.1%
G 1
 
0.1%
& 1
 
0.1%
R 1
 
0.1%

처리방법
Categorical

HIGH CORRELATION 

Distinct22
Distinct (%)4.8%
Missing0
Missing (%)0.0%
Memory size3.7 KiB
중간처분(일반소각)
202 
재활용(중간가공폐기물 제조)
74 
재활용(농업생산활동에 사용)
41 
재활용(원료 제조)
40 
재활용(연료·고형연료제품 제조)
31 
Other values (17)
69 

Length

Max length19
Median length10
Mean length11.991247
Min length1

Unique

Unique8 ?
Unique (%)1.8%

Sample

1st row재활용(파쇄.분쇄)
2nd row재활용(원료가공)
3rd row중간처분(일반소각)
4th row매립(민간관리형매립시설)
5th row재활용(연료·고형연료제품 제조)

Common Values

ValueCountFrequency (%)
중간처분(일반소각) 202
44.2%
재활용(중간가공폐기물 제조) 74
 
16.2%
재활용(농업생산활동에 사용) 41
 
9.0%
재활용(원료 제조) 40
 
8.8%
재활용(연료·고형연료제품 제조) 31
 
6.8%
매립(민간관리형매립시설) 20
 
4.4%
재활용(직접 제품제조) 12
 
2.6%
중간처분(고온소각) 10
 
2.2%
매립(관리형매립시설) 5
 
1.1%
재활용(토질개선에 사용) 4
 
0.9%
Other values (12) 18
 
3.9%

Length

2023-12-12T12:02:27.828887image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
중간처분(일반소각 202
30.2%
제조 145
21.7%
재활용(중간가공폐기물 74
 
11.1%
사용 48
 
7.2%
재활용(농업생산활동에 41
 
6.1%
재활용(원료 40
 
6.0%
재활용(연료·고형연료제품 31
 
4.6%
매립(민간관리형매립시설 20
 
3.0%
재활용(직접 12
 
1.8%
제품제조 12
 
1.8%
Other values (16) 44
 
6.6%
Distinct96
Distinct (%)21.0%
Missing0
Missing (%)0.0%
Memory size3.7 KiB
2023-12-12T12:02:28.106394image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length36
Median length34
Mean length22.97593
Min length19

Characters and Unicode

Total characters10500
Distinct characters150
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique20 ?
Unique (%)4.4%

Sample

1st row충청북도 증평군 도안면 모래재로 78
2nd row충청북도 증평군 도안면 모래재로 78
3rd row충청북도 증평군 증평읍 두산로 40
4th row충청북도 증평군 증평읍 두산로 40
5th row충청북도 증평군 도안면 원명로 45
ValueCountFrequency (%)
충청북도 457
18.7%
증평군 457
18.7%
증평읍 283
 
11.6%
도안면 174
 
7.1%
중앙로 119
 
4.9%
114 54
 
2.2%
증평2산단로 52
 
2.1%
원명로 36
 
1.5%
삼보로 26
 
1.1%
149 24
 
1.0%
Other values (142) 765
31.3%
2023-12-12T12:02:28.537906image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2003
19.1%
822
 
7.8%
816
 
7.8%
637
 
6.1%
467
 
4.4%
462
 
4.4%
460
 
4.4%
457
 
4.4%
411
 
3.9%
1 308
 
2.9%
Other values (140) 3657
34.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 6691
63.7%
Space Separator 2003
 
19.1%
Decimal Number 1469
 
14.0%
Connector Punctuation 118
 
1.1%
Dash Punctuation 93
 
0.9%
Uppercase Letter 46
 
0.4%
Close Punctuation 37
 
0.4%
Open Punctuation 37
 
0.4%
Other Punctuation 6
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
822
12.3%
816
12.2%
637
 
9.5%
467
 
7.0%
462
 
6.9%
460
 
6.9%
457
 
6.8%
411
 
6.1%
283
 
4.2%
175
 
2.6%
Other values (120) 1701
25.4%
Decimal Number
ValueCountFrequency (%)
1 308
21.0%
4 233
15.9%
2 192
13.1%
3 177
12.0%
5 142
9.7%
0 112
 
7.6%
6 84
 
5.7%
8 78
 
5.3%
7 76
 
5.2%
9 67
 
4.6%
Uppercase Letter
ValueCountFrequency (%)
K 21
45.7%
S 15
32.6%
L 8
 
17.4%
B 2
 
4.3%
Space Separator
ValueCountFrequency (%)
2003
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 118
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 93
100.0%
Close Punctuation
ValueCountFrequency (%)
) 37
100.0%
Open Punctuation
ValueCountFrequency (%)
( 37
100.0%
Other Punctuation
ValueCountFrequency (%)
& 6
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 6691
63.7%
Common 3763
35.8%
Latin 46
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
822
12.3%
816
12.2%
637
 
9.5%
467
 
7.0%
462
 
6.9%
460
 
6.9%
457
 
6.8%
411
 
6.1%
283
 
4.2%
175
 
2.6%
Other values (120) 1701
25.4%
Common
ValueCountFrequency (%)
2003
53.2%
1 308
 
8.2%
4 233
 
6.2%
2 192
 
5.1%
3 177
 
4.7%
5 142
 
3.8%
_ 118
 
3.1%
0 112
 
3.0%
- 93
 
2.5%
6 84
 
2.2%
Other values (6) 301
 
8.0%
Latin
ValueCountFrequency (%)
K 21
45.7%
S 15
32.6%
L 8
 
17.4%
B 2
 
4.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 6691
63.7%
ASCII 3809
36.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2003
52.6%
1 308
 
8.1%
4 233
 
6.1%
2 192
 
5.0%
3 177
 
4.6%
5 142
 
3.7%
_ 118
 
3.1%
0 112
 
2.9%
- 93
 
2.4%
6 84
 
2.2%
Other values (10) 347
 
9.1%
Hangul
ValueCountFrequency (%)
822
12.3%
816
12.2%
637
 
9.5%
467
 
7.0%
462
 
6.9%
460
 
6.9%
457
 
6.8%
411
 
6.1%
283
 
4.2%
175
 
2.6%
Other values (120) 1701
25.4%

신고년도
Real number (ℝ)

HIGH CORRELATION 

Distinct23
Distinct (%)5.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2013.7484
Minimum1999
Maximum2023
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size4.1 KiB
2023-12-12T12:02:28.683413image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1999
5-th percentile2002
Q12007
median2016
Q32019
95-th percentile2022
Maximum2023
Range24
Interquartile range (IQR)12

Descriptive statistics

Standard deviation6.4140849
Coefficient of variation (CV)0.0031851472
Kurtosis-1.0551132
Mean2013.7484
Median Absolute Deviation (MAD)5
Skewness-0.43339258
Sum920283
Variance41.140485
MonotonicityNot monotonic
2023-12-12T12:02:28.843121image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=23)
ValueCountFrequency (%)
2006 55
12.0%
2007 48
10.5%
2021 47
10.3%
2018 46
10.1%
2016 37
 
8.1%
2019 33
 
7.2%
2022 30
 
6.6%
2017 24
 
5.3%
2011 21
 
4.6%
2010 19
 
4.2%
Other values (13) 97
21.2%
ValueCountFrequency (%)
1999 2
 
0.4%
2000 10
 
2.2%
2001 5
 
1.1%
2002 11
 
2.4%
2004 1
 
0.2%
2005 2
 
0.4%
2006 55
12.0%
2007 48
10.5%
2009 2
 
0.4%
2010 19
 
4.2%
ValueCountFrequency (%)
2023 5
 
1.1%
2022 30
6.6%
2021 47
10.3%
2020 19
4.2%
2019 33
7.2%
2018 46
10.1%
2017 24
5.3%
2016 37
8.1%
2015 6
 
1.3%
2014 13
 
2.8%

데이터기준일자
Date

CONSTANT 

Distinct1
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size3.7 KiB
Minimum2023-04-27 00:00:00
Maximum2023-04-27 00:00:00
2023-12-12T12:02:29.071877image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T12:02:29.175957image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2023-12-12T12:02:22.643747image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T12:02:29.259591image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분폐기물 종류전화번호처리방법사업장도로명주소(지번포함)신고년도
구분1.0001.0000.9960.8150.9960.767
폐기물 종류1.0001.0000.9320.9620.9210.817
전화번호0.9960.9321.0000.9661.0000.997
처리방법0.8150.9620.9661.0000.9620.645
사업장도로명주소(지번포함)0.9960.9211.0000.9621.0000.996
신고년도0.7670.8170.9970.6450.9961.000
2023-12-12T12:02:29.380095image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분처리방법
구분1.0000.658
처리방법0.6581.000
2023-12-12T12:02:29.476660image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
신고년도구분처리방법
신고년도1.0000.6000.270
구분0.6001.0000.658
처리방법0.2700.6581.000

Missing values

2023-12-12T12:02:22.832555image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T12:02:23.040079image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

구분상호폐기물 종류전화번호운반자처리업소명처리방법사업장도로명주소(지번포함)신고년도데이터기준일자
0사업장일반풍남레미콘(주)폐콘크리트043-836-7805우진환경우진환경재활용(파쇄.분쇄)충청북도 증평군 도안면 모래재로 7819992023-04-27
1사업장일반풍남레미콘(주)폐수처리오니043-836-7805자가처리자가재활용(원료가공)충청북도 증평군 도안면 모래재로 7819992023-04-27
2사업장일반성산정밀(주)폐합성수지류043-836-0750백로환경아남환경중간처분(일반소각)충청북도 증평군 증평읍 두산로 4020002023-04-27
3사업장일반성산정밀(주)분진043-836-0750백로환경아남환경매립(민간관리형매립시설)충청북도 증평군 증평읍 두산로 4020002023-04-27
4사업장일반(주)한국알미늄폐가구류_ 폐도장목_ 폐목재포장재_ 폐전선드럼(접착제_ 페인트_ 기름_ 콘크리트 등의 물질이 사용된 목재를 말한다)043-836-8801금성충북자원 재생산업재활용(연료·고형연료제품 제조)충청북도 증평군 도안면 원명로 4520002023-04-27
5사업장일반(주)한국알미늄사업장폐기물 소각시설 소각재(바닥재와 비산재가 혼합된 경우를 말한다)043-836-8801미래이.에스(주)(주)제이에이그린매립(민간관리형매립시설)충청북도 증평군 도안면 원명로 4520002023-04-27
6사업장일반(주)한국알미늄폐합성수지류(폐염화비닐수지류는 제외한다)043-836-8801상훈(자)정우리사이클링(주)재활용(중간가공폐기물 제조)충청북도 증평군 도안면 원명로 4520002023-04-27
7사업장일반(주)한국알미늄그 밖의 공정오니043-836-8801한세이프에너지(주)한세이프에너지(주)중간처분(일반소각)충청북도 증평군 도안면 원명로 4520002023-04-27
8사업장일반(주)한국알미늄그 밖의 분진043-836-8801청풍환경한맥테코산업매립(민간관리형매립시설)충청북도 증평군 도안면 원명로 4520002023-04-27
9사업장일반(주)한국알미늄폐합성수지류(폐염화비닐수지류는 제외한다)043-836-8801명원테크명원테크재활용(중간가공폐기물 제조)충청북도 증평군 도안면 원명로 4520002023-04-27
구분상호폐기물 종류전화번호운반자처리업소명처리방법사업장도로명주소(지번포함)신고년도데이터기준일자
447지정(주)피유팩토리그 밖의 폐유기용제043-838-0650(주)삼동그린대양그린산업(주)재활용(연료·고형연료제품 제조)충청북도 증평군 도안면 증평2산단로 57 (주)피유팩토리20222023-04-27
448지정(주)피유팩토리그 밖의 폐유기용제043-838-0650(주)에이치아이에너지에이스켐(주)재활용(연료·고형연료제품 제조)충청북도 증평군 도안면 증평2산단로 57 (주)피유팩토리20222023-04-27
449지정(의)여덕의료재단증평장례식장손상성폐기물043-838-6663(주)메덱스더블유(주)이메디원중간처분(일반소각)충청북도 증평군 증평읍 중앙로 114_ 지하 1층20222023-04-27
450지정(의)여덕의료재단증평장례식장일반의료폐기물043-838-6663(주)메덱스더블유(주)이메디원중간처분(일반소각)충청북도 증평군 증평읍 중앙로 114_ 지하 1층20222023-04-27
451지정에스디바이오센서 주식회사손상성폐기물031-300-0400주식회사 세안이에스(주)스테리사이클 코리아중간처분(일반소각)충청북도 증평군 증평읍 증평산단로 1420222023-04-27
452지정에스디바이오센서 주식회사생물ㆍ화학폐기물031-300-0400주식회사 세안이에스(주)스테리사이클 코리아중간처분(일반소각)충청북도 증평군 증평읍 증평산단로 1420222023-04-27
453지정에스디바이오센서 주식회사병리계폐기물031-300-0400주식회사 세안이에스(주)스테리사이클 코리아중간처분(일반소각)충청북도 증평군 증평읍 증평산단로 1420222023-04-27
454지정에스디바이오센서 주식회사생물ㆍ화학폐기물031-300-0400주식회사 세안이에스(주)스테리사이클 코리아중간처분(일반소각)충청북도 증평군 증평읍 증평산단로 1420222023-04-27
455지정증평군청그 밖의 폐농약043-835-3646천지이에스(주)신대한정유산업(주)중간처분(고온소각)충청북도 증평군 증평읍 광장로 88_ 증평군청20222023-04-27
456지정증평군청그 밖의 폐농약043-835-3646천지이에스(주)신대한정유산업(주)중간처분(고온소각)충청북도 증평군 증평읍 광장로 88_ 증평군청20222023-04-27

Duplicate rows

Most frequently occurring

구분상호폐기물 종류전화번호운반자처리업소명처리방법사업장도로명주소(지번포함)신고년도데이터기준일자# duplicates
0사업장일반(주)두산전자사업증평공장폐합성수지류(폐염화비닐수지류는 제외한다)043-820-8225(주)유진유포리아(주)유진유포리아재활용(원료 제조)충청북도 증평군 증평읍 두산로 4020062023-04-272
1사업장일반(주)퓨엠폐합성수지류(폐염화비닐수지류는 제외한다)043-838-9562현무환경(주)(주)다나에너지솔루션중간처분(일반소각)충청북도 증평군 도안면 증평2산단로 5320192023-04-272
2사업장일반(주)퓨엠폐합성수지류(폐염화비닐수지류는 제외한다)043-838-9562현무환경(주)(주)중부에너지공사재활용(연료·고형연료제품 제조)충청북도 증평군 도안면 증평2산단로 5320192023-04-272
3사업장일반(주)퓨엠폐합성수지류(폐염화비닐수지류는 제외한다)043-838-9562현무환경(주)아세아환경(주)재활용(중간가공폐기물 제조)충청북도 증평군 도안면 증평2산단로 5320192023-04-272
4사업장일반(주)퓨엠 증평2지점폐합성수지류(폐염화비닐수지류는 제외한다)043-838-9572현무환경(주)(주)에코비트에너지세종재활용(중간가공폐기물 제조)충청북도 증평군 도안면 증평2산단로 19720222023-04-272
5사업장일반(주)퓨엠 증평2지점폐합성수지류(폐염화비닐수지류는 제외한다)043-838-9572현무환경(주)(주)에코비트에너지세종중간처분(일반소각)충청북도 증평군 도안면 증평2산단로 19720222023-04-272
6사업장일반(주)퓨엠 증평2지점폐합성수지류(폐염화비닐수지류는 제외한다)043-838-9572현무환경(주)그린에코넥서스(주)평택사업소재활용(연료·고형연료제품 제조)충청북도 증평군 도안면 증평2산단로 19720222023-04-272
7지정(의)여덕의료재단 증평병원격리의료폐기물043-760-8885주식회사메덱더블유(주)이메디원중간처분(일반소각)충청북도 증평군 증평읍 중앙로 11420212023-04-272
8지정(의)여덕의료재단 증평병원병리계폐기물043-760-8885주식회사메덱더블유(주)이메디원중간처분(일반소각)충청북도 증평군 증평읍 중앙로 11420212023-04-272
9지정(의)여덕의료재단 증평병원생물ㆍ화학폐기물043-760-8885주식회사메덱더블유(주)이메디원중간처분(일반소각)충청북도 증평군 증평읍 중앙로 11420212023-04-272