Overview

Dataset statistics

Number of variables4
Number of observations560
Missing cells0
Missing cells (%)0.0%
Duplicate rows54
Duplicate rows (%)9.6%
Total size in memory17.6 KiB
Average record size in memory32.2 B

Variable types

Text2
Categorical2

Dataset

Description보령시의 사업장 폐기물 배출자 신고현황 데이터 입니다. 상호명, 폐기물 종류(폐석재, 무기성오니, 폐합성수지류, 폐목재류 등), 처리방법(재활용, 중간처분 등) 항목으로 구성되어있습니다.
Author충청남도 보령시
URLhttps://www.data.go.kr/data/15081216/fileData.do

Alerts

데이터기준일 has constant value ""Constant
Dataset has 54 (9.6%) duplicate rowsDuplicates

Reproduction

Analysis started2024-04-13 12:25:29.923824
Analysis finished2024-04-13 12:25:32.947390
Duration3.02 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

상호
Text

Distinct193
Distinct (%)34.5%
Missing0
Missing (%)0.0%
Memory size4.5 KiB
2024-04-13T21:25:33.595140image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length18
Median length14
Mean length9.9875
Min length3

Characters and Unicode

Total characters5593
Distinct characters238
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique93 ?
Unique (%)16.6%

Sample

1st row(구)아시아테크
2nd row(사)웅천농공단지 입주기업체협의체
3rd row(사)웅천농공단지 입주기업체협의체
4th row(주)건양
5th row(주)경성산업
ValueCountFrequency (%)
한국중부발전(주 60
 
8.7%
신보령발전본부 60
 
8.7%
한국중부발전(주)보령발전본부 59
 
8.5%
코리아휠(주)보령공장 18
 
2.6%
주식회사 16
 
2.3%
삼원환경산업(주 14
 
2.0%
㈜함라 13
 
1.9%
보령시시설관리공단 13
 
1.9%
주)보령환경산업 12
 
1.7%
한국중부발전주식회사보령화력본부 11
 
1.6%
Other values (205) 416
60.1%
2024-04-13T21:25:34.591253image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
364
 
6.5%
) 344
 
6.2%
( 344
 
6.2%
274
 
4.9%
264
 
4.7%
263
 
4.7%
258
 
4.6%
257
 
4.6%
164
 
2.9%
158
 
2.8%
Other values (228) 2903
51.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4713
84.3%
Close Punctuation 344
 
6.2%
Open Punctuation 344
 
6.2%
Space Separator 132
 
2.4%
Other Symbol 39
 
0.7%
Decimal Number 11
 
0.2%
Uppercase Letter 10
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
364
 
7.7%
274
 
5.8%
264
 
5.6%
263
 
5.6%
258
 
5.5%
257
 
5.5%
164
 
3.5%
158
 
3.4%
136
 
2.9%
130
 
2.8%
Other values (215) 2445
51.9%
Uppercase Letter
ValueCountFrequency (%)
F 3
30.0%
R 2
20.0%
P 2
20.0%
B 1
 
10.0%
K 1
 
10.0%
A 1
 
10.0%
Decimal Number
ValueCountFrequency (%)
2 9
81.8%
8 1
 
9.1%
5 1
 
9.1%
Close Punctuation
ValueCountFrequency (%)
) 344
100.0%
Open Punctuation
ValueCountFrequency (%)
( 344
100.0%
Space Separator
ValueCountFrequency (%)
132
100.0%
Other Symbol
ValueCountFrequency (%)
39
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4752
85.0%
Common 831
 
14.9%
Latin 10
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
364
 
7.7%
274
 
5.8%
264
 
5.6%
263
 
5.5%
258
 
5.4%
257
 
5.4%
164
 
3.5%
158
 
3.3%
136
 
2.9%
130
 
2.7%
Other values (216) 2484
52.3%
Common
ValueCountFrequency (%)
) 344
41.4%
( 344
41.4%
132
 
15.9%
2 9
 
1.1%
8 1
 
0.1%
5 1
 
0.1%
Latin
ValueCountFrequency (%)
F 3
30.0%
R 2
20.0%
P 2
20.0%
B 1
 
10.0%
K 1
 
10.0%
A 1
 
10.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4713
84.3%
ASCII 841
 
15.0%
None 39
 
0.7%

Most frequent character per block

Hangul
ValueCountFrequency (%)
364
 
7.7%
274
 
5.8%
264
 
5.6%
263
 
5.6%
258
 
5.5%
257
 
5.5%
164
 
3.5%
158
 
3.4%
136
 
2.9%
130
 
2.8%
Other values (215) 2445
51.9%
ASCII
ValueCountFrequency (%)
) 344
40.9%
( 344
40.9%
132
 
15.7%
2 9
 
1.1%
F 3
 
0.4%
R 2
 
0.2%
P 2
 
0.2%
B 1
 
0.1%
K 1
 
0.1%
A 1
 
0.1%
Other values (2) 2
 
0.2%
None
ValueCountFrequency (%)
39
100.0%
Distinct99
Distinct (%)17.7%
Missing0
Missing (%)0.0%
Memory size4.5 KiB
2024-04-13T21:25:35.437016image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length88
Median length70
Mean length11.946429
Min length2

Characters and Unicode

Total characters6690
Distinct characters196
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique45 ?
Unique (%)8.0%

Sample

1st row폐합성수지
2nd row폐활성탄
3rd row폐흡착제
4th row폐콘크리트
5th row폐수처리오니
ValueCountFrequency (%)
밖의 94
 
7.4%
94
 
7.4%
제외한다 76
 
6.0%
폐합성수지류(폐염화비닐수지류는 72
 
5.7%
발생한 52
 
4.1%
폐수처리오니 49
 
3.9%
석탄재 48
 
3.8%
과정에서 29
 
2.3%
폐합성수지류 29
 
2.3%
처리하는 26
 
2.1%
Other values (177) 693
54.9%
2024-04-13T21:25:36.514033image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
708
 
10.6%
547
 
8.2%
306
 
4.6%
289
 
4.3%
260
 
3.9%
205
 
3.1%
200
 
3.0%
172
 
2.6%
144
 
2.2%
141
 
2.1%
Other values (186) 3718
55.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5694
85.1%
Space Separator 708
 
10.6%
Open Punctuation 122
 
1.8%
Close Punctuation 122
 
1.8%
Connector Punctuation 38
 
0.6%
Decimal Number 5
 
0.1%
Other Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
547
 
9.6%
306
 
5.4%
289
 
5.1%
260
 
4.6%
205
 
3.6%
200
 
3.5%
172
 
3.0%
144
 
2.5%
141
 
2.5%
136
 
2.4%
Other values (177) 3294
57.9%
Open Punctuation
ValueCountFrequency (%)
( 121
99.2%
1
 
0.8%
Close Punctuation
ValueCountFrequency (%)
) 121
99.2%
1
 
0.8%
Decimal Number
ValueCountFrequency (%)
1 4
80.0%
7 1
 
20.0%
Space Separator
ValueCountFrequency (%)
708
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 38
100.0%
Other Punctuation
ValueCountFrequency (%)
. 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5694
85.1%
Common 996
 
14.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
547
 
9.6%
306
 
5.4%
289
 
5.1%
260
 
4.6%
205
 
3.6%
200
 
3.5%
172
 
3.0%
144
 
2.5%
141
 
2.5%
136
 
2.4%
Other values (177) 3294
57.9%
Common
ValueCountFrequency (%)
708
71.1%
( 121
 
12.1%
) 121
 
12.1%
_ 38
 
3.8%
1 4
 
0.4%
. 1
 
0.1%
1
 
0.1%
1
 
0.1%
7 1
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5638
84.3%
ASCII 994
 
14.9%
Compat Jamo 56
 
0.8%
None 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
708
71.2%
( 121
 
12.2%
) 121
 
12.2%
_ 38
 
3.8%
1 4
 
0.4%
. 1
 
0.1%
7 1
 
0.1%
Hangul
ValueCountFrequency (%)
547
 
9.7%
306
 
5.4%
289
 
5.1%
260
 
4.6%
205
 
3.6%
200
 
3.5%
172
 
3.1%
144
 
2.6%
141
 
2.5%
136
 
2.4%
Other values (176) 3238
57.4%
Compat Jamo
ValueCountFrequency (%)
56
100.0%
None
ValueCountFrequency (%)
1
50.0%
1
50.0%

처리방법
Categorical

Distinct38
Distinct (%)6.8%
Missing0
Missing (%)0.0%
Memory size4.5 KiB
재활용(직접 제품제조)
81 
매립(민간관리형매립시설)
79 
중간처분(일반소각)
77 
재활용(원료 제조)
46 
재활용(중간가공폐기물 제조)
46 
Other values (33)
231 

Length

Max length19
Median length15
Mean length11.576786
Min length2

Unique

Unique14 ?
Unique (%)2.5%

Sample

1st row재활용(기타)
2nd row재활용(직접 제품제조)
3rd row재활용(직접 제품제조)
4th row중간처분(파쇄.분쇄)
5th row재활용(기타)

Common Values

ValueCountFrequency (%)
재활용(직접 제품제조) 81
14.5%
매립(민간관리형매립시설) 79
14.1%
중간처분(일반소각) 77
13.8%
재활용(원료 제조) 46
8.2%
재활용(중간가공폐기물 제조) 46
8.2%
재활용(기타) 43
7.7%
재활용(성토재·복토재 등으로 사용) 31
 
5.5%
중간처분(파쇄.분쇄) 23
 
4.1%
재활용(파쇄.분쇄) 22
 
3.9%
재활용(연료·고형연료제품 제조) 19
 
3.4%
Other values (28) 93
16.6%

Length

2024-04-13T21:25:36.759808image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
제조 111
13.3%
재활용(직접 84
10.1%
제품제조 81
9.7%
매립(민간관리형매립시설 79
9.5%
중간처분(일반소각 77
9.2%
재활용(원료 46
 
5.5%
재활용(중간가공폐기물 46
 
5.5%
사용 46
 
5.5%
재활용(기타 43
 
5.1%
재활용(성토재·복토재 31
 
3.7%
Other values (33) 191
22.9%

데이터기준일
Categorical

CONSTANT 

Distinct1
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size4.5 KiB
2024-04-08
560 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2024-04-08
2nd row2024-04-08
3rd row2024-04-08
4th row2024-04-08
5th row2024-04-08

Common Values

ValueCountFrequency (%)
2024-04-08 560
100.0%

Length

2024-04-13T21:25:36.981290image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-13T21:25:37.148199image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2024-04-08 560
100.0%

Correlations

2024-04-13T21:25:37.277930image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
폐기물 종류처리방법
폐기물 종류1.0000.950
처리방법0.9501.000

Missing values

2024-04-13T21:25:32.521029image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-13T21:25:32.818079image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

상호폐기물 종류처리방법데이터기준일
0(구)아시아테크폐합성수지재활용(기타)2024-04-08
1(사)웅천농공단지 입주기업체협의체폐활성탄재활용(직접 제품제조)2024-04-08
2(사)웅천농공단지 입주기업체협의체폐흡착제재활용(직접 제품제조)2024-04-08
3(주)건양폐콘크리트중간처분(파쇄.분쇄)2024-04-08
4(주)경성산업폐수처리오니재활용(기타)2024-04-08
5(주)다원피씨에스폐콘크리트중간처분(파쇄.분쇄)2024-04-08
6(주)대광폐수처리오니재활용(기타)2024-04-08
7(주)대광폐석재류재활용(기타)2024-04-08
8(주)대천리조트식물성잔재물재활용(사료화)2024-04-08
9(주)대천리조트식물성잔재물재활용(퇴비화)2024-04-08
상호폐기물 종류처리방법데이터기준일
550홈플러스(주)보령점폐식용유(식용을 목적으로 식품 재료와 원료를 제조ㆍ조리ㆍ가공하거나 식용유를 유통ㆍ사용 또는 음식물류 폐기물을 처리하는 과정에서 발생하는 기름을 말한다)재활용(중간가공폐기물 제조)2024-04-08
551홈플러스(주)보령점그 밖의 폐기물중간처분(일반소각)2024-04-08
552홍일씨푸드(주)수산물가공잔재물재활용(사료화)2024-04-08
553환경시설관리공사사업장폐기물매립(민간관리형매립시설)2024-04-08
554환경시설관리공사오니류해역배출2024-04-08
555환경시설관리공사사업장폐기물매립(민간관리형매립시설)2024-04-08
556환경시설관리공사오니류재활용(퇴비화)2024-04-08
557환경시설관리공사사업장폐기물매립(민간관리형매립시설)2024-04-08
558환경시설관리공사 보령사업소폐수처리오니기타재활용2024-04-08
559황금석재폐수처리오니재활용(기타)2024-04-08

Duplicate rows

Most frequently occurring

상호폐기물 종류처리방법데이터기준일# duplicates
47한국중부발전(주)보령발전본부석탄재재활용(직접 제품제조)2024-04-0824
33한국중부발전(주) 신보령발전본부석탄재재활용(직접 제품제조)2024-04-0815
11금화식품(주)보령공장그 밖의 동물성잔재물재활용(농업생산활동에 사용)2024-04-084
29한국중부발전(주) 신보령발전본부그 밖의 폐기물중간처분(일반소각)2024-04-084
35한국중부발전(주) 신보령발전본부폐수처리오니매립(민간관리형매립시설)2024-04-084
4(주)은포산업개발폐합성수지류(폐염화비닐수지류는 제외한다)재활용(직접 에너지회수)2024-04-083
5(주)테크로스 환경서비스그 밖의 폐수처리오니재활용(토질개선에 사용)2024-04-083
7㈜함라그 밖의 폐목재류재활용(원료 제조)2024-04-083
27코리아휠(주)보령공장폐흡착제재활용(직접 제품제조)2024-04-083
46한국중부발전(주)보령발전본부석탄재재활용(성토재·복토재 등으로 사용)2024-04-083