Overview

Dataset statistics

Number of variables3
Number of observations278
Missing cells0
Missing cells (%)0.0%
Duplicate rows26
Duplicate rows (%)9.4%
Total size in memory6.6 KiB
Average record size in memory24.5 B

Variable types

Text3

Dataset

Description사업장 폐기물 신고현황
Author강원도 홍천군
URLhttps://www.data.go.kr/data/15061495/fileData.do

Alerts

Dataset has 26 (9.4%) duplicate rowsDuplicates

Reproduction

Analysis started2023-12-12 23:24:59.496263
Analysis finished2023-12-12 23:25:00.164267
Duration0.67 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

상호
Text

Distinct130
Distinct (%)46.8%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
2023-12-13T08:25:00.413517image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length17
Median length14
Mean length8.8956835
Min length3

Characters and Unicode

Total characters2473
Distinct characters213
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique84 ?
Unique (%)30.2%

Sample

1st row세이지우드 홍천
2nd row사람과안전건설화재에너지연구원
3rd row사람과안전건설화재에너지연구원
4th row힐드로사이 주식회사
5th row탁자원
ValueCountFrequency (%)
하이트진로(주)강원공장 23
 
7.6%
환경시설관리주식회사 18
 
6.0%
주)소노호텔앤리조트 11
 
3.6%
강원도시가스(주 10
 
3.3%
홍천 7
 
2.3%
홍천군청 7
 
2.3%
육군제5397부대 7
 
2.3%
주식회사 6
 
2.0%
주)대명티피앤이 6
 
2.0%
한국도로공사홍천지사 6
 
2.0%
Other values (128) 201
66.6%
2023-12-13T08:25:00.887515image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
193
 
7.8%
) 182
 
7.4%
( 182
 
7.4%
60
 
2.4%
55
 
2.2%
54
 
2.2%
50
 
2.0%
47
 
1.9%
46
 
1.9%
43
 
1.7%
Other values (203) 1561
63.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2030
82.1%
Close Punctuation 182
 
7.4%
Open Punctuation 182
 
7.4%
Decimal Number 44
 
1.8%
Space Separator 24
 
1.0%
Uppercase Letter 7
 
0.3%
Other Symbol 3
 
0.1%
Other Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
193
 
9.5%
60
 
3.0%
55
 
2.7%
54
 
2.7%
50
 
2.5%
47
 
2.3%
46
 
2.3%
43
 
2.1%
41
 
2.0%
41
 
2.0%
Other values (184) 1400
69.0%
Decimal Number
ValueCountFrequency (%)
5 10
22.7%
9 9
20.5%
3 9
20.5%
7 8
18.2%
1 4
 
9.1%
6 2
 
4.5%
2 1
 
2.3%
0 1
 
2.3%
Uppercase Letter
ValueCountFrequency (%)
C 2
28.6%
E 1
14.3%
N 1
14.3%
G 1
14.3%
S 1
14.3%
K 1
14.3%
Close Punctuation
ValueCountFrequency (%)
) 182
100.0%
Open Punctuation
ValueCountFrequency (%)
( 182
100.0%
Space Separator
ValueCountFrequency (%)
24
100.0%
Other Symbol
ValueCountFrequency (%)
3
100.0%
Other Punctuation
ValueCountFrequency (%)
. 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2033
82.2%
Common 433
 
17.5%
Latin 7
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
193
 
9.5%
60
 
3.0%
55
 
2.7%
54
 
2.7%
50
 
2.5%
47
 
2.3%
46
 
2.3%
43
 
2.1%
41
 
2.0%
41
 
2.0%
Other values (185) 1403
69.0%
Common
ValueCountFrequency (%)
) 182
42.0%
( 182
42.0%
24
 
5.5%
5 10
 
2.3%
9 9
 
2.1%
3 9
 
2.1%
7 8
 
1.8%
1 4
 
0.9%
6 2
 
0.5%
2 1
 
0.2%
Other values (2) 2
 
0.5%
Latin
ValueCountFrequency (%)
C 2
28.6%
E 1
14.3%
N 1
14.3%
G 1
14.3%
S 1
14.3%
K 1
14.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2030
82.1%
ASCII 440
 
17.8%
None 3
 
0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
193
 
9.5%
60
 
3.0%
55
 
2.7%
54
 
2.7%
50
 
2.5%
47
 
2.3%
46
 
2.3%
43
 
2.1%
41
 
2.0%
41
 
2.0%
Other values (184) 1400
69.0%
ASCII
ValueCountFrequency (%)
) 182
41.4%
( 182
41.4%
24
 
5.5%
5 10
 
2.3%
9 9
 
2.0%
3 9
 
2.0%
7 8
 
1.8%
1 4
 
0.9%
6 2
 
0.5%
C 2
 
0.5%
Other values (8) 8
 
1.8%
None
ValueCountFrequency (%)
3
100.0%
Distinct95
Distinct (%)34.2%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
2023-12-13T08:25:01.134888image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length42
Median length35
Mean length18.388489
Min length1

Characters and Unicode

Total characters5112
Distinct characters172
Distinct categories8 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique52 ?
Unique (%)18.7%

Sample

1st row강원도 홍천군 두촌면 광석로 898-160
2nd row강원도 홍천군 북방면 송학정로 23-42
3rd row강원도 홍천군 북방면 송학정로 23-42
4th row강원도 홍천군 남면 한서로 2840
5th row강원도 홍천군 북방면 팔봉산로 2493 (외2필지)
ValueCountFrequency (%)
강원도 213
18.2%
홍천군 210
18.0%
홍천읍 92
 
7.9%
북방면 64
 
5.5%
12 26
 
2.2%
서면 25
 
2.1%
소매곡길 25
 
2.1%
도둔길 23
 
2.0%
49 23
 
2.0%
한치골길 21
 
1.8%
Other values (181) 447
38.2%
2023-12-13T08:25:01.556103image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1009
19.7%
325
 
6.4%
323
 
6.3%
252
 
4.9%
225
 
4.4%
215
 
4.2%
213
 
4.2%
2 158
 
3.1%
153
 
3.0%
120
 
2.3%
Other values (162) 2119
41.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3261
63.8%
Space Separator 1009
 
19.7%
Decimal Number 688
 
13.5%
Dash Punctuation 52
 
1.0%
Close Punctuation 38
 
0.7%
Open Punctuation 38
 
0.7%
Other Punctuation 18
 
0.4%
Connector Punctuation 8
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
325
 
10.0%
323
 
9.9%
252
 
7.7%
225
 
6.9%
215
 
6.6%
213
 
6.5%
153
 
4.7%
120
 
3.7%
94
 
2.9%
81
 
2.5%
Other values (146) 1260
38.6%
Decimal Number
ValueCountFrequency (%)
2 158
23.0%
1 115
16.7%
4 79
11.5%
3 78
11.3%
6 61
 
8.9%
9 55
 
8.0%
7 40
 
5.8%
8 40
 
5.8%
0 39
 
5.7%
5 23
 
3.3%
Space Separator
ValueCountFrequency (%)
1009
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 52
100.0%
Close Punctuation
ValueCountFrequency (%)
) 38
100.0%
Open Punctuation
ValueCountFrequency (%)
( 38
100.0%
Other Punctuation
ValueCountFrequency (%)
: 18
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 8
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3261
63.8%
Common 1851
36.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
325
 
10.0%
323
 
9.9%
252
 
7.7%
225
 
6.9%
215
 
6.6%
213
 
6.5%
153
 
4.7%
120
 
3.7%
94
 
2.9%
81
 
2.5%
Other values (146) 1260
38.6%
Common
ValueCountFrequency (%)
1009
54.5%
2 158
 
8.5%
1 115
 
6.2%
4 79
 
4.3%
3 78
 
4.2%
6 61
 
3.3%
9 55
 
3.0%
- 52
 
2.8%
7 40
 
2.2%
8 40
 
2.2%
Other values (6) 164
 
8.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3261
63.8%
ASCII 1851
36.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1009
54.5%
2 158
 
8.5%
1 115
 
6.2%
4 79
 
4.3%
3 78
 
4.2%
6 61
 
3.3%
9 55
 
3.0%
- 52
 
2.8%
7 40
 
2.2%
8 40
 
2.2%
Other values (6) 164
 
8.9%
Hangul
ValueCountFrequency (%)
325
 
10.0%
323
 
9.9%
252
 
7.7%
225
 
6.9%
215
 
6.6%
213
 
6.5%
153
 
4.7%
120
 
3.7%
94
 
2.9%
81
 
2.5%
Other values (146) 1260
38.6%
Distinct58
Distinct (%)20.9%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
2023-12-13T08:25:01.820590image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length84
Median length49
Mean length14.201439
Min length3

Characters and Unicode

Total characters3948
Distinct characters169
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique22 ?
Unique (%)7.9%

Sample

1st row임목폐목재(건설공사_ 산지개간 등의 과정에서 발생된 나무뿌리_ 가지_ 줄기 등을 말한다)
2nd row폐벽돌
3rd row폐합성수지류(폐염화비닐수지류는 제외한다)
4th row폐합성수지류(폐염화비닐수지류는 제외한다)
5th row폐합성수지류(폐염화비닐수지류는 제외한다)
ValueCountFrequency (%)
제외한다 51
 
7.0%
43
 
5.9%
밖의 43
 
5.9%
폐합성수지류(폐염화비닐수지류는 39
 
5.3%
폐목재류 33
 
4.5%
1등급 27
 
3.7%
폐수처리오니 21
 
2.9%
폐합성수지류 20
 
2.7%
폐콘크리트 19
 
2.6%
말한다 15
 
2.1%
Other values (124) 420
57.5%
2023-12-13T08:25:02.257654image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
455
 
11.5%
267
 
6.8%
167
 
4.2%
147
 
3.7%
120
 
3.0%
106
 
2.7%
100
 
2.5%
95
 
2.4%
93
 
2.4%
85
 
2.2%
Other values (159) 2313
58.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3273
82.9%
Space Separator 455
 
11.5%
Close Punctuation 75
 
1.9%
Open Punctuation 75
 
1.9%
Decimal Number 35
 
0.9%
Connector Punctuation 25
 
0.6%
Other Punctuation 10
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
267
 
8.2%
167
 
5.1%
147
 
4.5%
120
 
3.7%
106
 
3.2%
100
 
3.1%
95
 
2.9%
93
 
2.8%
85
 
2.6%
81
 
2.5%
Other values (150) 2012
61.5%
Close Punctuation
ValueCountFrequency (%)
) 71
94.7%
4
 
5.3%
Open Punctuation
ValueCountFrequency (%)
( 71
94.7%
4
 
5.3%
Decimal Number
ValueCountFrequency (%)
1 33
94.3%
8 2
 
5.7%
Space Separator
ValueCountFrequency (%)
455
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 25
100.0%
Other Punctuation
ValueCountFrequency (%)
. 10
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3273
82.9%
Common 675
 
17.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
267
 
8.2%
167
 
5.1%
147
 
4.5%
120
 
3.7%
106
 
3.2%
100
 
3.1%
95
 
2.9%
93
 
2.8%
85
 
2.6%
81
 
2.5%
Other values (150) 2012
61.5%
Common
ValueCountFrequency (%)
455
67.4%
) 71
 
10.5%
( 71
 
10.5%
1 33
 
4.9%
_ 25
 
3.7%
. 10
 
1.5%
4
 
0.6%
4
 
0.6%
8 2
 
0.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3249
82.3%
ASCII 667
 
16.9%
Compat Jamo 24
 
0.6%
None 8
 
0.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
455
68.2%
) 71
 
10.6%
( 71
 
10.6%
1 33
 
4.9%
_ 25
 
3.7%
. 10
 
1.5%
8 2
 
0.3%
Hangul
ValueCountFrequency (%)
267
 
8.2%
167
 
5.1%
147
 
4.5%
120
 
3.7%
106
 
3.3%
100
 
3.1%
95
 
2.9%
93
 
2.9%
85
 
2.6%
81
 
2.5%
Other values (149) 1988
61.2%
Compat Jamo
ValueCountFrequency (%)
24
100.0%
None
ValueCountFrequency (%)
4
50.0%
4
50.0%

Correlations

2023-12-13T08:25:02.671360image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사업장도로명주소폐기물 종류
사업장도로명주소1.0000.921
폐기물 종류0.9211.000

Missing values

2023-12-13T08:25:00.037227image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T08:25:00.124372image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

상호사업장도로명주소폐기물 종류
0세이지우드 홍천강원도 홍천군 두촌면 광석로 898-160임목폐목재(건설공사_ 산지개간 등의 과정에서 발생된 나무뿌리_ 가지_ 줄기 등을 말한다)
1사람과안전건설화재에너지연구원강원도 홍천군 북방면 송학정로 23-42폐벽돌
2사람과안전건설화재에너지연구원강원도 홍천군 북방면 송학정로 23-42폐합성수지류(폐염화비닐수지류는 제외한다)
3힐드로사이 주식회사강원도 홍천군 남면 한서로 2840폐합성수지류(폐염화비닐수지류는 제외한다)
4탁자원강원도 홍천군 북방면 팔봉산로 2493 (외2필지)폐합성수지류(폐염화비닐수지류는 제외한다)
5세이지우드 홍천강원도 홍천군 두촌면 광석로 898-160폐합성수지류(폐염화비닐수지류는 제외한다)
6(주)권선개발강원도 홍천군 북방면 영서로 3080-22석재ㆍ골재폐수처리오니(석재ㆍ골재 생산 시 발생한 폐수를 처리하는 과정에서 발생한 오니로 한정한다)
7은성산업 주식회사강원도 홍천군 북방면 영서로 3014-15석재ㆍ골재폐수처리오니(석재ㆍ골재 생산 시 발생한 폐수를 처리하는 과정에서 발생한 오니로 한정한다)
8은성산업 주식회사강원도 홍천군 북방면 영서로 3014-15석재ㆍ골재폐수처리오니(석재ㆍ골재 생산 시 발생한 폐수를 처리하는 과정에서 발생한 오니로 한정한다)
9(주)산돌식품강원도 홍천군 홍천읍 설악로 1009-9그 밖의 폐수처리오니
상호사업장도로명주소폐기물 종류
268하이트진로(주)강원공장강원도 홍천군 북방면 도둔길 49폐합성수지류(폐염화비닐수지류는 제외한다)
269하이트진로(주)강원공장강원도 홍천군 북방면 도둔길 49그 밖의 공정오니
270하이트진로(주)강원공장강원도 홍천군 북방면 도둔길 49그 밖의 무기성오니
271하이트진로(주)강원공장강원도 홍천군 북방면 도둔길 49폐합성수지류(폐염화비닐수지류는 제외한다)
272(주)고려가구강원도 홍천군 홍천읍 농공단지길 23-26폐목재류
273(주)고려가구강원도 홍천군 홍천읍 농공단지길 23-26폐종이류
274홍천 아산병원강원도 홍천군 홍천읍 산림공원1길 17폐수처리오니
275홍천 아산병원강원도 홍천군 홍천읍 산림공원1길 17종량제봉투 배출 폐기물(합성수지 종량제 봉투에 배출되는 폐기물을 말한다)
276홍천 아산병원강원도 홍천군 홍천읍 산림공원1길 17폐합성수지류(폐염화비닐수지류는 제외한다)
277홍천 아산병원강원도 홍천군 홍천읍 산림공원1길 17폐유리병류(「자원의 절약과 재활용촉진에 관한 법률 시행령」 제18조제1호에 해당하는 것을 말한다)

Duplicate rows

Most frequently occurring

상호사업장도로명주소폐기물 종류# duplicates
19하이트진로(주)강원공장강원도 홍천군 북방면 도둔길 49그 밖의 폐수처리오니8
20하이트진로(주)강원공장강원도 홍천군 북방면 도둔길 49폐합성수지류(폐염화비닐수지류는 제외한다)7
9강원도시가스(주)강원도 홍천군 북방면 소매곡길 12가축분뇨처리오니6
24환경시설관리주식회사강원도 홍천군 북방면 소매곡길 12 (폐기물배출현장:홍천축분 및 분뇨처리장)분뇨처리오니6
25환경시설관리주식회사강원도 홍천군 북방면 소매곡길 12 (폐기물배출현장:홍천하수처리장)하수처리오니6
2(주)소노호텔앤리조트강원도 홍천군 서면 한치골길 262하수처리오니4
13육군제5397부대폐목재류(원목의 용도 그대로 사용하는 나무뿌리ㆍ가지 등을 제거한 원줄기는 제외한다.)3
14육군제5397부대폐합성수지류3
22홍천군청(관광레저과)강원도 홍천군 홍천읍 석화로 93폐합성수지류3
0(주)대우건설폐목재류 1등급2