Overview

Dataset statistics

Number of variables6
Number of observations123
Missing cells0
Missing cells (%)0.0%
Duplicate rows19
Duplicate rows (%)15.4%
Total size in memory5.9 KiB
Average record size in memory49.1 B

Variable types

Text3
DateTime1
Categorical2

Dataset

Description경상북도 영덕군에 위치한 사업장폐기물배출자 신고업체 상호, 연락처, 주소, 신고일, 폐기물 종류 등 사업장폐기물 배출자 신고현황 데이터를 아래와 같이 제공하고자 합니다.
URLhttps://www.data.go.kr/data/15067635/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
Dataset has 19 (15.4%) duplicate rowsDuplicates

Reproduction

Analysis started2023-12-12 04:02:33.761113
Analysis finished2023-12-12 04:02:34.439081
Duration0.68 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

상호
Text

Distinct54
Distinct (%)43.9%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
2023-12-12T13:02:34.626116image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length21
Median length14
Mean length8.2276423
Min length4

Characters and Unicode

Total characters1012
Distinct characters132
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique26 ?
Unique (%)21.1%

Sample

1st row주식회사 국원건설
2nd row주식회사 국원건설
3rd row주식회사 국원건설
4th row영덕군청
5th row노락의료재단 영덕제일요양병원
ValueCountFrequency (%)
주)신화제2공장 10
 
7.6%
주)미성종합환경 9
 
6.8%
영덕공공하수처리시설 6
 
4.5%
고든통상(주)강구공장 5
 
3.8%
대호수산(주 5
 
3.8%
주식회사 4
 
3.0%
영해공공하수처리시설 4
 
3.0%
광명환경(주 4
 
3.0%
영덕군수산물건조영어조합 4
 
3.0%
영덕군청 3
 
2.3%
Other values (50) 78
59.1%
2023-12-12T13:02:35.205760image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
78
 
7.7%
( 76
 
7.5%
) 76
 
7.5%
41
 
4.1%
41
 
4.1%
32
 
3.2%
31
 
3.1%
29
 
2.9%
24
 
2.4%
19
 
1.9%
Other values (122) 565
55.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 838
82.8%
Open Punctuation 76
 
7.5%
Close Punctuation 76
 
7.5%
Decimal Number 13
 
1.3%
Space Separator 9
 
0.9%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
78
 
9.3%
41
 
4.9%
41
 
4.9%
32
 
3.8%
31
 
3.7%
29
 
3.5%
24
 
2.9%
19
 
2.3%
19
 
2.3%
19
 
2.3%
Other values (116) 505
60.3%
Decimal Number
ValueCountFrequency (%)
2 10
76.9%
1 2
 
15.4%
3 1
 
7.7%
Open Punctuation
ValueCountFrequency (%)
( 76
100.0%
Close Punctuation
ValueCountFrequency (%)
) 76
100.0%
Space Separator
ValueCountFrequency (%)
9
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 838
82.8%
Common 174
 
17.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
78
 
9.3%
41
 
4.9%
41
 
4.9%
32
 
3.8%
31
 
3.7%
29
 
3.5%
24
 
2.9%
19
 
2.3%
19
 
2.3%
19
 
2.3%
Other values (116) 505
60.3%
Common
ValueCountFrequency (%)
( 76
43.7%
) 76
43.7%
2 10
 
5.7%
9
 
5.2%
1 2
 
1.1%
3 1
 
0.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 838
82.8%
ASCII 174
 
17.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
78
 
9.3%
41
 
4.9%
41
 
4.9%
32
 
3.8%
31
 
3.7%
29
 
3.5%
24
 
2.9%
19
 
2.3%
19
 
2.3%
19
 
2.3%
Other values (116) 505
60.3%
ASCII
ValueCountFrequency (%)
( 76
43.7%
) 76
43.7%
2 10
 
5.7%
9
 
5.2%
1 2
 
1.1%
3 1
 
0.6%
Distinct62
Distinct (%)50.4%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
2023-12-12T13:02:35.582033image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length11.98374
Min length9

Characters and Unicode

Total characters1474
Distinct characters12
Distinct categories3 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique36 ?
Unique (%)29.3%

Sample

1st row032-431-6488
2nd row032-431-6488
3rd row032-431-6488
4th row054-730-6182
5th row054-733-1771
ValueCountFrequency (%)
054-734-8885 9
 
7.3%
054-734-6545 6
 
4.8%
054-733-6902 5
 
4.0%
054-733-2192 5
 
4.0%
054-733-7968 4
 
3.2%
054-733-3367 4
 
3.2%
054-734-0492 4
 
3.2%
054-734-5250 3
 
2.4%
054-732-8131 3
 
2.4%
054-730-6205 3
 
2.4%
Other values (53) 78
62.9%
2023-12-12T13:02:36.170153image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 243
16.5%
4 221
15.0%
3 205
13.9%
0 183
12.4%
5 175
11.9%
7 161
10.9%
6 70
 
4.7%
2 70
 
4.7%
8 61
 
4.1%
1 50
 
3.4%
Other values (2) 35
 
2.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1230
83.4%
Dash Punctuation 243
 
16.5%
Space Separator 1
 
0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
4 221
18.0%
3 205
16.7%
0 183
14.9%
5 175
14.2%
7 161
13.1%
6 70
 
5.7%
2 70
 
5.7%
8 61
 
5.0%
1 50
 
4.1%
9 34
 
2.8%
Dash Punctuation
ValueCountFrequency (%)
- 243
100.0%
Space Separator
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1474
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 243
16.5%
4 221
15.0%
3 205
13.9%
0 183
12.4%
5 175
11.9%
7 161
10.9%
6 70
 
4.7%
2 70
 
4.7%
8 61
 
4.1%
1 50
 
3.4%
Other values (2) 35
 
2.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1474
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 243
16.5%
4 221
15.0%
3 205
13.9%
0 183
12.4%
5 175
11.9%
7 161
10.9%
6 70
 
4.7%
2 70
 
4.7%
8 61
 
4.1%
1 50
 
3.4%
Other values (2) 35
 
2.4%
Distinct56
Distinct (%)45.5%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
2023-12-12T13:02:36.602310image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length55
Median length37
Mean length23.349593
Min length20

Characters and Unicode

Total characters2872
Distinct characters115
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique28 ?
Unique (%)22.8%

Sample

1st row경상북도 영덕군 남정면 장사리 583
2nd row경상북도 영덕군 남정면 장사리 583
3rd row경상북도 영덕군 남정면 장사리 583
4th row경상북도 영덕군 영덕읍 남석리 310-3 영덕군청
5th row경상북도 영덕군 영덕읍 우곡리 322-2
ValueCountFrequency (%)
경상북도 120
18.5%
영덕군 120
18.5%
강구면 58
 
8.9%
금호리 29
 
4.5%
영덕읍 27
 
4.2%
강구리 21
 
3.2%
화수리 14
 
2.2%
축산면 11
 
1.7%
10
 
1.5%
1135-2 10
 
1.5%
Other values (116) 230
35.4%
2023-12-12T13:02:37.204493image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
648
22.6%
158
 
5.5%
150
 
5.2%
125
 
4.4%
124
 
4.3%
121
 
4.2%
121
 
4.2%
121
 
4.2%
120
 
4.2%
1 101
 
3.5%
Other values (105) 1083
37.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1678
58.4%
Space Separator 648
 
22.6%
Decimal Number 447
 
15.6%
Dash Punctuation 80
 
2.8%
Close Punctuation 5
 
0.2%
Open Punctuation 5
 
0.2%
Connector Punctuation 5
 
0.2%
Uppercase Letter 3
 
0.1%
Math Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
158
 
9.4%
150
 
8.9%
125
 
7.4%
124
 
7.4%
121
 
7.2%
121
 
7.2%
121
 
7.2%
120
 
7.2%
97
 
5.8%
81
 
4.8%
Other values (86) 460
27.4%
Decimal Number
ValueCountFrequency (%)
1 101
22.6%
3 64
14.3%
5 58
13.0%
2 57
12.8%
4 49
11.0%
0 33
 
7.4%
6 29
 
6.5%
8 28
 
6.3%
7 16
 
3.6%
9 12
 
2.7%
Uppercase Letter
ValueCountFrequency (%)
B 1
33.3%
A 1
33.3%
Y 1
33.3%
Space Separator
ValueCountFrequency (%)
648
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 80
100.0%
Close Punctuation
ValueCountFrequency (%)
) 5
100.0%
Open Punctuation
ValueCountFrequency (%)
( 5
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 5
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1678
58.4%
Common 1191
41.5%
Latin 3
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
158
 
9.4%
150
 
8.9%
125
 
7.4%
124
 
7.4%
121
 
7.2%
121
 
7.2%
121
 
7.2%
120
 
7.2%
97
 
5.8%
81
 
4.8%
Other values (86) 460
27.4%
Common
ValueCountFrequency (%)
648
54.4%
1 101
 
8.5%
- 80
 
6.7%
3 64
 
5.4%
5 58
 
4.9%
2 57
 
4.8%
4 49
 
4.1%
0 33
 
2.8%
6 29
 
2.4%
8 28
 
2.4%
Other values (6) 44
 
3.7%
Latin
ValueCountFrequency (%)
B 1
33.3%
A 1
33.3%
Y 1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1678
58.4%
ASCII 1194
41.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
648
54.3%
1 101
 
8.5%
- 80
 
6.7%
3 64
 
5.4%
5 58
 
4.9%
2 57
 
4.8%
4 49
 
4.1%
0 33
 
2.8%
6 29
 
2.4%
8 28
 
2.3%
Other values (9) 47
 
3.9%
Hangul
ValueCountFrequency (%)
158
 
9.4%
150
 
8.9%
125
 
7.4%
124
 
7.4%
121
 
7.2%
121
 
7.2%
121
 
7.2%
120
 
7.2%
97
 
5.8%
81
 
4.8%
Other values (86) 460
27.4%
Distinct58
Distinct (%)47.2%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
Minimum1996-01-26 00:00:00
Maximum2022-03-28 00:00:00
2023-12-12T13:02:37.406273image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T13:02:37.894035image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

폐기물 종류
Categorical

Distinct32
Distinct (%)26.0%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
그 밖의 폐수처리오니
21 
수산물가공잔재물
19 
폐수처리오니
17 
폐합성수지류(폐염화비닐수지류는 제외한다)
15 
폐콘크리트
Other values (27)
42 

Length

Max length54
Median length49
Mean length11.235772
Min length4

Unique

Unique20 ?
Unique (%)16.3%

Sample

1st row폐수처리오니
2nd row폐수처리오니
3rd row폐수처리오니
4th row동물사체
5th row그 밖의 폐섬유

Common Values

ValueCountFrequency (%)
그 밖의 폐수처리오니 21
17.1%
수산물가공잔재물 19
15.4%
폐수처리오니 17
13.8%
폐합성수지류(폐염화비닐수지류는 제외한다) 15
12.2%
폐콘크리트 9
 
7.3%
하수처리오니 6
 
4.9%
폐합성수지류 4
 
3.3%
사업장폐기물 3
 
2.4%
그 밖의 폐기물 3
 
2.4%
그 밖의 폐목재류 2
 
1.6%
Other values (22) 24
19.5%

Length

2023-12-12T13:02:38.068806image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
폐수처리오니 39
15.5%
35
13.9%
밖의 35
13.9%
수산물가공잔재물 19
 
7.5%
폐합성수지류(폐염화비닐수지류는 15
 
6.0%
제외한다 15
 
6.0%
폐콘크리트 9
 
3.6%
하수처리오니 6
 
2.4%
발생한 4
 
1.6%
폐합성수지류 4
 
1.6%
Other values (52) 71
28.2%

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)0.8%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
2023-08-25
123 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023-08-25
2nd row2023-08-25
3rd row2023-08-25
4th row2023-08-25
5th row2023-08-25

Common Values

ValueCountFrequency (%)
2023-08-25 123
100.0%

Length

2023-12-12T13:02:38.201545image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T13:02:38.361685image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023-08-25 123
100.0%

Correlations

2023-12-12T13:02:38.463951image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
상호연락처사업장지번주소신고일폐기물 종류
상호1.0000.9991.0001.0000.914
연락처0.9991.0000.9990.9990.922
사업장지번주소1.0000.9991.0001.0000.947
신고일1.0000.9991.0001.0000.966
폐기물 종류0.9140.9220.9470.9661.000

Missing values

2023-12-12T13:02:34.263298image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T13:02:34.386240image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

상호연락처사업장지번주소신고일폐기물 종류데이터기준일자
0주식회사 국원건설032-431-6488경상북도 영덕군 남정면 장사리 5832022-03-28폐수처리오니2023-08-25
1주식회사 국원건설032-431-6488경상북도 영덕군 남정면 장사리 5832022-03-28폐수처리오니2023-08-25
2주식회사 국원건설032-431-6488경상북도 영덕군 남정면 장사리 5832022-03-28폐수처리오니2023-08-25
3영덕군청054-730-6182경상북도 영덕군 영덕읍 남석리 310-3 영덕군청2022-03-17동물사체2023-08-25
4노락의료재단 영덕제일요양병원054-733-1771경상북도 영덕군 영덕읍 우곡리 322-22022-02-23그 밖의 폐섬유2023-08-25
5영덕군농업기술센터054-730-6483경상북도 영덕군 영덕읍 구미리 167-1 농업기술센터2021-07-05폐보드류2023-08-25
6야바시라임스톤텍스코리아(주)054-734-5334경상북도 영덕군 영덕읍 남산리 551-42021-01-06폐합성수지류(폐염화비닐수지류는 제외한다)2023-08-25
7(주)대흥토건070-4772-9391충청북도 충주시 중앙탑면 용전리 327 (주)대흥레미콘2020-11-04폐수처리오니2023-08-25
8광명환경(주)054 7326366경상북도 영덕군 영덕읍 화수리 22 외 1필(23번지)2020-09-25폐합성수지류(폐염화비닐수지류는 제외한다)2023-08-25
9(주)동진레미콘 영덕3공장054-293-3736경상북도 영덕군 남정면 부흥리 2642020-08-05폐콘크리트2023-08-25
상호연락처사업장지번주소신고일폐기물 종류데이터기준일자
113명진식품054-733-4004경상북도 영덕군 강구면 금호리 718-62002-09-26수산물가공잔재물2023-08-25
114금호산업054-732-2311경상북도 영덕군 강구면 금호리 1135-41996-03-28사업장폐기물2023-08-25
115금호산업054-732-2311경상북도 영덕군 강구면 금호리 1135-41996-03-28폐수처리오니2023-08-25
116고든통상(주)강구공장054-733-6902경상북도 영덕군 강구면 금호리 1053-12002-03-18수산물가공잔재물2023-08-25
117고든통상(주)강구공장054-733-6902경상북도 영덕군 강구면 금호리 1053-12002-03-18그 밖의 폐수처리오니2023-08-25
118고든통상(주)강구공장054-733-6902경상북도 영덕군 강구면 금호리 1053-12002-03-18수산물가공잔재물2023-08-25
119고든통상(주)강구공장054-733-6902경상북도 영덕군 강구면 금호리 1053-12002-03-18그 밖의 폐수처리오니2023-08-25
120고든통상(주)강구공장054-733-6902경상북도 영덕군 강구면 금호리 1053-12002-03-18폐합성수지류(폐염화비닐수지류는 제외한다)2023-08-25
121(주)신화1공장054-734-4673경상북도 영덕군 강구면 금호리 11351996-03-27그 밖의 폐수처리오니2023-08-25
122(주)신화1공장054-734-4673경상북도 영덕군 강구면 금호리 11351996-03-27수산물가공잔재물2023-08-25

Duplicate rows

Most frequently occurring

상호연락처사업장지번주소신고일폐기물 종류데이터기준일자# duplicates
1(주)미성종합환경054-734-8885경상북도 영덕군 영덕읍 화수리 40-32018-01-25폐합성수지류(폐염화비닐수지류는 제외한다)2023-08-256
7대호수산(주)054-733-2192경상북도 영덕군 강구면 강구리 161996-12-16수산물가공잔재물2023-08-253
9영덕공공하수처리시설054-734-6545경상북도 영덕군 강구면 금호리 8452001-10-31하수처리오니2023-08-253
13영해공공하수처리시설054-733-3367경상북도 영덕군 영해면 연평리 4-42006-10-13하수처리오니2023-08-253
14주식회사 국원건설032-431-6488경상북도 영덕군 남정면 장사리 5832022-03-28폐수처리오니2023-08-253
0(주)대성산업054-734-4247경상북도 영덕군 강구면 강구리 363-252002-09-30폐수처리오니2023-08-252
2(주)세웅수산054-732-8131경상북도 영덕군 강구면 강구리 921998-04-11그 밖의 폐수처리오니2023-08-252
3고든통상(주)강구공장054-733-6902경상북도 영덕군 강구면 금호리 1053-12002-03-18그 밖의 폐수처리오니2023-08-252
4고든통상(주)강구공장054-733-6902경상북도 영덕군 강구면 금호리 1053-12002-03-18수산물가공잔재물2023-08-252
5광명환경(주)054-732-6366경상북도 영덕군 영덕읍 화수리 222019-04-19폐합성수지류(폐염화비닐수지류는 제외한다)2023-08-252