Overview

Dataset statistics

Number of variables6
Number of observations175
Missing cells0
Missing cells (%)0.0%
Duplicate rows1
Duplicate rows (%)0.6%
Total size in memory8.3 KiB
Average record size in memory48.8 B

Variable types

Text4
Categorical1
DateTime1

Dataset

Description강원도 동해시의 사업장폐기물 배출자 신고정보(신고번호, 사업장명, 발생폐기물, 사업장 주소, 신고유형)를 안내해드립니다.
URLhttps://www.data.go.kr/data/15060366/fileData.do

Alerts

신고유형 has constant value ""Constant
기준일 has constant value ""Constant
Dataset has 1 (0.6%) duplicate rowsDuplicates

Reproduction

Analysis started2023-12-12 16:30:49.831220
Analysis finished2023-12-12 16:30:50.540147
Duration0.71 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct89
Distinct (%)50.9%
Missing0
Missing (%)0.0%
Memory size1.5 KiB
2023-12-13T01:30:50.755916image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length21
Median length21
Mean length21
Min length21

Characters and Unicode

Total characters3675
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique57 ?
Unique (%)32.6%

Sample

1st row4210000-31-1995-00030
2nd row4210000-31-1995-00030
3rd row4210000-31-1995-00030
4th row4210000-31-2001-00001
5th row4210000-31-2001-00001
ValueCountFrequency (%)
4210000-31-2002-00005 13
 
7.4%
4210000-31-2001-00003 10
 
5.7%
4210000-31-2016-00001 9
 
5.1%
4210000-31-2009-00018 7
 
4.0%
4210000-31-2002-00004 7
 
4.0%
4210000-31-2009-00009 5
 
2.9%
4210000-31-2001-00009 5
 
2.9%
4210000-31-2004-00001 5
 
2.9%
4210000-31-2007-00016 4
 
2.3%
4210000-31-2003-00003 4
 
2.3%
Other values (79) 106
60.6%
2023-12-13T01:30:51.171175image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 1667
45.4%
- 525
 
14.3%
1 494
 
13.4%
2 403
 
11.0%
3 215
 
5.9%
4 203
 
5.5%
9 45
 
1.2%
5 40
 
1.1%
6 33
 
0.9%
8 29
 
0.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 3150
85.7%
Dash Punctuation 525
 
14.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 1667
52.9%
1 494
 
15.7%
2 403
 
12.8%
3 215
 
6.8%
4 203
 
6.4%
9 45
 
1.4%
5 40
 
1.3%
6 33
 
1.0%
8 29
 
0.9%
7 21
 
0.7%
Dash Punctuation
ValueCountFrequency (%)
- 525
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 3675
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 1667
45.4%
- 525
 
14.3%
1 494
 
13.4%
2 403
 
11.0%
3 215
 
5.9%
4 203
 
5.5%
9 45
 
1.2%
5 40
 
1.1%
6 33
 
0.9%
8 29
 
0.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 3675
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 1667
45.4%
- 525
 
14.3%
1 494
 
13.4%
2 403
 
11.0%
3 215
 
5.9%
4 203
 
5.5%
9 45
 
1.2%
5 40
 
1.1%
6 33
 
0.9%
8 29
 
0.8%
Distinct87
Distinct (%)49.7%
Missing0
Missing (%)0.0%
Memory size1.5 KiB
2023-12-13T01:30:51.418013image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length27
Median length16
Mean length10.171429
Min length2

Characters and Unicode

Total characters1780
Distinct characters179
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique53 ?
Unique (%)30.3%

Sample

1st row쌍용씨앤이(주)동해사업소
2nd row쌍용씨앤이(주)동해사업소
3rd row쌍용씨앤이(주)동해사업소
4th row(주)대신산업
5th row(주)대신산업
ValueCountFrequency (%)
동해공장 25
 
10.0%
쌍용씨앤이(주 13
 
5.2%
한국동서발전(주 10
 
4.0%
동해발전본부 10
 
4.0%
주)지에스동해전력 9
 
3.6%
주식회사 9
 
3.6%
엘에스전선(주 7
 
2.8%
주)삼표시멘트 7
 
2.8%
동해공장(db 5
 
2.0%
metal 5
 
2.0%
Other values (92) 149
59.8%
2023-12-13T01:30:51.772952image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
) 135
 
7.6%
( 135
 
7.6%
133
 
7.5%
112
 
6.3%
86
 
4.8%
74
 
4.2%
44
 
2.5%
41
 
2.3%
38
 
2.1%
31
 
1.7%
Other values (169) 951
53.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1393
78.3%
Close Punctuation 135
 
7.6%
Open Punctuation 135
 
7.6%
Space Separator 74
 
4.2%
Uppercase Letter 22
 
1.2%
Lowercase Letter 20
 
1.1%
Decimal Number 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
133
 
9.5%
112
 
8.0%
86
 
6.2%
44
 
3.2%
41
 
2.9%
38
 
2.7%
31
 
2.2%
30
 
2.2%
29
 
2.1%
25
 
1.8%
Other values (153) 824
59.2%
Uppercase Letter
ValueCountFrequency (%)
M 5
22.7%
D 5
22.7%
B 5
22.7%
L 2
 
9.1%
H 2
 
9.1%
F 1
 
4.5%
R 1
 
4.5%
T 1
 
4.5%
Lowercase Letter
ValueCountFrequency (%)
a 5
25.0%
e 5
25.0%
t 5
25.0%
l 5
25.0%
Close Punctuation
ValueCountFrequency (%)
) 135
100.0%
Open Punctuation
ValueCountFrequency (%)
( 135
100.0%
Space Separator
ValueCountFrequency (%)
74
100.0%
Decimal Number
ValueCountFrequency (%)
1 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1393
78.3%
Common 345
 
19.4%
Latin 42
 
2.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
133
 
9.5%
112
 
8.0%
86
 
6.2%
44
 
3.2%
41
 
2.9%
38
 
2.7%
31
 
2.2%
30
 
2.2%
29
 
2.1%
25
 
1.8%
Other values (153) 824
59.2%
Latin
ValueCountFrequency (%)
a 5
11.9%
M 5
11.9%
e 5
11.9%
t 5
11.9%
D 5
11.9%
l 5
11.9%
B 5
11.9%
L 2
 
4.8%
H 2
 
4.8%
F 1
 
2.4%
Other values (2) 2
 
4.8%
Common
ValueCountFrequency (%)
) 135
39.1%
( 135
39.1%
74
21.4%
1 1
 
0.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1393
78.3%
ASCII 387
 
21.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
) 135
34.9%
( 135
34.9%
74
19.1%
a 5
 
1.3%
M 5
 
1.3%
e 5
 
1.3%
t 5
 
1.3%
D 5
 
1.3%
l 5
 
1.3%
B 5
 
1.3%
Other values (6) 8
 
2.1%
Hangul
ValueCountFrequency (%)
133
 
9.5%
112
 
8.0%
86
 
6.2%
44
 
3.2%
41
 
2.9%
38
 
2.7%
31
 
2.2%
30
 
2.2%
29
 
2.1%
25
 
1.8%
Other values (153) 824
59.2%
Distinct64
Distinct (%)36.6%
Missing0
Missing (%)0.0%
Memory size1.5 KiB
2023-12-13T01:30:52.006582image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length47
Median length44
Mean length9.36
Min length2

Characters and Unicode

Total characters1638
Distinct characters144
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique37 ?
Unique (%)21.1%

Sample

1st row자동차 폐타이어
2nd row폐합성고무류
3rd row폐합성수지류(폐염화비닐수지류 제외)
4th row건설오니
5th row폐콘크리트
ValueCountFrequency (%)
29
 
9.1%
밖의 29
 
9.1%
폐콘크리트 18
 
5.7%
제외 16
 
5.0%
수산물가공잔재물 15
 
4.7%
폐수처리오니 14
 
4.4%
폐합성수지류(폐염화비닐수지류 13
 
4.1%
폐목재류 9
 
2.8%
폐합성고무류 8
 
2.5%
폐기물 8
 
2.5%
Other values (85) 159
50.0%
2023-12-13T01:30:52.352803image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
146
 
8.9%
143
 
8.7%
76
 
4.6%
69
 
4.2%
64
 
3.9%
51
 
3.1%
44
 
2.7%
42
 
2.6%
42
 
2.6%
37
 
2.3%
Other values (134) 924
56.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1441
88.0%
Space Separator 143
 
8.7%
Close Punctuation 23
 
1.4%
Open Punctuation 23
 
1.4%
Other Punctuation 7
 
0.4%
Connector Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
146
 
10.1%
76
 
5.3%
69
 
4.8%
64
 
4.4%
51
 
3.5%
44
 
3.1%
42
 
2.9%
42
 
2.9%
37
 
2.6%
35
 
2.4%
Other values (128) 835
57.9%
Other Punctuation
ValueCountFrequency (%)
· 5
71.4%
. 2
 
28.6%
Space Separator
ValueCountFrequency (%)
143
100.0%
Close Punctuation
ValueCountFrequency (%)
) 23
100.0%
Open Punctuation
ValueCountFrequency (%)
( 23
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1441
88.0%
Common 197
 
12.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
146
 
10.1%
76
 
5.3%
69
 
4.8%
64
 
4.4%
51
 
3.5%
44
 
3.1%
42
 
2.9%
42
 
2.9%
37
 
2.6%
35
 
2.4%
Other values (128) 835
57.9%
Common
ValueCountFrequency (%)
143
72.6%
) 23
 
11.7%
( 23
 
11.7%
· 5
 
2.5%
. 2
 
1.0%
_ 1
 
0.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1441
88.0%
ASCII 192
 
11.7%
None 5
 
0.3%

Most frequent character per block

Hangul
ValueCountFrequency (%)
146
 
10.1%
76
 
5.3%
69
 
4.8%
64
 
4.4%
51
 
3.5%
44
 
3.1%
42
 
2.9%
42
 
2.9%
37
 
2.6%
35
 
2.4%
Other values (128) 835
57.9%
ASCII
ValueCountFrequency (%)
143
74.5%
) 23
 
12.0%
( 23
 
12.0%
. 2
 
1.0%
_ 1
 
0.5%
None
ValueCountFrequency (%)
· 5
100.0%
Distinct80
Distinct (%)45.7%
Missing0
Missing (%)0.0%
Memory size1.5 KiB
2023-12-13T01:30:52.582561image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length39
Median length35
Mean length26.765714
Min length23

Characters and Unicode

Total characters4684
Distinct characters115
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique49 ?
Unique (%)28.0%

Sample

1st row강원특별자치도 동해시 삼화로 367 쌍용자원개발 (삼화동)
2nd row강원특별자치도 동해시 삼화로 367 쌍용자원개발 (삼화동)
3rd row강원특별자치도 동해시 삼화로 367 쌍용자원개발 (삼화동)
4th row강원특별자치도 동해시 대동로 161-7 (송정동)
5th row강원특별자치도 동해시 대동로 161-7 (송정동)
ValueCountFrequency (%)
동해시 175
19.2%
강원특별자치도 168
18.4%
구호동 55
 
6.0%
송정동 36
 
3.9%
대동로 34
 
3.7%
삼화동 22
 
2.4%
효자로 20
 
2.2%
246 19
 
2.1%
210 14
 
1.5%
36 14
 
1.5%
Other values (129) 356
39.0%
2023-12-13T01:30:52.945642image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
738
 
15.8%
420
 
9.0%
205
 
4.4%
199
 
4.2%
179
 
3.8%
) 179
 
3.8%
( 179
 
3.8%
178
 
3.8%
177
 
3.8%
175
 
3.7%
Other values (105) 2055
43.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2997
64.0%
Space Separator 738
 
15.8%
Decimal Number 539
 
11.5%
Close Punctuation 179
 
3.8%
Open Punctuation 179
 
3.8%
Dash Punctuation 27
 
0.6%
Uppercase Letter 18
 
0.4%
Connector Punctuation 7
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
420
14.0%
205
 
6.8%
199
 
6.6%
179
 
6.0%
178
 
5.9%
177
 
5.9%
175
 
5.8%
175
 
5.8%
168
 
5.6%
168
 
5.6%
Other values (88) 953
31.8%
Decimal Number
ValueCountFrequency (%)
1 109
20.2%
2 90
16.7%
6 89
16.5%
4 59
10.9%
0 42
 
7.8%
7 40
 
7.4%
5 35
 
6.5%
3 35
 
6.5%
9 27
 
5.0%
8 13
 
2.4%
Uppercase Letter
ValueCountFrequency (%)
S 9
50.0%
G 9
50.0%
Space Separator
ValueCountFrequency (%)
738
100.0%
Close Punctuation
ValueCountFrequency (%)
) 179
100.0%
Open Punctuation
ValueCountFrequency (%)
( 179
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 27
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 7
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2997
64.0%
Common 1669
35.6%
Latin 18
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
420
14.0%
205
 
6.8%
199
 
6.6%
179
 
6.0%
178
 
5.9%
177
 
5.9%
175
 
5.8%
175
 
5.8%
168
 
5.6%
168
 
5.6%
Other values (88) 953
31.8%
Common
ValueCountFrequency (%)
738
44.2%
) 179
 
10.7%
( 179
 
10.7%
1 109
 
6.5%
2 90
 
5.4%
6 89
 
5.3%
4 59
 
3.5%
0 42
 
2.5%
7 40
 
2.4%
5 35
 
2.1%
Other values (5) 109
 
6.5%
Latin
ValueCountFrequency (%)
S 9
50.0%
G 9
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2997
64.0%
ASCII 1687
36.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
738
43.7%
) 179
 
10.6%
( 179
 
10.6%
1 109
 
6.5%
2 90
 
5.3%
6 89
 
5.3%
4 59
 
3.5%
0 42
 
2.5%
7 40
 
2.4%
5 35
 
2.1%
Other values (7) 127
 
7.5%
Hangul
ValueCountFrequency (%)
420
14.0%
205
 
6.8%
199
 
6.6%
179
 
6.0%
178
 
5.9%
177
 
5.9%
175
 
5.8%
175
 
5.8%
168
 
5.6%
168
 
5.6%
Other values (88) 953
31.8%

신고유형
Categorical

CONSTANT 

Distinct1
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size1.5 KiB
사업장폐기물배출자(2호)
175 

Length

Max length13
Median length13
Mean length13
Min length13

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row사업장폐기물배출자(2호)
2nd row사업장폐기물배출자(2호)
3rd row사업장폐기물배출자(2호)
4th row사업장폐기물배출자(2호)
5th row사업장폐기물배출자(2호)

Common Values

ValueCountFrequency (%)
사업장폐기물배출자(2호) 175
100.0%

Length

2023-12-13T01:30:53.116554image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T01:30:53.206990image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
사업장폐기물배출자(2호 175
100.0%

기준일
Date

CONSTANT 

Distinct1
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size1.5 KiB
Minimum2023-07-03 00:00:00
Maximum2023-07-03 00:00:00
2023-12-13T01:30:53.294680image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T01:30:53.409698image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Correlations

2023-12-13T01:30:53.496119image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
신고번호사업장명발생폐기물사업장 주소
신고번호1.0001.0000.0001.000
사업장명1.0001.0000.0001.000
발생폐기물0.0000.0001.0000.000
사업장 주소1.0001.0000.0001.000

Missing values

2023-12-13T01:30:50.406865image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T01:30:50.502605image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

신고번호사업장명발생폐기물사업장 주소신고유형기준일
04210000-31-1995-00030쌍용씨앤이(주)동해사업소자동차 폐타이어강원특별자치도 동해시 삼화로 367 쌍용자원개발 (삼화동)사업장폐기물배출자(2호)2023-07-03
14210000-31-1995-00030쌍용씨앤이(주)동해사업소폐합성고무류강원특별자치도 동해시 삼화로 367 쌍용자원개발 (삼화동)사업장폐기물배출자(2호)2023-07-03
24210000-31-1995-00030쌍용씨앤이(주)동해사업소폐합성수지류(폐염화비닐수지류 제외)강원특별자치도 동해시 삼화로 367 쌍용자원개발 (삼화동)사업장폐기물배출자(2호)2023-07-03
34210000-31-2001-00001(주)대신산업건설오니강원특별자치도 동해시 대동로 161-7 (송정동)사업장폐기물배출자(2호)2023-07-03
44210000-31-2001-00001(주)대신산업폐콘크리트강원특별자치도 동해시 대동로 161-7 (송정동)사업장폐기물배출자(2호)2023-07-03
54210000-31-2001-00003한국동서발전(주) 동해발전본부그 밖의 동·식물성잔재물강원특별자치도 동해시 공단9로 145 (구호동)사업장폐기물배출자(2호)2023-07-03
64210000-31-2001-00003한국동서발전(주) 동해발전본부그 밖의 연소잔재물강원특별자치도 동해시 공단9로 145 (구호동)사업장폐기물배출자(2호)2023-07-03
74210000-31-2001-00003한국동서발전(주) 동해발전본부그 밖의 폐목재류강원특별자치도 동해시 공단9로 145 (구호동)사업장폐기물배출자(2호)2023-07-03
84210000-31-2001-00003한국동서발전(주) 동해발전본부석탄재강원특별자치도 동해시 공단9로 145 (구호동)사업장폐기물배출자(2호)2023-07-03
94210000-31-2001-00003한국동서발전(주) 동해발전본부폐내화물강원특별자치도 동해시 공단9로 145 (구호동)사업장폐기물배출자(2호)2023-07-03
신고번호사업장명발생폐기물사업장 주소신고유형기준일
1654210000-31-2018-00001삼화공영주식회사그 밖의 무기성오니강원특별자치도 동해시 쉬눔길 24 삼화레미콘 (추암동)사업장폐기물배출자(2호)2023-07-03
1664210000-31-2018-00001삼화공영주식회사폐콘크리트강원특별자치도 동해시 쉬눔길 24 삼화레미콘 (추암동)사업장폐기물배출자(2호)2023-07-03
1674210000-31-2018-00002해군제1수리창샌드블라스트폐사강원특별자치도 동해시 대동로 430 사서함 604-22-2호 (송정동)사업장폐기물배출자(2호)2023-07-03
1684210000-31-2019-00001동해조선FRT폐합성수지류(폐염화비닐수지류 제외)강원특별자치도 동해시 일출로 92-36 가동 (묵호진동)사업장폐기물배출자(2호)2023-07-03
1694210000-31-2019-00002거성산업(주)동해폐활성탄강원특별자치도 동해시 공단8로 98 (주)거성산업 (구호동)사업장폐기물배출자(2호)2023-07-03
1704210000-31-2020-00001(주)안성엔지니어링폐콘크리트강원특별자치도 동해시 대동로 210 (송정동)사업장폐기물배출자(2호)2023-07-03
1714210000-31-2020-00002동해시 상하수도사업소정수처리오니강원특별자치도 동해시 새골길 50 상하수도사업소 (쇄운동)사업장폐기물배출자(2호)2023-07-03
1724210000-31-2021-00001합자회사 농수푸드수산물가공잔재물강원특별자치도 동해시 공단7로 83 농수푸드 (구호동)사업장폐기물배출자(2호)2023-07-03
1734210000-31-2022-00001(주)한화 강원지사폐합성수지류(폐염화비닐수지류 제외)강원특별자치도 동해시 설운길 276 (쇄운동)사업장폐기물배출자(2호)2023-07-03
1744211000-31-2023-00001주식회사 화남코퍼레이션수산물가공잔재물강원특별자치도 동해시 공단1로 177_ 1동 (구호동)사업장폐기물배출자(2호)2023-07-03

Duplicate rows

Most frequently occurring

신고번호사업장명발생폐기물사업장 주소신고유형기준일# duplicates
04210000-31-2009-00033(주)그린빌 동해송정 LH아파트그 밖의 폐기물강원특별자치도 동해시 동해역길 19 (송정동)사업장폐기물배출자(2호)2023-07-032