Overview

Dataset statistics

Number of variables4
Number of observations809
Missing cells0
Missing cells (%)0.0%
Duplicate rows61
Duplicate rows (%)7.5%
Total size in memory26.2 KiB
Average record size in memory33.2 B

Variable types

Text3
Numeric1

Dataset

Description전라남도 영암군 사업장폐기물배출자의 신고현황에 대한 데이터로 상호, 사업장도로명주소, 폐기물 종류, 배출량등의 항목을 제공합니다.
Author전라남도 영암군
URLhttps://www.data.go.kr/data/15060900/fileData.do

Alerts

Dataset has 61 (7.5%) duplicate rowsDuplicates

Reproduction

Analysis started2023-12-12 21:59:36.052991
Analysis finished2023-12-12 21:59:36.666051
Duration0.61 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

상호
Text

Distinct299
Distinct (%)37.0%
Missing0
Missing (%)0.0%
Memory size6.4 KiB
2023-12-13T06:59:36.836932image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length23
Median length15
Mean length8.6155748
Min length3

Characters and Unicode

Total characters6970
Distinct characters265
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique122 ?
Unique (%)15.1%

Sample

1st row코스틸산업(주)
2nd row현대인프라솔루션주식회사
3rd row현대인프라솔루션주식회사
4th row(주)제이비앤아이 영암사업장
5th row(주)제이비앤아이 영암사업장
ValueCountFrequency (%)
현대삼호중공업(주 35
 
4.0%
보워터코리아(유 27
 
3.1%
케이씨(주 21
 
2.4%
유)미래환경 17
 
2.0%
현대힘스(주)대불1공장 15
 
1.7%
해군제3함대사령부 13
 
1.5%
주)삼호로커스 11
 
1.3%
영암그린에너지(주 11
 
1.3%
주)엘케이스틸 10
 
1.2%
주)에스엠에스1공장 9
 
1.0%
Other values (307) 697
80.5%
2023-12-13T06:59:37.277669image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
) 719
 
10.3%
( 719
 
10.3%
571
 
8.2%
283
 
4.1%
247
 
3.5%
231
 
3.3%
192
 
2.8%
152
 
2.2%
147
 
2.1%
131
 
1.9%
Other values (255) 3578
51.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5334
76.5%
Close Punctuation 719
 
10.3%
Open Punctuation 719
 
10.3%
Decimal Number 95
 
1.4%
Space Separator 57
 
0.8%
Uppercase Letter 38
 
0.5%
Other Punctuation 4
 
0.1%
Lowercase Letter 4
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
571
 
10.7%
283
 
5.3%
247
 
4.6%
231
 
4.3%
192
 
3.6%
152
 
2.8%
147
 
2.8%
131
 
2.5%
124
 
2.3%
106
 
2.0%
Other values (234) 3150
59.1%
Uppercase Letter
ValueCountFrequency (%)
K 7
18.4%
S 7
18.4%
G 6
15.8%
E 3
7.9%
N 3
7.9%
M 3
7.9%
C 3
7.9%
Y 2
 
5.3%
D 2
 
5.3%
A 1
 
2.6%
Decimal Number
ValueCountFrequency (%)
1 42
44.2%
3 27
28.4%
2 26
27.4%
Other Punctuation
ValueCountFrequency (%)
. 2
50.0%
& 2
50.0%
Lowercase Letter
ValueCountFrequency (%)
p 2
50.0%
s 2
50.0%
Close Punctuation
ValueCountFrequency (%)
) 719
100.0%
Open Punctuation
ValueCountFrequency (%)
( 719
100.0%
Space Separator
ValueCountFrequency (%)
57
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5334
76.5%
Common 1594
 
22.9%
Latin 42
 
0.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
571
 
10.7%
283
 
5.3%
247
 
4.6%
231
 
4.3%
192
 
3.6%
152
 
2.8%
147
 
2.8%
131
 
2.5%
124
 
2.3%
106
 
2.0%
Other values (234) 3150
59.1%
Latin
ValueCountFrequency (%)
K 7
16.7%
S 7
16.7%
G 6
14.3%
E 3
7.1%
N 3
7.1%
M 3
7.1%
C 3
7.1%
p 2
 
4.8%
s 2
 
4.8%
Y 2
 
4.8%
Other values (3) 4
9.5%
Common
ValueCountFrequency (%)
) 719
45.1%
( 719
45.1%
57
 
3.6%
1 42
 
2.6%
3 27
 
1.7%
2 26
 
1.6%
. 2
 
0.1%
& 2
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5334
76.5%
ASCII 1636
 
23.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
) 719
43.9%
( 719
43.9%
57
 
3.5%
1 42
 
2.6%
3 27
 
1.7%
2 26
 
1.6%
K 7
 
0.4%
S 7
 
0.4%
G 6
 
0.4%
E 3
 
0.2%
Other values (11) 23
 
1.4%
Hangul
ValueCountFrequency (%)
571
 
10.7%
283
 
5.3%
247
 
4.6%
231
 
4.3%
192
 
3.6%
152
 
2.8%
147
 
2.8%
131
 
2.5%
124
 
2.3%
106
 
2.0%
Other values (234) 3150
59.1%
Distinct218
Distinct (%)26.9%
Missing0
Missing (%)0.0%
Memory size6.4 KiB
2023-12-13T06:59:37.584103image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length37
Median length35
Mean length19.91471
Min length1

Characters and Unicode

Total characters16111
Distinct characters170
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique88 ?
Unique (%)10.9%

Sample

1st row전라남도 영암군 삼호읍 대불주거1로 192_ 코스틸산업
2nd row전라남도 영암군 삼호읍 용앙로 520_ 지팸중공업(주)
3rd row전라남도 영암군 삼호읍 용앙로 520_ 지팸중공업(주)
4th row전라남도 영암군 삼호읍 자유무역로 194
5th row전라남도 영암군 삼호읍 자유무역로 194
ValueCountFrequency (%)
전라남도 713
19.4%
영암군 713
19.4%
삼호읍 642
17.5%
나불로 76
 
2.1%
대불산단6로 76
 
2.1%
대불산단3로 64
 
1.7%
용앙로 55
 
1.5%
대불로 44
 
1.2%
산단서부로 44
 
1.2%
대불산단1로 37
 
1.0%
Other values (270) 1208
32.9%
2023-12-13T06:59:38.053341image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3061
19.0%
731
 
4.5%
731
 
4.5%
724
 
4.5%
717
 
4.5%
717
 
4.5%
714
 
4.4%
713
 
4.4%
677
 
4.2%
660
 
4.1%
Other values (160) 6666
41.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 10498
65.2%
Space Separator 3061
 
19.0%
Decimal Number 2191
 
13.6%
Close Punctuation 121
 
0.8%
Open Punctuation 116
 
0.7%
Uppercase Letter 43
 
0.3%
Connector Punctuation 41
 
0.3%
Dash Punctuation 40
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
731
 
7.0%
731
 
7.0%
724
 
6.9%
717
 
6.8%
717
 
6.8%
714
 
6.8%
713
 
6.8%
677
 
6.4%
660
 
6.3%
649
 
6.2%
Other values (142) 3465
33.0%
Decimal Number
ValueCountFrequency (%)
1 368
16.8%
3 349
15.9%
2 322
14.7%
0 238
10.9%
5 192
8.8%
6 184
8.4%
7 152
6.9%
9 147
 
6.7%
8 133
 
6.1%
4 106
 
4.8%
Uppercase Letter
ValueCountFrequency (%)
C 21
48.8%
K 21
48.8%
A 1
 
2.3%
Space Separator
ValueCountFrequency (%)
3061
100.0%
Close Punctuation
ValueCountFrequency (%)
) 121
100.0%
Open Punctuation
ValueCountFrequency (%)
( 116
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 41
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 40
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 10498
65.2%
Common 5570
34.6%
Latin 43
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
731
 
7.0%
731
 
7.0%
724
 
6.9%
717
 
6.8%
717
 
6.8%
714
 
6.8%
713
 
6.8%
677
 
6.4%
660
 
6.3%
649
 
6.2%
Other values (142) 3465
33.0%
Common
ValueCountFrequency (%)
3061
55.0%
1 368
 
6.6%
3 349
 
6.3%
2 322
 
5.8%
0 238
 
4.3%
5 192
 
3.4%
6 184
 
3.3%
7 152
 
2.7%
9 147
 
2.6%
8 133
 
2.4%
Other values (5) 424
 
7.6%
Latin
ValueCountFrequency (%)
C 21
48.8%
K 21
48.8%
A 1
 
2.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 10498
65.2%
ASCII 5613
34.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3061
54.5%
1 368
 
6.6%
3 349
 
6.2%
2 322
 
5.7%
0 238
 
4.2%
5 192
 
3.4%
6 184
 
3.3%
7 152
 
2.7%
9 147
 
2.6%
8 133
 
2.4%
Other values (8) 467
 
8.3%
Hangul
ValueCountFrequency (%)
731
 
7.0%
731
 
7.0%
724
 
6.9%
717
 
6.8%
717
 
6.8%
714
 
6.8%
713
 
6.8%
677
 
6.4%
660
 
6.3%
649
 
6.2%
Other values (142) 3465
33.0%
Distinct84
Distinct (%)10.4%
Missing0
Missing (%)0.0%
Memory size6.4 KiB
2023-12-13T06:59:38.287718image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length88
Median length54
Mean length14.47466
Min length2

Characters and Unicode

Total characters11710
Distinct characters169
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique33 ?
Unique (%)4.1%

Sample

1st row그 밖의 광재류
2nd row폐합성수지류(폐염화비닐수지류는 제외한다)
3rd row폐토사
4th row그 밖의 분진
5th row폐합성수지류(폐염화비닐수지류는 제외한다)
ValueCountFrequency (%)
제외한다 220
 
10.7%
215
 
10.4%
밖의 215
 
10.4%
폐합성수지류(폐염화비닐수지류는 175
 
8.5%
폐합성수지류 118
 
5.7%
폐금속류 69
 
3.3%
분진 65
 
3.2%
발생되는 43
 
2.1%
것은 42
 
2.0%
소각시설에서 42
 
2.0%
Other values (144) 858
41.6%
2023-12-13T06:59:38.657951image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1279
 
10.9%
870
 
7.4%
658
 
5.6%
541
 
4.6%
519
 
4.4%
346
 
3.0%
343
 
2.9%
335
 
2.9%
( 269
 
2.3%
) 269
 
2.3%
Other values (159) 6281
53.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 9765
83.4%
Space Separator 1279
 
10.9%
Open Punctuation 269
 
2.3%
Close Punctuation 269
 
2.3%
Connector Punctuation 119
 
1.0%
Decimal Number 8
 
0.1%
Other Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
870
 
8.9%
658
 
6.7%
541
 
5.5%
519
 
5.3%
346
 
3.5%
343
 
3.5%
335
 
3.4%
263
 
2.7%
241
 
2.5%
237
 
2.4%
Other values (151) 5412
55.4%
Decimal Number
ValueCountFrequency (%)
2 3
37.5%
3 3
37.5%
1 2
25.0%
Space Separator
ValueCountFrequency (%)
1279
100.0%
Open Punctuation
ValueCountFrequency (%)
( 269
100.0%
Close Punctuation
ValueCountFrequency (%)
) 269
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 119
100.0%
Other Punctuation
ValueCountFrequency (%)
. 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 9765
83.4%
Common 1945
 
16.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
870
 
8.9%
658
 
6.7%
541
 
5.5%
519
 
5.3%
346
 
3.5%
343
 
3.5%
335
 
3.4%
263
 
2.7%
241
 
2.5%
237
 
2.4%
Other values (151) 5412
55.4%
Common
ValueCountFrequency (%)
1279
65.8%
( 269
 
13.8%
) 269
 
13.8%
_ 119
 
6.1%
2 3
 
0.2%
3 3
 
0.2%
1 2
 
0.1%
. 1
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 9754
83.3%
ASCII 1945
 
16.6%
Compat Jamo 11
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1279
65.8%
( 269
 
13.8%
) 269
 
13.8%
_ 119
 
6.1%
2 3
 
0.2%
3 3
 
0.2%
1 2
 
0.1%
. 1
 
0.1%
Hangul
ValueCountFrequency (%)
870
 
8.9%
658
 
6.7%
541
 
5.5%
519
 
5.3%
346
 
3.5%
343
 
3.5%
335
 
3.4%
263
 
2.7%
241
 
2.5%
237
 
2.4%
Other values (150) 5401
55.4%
Compat Jamo
ValueCountFrequency (%)
11
100.0%

배출량(톤)
Real number (ℝ)

Distinct153
Distinct (%)18.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean985.06675
Minimum0
Maximum150000
Zeros1
Zeros (%)0.1%
Negative0
Negative (%)0.0%
Memory size7.2 KiB
2023-12-13T06:59:38.782617image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile10
Q140
median100
Q3240
95-th percentile1768.8
Maximum150000
Range150000
Interquartile range (IQR)200

Descriptive statistics

Standard deviation7747.2337
Coefficient of variation (CV)7.8646789
Kurtosis232.48487
Mean985.06675
Median Absolute Deviation (MAD)70
Skewness14.404818
Sum796919
Variance60019630
MonotonicityNot monotonic
2023-12-13T06:59:38.913607image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
60.0 89
 
11.0%
120.0 77
 
9.5%
30.0 38
 
4.7%
24.0 36
 
4.4%
100.0 34
 
4.2%
300.0 33
 
4.1%
36.0 30
 
3.7%
240.0 25
 
3.1%
200.0 21
 
2.6%
50.0 20
 
2.5%
Other values (143) 406
50.2%
ValueCountFrequency (%)
0.0 1
 
0.1%
0.24 1
 
0.1%
0.264 1
 
0.1%
0.3 1
 
0.1%
1.0 3
0.4%
2.1 1
 
0.1%
3.0 2
 
0.2%
3.6 3
0.4%
4.8 1
 
0.1%
5.0 6
0.7%
ValueCountFrequency (%)
150000.0 1
0.1%
100000.0 1
0.1%
94400.0 1
0.1%
60000.0 1
0.1%
36000.0 1
0.1%
30000.0 1
0.1%
25000.0 2
0.2%
12240.0 1
0.1%
10470.0 1
0.1%
9000.0 1
0.1%

Interactions

2023-12-13T06:59:36.410323image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T06:59:38.996565image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
폐기물 종류배출량(톤)
폐기물 종류1.0000.644
배출량(톤)0.6441.000

Missing values

2023-12-13T06:59:36.540196image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T06:59:36.628912image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

상호사업장도로명주소폐기물 종류배출량(톤)
0코스틸산업(주)전라남도 영암군 삼호읍 대불주거1로 192_ 코스틸산업그 밖의 광재류20.0
1현대인프라솔루션주식회사전라남도 영암군 삼호읍 용앙로 520_ 지팸중공업(주)폐합성수지류(폐염화비닐수지류는 제외한다)36.0
2현대인프라솔루션주식회사전라남도 영암군 삼호읍 용앙로 520_ 지팸중공업(주)폐토사24.0
3(주)제이비앤아이 영암사업장전라남도 영암군 삼호읍 자유무역로 194그 밖의 분진60.0
4(주)제이비앤아이 영암사업장전라남도 영암군 삼호읍 자유무역로 194폐합성수지류(폐염화비닐수지류는 제외한다)50.0
5(주)케이에스 영암지점전라남도 영암군 삼호읍 나불로3길 8폐아스팔트콘크리트500.0
6(주)에스에이치전라남도 영암군 삼호읍 용앙로 154 (유) 세화산업폐합성수지류(폐염화비닐수지류는 제외한다)40.0
7(유)세븐에프알피조선전라남도 영암군 삼호읍 용앙로 487_ 마린이노테크폐합성수지류(폐염화비닐수지류는 제외한다)24.0
8케이에스조선소전라남도 영암군 삼호읍 삼불1길 31_ 성용엔지니어링폐합성수지류(폐염화비닐수지류는 제외한다)60.0
9영암군청전라남도 영암군 영암읍 군청로 1_ 영암군청폐합성수지류(폐염화비닐수지류는 제외한다)386.01
상호사업장도로명주소폐기물 종류배출량(톤)
799케이씨(주)전라남도 영암군 삼호읍 산단서부로 85_ KC(주)그 밖의 분진120.0
800케이씨(주)전라남도 영암군 삼호읍 산단서부로 85_ KC(주)폐수처리오니900.0
801케이씨(주)전라남도 영암군 삼호읍 산단서부로 85_ KC(주)폐내화물100.0
802케이씨(주)전라남도 영암군 삼호읍 산단서부로 85_ KC(주)금속성폐촉매25.0
803케이씨(주)전라남도 영암군 삼호읍 산단서부로 85_ KC(주)폐합성수지류(폐염화비닐수지류는 제외한다)100.0
804케이씨(주)전라남도 영암군 삼호읍 산단서부로 85_ KC(주)보크사이트잔재물30000.0
805케이씨(주)전라남도 영암군 삼호읍 산단서부로 85_ KC(주)폐수처리오니2500.0
806케이씨(주)전라남도 영암군 삼호읍 산단서부로 85_ KC(주)폐합성수지류(폐염화비닐수지류는 제외한다)100.0
807케이씨(주)전라남도 영암군 삼호읍 산단서부로 85_ KC(주)보크사이트잔재물60000.0
808케이씨(주)전라남도 영암군 삼호읍 산단서부로 85_ KC(주)보크사이트잔재물150000.0

Duplicate rows

Most frequently occurring

상호사업장도로명주소폐기물 종류배출량(톤)# duplicates
6(유)천하환경전라남도 영암군 삼호읍 백야길 29-215폐합성수지류(폐염화비닐수지류는 제외한다)360.04
42영암그린에너지(주)전라남도 영암군 삼호읍 대불산단6로 31폐합성수지류(폐염화비닐수지류는 제외한다)1200.04
57현대힘스(주)대불1공장전라남도 영암군 삼호읍 대불산단3로 20그 밖의 광재류250.04
4(유)미래환경전라남도 영암군 학산면 녹색로 3030폐합성수지류(폐염화비닐수지류는 제외한다)175.03
9(유)현성산업폐사(샌드블라스트 폐사)200.03
11(주)보석산업전라남도 영암군 삼호읍 대불산단6로 37-21폐합성수지류(폐염화비닐수지류는 제외한다)60.03
36보워터코리아(유)전라남도 영암군 삼호읍 나불로 230그 밖의 소각시설 중 바닥재와 비산재가 분리ㆍ배출되지 아니하는 시설에서 발생하는 소각재6000.03
0(유)대원에스피전라남도 영암군 삼호읍 소등로 15분진(대기오염방지시설에서 포집된 것에 한정하되_ 소각시설에서 발생되는 것은 제외한다)30.02
1(유)두성중공업전라남도 영암군 삼호읍 대불산단3로 201폐합성수지류50.02
2(유)미래환경전라남도 영암군 학산면 녹색로 3030폐합성수지류(폐염화비닐수지류는 제외한다)50.02