Overview

Dataset statistics

Number of variables5
Number of observations254
Missing cells0
Missing cells (%)0.0%
Duplicate rows24
Duplicate rows (%)9.4%
Total size in memory10.3 KiB
Average record size in memory41.5 B

Variable types

Text2
Categorical2
Numeric1

Dataset

Description대구광역시 동구 사업장페기물 배출자 신고현황 데이터입니다. 이 데이터는 상호명, 폐기물 종류, 배출량, 주소 등의 항목을 포함합니다.
Author대구광역시 동구
URLhttps://www.data.go.kr/data/15060399/fileData.do

Alerts

생활계구분 has constant value ""Constant
Dataset has 24 (9.4%) duplicate rowsDuplicates
배출량(톤) is highly overall correlated with 폐기물 종류High correlation
폐기물 종류 is highly overall correlated with 배출량(톤)High correlation

Reproduction

Analysis started2024-04-21 01:25:09.499309
Analysis finished2024-04-21 01:25:11.319234
Duration1.82 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

상호
Text

Distinct68
Distinct (%)26.8%
Missing0
Missing (%)0.0%
Memory size2.1 KiB
2024-04-21T10:25:11.502559image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length23
Median length16
Mean length9.5866142
Min length3

Characters and Unicode

Total characters2435
Distinct characters182
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique16 ?
Unique (%)6.3%

Sample

1st row선푸드
2nd row어썸마켓
3rd row어썸마켓
4th row어썸마켓
5th row(주)풀무원푸드앤컬쳐 공군제11전투비행단
ValueCountFrequency (%)
주)신세계동대구복합환승센터 18
 
5.8%
주)이마트 14
 
4.5%
반야월점 14
 
4.5%
롯데쇼핑(주)롯데마트대구율하점 14
 
4.5%
홈플러스(주 13
 
4.2%
주식회사 11
 
3.6%
의)열경의료재단 7
 
2.3%
대동병원 7
 
2.3%
한국맥도날드(유)방촌동점 7
 
2.3%
롯데쇼핑(주)롯데아울렛 7
 
2.3%
Other values (70) 196
63.6%
2024-04-21T10:25:11.925553image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
( 139
 
5.7%
) 139
 
5.7%
120
 
4.9%
84
 
3.4%
70
 
2.9%
55
 
2.3%
54
 
2.2%
54
 
2.2%
54
 
2.2%
54
 
2.2%
Other values (172) 1612
66.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2097
86.1%
Open Punctuation 139
 
5.7%
Close Punctuation 139
 
5.7%
Space Separator 54
 
2.2%
Decimal Number 4
 
0.2%
Uppercase Letter 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
120
 
5.7%
84
 
4.0%
70
 
3.3%
55
 
2.6%
54
 
2.6%
54
 
2.6%
54
 
2.6%
52
 
2.5%
43
 
2.1%
43
 
2.1%
Other values (166) 1468
70.0%
Uppercase Letter
ValueCountFrequency (%)
Y 1
50.0%
G 1
50.0%
Open Punctuation
ValueCountFrequency (%)
( 139
100.0%
Close Punctuation
ValueCountFrequency (%)
) 139
100.0%
Space Separator
ValueCountFrequency (%)
54
100.0%
Decimal Number
ValueCountFrequency (%)
1 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2097
86.1%
Common 336
 
13.8%
Latin 2
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
120
 
5.7%
84
 
4.0%
70
 
3.3%
55
 
2.6%
54
 
2.6%
54
 
2.6%
54
 
2.6%
52
 
2.5%
43
 
2.1%
43
 
2.1%
Other values (166) 1468
70.0%
Common
ValueCountFrequency (%)
( 139
41.4%
) 139
41.4%
54
 
16.1%
1 4
 
1.2%
Latin
ValueCountFrequency (%)
Y 1
50.0%
G 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2097
86.1%
ASCII 338
 
13.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
( 139
41.1%
) 139
41.1%
54
 
16.0%
1 4
 
1.2%
Y 1
 
0.3%
G 1
 
0.3%
Hangul
ValueCountFrequency (%)
120
 
5.7%
84
 
4.0%
70
 
3.3%
55
 
2.6%
54
 
2.6%
54
 
2.6%
54
 
2.6%
52
 
2.5%
43
 
2.1%
43
 
2.1%
Other values (166) 1468
70.0%

생활계구분
Categorical

CONSTANT 

Distinct1
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size2.1 KiB
생활계
254 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row생활계
2nd row생활계
3rd row생활계
4th row생활계
5th row생활계

Common Values

ValueCountFrequency (%)
생활계 254
100.0%

Length

2024-04-21T10:25:12.038775image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-21T10:25:12.115689image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
생활계 254
100.0%

폐기물 종류
Categorical

HIGH CORRELATION 

Distinct30
Distinct (%)11.8%
Missing0
Missing (%)0.0%
Memory size2.1 KiB
폐합성수지류(폐염화비닐수지류는 제외한다)
65 
음식물류폐기물
39 
폐합성수지류
25 
폐합성고무류
14 
폐종이류
14 
Other values (25)
97 

Length

Max length84
Median length47
Mean length16.480315
Min length3

Unique

Unique11 ?
Unique (%)4.3%

Sample

1st row축산물가공잔재물(동물성 유지류는 제외한다)
2nd row폐합성수지류(폐염화비닐수지류는 제외한다)
3rd row폐합성수지류(폐염화비닐수지류는 제외한다)
4th row음식물류폐기물
5th row음식물류폐기물

Common Values

ValueCountFrequency (%)
폐합성수지류(폐염화비닐수지류는 제외한다) 65
25.6%
음식물류폐기물 39
15.4%
폐합성수지류 25
 
9.8%
폐합성고무류 14
 
5.5%
폐종이류 14
 
5.5%
폐가구류_ 폐도장목_ 폐목재포장재_ 폐전선드럼(접착제_ 페인트_ 기름_ 콘크리트 등의 물질이 사용된 목재를 말한다) 14
 
5.5%
폐합성섬유 12
 
4.7%
그 밖의 폐목재류 8
 
3.1%
폐가구류_ 폐도장목_ 폐목재포장재_ 폐전선드럼(원목상태의 깨끗한 목재를 말한다) 7
 
2.8%
그 밖의 폐섬유 7
 
2.8%
Other values (20) 49
19.3%

Length

2024-04-21T10:25:12.222475image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
제외한다 75
 
11.5%
폐합성수지류(폐염화비닐수지류는 65
 
10.0%
음식물류폐기물 39
 
6.0%
폐합성수지류 25
 
3.8%
말한다 23
 
3.5%
폐도장목 21
 
3.2%
폐목재포장재 21
 
3.2%
목재를 21
 
3.2%
폐가구류 21
 
3.2%
19
 
2.9%
Other values (63) 322
49.4%

배출량(톤)
Real number (ℝ)

HIGH CORRELATION 

Distinct56
Distinct (%)22.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean110.96535
Minimum0.5
Maximum1920
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.4 KiB
2024-04-21T10:25:12.362979image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0.5
5-th percentile1
Q112
median42
Q383
95-th percentile467.5
Maximum1920
Range1919.5
Interquartile range (IQR)71

Descriptive statistics

Standard deviation220.45503
Coefficient of variation (CV)1.9867015
Kurtosis24.888314
Mean110.96535
Median Absolute Deviation (MAD)30
Skewness4.3917159
Sum28185.2
Variance48600.422
MonotonicityNot monotonic
2024-04-21T10:25:12.499257image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
60.0 29
 
11.4%
12.0 28
 
11.0%
36.0 24
 
9.4%
1.0 19
 
7.5%
6.0 14
 
5.5%
48.0 12
 
4.7%
180.0 10
 
3.9%
24.0 10
 
3.9%
72.0 10
 
3.9%
54.0 7
 
2.8%
Other values (46) 91
35.8%
ValueCountFrequency (%)
0.5 1
 
0.4%
0.6 4
 
1.6%
0.96 1
 
0.4%
1.0 19
7.5%
1.2 1
 
0.4%
2.0 1
 
0.4%
3.0 5
 
2.0%
3.6 1
 
0.4%
3.8 1
 
0.4%
4.32 1
 
0.4%
ValueCountFrequency (%)
1920.0 1
 
0.4%
1300.0 1
 
0.4%
1200.0 1
 
0.4%
960.0 3
1.2%
840.0 1
 
0.4%
600.0 3
1.2%
540.0 2
0.8%
500.0 1
 
0.4%
450.0 1
 
0.4%
400.0 1
 
0.4%
Distinct66
Distinct (%)26.0%
Missing0
Missing (%)0.0%
Memory size2.1 KiB
2024-04-21T10:25:12.730605image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length41
Median length36
Mean length24.26378
Min length19

Characters and Unicode

Total characters6163
Distinct characters112
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique12 ?
Unique (%)4.7%

Sample

1st row대구광역시 동구 입석로 97-25 (검사동)
2nd row대구광역시 동구 동촌로 137_ 1층 (검사동)
3rd row대구광역시 동구 동촌로 137_ 1층 (검사동)
4th row대구광역시 동구 동촌로 137_ 1층 (검사동)
5th row대구광역시 동구 아양로 352_ 1층 201호 (입석동)
ValueCountFrequency (%)
대구광역시 254
19.3%
동구 254
19.3%
동촌로 43
 
3.3%
신암동 35
 
2.7%
신서동 33
 
2.5%
신천동 33
 
2.5%
안심로 32
 
2.4%
방촌동 28
 
2.1%
효목동 23
 
1.7%
검사동 22
 
1.7%
Other values (124) 561
42.6%
2024-04-21T10:25:13.073963image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1064
17.3%
641
 
10.4%
537
 
8.7%
286
 
4.6%
272
 
4.4%
258
 
4.2%
254
 
4.1%
254
 
4.1%
) 253
 
4.1%
( 253
 
4.1%
Other values (102) 2091
33.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3705
60.1%
Space Separator 1064
 
17.3%
Decimal Number 805
 
13.1%
Close Punctuation 253
 
4.1%
Open Punctuation 253
 
4.1%
Connector Punctuation 45
 
0.7%
Dash Punctuation 35
 
0.6%
Math Symbol 2
 
< 0.1%
Uppercase Letter 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
641
17.3%
537
14.5%
286
 
7.7%
272
 
7.3%
258
 
7.0%
254
 
6.9%
254
 
6.9%
111
 
3.0%
79
 
2.1%
57
 
1.5%
Other values (85) 956
25.8%
Decimal Number
ValueCountFrequency (%)
1 186
23.1%
2 104
12.9%
9 87
10.8%
3 74
 
9.2%
7 70
 
8.7%
5 64
 
8.0%
4 64
 
8.0%
8 63
 
7.8%
6 50
 
6.2%
0 43
 
5.3%
Space Separator
ValueCountFrequency (%)
1064
100.0%
Close Punctuation
ValueCountFrequency (%)
) 253
100.0%
Open Punctuation
ValueCountFrequency (%)
( 253
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 45
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 35
100.0%
Math Symbol
ValueCountFrequency (%)
~ 2
100.0%
Uppercase Letter
ValueCountFrequency (%)
D 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3705
60.1%
Common 2457
39.9%
Latin 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
641
17.3%
537
14.5%
286
 
7.7%
272
 
7.3%
258
 
7.0%
254
 
6.9%
254
 
6.9%
111
 
3.0%
79
 
2.1%
57
 
1.5%
Other values (85) 956
25.8%
Common
ValueCountFrequency (%)
1064
43.3%
) 253
 
10.3%
( 253
 
10.3%
1 186
 
7.6%
2 104
 
4.2%
9 87
 
3.5%
3 74
 
3.0%
7 70
 
2.8%
5 64
 
2.6%
4 64
 
2.6%
Other values (6) 238
 
9.7%
Latin
ValueCountFrequency (%)
D 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3705
60.1%
ASCII 2458
39.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1064
43.3%
) 253
 
10.3%
( 253
 
10.3%
1 186
 
7.6%
2 104
 
4.2%
9 87
 
3.5%
3 74
 
3.0%
7 70
 
2.8%
5 64
 
2.6%
4 64
 
2.6%
Other values (7) 239
 
9.7%
Hangul
ValueCountFrequency (%)
641
17.3%
537
14.5%
286
 
7.7%
272
 
7.3%
258
 
7.0%
254
 
6.9%
254
 
6.9%
111
 
3.0%
79
 
2.1%
57
 
1.5%
Other values (85) 956
25.8%

Interactions

2024-04-21T10:25:11.032053image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-21T10:25:13.162688image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
상호폐기물 종류배출량(톤)사업장도로명주소
상호1.0000.8860.9071.000
폐기물 종류0.8861.0000.8250.893
배출량(톤)0.9070.8251.0000.855
사업장도로명주소1.0000.8930.8551.000
2024-04-21T10:25:13.256243image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
배출량(톤)폐기물 종류
배출량(톤)1.0000.502
폐기물 종류0.5021.000

Missing values

2024-04-21T10:25:11.198812image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-21T10:25:11.280259image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

상호생활계구분폐기물 종류배출량(톤)사업장도로명주소
0선푸드생활계축산물가공잔재물(동물성 유지류는 제외한다)1300.0대구광역시 동구 입석로 97-25 (검사동)
1어썸마켓생활계폐합성수지류(폐염화비닐수지류는 제외한다)240.0대구광역시 동구 동촌로 137_ 1층 (검사동)
2어썸마켓생활계폐합성수지류(폐염화비닐수지류는 제외한다)540.0대구광역시 동구 동촌로 137_ 1층 (검사동)
3어썸마켓생활계음식물류폐기물600.0대구광역시 동구 동촌로 137_ 1층 (검사동)
4(주)풀무원푸드앤컬쳐 공군제11전투비행단생활계음식물류폐기물960.0대구광역시 동구 아양로 352_ 1층 201호 (입석동)
5주식회사 우성티오티생활계폐합성수지류(폐염화비닐수지류는 제외한다)20.0대구광역시 동구 매여로1길 37-11 (상매동)
6주식회사 우성티오티생활계화학점결폐주물사1200.0대구광역시 동구 매여로1길 37-11 (상매동)
7롯데칠성음료(주)대구영업생활계그 밖의 폐전기전자제품류400.0대구광역시 동구 신암남로24길 81 (신암동)
8주식회사 제이스글로벌생활계폐합성수지류(폐염화비닐수지류는 제외한다)360.0대구광역시 동구 동부로26길 6 (신천동)
9주식회사 제이스글로벌생활계음식물류폐기물240.0대구광역시 동구 동부로26길 6 (신천동)
상호생활계구분폐기물 종류배출량(톤)사업장도로명주소
244국제오피스텔생활계폐합성수지류72.0대구광역시 동구 동대구로 432 (신천동)
245국제오피스텔생활계폐지류6.0대구광역시 동구 동대구로 432 (신천동)
246영신고등학교생활계폐식용유(식용을 목적으로 식품 재료와 원료를 제조ㆍ조리ㆍ가공하거나 식용유를 유통ㆍ사용 또는 음식물류 폐기물을 처리하는 과정에서 발생하는 기름을 말한다)2.0대구광역시 동구 팔공로50길 32 (봉무동)
247영신고등학교생활계폐합성수지류120.0대구광역시 동구 팔공로50길 32 (봉무동)
248영신고등학교생활계음식물류폐기물48.0대구광역시 동구 팔공로50길 32 (봉무동)
249영신고등학교생활계폐합성수지류12.0대구광역시 동구 팔공로50길 32 (봉무동)
250대구파티마병원생활계음식물류폐기물108.0대구광역시 동구 아양로 99 (신암동)
251대구파티마병원생활계폐합성수지류(폐염화비닐수지류는 제외한다)300.0대구광역시 동구 아양로 99 (신암동)
252대구파티마병원생활계폐합성수지류(폐염화비닐수지류는 제외한다)300.0대구광역시 동구 아양로 99 (신암동)
253대구파티마병원생활계그 밖의 폐섬유90.0대구광역시 동구 아양로 99 (신암동)

Duplicate rows

Most frequently occurring

상호생활계구분폐기물 종류배출량(톤)사업장도로명주소# duplicates
0(의)열경의료재단생활계폐합성수지류(폐염화비닐수지류는 제외한다)54.0대구광역시 동구 화랑로 81 (효목동)2
1(주)스타하우스생활계폐합성수지류(폐염화비닐수지류는 제외한다)60.0대구광역시 동구 동촌로 316 (방촌동)2
2(주)신세계동대구복합환승센터생활계폐합성고무류36.0대구광역시 동구 동부로 149 (신천동_ 동대구역복합환승센터)2
3(주)신세계동대구복합환승센터생활계폐합성섬유36.0대구광역시 동구 동부로 149 (신천동_ 동대구역복합환승센터)2
4(주)이마트 반야월점생활계폐가구류_ 폐도장목_ 폐목재포장재_ 폐전선드럼(원목상태의 깨끗한 목재를 말한다)12.0대구광역시 동구 안심로 389-2 (신서동)2
5(주)이마트 반야월점생활계폐합성수지류204.0대구광역시 동구 안심로 389-2 (신서동)2
6(주)푸드맘생활계폐합성수지류(폐염화비닐수지류는 제외한다)24.0대구광역시 동구 동촌로 391-7 (용계동)2
7(합)궁전라벤더생활계폐합성수지류(폐염화비닐수지류는 제외한다)54.0대구광역시 동구 아양로 52 (신암동)2
8강동고등학교생활계폐합성수지류(폐염화비닐수지류는 제외한다)72.0대구광역시 동구 금강로 65 (신서동)2
9강동중학교생활계폐합성수지류(폐염화비닐수지류는 제외한다)60.0대구광역시 동구 동호로 112 (신서동_ 강동중학교)2