Overview

Dataset statistics

Number of variables7
Number of observations175
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory10.0 KiB
Average record size in memory58.8 B

Variable types

Numeric2
Text2
Categorical2
DateTime1

Dataset

Description폐기물관리법 제17조에 따른 충청북도 보은군 사업장폐기물배출자 현황에 따른 데이터로 상호명, 주소, 배출종류 등에 관한 내용입니다.
Author충청북도 보은군
URLhttps://www.data.go.kr/data/15060747/fileData.do

Alerts

데이터기준일 has constant value ""Constant
배출량(톤) is highly overall correlated with 폐기물 종류High correlation
폐기물 종류 is highly overall correlated with 배출량(톤)High correlation
구분 is highly imbalanced (52.2%)Imbalance
구 분 has unique valuesUnique

Reproduction

Analysis started2023-12-12 08:46:32.653519
Analysis finished2023-12-12 08:46:34.052755
Duration1.4 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

구 분
Real number (ℝ)

UNIQUE 

Distinct175
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean88
Minimum1
Maximum175
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.7 KiB
2023-12-12T17:46:34.193453image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile9.7
Q144.5
median88
Q3131.5
95-th percentile166.3
Maximum175
Range174
Interquartile range (IQR)87

Descriptive statistics

Standard deviation50.662281
Coefficient of variation (CV)0.57570773
Kurtosis-1.2
Mean88
Median Absolute Deviation (MAD)44
Skewness0
Sum15400
Variance2566.6667
MonotonicityStrictly increasing
2023-12-12T17:46:34.353326image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.6%
2 1
 
0.6%
113 1
 
0.6%
114 1
 
0.6%
115 1
 
0.6%
116 1
 
0.6%
117 1
 
0.6%
118 1
 
0.6%
119 1
 
0.6%
120 1
 
0.6%
Other values (165) 165
94.3%
ValueCountFrequency (%)
1 1
0.6%
2 1
0.6%
3 1
0.6%
4 1
0.6%
5 1
0.6%
6 1
0.6%
7 1
0.6%
8 1
0.6%
9 1
0.6%
10 1
0.6%
ValueCountFrequency (%)
175 1
0.6%
174 1
0.6%
173 1
0.6%
172 1
0.6%
171 1
0.6%
170 1
0.6%
169 1
0.6%
168 1
0.6%
167 1
0.6%
166 1
0.6%

상호
Text

Distinct59
Distinct (%)33.7%
Missing0
Missing (%)0.0%
Memory size1.5 KiB
2023-12-12T17:46:34.570341image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length14
Mean length11.074286
Min length2

Characters and Unicode

Total characters1938
Distinct characters155
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique26 ?
Unique (%)14.9%

Sample

1st row보은농협미곡처리장
2nd row남보은농협미곡종합처리장
3rd row(주)테크윈보은2지점
4th row한국도로공사보은지사
5th row주식회사 월드피씨
ValueCountFrequency (%)
주)한화 22
 
8.8%
보은사업장 22
 
8.8%
화약/방산 22
 
8.8%
주식회사 15
 
6.0%
주)이킴 12
 
4.8%
주)테크로스환경서비스[속리산사업소 10
 
4.0%
주)우진플라임 10
 
4.0%
주)한국카본보은공장 8
 
3.2%
주)테크로스환경서비스[보은사업소 7
 
2.8%
주)대광주철 7
 
2.8%
Other values (55) 115
46.0%
2023-12-12T17:46:35.021931image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
170
 
8.8%
( 141
 
7.3%
) 141
 
7.3%
75
 
3.9%
68
 
3.5%
67
 
3.5%
62
 
3.2%
54
 
2.8%
53
 
2.7%
51
 
2.6%
Other values (145) 1056
54.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1502
77.5%
Open Punctuation 164
 
8.5%
Close Punctuation 164
 
8.5%
Space Separator 75
 
3.9%
Other Punctuation 23
 
1.2%
Uppercase Letter 6
 
0.3%
Decimal Number 3
 
0.2%
Other Symbol 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
170
 
11.3%
68
 
4.5%
67
 
4.5%
62
 
4.1%
54
 
3.6%
53
 
3.5%
51
 
3.4%
48
 
3.2%
47
 
3.1%
37
 
2.5%
Other values (129) 845
56.3%
Uppercase Letter
ValueCountFrequency (%)
L 1
16.7%
B 1
16.7%
H 1
16.7%
N 1
16.7%
U 1
16.7%
S 1
16.7%
Open Punctuation
ValueCountFrequency (%)
( 141
86.0%
[ 23
 
14.0%
Close Punctuation
ValueCountFrequency (%)
) 141
86.0%
] 23
 
14.0%
Other Punctuation
ValueCountFrequency (%)
/ 22
95.7%
& 1
 
4.3%
Decimal Number
ValueCountFrequency (%)
2 2
66.7%
1 1
33.3%
Space Separator
ValueCountFrequency (%)
75
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1503
77.6%
Common 429
 
22.1%
Latin 6
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
170
 
11.3%
68
 
4.5%
67
 
4.5%
62
 
4.1%
54
 
3.6%
53
 
3.5%
51
 
3.4%
48
 
3.2%
47
 
3.1%
37
 
2.5%
Other values (130) 846
56.3%
Common
ValueCountFrequency (%)
( 141
32.9%
) 141
32.9%
75
17.5%
] 23
 
5.4%
[ 23
 
5.4%
/ 22
 
5.1%
2 2
 
0.5%
1 1
 
0.2%
& 1
 
0.2%
Latin
ValueCountFrequency (%)
L 1
16.7%
B 1
16.7%
H 1
16.7%
N 1
16.7%
U 1
16.7%
S 1
16.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1502
77.5%
ASCII 435
 
22.4%
None 1
 
0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
170
 
11.3%
68
 
4.5%
67
 
4.5%
62
 
4.1%
54
 
3.6%
53
 
3.5%
51
 
3.4%
48
 
3.2%
47
 
3.1%
37
 
2.5%
Other values (129) 845
56.3%
ASCII
ValueCountFrequency (%)
( 141
32.4%
) 141
32.4%
75
17.2%
] 23
 
5.3%
[ 23
 
5.3%
/ 22
 
5.1%
2 2
 
0.5%
L 1
 
0.2%
B 1
 
0.2%
H 1
 
0.2%
Other values (5) 5
 
1.1%
None
ValueCountFrequency (%)
1
100.0%
Distinct56
Distinct (%)32.0%
Missing0
Missing (%)0.0%
Memory size1.5 KiB
2023-12-12T17:46:35.295393image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length39
Median length35
Mean length24.977143
Min length19

Characters and Unicode

Total characters4371
Distinct characters117
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique25 ?
Unique (%)14.3%

Sample

1st row충청북도 보은군 장안면 장안로 10-26
2nd row충청북도 보은군 탄부면 하장1길 7_ 남보은농협탄부미곡종합처리장
3rd row충청북도 보은군 삼승면 남부로 3749-6_ 공장
4th row충청북도 보은군 보은읍 남부로 4045_ 한국도로공사
5th row충청북도 보은군 삼승면 남부로 3790-59 (주)월드피시
ValueCountFrequency (%)
충청북도 175
18.6%
보은군 175
18.6%
남부로 53
 
5.6%
삼승면 50
 
5.3%
보은읍 43
 
4.6%
장안면 30
 
3.2%
내북면 28
 
3.0%
회인내북로 22
 
2.3%
857 22
 
2.3%
매화구인로 22
 
2.3%
Other values (96) 319
34.0%
2023-12-12T17:46:35.678847image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
769
 
17.6%
230
 
5.3%
227
 
5.2%
224
 
5.1%
180
 
4.1%
177
 
4.0%
175
 
4.0%
175
 
4.0%
143
 
3.3%
132
 
3.0%
Other values (107) 1939
44.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2695
61.7%
Space Separator 769
 
17.6%
Decimal Number 720
 
16.5%
Dash Punctuation 83
 
1.9%
Open Punctuation 37
 
0.8%
Close Punctuation 37
 
0.8%
Connector Punctuation 27
 
0.6%
Uppercase Letter 2
 
< 0.1%
Other Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
230
 
8.5%
227
 
8.4%
224
 
8.3%
180
 
6.7%
177
 
6.6%
175
 
6.5%
175
 
6.5%
143
 
5.3%
132
 
4.9%
56
 
2.1%
Other values (89) 976
36.2%
Decimal Number
ValueCountFrequency (%)
3 118
16.4%
7 95
13.2%
2 93
12.9%
5 78
10.8%
1 72
10.0%
0 66
9.2%
6 65
9.0%
8 63
8.8%
4 44
 
6.1%
9 26
 
3.6%
Uppercase Letter
ValueCountFrequency (%)
C 1
50.0%
F 1
50.0%
Space Separator
ValueCountFrequency (%)
769
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 83
100.0%
Open Punctuation
ValueCountFrequency (%)
( 37
100.0%
Close Punctuation
ValueCountFrequency (%)
) 37
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 27
100.0%
Other Punctuation
ValueCountFrequency (%)
& 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2695
61.7%
Common 1674
38.3%
Latin 2
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
230
 
8.5%
227
 
8.4%
224
 
8.3%
180
 
6.7%
177
 
6.6%
175
 
6.5%
175
 
6.5%
143
 
5.3%
132
 
4.9%
56
 
2.1%
Other values (89) 976
36.2%
Common
ValueCountFrequency (%)
769
45.9%
3 118
 
7.0%
7 95
 
5.7%
2 93
 
5.6%
- 83
 
5.0%
5 78
 
4.7%
1 72
 
4.3%
0 66
 
3.9%
6 65
 
3.9%
8 63
 
3.8%
Other values (6) 172
 
10.3%
Latin
ValueCountFrequency (%)
C 1
50.0%
F 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2695
61.7%
ASCII 1676
38.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
769
45.9%
3 118
 
7.0%
7 95
 
5.7%
2 93
 
5.5%
- 83
 
5.0%
5 78
 
4.7%
1 72
 
4.3%
0 66
 
3.9%
6 65
 
3.9%
8 63
 
3.8%
Other values (8) 174
 
10.4%
Hangul
ValueCountFrequency (%)
230
 
8.5%
227
 
8.4%
224
 
8.3%
180
 
6.7%
177
 
6.6%
175
 
6.5%
175
 
6.5%
143
 
5.3%
132
 
4.9%
56
 
2.1%
Other values (89) 976
36.2%

구분
Categorical

IMBALANCE 

Distinct2
Distinct (%)1.1%
Missing0
Missing (%)0.0%
Memory size1.5 KiB
배출시설계
157 
비배출시설계
18 

Length

Max length6
Median length5
Mean length5.1028571
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row배출시설계
2nd row배출시설계
3rd row배출시설계
4th row배출시설계
5th row배출시설계

Common Values

ValueCountFrequency (%)
배출시설계 157
89.7%
비배출시설계 18
 
10.3%

Length

2023-12-12T17:46:35.812939image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T17:46:35.927989image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
배출시설계 157
89.7%
비배출시설계 18
 
10.3%

폐기물 종류
Categorical

HIGH CORRELATION 

Distinct36
Distinct (%)20.6%
Missing0
Missing (%)0.0%
Memory size1.5 KiB
폐합성수지류(폐염화비닐수지류는 제외한다)
45 
하수처리오니
16 
그 밖의 폐수처리오니
15 
그 밖의 식물성잔재물
14 
폐합성수지류
Other values (31)
77 

Length

Max length84
Median length49
Mean length14.325714
Min length2

Unique

Unique13 ?
Unique (%)7.4%

Sample

1st row왕겨
2nd row왕겨
3rd row폐합성수지류(폐염화비닐수지류는 제외한다)
4th row폐토사
5th row폐콘크리트

Common Values

ValueCountFrequency (%)
폐합성수지류(폐염화비닐수지류는 제외한다) 45
25.7%
하수처리오니 16
 
9.1%
그 밖의 폐수처리오니 15
 
8.6%
그 밖의 식물성잔재물 14
 
8.0%
폐합성수지류 8
 
4.6%
그 밖의 폐기물 7
 
4.0%
폐콘크리트 6
 
3.4%
폐가구류_ 폐도장목_ 폐목재포장재_ 폐전선드럼(원목상태의 깨끗한 목재를 말한다) 6
 
3.4%
그 밖의 분진 6
 
3.4%
폐합성섬유 5
 
2.9%
Other values (26) 47
26.9%

Length

2023-12-12T17:46:36.113661image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
49
 
11.6%
밖의 49
 
11.6%
폐합성수지류(폐염화비닐수지류는 45
 
10.6%
제외한다 45
 
10.6%
폐수처리오니 19
 
4.5%
하수처리오니 16
 
3.8%
식물성잔재물 14
 
3.3%
말한다 11
 
2.6%
폐합성수지류 8
 
1.9%
폐기물 7
 
1.7%
Other values (63) 161
38.0%

배출량(톤)
Real number (ℝ)

HIGH CORRELATION 

Distinct50
Distinct (%)28.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean326.58126
Minimum0.4
Maximum3500
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.7 KiB
2023-12-12T17:46:36.293066image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0.4
5-th percentile10
Q149
median96
Q3360
95-th percentile1200
Maximum3500
Range3499.6
Interquartile range (IQR)311

Descriptive statistics

Standard deviation518.34015
Coefficient of variation (CV)1.5871705
Kurtosis9.4713321
Mean326.58126
Median Absolute Deviation (MAD)76
Skewness2.6841779
Sum57151.72
Variance268676.51
MonotonicityNot monotonic
2023-12-12T17:46:36.474097image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
50.0 17
 
9.7%
60.0 17
 
9.7%
120.0 13
 
7.4%
600.0 11
 
6.3%
1200.0 10
 
5.7%
100.0 9
 
5.1%
12.0 8
 
4.6%
30.0 7
 
4.0%
360.0 6
 
3.4%
20.0 6
 
3.4%
Other values (40) 71
40.6%
ValueCountFrequency (%)
0.4 1
 
0.6%
1.0 1
 
0.6%
1.2 1
 
0.6%
4.0 1
 
0.6%
5.0 2
 
1.1%
6.0 1
 
0.6%
8.4 1
 
0.6%
10.0 2
 
1.1%
12.0 8
4.6%
18.0 2
 
1.1%
ValueCountFrequency (%)
3500.0 1
 
0.6%
2400.0 1
 
0.6%
2000.0 2
 
1.1%
1500.0 4
 
2.3%
1200.0 10
5.7%
1140.0 2
 
1.1%
1080.0 2
 
1.1%
1000.0 2
 
1.1%
960.0 2
 
1.1%
600.0 11
6.3%

데이터기준일
Date

CONSTANT 

Distinct1
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size1.5 KiB
Minimum2022-09-14 00:00:00
Maximum2022-09-14 00:00:00
2023-12-12T17:46:36.627139image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:46:36.762137image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2023-12-12T17:46:33.293009image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:46:33.081512image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:46:33.401830image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:46:33.176843image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T17:46:36.854209image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구 분상호사업장도로명주소구분폐기물 종류배출량(톤)
구 분1.0000.9900.9840.4670.7530.473
상호0.9901.0001.0000.9690.9040.727
사업장도로명주소0.9841.0001.0000.9840.8930.578
구분0.4670.9690.9841.0000.4730.000
폐기물 종류0.7530.9040.8930.4731.0000.915
배출량(톤)0.4730.7270.5780.0000.9151.000
2023-12-12T17:46:36.997615image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분폐기물 종류
구분1.0000.337
폐기물 종류0.3371.000
2023-12-12T17:46:37.112813image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구 분배출량(톤)구분폐기물 종류
구 분1.000-0.1560.3490.341
배출량(톤)-0.1561.0000.0000.594
구분0.3490.0001.0000.337
폐기물 종류0.3410.5940.3371.000

Missing values

2023-12-12T17:46:33.856648image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T17:46:34.004893image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

구 분상호사업장도로명주소구분폐기물 종류배출량(톤)데이터기준일
01보은농협미곡처리장충청북도 보은군 장안면 장안로 10-26배출시설계왕겨1080.02022-09-14
12남보은농협미곡종합처리장충청북도 보은군 탄부면 하장1길 7_ 남보은농협탄부미곡종합처리장배출시설계왕겨1200.02022-09-14
23(주)테크윈보은2지점충청북도 보은군 삼승면 남부로 3749-6_ 공장배출시설계폐합성수지류(폐염화비닐수지류는 제외한다)12.02022-09-14
34한국도로공사보은지사충청북도 보은군 보은읍 남부로 4045_ 한국도로공사배출시설계폐토사204.02022-09-14
45주식회사 월드피씨충청북도 보은군 삼승면 남부로 3790-59 (주)월드피시배출시설계폐콘크리트360.02022-09-14
56BH케미칼충청북도 보은군 보은읍 보청대로 1469-22_ 공장배출시설계폐합성수지류(폐염화비닐수지류는 제외한다)120.02022-09-14
67(주)SUN&L 보은공장충청북도 보은군 보은읍 보은미원로 156_ 공장배출시설계폐합성수지류(폐염화비닐수지류는 제외한다)30.02022-09-14
78(주)한국신소재보은공장충청북도 보은군 삼승면 남부로 3750-273 (주) 한국 신소재배출시설계폐합성수지류(폐염화비닐수지류는 제외한다)1200.02022-09-14
89(주)한국신소재보은공장충청북도 보은군 삼승면 남부로 3750-273 (주) 한국 신소재배출시설계폐합성섬유1200.02022-09-14
910(주)한국신소재보은공장충청북도 보은군 삼승면 남부로 3750-273 (주) 한국 신소재배출시설계폐가구류_ 폐도장목_ 폐목재포장재_ 폐전선드럼(원목상태의 깨끗한 목재를 말한다)360.02022-09-14
구 분상호사업장도로명주소구분폐기물 종류배출량(톤)데이터기준일
165166(주)이킴충청북도 보은군 보은읍 금굴4길 35배출시설계그 밖의 식물성잔재물1140.02022-09-14
166167(주)이킴충청북도 보은군 보은읍 금굴4길 35배출시설계폐합성수지류(폐염화비닐수지류는 제외한다)120.02022-09-14
167168(주)이킴충청북도 보은군 보은읍 금굴4길 35배출시설계그 밖의 폐수처리오니12.02022-09-14
168169(주)대광주철충청북도 보은군 산외면 산외로 126배출시설계그 밖의 분진270.02022-09-14
169170(주)대광주철충청북도 보은군 산외면 산외로 126배출시설계그 밖의 광재류500.02022-09-14
170171(주)대광주철충청북도 보은군 산외면 산외로 126배출시설계그 밖의 분진270.02022-09-14
171172(주)대광주철충청북도 보은군 산외면 산외로 126배출시설계그 밖의 광재류100.02022-09-14
172173(주)대광주철충청북도 보은군 산외면 산외로 126배출시설계화학점결폐주물사1500.02022-09-14
173174(주)대광주철충청북도 보은군 산외면 산외로 126배출시설계폐합성수지류(폐염화비닐수지류는 제외한다)60.02022-09-14
174175(주)대광주철충청북도 보은군 산외면 산외로 126배출시설계화학점결폐주물사1500.02022-09-14