Overview

Dataset statistics

Number of variables5
Number of observations166
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory6.9 KiB
Average record size in memory42.8 B

Variable types

Numeric2
Text2
Categorical1

Dataset

Description대전광역시 동구 사업장폐기물 배출자신고현황에 관한 데이터로 업체명, 사업장도로명주소, 폐기물 종류, 배출량(톤) 등에 관한 내용을 포함하고 있습니다.
Author대전광역시 동구
URLhttps://www.data.go.kr/data/15081140/fileData.do

Alerts

연번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 10:44:41.271514
Analysis finished2023-12-12 10:44:42.645249
Duration1.37 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct166
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean83.5
Minimum1
Maximum166
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.6 KiB
2023-12-12T19:44:42.782501image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile9.25
Q142.25
median83.5
Q3124.75
95-th percentile157.75
Maximum166
Range165
Interquartile range (IQR)82.5

Descriptive statistics

Standard deviation48.064193
Coefficient of variation (CV)0.57561908
Kurtosis-1.2
Mean83.5
Median Absolute Deviation (MAD)41.5
Skewness0
Sum13861
Variance2310.1667
MonotonicityStrictly increasing
2023-12-12T19:44:43.055197image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.6%
106 1
 
0.6%
108 1
 
0.6%
109 1
 
0.6%
110 1
 
0.6%
111 1
 
0.6%
112 1
 
0.6%
113 1
 
0.6%
114 1
 
0.6%
115 1
 
0.6%
Other values (156) 156
94.0%
ValueCountFrequency (%)
1 1
0.6%
2 1
0.6%
3 1
0.6%
4 1
0.6%
5 1
0.6%
6 1
0.6%
7 1
0.6%
8 1
0.6%
9 1
0.6%
10 1
0.6%
ValueCountFrequency (%)
166 1
0.6%
165 1
0.6%
164 1
0.6%
163 1
0.6%
162 1
0.6%
161 1
0.6%
160 1
0.6%
159 1
0.6%
158 1
0.6%
157 1
0.6%

상호
Text

Distinct54
Distinct (%)32.5%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
2023-12-12T19:44:43.380799image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length17
Median length14
Mean length8.1686747
Min length3

Characters and Unicode

Total characters1356
Distinct characters140
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique24 ?
Unique (%)14.5%

Sample

1st row우송정보대학 산학협력단
2nd row전진상사
3rd row전진상사
4th row전진상사
5th row전진상사
ValueCountFrequency (%)
새빛기업(주 15
 
7.5%
한국전력공사 11
 
5.5%
대전세종충남본부 10
 
5.0%
국보환경(주 9
 
4.5%
레노텍(주 8
 
4.0%
우송대학교 7
 
3.5%
㈜이마트 7
 
3.5%
대전터미널점 7
 
3.5%
대전대학교 7
 
3.5%
홈플러스(주)대전가오점 7
 
3.5%
Other values (48) 111
55.8%
2023-12-12T19:44:43.859062image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
90
 
6.6%
82
 
6.0%
( 63
 
4.6%
) 63
 
4.6%
60
 
4.4%
34
 
2.5%
33
 
2.4%
32
 
2.4%
32
 
2.4%
31
 
2.3%
Other values (130) 836
61.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1165
85.9%
Open Punctuation 63
 
4.6%
Close Punctuation 63
 
4.6%
Space Separator 33
 
2.4%
Other Symbol 31
 
2.3%
Decimal Number 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
90
 
7.7%
82
 
7.0%
60
 
5.2%
34
 
2.9%
32
 
2.7%
32
 
2.7%
30
 
2.6%
26
 
2.2%
25
 
2.1%
25
 
2.1%
Other values (125) 729
62.6%
Open Punctuation
ValueCountFrequency (%)
( 63
100.0%
Close Punctuation
ValueCountFrequency (%)
) 63
100.0%
Space Separator
ValueCountFrequency (%)
33
100.0%
Other Symbol
ValueCountFrequency (%)
31
100.0%
Decimal Number
ValueCountFrequency (%)
2 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1196
88.2%
Common 160
 
11.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
90
 
7.5%
82
 
6.9%
60
 
5.0%
34
 
2.8%
32
 
2.7%
32
 
2.7%
31
 
2.6%
30
 
2.5%
26
 
2.2%
25
 
2.1%
Other values (126) 754
63.0%
Common
ValueCountFrequency (%)
( 63
39.4%
) 63
39.4%
33
20.6%
2 1
 
0.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1165
85.9%
ASCII 160
 
11.8%
None 31
 
2.3%

Most frequent character per block

Hangul
ValueCountFrequency (%)
90
 
7.7%
82
 
7.0%
60
 
5.2%
34
 
2.9%
32
 
2.7%
32
 
2.7%
30
 
2.6%
26
 
2.2%
25
 
2.1%
25
 
2.1%
Other values (125) 729
62.6%
ASCII
ValueCountFrequency (%)
( 63
39.4%
) 63
39.4%
33
20.6%
2 1
 
0.6%
None
ValueCountFrequency (%)
31
100.0%
Distinct52
Distinct (%)31.3%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
2023-12-12T19:44:44.185174image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length49
Median length40
Mean length28.813253
Min length20

Characters and Unicode

Total characters4783
Distinct characters145
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique23 ?
Unique (%)13.9%

Sample

1st row대전광역시 동구 동대전로131번길 53_ 우송고등학교_우송중학교 1층 (자양동)
2nd row대전광역시 동구 산서로 1651 (대별동)
3rd row대전광역시 동구 산서로 1651 (대별동)
4th row대전광역시 동구 산서로 1651 (대별동)
5th row대전광역시 동구 산서로 1651 (대별동)
ValueCountFrequency (%)
대전광역시 159
 
17.2%
동구 140
 
15.2%
용전동 25
 
2.7%
대전로 20
 
2.2%
가오동 20
 
2.2%
하소동 19
 
2.1%
산내로450번길 17
 
1.8%
대성동 16
 
1.7%
277-20 15
 
1.6%
자양동 15
 
1.6%
Other values (155) 478
51.7%
2023-12-12T19:44:44.653923image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
759
 
15.9%
357
 
7.5%
299
 
6.3%
255
 
5.3%
177
 
3.7%
1 174
 
3.6%
( 174
 
3.6%
) 174
 
3.6%
171
 
3.6%
167
 
3.5%
Other values (135) 2076
43.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2851
59.6%
Space Separator 759
 
15.9%
Decimal Number 713
 
14.9%
Open Punctuation 174
 
3.6%
Close Punctuation 174
 
3.6%
Connector Punctuation 73
 
1.5%
Dash Punctuation 31
 
0.6%
Uppercase Letter 8
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
357
 
12.5%
299
 
10.5%
255
 
8.9%
177
 
6.2%
171
 
6.0%
167
 
5.9%
166
 
5.8%
160
 
5.6%
68
 
2.4%
67
 
2.4%
Other values (119) 964
33.8%
Decimal Number
ValueCountFrequency (%)
1 174
24.4%
2 109
15.3%
7 81
11.4%
0 74
10.4%
6 66
 
9.3%
5 51
 
7.2%
4 45
 
6.3%
3 41
 
5.8%
8 39
 
5.5%
9 33
 
4.6%
Space Separator
ValueCountFrequency (%)
759
100.0%
Open Punctuation
ValueCountFrequency (%)
( 174
100.0%
Close Punctuation
ValueCountFrequency (%)
) 174
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 73
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 31
100.0%
Uppercase Letter
ValueCountFrequency (%)
A 8
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2851
59.6%
Common 1924
40.2%
Latin 8
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
357
 
12.5%
299
 
10.5%
255
 
8.9%
177
 
6.2%
171
 
6.0%
167
 
5.9%
166
 
5.8%
160
 
5.6%
68
 
2.4%
67
 
2.4%
Other values (119) 964
33.8%
Common
ValueCountFrequency (%)
759
39.4%
1 174
 
9.0%
( 174
 
9.0%
) 174
 
9.0%
2 109
 
5.7%
7 81
 
4.2%
0 74
 
3.8%
_ 73
 
3.8%
6 66
 
3.4%
5 51
 
2.7%
Other values (5) 189
 
9.8%
Latin
ValueCountFrequency (%)
A 8
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2851
59.6%
ASCII 1932
40.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
759
39.3%
1 174
 
9.0%
( 174
 
9.0%
) 174
 
9.0%
2 109
 
5.6%
7 81
 
4.2%
0 74
 
3.8%
_ 73
 
3.8%
6 66
 
3.4%
5 51
 
2.6%
Other values (6) 197
 
10.2%
Hangul
ValueCountFrequency (%)
357
 
12.5%
299
 
10.5%
255
 
8.9%
177
 
6.2%
171
 
6.0%
167
 
5.9%
166
 
5.8%
160
 
5.6%
68
 
2.4%
67
 
2.4%
Other values (119) 964
33.8%

폐기물 종류
Categorical

Distinct27
Distinct (%)16.3%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
폐합성수지류(폐염화비닐수지류는 제외한다)
45 
그 밖의 폐기물
19 
폐유리
18 
그 밖의 폐합성고분자화합물(합성수지류로 피복된 폐전선을 포함한다)
13 
하수준설토
12 
Other values (22)
59 

Length

Max length84
Median length57
Mean length20.524096
Min length3

Unique

Unique8 ?
Unique (%)4.8%

Sample

1st row그 밖의 폐기물
2nd row폐유리
3rd row폐도자기조각
4th row폐유리
5th row폐유리

Common Values

ValueCountFrequency (%)
폐합성수지류(폐염화비닐수지류는 제외한다) 45
27.1%
그 밖의 폐기물 19
11.4%
폐유리 18
 
10.8%
그 밖의 폐합성고분자화합물(합성수지류로 피복된 폐전선을 포함한다) 13
 
7.8%
하수준설토 12
 
7.2%
임목폐목재(건설공사ㆍ산지개간 등의 과정에서 발생된 나무뿌리ㆍ가지ㆍ줄기 등을 말한다) 12
 
7.2%
음식물류폐기물 7
 
4.2%
그 밖의 식물성잔재물 4
 
2.4%
폐가구류_ 폐도장목_ 폐목재포장재_ 폐전선드럼(원목상태의 깨끗한 목재를 말한다) 4
 
2.4%
폐전주(폐애자_ 폐근가 및 폐합성수지제 커버류 등을 포함한다) 3
 
1.8%
Other values (17) 29
17.5%

Length

2023-12-12T19:44:44.822232image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
제외한다 46
 
8.3%
폐합성수지류(폐염화비닐수지류는 45
 
8.1%
38
 
6.9%
밖의 38
 
6.9%
말한다 26
 
4.7%
등의 20
 
3.6%
폐기물 19
 
3.4%
폐유리 18
 
3.2%
등을 18
 
3.2%
과정에서 17
 
3.1%
Other values (64) 269
48.6%

배출량(톤)
Real number (ℝ)

Distinct56
Distinct (%)33.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean250.17735
Minimum0.96
Maximum3000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.6 KiB
2023-12-12T19:44:44.984879image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0.96
5-th percentile9.25
Q127
median93
Q3285
95-th percentile900
Maximum3000
Range2999.04
Interquartile range (IQR)258

Descriptive statistics

Standard deviation394.17827
Coefficient of variation (CV)1.5755954
Kurtosis16.030636
Mean250.17735
Median Absolute Deviation (MAD)75
Skewness3.3295358
Sum41529.44
Variance155376.51
MonotonicityNot monotonic
2023-12-12T19:44:45.158991image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
36.0 15
 
9.0%
700.0 13
 
7.8%
120.0 10
 
6.0%
240.0 8
 
4.8%
25.0 7
 
4.2%
200.0 7
 
4.2%
100.0 6
 
3.6%
150.0 5
 
3.0%
30.0 4
 
2.4%
24.0 4
 
2.4%
Other values (46) 87
52.4%
ValueCountFrequency (%)
0.96 1
 
0.6%
5.0 1
 
0.6%
6.0 1
 
0.6%
8.0 2
1.2%
8.4 3
1.8%
9.0 1
 
0.6%
10.0 4
2.4%
12.0 3
1.8%
14.0 2
1.2%
15.0 3
1.8%
ValueCountFrequency (%)
3000.0 1
 
0.6%
1872.0 1
 
0.6%
1600.0 1
 
0.6%
1500.0 1
 
0.6%
1200.0 2
 
1.2%
912.0 1
 
0.6%
900.0 4
 
2.4%
804.0 1
 
0.6%
720.0 1
 
0.6%
700.0 13
7.8%

Interactions

2023-12-12T19:44:41.989739image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T19:44:41.704672image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T19:44:42.121897image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T19:44:41.839861image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T19:44:45.315213image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번상호사업장도로명주소폐기물 종류배출량(톤)
연번1.0000.9820.9790.7760.378
상호0.9821.0001.0000.9650.521
사업장도로명주소0.9791.0001.0000.9490.582
폐기물 종류0.7760.9650.9491.0000.774
배출량(톤)0.3780.5210.5820.7741.000
2023-12-12T19:44:45.432921image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번배출량(톤)폐기물 종류
연번1.000-0.1550.391
배출량(톤)-0.1551.0000.416
폐기물 종류0.3910.4161.000

Missing values

2023-12-12T19:44:42.322155image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T19:44:42.535131image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번상호사업장도로명주소폐기물 종류배출량(톤)
01우송정보대학 산학협력단대전광역시 동구 동대전로131번길 53_ 우송고등학교_우송중학교 1층 (자양동)그 밖의 폐기물36.0
12전진상사대전광역시 동구 산서로 1651 (대별동)폐유리700.0
23전진상사대전광역시 동구 산서로 1651 (대별동)폐도자기조각240.0
34전진상사대전광역시 동구 산서로 1651 (대별동)폐유리700.0
45전진상사대전광역시 동구 산서로 1651 (대별동)폐유리700.0
56전진상사대전광역시 동구 산서로 1651 (대별동)폐유리700.0
67(주)금강푸드대전광역시 동구 물류로14번길 73-30 (구도동)그 밖의 식물성잔재물912.0
78해태제과식품(주) 식품충청영업소대전광역시 동구 우암로277번길 131 (가양동)폐합성수지류(폐염화비닐수지류는 제외한다)36.0
89해태제과식품(주) 식품충청영업소대전광역시 동구 우암로277번길 131 (가양동)그 밖의 폐기물36.0
910주식회사 사모스대전광역시 동구 하소중로 22 (하소동)목재가공공장 부산물(접착제_ 페인트_ 기름_ 콘크리트 등의 물질이 사용된 목재부산물 및 분진을 말한다)72.0
연번상호사업장도로명주소폐기물 종류배출량(톤)
156157통우통신대전광역시 대덕구 덕암북로72번길 71 (덕암동)그 밖의 폐합성고분자화합물(합성수지류로 피복된 폐전선을 포함한다)10.0
157158통우통신대전광역시 대덕구 덕암북로72번길 71 (덕암동)폐전주(폐애자_ 폐근가 및 폐합성수지제 커버류 등을 포함한다)50.0
158159대전광역시 동구청(건설과)대전광역시 동구 동구청로 147_ 동구청 (가오동)하수준설토50.0
159160대전광역시 동구청(건설과)대전광역시 동구 동구청로 147_ 동구청 (가오동)하수준설토50.0
160161계룡건설산업㈜대전광역시 서구 문정로48번길 48 (탄방동)임목폐목재(건설공사_ 산지개간 등의 과정에서 발생된 나무뿌리_ 가지_ 줄기 등을 말한다)100.0
161162코오롱글로벌㈜경기도 과천시 코오롱로 11_ 코오롱 (별양동)폐토사1600.0
162163코오롱글로벌㈜경기도 과천시 코오롱로 11_ 코오롱 (별양동)그 밖의 폐기물240.0
163164㈜한화서울특별시 중구 청계천로 86_ 한화빌딩 (장교동)폐발포합성수지40.0
164165㈜한화서울특별시 중구 청계천로 86_ 한화빌딩 (장교동)임목폐목재(건설공사_ 산지개간 등의 과정에서 발생된 나무뿌리_ 가지_ 줄기 등을 말한다)25.0
165166(주)에코비트워터대전광역시 유성구 엑스포로 326 (원촌동)하수준설토150.0