Overview

Dataset statistics

Number of variables4
Number of observations664
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory21.5 KiB
Average record size in memory33.2 B

Variable types

Numeric1
Text3

Dataset

Description사업장폐기물에 대한 데이터(폐기물 업체 상호, 폐기물 종류, 사업장 도로명주소 등 포함하여 제공합니다) 사업장폐기물배출자 신고현황 등을 제공합니다
URLhttps://www.data.go.kr/data/15083944/fileData.do

Alerts

연번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 10:42:03.773169
Analysis finished2023-12-12 10:42:04.600180
Duration0.83 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct664
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean332.5
Minimum1
Maximum664
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size6.0 KiB
2023-12-12T19:42:04.712307image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile34.15
Q1166.75
median332.5
Q3498.25
95-th percentile630.85
Maximum664
Range663
Interquartile range (IQR)331.5

Descriptive statistics

Standard deviation191.82457
Coefficient of variation (CV)0.57691601
Kurtosis-1.2
Mean332.5
Median Absolute Deviation (MAD)166
Skewness0
Sum220780
Variance36796.667
MonotonicityStrictly increasing
2023-12-12T19:42:04.916921image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.2%
438 1
 
0.2%
440 1
 
0.2%
441 1
 
0.2%
442 1
 
0.2%
443 1
 
0.2%
444 1
 
0.2%
445 1
 
0.2%
446 1
 
0.2%
447 1
 
0.2%
Other values (654) 654
98.5%
ValueCountFrequency (%)
1 1
0.2%
2 1
0.2%
3 1
0.2%
4 1
0.2%
5 1
0.2%
6 1
0.2%
7 1
0.2%
8 1
0.2%
9 1
0.2%
10 1
0.2%
ValueCountFrequency (%)
664 1
0.2%
663 1
0.2%
662 1
0.2%
661 1
0.2%
660 1
0.2%
659 1
0.2%
658 1
0.2%
657 1
0.2%
656 1
0.2%
655 1
0.2%

상호
Text

Distinct200
Distinct (%)30.1%
Missing0
Missing (%)0.0%
Memory size5.3 KiB
2023-12-12T19:42:05.257042image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length24
Median length21
Mean length10.207831
Min length3

Characters and Unicode

Total characters6778
Distinct characters271
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique73 ?
Unique (%)11.0%

Sample

1st row부산광역시 푸른도시가꾸기사업소
2nd row(주)알파로보틱스
3rd row(주)비엠티
4th row(주)대일
5th row주식회사 유창에스엔티
ValueCountFrequency (%)
주)한솔그린 28
 
3.1%
주식회사 28
 
3.1%
부산환경공단 25
 
2.8%
기장공장 18
 
2.0%
기장사업소 18
 
2.0%
리클린대구(주 17
 
1.9%
주)성화그린 15
 
1.7%
푸른이앤지(주 15
 
1.7%
롯데쇼핑(주 14
 
1.6%
일광개발(주 13
 
1.4%
Other values (221) 706
78.7%
2023-12-12T19:42:05.799374image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
525
 
7.7%
( 499
 
7.4%
) 499
 
7.4%
233
 
3.4%
188
 
2.8%
170
 
2.5%
165
 
2.4%
139
 
2.1%
123
 
1.8%
116
 
1.7%
Other values (261) 4121
60.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5378
79.3%
Open Punctuation 499
 
7.4%
Close Punctuation 499
 
7.4%
Space Separator 233
 
3.4%
Decimal Number 59
 
0.9%
Uppercase Letter 54
 
0.8%
Lowercase Letter 45
 
0.7%
Other Punctuation 8
 
0.1%
Other Symbol 3
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
525
 
9.8%
188
 
3.5%
170
 
3.2%
165
 
3.1%
139
 
2.6%
123
 
2.3%
116
 
2.2%
115
 
2.1%
93
 
1.7%
91
 
1.7%
Other values (240) 3653
67.9%
Uppercase Letter
ValueCountFrequency (%)
C 12
22.2%
N 12
22.2%
H 9
16.7%
S 8
14.8%
T 8
14.8%
I 5
9.3%
Decimal Number
ValueCountFrequency (%)
2 23
39.0%
1 16
27.1%
0 10
16.9%
9 5
 
8.5%
4 5
 
8.5%
Lowercase Letter
ValueCountFrequency (%)
e 9
20.0%
p 9
20.0%
o 9
20.0%
r 9
20.0%
y 9
20.0%
Open Punctuation
ValueCountFrequency (%)
( 499
100.0%
Close Punctuation
ValueCountFrequency (%)
) 499
100.0%
Space Separator
ValueCountFrequency (%)
233
100.0%
Other Punctuation
ValueCountFrequency (%)
& 8
100.0%
Other Symbol
ValueCountFrequency (%)
3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5381
79.4%
Common 1298
 
19.2%
Latin 99
 
1.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
525
 
9.8%
188
 
3.5%
170
 
3.2%
165
 
3.1%
139
 
2.6%
123
 
2.3%
116
 
2.2%
115
 
2.1%
93
 
1.7%
91
 
1.7%
Other values (241) 3656
67.9%
Latin
ValueCountFrequency (%)
C 12
12.1%
N 12
12.1%
e 9
9.1%
p 9
9.1%
o 9
9.1%
r 9
9.1%
y 9
9.1%
H 9
9.1%
S 8
8.1%
T 8
8.1%
Common
ValueCountFrequency (%)
( 499
38.4%
) 499
38.4%
233
18.0%
2 23
 
1.8%
1 16
 
1.2%
0 10
 
0.8%
& 8
 
0.6%
9 5
 
0.4%
4 5
 
0.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5378
79.3%
ASCII 1397
 
20.6%
None 3
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
525
 
9.8%
188
 
3.5%
170
 
3.2%
165
 
3.1%
139
 
2.6%
123
 
2.3%
116
 
2.2%
115
 
2.1%
93
 
1.7%
91
 
1.7%
Other values (240) 3653
67.9%
ASCII
ValueCountFrequency (%)
( 499
35.7%
) 499
35.7%
233
16.7%
2 23
 
1.6%
1 16
 
1.1%
C 12
 
0.9%
N 12
 
0.9%
0 10
 
0.7%
e 9
 
0.6%
p 9
 
0.6%
Other values (10) 75
 
5.4%
None
ValueCountFrequency (%)
3
100.0%
Distinct77
Distinct (%)11.6%
Missing0
Missing (%)0.0%
Memory size5.3 KiB
2023-12-12T19:42:06.166995image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length84
Median length57
Mean length14.578313
Min length3

Characters and Unicode

Total characters9680
Distinct characters181
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique25 ?
Unique (%)3.8%

Sample

1st row폐합성수지류(폐염화비닐수지류는 제외한다)
2nd row폐합성수지류(폐염화비닐수지류는 제외한다)
3rd row폐합성수지류(폐염화비닐수지류는 제외한다)
4th row폐합성수지류(폐염화비닐수지류는 제외한다)
5th row석재ㆍ골재폐수처리오니(석재ㆍ골재 생산 시 발생한 폐수를 처리하는 과정에서 발생한 오니로 한정한다)
ValueCountFrequency (%)
209
 
12.4%
밖의 209
 
12.4%
제외한다 190
 
11.3%
폐합성수지류(폐염화비닐수지류는 178
 
10.5%
폐기물 107
 
6.3%
폐목재류 37
 
2.2%
음식물류폐기물 33
 
2.0%
말한다 31
 
1.8%
폐수처리오니 30
 
1.8%
하수처리오니 23
 
1.4%
Other values (140) 641
38.0%
2023-12-12T19:42:06.772882image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1030
 
10.6%
778
 
8.0%
517
 
5.3%
463
 
4.8%
427
 
4.4%
308
 
3.2%
271
 
2.8%
266
 
2.7%
247
 
2.6%
236
 
2.4%
Other values (171) 5137
53.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 8087
83.5%
Space Separator 1030
 
10.6%
Open Punctuation 240
 
2.5%
Close Punctuation 240
 
2.5%
Connector Punctuation 58
 
0.6%
Decimal Number 18
 
0.2%
Other Punctuation 7
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
778
 
9.6%
517
 
6.4%
463
 
5.7%
427
 
5.3%
308
 
3.8%
271
 
3.4%
266
 
3.3%
247
 
3.1%
236
 
2.9%
235
 
2.9%
Other values (161) 4339
53.7%
Decimal Number
ValueCountFrequency (%)
1 12
66.7%
8 5
27.8%
2 1
 
5.6%
Open Punctuation
ValueCountFrequency (%)
( 235
97.9%
5
 
2.1%
Close Punctuation
ValueCountFrequency (%)
) 235
97.9%
5
 
2.1%
Space Separator
ValueCountFrequency (%)
1030
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 58
100.0%
Other Punctuation
ValueCountFrequency (%)
. 7
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 8087
83.5%
Common 1593
 
16.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
778
 
9.6%
517
 
6.4%
463
 
5.7%
427
 
5.3%
308
 
3.8%
271
 
3.4%
266
 
3.3%
247
 
3.1%
236
 
2.9%
235
 
2.9%
Other values (161) 4339
53.7%
Common
ValueCountFrequency (%)
1030
64.7%
( 235
 
14.8%
) 235
 
14.8%
_ 58
 
3.6%
1 12
 
0.8%
. 7
 
0.4%
5
 
0.3%
5
 
0.3%
8 5
 
0.3%
2 1
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 8055
83.2%
ASCII 1583
 
16.4%
Compat Jamo 32
 
0.3%
None 10
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1030
65.1%
( 235
 
14.8%
) 235
 
14.8%
_ 58
 
3.7%
1 12
 
0.8%
. 7
 
0.4%
8 5
 
0.3%
2 1
 
0.1%
Hangul
ValueCountFrequency (%)
778
 
9.7%
517
 
6.4%
463
 
5.7%
427
 
5.3%
308
 
3.8%
271
 
3.4%
266
 
3.3%
247
 
3.1%
236
 
2.9%
235
 
2.9%
Other values (160) 4307
53.5%
Compat Jamo
ValueCountFrequency (%)
32
100.0%
None
ValueCountFrequency (%)
5
50.0%
5
50.0%
Distinct192
Distinct (%)28.9%
Missing0
Missing (%)0.0%
Memory size5.3 KiB
2023-12-12T19:42:07.053883image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length54
Median length45
Mean length26.275602
Min length1

Characters and Unicode

Total characters17447
Distinct characters246
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique64 ?
Unique (%)9.6%

Sample

1st row부산광역시 연제구 고분로191번길 6 (연산동_ 부산광역시푸른도시가꾸기사업소)
2nd row부산광역시 기장군 장안읍 명례산단7로 30 (주)알파로보틱스
3rd row부산광역시 기장군 장안읍 장안산단2로 17
4th row부산광역시 기장군 정관면 산단4로 99
5th row부산광역시 기장군 정관읍 산단5로 100-136_ 대양테크
ValueCountFrequency (%)
부산광역시 647
18.0%
기장군 633
17.6%
정관읍 164
 
4.6%
기장읍 159
 
4.4%
장안읍 133
 
3.7%
정관면 97
 
2.7%
기장대로 62
 
1.7%
일광읍 58
 
1.6%
산단7로 52
 
1.4%
기장해안로 47
 
1.3%
Other values (339) 1552
43.1%
2023-12-12T19:42:07.544469image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2962
 
17.0%
1127
 
6.5%
988
 
5.7%
939
 
5.4%
767
 
4.4%
727
 
4.2%
695
 
4.0%
651
 
3.7%
633
 
3.6%
531
 
3.0%
Other values (236) 7427
42.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 11323
64.9%
Space Separator 2962
 
17.0%
Decimal Number 2417
 
13.9%
Dash Punctuation 183
 
1.0%
Close Punctuation 166
 
1.0%
Open Punctuation 166
 
1.0%
Connector Punctuation 151
 
0.9%
Uppercase Letter 54
 
0.3%
Other Punctuation 24
 
0.1%
Lowercase Letter 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1127
 
10.0%
988
 
8.7%
939
 
8.3%
767
 
6.8%
727
 
6.4%
695
 
6.1%
651
 
5.7%
633
 
5.6%
531
 
4.7%
519
 
4.6%
Other values (208) 3746
33.1%
Decimal Number
ValueCountFrequency (%)
1 451
18.7%
2 364
15.1%
3 270
11.2%
5 250
10.3%
6 246
10.2%
7 215
8.9%
0 171
 
7.1%
4 169
 
7.0%
9 165
 
6.8%
8 116
 
4.8%
Uppercase Letter
ValueCountFrequency (%)
C 14
25.9%
A 14
25.9%
S 12
22.2%
T 11
20.4%
K 1
 
1.9%
N 1
 
1.9%
H 1
 
1.9%
Other Punctuation
ValueCountFrequency (%)
* 11
45.8%
& 11
45.8%
: 2
 
8.3%
Close Punctuation
ValueCountFrequency (%)
) 155
93.4%
] 11
 
6.6%
Open Punctuation
ValueCountFrequency (%)
( 155
93.4%
[ 11
 
6.6%
Space Separator
ValueCountFrequency (%)
2962
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 183
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 151
100.0%
Lowercase Letter
ValueCountFrequency (%)
n 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 11323
64.9%
Common 6069
34.8%
Latin 55
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1127
 
10.0%
988
 
8.7%
939
 
8.3%
767
 
6.8%
727
 
6.4%
695
 
6.1%
651
 
5.7%
633
 
5.6%
531
 
4.7%
519
 
4.6%
Other values (208) 3746
33.1%
Common
ValueCountFrequency (%)
2962
48.8%
1 451
 
7.4%
2 364
 
6.0%
3 270
 
4.4%
5 250
 
4.1%
6 246
 
4.1%
7 215
 
3.5%
- 183
 
3.0%
0 171
 
2.8%
4 169
 
2.8%
Other values (10) 788
 
13.0%
Latin
ValueCountFrequency (%)
C 14
25.5%
A 14
25.5%
S 12
21.8%
T 11
20.0%
K 1
 
1.8%
N 1
 
1.8%
H 1
 
1.8%
n 1
 
1.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 11323
64.9%
ASCII 6124
35.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2962
48.4%
1 451
 
7.4%
2 364
 
5.9%
3 270
 
4.4%
5 250
 
4.1%
6 246
 
4.0%
7 215
 
3.5%
- 183
 
3.0%
0 171
 
2.8%
4 169
 
2.8%
Other values (18) 843
 
13.8%
Hangul
ValueCountFrequency (%)
1127
 
10.0%
988
 
8.7%
939
 
8.3%
767
 
6.8%
727
 
6.4%
695
 
6.1%
651
 
5.7%
633
 
5.6%
531
 
4.7%
519
 
4.6%
Other values (208) 3746
33.1%

Interactions

2023-12-12T19:42:04.274471image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T19:42:07.666465image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번폐기물 종류
연번1.0000.629
폐기물 종류0.6291.000

Missing values

2023-12-12T19:42:04.433038image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T19:42:04.548661image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번상호폐기물 종류사업장도로명주소
01부산광역시 푸른도시가꾸기사업소폐합성수지류(폐염화비닐수지류는 제외한다)부산광역시 연제구 고분로191번길 6 (연산동_ 부산광역시푸른도시가꾸기사업소)
12(주)알파로보틱스폐합성수지류(폐염화비닐수지류는 제외한다)부산광역시 기장군 장안읍 명례산단7로 30 (주)알파로보틱스
23(주)비엠티폐합성수지류(폐염화비닐수지류는 제외한다)부산광역시 기장군 장안읍 장안산단2로 17
34(주)대일폐합성수지류(폐염화비닐수지류는 제외한다)부산광역시 기장군 정관면 산단4로 99
45주식회사 유창에스엔티석재ㆍ골재폐수처리오니(석재ㆍ골재 생산 시 발생한 폐수를 처리하는 과정에서 발생한 오니로 한정한다)부산광역시 기장군 정관읍 산단5로 100-136_ 대양테크
56주식회사 유창에스엔티폐석재부산광역시 기장군 정관읍 산단5로 100-136_ 대양테크
67일진엔티에스㈜폐합성수지류(폐염화비닐수지류는 제외한다)부산광역시 기장군 장안읍 명례산단6로 162
78㈜아이씨맥스 부산기장공장폐합성수지류(폐염화비닐수지류는 제외한다)부산광역시 기장군 장안읍 오리산단5로 56
89㈜아이씨맥스 부산기장공장폐합성수지류(폐염화비닐수지류는 제외한다)부산광역시 기장군 장안읍 오리산단5로 56
910주식회사 일영이푸드축산물가공잔재물(동물성 유지류는 제외한다)부산광역시 기장군 정관읍 산단7로 65_ 일영이푸드
연번상호폐기물 종류사업장도로명주소
654655(주)제일선재그 밖의 광재류부산광역시 기장군 정관읍 산단4로 2-27 (주)제일선재
655656동진특수고무폐합성고무류부산광역시 기장군 정관면 정관중앙로 211
656657동래석재석재ㆍ골재폐수처리오니(석재ㆍ골재 생산 시 발생한 폐수를 처리하는 과정에서 발생한 오니로 한정한다)부산광역시 기장군 정관면 산단4로 2-13
657658동래석재폐석재부산광역시 기장군 정관면 산단4로 2-13
658659아시아드컨트리클럽(주)음식물쓰레기부산광역시 기장군 일광읍 이천8길 100
659660주식회사영남환경폐합성수지부산광역시 기장군 정관면 양수길 55-52
660661국립수산과학원건설폐기물부산광역시 기장군 기장읍 기장해안로 216
661662국립수산과학원폐합성수지부산광역시 기장군 기장읍 기장해안로 216
662663(재)한국 천부교 전도관 유지재단소각잔재물부산광역시 기장군 기장읍 차성동로19번길 1
663664(주)고려화학그 밖의 공정오니부산광역시 기장군 정관읍 용수공단2길 85 (고려화학)