Overview

Dataset statistics

Number of variables5
Number of observations139
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory5.7 KiB
Average record size in memory41.9 B

Variable types

DateTime1
Text1
Categorical2
Numeric1

Dataset

Description매립장 내 폐기물반입을 하기위한 반입협의신청 정보와 그에 따른 현장 실사 정보입니다. 개방항목 : 협의년월, 지자체명, 폐기물명, 협의량,신청단위 항목을 제공합니다.
URLhttps://www.data.go.kr/data/15064375/fileData.do

Reproduction

Analysis started2023-12-12 11:03:10.284152
Analysis finished2023-12-12 11:03:11.240674
Duration0.96 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct59
Distinct (%)42.4%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
Minimum2001-01-01 00:00:00
Maximum2022-08-01 00:00:00
2023-12-12T20:03:11.367935image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T20:03:11.639518image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct68
Distinct (%)48.9%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
2023-12-12T20:03:12.046751image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length6
Mean length6.5755396
Min length4

Characters and Unicode

Total characters914
Distinct characters92
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique39 ?
Unique (%)28.1%

Sample

1st row서울시(자원순환과)
2nd row인천환경공단(청라)
3rd row서울시서대문구
4th row서울시중랑구
5th row김포시맑은물사업소
ValueCountFrequency (%)
경기도양주시 13
 
9.4%
경기도고양시 9
 
6.5%
경기도안산시 6
 
4.3%
경기도청 5
 
3.6%
서울시(자원순환과 4
 
2.9%
인천시중구 4
 
2.9%
경기도포천시 4
 
2.9%
인천시서구 4
 
2.9%
경기도파주시 4
 
2.9%
서울시동작구 4
 
2.9%
Other values (58) 82
59.0%
2023-12-12T20:03:12.620991image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
123
 
13.5%
81
 
8.9%
74
 
8.1%
69
 
7.5%
40
 
4.4%
37
 
4.0%
31
 
3.4%
30
 
3.3%
26
 
2.8%
23
 
2.5%
Other values (82) 380
41.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 902
98.7%
Close Punctuation 6
 
0.7%
Open Punctuation 6
 
0.7%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
123
 
13.6%
81
 
9.0%
74
 
8.2%
69
 
7.6%
40
 
4.4%
37
 
4.1%
31
 
3.4%
30
 
3.3%
26
 
2.9%
23
 
2.5%
Other values (80) 368
40.8%
Close Punctuation
ValueCountFrequency (%)
) 6
100.0%
Open Punctuation
ValueCountFrequency (%)
( 6
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 902
98.7%
Common 12
 
1.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
123
 
13.6%
81
 
9.0%
74
 
8.2%
69
 
7.6%
40
 
4.4%
37
 
4.1%
31
 
3.4%
30
 
3.3%
26
 
2.9%
23
 
2.5%
Other values (80) 368
40.8%
Common
ValueCountFrequency (%)
) 6
50.0%
( 6
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 902
98.7%
ASCII 12
 
1.3%

Most frequent character per block

Hangul
ValueCountFrequency (%)
123
 
13.6%
81
 
9.0%
74
 
8.2%
69
 
7.6%
40
 
4.4%
37
 
4.1%
31
 
3.4%
30
 
3.3%
26
 
2.9%
23
 
2.5%
Other values (80) 368
40.8%
ASCII
ValueCountFrequency (%)
) 6
50.0%
( 6
50.0%

폐기물명
Categorical

Distinct7
Distinct (%)5.0%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
사업장배출계(기타)
53 
사업장비배출시설계(자가)_협의폐기물
44 
광재등17종
16 
정수오니
12 
하수준설토
 
5
Other values (2)

Length

Max length19
Median length10
Mean length11.374101
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row광재등17종
2nd row광재등17종
3rd row사업장비배출시설계(자가)_협의폐기물
4th row사업장비배출시설계(자가)_협의폐기물
5th row정수오니

Common Values

ValueCountFrequency (%)
사업장배출계(기타) 53
38.1%
사업장비배출시설계(자가)_협의폐기물 44
31.7%
광재등17종 16
 
11.5%
정수오니 12
 
8.6%
하수준설토 5
 
3.6%
오니류등4종 5
 
3.6%
하수오니 4
 
2.9%

Length

2023-12-12T20:03:12.831522image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T20:03:13.530135image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
사업장배출계(기타 53
38.1%
사업장비배출시설계(자가)_협의폐기물 44
31.7%
광재등17종 16
 
11.5%
정수오니 12
 
8.6%
하수준설토 5
 
3.6%
오니류등4종 5
 
3.6%
하수오니 4
 
2.9%

협의량
Real number (ℝ)

Distinct92
Distinct (%)66.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1535.1978
Minimum9
Maximum40000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.4 KiB
2023-12-12T20:03:13.739541image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum9
5-th percentile30
Q1120
median292
Q31000
95-th percentile5100
Maximum40000
Range39991
Interquartile range (IQR)880

Descriptive statistics

Standard deviation4528.7796
Coefficient of variation (CV)2.9499648
Kurtosis44.866173
Mean1535.1978
Median Absolute Deviation (MAD)208
Skewness6.2549751
Sum213392.5
Variance20509845
MonotonicityNot monotonic
2023-12-12T20:03:13.960937image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
200.0 8
 
5.8%
100.0 5
 
3.6%
3000.0 4
 
2.9%
1000.0 4
 
2.9%
30.0 4
 
2.9%
600.0 4
 
2.9%
1800.0 3
 
2.2%
162.0 3
 
2.2%
400.0 3
 
2.2%
120.0 3
 
2.2%
Other values (82) 98
70.5%
ValueCountFrequency (%)
9.0 1
 
0.7%
19.0 1
 
0.7%
22.0 1
 
0.7%
23.0 1
 
0.7%
27.0 1
 
0.7%
30.0 4
2.9%
35.0 1
 
0.7%
37.5 1
 
0.7%
40.0 2
1.4%
44.0 1
 
0.7%
ValueCountFrequency (%)
40000.0 1
0.7%
23534.0 2
1.4%
8496.0 1
0.7%
8400.0 1
0.7%
7200.0 1
0.7%
6000.0 1
0.7%
5000.0 1
0.7%
4100.0 1
0.7%
4000.0 1
0.7%
3800.0 1
0.7%

신청단위
Categorical

Distinct3
Distinct (%)2.2%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
톤/월
93 
톤/년
36 
총/기간
10 

Length

Max length4
Median length3
Mean length3.0719424
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row톤/월
2nd row톤/월
3rd row톤/월
4th row톤/월
5th row톤/년

Common Values

ValueCountFrequency (%)
톤/월 93
66.9%
톤/년 36
 
25.9%
총/기간 10
 
7.2%

Length

2023-12-12T20:03:14.188780image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T20:03:14.330125image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
톤/월 93
66.9%
톤/년 36
 
25.9%
총/기간 10
 
7.2%

Interactions

2023-12-12T20:03:10.807317image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T20:03:14.445441image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
협의년월지자체명폐기물명협의량신청단위
협의년월1.0000.9720.6460.8200.805
지자체명0.9721.0000.9300.9340.844
폐기물명0.6460.9301.0000.3980.598
협의량0.8200.9340.3981.0000.416
신청단위0.8050.8440.5980.4161.000
2023-12-12T20:03:14.592983image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
폐기물명신청단위
폐기물명1.0000.484
신청단위0.4841.000
2023-12-12T20:03:14.749889image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
협의량폐기물명신청단위
협의량1.0000.2660.343
폐기물명0.2661.0000.484
신청단위0.3430.4841.000

Missing values

2023-12-12T20:03:11.026433image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T20:03:11.180868image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

협의년월지자체명폐기물명협의량신청단위
02001-01서울시(자원순환과)광재등17종4100.0톤/월
12002-04인천환경공단(청라)광재등17종3000.0톤/월
22003-04서울시서대문구사업장비배출시설계(자가)_협의폐기물390.0톤/월
32003-04서울시중랑구사업장비배출시설계(자가)_협의폐기물200.0톤/월
42003-09김포시맑은물사업소정수오니3800.0톤/년
52003-09서울대공원사업장비배출시설계(자가)_협의폐기물120.0톤/년
62003-11서울시관악구사업장비배출시설계(자가)_협의폐기물400.0톤/월
72003-12경기도김포시사업장배출계(기타)400.0톤/월
82003-12경기도김포시사업장배출계(기타)612.0톤/월
92004-10인천시남동구사업장비배출시설계(자가)_협의폐기물500.0톤/월
협의년월지자체명폐기물명협의량신청단위
1292020-08한강사업본부사업장비배출시설계(자가)_협의폐기물600.0톤/년
1302021-10경기도부천시사업장비배출시설계(자가)_협의폐기물100.0톤/월
1312021-10성남맑은물관리사업소사업장비배출시설계(자가)_협의폐기물2880.0톤/년
1322021-10성남맑은물관리사업소하수준설토1000.0톤/년
1332021-11경기도고양시사업장배출계(기타)108.0톤/월
1342021-11경기도이천시광재등17종2126.0톤/년
1352021-12남양주시상하수도관리센터하수오니1200.0톤/년
1362022-08서울시관악구사업장비배출시설계(자가)_협의폐기물490.0톤/년
1372022-08서울시동작구사업장비배출시설계(자가)_협의폐기물490.0톤/년
1382022-08서울시영등포구사업장비배출시설계(자가)_협의폐기물490.0톤/년