Overview

Dataset statistics

Number of variables3
Number of observations182
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory4.7 KiB
Average record size in memory26.7 B

Variable types

Numeric2
Text1

Dataset

Description수도권매립지에 반입되는 폐기물의 코드, 단가 등 폐기물 정보입니다.개방항목 : 폐기물코드, 폐기물명, 폐기물단가(원) 의 항목을 제공합니다.
Author수도권매립지관리공사
URLhttps://www.data.go.kr/data/15064397/fileData.do

Alerts

폐기물코드 is highly overall correlated with 폐기물단가(원)High correlation
폐기물단가(원) is highly overall correlated with 폐기물코드High correlation
폐기물코드 has unique valuesUnique
폐기물단가(원) has 77 (42.3%) zerosZeros

Reproduction

Analysis started2024-04-13 11:38:31.997528
Analysis finished2024-04-13 11:38:35.637124
Duration3.64 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

폐기물코드
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct182
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean512159.82
Minimum20199
Maximum570101
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.7 KiB
2024-04-13T20:38:35.857475image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum20199
5-th percentile400206.05
Q1510827.25
median521150
Q3560109.25
95-th percentile561011.95
Maximum570101
Range549902
Interquartile range (IQR)49282

Descriptive statistics

Standard deviation83588.815
Coefficient of variation (CV)0.16320846
Kurtosis18.839783
Mean512159.82
Median Absolute Deviation (MAD)28965.5
Skewness-3.9657651
Sum93213088
Variance6.98709 × 109
MonotonicityNot monotonic
2024-04-13T20:38:36.290136image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
561005 1
 
0.5%
510903 1
 
0.5%
560111 1
 
0.5%
561011 1
 
0.5%
550105 1
 
0.5%
560103 1
 
0.5%
550131 1
 
0.5%
550119 1
 
0.5%
400311 1
 
0.5%
512900 1
 
0.5%
Other values (172) 172
94.5%
ValueCountFrequency (%)
20199 1
0.5%
60101 1
0.5%
60103 1
0.5%
60199 1
0.5%
400101 1
0.5%
400102 1
0.5%
400103 1
0.5%
400104 1
0.5%
400105 1
0.5%
400206 1
0.5%
ValueCountFrequency (%)
570101 1
0.5%
562004 1
0.5%
562003 1
0.5%
562002 1
0.5%
562001 1
0.5%
561016 1
0.5%
561015 1
0.5%
561014 1
0.5%
561013 1
0.5%
561012 1
0.5%
Distinct166
Distinct (%)91.2%
Missing0
Missing (%)0.0%
Memory size1.5 KiB
2024-04-13T20:38:37.199438image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length44
Median length18
Mean length6.7912088
Min length2

Characters and Unicode

Total characters1236
Distinct characters210
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique150 ?
Unique (%)82.4%

Sample

1st row2단계건조슬러지(처분)
2nd row2단계시설 준설물
3rd row3단계건조슬러지(연료탄)
4th row3단계건조슬러지(처분)
5th rowAll-Dash
ValueCountFrequency (%)
5
 
2.2%
밖의 5
 
2.2%
무기성 3
 
1.3%
처리물 3
 
1.3%
3
 
1.3%
폐금속류 2
 
0.9%
폐합성수지 2
 
0.9%
그밖의 2
 
0.9%
정수오니 2
 
0.9%
폐블럭 2
 
0.9%
Other values (179) 198
87.2%
2024-04-13T20:38:38.526888image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
77
 
6.2%
45
 
3.6%
41
 
3.3%
( 37
 
3.0%
) 37
 
3.0%
32
 
2.6%
29
 
2.3%
28
 
2.3%
27
 
2.2%
24
 
1.9%
Other values (200) 859
69.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1054
85.3%
Space Separator 45
 
3.6%
Open Punctuation 37
 
3.0%
Close Punctuation 37
 
3.0%
Decimal Number 30
 
2.4%
Uppercase Letter 15
 
1.2%
Other Punctuation 8
 
0.6%
Lowercase Letter 5
 
0.4%
Dash Punctuation 3
 
0.2%
Connector Punctuation 2
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
77
 
7.3%
41
 
3.9%
32
 
3.0%
29
 
2.8%
28
 
2.7%
27
 
2.6%
24
 
2.3%
23
 
2.2%
20
 
1.9%
20
 
1.9%
Other values (176) 733
69.5%
Decimal Number
ValueCountFrequency (%)
2 8
26.7%
1 7
23.3%
3 5
16.7%
0 4
13.3%
5 3
 
10.0%
7 2
 
6.7%
4 1
 
3.3%
Uppercase Letter
ValueCountFrequency (%)
F 3
20.0%
R 3
20.0%
S 3
20.0%
G 2
13.3%
C 2
13.3%
D 1
 
6.7%
A 1
 
6.7%
Lowercase Letter
ValueCountFrequency (%)
l 2
40.0%
h 1
20.0%
s 1
20.0%
a 1
20.0%
Space Separator
ValueCountFrequency (%)
45
100.0%
Open Punctuation
ValueCountFrequency (%)
( 37
100.0%
Close Punctuation
ValueCountFrequency (%)
) 37
100.0%
Other Punctuation
ValueCountFrequency (%)
% 8
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1054
85.3%
Common 162
 
13.1%
Latin 20
 
1.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
77
 
7.3%
41
 
3.9%
32
 
3.0%
29
 
2.8%
28
 
2.7%
27
 
2.6%
24
 
2.3%
23
 
2.2%
20
 
1.9%
20
 
1.9%
Other values (176) 733
69.5%
Common
ValueCountFrequency (%)
45
27.8%
( 37
22.8%
) 37
22.8%
2 8
 
4.9%
% 8
 
4.9%
1 7
 
4.3%
3 5
 
3.1%
0 4
 
2.5%
5 3
 
1.9%
- 3
 
1.9%
Other values (3) 5
 
3.1%
Latin
ValueCountFrequency (%)
F 3
15.0%
R 3
15.0%
S 3
15.0%
l 2
10.0%
G 2
10.0%
C 2
10.0%
h 1
 
5.0%
s 1
 
5.0%
a 1
 
5.0%
D 1
 
5.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1053
85.2%
ASCII 182
 
14.7%
Compat Jamo 1
 
0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
77
 
7.3%
41
 
3.9%
32
 
3.0%
29
 
2.8%
28
 
2.7%
27
 
2.6%
24
 
2.3%
23
 
2.2%
20
 
1.9%
20
 
1.9%
Other values (175) 732
69.5%
ASCII
ValueCountFrequency (%)
45
24.7%
( 37
20.3%
) 37
20.3%
2 8
 
4.4%
% 8
 
4.4%
1 7
 
3.8%
3 5
 
2.7%
0 4
 
2.2%
5 3
 
1.6%
F 3
 
1.6%
Other values (14) 25
13.7%
Compat Jamo
ValueCountFrequency (%)
1
100.0%

폐기물단가(원)
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct17
Distinct (%)9.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean75635.764
Minimum0
Maximum900000
Zeros77
Zeros (%)42.3%
Negative0
Negative (%)0.0%
Memory size1.7 KiB
2024-04-13T20:38:38.902219image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median99999
Q3126840
95-th percentile147497
Maximum900000
Range900000
Interquartile range (IQR)126840

Descriptive statistics

Standard deviation89206.942
Coefficient of variation (CV)1.1794281
Kurtosis38.949219
Mean75635.764
Median Absolute Deviation (MAD)47498
Skewness4.3032361
Sum13765709
Variance7.9578785 × 109
MonotonicityNot monotonic
2024-04-13T20:38:39.260631image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=17)
ValueCountFrequency (%)
0 77
42.3%
147497 38
20.9%
125191 20
 
11.0%
99999 12
 
6.6%
126840 9
 
4.9%
116855 5
 
2.7%
148270 5
 
2.7%
80453 4
 
2.2%
60500 4
 
2.2%
107058 1
 
0.5%
Other values (7) 7
 
3.8%
ValueCountFrequency (%)
0 77
42.3%
21811 1
 
0.5%
23500 1
 
0.5%
60500 4
 
2.2%
79408 1
 
0.5%
80453 4
 
2.2%
87608 1
 
0.5%
97963 1
 
0.5%
99999 12
 
6.6%
107058 1
 
0.5%
ValueCountFrequency (%)
900000 1
 
0.5%
148270 5
 
2.7%
147497 38
20.9%
126840 9
 
4.9%
125191 20
11.0%
116855 5
 
2.7%
108670 1
 
0.5%
107058 1
 
0.5%
99999 12
 
6.6%
97963 1
 
0.5%

Interactions

2024-04-13T20:38:34.703836image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-13T20:38:34.211341image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-13T20:38:34.945431image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-13T20:38:34.465905image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-13T20:38:39.485936image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
폐기물코드폐기물단가(원)
폐기물코드1.0000.518
폐기물단가(원)0.5181.000
2024-04-13T20:38:39.717050image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
폐기물코드폐기물단가(원)
폐기물코드1.000-0.704
폐기물단가(원)-0.7041.000

Missing values

2024-04-13T20:38:35.258680image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-13T20:38:35.518927image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

폐기물코드폐기물명폐기물단가(원)
05610052단계건조슬러지(처분)0
15601392단계시설 준설물0
25620043단계건조슬러지(연료탄)0
35610093단계건조슬러지(처분)0
4550129All-Dash0
5562001SRF0
6560133SRF반입장폐기물0
7560134SRF시설 고철(무상공급)0
8520400가내공업116855
9519900건설공사로 인하여 발생되는 그 밖의 폐기물147497
폐기물코드폐기물명폐기물단가(원)
172510205하수준설토79408
173561010하수준설토(외곽수로)0
174560148하수준설토(재활용)0
175400414혼합건설폐기물147497
176513400혼합건설폐기물147497
177550111활성탄99999
178550106황산0
179550117황산(70%)0
180550114황산제이철(11%)99999
181550107황산제일철0