Overview

Dataset statistics

Number of variables6
Number of observations84
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory4.2 KiB
Average record size in memory51.6 B

Variable types

Numeric2
Categorical2
Text2

Dataset

Description경상남도 생활폐기물 처리시설 집계현황입니다. (매립시설 24개소, 소각시설 20개소, 음식물류 폐기물 공공처리시설 12개소, 폐기물 에너지화시설 6개소, 재활용품 선별장 22개소)
Author공공데이터포털
URLhttps://www.data.go.kr/data/15088730/fileData.do

Alerts

연번 is highly overall correlated with 용량 and 2 other fieldsHigh correlation
용량 is highly overall correlated with 연번High correlation
시설구분 is highly overall correlated with 연번 and 1 other fieldsHigh correlation
용량 단위 is highly overall correlated with 연번 and 1 other fieldsHigh correlation
연번 has unique valuesUnique

Reproduction

Analysis started2024-04-16 22:34:05.529121
Analysis finished2024-04-16 22:34:06.290705
Duration0.76 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct84
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean42.5
Minimum1
Maximum84
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size888.0 B
2024-04-17T07:34:06.351085image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile5.15
Q121.75
median42.5
Q363.25
95-th percentile79.85
Maximum84
Range83
Interquartile range (IQR)41.5

Descriptive statistics

Standard deviation24.392622
Coefficient of variation (CV)0.57394404
Kurtosis-1.2
Mean42.5
Median Absolute Deviation (MAD)21
Skewness0
Sum3570
Variance595
MonotonicityStrictly increasing
2024-04-17T07:34:06.465814image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.2%
55 1
 
1.2%
63 1
 
1.2%
62 1
 
1.2%
61 1
 
1.2%
60 1
 
1.2%
59 1
 
1.2%
58 1
 
1.2%
57 1
 
1.2%
56 1
 
1.2%
Other values (74) 74
88.1%
ValueCountFrequency (%)
1 1
1.2%
2 1
1.2%
3 1
1.2%
4 1
1.2%
5 1
1.2%
6 1
1.2%
7 1
1.2%
8 1
1.2%
9 1
1.2%
10 1
1.2%
ValueCountFrequency (%)
84 1
1.2%
83 1
1.2%
82 1
1.2%
81 1
1.2%
80 1
1.2%
79 1
1.2%
78 1
1.2%
77 1
1.2%
76 1
1.2%
75 1
1.2%

시설구분
Categorical

HIGH CORRELATION 

Distinct5
Distinct (%)6.0%
Missing0
Missing (%)0.0%
Memory size804.0 B
매립시설
24 
재활용품선별장
22 
소각시설
20 
음식물처리시설
12 
에너지화시설

Length

Max length7
Median length4
Mean length5.3571429
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row매립시설
2nd row매립시설
3rd row매립시설
4th row매립시설
5th row매립시설

Common Values

ValueCountFrequency (%)
매립시설 24
28.6%
재활용품선별장 22
26.2%
소각시설 20
23.8%
음식물처리시설 12
14.3%
에너지화시설 6
 
7.1%

Length

2024-04-17T07:34:06.594290image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-17T07:34:06.689651image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
매립시설 24
28.6%
재활용품선별장 22
26.2%
소각시설 20
23.8%
음식물처리시설 12
14.3%
에너지화시설 6
 
7.1%
Distinct53
Distinct (%)63.1%
Missing0
Missing (%)0.0%
Memory size804.0 B
2024-04-17T07:34:06.882259image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length3
Mean length5.8214286
Min length3

Characters and Unicode

Total characters489
Distinct characters84
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique34 ?
Unique (%)40.5%

Sample

1st row창원 천선 매립장
2nd row창원 덕동 매립장
3rd row창원 덕산 매립장
4th row진주권 광역쓰레기매립장
5th row명정(2차) 폐기물 매립장
ValueCountFrequency (%)
양산시 5
 
4.8%
매립장 5
 
4.8%
통영시 4
 
3.8%
거제시 4
 
3.8%
창원시 3
 
2.9%
창원 3
 
2.9%
폐기물매립장 3
 
2.9%
하동군 3
 
2.9%
창원시(마산 3
 
2.9%
밀양시 3
 
2.9%
Other values (53) 68
65.4%
2024-04-17T07:34:07.199530image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
46
 
9.4%
22
 
4.5%
20
 
4.1%
20
 
4.1%
19
 
3.9%
19
 
3.9%
19
 
3.9%
19
 
3.9%
17
 
3.5%
( 16
 
3.3%
Other values (74) 272
55.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 433
88.5%
Space Separator 20
 
4.1%
Open Punctuation 16
 
3.3%
Close Punctuation 16
 
3.3%
Decimal Number 4
 
0.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
46
 
10.6%
22
 
5.1%
20
 
4.6%
19
 
4.4%
19
 
4.4%
19
 
4.4%
19
 
4.4%
17
 
3.9%
16
 
3.7%
15
 
3.5%
Other values (69) 221
51.0%
Decimal Number
ValueCountFrequency (%)
2 3
75.0%
1 1
 
25.0%
Space Separator
ValueCountFrequency (%)
20
100.0%
Open Punctuation
ValueCountFrequency (%)
( 16
100.0%
Close Punctuation
ValueCountFrequency (%)
) 16
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 433
88.5%
Common 56
 
11.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
46
 
10.6%
22
 
5.1%
20
 
4.6%
19
 
4.4%
19
 
4.4%
19
 
4.4%
19
 
4.4%
17
 
3.9%
16
 
3.7%
15
 
3.5%
Other values (69) 221
51.0%
Common
ValueCountFrequency (%)
20
35.7%
( 16
28.6%
) 16
28.6%
2 3
 
5.4%
1 1
 
1.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 433
88.5%
ASCII 56
 
11.5%

Most frequent character per block

Hangul
ValueCountFrequency (%)
46
 
10.6%
22
 
5.1%
20
 
4.6%
19
 
4.4%
19
 
4.4%
19
 
4.4%
19
 
4.4%
17
 
3.9%
16
 
3.7%
15
 
3.5%
Other values (69) 221
51.0%
ASCII
ValueCountFrequency (%)
20
35.7%
( 16
28.6%
) 16
28.6%
2 3
 
5.4%
1 1
 
1.8%
Distinct71
Distinct (%)84.5%
Missing0
Missing (%)0.0%
Memory size804.0 B
2024-04-17T07:34:07.430924image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length25
Median length22
Mean length14.785714
Min length6

Characters and Unicode

Total characters1242
Distinct characters109
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique59 ?
Unique (%)70.2%

Sample

1st row성산구 공단로 788번길 97
2nd row마산합포구 가포로 615-119
3rd row진해구 천자로 101
4th row내동면 유수길 75번길 63
5th row평인일주로 1074-30
ValueCountFrequency (%)
성산구 7
 
2.8%
창곡로 6
 
2.4%
내동면 5
 
2.0%
한내8길 5
 
2.0%
금성면 5
 
2.0%
김해대로 4
 
1.6%
108번길 4
 
1.6%
유수길 4
 
1.6%
마산합포구 4
 
1.6%
함양읍 4
 
1.6%
Other values (134) 204
81.0%
2024-04-17T07:34:07.814695image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
172
 
13.8%
1 79
 
6.4%
51
 
4.1%
2 41
 
3.3%
- 41
 
3.3%
40
 
3.2%
5 39
 
3.1%
8 38
 
3.1%
37
 
3.0%
7 33
 
2.7%
Other values (99) 671
54.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 636
51.2%
Decimal Number 357
28.7%
Space Separator 172
 
13.8%
Dash Punctuation 41
 
3.3%
Close Punctuation 18
 
1.4%
Open Punctuation 18
 
1.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
51
 
8.0%
40
 
6.3%
37
 
5.8%
29
 
4.6%
20
 
3.1%
18
 
2.8%
18
 
2.8%
18
 
2.8%
16
 
2.5%
15
 
2.4%
Other values (85) 374
58.8%
Decimal Number
ValueCountFrequency (%)
1 79
22.1%
2 41
11.5%
5 39
10.9%
8 38
10.6%
7 33
9.2%
3 29
 
8.1%
9 29
 
8.1%
0 25
 
7.0%
6 24
 
6.7%
4 20
 
5.6%
Space Separator
ValueCountFrequency (%)
172
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 41
100.0%
Close Punctuation
ValueCountFrequency (%)
) 18
100.0%
Open Punctuation
ValueCountFrequency (%)
( 18
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 636
51.2%
Common 606
48.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
51
 
8.0%
40
 
6.3%
37
 
5.8%
29
 
4.6%
20
 
3.1%
18
 
2.8%
18
 
2.8%
18
 
2.8%
16
 
2.5%
15
 
2.4%
Other values (85) 374
58.8%
Common
ValueCountFrequency (%)
172
28.4%
1 79
13.0%
2 41
 
6.8%
- 41
 
6.8%
5 39
 
6.4%
8 38
 
6.3%
7 33
 
5.4%
3 29
 
4.8%
9 29
 
4.8%
0 25
 
4.1%
Other values (4) 80
13.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 636
51.2%
ASCII 606
48.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
172
28.4%
1 79
13.0%
2 41
 
6.8%
- 41
 
6.8%
5 39
 
6.4%
8 38
 
6.3%
7 33
 
5.4%
3 29
 
4.8%
9 29
 
4.8%
0 25
 
4.1%
Other values (4) 80
13.2%
Hangul
ValueCountFrequency (%)
51
 
8.0%
40
 
6.3%
37
 
5.8%
29
 
4.6%
20
 
3.1%
18
 
2.8%
18
 
2.8%
18
 
2.8%
16
 
2.5%
15
 
2.4%
Other values (85) 374
58.8%

용량
Real number (ℝ)

HIGH CORRELATION 

Distinct48
Distinct (%)57.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean315532.73
Minimum5
Maximum5854955
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size888.0 B
2024-04-17T07:34:07.942386image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum5
5-th percentile10
Q120
median80
Q389465.75
95-th percentile2622918.5
Maximum5854955
Range5854950
Interquartile range (IQR)89445.75

Descriptive statistics

Standard deviation947449.69
Coefficient of variation (CV)3.0026986
Kurtosis17.162296
Mean315532.73
Median Absolute Deviation (MAD)70
Skewness3.9818751
Sum26504749
Variance8.9766092 × 1011
MonotonicityNot monotonic
2024-04-17T07:34:08.063058image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=48)
ValueCountFrequency (%)
10 8
 
9.5%
100 6
 
7.1%
200 4
 
4.8%
30 4
 
4.8%
15 4
 
4.8%
20 4
 
4.8%
50 4
 
4.8%
80 3
 
3.6%
40 3
 
3.6%
60 2
 
2.4%
Other values (38) 42
50.0%
ValueCountFrequency (%)
5 1
 
1.2%
6 1
 
1.2%
8 2
 
2.4%
10 8
9.5%
15 4
4.8%
16 1
 
1.2%
19 1
 
1.2%
20 4
4.8%
24 1
 
1.2%
30 4
4.8%
ValueCountFrequency (%)
5854955 1
1.2%
3810537 1
1.2%
3254000 1
1.2%
3019890 1
1.2%
2810950 1
1.2%
1557407 1
1.2%
1040000 1
1.2%
1033493 1
1.2%
1000994 1
1.2%
432211 1
1.2%

용량 단위
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)3.6%
Missing0
Missing (%)0.0%
Memory size804.0 B
톤/일
58 
24 
kw/h
 
2

Length

Max length4
Median length3
Mean length2.452381
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row
2nd row
3rd row
4th row
5th row

Common Values

ValueCountFrequency (%)
톤/일 58
69.0%
24
28.6%
kw/h 2
 
2.4%

Length

2024-04-17T07:34:08.184872image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-17T07:34:08.273582image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
톤/일 58
69.0%
24
28.6%
kw/h 2
 
2.4%

Interactions

2024-04-17T07:34:06.025408image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T07:34:05.896777image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T07:34:06.086222image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T07:34:05.955120image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-17T07:34:08.335914image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번시설구분시설명소재지용량용량 단위
연번1.0000.9900.7260.7200.3860.820
시설구분0.9901.0000.6440.0000.0970.784
시설명0.7260.6441.0000.9661.0001.000
소재지0.7200.0000.9661.0000.5660.000
용량0.3860.0971.0000.5661.0000.396
용량 단위0.8200.7841.0000.0000.3961.000
2024-04-17T07:34:08.417290image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시설구분용량 단위
시설구분1.0000.789
용량 단위0.7891.000
2024-04-17T07:34:08.488642image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번용량시설구분용량 단위
연번1.000-0.7560.8270.689
용량-0.7561.0000.0530.282
시설구분0.8270.0531.0000.789
용량 단위0.6890.2820.7891.000

Missing values

2024-04-17T07:34:06.176940image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-17T07:34:06.257820image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번시설구분시설명소재지용량용량 단위
01매립시설창원 천선 매립장성산구 공단로 788번길 973810537
12매립시설창원 덕동 매립장마산합포구 가포로 615-1193254000
23매립시설창원 덕산 매립장진해구 천자로 1011000994
34매립시설진주권 광역쓰레기매립장내동면 유수길 75번길 635854955
45매립시설명정(2차) 폐기물 매립장평인일주로 1074-302810950
56매립시설사등 폐기물매립장환경길 571040000
67매립시설진영 폐기물매립장김해대로 832-591557407
78매립시설일반폐기물매립장무안면 신생길 53-280432211
89매립시설하청 쓰레기매립장하청면 한내8길1033493
910매립시설유산 폐기물매립장유산공단10길 1113019890
연번시설구분시설명소재지용량용량 단위
7475재활용품선별장산청군생비량면 진산로 2192-778톤/일
7576재활용품선별장함양군함양읍 함양남서로 10215톤/일
7677재활용품선별장거창군거창읍 심소정길 139-1415톤/일
7778재활용품선별장합천군대양면 합천대로 2586-516톤/일
7879에너지화시설창원시(매립가스)마산합포구 가포로 615-119900kw/h
7980에너지화시설진주시(매립가스)내동면 유수길 75번길 63925kw/h
8081에너지화시설창원시(음폐수바이오가스)성산구 창곡로 108번길 8200톤/일
8182에너지화시설진주시(음폐수바이오가스)내동면 유수길 75번길 63110톤/일
8283에너지화시설김해시(음폐수바이오가스)김해대로 835번길 13-72100톤/일
8384에너지화시설사천시(음식물류)사천시 환경길 5540톤/일