Overview

Dataset statistics

Number of variables5
Number of observations113
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory4.7 KiB
Average record size in memory42.2 B

Variable types

Categorical1
Text2
Numeric1
DateTime1

Dataset

Description대구광역시 달성군의 대형폐기물 처리 수수로 정보로 폐기물 구분, 폐기물명, 폐기물 규격, 수수료 등의 데이터를 포함하고 있습니다.
Author대구광역시 달성군
URLhttps://www.data.go.kr/data/15093675/fileData.do

Alerts

데이터기준일자 has constant value ""Constant

Reproduction

Analysis started2023-12-11 23:19:02.074598
Analysis finished2023-12-11 23:19:02.544567
Duration0.47 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

폐기물 구분
Categorical

Distinct4
Distinct (%)3.5%
Missing0
Missing (%)0.0%
Memory size1.0 KiB
생활용품
41 
가구류
35 
가전제품류
31 
기타류

Length

Max length5
Median length4
Mean length3.9115044
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row가전제품류
2nd row가전제품류
3rd row가전제품류
4th row가전제품류
5th row가전제품류

Common Values

ValueCountFrequency (%)
생활용품 41
36.3%
가구류 35
31.0%
가전제품류 31
27.4%
기타류 6
 
5.3%

Length

2023-12-12T08:19:02.619465image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T08:19:02.736950image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
생활용품 41
36.3%
가구류 35
31.0%
가전제품류 31
27.4%
기타류 6
 
5.3%
Distinct61
Distinct (%)54.0%
Missing0
Missing (%)0.0%
Memory size1.0 KiB
2023-12-12T08:19:02.963492image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length9
Mean length3.539823
Min length2

Characters and Unicode

Total characters400
Distinct characters125
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique26 ?
Unique (%)23.0%

Sample

1st rowT.V
2nd rowT.V
3rd row가스오븐렌지
4th row가습기
5th row공기청정기
ValueCountFrequency (%)
6
 
4.9%
서랍장 4
 
3.3%
침대 4
 
3.3%
피아노 3
 
2.5%
이불 3
 
2.5%
장식장 3
 
2.5%
거울·액자 3
 
2.5%
벽시계 3
 
2.5%
에어컨디셔너 3
 
2.5%
책장(책꽂이 3
 
2.5%
Other values (54) 87
71.3%
2023-12-12T08:19:03.317263image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
26
 
6.5%
24
 
6.0%
12
 
3.0%
9
 
2.2%
9
 
2.2%
( 9
 
2.2%
) 9
 
2.2%
8
 
2.0%
8
 
2.0%
8
 
2.0%
Other values (115) 278
69.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 353
88.2%
Other Punctuation 10
 
2.5%
Uppercase Letter 10
 
2.5%
Space Separator 9
 
2.2%
Open Punctuation 9
 
2.2%
Close Punctuation 9
 
2.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
26
 
7.4%
24
 
6.8%
12
 
3.4%
9
 
2.5%
8
 
2.3%
8
 
2.3%
8
 
2.3%
8
 
2.3%
7
 
2.0%
6
 
1.7%
Other values (104) 237
67.1%
Uppercase Letter
ValueCountFrequency (%)
R 2
20.0%
P 2
20.0%
F 2
20.0%
T 2
20.0%
V 2
20.0%
Other Punctuation
ValueCountFrequency (%)
. 6
60.0%
· 3
30.0%
, 1
 
10.0%
Space Separator
ValueCountFrequency (%)
9
100.0%
Open Punctuation
ValueCountFrequency (%)
( 9
100.0%
Close Punctuation
ValueCountFrequency (%)
) 9
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 353
88.2%
Common 37
 
9.2%
Latin 10
 
2.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
26
 
7.4%
24
 
6.8%
12
 
3.4%
9
 
2.5%
8
 
2.3%
8
 
2.3%
8
 
2.3%
8
 
2.3%
7
 
2.0%
6
 
1.7%
Other values (104) 237
67.1%
Common
ValueCountFrequency (%)
9
24.3%
( 9
24.3%
) 9
24.3%
. 6
16.2%
· 3
 
8.1%
, 1
 
2.7%
Latin
ValueCountFrequency (%)
R 2
20.0%
P 2
20.0%
F 2
20.0%
T 2
20.0%
V 2
20.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 353
88.2%
ASCII 44
 
11.0%
None 3
 
0.8%

Most frequent character per block

Hangul
ValueCountFrequency (%)
26
 
7.4%
24
 
6.8%
12
 
3.4%
9
 
2.5%
8
 
2.3%
8
 
2.3%
8
 
2.3%
8
 
2.3%
7
 
2.0%
6
 
1.7%
Other values (104) 237
67.1%
ASCII
ValueCountFrequency (%)
9
20.5%
( 9
20.5%
) 9
20.5%
. 6
13.6%
R 2
 
4.5%
P 2
 
4.5%
F 2
 
4.5%
T 2
 
4.5%
V 2
 
4.5%
, 1
 
2.3%
None
ValueCountFrequency (%)
· 3
100.0%
Distinct94
Distinct (%)83.2%
Missing0
Missing (%)0.0%
Memory size1.0 KiB
2023-12-12T08:19:03.563616image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length27
Median length15
Mean length7.7079646
Min length2

Characters and Unicode

Total characters871
Distinct characters116
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique83 ?
Unique (%)73.5%

Sample

1st row42인치 미만
2nd row42인치 이상
3rd row높이 1m 이상
4th row모든 규격
5th row높이 1m 이상
ValueCountFrequency (%)
모든 12
 
6.7%
이상 10
 
5.6%
규격 9
 
5.0%
미만 7
 
3.9%
1쪽 4
 
2.2%
4
 
2.2%
높이 3
 
1.7%
이하 3
 
1.7%
종류 3
 
1.7%
120cm×180cm미만 2
 
1.1%
Other values (101) 123
68.3%
2023-12-12T08:19:03.900630image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
67
 
7.7%
0 59
 
6.8%
m 49
 
5.6%
1 48
 
5.5%
47
 
5.4%
c 42
 
4.8%
28
 
3.2%
22
 
2.5%
5 22
 
2.5%
21
 
2.4%
Other values (106) 466
53.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 410
47.1%
Decimal Number 195
22.4%
Lowercase Letter 108
 
12.4%
Space Separator 67
 
7.7%
Math Symbol 19
 
2.2%
Open Punctuation 18
 
2.1%
Close Punctuation 18
 
2.1%
Other Punctuation 18
 
2.1%
Other Symbol 17
 
2.0%
Uppercase Letter 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
47
 
11.5%
28
 
6.8%
22
 
5.4%
21
 
5.1%
19
 
4.6%
16
 
3.9%
15
 
3.7%
15
 
3.7%
15
 
3.7%
12
 
2.9%
Other values (83) 200
48.8%
Decimal Number
ValueCountFrequency (%)
0 59
30.3%
1 48
24.6%
5 22
 
11.3%
2 18
 
9.2%
6 11
 
5.6%
8 11
 
5.6%
9 9
 
4.6%
3 8
 
4.1%
4 6
 
3.1%
7 3
 
1.5%
Lowercase Letter
ValueCountFrequency (%)
m 49
45.4%
c 42
38.9%
g 9
 
8.3%
k 8
 
7.4%
Other Symbol
ValueCountFrequency (%)
11
64.7%
6
35.3%
Other Punctuation
ValueCountFrequency (%)
. 10
55.6%
, 8
44.4%
Space Separator
ValueCountFrequency (%)
67
100.0%
Math Symbol
ValueCountFrequency (%)
× 19
100.0%
Open Punctuation
ValueCountFrequency (%)
( 18
100.0%
Close Punctuation
ValueCountFrequency (%)
) 18
100.0%
Uppercase Letter
ValueCountFrequency (%)
K 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 410
47.1%
Common 352
40.4%
Latin 109
 
12.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
47
 
11.5%
28
 
6.8%
22
 
5.4%
21
 
5.1%
19
 
4.6%
16
 
3.9%
15
 
3.7%
15
 
3.7%
15
 
3.7%
12
 
2.9%
Other values (83) 200
48.8%
Common
ValueCountFrequency (%)
67
19.0%
0 59
16.8%
1 48
13.6%
5 22
 
6.2%
× 19
 
5.4%
2 18
 
5.1%
( 18
 
5.1%
) 18
 
5.1%
6 11
 
3.1%
11
 
3.1%
Other values (8) 61
17.3%
Latin
ValueCountFrequency (%)
m 49
45.0%
c 42
38.5%
g 9
 
8.3%
k 8
 
7.3%
K 1
 
0.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 425
48.8%
Hangul 410
47.1%
None 19
 
2.2%
CJK Compat 17
 
2.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
67
15.8%
0 59
13.9%
m 49
11.5%
1 48
11.3%
c 42
9.9%
5 22
 
5.2%
2 18
 
4.2%
( 18
 
4.2%
) 18
 
4.2%
6 11
 
2.6%
Other values (10) 73
17.2%
Hangul
ValueCountFrequency (%)
47
 
11.5%
28
 
6.8%
22
 
5.4%
21
 
5.1%
19
 
4.6%
16
 
3.9%
15
 
3.7%
15
 
3.7%
15
 
3.7%
12
 
2.9%
Other values (83) 200
48.8%
None
ValueCountFrequency (%)
× 19
100.0%
CJK Compat
ValueCountFrequency (%)
11
64.7%
6
35.3%

수수료
Real number (ℝ)

Distinct14
Distinct (%)12.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4610.6195
Minimum1000
Maximum20000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.1 KiB
2023-12-12T08:19:03.994680image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1000
5-th percentile1000
Q12000
median3000
Q36000
95-th percentile13000
Maximum20000
Range19000
Interquartile range (IQR)4000

Descriptive statistics

Standard deviation3808.9032
Coefficient of variation (CV)0.82611528
Kurtosis4.1820562
Mean4610.6195
Median Absolute Deviation (MAD)2000
Skewness1.9276083
Sum521000
Variance14507743
MonotonicityNot monotonic
2023-12-12T08:19:04.078073image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=14)
ValueCountFrequency (%)
2000 26
23.0%
3000 23
20.4%
5000 17
15.0%
1000 14
12.4%
6000 10
 
8.8%
7000 6
 
5.3%
10000 4
 
3.5%
9000 3
 
2.7%
8000 2
 
1.8%
15000 2
 
1.8%
Other values (4) 6
 
5.3%
ValueCountFrequency (%)
1000 14
12.4%
2000 26
23.0%
3000 23
20.4%
4000 1
 
0.9%
5000 17
15.0%
6000 10
 
8.8%
7000 6
 
5.3%
8000 2
 
1.8%
9000 3
 
2.7%
10000 4
 
3.5%
ValueCountFrequency (%)
20000 1
 
0.9%
18000 2
 
1.8%
15000 2
 
1.8%
13000 2
 
1.8%
10000 4
 
3.5%
9000 3
 
2.7%
8000 2
 
1.8%
7000 6
 
5.3%
6000 10
8.8%
5000 17
15.0%

데이터기준일자
Date

CONSTANT 

Distinct1
Distinct (%)0.9%
Missing0
Missing (%)0.0%
Memory size1.0 KiB
Minimum2022-10-28 00:00:00
Maximum2022-10-28 00:00:00
2023-12-12T08:19:04.156667image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T08:19:04.236534image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2023-12-12T08:19:02.307671image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T08:19:04.297627image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
폐기물 구분폐기물 명폐기물 규격수수료
폐기물 구분1.0001.0000.9540.276
폐기물 명1.0001.0000.0000.000
폐기물 규격0.9540.0001.0000.935
수수료0.2760.0000.9351.000
2023-12-12T08:19:04.384334image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
수수료폐기물 구분
수수료1.0000.201
폐기물 구분0.2011.000

Missing values

2023-12-12T08:19:02.419294image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T08:19:02.511709image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

폐기물 구분폐기물 명폐기물 규격수수료데이터기준일자
0가전제품류T.V42인치 미만30002022-10-28
1가전제품류T.V42인치 이상60002022-10-28
2가전제품류가스오븐렌지높이 1m 이상50002022-10-28
3가전제품류가습기모든 규격10002022-10-28
4가전제품류공기청정기높이 1m 이상30002022-10-28
5가전제품류기타소형가전모든제품10002022-10-28
6가전제품류난로전기난로20002022-10-28
7가전제품류난로석유난로30002022-10-28
8가전제품류냉장고500리터 이상90002022-10-28
9가전제품류냉장고300리터이상70002022-10-28
폐기물 구분폐기물 명폐기물 규격수수료데이터기준일자
103생활용품피아노그랜드180002022-10-28
104생활용품항아리대형 1개당 1㎥이상50002022-10-28
105생활용품항아리중형1개당 (0.5㎥이상 1㎥미만)20002022-10-28
106생활용품항아리소형1개당 0.5㎥ 미만10002022-10-28
107기타류마대20Kg당30002022-10-28
108기타류물탱크(F.R.P)1㎥ 이상100002022-10-28
109기타류물탱크(F.R.P)1㎥ 미만50002022-10-28
110기타류소화기20kg 이상180002022-10-28
111기타류소화기6.5kg 이하50002022-10-28
112기타류소화기3.5kg 이하30002022-10-28