Overview

Dataset statistics

Number of variables5
Number of observations47
Missing cells0
Missing cells (%)0.0%
Duplicate rows1
Duplicate rows (%)2.1%
Total size in memory2.0 KiB
Average record size in memory43.8 B

Variable types

Categorical2
Text2
Numeric1

Dataset

Description경상북도 봉화군 폐기물관리조례 [별표1]에 규정된 대형폐기물 품명, 규격, 처리수수료에 관한 데이터 자료 입니다.
Author경상북도 봉화군
URLhttps://www.data.go.kr/data/15094090/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
Dataset has 1 (2.1%) duplicate rowsDuplicates

Reproduction

Analysis started2023-12-12 05:25:16.036992
Analysis finished2023-12-12 05:25:16.461773
Duration0.42 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

폐기물 구분
Categorical

Distinct3
Distinct (%)6.4%
Missing0
Missing (%)0.0%
Memory size508.0 B
가구류
22 
가전제품
20 
기타

Length

Max length4
Median length3
Mean length3.3191489
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row가전제품
2nd row가전제품
3rd row가전제품
4th row가전제품
5th row가전제품

Common Values

ValueCountFrequency (%)
가구류 22
46.8%
가전제품 20
42.6%
기타 5
 
10.6%

Length

2023-12-12T14:25:16.534944image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T14:25:16.655144image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
가구류 22
46.8%
가전제품 20
42.6%
기타 5
 
10.6%
Distinct26
Distinct (%)55.3%
Missing0
Missing (%)0.0%
Memory size508.0 B
2023-12-12T14:25:16.828320image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length33
Median length7
Mean length3.8085106
Min length2

Characters and Unicode

Total characters179
Distinct characters72
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9 ?
Unique (%)19.1%

Sample

1st row냉장고
2nd row냉장고
3rd row냉장고
4th row텔레비전
5th row텔레비전
ValueCountFrequency (%)
냉장고 3
 
5.5%
소파 3
 
5.5%
에어컨 3
 
5.5%
컴퓨터 3
 
5.5%
침대 2
 
3.6%
텔레비전 2
 
3.6%
소화기 2
 
3.6%
피아노 2
 
3.6%
식탁 2
 
3.6%
서랍장 2
 
3.6%
Other values (24) 31
56.4%
2023-12-12T14:25:17.143923image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
14
 
7.8%
11
 
6.1%
8
 
4.5%
6
 
3.4%
5
 
2.8%
5
 
2.8%
4
 
2.2%
4
 
2.2%
4
 
2.2%
4
 
2.2%
Other values (62) 114
63.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 165
92.2%
Space Separator 8
 
4.5%
Other Punctuation 4
 
2.2%
Open Punctuation 1
 
0.6%
Close Punctuation 1
 
0.6%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
14
 
8.5%
11
 
6.7%
6
 
3.6%
5
 
3.0%
5
 
3.0%
4
 
2.4%
4
 
2.4%
4
 
2.4%
4
 
2.4%
4
 
2.4%
Other values (57) 104
63.0%
Other Punctuation
ValueCountFrequency (%)
, 3
75.0%
. 1
 
25.0%
Space Separator
ValueCountFrequency (%)
8
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 165
92.2%
Common 14
 
7.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
14
 
8.5%
11
 
6.7%
6
 
3.6%
5
 
3.0%
5
 
3.0%
4
 
2.4%
4
 
2.4%
4
 
2.4%
4
 
2.4%
4
 
2.4%
Other values (57) 104
63.0%
Common
ValueCountFrequency (%)
8
57.1%
, 3
 
21.4%
. 1
 
7.1%
( 1
 
7.1%
) 1
 
7.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 165
92.2%
ASCII 14
 
7.8%

Most frequent character per block

Hangul
ValueCountFrequency (%)
14
 
8.5%
11
 
6.7%
6
 
3.6%
5
 
3.0%
5
 
3.0%
4
 
2.4%
4
 
2.4%
4
 
2.4%
4
 
2.4%
4
 
2.4%
Other values (57) 104
63.0%
ASCII
ValueCountFrequency (%)
8
57.1%
, 3
 
21.4%
. 1
 
7.1%
( 1
 
7.1%
) 1
 
7.1%
Distinct35
Distinct (%)74.5%
Missing0
Missing (%)0.0%
Memory size508.0 B
2023-12-12T14:25:17.368931image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length9
Mean length5.9787234
Min length2

Characters and Unicode

Total characters281
Distinct characters54
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique32 ?
Unique (%)68.1%

Sample

1st row500ℓ 이상
2nd row300ℓ 이상
3rd row300ℓ 미만
4th row25형(63.5㎝) 이상
5th row25형(63.5㎝) 미만
ValueCountFrequency (%)
모든 11
 
12.0%
이상 11
 
12.0%
규격 11
 
12.0%
미만 8
 
8.7%
1m 6
 
6.5%
2인용 3
 
3.3%
1인용 3
 
3.3%
2
 
2.2%
3.3kg 2
 
2.2%
6인용 2
 
2.2%
Other values (24) 33
35.9%
2023-12-12T14:25:18.013229image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
47
 
16.7%
17
 
6.0%
11
 
3.9%
11
 
3.9%
11
 
3.9%
11
 
3.9%
11
 
3.9%
10
 
3.6%
10
 
3.6%
1 10
 
3.6%
Other values (44) 132
47.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 148
52.7%
Decimal Number 53
 
18.9%
Space Separator 47
 
16.7%
Lowercase Letter 17
 
6.0%
Other Punctuation 6
 
2.1%
Other Symbol 5
 
1.8%
Open Punctuation 2
 
0.7%
Close Punctuation 2
 
0.7%
Math Symbol 1
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
17
11.5%
11
 
7.4%
11
 
7.4%
11
 
7.4%
11
 
7.4%
11
 
7.4%
10
 
6.8%
10
 
6.8%
9
 
6.1%
8
 
5.4%
Other values (23) 39
26.4%
Decimal Number
ValueCountFrequency (%)
1 10
18.9%
6 9
17.0%
3 9
17.0%
0 8
15.1%
2 7
13.2%
5 5
9.4%
4 4
 
7.5%
9 1
 
1.9%
Lowercase Letter
ValueCountFrequency (%)
m 8
47.1%
3
 
17.6%
g 2
 
11.8%
k 2
 
11.8%
c 2
 
11.8%
Other Punctuation
ValueCountFrequency (%)
. 4
66.7%
, 2
33.3%
Other Symbol
ValueCountFrequency (%)
3
60.0%
2
40.0%
Space Separator
ValueCountFrequency (%)
47
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 148
52.7%
Common 119
42.3%
Latin 14
 
5.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
17
11.5%
11
 
7.4%
11
 
7.4%
11
 
7.4%
11
 
7.4%
11
 
7.4%
10
 
6.8%
10
 
6.8%
9
 
6.1%
8
 
5.4%
Other values (23) 39
26.4%
Common
ValueCountFrequency (%)
47
39.5%
1 10
 
8.4%
6 9
 
7.6%
3 9
 
7.6%
0 8
 
6.7%
2 7
 
5.9%
5 5
 
4.2%
. 4
 
3.4%
4 4
 
3.4%
3
 
2.5%
Other values (7) 13
 
10.9%
Latin
ValueCountFrequency (%)
m 8
57.1%
g 2
 
14.3%
k 2
 
14.3%
c 2
 
14.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 148
52.7%
ASCII 125
44.5%
CJK Compat 5
 
1.8%
Letterlike Symbols 3
 
1.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
47
37.6%
1 10
 
8.0%
6 9
 
7.2%
3 9
 
7.2%
m 8
 
6.4%
0 8
 
6.4%
2 7
 
5.6%
5 5
 
4.0%
. 4
 
3.2%
4 4
 
3.2%
Other values (8) 14
 
11.2%
Hangul
ValueCountFrequency (%)
17
11.5%
11
 
7.4%
11
 
7.4%
11
 
7.4%
11
 
7.4%
11
 
7.4%
10
 
6.8%
10
 
6.8%
9
 
6.1%
8
 
5.4%
Other values (23) 39
26.4%
Letterlike Symbols
ValueCountFrequency (%)
3
100.0%
CJK Compat
ValueCountFrequency (%)
3
60.0%
2
40.0%

수수료
Real number (ℝ)

Distinct9
Distinct (%)19.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4957.4468
Minimum1000
Maximum15000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size555.0 B
2023-12-12T14:25:18.245246image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1000
5-th percentile2000
Q12000
median4000
Q35500
95-th percentile13500
Maximum15000
Range14000
Interquartile range (IQR)3500

Descriptive statistics

Standard deviation3519.8657
Coefficient of variation (CV)0.71001582
Kurtosis2.2896805
Mean4957.4468
Median Absolute Deviation (MAD)2000
Skewness1.6206704
Sum233000
Variance12389454
MonotonicityNot monotonic
2023-12-12T14:25:18.525036image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=9)
ValueCountFrequency (%)
2000 12
25.5%
4000 9
19.1%
5000 7
14.9%
3000 6
12.8%
8000 4
 
8.5%
15000 3
 
6.4%
10000 3
 
6.4%
6000 2
 
4.3%
1000 1
 
2.1%
ValueCountFrequency (%)
1000 1
 
2.1%
2000 12
25.5%
3000 6
12.8%
4000 9
19.1%
5000 7
14.9%
6000 2
 
4.3%
8000 4
 
8.5%
10000 3
 
6.4%
15000 3
 
6.4%
ValueCountFrequency (%)
15000 3
 
6.4%
10000 3
 
6.4%
8000 4
 
8.5%
6000 2
 
4.3%
5000 7
14.9%
4000 9
19.1%
3000 6
12.8%
2000 12
25.5%
1000 1
 
2.1%

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)2.1%
Missing0
Missing (%)0.0%
Memory size508.0 B
2023-11-01
47 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023-11-01
2nd row2023-11-01
3rd row2023-11-01
4th row2023-11-01
5th row2023-11-01

Common Values

ValueCountFrequency (%)
2023-11-01 47
100.0%

Length

2023-12-12T14:25:18.824773image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T14:25:19.034210image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023-11-01 47
100.0%

Interactions

2023-12-12T14:25:16.216467image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T14:25:19.164469image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
폐기물 구분폐기물 명폐기물 규격수수료
폐기물 구분1.0001.0000.9270.000
폐기물 명1.0001.0000.0000.000
폐기물 규격0.9270.0001.0000.907
수수료0.0000.0000.9071.000
2023-12-12T14:25:19.368798image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
수수료폐기물 구분
수수료1.0000.067
폐기물 구분0.0671.000

Missing values

2023-12-12T14:25:16.320875image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T14:25:16.422196image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

폐기물 구분폐기물 명폐기물 규격수수료데이터기준일자
0가전제품냉장고500ℓ 이상80002023-11-01
1가전제품냉장고300ℓ 이상60002023-11-01
2가전제품냉장고300ℓ 미만40002023-11-01
3가전제품텔레비전25형(63.5㎝) 이상50002023-11-01
4가전제품텔레비전25형(63.5㎝) 미만30002023-11-01
5가전제품세탁기모든 규격40002023-11-01
6가전제품청소기모든 규격20002023-11-01
7가전제품에어컨264㎡형 이상80002023-11-01
8가전제품에어컨66㎡형 이상50002023-11-01
9가전제품에어컨66㎡형 미만30002023-11-01
폐기물 구분폐기물 명폐기물 규격수수료데이터기준일자
37가구류책장1m 미만20002023-11-01
38가구류캐비닛모든 규격40002023-11-01
39가구류탁자모든 규격30002023-11-01
40가구류식탁6인용 이상50002023-11-01
41가구류식탁6인용 미만40002023-11-01
42기타피아노업라이트100002023-11-01
43기타피아노그랜드150002023-11-01
44기타소화기3.3kg 이하30002023-11-01
45기타소화기3.3kg 초과50002023-11-01
46기타기타(자전거, 선풍기, 싱크대. 쌀통 및 이와 유사한 물건)개 당20002023-11-01

Duplicate rows

Most frequently occurring

폐기물 구분폐기물 명폐기물 규격수수료데이터기준일자# duplicates
0가전제품컴퓨터모든 규격20002023-11-012