Overview

Dataset statistics

Number of variables5
Number of observations193
Missing cells105
Missing cells (%)10.9%
Duplicate rows1
Duplicate rows (%)0.5%
Total size in memory7.9 KiB
Average record size in memory41.7 B

Variable types

Categorical1
Text2
Numeric1
DateTime1

Dataset

Description강원도 속초시 대형폐기물 처리 수수료 정보입니다. 폐기물 구분, 폐기물 명, 폐기물 규격, 수수료, 데이터 기준일자가 포함되어 있습니다.
Author강원도 속초시
URLhttps://www.data.go.kr/data/15093805/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
Dataset has 1 (0.5%) duplicate rowsDuplicates
폐기물 명 has 105 (54.4%) missing valuesMissing

Reproduction

Analysis started2023-12-11 23:55:41.510231
Analysis finished2023-12-11 23:55:41.896958
Duration0.39 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

폐기물 구분
Categorical

Distinct2
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
가구류
168 
가전제품
25 

Length

Max length4
Median length3
Mean length3.1295337
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row가구류
2nd row가구류
3rd row가구류
4th row가구류
5th row가구류

Common Values

ValueCountFrequency (%)
가구류 168
87.0%
가전제품 25
 
13.0%

Length

2023-12-12T08:55:41.964043image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T08:55:42.117129image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
가구류 168
87.0%
가전제품 25
 
13.0%

폐기물 명
Text

MISSING 

Distinct88
Distinct (%)100.0%
Missing105
Missing (%)54.4%
Memory size1.6 KiB
2023-12-12T08:55:42.358239image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length9
Mean length5.2840909
Min length3

Characters and Unicode

Total characters465
Distinct characters155
Distinct categories5 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique88 ?
Unique (%)100.0%

Sample

1st row 가방
2nd row 가습기
3rd row 거 울
4th row 교 자 상
5th row 기타 폐기물
ValueCountFrequency (%)
8
 
5.3%
4
 
2.6%
3
 
2.0%
3
 
2.0%
3
 
2.0%
2
 
1.3%
받침대 2
 
1.3%
2
 
1.3%
오디오 2
 
1.3%
2
 
1.3%
Other values (118) 121
79.6%
2023-12-12T08:55:42.757392image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
149
32.0%
29
 
6.2%
11
 
2.4%
11
 
2.4%
9
 
1.9%
8
 
1.7%
7
 
1.5%
7
 
1.5%
7
 
1.5%
6
 
1.3%
Other values (145) 221
47.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 309
66.5%
Space Separator 149
32.0%
Other Punctuation 3
 
0.6%
Close Punctuation 2
 
0.4%
Open Punctuation 2
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
29
 
9.4%
11
 
3.6%
11
 
3.6%
9
 
2.9%
8
 
2.6%
7
 
2.3%
7
 
2.3%
7
 
2.3%
6
 
1.9%
5
 
1.6%
Other values (141) 209
67.6%
Space Separator
ValueCountFrequency (%)
149
100.0%
Other Punctuation
ValueCountFrequency (%)
/ 3
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 308
66.2%
Common 156
33.5%
Han 1
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
29
 
9.4%
11
 
3.6%
11
 
3.6%
9
 
2.9%
8
 
2.6%
7
 
2.3%
7
 
2.3%
7
 
2.3%
6
 
1.9%
5
 
1.6%
Other values (140) 208
67.5%
Common
ValueCountFrequency (%)
149
95.5%
/ 3
 
1.9%
) 2
 
1.3%
( 2
 
1.3%
Han
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 308
66.2%
ASCII 156
33.5%
CJK 1
 
0.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
149
95.5%
/ 3
 
1.9%
) 2
 
1.3%
( 2
 
1.3%
Hangul
ValueCountFrequency (%)
29
 
9.4%
11
 
3.6%
11
 
3.6%
9
 
2.9%
8
 
2.6%
7
 
2.3%
7
 
2.3%
7
 
2.3%
6
 
1.9%
5
 
1.6%
Other values (140) 208
67.5%
CJK
ValueCountFrequency (%)
1
100.0%
Distinct144
Distinct (%)74.6%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
2023-12-12T08:55:43.042516image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length21
Median length16
Mean length8.4300518
Min length3

Characters and Unicode

Total characters1627
Distinct characters116
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique121 ?
Unique (%)62.7%

Sample

1st row 대형(높이 60cm 이상)
2nd row 중형(높이 60cm 미만)
3rd row 소형(높이 30cm 미만)
4th row 모든규격
5th row1㎡ 이상
ValueCountFrequency (%)
미만 55
 
14.7%
이상 31
 
8.3%
모든규격 23
 
6.2%
1m 22
 
5.9%
1.5m 15
 
4.0%
길이 11
 
2.9%
소형(높이 7
 
1.9%
대형(높이 7
 
1.9%
초과 6
 
1.6%
중형(높이 6
 
1.6%
Other values (115) 190
50.9%
2023-12-12T08:55:43.458857image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
372
22.9%
88
 
5.4%
1 75
 
4.6%
64
 
3.9%
64
 
3.9%
m 62
 
3.8%
) 52
 
3.2%
( 52
 
3.2%
49
 
3.0%
0 48
 
3.0%
Other values (106) 701
43.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 753
46.3%
Space Separator 372
22.9%
Decimal Number 222
 
13.6%
Lowercase Letter 111
 
6.8%
Close Punctuation 52
 
3.2%
Open Punctuation 52
 
3.2%
Other Punctuation 35
 
2.2%
Other Symbol 23
 
1.4%
Uppercase Letter 6
 
0.4%
Math Symbol 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
88
 
11.7%
64
 
8.5%
64
 
8.5%
49
 
6.5%
44
 
5.8%
28
 
3.7%
28
 
3.7%
27
 
3.6%
27
 
3.6%
27
 
3.6%
Other values (79) 307
40.8%
Decimal Number
ValueCountFrequency (%)
1 75
33.8%
0 48
21.6%
5 38
17.1%
6 17
 
7.7%
2 12
 
5.4%
3 12
 
5.4%
4 10
 
4.5%
9 8
 
3.6%
8 2
 
0.9%
Lowercase Letter
ValueCountFrequency (%)
m 62
55.9%
c 22
 
19.8%
k 11
 
9.9%
g 11
 
9.9%
5
 
4.5%
Other Punctuation
ValueCountFrequency (%)
. 23
65.7%
* 6
 
17.1%
, 4
 
11.4%
/ 2
 
5.7%
Uppercase Letter
ValueCountFrequency (%)
P 2
33.3%
R 2
33.3%
F 2
33.3%
Other Symbol
ValueCountFrequency (%)
22
95.7%
1
 
4.3%
Space Separator
ValueCountFrequency (%)
372
100.0%
Close Punctuation
ValueCountFrequency (%)
) 52
100.0%
Open Punctuation
ValueCountFrequency (%)
( 52
100.0%
Math Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 762
46.8%
Hangul 753
46.3%
Latin 112
 
6.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
88
 
11.7%
64
 
8.5%
64
 
8.5%
49
 
6.5%
44
 
5.8%
28
 
3.7%
28
 
3.7%
27
 
3.6%
27
 
3.6%
27
 
3.6%
Other values (79) 307
40.8%
Common
ValueCountFrequency (%)
372
48.8%
1 75
 
9.8%
) 52
 
6.8%
( 52
 
6.8%
0 48
 
6.3%
5 38
 
5.0%
. 23
 
3.0%
22
 
2.9%
6 17
 
2.2%
2 12
 
1.6%
Other values (10) 51
 
6.7%
Latin
ValueCountFrequency (%)
m 62
55.4%
c 22
 
19.6%
k 11
 
9.8%
g 11
 
9.8%
P 2
 
1.8%
R 2
 
1.8%
F 2
 
1.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 845
51.9%
Hangul 753
46.3%
CJK Compat 23
 
1.4%
Letterlike Symbols 5
 
0.3%
Math Operators 1
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
372
44.0%
1 75
 
8.9%
m 62
 
7.3%
) 52
 
6.2%
( 52
 
6.2%
0 48
 
5.7%
5 38
 
4.5%
. 23
 
2.7%
c 22
 
2.6%
6 17
 
2.0%
Other values (13) 84
 
9.9%
Hangul
ValueCountFrequency (%)
88
 
11.7%
64
 
8.5%
64
 
8.5%
49
 
6.5%
44
 
5.8%
28
 
3.7%
28
 
3.7%
27
 
3.6%
27
 
3.6%
27
 
3.6%
Other values (79) 307
40.8%
CJK Compat
ValueCountFrequency (%)
22
95.7%
1
 
4.3%
Letterlike Symbols
ValueCountFrequency (%)
5
100.0%
Math Operators
ValueCountFrequency (%)
1
100.0%

수수료
Real number (ℝ)

Distinct20
Distinct (%)10.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6155.4404
Minimum1000
Maximum50000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.8 KiB
2023-12-12T08:55:43.575018image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1000
5-th percentile1000
Q13000
median4000
Q38000
95-th percentile15000
Maximum50000
Range49000
Interquartile range (IQR)5000

Descriptive statistics

Standard deviation6165.8235
Coefficient of variation (CV)1.0016868
Kurtosis15.755267
Mean6155.4404
Median Absolute Deviation (MAD)2000
Skewness3.2755574
Sum1188000
Variance38017379
MonotonicityNot monotonic
2023-12-12T08:55:43.697568image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=20)
ValueCountFrequency (%)
3000 44
22.8%
5000 23
11.9%
2000 22
11.4%
4000 20
10.4%
1000 17
 
8.8%
8000 15
 
7.8%
10000 12
 
6.2%
6000 7
 
3.6%
7000 6
 
3.1%
15000 5
 
2.6%
Other values (10) 22
11.4%
ValueCountFrequency (%)
1000 17
 
8.8%
2000 22
11.4%
3000 44
22.8%
4000 20
10.4%
5000 23
11.9%
6000 7
 
3.6%
7000 6
 
3.1%
8000 15
 
7.8%
9000 2
 
1.0%
10000 12
 
6.2%
ValueCountFrequency (%)
50000 1
 
0.5%
30000 3
1.6%
25000 1
 
0.5%
23000 1
 
0.5%
20000 3
1.6%
15000 5
2.6%
14000 2
 
1.0%
13000 2
 
1.0%
12000 4
2.1%
11000 3
1.6%

데이터기준일자
Date

CONSTANT 

Distinct1
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
Minimum2021-10-28 00:00:00
Maximum2021-10-28 00:00:00
2023-12-12T08:55:43.810990image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T08:55:43.888020image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2023-12-12T08:55:41.704726image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T08:55:43.950481image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
폐기물 구분폐기물 명수수료
폐기물 구분1.0001.0000.214
폐기물 명1.0001.0001.000
수수료0.2141.0001.000
2023-12-12T08:55:44.050475image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
수수료폐기물 구분
수수료1.0000.251
폐기물 구분0.2511.000

Missing values

2023-12-12T08:55:41.790792image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T08:55:41.864166image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

폐기물 구분폐기물 명폐기물 규격수수료데이터기준일자
0가구류가방대형(높이 60cm 이상)30002021-10-28
1가구류<NA>중형(높이 60cm 미만)20002021-10-28
2가구류<NA>소형(높이 30cm 미만)10002021-10-28
3가구류가습기모든규격20002021-10-28
4가구류거 울1㎡ 이상30002021-10-28
5가구류<NA>1㎡ 미만20002021-10-28
6가구류교 자 상4인 이상40002021-10-28
7가구류<NA>4인 미만30002021-10-28
8가구류<NA>3인 미만20002021-10-28
9가구류기타 폐기물0.5㎡ 미만30002021-10-28
폐기물 구분폐기물 명폐기물 규격수수료데이터기준일자
183가전제품<NA>40인치 미만80002021-10-28
184가전제품<NA>30인치 미만50002021-10-28
185가전제품<NA>20인치 미만30002021-10-28
186가전제품세탁기10kg 이상(드럼)80002021-10-28
187가전제품<NA>10kg 미만50002021-10-28
188가전제품<NA>5kg 미만40002021-10-28
189가전제품캔음료 자판기모든규격250002021-10-28
190가전제품온 풍 기264㎡형 이상80002021-10-28
191가전제품<NA>66㎡형 이상50002021-10-28
192가전제품<NA>66㎡형 미만40002021-10-28

Duplicate rows

Most frequently occurring

폐기물 구분폐기물 명폐기물 규격수수료데이터기준일자# duplicates
0가구류<NA>초과 1kg 당10002021-10-283