Overview

Dataset statistics

Number of variables5
Number of observations220
Missing cells1
Missing cells (%)0.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory8.9 KiB
Average record size in memory41.6 B

Variable types

Categorical1
Text2
Numeric1
DateTime1

Dataset

Description경기도 오산시 대형폐기물 처리 수수료 정보에 대한 폐기물구분, 폐기물명, 폐기물규격, 수수료 항목을 제공합니다.
Author경기도 오산시
URLhttps://www.data.go.kr/data/15097803/fileData.do

Alerts

데이터기준일자 has constant value ""Constant

Reproduction

Analysis started2024-03-15 01:26:14.909448
Analysis finished2024-03-15 01:26:16.179592
Duration1.27 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

폐기물구분
Categorical

Distinct4
Distinct (%)1.8%
Missing0
Missing (%)0.0%
Memory size1.8 KiB
기타
135 
가구
56 
가전제품
28 
<NA>
 
1

Length

Max length4
Median length2
Mean length2.2636364
Min length2

Unique

Unique1 ?
Unique (%)0.5%

Sample

1st row기타
2nd row기타
3rd row기타
4th row기타
5th row기타

Common Values

ValueCountFrequency (%)
기타 135
61.4%
가구 56
25.5%
가전제품 28
 
12.7%
<NA> 1
 
0.5%

Length

2024-03-15T10:26:16.401822image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-15T10:26:16.760483image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
기타 135
61.4%
가구 56
25.5%
가전제품 28
 
12.7%
na 1
 
0.5%
Distinct141
Distinct (%)64.4%
Missing1
Missing (%)0.5%
Memory size1.8 KiB
2024-03-15T10:26:18.059543image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length11
Median length10
Mean length3.4429224
Min length2

Characters and Unicode

Total characters754
Distinct characters215
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique90 ?
Unique (%)41.1%

Sample

1st row가방
2nd row가방
3rd row가방
4th row가방
5th row가스(오븐)레인지
ValueCountFrequency (%)
침대 8
 
3.4%
소파 5
 
2.1%
가방 4
 
1.7%
탁자 4
 
1.7%
문갑 4
 
1.7%
식탁 4
 
1.7%
의자 3
 
1.3%
수족관 3
 
1.3%
냉장고 3
 
1.3%
시계 3
 
1.3%
Other values (141) 193
82.5%
2024-03-15T10:26:19.667337image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
39
 
5.2%
26
 
3.4%
22
 
2.9%
18
 
2.4%
15
 
2.0%
14
 
1.9%
13
 
1.7%
( 12
 
1.6%
) 12
 
1.6%
12
 
1.6%
Other values (205) 571
75.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 702
93.1%
Space Separator 15
 
2.0%
Open Punctuation 12
 
1.6%
Close Punctuation 12
 
1.6%
Other Punctuation 9
 
1.2%
Uppercase Letter 4
 
0.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
39
 
5.6%
26
 
3.7%
22
 
3.1%
18
 
2.6%
14
 
2.0%
13
 
1.9%
12
 
1.7%
12
 
1.7%
12
 
1.7%
10
 
1.4%
Other values (198) 524
74.6%
Other Punctuation
ValueCountFrequency (%)
, 8
88.9%
. 1
 
11.1%
Uppercase Letter
ValueCountFrequency (%)
T 2
50.0%
V 2
50.0%
Space Separator
ValueCountFrequency (%)
15
100.0%
Open Punctuation
ValueCountFrequency (%)
( 12
100.0%
Close Punctuation
ValueCountFrequency (%)
) 12
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 702
93.1%
Common 48
 
6.4%
Latin 4
 
0.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
39
 
5.6%
26
 
3.7%
22
 
3.1%
18
 
2.6%
14
 
2.0%
13
 
1.9%
12
 
1.7%
12
 
1.7%
12
 
1.7%
10
 
1.4%
Other values (198) 524
74.6%
Common
ValueCountFrequency (%)
15
31.2%
( 12
25.0%
) 12
25.0%
, 8
16.7%
. 1
 
2.1%
Latin
ValueCountFrequency (%)
T 2
50.0%
V 2
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 702
93.1%
ASCII 52
 
6.9%

Most frequent character per block

Hangul
ValueCountFrequency (%)
39
 
5.6%
26
 
3.7%
22
 
3.1%
18
 
2.6%
14
 
2.0%
13
 
1.9%
12
 
1.7%
12
 
1.7%
12
 
1.7%
10
 
1.4%
Other values (198) 524
74.6%
ASCII
ValueCountFrequency (%)
15
28.8%
( 12
23.1%
) 12
23.1%
, 8
15.4%
T 2
 
3.8%
V 2
 
3.8%
. 1
 
1.9%
Distinct133
Distinct (%)60.5%
Missing0
Missing (%)0.0%
Memory size1.8 KiB
2024-03-15T10:26:21.022577image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length96
Median length20
Mean length7.55
Min length2

Characters and Unicode

Total characters1661
Distinct characters172
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique113 ?
Unique (%)51.4%

Sample

1st row대형 여행용(캐리어)
2nd row중형 여행용(캐리어)
3rd row소형 여행용(캐리어)
4th row골프용, 낚시용
5th row높이 또는 폭 1m이상
ValueCountFrequency (%)
규격 48
 
10.1%
모든 47
 
9.9%
이상 41
 
8.6%
미만 27
 
5.7%
높이 25
 
5.3%
1m이상 16
 
3.4%
또는 14
 
3.0%
1m 9
 
1.9%
8
 
1.7%
1㎡ 8
 
1.7%
Other values (156) 231
48.7%
2024-03-15T10:26:22.838715image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
255
 
15.4%
115
 
6.9%
1 74
 
4.5%
65
 
3.9%
m 63
 
3.8%
52
 
3.1%
52
 
3.1%
51
 
3.1%
51
 
3.1%
( 49
 
3.0%
Other values (162) 834
50.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 982
59.1%
Space Separator 256
 
15.4%
Decimal Number 181
 
10.9%
Lowercase Letter 84
 
5.1%
Open Punctuation 49
 
3.0%
Close Punctuation 49
 
3.0%
Other Punctuation 33
 
2.0%
Other Symbol 14
 
0.8%
Uppercase Letter 10
 
0.6%
Math Symbol 3
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
115
 
11.7%
65
 
6.6%
52
 
5.3%
52
 
5.3%
51
 
5.2%
51
 
5.2%
45
 
4.6%
33
 
3.4%
32
 
3.3%
31
 
3.2%
Other values (132) 455
46.3%
Decimal Number
ValueCountFrequency (%)
1 74
40.9%
0 32
17.7%
2 22
 
12.2%
6 16
 
8.8%
5 14
 
7.7%
3 11
 
6.1%
4 9
 
5.0%
8 2
 
1.1%
9 1
 
0.6%
Uppercase Letter
ValueCountFrequency (%)
D 2
20.0%
V 2
20.0%
X 2
20.0%
L 2
20.0%
R 1
10.0%
T 1
10.0%
Lowercase Letter
ValueCountFrequency (%)
m 63
75.0%
c 13
 
15.5%
g 3
 
3.6%
k 3
 
3.6%
x 2
 
2.4%
Other Punctuation
ValueCountFrequency (%)
, 21
63.6%
. 10
30.3%
/ 2
 
6.1%
Space Separator
ValueCountFrequency (%)
255
99.6%
  1
 
0.4%
Math Symbol
ValueCountFrequency (%)
~ 2
66.7%
+ 1
33.3%
Open Punctuation
ValueCountFrequency (%)
( 49
100.0%
Close Punctuation
ValueCountFrequency (%)
) 49
100.0%
Other Symbol
ValueCountFrequency (%)
14
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 982
59.1%
Common 585
35.2%
Latin 94
 
5.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
115
 
11.7%
65
 
6.6%
52
 
5.3%
52
 
5.3%
51
 
5.2%
51
 
5.2%
45
 
4.6%
33
 
3.4%
32
 
3.3%
31
 
3.2%
Other values (132) 455
46.3%
Common
ValueCountFrequency (%)
255
43.6%
1 74
 
12.6%
( 49
 
8.4%
) 49
 
8.4%
0 32
 
5.5%
2 22
 
3.8%
, 21
 
3.6%
6 16
 
2.7%
5 14
 
2.4%
14
 
2.4%
Other values (9) 39
 
6.7%
Latin
ValueCountFrequency (%)
m 63
67.0%
c 13
 
13.8%
g 3
 
3.2%
k 3
 
3.2%
D 2
 
2.1%
V 2
 
2.1%
X 2
 
2.1%
x 2
 
2.1%
L 2
 
2.1%
R 1
 
1.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 982
59.1%
ASCII 664
40.0%
CJK Compat 14
 
0.8%
None 1
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
255
38.4%
1 74
 
11.1%
m 63
 
9.5%
( 49
 
7.4%
) 49
 
7.4%
0 32
 
4.8%
2 22
 
3.3%
, 21
 
3.2%
6 16
 
2.4%
5 14
 
2.1%
Other values (18) 69
 
10.4%
Hangul
ValueCountFrequency (%)
115
 
11.7%
65
 
6.6%
52
 
5.3%
52
 
5.3%
51
 
5.2%
51
 
5.2%
45
 
4.6%
33
 
3.4%
32
 
3.3%
31
 
3.2%
Other values (132) 455
46.3%
CJK Compat
ValueCountFrequency (%)
14
100.0%
None
ValueCountFrequency (%)
  1
100.0%

수수료
Real number (ℝ)

Distinct16
Distinct (%)7.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4213.6364
Minimum0
Maximum30000
Zeros1
Zeros (%)0.5%
Negative0
Negative (%)0.0%
Memory size2.1 KiB
2024-03-15T10:26:23.232904image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile1000
Q12000
median3000
Q35000
95-th percentile12000
Maximum30000
Range30000
Interquartile range (IQR)3000

Descriptive statistics

Standard deviation3789.5308
Coefficient of variation (CV)0.89934928
Kurtosis12.238671
Mean4213.6364
Median Absolute Deviation (MAD)1000
Skewness2.9353489
Sum927000
Variance14360544
MonotonicityNot monotonic
2024-03-15T10:26:23.605266image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=16)
ValueCountFrequency (%)
2000 54
24.5%
3000 46
20.9%
5000 28
12.7%
1000 26
11.8%
4000 26
11.8%
7000 7
 
3.2%
6000 7
 
3.2%
10000 6
 
2.7%
15000 5
 
2.3%
8000 5
 
2.3%
Other values (6) 10
 
4.5%
ValueCountFrequency (%)
0 1
 
0.5%
1000 26
11.8%
2000 54
24.5%
3000 46
20.9%
4000 26
11.8%
5000 28
12.7%
6000 7
 
3.2%
7000 7
 
3.2%
8000 5
 
2.3%
10000 6
 
2.7%
ValueCountFrequency (%)
30000 1
 
0.5%
21000 1
 
0.5%
16000 3
1.4%
15000 5
2.3%
12000 2
 
0.9%
11000 2
 
0.9%
10000 6
2.7%
8000 5
2.3%
7000 7
3.2%
6000 7
3.2%

데이터기준일자
Date

CONSTANT 

Distinct1
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size1.8 KiB
Minimum2023-12-31 00:00:00
Maximum2023-12-31 00:00:00
2024-03-15T10:26:23.939170image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T10:26:24.244802image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2024-03-15T10:26:15.394736image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-15T10:26:24.456109image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
폐기물구분수수료
폐기물구분1.0000.347
수수료0.3471.000
2024-03-15T10:26:24.682049image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
수수료폐기물구분
수수료1.0000.232
폐기물구분0.2321.000

Missing values

2024-03-15T10:26:15.729736image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-15T10:26:16.065732image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

폐기물구분폐기물명폐기물규격수수료데이터기준일자
0기타가방대형 여행용(캐리어)30002023-12-31
1기타가방중형 여행용(캐리어)20002023-12-31
2기타가방소형 여행용(캐리어)10002023-12-31
3기타가방골프용, 낚시용30002023-12-31
4기타가스(오븐)레인지높이 또는 폭 1m이상40002023-12-31
5기타가스대(선반)1쪽(폭 60cm 이하 )20002023-12-31
6기타간판0.6m x 1.8m 이상70002023-12-31
7기타간판0.6m x 1.8m 미만40002023-12-31
8기타간판입간판20002023-12-31
9기타개수대1쪽30002023-12-31
폐기물구분폐기물명폐기물규격수수료데이터기준일자
210가구화장대폭 120cm 이상40002023-12-31
211가구화장대폭 120cm 미만30002023-12-31
212기타화환모든 규격30002023-12-31
213기타환풍기모든 규격20002023-12-31
214기타휠체어모든 규격30002023-12-31
215가전제품TV높이 또는 너비 1m이상50002023-12-31
216가전제품TV.오디오케이스모든 규격30002023-12-31
217기타가정배출소량폐기물20kg당20002023-12-31
218기타기 타커피머신, 스피커, 팩스, 가습기, 카세트라디오, 다리미, 선풍기, 비데, VTR/DVD, 전화기, 족욕기, 헤어드라이기, 전기히터, 전자레인지, 전기밥솥 등 (1m 미만)02023-12-31
219가전제품(소형 가전제품)그 밖에 이에 준하는 소형 가전제품(1m 이상)20002023-12-31