Overview

Dataset statistics

Number of variables7
Number of observations139
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory8.1 KiB
Average record size in memory59.9 B

Variable types

Categorical1
Text2
Numeric3
DateTime1

Dataset

Description제주특별자치도 제주시의 품목 및 규격별 대형폐기물 수수료(운반비, 처리비) 현황
Author제주특별자치도 제주시
URLhttps://www.data.go.kr/data/15071215/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
수수료 is highly overall correlated with 수집운반비 and 1 other fieldsHigh correlation
수집운반비 is highly overall correlated with 수수료 and 1 other fieldsHigh correlation
처리비 is highly overall correlated with 수수료 and 1 other fieldsHigh correlation

Reproduction

Analysis started2023-12-12 14:41:51.286501
Analysis finished2023-12-12 14:41:52.728713
Duration1.44 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

종류
Categorical

Distinct3
Distinct (%)2.2%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
가전류
54 
기타류
44 
가구류
41 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row가전류
2nd row기타류
3rd row가전류
4th row가전류
5th row가전류

Common Values

ValueCountFrequency (%)
가전류 54
38.8%
기타류 44
31.7%
가구류 41
29.5%

Length

2023-12-12T23:41:52.791291image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T23:41:52.892673image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
가전류 54
38.8%
기타류 44
31.7%
가구류 41
29.5%

품목
Text

Distinct80
Distinct (%)57.6%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
2023-12-12T23:41:53.143323image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length22
Median length14
Mean length5.8129496
Min length2

Characters and Unicode

Total characters808
Distinct characters181
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique32 ?
Unique (%)23.0%

Sample

1st rowVTR_DVD(스피커별도)
2nd row가방
3rd row가스렌지
4th row가스렌지
5th row가스오븐렌지
ValueCountFrequency (%)
에어컨디셔너 5
 
3.5%
냉장고 4
 
2.8%
타이어(휠제외 3
 
2.1%
싱크대(선반별도 3
 
2.1%
수족관 3
 
2.1%
의자 3
 
2.1%
기타가전제품및사무용품 3
 
2.1%
실내기 3
 
2.1%
보일러통 3
 
2.1%
프린터 3
 
2.1%
Other values (71) 111
77.1%
2023-12-12T23:41:53.560590image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
+ 40
 
5.0%
35
 
4.3%
27
 
3.3%
22
 
2.7%
( 20
 
2.5%
) 20
 
2.5%
16
 
2.0%
14
 
1.7%
14
 
1.7%
13
 
1.6%
Other values (171) 587
72.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 716
88.6%
Math Symbol 40
 
5.0%
Open Punctuation 20
 
2.5%
Close Punctuation 20
 
2.5%
Uppercase Letter 6
 
0.7%
Space Separator 5
 
0.6%
Connector Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
35
 
4.9%
27
 
3.8%
22
 
3.1%
16
 
2.2%
14
 
2.0%
14
 
2.0%
13
 
1.8%
12
 
1.7%
12
 
1.7%
11
 
1.5%
Other values (162) 540
75.4%
Uppercase Letter
ValueCountFrequency (%)
V 2
33.3%
D 2
33.3%
T 1
16.7%
R 1
16.7%
Math Symbol
ValueCountFrequency (%)
+ 40
100.0%
Open Punctuation
ValueCountFrequency (%)
( 20
100.0%
Close Punctuation
ValueCountFrequency (%)
) 20
100.0%
Space Separator
ValueCountFrequency (%)
5
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 716
88.6%
Common 86
 
10.6%
Latin 6
 
0.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
35
 
4.9%
27
 
3.8%
22
 
3.1%
16
 
2.2%
14
 
2.0%
14
 
2.0%
13
 
1.8%
12
 
1.7%
12
 
1.7%
11
 
1.5%
Other values (162) 540
75.4%
Common
ValueCountFrequency (%)
+ 40
46.5%
( 20
23.3%
) 20
23.3%
5
 
5.8%
_ 1
 
1.2%
Latin
ValueCountFrequency (%)
V 2
33.3%
D 2
33.3%
T 1
16.7%
R 1
16.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 716
88.6%
ASCII 92
 
11.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
+ 40
43.5%
( 20
21.7%
) 20
21.7%
5
 
5.4%
V 2
 
2.2%
D 2
 
2.2%
_ 1
 
1.1%
T 1
 
1.1%
R 1
 
1.1%
Hangul
ValueCountFrequency (%)
35
 
4.9%
27
 
3.8%
22
 
3.1%
16
 
2.2%
14
 
2.0%
14
 
2.0%
13
 
1.8%
12
 
1.7%
12
 
1.7%
11
 
1.5%
Other values (162) 540
75.4%

규격
Text

Distinct71
Distinct (%)51.1%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
2023-12-12T23:41:53.793170image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length34
Median length16
Mean length7.6618705
Min length2

Characters and Unicode

Total characters1065
Distinct characters138
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique51 ?
Unique (%)36.7%

Sample

1st row모든규격
2nd row모든규격
3rd row대형(업소용)
4th row소형(가정용)
5th row높이1m미만
ValueCountFrequency (%)
모든규격 24
 
15.9%
소형(가정용 8
 
5.3%
대형(업소용 7
 
4.6%
1인용 6
 
4.0%
2인용 6
 
4.0%
묶음단위(길이2m×지름30cm 4
 
2.6%
소형 3
 
2.0%
싱크대 3
 
2.0%
경우 3
 
2.0%
분리 3
 
2.0%
Other values (65) 84
55.6%
2023-12-12T23:41:54.184694image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
m 62
 
5.8%
49
 
4.6%
1 48
 
4.5%
( 45
 
4.2%
) 45
 
4.2%
0 40
 
3.8%
40
 
3.8%
39
 
3.7%
. 34
 
3.2%
2 31
 
2.9%
Other values (128) 632
59.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 619
58.1%
Decimal Number 169
 
15.9%
Lowercase Letter 83
 
7.8%
Open Punctuation 45
 
4.2%
Close Punctuation 45
 
4.2%
Math Symbol 40
 
3.8%
Other Punctuation 36
 
3.4%
Uppercase Letter 13
 
1.2%
Space Separator 12
 
1.1%
Other Symbol 3
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
49
 
7.9%
40
 
6.5%
39
 
6.3%
28
 
4.5%
27
 
4.4%
26
 
4.2%
26
 
4.2%
26
 
4.2%
26
 
4.2%
26
 
4.2%
Other values (94) 306
49.4%
Decimal Number
ValueCountFrequency (%)
1 48
28.4%
0 40
23.7%
2 31
18.3%
6 20
11.8%
9 8
 
4.7%
3 8
 
4.7%
5 6
 
3.6%
4 4
 
2.4%
8 2
 
1.2%
7 2
 
1.2%
Lowercase Letter
ValueCountFrequency (%)
m 62
74.7%
c 7
 
8.4%
z 3
 
3.6%
e 3
 
3.6%
i 3
 
3.6%
s 3
 
3.6%
k 1
 
1.2%
g 1
 
1.2%
Uppercase Letter
ValueCountFrequency (%)
L 6
46.2%
P 3
23.1%
C 1
 
7.7%
F 1
 
7.7%
R 1
 
7.7%
E 1
 
7.7%
Math Symbol
ValueCountFrequency (%)
× 27
67.5%
+ 11
27.5%
~ 1
 
2.5%
1
 
2.5%
Other Punctuation
ValueCountFrequency (%)
. 34
94.4%
· 2
 
5.6%
Open Punctuation
ValueCountFrequency (%)
( 45
100.0%
Close Punctuation
ValueCountFrequency (%)
) 45
100.0%
Space Separator
ValueCountFrequency (%)
12
100.0%
Other Symbol
ValueCountFrequency (%)
3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 619
58.1%
Common 350
32.9%
Latin 96
 
9.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
49
 
7.9%
40
 
6.5%
39
 
6.3%
28
 
4.5%
27
 
4.4%
26
 
4.2%
26
 
4.2%
26
 
4.2%
26
 
4.2%
26
 
4.2%
Other values (94) 306
49.4%
Common
ValueCountFrequency (%)
1 48
13.7%
( 45
12.9%
) 45
12.9%
0 40
11.4%
. 34
9.7%
2 31
8.9%
× 27
7.7%
6 20
5.7%
12
 
3.4%
+ 11
 
3.1%
Other values (10) 37
10.6%
Latin
ValueCountFrequency (%)
m 62
64.6%
c 7
 
7.3%
L 6
 
6.2%
z 3
 
3.1%
e 3
 
3.1%
i 3
 
3.1%
P 3
 
3.1%
s 3
 
3.1%
C 1
 
1.0%
k 1
 
1.0%
Other values (4) 4
 
4.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 619
58.1%
ASCII 413
38.8%
None 30
 
2.8%
CJK Compat 3
 
0.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
m 62
15.0%
1 48
11.6%
( 45
10.9%
) 45
10.9%
0 40
9.7%
. 34
8.2%
2 31
7.5%
6 20
 
4.8%
12
 
2.9%
+ 11
 
2.7%
Other values (20) 65
15.7%
Hangul
ValueCountFrequency (%)
49
 
7.9%
40
 
6.5%
39
 
6.3%
28
 
4.5%
27
 
4.4%
26
 
4.2%
26
 
4.2%
26
 
4.2%
26
 
4.2%
26
 
4.2%
Other values (94) 306
49.4%
None
ValueCountFrequency (%)
× 27
90.0%
· 2
 
6.7%
1
 
3.3%
CJK Compat
ValueCountFrequency (%)
3
100.0%

수수료
Real number (ℝ)

HIGH CORRELATION 

Distinct13
Distinct (%)9.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5977.6978
Minimum1500
Maximum30000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.4 KiB
2023-12-12T23:41:54.311363image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1500
5-th percentile3000
Q13000
median4500
Q37500
95-th percentile15000
Maximum30000
Range28500
Interquartile range (IQR)4500

Descriptive statistics

Standard deviation4346.0301
Coefficient of variation (CV)0.72704077
Kurtosis8.3208072
Mean5977.6978
Median Absolute Deviation (MAD)1500
Skewness2.4687892
Sum830900
Variance18887977
MonotonicityNot monotonic
2023-12-12T23:41:54.430665image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=13)
ValueCountFrequency (%)
3000 46
33.1%
4500 29
20.9%
7500 19
13.7%
6000 13
 
9.4%
12000 12
 
8.6%
15000 5
 
3.6%
4000 4
 
2.9%
1500 3
 
2.2%
2000 2
 
1.4%
9000 2
 
1.4%
Other values (3) 4
 
2.9%
ValueCountFrequency (%)
1500 3
 
2.2%
2000 2
 
1.4%
3000 46
33.1%
4000 4
 
2.9%
4500 29
20.9%
5400 1
 
0.7%
6000 13
 
9.4%
7500 19
13.7%
9000 2
 
1.4%
12000 12
 
8.6%
ValueCountFrequency (%)
30000 1
 
0.7%
22500 2
 
1.4%
15000 5
 
3.6%
12000 12
8.6%
9000 2
 
1.4%
7500 19
13.7%
6000 13
9.4%
5400 1
 
0.7%
4500 29
20.9%
4000 4
 
2.9%

수집운반비
Real number (ℝ)

HIGH CORRELATION 

Distinct13
Distinct (%)9.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3586.6187
Minimum900
Maximum18000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.4 KiB
2023-12-12T23:41:54.591328image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum900
5-th percentile1800
Q11800
median2700
Q34500
95-th percentile9000
Maximum18000
Range17100
Interquartile range (IQR)2700

Descriptive statistics

Standard deviation2607.618
Coefficient of variation (CV)0.72704077
Kurtosis8.3208072
Mean3586.6187
Median Absolute Deviation (MAD)900
Skewness2.4687892
Sum498540
Variance6799671.8
MonotonicityNot monotonic
2023-12-12T23:41:54.730626image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=13)
ValueCountFrequency (%)
1800 46
33.1%
2700 29
20.9%
4500 19
13.7%
3600 13
 
9.4%
7200 12
 
8.6%
9000 5
 
3.6%
2400 4
 
2.9%
900 3
 
2.2%
1200 2
 
1.4%
5400 2
 
1.4%
Other values (3) 4
 
2.9%
ValueCountFrequency (%)
900 3
 
2.2%
1200 2
 
1.4%
1800 46
33.1%
2400 4
 
2.9%
2700 29
20.9%
3240 1
 
0.7%
3600 13
 
9.4%
4500 19
13.7%
5400 2
 
1.4%
7200 12
 
8.6%
ValueCountFrequency (%)
18000 1
 
0.7%
13500 2
 
1.4%
9000 5
 
3.6%
7200 12
8.6%
5400 2
 
1.4%
4500 19
13.7%
3600 13
9.4%
3240 1
 
0.7%
2700 29
20.9%
2400 4
 
2.9%

처리비
Real number (ℝ)

HIGH CORRELATION 

Distinct13
Distinct (%)9.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2391.0791
Minimum600
Maximum12000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.4 KiB
2023-12-12T23:41:54.870092image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum600
5-th percentile1200
Q11200
median1800
Q33000
95-th percentile6000
Maximum12000
Range11400
Interquartile range (IQR)1800

Descriptive statistics

Standard deviation1738.412
Coefficient of variation (CV)0.72704077
Kurtosis8.3208072
Mean2391.0791
Median Absolute Deviation (MAD)600
Skewness2.4687892
Sum332360
Variance3022076.4
MonotonicityNot monotonic
2023-12-12T23:41:54.983164image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=13)
ValueCountFrequency (%)
1200 46
33.1%
1800 29
20.9%
3000 19
13.7%
2400 13
 
9.4%
4800 12
 
8.6%
6000 5
 
3.6%
1600 4
 
2.9%
600 3
 
2.2%
800 2
 
1.4%
3600 2
 
1.4%
Other values (3) 4
 
2.9%
ValueCountFrequency (%)
600 3
 
2.2%
800 2
 
1.4%
1200 46
33.1%
1600 4
 
2.9%
1800 29
20.9%
2160 1
 
0.7%
2400 13
 
9.4%
3000 19
13.7%
3600 2
 
1.4%
4800 12
 
8.6%
ValueCountFrequency (%)
12000 1
 
0.7%
9000 2
 
1.4%
6000 5
 
3.6%
4800 12
8.6%
3600 2
 
1.4%
3000 19
13.7%
2400 13
9.4%
2160 1
 
0.7%
1800 29
20.9%
1600 4
 
2.9%

데이터기준일자
Date

CONSTANT 

Distinct1
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
Minimum2020-10-23 00:00:00
Maximum2020-10-23 00:00:00
2023-12-12T23:41:55.090750image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T23:41:55.203936image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2023-12-12T23:41:52.199116image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T23:41:51.655986image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T23:41:51.911319image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T23:41:52.287275image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T23:41:51.737009image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T23:41:51.997788image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T23:41:52.389272image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T23:41:51.826183image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T23:41:52.095606image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T23:41:55.298726image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
종류품목규격수수료수집운반비처리비
종류1.0001.0000.8800.0840.0840.084
품목1.0001.0000.0000.0000.0000.000
규격0.8800.0001.0000.9640.9640.964
수수료0.0840.0000.9641.0001.0001.000
수집운반비0.0840.0000.9641.0001.0001.000
처리비0.0840.0000.9641.0001.0001.000
2023-12-12T23:41:55.420761image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
수수료수집운반비처리비종류
수수료1.0001.0001.0000.046
수집운반비1.0001.0001.0000.046
처리비1.0001.0001.0000.046
종류0.0460.0460.0461.000

Missing values

2023-12-12T23:41:52.526489image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T23:41:52.669392image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

종류품목규격수수료수집운반비처리비데이터기준일자
0가전류VTR_DVD(스피커별도)모든규격3000180012002020-10-23
1기타류가방모든규격3000180012002020-10-23
2가전류가스렌지대형(업소용)4500270018002020-10-23
3가전류가스렌지소형(가정용)3000180012002020-10-23
4가전류가스오븐렌지높이1m미만3000180012002020-10-23
5가전류가스오븐렌지높이1m이상6000360024002020-10-23
6가전류가습기모든규격3000180012002020-10-23
7기타류각종상판+파티션+판넬+칠판+화이트보드등0.9m×1.8m기준6000360024002020-10-23
8기타류각종상판+파티션+판넬+칠판+화이트보드등1m×1m기준3000180012002020-10-23
9가구류간이침대(접이식침대)1인용7500450030002020-10-23
종류품목규격수수료수집운반비처리비데이터기준일자
129가전류탈수기모든규격3000180012002020-10-23
130가전류텔레비젼12인치이상4500270018002020-10-23
131가전류텔레비젼42인치이상7500450030002020-10-23
132기타류파유리+잡재물류40kg용마대4000240016002020-10-23
133가전류팩스기기모든규격3000180012002020-10-23
134가전류프린터대형(특수용)7500450030002020-10-23
135가전류프린터소형(가정용)3000180012002020-10-23
136가전류프린터중형(사무용)4500270018002020-10-23
137가구류피아노그랜드225001350090002020-10-23
138가구류피아노어프라이트15000900060002020-10-23