Overview

Dataset statistics

Number of variables4
Number of observations283
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory9.5 KiB
Average record size in memory34.5 B

Variable types

Numeric2
Text2

Dataset

Description본 파일데이터는 경기도 화성시 대형폐기물 처리수수료에 대한 데이터로 연번, 품명, 규격, 부과금액의 정보를 포함하고 있습니다.
Author경기도 화성시
URLhttps://www.data.go.kr/data/15042373/fileData.do

Alerts

연번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 08:00:20.450653
Analysis finished2023-12-12 08:00:21.401348
Duration0.95 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct283
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean142
Minimum1
Maximum283
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.6 KiB
2023-12-12T17:00:21.470752image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile15.1
Q171.5
median142
Q3212.5
95-th percentile268.9
Maximum283
Range282
Interquartile range (IQR)141

Descriptive statistics

Standard deviation81.839273
Coefficient of variation (CV)0.57633291
Kurtosis-1.2
Mean142
Median Absolute Deviation (MAD)71
Skewness0
Sum40186
Variance6697.6667
MonotonicityStrictly increasing
2023-12-12T17:00:21.614082image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.4%
188 1
 
0.4%
194 1
 
0.4%
193 1
 
0.4%
192 1
 
0.4%
191 1
 
0.4%
190 1
 
0.4%
189 1
 
0.4%
187 1
 
0.4%
196 1
 
0.4%
Other values (273) 273
96.5%
ValueCountFrequency (%)
1 1
0.4%
2 1
0.4%
3 1
0.4%
4 1
0.4%
5 1
0.4%
6 1
0.4%
7 1
0.4%
8 1
0.4%
9 1
0.4%
10 1
0.4%
ValueCountFrequency (%)
283 1
0.4%
282 1
0.4%
281 1
0.4%
280 1
0.4%
279 1
0.4%
278 1
0.4%
277 1
0.4%
276 1
0.4%
275 1
0.4%
274 1
0.4%

품명
Text

Distinct150
Distinct (%)53.0%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
2023-12-12T17:00:21.914382image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length30
Median length15
Mean length4.2862191
Min length2

Characters and Unicode

Total characters1213
Distinct characters229
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique75 ?
Unique (%)26.5%

Sample

1st row가스대
2nd row개수대
3rd row라텍스
4th row라텍스
5th row문갑
ValueCountFrequency (%)
컴퓨터 14
 
4.5%
소파(응접세트 9
 
2.9%
오락기 8
 
2.6%
책장 6
 
1.9%
일반침대 6
 
1.9%
장롱 5
 
1.6%
텔레비전 5
 
1.6%
문갑 4
 
1.3%
난로 4
 
1.3%
식탁(테이블 4
 
1.3%
Other values (150) 248
79.2%
2023-12-12T17:00:22.355863image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
67
 
5.5%
36
 
3.0%
( 34
 
2.8%
) 34
 
2.8%
30
 
2.5%
29
 
2.4%
26
 
2.1%
23
 
1.9%
22
 
1.8%
21
 
1.7%
Other values (219) 891
73.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1098
90.5%
Open Punctuation 34
 
2.8%
Close Punctuation 34
 
2.8%
Space Separator 30
 
2.5%
Other Punctuation 14
 
1.2%
Uppercase Letter 3
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
67
 
6.1%
36
 
3.3%
29
 
2.6%
26
 
2.4%
23
 
2.1%
22
 
2.0%
21
 
1.9%
19
 
1.7%
18
 
1.6%
18
 
1.6%
Other values (213) 819
74.6%
Uppercase Letter
ValueCountFrequency (%)
D 2
66.7%
V 1
33.3%
Open Punctuation
ValueCountFrequency (%)
( 34
100.0%
Close Punctuation
ValueCountFrequency (%)
) 34
100.0%
Space Separator
ValueCountFrequency (%)
30
100.0%
Other Punctuation
ValueCountFrequency (%)
, 14
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1098
90.5%
Common 112
 
9.2%
Latin 3
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
67
 
6.1%
36
 
3.3%
29
 
2.6%
26
 
2.4%
23
 
2.1%
22
 
2.0%
21
 
1.9%
19
 
1.7%
18
 
1.6%
18
 
1.6%
Other values (213) 819
74.6%
Common
ValueCountFrequency (%)
( 34
30.4%
) 34
30.4%
30
26.8%
, 14
12.5%
Latin
ValueCountFrequency (%)
D 2
66.7%
V 1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1098
90.5%
ASCII 115
 
9.5%

Most frequent character per block

Hangul
ValueCountFrequency (%)
67
 
6.1%
36
 
3.3%
29
 
2.6%
26
 
2.4%
23
 
2.1%
22
 
2.0%
21
 
1.9%
19
 
1.7%
18
 
1.6%
18
 
1.6%
Other values (213) 819
74.6%
ASCII
ValueCountFrequency (%)
( 34
29.6%
) 34
29.6%
30
26.1%
, 14
12.2%
D 2
 
1.7%
V 1
 
0.9%

규격
Text

Distinct174
Distinct (%)61.5%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
2023-12-12T17:00:22.694126image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length95
Median length38
Mean length7.0777385
Min length2

Characters and Unicode

Total characters2003
Distinct characters229
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique148 ?
Unique (%)52.3%

Sample

1st row개당
2nd row모든규격
3rd row가로 세로1m(큰수)이상
4th row가로 세로2m(큰수)미만
5th row모든규격
ValueCountFrequency (%)
모든규격 50
 
12.9%
이상 19
 
4.9%
미만 16
 
4.1%
1인용 10
 
2.6%
개당 10
 
2.6%
소형 8
 
2.1%
대형 7
 
1.8%
2인용 7
 
1.8%
대리석 7
 
1.8%
높이1m미만 5
 
1.3%
Other values (171) 250
64.3%
2023-12-12T17:00:23.131935image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
106
 
5.3%
105
 
5.2%
0 94
 
4.7%
1 88
 
4.4%
m 77
 
3.8%
68
 
3.4%
61
 
3.0%
61
 
3.0%
54
 
2.7%
50
 
2.5%
Other values (219) 1239
61.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1239
61.9%
Decimal Number 341
 
17.0%
Lowercase Letter 123
 
6.1%
Space Separator 106
 
5.3%
Other Punctuation 64
 
3.2%
Open Punctuation 46
 
2.3%
Close Punctuation 46
 
2.3%
Other Symbol 16
 
0.8%
Uppercase Letter 14
 
0.7%
Math Symbol 8
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
105
 
8.5%
68
 
5.5%
61
 
4.9%
61
 
4.9%
54
 
4.4%
50
 
4.0%
50
 
4.0%
50
 
4.0%
46
 
3.7%
37
 
3.0%
Other values (190) 657
53.0%
Decimal Number
ValueCountFrequency (%)
0 94
27.6%
1 88
25.8%
2 47
13.8%
4 22
 
6.5%
6 21
 
6.2%
5 21
 
6.2%
9 17
 
5.0%
3 16
 
4.7%
8 11
 
3.2%
7 4
 
1.2%
Lowercase Letter
ValueCountFrequency (%)
m 77
62.6%
c 29
 
23.6%
k 8
 
6.5%
g 8
 
6.5%
x 1
 
0.8%
Other Punctuation
ValueCountFrequency (%)
, 44
68.8%
. 14
 
21.9%
* 6
 
9.4%
Other Symbol
ValueCountFrequency (%)
11
68.8%
4
 
25.0%
1
 
6.2%
Uppercase Letter
ValueCountFrequency (%)
L 8
57.1%
D 4
28.6%
R 2
 
14.3%
Math Symbol
ValueCountFrequency (%)
× 7
87.5%
~ 1
 
12.5%
Space Separator
ValueCountFrequency (%)
106
100.0%
Open Punctuation
ValueCountFrequency (%)
( 46
100.0%
Close Punctuation
ValueCountFrequency (%)
) 46
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1239
61.9%
Common 627
31.3%
Latin 137
 
6.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
105
 
8.5%
68
 
5.5%
61
 
4.9%
61
 
4.9%
54
 
4.4%
50
 
4.0%
50
 
4.0%
50
 
4.0%
46
 
3.7%
37
 
3.0%
Other values (190) 657
53.0%
Common
ValueCountFrequency (%)
106
16.9%
0 94
15.0%
1 88
14.0%
2 47
7.5%
( 46
7.3%
) 46
7.3%
, 44
7.0%
4 22
 
3.5%
6 21
 
3.3%
5 21
 
3.3%
Other values (11) 92
14.7%
Latin
ValueCountFrequency (%)
m 77
56.2%
c 29
 
21.2%
L 8
 
5.8%
k 8
 
5.8%
g 8
 
5.8%
D 4
 
2.9%
R 2
 
1.5%
x 1
 
0.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1239
61.9%
ASCII 741
37.0%
CJK Compat 16
 
0.8%
None 7
 
0.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
106
14.3%
0 94
12.7%
1 88
11.9%
m 77
10.4%
2 47
 
6.3%
( 46
 
6.2%
) 46
 
6.2%
, 44
 
5.9%
c 29
 
3.9%
4 22
 
3.0%
Other values (15) 142
19.2%
Hangul
ValueCountFrequency (%)
105
 
8.5%
68
 
5.5%
61
 
4.9%
61
 
4.9%
54
 
4.4%
50
 
4.0%
50
 
4.0%
50
 
4.0%
46
 
3.7%
37
 
3.0%
Other values (190) 657
53.0%
CJK Compat
ValueCountFrequency (%)
11
68.8%
4
 
25.0%
1
 
6.2%
None
ValueCountFrequency (%)
× 7
100.0%

부과금액(원)
Real number (ℝ)

Distinct18
Distinct (%)6.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4462.8975
Minimum500
Maximum50000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.6 KiB
2023-12-12T17:00:23.268911image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum500
5-th percentile2000
Q12000
median3000
Q35000
95-th percentile10000
Maximum50000
Range49500
Interquartile range (IQR)3000

Descriptive statistics

Standard deviation4026.4464
Coefficient of variation (CV)0.90220455
Kurtosis58.354833
Mean4462.8975
Median Absolute Deviation (MAD)1000
Skewness5.9092764
Sum1263000
Variance16212271
MonotonicityNot monotonic
2023-12-12T17:00:23.379384image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=18)
ValueCountFrequency (%)
3000 78
27.6%
2000 62
21.9%
4000 41
14.5%
5000 31
 
11.0%
6000 14
 
4.9%
8000 13
 
4.6%
1000 10
 
3.5%
7000 8
 
2.8%
10000 7
 
2.5%
15000 5
 
1.8%
Other values (8) 14
 
4.9%
ValueCountFrequency (%)
500 2
 
0.7%
1000 10
 
3.5%
2000 62
21.9%
3000 78
27.6%
4000 41
14.5%
5000 31
 
11.0%
6000 14
 
4.9%
7000 8
 
2.8%
8000 13
 
4.6%
9000 3
 
1.1%
ValueCountFrequency (%)
50000 1
 
0.4%
18000 1
 
0.4%
16000 1
 
0.4%
15000 5
 
1.8%
14000 1
 
0.4%
13000 1
 
0.4%
12000 4
 
1.4%
10000 7
2.5%
9000 3
 
1.1%
8000 13
4.6%

Interactions

2023-12-12T17:00:20.947293image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:00:20.742789image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:00:21.053321image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:00:20.835081image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T17:00:23.467643image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번부과금액(원)
연번1.0000.221
부과금액(원)0.2211.000
2023-12-12T17:00:23.547746image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번부과금액(원)
연번1.000-0.324
부과금액(원)-0.3241.000

Missing values

2023-12-12T17:00:21.212989image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T17:00:21.361408image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번품명규격부과금액(원)
01가스대개당2000
12개수대모든규격3000
23라텍스가로 세로1m(큰수)이상6000
34라텍스가로 세로2m(큰수)미만4000
45문갑모든규격4000
56문갑대리석 높이또는 폭(큰수)2m이상16000
67문갑대리석 높이또는 폭(큰수)1.5m미만6000
78문갑대리석 높이또는 폭(큰수)1.5m이상~2m미만12000
89문짝모든규격4000
910서랍모든규격2000
연번품명규격부과금액(원)
273274헬멧개당1000
274275헬스기구헬스싸이클3000
275276헬스기구러닝머신5000
276277헬스기구스텝퍼3000
277278형광등(갓포함)장식용3000
278279형광등(갓포함)일반용2000
279280화분대형 50cm 이상2000
280281화분소형 50cm 미만1000
281282화환모든규격4000
282283휠체어모든규격3000