Overview

Dataset statistics

Number of variables5
Number of observations91
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory3.8 KiB
Average record size in memory42.5 B

Variable types

Categorical2
Text2
Numeric1

Dataset

Description경기도 김포시의 대형폐수수료정보(기물의 구분, 품명, 규격, 규격에 따른 수수료, 데이터 기준일자)의 데이터를 제공하고 있습니다.
Author경기도 김포시
URLhttps://www.data.go.kr/data/15034879/fileData.do

Alerts

데이터기준일자 has constant value ""Constant

Reproduction

Analysis started2024-04-06 08:34:30.029887
Analysis finished2024-04-06 08:34:30.908092
Duration0.88 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

구분
Categorical

Distinct3
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Memory size860.0 B
가구류
46 
기 타
28 
생활용품류
17 

Length

Max length5
Median length3
Mean length3.3736264
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row가구류
2nd row가구류
3rd row가구류
4th row가구류
5th row가구류

Common Values

ValueCountFrequency (%)
가구류 46
50.5%
기 타 28
30.8%
생활용품류 17
 
18.7%

Length

2024-04-06T17:34:31.061996image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-06T17:34:31.297065image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
가구류 46
38.7%
28
23.5%
28
23.5%
생활용품류 17
 
14.3%

품명
Text

Distinct50
Distinct (%)54.9%
Missing0
Missing (%)0.0%
Memory size860.0 B
2024-04-06T17:34:31.709426image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length10
Mean length3.6153846
Min length2

Characters and Unicode

Total characters329
Distinct characters116
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique25 ?
Unique (%)27.5%

Sample

1st row장롱
2nd row장롱
3rd row장롱
4th row비키니옷장
5th row소파
ValueCountFrequency (%)
침대 8
 
8.2%
소파 4
 
4.1%
의자 4
 
4.1%
싱크대 3
 
3.1%
피아노 3
 
3.1%
책장,장식장 3
 
3.1%
수족관 3
 
3.1%
책상 3
 
3.1%
장롱 3
 
3.1%
매트 2
 
2.1%
Other values (44) 61
62.9%
2024-04-06T17:34:32.431768image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
24
 
7.3%
15
 
4.6%
, 13
 
4.0%
11
 
3.3%
10
 
3.0%
7
 
2.1%
7
 
2.1%
7
 
2.1%
7
 
2.1%
7
 
2.1%
Other values (106) 221
67.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 306
93.0%
Other Punctuation 13
 
4.0%
Space Separator 6
 
1.8%
Uppercase Letter 2
 
0.6%
Close Punctuation 1
 
0.3%
Open Punctuation 1
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
24
 
7.8%
15
 
4.9%
11
 
3.6%
10
 
3.3%
7
 
2.3%
7
 
2.3%
7
 
2.3%
7
 
2.3%
7
 
2.3%
7
 
2.3%
Other values (100) 204
66.7%
Uppercase Letter
ValueCountFrequency (%)
T 1
50.0%
V 1
50.0%
Other Punctuation
ValueCountFrequency (%)
, 13
100.0%
Space Separator
ValueCountFrequency (%)
6
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 306
93.0%
Common 21
 
6.4%
Latin 2
 
0.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
24
 
7.8%
15
 
4.9%
11
 
3.6%
10
 
3.3%
7
 
2.3%
7
 
2.3%
7
 
2.3%
7
 
2.3%
7
 
2.3%
7
 
2.3%
Other values (100) 204
66.7%
Common
ValueCountFrequency (%)
, 13
61.9%
6
28.6%
) 1
 
4.8%
( 1
 
4.8%
Latin
ValueCountFrequency (%)
T 1
50.0%
V 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 302
91.8%
ASCII 23
 
7.0%
Compat Jamo 4
 
1.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
24
 
7.9%
15
 
5.0%
11
 
3.6%
10
 
3.3%
7
 
2.3%
7
 
2.3%
7
 
2.3%
7
 
2.3%
7
 
2.3%
7
 
2.3%
Other values (99) 200
66.2%
ASCII
ValueCountFrequency (%)
, 13
56.5%
6
26.1%
T 1
 
4.3%
) 1
 
4.3%
( 1
 
4.3%
V 1
 
4.3%
Compat Jamo
ValueCountFrequency (%)
4
100.0%

규격
Text

Distinct71
Distinct (%)78.0%
Missing0
Missing (%)0.0%
Memory size860.0 B
2024-04-06T17:34:32.980043image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length11
Mean length5.956044
Min length2

Characters and Unicode

Total characters542
Distinct characters118
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique65 ?
Unique (%)71.4%

Sample

1st row두짝문
2nd row한짝문
3rd row유아용장롱(1m미만)
4th row모든규격
5th row4인용 초과
ValueCountFrequency (%)
모든규격 13
 
8.7%
이상 13
 
8.7%
1m 10
 
6.7%
미만 10
 
6.7%
높이 8
 
5.4%
2인용 6
 
4.0%
1인용 5
 
3.4%
이하 4
 
2.7%
2
 
1.3%
초과 2
 
1.3%
Other values (62) 76
51.0%
2024-04-06T17:34:33.747200image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
58
 
10.7%
32
 
5.9%
1 26
 
4.8%
22
 
4.1%
20
 
3.7%
19
 
3.5%
m 15
 
2.8%
14
 
2.6%
14
 
2.6%
14
 
2.6%
Other values (108) 308
56.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 362
66.8%
Space Separator 58
 
10.7%
Decimal Number 58
 
10.7%
Lowercase Letter 23
 
4.2%
Open Punctuation 13
 
2.4%
Close Punctuation 13
 
2.4%
Other Punctuation 7
 
1.3%
Math Symbol 4
 
0.7%
Uppercase Letter 3
 
0.6%
Other Symbol 1
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
32
 
8.8%
22
 
6.1%
20
 
5.5%
19
 
5.2%
14
 
3.9%
14
 
3.9%
14
 
3.9%
14
 
3.9%
12
 
3.3%
12
 
3.3%
Other values (87) 189
52.2%
Decimal Number
ValueCountFrequency (%)
1 26
44.8%
2 10
 
17.2%
3 7
 
12.1%
0 5
 
8.6%
4 5
 
8.6%
5 3
 
5.2%
6 2
 
3.4%
Lowercase Letter
ValueCountFrequency (%)
m 15
65.2%
g 3
 
13.0%
k 2
 
8.7%
c 2
 
8.7%
1
 
4.3%
Other Punctuation
ValueCountFrequency (%)
. 4
57.1%
, 3
42.9%
Uppercase Letter
ValueCountFrequency (%)
M 2
66.7%
K 1
33.3%
Space Separator
ValueCountFrequency (%)
58
100.0%
Open Punctuation
ValueCountFrequency (%)
( 13
100.0%
Close Punctuation
ValueCountFrequency (%)
) 13
100.0%
Math Symbol
ValueCountFrequency (%)
+ 4
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 362
66.8%
Common 155
28.6%
Latin 25
 
4.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
32
 
8.8%
22
 
6.1%
20
 
5.5%
19
 
5.2%
14
 
3.9%
14
 
3.9%
14
 
3.9%
14
 
3.9%
12
 
3.3%
12
 
3.3%
Other values (87) 189
52.2%
Common
ValueCountFrequency (%)
58
37.4%
1 26
16.8%
( 13
 
8.4%
) 13
 
8.4%
2 10
 
6.5%
3 7
 
4.5%
0 5
 
3.2%
4 5
 
3.2%
. 4
 
2.6%
+ 4
 
2.6%
Other values (5) 10
 
6.5%
Latin
ValueCountFrequency (%)
m 15
60.0%
g 3
 
12.0%
M 2
 
8.0%
k 2
 
8.0%
c 2
 
8.0%
K 1
 
4.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 361
66.6%
ASCII 178
32.8%
Letterlike Symbols 1
 
0.2%
Compat Jamo 1
 
0.2%
CJK Compat 1
 
0.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
58
32.6%
1 26
14.6%
m 15
 
8.4%
( 13
 
7.3%
) 13
 
7.3%
2 10
 
5.6%
3 7
 
3.9%
0 5
 
2.8%
4 5
 
2.8%
. 4
 
2.2%
Other values (9) 22
 
12.4%
Hangul
ValueCountFrequency (%)
32
 
8.9%
22
 
6.1%
20
 
5.5%
19
 
5.3%
14
 
3.9%
14
 
3.9%
14
 
3.9%
14
 
3.9%
12
 
3.3%
12
 
3.3%
Other values (86) 188
52.1%
Letterlike Symbols
ValueCountFrequency (%)
1
100.0%
Compat Jamo
ValueCountFrequency (%)
1
100.0%
CJK Compat
ValueCountFrequency (%)
1
100.0%

부과금액
Real number (ℝ)

Distinct7
Distinct (%)7.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5824.1758
Minimum3000
Maximum40000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size951.0 B
2024-04-06T17:34:33.980968image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum3000
5-th percentile3000
Q13000
median3000
Q35000
95-th percentile15000
Maximum40000
Range37000
Interquartile range (IQR)2000

Descriptive statistics

Standard deviation5689.3534
Coefficient of variation (CV)0.97685125
Kurtosis17.091155
Mean5824.1758
Median Absolute Deviation (MAD)0
Skewness3.6842415
Sum530000
Variance32368742
MonotonicityNot monotonic
2024-04-06T17:34:34.186787image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=7)
ValueCountFrequency (%)
3000 50
54.9%
5000 22
24.2%
10000 12
 
13.2%
15000 4
 
4.4%
20000 1
 
1.1%
30000 1
 
1.1%
40000 1
 
1.1%
ValueCountFrequency (%)
3000 50
54.9%
5000 22
24.2%
10000 12
 
13.2%
15000 4
 
4.4%
20000 1
 
1.1%
30000 1
 
1.1%
40000 1
 
1.1%
ValueCountFrequency (%)
40000 1
 
1.1%
30000 1
 
1.1%
20000 1
 
1.1%
15000 4
 
4.4%
10000 12
 
13.2%
5000 22
24.2%
3000 50
54.9%

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)1.1%
Missing0
Missing (%)0.0%
Memory size860.0 B
2024-03-12
91 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2024-03-12
2nd row2024-03-12
3rd row2024-03-12
4th row2024-03-12
5th row2024-03-12

Common Values

ValueCountFrequency (%)
2024-03-12 91
100.0%

Length

2024-04-06T17:34:34.421324image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-06T17:34:34.576416image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2024-03-12 91
100.0%

Interactions

2024-04-06T17:34:30.448203image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-06T17:34:34.683277image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분품명규격부과금액
구분1.0001.0000.0000.000
품명1.0001.0000.0000.000
규격0.0000.0001.0001.000
부과금액0.0000.0001.0001.000
2024-04-06T17:34:34.997708image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
부과금액구분
부과금액1.0000.000
구분0.0001.000

Missing values

2024-04-06T17:34:30.677708image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-06T17:34:30.844516image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

구분품명규격부과금액데이터기준일자
0가구류장롱두짝문150002024-03-12
1가구류장롱한짝문100002024-03-12
2가구류장롱유아용장롱(1m미만)50002024-03-12
3가구류비키니옷장모든규격30002024-03-12
4가구류소파4인용 초과100002024-03-12
5가구류소파4인용 이하50002024-03-12
6가구류소파2인용 이하30002024-03-12
7가구류소파보조의자30002024-03-12
8가구류책상양수(상판+양서랍)100002024-03-12
9가구류책상편수(상판+외서랍)50002024-03-12
구분품명규격부과금액데이터기준일자
81기 타운동기구덤벨, 바벨 등30002024-03-12
82기 타소화기3.3kg 이하30002024-03-12
83기 타소화기3.3kg 초과50002024-03-12
84기 타전기장판, 매트1인용30002024-03-12
85기 타전기장판, 매트2인용 이상50002024-03-12
86기 타카시트1인용30002024-03-12
87기 타카시트2인용 이상50002024-03-12
88기 타캣타워모든 규격30002024-03-12
89기 타파티션1.5m 이상50002024-03-12
90기 타파티션1.5m 미만30002024-03-12