Overview

Dataset statistics

Number of variables4
Number of observations165
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory5.6 KiB
Average record size in memory34.8 B

Variable types

Numeric2
Text2

Dataset

Description인천광역시 연수구 대형폐기물 품목 및 수수료 데이터로서 연번, 품명, 규격, 수수료(원) 등의 항목으로 이루어져 있습니다.
Author인천광역시 연수구
URLhttps://data.incheon.go.kr/findData/publicDataDetail?dataId=15116346&srcSe=7661IVAWM27C61E190

Alerts

연번 has unique valuesUnique

Reproduction

Analysis started2024-01-28 08:34:35.204239
Analysis finished2024-01-28 08:34:35.867848
Duration0.66 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct165
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean83
Minimum1
Maximum165
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.6 KiB
2024-01-28T17:34:35.925400image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile9.2
Q142
median83
Q3124
95-th percentile156.8
Maximum165
Range164
Interquartile range (IQR)82

Descriptive statistics

Standard deviation47.775517
Coefficient of variation (CV)0.57560864
Kurtosis-1.2
Mean83
Median Absolute Deviation (MAD)41
Skewness0
Sum13695
Variance2282.5
MonotonicityStrictly increasing
2024-01-28T17:34:36.045551image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.6%
105 1
 
0.6%
107 1
 
0.6%
108 1
 
0.6%
109 1
 
0.6%
110 1
 
0.6%
111 1
 
0.6%
112 1
 
0.6%
113 1
 
0.6%
114 1
 
0.6%
Other values (155) 155
93.9%
ValueCountFrequency (%)
1 1
0.6%
2 1
0.6%
3 1
0.6%
4 1
0.6%
5 1
0.6%
6 1
0.6%
7 1
0.6%
8 1
0.6%
9 1
0.6%
10 1
0.6%
ValueCountFrequency (%)
165 1
0.6%
164 1
0.6%
163 1
0.6%
162 1
0.6%
161 1
0.6%
160 1
0.6%
159 1
0.6%
158 1
0.6%
157 1
0.6%
156 1
0.6%

품명
Text

Distinct86
Distinct (%)52.1%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
2024-01-28T17:34:36.268937image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length13
Mean length4.2
Min length2

Characters and Unicode

Total characters693
Distinct characters179
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique43 ?
Unique (%)26.1%

Sample

1st row문갑(거실장)
2nd row문갑(거실장)
3rd row문갑(거실장)
4th row문짝
5th row문짝
ValueCountFrequency (%)
소파 9
 
5.0%
7
 
3.9%
침대틀 6
 
3.3%
식탁 6
 
3.3%
책상 5
 
2.8%
장롱(1쪽 4
 
2.2%
유아용품 3
 
1.7%
밥상 3
 
1.7%
액자 3
 
1.7%
카펫 3
 
1.7%
Other values (82) 131
72.8%
2024-01-28T17:34:36.626680image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
) 32
 
4.6%
32
 
4.6%
( 32
 
4.6%
24
 
3.5%
18
 
2.6%
15
 
2.2%
13
 
1.9%
13
 
1.9%
12
 
1.7%
12
 
1.7%
Other values (169) 490
70.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 596
86.0%
Close Punctuation 32
 
4.6%
Open Punctuation 32
 
4.6%
Space Separator 15
 
2.2%
Decimal Number 7
 
1.0%
Uppercase Letter 6
 
0.9%
Other Punctuation 4
 
0.6%
Dash Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
32
 
5.4%
24
 
4.0%
18
 
3.0%
13
 
2.2%
13
 
2.2%
12
 
2.0%
12
 
2.0%
12
 
2.0%
11
 
1.8%
11
 
1.8%
Other values (160) 438
73.5%
Uppercase Letter
ValueCountFrequency (%)
F 2
33.3%
R 2
33.3%
P 2
33.3%
Close Punctuation
ValueCountFrequency (%)
) 32
100.0%
Open Punctuation
ValueCountFrequency (%)
( 32
100.0%
Space Separator
ValueCountFrequency (%)
15
100.0%
Decimal Number
ValueCountFrequency (%)
1 7
100.0%
Other Punctuation
ValueCountFrequency (%)
, 4
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 596
86.0%
Common 91
 
13.1%
Latin 6
 
0.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
32
 
5.4%
24
 
4.0%
18
 
3.0%
13
 
2.2%
13
 
2.2%
12
 
2.0%
12
 
2.0%
12
 
2.0%
11
 
1.8%
11
 
1.8%
Other values (160) 438
73.5%
Common
ValueCountFrequency (%)
) 32
35.2%
( 32
35.2%
15
16.5%
1 7
 
7.7%
, 4
 
4.4%
- 1
 
1.1%
Latin
ValueCountFrequency (%)
F 2
33.3%
R 2
33.3%
P 2
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 596
86.0%
ASCII 97
 
14.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
) 32
33.0%
( 32
33.0%
15
15.5%
1 7
 
7.2%
, 4
 
4.1%
F 2
 
2.1%
R 2
 
2.1%
P 2
 
2.1%
- 1
 
1.0%
Hangul
ValueCountFrequency (%)
32
 
5.4%
24
 
4.0%
18
 
3.0%
13
 
2.2%
13
 
2.2%
12
 
2.0%
12
 
2.0%
12
 
2.0%
11
 
1.8%
11
 
1.8%
Other values (160) 438
73.5%

규격
Text

Distinct124
Distinct (%)75.2%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
2024-01-28T17:34:36.874405image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length18
Median length15
Mean length7.3818182
Min length1

Characters and Unicode

Total characters1218
Distinct characters155
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique117 ?
Unique (%)70.9%

Sample

1st row모든 규격
2nd row돌문갑
3rd row유리(부착)문갑
4th row목재
5th row철재
ValueCountFrequency (%)
모든 33
 
10.7%
규격 33
 
10.7%
미만 24
 
7.8%
이상 23
 
7.5%
1m 7
 
2.3%
이하 4
 
1.3%
포함 4
 
1.3%
6인용 4
 
1.3%
가정용 4
 
1.3%
4
 
1.3%
Other values (130) 167
54.4%
2024-01-28T17:34:37.235668image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
145
 
11.9%
( 70
 
5.7%
) 70
 
5.7%
m 48
 
3.9%
47
 
3.9%
1 41
 
3.4%
39
 
3.2%
35
 
2.9%
33
 
2.7%
33
 
2.7%
Other values (145) 657
53.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 693
56.9%
Space Separator 145
 
11.9%
Decimal Number 135
 
11.1%
Lowercase Letter 72
 
5.9%
Open Punctuation 70
 
5.7%
Close Punctuation 70
 
5.7%
Other Punctuation 27
 
2.2%
Math Symbol 4
 
0.3%
Other Symbol 2
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
47
 
6.8%
39
 
5.6%
35
 
5.1%
33
 
4.8%
33
 
4.8%
33
 
4.8%
32
 
4.6%
29
 
4.2%
27
 
3.9%
26
 
3.8%
Other values (123) 359
51.8%
Decimal Number
ValueCountFrequency (%)
1 41
30.4%
0 32
23.7%
2 16
 
11.9%
5 16
 
11.9%
3 10
 
7.4%
6 9
 
6.7%
4 5
 
3.7%
9 4
 
3.0%
7 2
 
1.5%
Lowercase Letter
ValueCountFrequency (%)
m 48
66.7%
c 17
 
23.6%
k 3
 
4.2%
g 3
 
4.2%
h 1
 
1.4%
Other Punctuation
ValueCountFrequency (%)
. 18
66.7%
, 9
33.3%
Math Symbol
ValueCountFrequency (%)
× 3
75.0%
~ 1
 
25.0%
Space Separator
ValueCountFrequency (%)
145
100.0%
Open Punctuation
ValueCountFrequency (%)
( 70
100.0%
Close Punctuation
ValueCountFrequency (%)
) 70
100.0%
Other Symbol
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 693
56.9%
Common 453
37.2%
Latin 72
 
5.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
47
 
6.8%
39
 
5.6%
35
 
5.1%
33
 
4.8%
33
 
4.8%
33
 
4.8%
32
 
4.6%
29
 
4.2%
27
 
3.9%
26
 
3.8%
Other values (123) 359
51.8%
Common
ValueCountFrequency (%)
145
32.0%
( 70
15.5%
) 70
15.5%
1 41
 
9.1%
0 32
 
7.1%
. 18
 
4.0%
2 16
 
3.5%
5 16
 
3.5%
3 10
 
2.2%
, 9
 
2.0%
Other values (7) 26
 
5.7%
Latin
ValueCountFrequency (%)
m 48
66.7%
c 17
 
23.6%
k 3
 
4.2%
g 3
 
4.2%
h 1
 
1.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 693
56.9%
ASCII 520
42.7%
None 3
 
0.2%
CJK Compat 2
 
0.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
145
27.9%
( 70
13.5%
) 70
13.5%
m 48
 
9.2%
1 41
 
7.9%
0 32
 
6.2%
. 18
 
3.5%
c 17
 
3.3%
2 16
 
3.1%
5 16
 
3.1%
Other values (10) 47
 
9.0%
Hangul
ValueCountFrequency (%)
47
 
6.8%
39
 
5.6%
35
 
5.1%
33
 
4.8%
33
 
4.8%
33
 
4.8%
32
 
4.6%
29
 
4.2%
27
 
3.9%
26
 
3.8%
Other values (123) 359
51.8%
None
ValueCountFrequency (%)
× 3
100.0%
CJK Compat
ValueCountFrequency (%)
2
100.0%

수수료(원)
Real number (ℝ)

Distinct16
Distinct (%)9.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6200
Minimum1000
Maximum50000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.6 KiB
2024-01-28T17:34:37.344124image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1000
5-th percentile2000
Q13000
median5000
Q38000
95-th percentile12000
Maximum50000
Range49000
Interquartile range (IQR)5000

Descriptive statistics

Standard deviation5119.5465
Coefficient of variation (CV)0.8257333
Kurtosis34.465308
Mean6200
Median Absolute Deviation (MAD)2000
Skewness4.6750115
Sum1023000
Variance26209756
MonotonicityNot monotonic
2024-01-28T17:34:37.446418image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=16)
ValueCountFrequency (%)
5000 40
24.2%
3000 28
17.0%
10000 20
12.1%
2000 16
 
9.7%
4000 14
 
8.5%
6000 13
 
7.9%
12000 7
 
4.2%
8000 7
 
4.2%
1000 4
 
2.4%
11000 4
 
2.4%
Other values (6) 12
 
7.3%
ValueCountFrequency (%)
1000 4
 
2.4%
2000 16
 
9.7%
3000 28
17.0%
4000 14
 
8.5%
5000 40
24.2%
6000 13
 
7.9%
7000 4
 
2.4%
8000 7
 
4.2%
9000 3
 
1.8%
10000 20
12.1%
ValueCountFrequency (%)
50000 1
 
0.6%
30000 1
 
0.6%
20000 1
 
0.6%
15000 2
 
1.2%
12000 7
 
4.2%
11000 4
 
2.4%
10000 20
12.1%
9000 3
 
1.8%
8000 7
 
4.2%
7000 4
 
2.4%

Interactions

2024-01-28T17:34:35.608478image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-28T17:34:35.451558image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-28T17:34:35.673926image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-28T17:34:35.536140image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-01-28T17:34:37.519213image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번품명수수료(원)
연번1.0000.9960.233
품명0.9961.0000.196
수수료(원)0.2330.1961.000
2024-01-28T17:34:37.591829image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번수수료(원)
연번1.000-0.144
수수료(원)-0.1441.000

Missing values

2024-01-28T17:34:35.761487image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-28T17:34:35.833705image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번품명규격수수료(원)
01문갑(거실장)모든 규격6000
12문갑(거실장)돌문갑10000
23문갑(거실장)유리(부착)문갑10000
34문짝목재4000
45문짝철재5000
56문틀소(높이 1.6m 미만)3000
67문틀대(높이 1.6m 이상)5000
78사무용칸막이(파티션)모든 규격5000
89신발장소(높이 1.5m 미만)3000
910신발장대(높이 1.5m 이상)6000
연번품명규격수수료(원)
155156항아리(화분)대(70cm 초과)5000
156157폴더매트3.3㎡(1평) 당8000
157158화환3000
158159환풍기가정용2000
159160환풍기업소용3000
160161휠체어모든 규격2000
161162물탱크(FRP 제외)2.5톤 미만30000
162163물탱크(FRP 제외)2.5톤 이상50000
163164장판5m 미만4000
164165장판5m 이상5000