Overview

Dataset statistics

Number of variables4
Number of observations388
Missing cells173
Missing cells (%)11.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory12.3 KiB
Average record size in memory32.3 B

Variable types

Categorical1
Text3

Dataset

Description부산광역시남구_쓰레기종량제봉투_판매소_현황_20210531
Author부산광역시 남구
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=15045459

Alerts

전화번호 has 173 (44.6%) missing valuesMissing

Reproduction

Analysis started2023-12-10 17:19:19.310566
Analysis finished2023-12-10 17:19:20.765260
Duration1.45 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

행정동
Categorical

Distinct19
Distinct (%)4.9%
Missing0
Missing (%)0.0%
Memory size3.2 KiB
대연3동
52 
용호1동
45 
대연1동
37 
문현1동
24 
용호3동
24 
Other values (14)
206 

Length

Max length5
Median length4
Mean length3.9407216
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row대연1동
2nd row대연1동
3rd row대연1동
4th row대연1동
5th row대연1동

Common Values

ValueCountFrequency (%)
대연3동 52
13.4%
용호1동 45
11.6%
대연1동 37
 
9.5%
문현1동 24
 
6.2%
용호3동 24
 
6.2%
대연5동 21
 
5.4%
감만1동 21
 
5.4%
문현2동 20
 
5.2%
문현3동 20
 
5.2%
문현4동 18
 
4.6%
Other values (9) 106
27.3%

Length

2023-12-11T02:19:20.965368image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
대연3동 52
13.4%
용호1동 45
11.6%
대연1동 37
 
9.5%
대연5동 26
 
6.7%
문현1동 24
 
6.2%
용호3동 24
 
6.2%
감만1동 21
 
5.4%
문현2동 20
 
5.2%
문현3동 20
 
5.2%
대연4동 18
 
4.6%
Other values (8) 101
26.0%
Distinct378
Distinct (%)97.4%
Missing0
Missing (%)0.0%
Memory size3.2 KiB
2023-12-11T02:19:21.467544image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length25
Median length16
Mean length7.9097938
Min length2

Characters and Unicode

Total characters3069
Distinct characters284
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique370 ?
Unique (%)95.4%

Sample

1st rowGS25 대연중앙점
2nd row한진사
3rd row드림마트
4th rowGS25 대연래미안
5th row세븐일레븐 부산대연중앙점
ValueCountFrequency (%)
gs25 43
 
7.4%
씨유 32
 
5.5%
세븐일레븐 31
 
5.3%
이마트24 17
 
2.9%
미니스톱 10
 
1.7%
문현점 6
 
1.0%
현대마트 4
 
0.7%
부산대연점 4
 
0.7%
대연점 4
 
0.7%
㈜코리아세븐 4
 
0.7%
Other values (391) 426
73.3%
2023-12-11T02:19:22.294053image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
200
 
6.5%
187
 
6.1%
146
 
4.8%
137
 
4.5%
109
 
3.6%
2 76
 
2.5%
73
 
2.4%
73
 
2.4%
64
 
2.1%
64
 
2.1%
Other values (274) 1940
63.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2522
82.2%
Space Separator 200
 
6.5%
Decimal Number 161
 
5.2%
Uppercase Letter 129
 
4.2%
Other Symbol 20
 
0.7%
Open Punctuation 18
 
0.6%
Close Punctuation 18
 
0.6%
Other Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
187
 
7.4%
146
 
5.8%
137
 
5.4%
109
 
4.3%
73
 
2.9%
73
 
2.9%
64
 
2.5%
64
 
2.5%
59
 
2.3%
58
 
2.3%
Other values (252) 1552
61.5%
Uppercase Letter
ValueCountFrequency (%)
S 54
41.9%
G 48
37.2%
K 8
 
6.2%
C 8
 
6.2%
U 5
 
3.9%
J 3
 
2.3%
M 1
 
0.8%
N 1
 
0.8%
P 1
 
0.8%
Decimal Number
ValueCountFrequency (%)
2 76
47.2%
5 52
32.3%
4 20
 
12.4%
3 4
 
2.5%
1 3
 
1.9%
8 3
 
1.9%
7 2
 
1.2%
6 1
 
0.6%
Space Separator
ValueCountFrequency (%)
200
100.0%
Other Symbol
ValueCountFrequency (%)
20
100.0%
Open Punctuation
ValueCountFrequency (%)
( 18
100.0%
Close Punctuation
ValueCountFrequency (%)
) 18
100.0%
Other Punctuation
ValueCountFrequency (%)
, 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2542
82.8%
Common 398
 
13.0%
Latin 129
 
4.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
187
 
7.4%
146
 
5.7%
137
 
5.4%
109
 
4.3%
73
 
2.9%
73
 
2.9%
64
 
2.5%
64
 
2.5%
59
 
2.3%
58
 
2.3%
Other values (253) 1572
61.8%
Common
ValueCountFrequency (%)
200
50.3%
2 76
 
19.1%
5 52
 
13.1%
4 20
 
5.0%
( 18
 
4.5%
) 18
 
4.5%
3 4
 
1.0%
1 3
 
0.8%
8 3
 
0.8%
7 2
 
0.5%
Other values (2) 2
 
0.5%
Latin
ValueCountFrequency (%)
S 54
41.9%
G 48
37.2%
K 8
 
6.2%
C 8
 
6.2%
U 5
 
3.9%
J 3
 
2.3%
M 1
 
0.8%
N 1
 
0.8%
P 1
 
0.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2522
82.2%
ASCII 527
 
17.2%
None 20
 
0.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
200
38.0%
2 76
 
14.4%
S 54
 
10.2%
5 52
 
9.9%
G 48
 
9.1%
4 20
 
3.8%
( 18
 
3.4%
) 18
 
3.4%
K 8
 
1.5%
C 8
 
1.5%
Other values (11) 25
 
4.7%
Hangul
ValueCountFrequency (%)
187
 
7.4%
146
 
5.8%
137
 
5.4%
109
 
4.3%
73
 
2.9%
73
 
2.9%
64
 
2.5%
64
 
2.5%
59
 
2.3%
58
 
2.3%
Other values (252) 1552
61.5%
None
ValueCountFrequency (%)
20
100.0%
Distinct386
Distinct (%)99.5%
Missing0
Missing (%)0.0%
Memory size3.2 KiB
2023-12-11T02:19:22.892453image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length45
Median length36
Mean length13.386598
Min length5

Characters and Unicode

Total characters5194
Distinct characters188
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique385 ?
Unique (%)99.2%

Sample

1st row수영로 244
2nd row수영로196번길 23
3rd row수영로196번길 31
4th row수영로208번길 15, 105호
5th row수영로220번길 21
ValueCountFrequency (%)
1층 42
 
4.2%
수영로 22
 
2.2%
분포로 15
 
1.5%
용호로 14
 
1.4%
유엔로 11
 
1.1%
동명로 11
 
1.1%
진남로 10
 
1.0%
전포대로 10
 
1.0%
석포로 9
 
0.9%
유엔평화로 9
 
0.9%
Other values (488) 846
84.7%
2023-12-11T02:19:23.786819image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
620
 
11.9%
1 534
 
10.3%
391
 
7.5%
2 239
 
4.6%
204
 
3.9%
198
 
3.8%
0 184
 
3.5%
3 165
 
3.2%
, 149
 
2.9%
4 145
 
2.8%
Other values (178) 2365
45.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2387
46.0%
Decimal Number 1856
35.7%
Space Separator 620
 
11.9%
Other Punctuation 150
 
2.9%
Dash Punctuation 56
 
1.1%
Close Punctuation 43
 
0.8%
Open Punctuation 43
 
0.8%
Uppercase Letter 35
 
0.7%
Lowercase Letter 3
 
0.1%
Math Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
391
 
16.4%
204
 
8.5%
198
 
8.3%
118
 
4.9%
117
 
4.9%
65
 
2.7%
62
 
2.6%
58
 
2.4%
57
 
2.4%
52
 
2.2%
Other values (149) 1065
44.6%
Decimal Number
ValueCountFrequency (%)
1 534
28.8%
2 239
12.9%
0 184
 
9.9%
3 165
 
8.9%
4 145
 
7.8%
6 143
 
7.7%
5 131
 
7.1%
9 124
 
6.7%
7 98
 
5.3%
8 93
 
5.0%
Uppercase Letter
ValueCountFrequency (%)
B 9
25.7%
G 5
14.3%
S 5
14.3%
A 4
11.4%
K 3
 
8.6%
L 3
 
8.6%
F 2
 
5.7%
I 2
 
5.7%
C 2
 
5.7%
Lowercase Letter
ValueCountFrequency (%)
e 1
33.3%
s 1
33.3%
k 1
33.3%
Other Punctuation
ValueCountFrequency (%)
, 149
99.3%
. 1
 
0.7%
Space Separator
ValueCountFrequency (%)
620
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 56
100.0%
Close Punctuation
ValueCountFrequency (%)
) 43
100.0%
Open Punctuation
ValueCountFrequency (%)
( 43
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 2769
53.3%
Hangul 2387
46.0%
Latin 38
 
0.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
391
 
16.4%
204
 
8.5%
198
 
8.3%
118
 
4.9%
117
 
4.9%
65
 
2.7%
62
 
2.6%
58
 
2.4%
57
 
2.4%
52
 
2.2%
Other values (149) 1065
44.6%
Common
ValueCountFrequency (%)
620
22.4%
1 534
19.3%
2 239
 
8.6%
0 184
 
6.6%
3 165
 
6.0%
, 149
 
5.4%
4 145
 
5.2%
6 143
 
5.2%
5 131
 
4.7%
9 124
 
4.5%
Other values (7) 335
12.1%
Latin
ValueCountFrequency (%)
B 9
23.7%
G 5
13.2%
S 5
13.2%
A 4
10.5%
K 3
 
7.9%
L 3
 
7.9%
F 2
 
5.3%
I 2
 
5.3%
C 2
 
5.3%
e 1
 
2.6%
Other values (2) 2
 
5.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2807
54.0%
Hangul 2387
46.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
620
22.1%
1 534
19.0%
2 239
 
8.5%
0 184
 
6.6%
3 165
 
5.9%
, 149
 
5.3%
4 145
 
5.2%
6 143
 
5.1%
5 131
 
4.7%
9 124
 
4.4%
Other values (19) 373
13.3%
Hangul
ValueCountFrequency (%)
391
 
16.4%
204
 
8.5%
198
 
8.3%
118
 
4.9%
117
 
4.9%
65
 
2.7%
62
 
2.6%
58
 
2.4%
57
 
2.4%
52
 
2.2%
Other values (149) 1065
44.6%

전화번호
Text

MISSING 

Distinct214
Distinct (%)99.5%
Missing173
Missing (%)44.6%
Memory size3.2 KiB
2023-12-11T02:19:24.355268image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length8
Mean length8.2604651
Min length8

Characters and Unicode

Total characters1776
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique213 ?
Unique (%)99.1%

Sample

1st row625-5837
2nd row621-2984
3rd row051-242-2426
4th row626-7737
5th row628-4989
ValueCountFrequency (%)
051-929-5599 2
 
0.9%
621-2292 1
 
0.5%
637-5666 1
 
0.5%
633-0933 1
 
0.5%
633-1141 1
 
0.5%
730-6580 1
 
0.5%
644-4394 1
 
0.5%
633-9630 1
 
0.5%
642-4148 1
 
0.5%
631-4980 1
 
0.5%
Other values (204) 204
94.9%
2023-12-11T02:19:25.273392image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
6 297
16.7%
- 229
12.9%
2 226
12.7%
3 157
8.8%
1 154
8.7%
4 147
8.3%
5 142
8.0%
0 117
 
6.6%
7 115
 
6.5%
8 105
 
5.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1547
87.1%
Dash Punctuation 229
 
12.9%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
6 297
19.2%
2 226
14.6%
3 157
10.1%
1 154
10.0%
4 147
9.5%
5 142
9.2%
0 117
 
7.6%
7 115
 
7.4%
8 105
 
6.8%
9 87
 
5.6%
Dash Punctuation
ValueCountFrequency (%)
- 229
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1776
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
6 297
16.7%
- 229
12.9%
2 226
12.7%
3 157
8.8%
1 154
8.7%
4 147
8.3%
5 142
8.0%
0 117
 
6.6%
7 115
 
6.5%
8 105
 
5.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1776
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
6 297
16.7%
- 229
12.9%
2 226
12.7%
3 157
8.8%
1 154
8.7%
4 147
8.3%
5 142
8.0%
0 117
 
6.6%
7 115
 
6.5%
8 105
 
5.9%

Missing values

2023-12-11T02:19:20.515221image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T02:19:20.685422image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

행정동상호명도로명주소전화번호
0대연1동GS25 대연중앙점수영로 244<NA>
1대연1동한진사수영로196번길 23625-5837
2대연1동드림마트수영로196번길 31621-2984
3대연1동GS25 대연래미안수영로208번길 15, 105호051-242-2426
4대연1동세븐일레븐 부산대연중앙점수영로220번길 21<NA>
5대연1동씨유 대연엔젤점수영로220번길 8626-7737
6대연1동GS25 대연용소점수영로250번길 11-2<NA>
7대연1동경성두배로마트수영로266번길 23-1628-4989
8대연1동㈜킹스마트수영로266번길 36623-0012
9대연1동GS25 대연원빌점용소로64번길 140, 101호624-6093
행정동상호명도로명주소전화번호
378문현4동뉴빅세일마트지게골로 111, 2층<NA>
379문현4동천령상회지게골로 28642-3947
380문현4동롯데쇼핑㈜ 롯데슈퍼 부산문현점지게골로 45636-5601
381문현4동동경슈퍼지게골로 66643-3978
382문현4동문현쇼핑몰지게골로37<NA>
383문현4동주식회사 대웅식품 문현점우암로 359051-639-9696
384문현4동지에스25 부산곱창골목점지게골로 33-4<NA>
385문현4동아이스크림특공대지게골로 38<NA>
386문현4동한별슈퍼마켓동제당로 170647-8929
387문현4동세븐일레븐 부산문현장고개점장고개로 105<NA>