Overview

Dataset statistics

Number of variables4
Number of observations400
Missing cells190
Missing cells (%)11.9%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory12.6 KiB
Average record size in memory32.3 B

Variable types

Categorical1
Text3

Dataset

Description부산광역시남구_쓰레기종량제봉투_판매소_현황_20220616
Author부산광역시 남구
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=15045459

Alerts

전화번호 has 190 (47.5%) missing valuesMissing

Reproduction

Analysis started2023-12-10 17:19:11.427926
Analysis finished2023-12-10 17:19:12.249800
Duration0.82 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

행정동
Categorical

Distinct18
Distinct (%)4.5%
Missing0
Missing (%)0.0%
Memory size3.3 KiB
대연3동
53 
용호1동
45 
대연1동
44 
문현1동
25 
용호3동
24 
Other values (13)
209 

Length

Max length5
Median length4
Mean length3.94
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row대연1동
2nd row대연1동
3rd row대연1동
4th row대연1동
5th row대연1동

Common Values

ValueCountFrequency (%)
대연3동 53
13.2%
용호1동 45
11.2%
대연1동 44
11.0%
문현1동 25
 
6.2%
용호3동 24
 
6.0%
대연5동 22
 
5.5%
감만1동 21
 
5.2%
문현3동 21
 
5.2%
문현2동 21
 
5.2%
용호2동 20
 
5.0%
Other values (8) 104
26.0%

Length

2023-12-11T02:19:12.435646image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
대연3동 53
13.2%
용호1동 45
11.2%
대연1동 44
11.0%
대연5동 27
 
6.8%
문현1동 25
 
6.2%
용호3동 24
 
6.0%
문현3동 21
 
5.2%
문현2동 21
 
5.2%
감만1동 21
 
5.2%
용호2동 20
 
5.0%
Other values (7) 99
24.8%
Distinct390
Distinct (%)97.5%
Missing0
Missing (%)0.0%
Memory size3.3 KiB
2023-12-11T02:19:13.055372image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length18
Median length15
Mean length7.9625
Min length2

Characters and Unicode

Total characters3185
Distinct characters286
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique382 ?
Unique (%)95.5%

Sample

1st rowGS25 대연중앙점
2nd row한진사
3rd row드림마트
4th rowGS25 대연래미안
5th row세븐일레븐 부산대연중앙점
ValueCountFrequency (%)
gs25 47
 
7.8%
씨유 36
 
6.0%
세븐일레븐 35
 
5.8%
이마트24 20
 
3.3%
미니스톱 8
 
1.3%
문현점 6
 
1.0%
주식회사 4
 
0.7%
대연점 4
 
0.7%
㈜코리아세븐 4
 
0.7%
현대마트 4
 
0.7%
Other values (402) 437
72.2%
2023-12-11T02:19:14.440970image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
211
 
6.6%
200
 
6.3%
144
 
4.5%
134
 
4.2%
115
 
3.6%
2 83
 
2.6%
81
 
2.5%
76
 
2.4%
69
 
2.2%
69
 
2.2%
Other values (276) 2003
62.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2614
82.1%
Space Separator 211
 
6.6%
Decimal Number 170
 
5.3%
Uppercase Letter 137
 
4.3%
Other Symbol 20
 
0.6%
Open Punctuation 16
 
0.5%
Close Punctuation 16
 
0.5%
Other Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
200
 
7.7%
144
 
5.5%
134
 
5.1%
115
 
4.4%
81
 
3.1%
76
 
2.9%
69
 
2.6%
69
 
2.6%
63
 
2.4%
56
 
2.1%
Other values (254) 1607
61.5%
Uppercase Letter
ValueCountFrequency (%)
S 59
43.1%
G 52
38.0%
K 8
 
5.8%
C 7
 
5.1%
U 5
 
3.6%
J 2
 
1.5%
P 1
 
0.7%
N 1
 
0.7%
M 1
 
0.7%
R 1
 
0.7%
Decimal Number
ValueCountFrequency (%)
2 83
48.8%
5 56
32.9%
4 23
 
13.5%
3 3
 
1.8%
1 2
 
1.2%
8 2
 
1.2%
6 1
 
0.6%
Space Separator
ValueCountFrequency (%)
211
100.0%
Other Symbol
ValueCountFrequency (%)
20
100.0%
Open Punctuation
ValueCountFrequency (%)
( 16
100.0%
Close Punctuation
ValueCountFrequency (%)
) 16
100.0%
Other Punctuation
ValueCountFrequency (%)
, 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2634
82.7%
Common 414
 
13.0%
Latin 137
 
4.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
200
 
7.6%
144
 
5.5%
134
 
5.1%
115
 
4.4%
81
 
3.1%
76
 
2.9%
69
 
2.6%
69
 
2.6%
63
 
2.4%
56
 
2.1%
Other values (255) 1627
61.8%
Common
ValueCountFrequency (%)
211
51.0%
2 83
 
20.0%
5 56
 
13.5%
4 23
 
5.6%
( 16
 
3.9%
) 16
 
3.9%
3 3
 
0.7%
1 2
 
0.5%
8 2
 
0.5%
, 1
 
0.2%
Latin
ValueCountFrequency (%)
S 59
43.1%
G 52
38.0%
K 8
 
5.8%
C 7
 
5.1%
U 5
 
3.6%
J 2
 
1.5%
P 1
 
0.7%
N 1
 
0.7%
M 1
 
0.7%
R 1
 
0.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2614
82.1%
ASCII 551
 
17.3%
None 20
 
0.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
211
38.3%
2 83
 
15.1%
S 59
 
10.7%
5 56
 
10.2%
G 52
 
9.4%
4 23
 
4.2%
( 16
 
2.9%
) 16
 
2.9%
K 8
 
1.5%
C 7
 
1.3%
Other values (11) 20
 
3.6%
Hangul
ValueCountFrequency (%)
200
 
7.7%
144
 
5.5%
134
 
5.1%
115
 
4.4%
81
 
3.1%
76
 
2.9%
69
 
2.6%
69
 
2.6%
63
 
2.4%
56
 
2.1%
Other values (254) 1607
61.5%
None
ValueCountFrequency (%)
20
100.0%
Distinct398
Distinct (%)99.5%
Missing0
Missing (%)0.0%
Memory size3.3 KiB
2023-12-11T02:19:15.123019image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length45
Median length35
Mean length13.0925
Min length5

Characters and Unicode

Total characters5237
Distinct characters182
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique397 ?
Unique (%)99.2%

Sample

1st row수영로 244
2nd row수영로196번길 23
3rd row수영로196번길 31
4th row수영로208번길 15, 105호
5th row수영로220번길 21
ValueCountFrequency (%)
1층 46
 
4.5%
수영로 24
 
2.4%
용호로 15
 
1.5%
분포로 15
 
1.5%
동명로 11
 
1.1%
진남로 11
 
1.1%
유엔평화로 11
 
1.1%
유엔로 11
 
1.1%
전포대로 10
 
1.0%
85 9
 
0.9%
Other values (490) 857
84.0%
2023-12-11T02:19:16.115035image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
629
 
12.0%
1 538
 
10.3%
403
 
7.7%
2 242
 
4.6%
211
 
4.0%
204
 
3.9%
0 185
 
3.5%
3 166
 
3.2%
, 153
 
2.9%
4 150
 
2.9%
Other values (172) 2356
45.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2399
45.8%
Decimal Number 1883
36.0%
Space Separator 629
 
12.0%
Other Punctuation 154
 
2.9%
Dash Punctuation 58
 
1.1%
Close Punctuation 39
 
0.7%
Open Punctuation 39
 
0.7%
Uppercase Letter 32
 
0.6%
Lowercase Letter 3
 
0.1%
Math Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
403
 
16.8%
211
 
8.8%
204
 
8.5%
118
 
4.9%
116
 
4.8%
67
 
2.8%
62
 
2.6%
60
 
2.5%
57
 
2.4%
53
 
2.2%
Other values (143) 1048
43.7%
Decimal Number
ValueCountFrequency (%)
1 538
28.6%
2 242
12.9%
0 185
 
9.8%
3 166
 
8.8%
4 150
 
8.0%
6 147
 
7.8%
9 131
 
7.0%
5 130
 
6.9%
7 99
 
5.3%
8 95
 
5.0%
Uppercase Letter
ValueCountFrequency (%)
B 9
28.1%
G 5
15.6%
A 4
12.5%
L 3
 
9.4%
S 3
 
9.4%
C 3
 
9.4%
F 2
 
6.2%
I 2
 
6.2%
K 1
 
3.1%
Lowercase Letter
ValueCountFrequency (%)
e 1
33.3%
s 1
33.3%
k 1
33.3%
Other Punctuation
ValueCountFrequency (%)
, 153
99.4%
. 1
 
0.6%
Space Separator
ValueCountFrequency (%)
629
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 58
100.0%
Close Punctuation
ValueCountFrequency (%)
) 39
100.0%
Open Punctuation
ValueCountFrequency (%)
( 39
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 2803
53.5%
Hangul 2399
45.8%
Latin 35
 
0.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
403
 
16.8%
211
 
8.8%
204
 
8.5%
118
 
4.9%
116
 
4.8%
67
 
2.8%
62
 
2.6%
60
 
2.5%
57
 
2.4%
53
 
2.2%
Other values (143) 1048
43.7%
Common
ValueCountFrequency (%)
629
22.4%
1 538
19.2%
2 242
 
8.6%
0 185
 
6.6%
3 166
 
5.9%
, 153
 
5.5%
4 150
 
5.4%
6 147
 
5.2%
9 131
 
4.7%
5 130
 
4.6%
Other values (7) 332
11.8%
Latin
ValueCountFrequency (%)
B 9
25.7%
G 5
14.3%
A 4
11.4%
L 3
 
8.6%
S 3
 
8.6%
C 3
 
8.6%
F 2
 
5.7%
I 2
 
5.7%
e 1
 
2.9%
K 1
 
2.9%
Other values (2) 2
 
5.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2838
54.2%
Hangul 2399
45.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
629
22.2%
1 538
19.0%
2 242
 
8.5%
0 185
 
6.5%
3 166
 
5.8%
, 153
 
5.4%
4 150
 
5.3%
6 147
 
5.2%
9 131
 
4.6%
5 130
 
4.6%
Other values (19) 367
12.9%
Hangul
ValueCountFrequency (%)
403
 
16.8%
211
 
8.8%
204
 
8.5%
118
 
4.9%
116
 
4.8%
67
 
2.8%
62
 
2.6%
60
 
2.5%
57
 
2.4%
53
 
2.2%
Other values (143) 1048
43.7%

전화번호
Text

MISSING 

Distinct209
Distinct (%)99.5%
Missing190
Missing (%)47.5%
Memory size3.3 KiB
2023-12-11T02:19:16.615737image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length12
Mean length12.009524
Min length12

Characters and Unicode

Total characters2522
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique208 ?
Unique (%)99.0%

Sample

1st row051-625-5837
2nd row051-621-2984
3rd row051-242-2426
4th row051-626-7737
5th row051-628-4989
ValueCountFrequency (%)
051-929-5599 2
 
1.0%
051-624-8355 1
 
0.5%
051-645-7702 1
 
0.5%
051-632-5361 1
 
0.5%
051-632-9201 1
 
0.5%
051-644-4394 1
 
0.5%
051-633-9630 1
 
0.5%
051-642-4148 1
 
0.5%
051-631-4980 1
 
0.5%
051-646-3576 1
 
0.5%
Other values (199) 199
94.8%
2023-12-11T02:19:17.359021image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 421
16.7%
1 343
13.6%
5 338
13.4%
0 310
12.3%
6 292
11.6%
2 221
8.8%
3 154
 
6.1%
4 146
 
5.8%
7 114
 
4.5%
8 99
 
3.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 2101
83.3%
Dash Punctuation 421
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 343
16.3%
5 338
16.1%
0 310
14.8%
6 292
13.9%
2 221
10.5%
3 154
7.3%
4 146
6.9%
7 114
 
5.4%
8 99
 
4.7%
9 84
 
4.0%
Dash Punctuation
ValueCountFrequency (%)
- 421
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 2522
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 421
16.7%
1 343
13.6%
5 338
13.4%
0 310
12.3%
6 292
11.6%
2 221
8.8%
3 154
 
6.1%
4 146
 
5.8%
7 114
 
4.5%
8 99
 
3.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2522
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 421
16.7%
1 343
13.6%
5 338
13.4%
0 310
12.3%
6 292
11.6%
2 221
8.8%
3 154
 
6.1%
4 146
 
5.8%
7 114
 
4.5%
8 99
 
3.9%

Missing values

2023-12-11T02:19:11.981786image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T02:19:12.167491image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

행정동상호명도로명주소전화번호
0대연1동GS25 대연중앙점수영로 244<NA>
1대연1동한진사수영로196번길 23051-625-5837
2대연1동드림마트수영로196번길 31051-621-2984
3대연1동GS25 대연래미안수영로208번길 15, 105호051-242-2426
4대연1동세븐일레븐 부산대연중앙점수영로220번길 21<NA>
5대연1동씨유 대연엔젤점수영로220번길 8051-626-7737
6대연1동GS25 대연용소점수영로250번길 11-2<NA>
7대연1동경성두배로마트수영로274-16051-628-4989
8대연1동㈜킹스마트수영로266번길 36051-623-0012
9대연1동GS25 대연원빌점용소로64번길 140, 101호051-624-6093
행정동상호명도로명주소전화번호
390문현4동천령상회지게골로 28051-642-3947
391문현4동롯데쇼핑㈜ 롯데슈퍼 부산문현점지게골로 45051-636-5601
392문현4동동경슈퍼지게골로 66051-643-3978
393문현4동문현쇼핑몰지게골로37<NA>
394문현4동주식회사 대웅식품 문현점우암로 359051-639-9696
395문현4동지에스25 부산곱창골목점지게골로 33-4<NA>
396문현4동아이스크림특공대지게골로 38<NA>
397문현4동한별슈퍼마켓동제당로 170051-647-8929
398문현4동세븐일레븐 부산문현장고개점장고개로 105<NA>
399문현4동공오일마트 주식회사수영로 30-9<NA>