Overview

Dataset statistics

Number of variables4
Number of observations389
Missing cells1
Missing cells (%)0.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory12.3 KiB
Average record size in memory32.3 B

Variable types

Categorical1
Text3

Dataset

Description부산광역시남구_쓰레기종량제봉투_판매소_현황_20230109
Author부산광역시 남구
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=15045459

Reproduction

Analysis started2023-12-10 17:19:04.166948
Analysis finished2023-12-10 17:19:04.965518
Duration0.8 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

행정동
Categorical

Distinct18
Distinct (%)4.6%
Missing0
Missing (%)0.0%
Memory size3.2 KiB
대연3동
52 
용호1동
45 
대연1동
41 
대연5동
25 
문현1동
25 
Other values (13)
201 

Length

Max length5
Median length4
Mean length3.933162
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row대연1동
2nd row대연1동
3rd row대연1동
4th row대연1동
5th row대연1동

Common Values

ValueCountFrequency (%)
대연3동 52
13.4%
용호1동 45
11.6%
대연1동 41
10.5%
대연5동 25
 
6.4%
문현1동 25
 
6.4%
문현3동 21
 
5.4%
용호3동 21
 
5.4%
감만1동 20
 
5.1%
문현2동 20
 
5.1%
용호2동 19
 
4.9%
Other values (8) 100
25.7%

Length

2023-12-11T02:19:05.168256image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
대연3동 52
13.4%
용호1동 45
11.6%
대연1동 41
10.5%
대연5동 28
 
7.2%
문현1동 25
 
6.4%
문현3동 21
 
5.4%
용호3동 21
 
5.4%
문현2동 20
 
5.1%
감만1동 20
 
5.1%
용호2동 19
 
4.9%
Other values (7) 97
24.9%
Distinct381
Distinct (%)97.9%
Missing0
Missing (%)0.0%
Memory size3.2 KiB
2023-12-11T02:19:05.685339image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length17
Median length14
Mean length8.0437018
Min length3

Characters and Unicode

Total characters3129
Distinct characters289
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique375 ?
Unique (%)96.4%

Sample

1st rowGS25 대연중앙점
2nd row한진사
3rd row드림마트
4th rowGS25 대연래미안
5th row세븐일레븐 부산대연중앙점
ValueCountFrequency (%)
gs25 45
 
7.5%
씨유 43
 
7.2%
세븐일레븐 40
 
6.7%
이마트24 19
 
3.2%
문현점 6
 
1.0%
대연점 5
 
0.8%
현대마트 4
 
0.7%
미니스톱 3
 
0.5%
탑플러스마트 3
 
0.5%
㈜코리아세븐 3
 
0.5%
Other values (392) 426
71.4%
2023-12-11T02:19:06.506611image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
213
 
6.8%
205
 
6.6%
133
 
4.3%
124
 
4.0%
114
 
3.6%
88
 
2.8%
2 80
 
2.6%
80
 
2.6%
72
 
2.3%
71
 
2.3%
Other values (279) 1949
62.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2579
82.4%
Space Separator 213
 
6.8%
Decimal Number 161
 
5.1%
Uppercase Letter 131
 
4.2%
Other Symbol 18
 
0.6%
Close Punctuation 13
 
0.4%
Open Punctuation 13
 
0.4%
Other Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
205
 
7.9%
133
 
5.2%
124
 
4.8%
114
 
4.4%
88
 
3.4%
80
 
3.1%
72
 
2.8%
71
 
2.8%
66
 
2.6%
61
 
2.4%
Other values (257) 1565
60.7%
Uppercase Letter
ValueCountFrequency (%)
S 56
42.7%
G 50
38.2%
C 7
 
5.3%
K 6
 
4.6%
U 5
 
3.8%
J 2
 
1.5%
M 1
 
0.8%
E 1
 
0.8%
R 1
 
0.8%
N 1
 
0.8%
Decimal Number
ValueCountFrequency (%)
2 80
49.7%
5 53
32.9%
4 22
 
13.7%
1 2
 
1.2%
3 2
 
1.2%
8 2
 
1.2%
Space Separator
ValueCountFrequency (%)
213
100.0%
Other Symbol
ValueCountFrequency (%)
18
100.0%
Close Punctuation
ValueCountFrequency (%)
) 13
100.0%
Open Punctuation
ValueCountFrequency (%)
( 13
100.0%
Other Punctuation
ValueCountFrequency (%)
, 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2597
83.0%
Common 401
 
12.8%
Latin 131
 
4.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
205
 
7.9%
133
 
5.1%
124
 
4.8%
114
 
4.4%
88
 
3.4%
80
 
3.1%
72
 
2.8%
71
 
2.7%
66
 
2.5%
61
 
2.3%
Other values (258) 1583
61.0%
Latin
ValueCountFrequency (%)
S 56
42.7%
G 50
38.2%
C 7
 
5.3%
K 6
 
4.6%
U 5
 
3.8%
J 2
 
1.5%
M 1
 
0.8%
E 1
 
0.8%
R 1
 
0.8%
N 1
 
0.8%
Common
ValueCountFrequency (%)
213
53.1%
2 80
 
20.0%
5 53
 
13.2%
4 22
 
5.5%
) 13
 
3.2%
( 13
 
3.2%
1 2
 
0.5%
3 2
 
0.5%
8 2
 
0.5%
, 1
 
0.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2579
82.4%
ASCII 532
 
17.0%
None 18
 
0.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
213
40.0%
2 80
 
15.0%
S 56
 
10.5%
5 53
 
10.0%
G 50
 
9.4%
4 22
 
4.1%
) 13
 
2.4%
( 13
 
2.4%
C 7
 
1.3%
K 6
 
1.1%
Other values (11) 19
 
3.6%
Hangul
ValueCountFrequency (%)
205
 
7.9%
133
 
5.2%
124
 
4.8%
114
 
4.4%
88
 
3.4%
80
 
3.1%
72
 
2.8%
71
 
2.8%
66
 
2.6%
61
 
2.4%
Other values (257) 1565
60.7%
None
ValueCountFrequency (%)
18
100.0%
Distinct388
Distinct (%)99.7%
Missing0
Missing (%)0.0%
Memory size3.2 KiB
2023-12-11T02:19:07.044822image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length45
Median length35
Mean length13.33419
Min length5

Characters and Unicode

Total characters5187
Distinct characters189
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique387 ?
Unique (%)99.5%

Sample

1st row수영로 244
2nd row수영로196번길 23
3rd row수영로196번길 31
4th row수영로208번길 15, 105호
5th row수영로220번길 21
ValueCountFrequency (%)
1층 46
 
4.5%
수영로 25
 
2.5%
분포로 15
 
1.5%
용호로 13
 
1.3%
동명로 11
 
1.1%
유엔로 11
 
1.1%
유엔평화로 11
 
1.1%
전포대로 10
 
1.0%
진남로 10
 
1.0%
못골로 9
 
0.9%
Other values (492) 851
84.1%
2023-12-11T02:19:07.996021image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
632
 
12.2%
1 539
 
10.4%
392
 
7.6%
2 242
 
4.7%
203
 
3.9%
0 195
 
3.8%
194
 
3.7%
3 167
 
3.2%
, 162
 
3.1%
4 146
 
2.8%
Other values (179) 2315
44.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2365
45.6%
Decimal Number 1862
35.9%
Space Separator 632
 
12.2%
Other Punctuation 163
 
3.1%
Dash Punctuation 57
 
1.1%
Close Punctuation 36
 
0.7%
Open Punctuation 36
 
0.7%
Uppercase Letter 32
 
0.6%
Lowercase Letter 3
 
0.1%
Math Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
392
 
16.6%
203
 
8.6%
194
 
8.2%
117
 
4.9%
115
 
4.9%
62
 
2.6%
61
 
2.6%
60
 
2.5%
53
 
2.2%
52
 
2.2%
Other values (150) 1056
44.7%
Decimal Number
ValueCountFrequency (%)
1 539
28.9%
2 242
13.0%
0 195
 
10.5%
3 167
 
9.0%
4 146
 
7.8%
6 134
 
7.2%
5 127
 
6.8%
9 124
 
6.7%
7 100
 
5.4%
8 88
 
4.7%
Uppercase Letter
ValueCountFrequency (%)
B 9
28.1%
G 5
15.6%
A 4
12.5%
S 3
 
9.4%
L 3
 
9.4%
C 3
 
9.4%
F 2
 
6.2%
I 2
 
6.2%
K 1
 
3.1%
Lowercase Letter
ValueCountFrequency (%)
s 1
33.3%
k 1
33.3%
e 1
33.3%
Other Punctuation
ValueCountFrequency (%)
, 162
99.4%
. 1
 
0.6%
Space Separator
ValueCountFrequency (%)
632
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 57
100.0%
Close Punctuation
ValueCountFrequency (%)
) 36
100.0%
Open Punctuation
ValueCountFrequency (%)
( 36
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 2787
53.7%
Hangul 2365
45.6%
Latin 35
 
0.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
392
 
16.6%
203
 
8.6%
194
 
8.2%
117
 
4.9%
115
 
4.9%
62
 
2.6%
61
 
2.6%
60
 
2.5%
53
 
2.2%
52
 
2.2%
Other values (150) 1056
44.7%
Common
ValueCountFrequency (%)
632
22.7%
1 539
19.3%
2 242
 
8.7%
0 195
 
7.0%
3 167
 
6.0%
, 162
 
5.8%
4 146
 
5.2%
6 134
 
4.8%
5 127
 
4.6%
9 124
 
4.4%
Other values (7) 319
11.4%
Latin
ValueCountFrequency (%)
B 9
25.7%
G 5
14.3%
A 4
11.4%
S 3
 
8.6%
L 3
 
8.6%
C 3
 
8.6%
F 2
 
5.7%
I 2
 
5.7%
K 1
 
2.9%
s 1
 
2.9%
Other values (2) 2
 
5.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2822
54.4%
Hangul 2365
45.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
632
22.4%
1 539
19.1%
2 242
 
8.6%
0 195
 
6.9%
3 167
 
5.9%
, 162
 
5.7%
4 146
 
5.2%
6 134
 
4.7%
5 127
 
4.5%
9 124
 
4.4%
Other values (19) 354
12.5%
Hangul
ValueCountFrequency (%)
392
 
16.6%
203
 
8.6%
194
 
8.2%
117
 
4.9%
115
 
4.9%
62
 
2.6%
61
 
2.6%
60
 
2.5%
53
 
2.2%
52
 
2.2%
Other values (150) 1056
44.7%
Distinct192
Distinct (%)49.5%
Missing1
Missing (%)0.3%
Memory size3.2 KiB
2023-12-11T02:19:08.600778image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length6
Mean length8.9742268
Min length6

Characters and Unicode

Total characters3482
Distinct characters17
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique190 ?
Unique (%)49.0%

Sample

1st row개인정보포함
2nd row051-625-5837
3rd row051-621-2984
4th row051-242-2426
5th row개인정보포함
ValueCountFrequency (%)
개인정보포함 196
50.5%
051-929-5599 2
 
0.5%
051-624-0390 1
 
0.3%
051-633-1141 1
 
0.3%
051-635-8580 1
 
0.3%
051-633-9630 1
 
0.3%
051-642-4148 1
 
0.3%
051-631-4980 1
 
0.3%
051-646-3576 1
 
0.3%
051-632-8599 1
 
0.3%
Other values (182) 182
46.9%
2023-12-11T02:19:09.541398image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 385
11.1%
1 318
 
9.1%
5 312
 
9.0%
0 281
 
8.1%
6 264
 
7.6%
2 199
 
5.7%
196
 
5.6%
196
 
5.6%
196
 
5.6%
196
 
5.6%
Other values (7) 939
27.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1921
55.2%
Other Letter 1176
33.8%
Dash Punctuation 385
 
11.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 318
16.6%
5 312
16.2%
0 281
14.6%
6 264
13.7%
2 199
10.4%
3 138
7.2%
4 134
7.0%
7 107
 
5.6%
8 93
 
4.8%
9 75
 
3.9%
Other Letter
ValueCountFrequency (%)
196
16.7%
196
16.7%
196
16.7%
196
16.7%
196
16.7%
196
16.7%
Dash Punctuation
ValueCountFrequency (%)
- 385
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 2306
66.2%
Hangul 1176
33.8%

Most frequent character per script

Common
ValueCountFrequency (%)
- 385
16.7%
1 318
13.8%
5 312
13.5%
0 281
12.2%
6 264
11.4%
2 199
8.6%
3 138
 
6.0%
4 134
 
5.8%
7 107
 
4.6%
8 93
 
4.0%
Hangul
ValueCountFrequency (%)
196
16.7%
196
16.7%
196
16.7%
196
16.7%
196
16.7%
196
16.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2306
66.2%
Hangul 1176
33.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 385
16.7%
1 318
13.8%
5 312
13.5%
0 281
12.2%
6 264
11.4%
2 199
8.6%
3 138
 
6.0%
4 134
 
5.8%
7 107
 
4.6%
8 93
 
4.0%
Hangul
ValueCountFrequency (%)
196
16.7%
196
16.7%
196
16.7%
196
16.7%
196
16.7%
196
16.7%

Missing values

2023-12-11T02:19:04.687243image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T02:19:04.879472image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

행정동상호명도로명주소전화번호
0대연1동GS25 대연중앙점수영로 244개인정보포함
1대연1동한진사수영로196번길 23051-625-5837
2대연1동드림마트수영로196번길 31051-621-2984
3대연1동GS25 대연래미안수영로208번길 15, 105호051-242-2426
4대연1동세븐일레븐 부산대연중앙점수영로220번길 21개인정보포함
5대연1동씨유 대연엔젤점수영로220번길 8051-626-7737
6대연1동GS25 대연용소점수영로250번길 11-2개인정보포함
7대연1동GS25 대연원빌점용소로64번길 140, 101호051-624-6093
8대연1동세븐일레븐 부산UN평화점용소로64번길 85개인정보포함
9대연1동씨유 대연대로점유엔로 121, 1층개인정보포함
행정동상호명도로명주소전화번호
379문현4동천령상회지게골로 28051-642-3947
380문현4동롯데쇼핑㈜ 롯데슈퍼 부산문현점지게골로 45051-636-5601
381문현4동동경슈퍼지게골로 66051-643-3978
382문현4동문현쇼핑몰지게골로37개인정보포함
383문현4동주식회사 대웅식품 문현점우암로 359051-639-9696
384문현4동지에스25 부산곱창골목점지게골로 33-4개인정보포함
385문현4동아이스크림특공대지게골로 38개인정보포함
386문현4동한별슈퍼마켓동제당로 170051-647-8929
387문현4동세븐일레븐 부산문현장고개점장고개로 105개인정보포함
388문현4동공오일마트 주식회사수영로 30-9개인정보포함