Overview

Dataset statistics

Number of variables4
Number of observations393
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory12.4 KiB
Average record size in memory32.3 B

Variable types

Categorical1
Text3

Dataset

Description부산광역시남구_쓰레기종량제봉투_판매소_현황_20230706
Author부산광역시 남구
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=15045459

Reproduction

Analysis started2023-12-10 17:18:57.539158
Analysis finished2023-12-10 17:18:58.473612
Duration0.93 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

행정동
Categorical

Distinct18
Distinct (%)4.6%
Missing0
Missing (%)0.0%
Memory size3.2 KiB
대연3동
53 
용호1동
45 
대연1동
41 
대연5동
25 
용호2동
25 
Other values (13)
204 

Length

Max length5
Median length4
Mean length3.9312977
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row대연1동
2nd row대연1동
3rd row대연1동
4th row대연1동
5th row대연1동

Common Values

ValueCountFrequency (%)
대연3동 53
13.5%
용호1동 45
11.5%
대연1동 41
10.4%
대연5동 25
 
6.4%
용호2동 25
 
6.4%
문현1동 22
 
5.6%
문현3동 22
 
5.6%
문현2동 22
 
5.6%
용호3동 21
 
5.3%
대연4동 17
 
4.3%
Other values (8) 100
25.4%

Length

2023-12-11T02:18:58.686414image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
대연3동 53
13.5%
용호1동 45
11.5%
대연1동 41
10.4%
대연5동 28
 
7.1%
용호2동 25
 
6.4%
문현1동 22
 
5.6%
문현3동 22
 
5.6%
문현2동 22
 
5.6%
용호3동 21
 
5.3%
대연4동 17
 
4.3%
Other values (7) 97
24.7%
Distinct386
Distinct (%)98.2%
Missing0
Missing (%)0.0%
Memory size3.2 KiB
2023-12-11T02:18:59.207337image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length26
Median length15
Mean length8.4249364
Min length3

Characters and Unicode

Total characters3311
Distinct characters294
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique381 ?
Unique (%)96.9%

Sample

1st rowGS25 대연중앙점
2nd row한진사
3rd row드림마트
4th rowGS25 대연래미안
5th row세븐일레븐 부산대연중앙점
ValueCountFrequency (%)
gs25 52
 
8.3%
씨유 50
 
8.0%
세븐일레븐 40
 
6.4%
이마트24 22
 
3.5%
문현점 6
 
1.0%
현대마트 4
 
0.6%
부산용호점 3
 
0.5%
㈜코리아세븐 3
 
0.5%
부산대연점 3
 
0.5%
홈플러스㈜익스프레스 3
 
0.5%
Other values (398) 437
70.1%
2023-12-11T02:19:00.022899image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
235
 
7.1%
224
 
6.8%
130
 
3.9%
122
 
3.7%
117
 
3.5%
2 90
 
2.7%
90
 
2.7%
81
 
2.4%
77
 
2.3%
75
 
2.3%
Other values (284) 2070
62.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2695
81.4%
Space Separator 235
 
7.1%
Decimal Number 186
 
5.6%
Uppercase Letter 147
 
4.4%
Other Symbol 19
 
0.6%
Open Punctuation 14
 
0.4%
Close Punctuation 14
 
0.4%
Other Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
224
 
8.3%
130
 
4.8%
122
 
4.5%
117
 
4.3%
90
 
3.3%
81
 
3.0%
77
 
2.9%
75
 
2.8%
69
 
2.6%
67
 
2.5%
Other values (261) 1643
61.0%
Uppercase Letter
ValueCountFrequency (%)
S 64
43.5%
G 57
38.8%
K 7
 
4.8%
C 6
 
4.1%
U 5
 
3.4%
R 2
 
1.4%
J 2
 
1.4%
N 1
 
0.7%
E 1
 
0.7%
M 1
 
0.7%
Decimal Number
ValueCountFrequency (%)
2 90
48.4%
5 60
32.3%
4 25
 
13.4%
3 3
 
1.6%
1 3
 
1.6%
8 3
 
1.6%
7 2
 
1.1%
Space Separator
ValueCountFrequency (%)
235
100.0%
Other Symbol
ValueCountFrequency (%)
19
100.0%
Open Punctuation
ValueCountFrequency (%)
( 14
100.0%
Close Punctuation
ValueCountFrequency (%)
) 14
100.0%
Other Punctuation
ValueCountFrequency (%)
, 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2714
82.0%
Common 450
 
13.6%
Latin 147
 
4.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
224
 
8.3%
130
 
4.8%
122
 
4.5%
117
 
4.3%
90
 
3.3%
81
 
3.0%
77
 
2.8%
75
 
2.8%
69
 
2.5%
67
 
2.5%
Other values (262) 1662
61.2%
Common
ValueCountFrequency (%)
235
52.2%
2 90
 
20.0%
5 60
 
13.3%
4 25
 
5.6%
( 14
 
3.1%
) 14
 
3.1%
3 3
 
0.7%
1 3
 
0.7%
8 3
 
0.7%
7 2
 
0.4%
Latin
ValueCountFrequency (%)
S 64
43.5%
G 57
38.8%
K 7
 
4.8%
C 6
 
4.1%
U 5
 
3.4%
R 2
 
1.4%
J 2
 
1.4%
N 1
 
0.7%
E 1
 
0.7%
M 1
 
0.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2695
81.4%
ASCII 597
 
18.0%
None 19
 
0.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
235
39.4%
2 90
 
15.1%
S 64
 
10.7%
5 60
 
10.1%
G 57
 
9.5%
4 25
 
4.2%
( 14
 
2.3%
) 14
 
2.3%
K 7
 
1.2%
C 6
 
1.0%
Other values (12) 25
 
4.2%
Hangul
ValueCountFrequency (%)
224
 
8.3%
130
 
4.8%
122
 
4.5%
117
 
4.3%
90
 
3.3%
81
 
3.0%
77
 
2.9%
75
 
2.8%
69
 
2.6%
67
 
2.5%
Other values (261) 1643
61.0%
None
ValueCountFrequency (%)
19
100.0%
Distinct391
Distinct (%)99.5%
Missing0
Missing (%)0.0%
Memory size3.2 KiB
2023-12-11T02:19:00.460758image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length45
Median length35
Mean length13.651399
Min length5

Characters and Unicode

Total characters5365
Distinct characters188
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique389 ?
Unique (%)99.0%

Sample

1st row수영로 244
2nd row수영로196번길 23
3rd row수영로196번길 31
4th row수영로208번길 15, 105호
5th row수영로220번길 21
ValueCountFrequency (%)
1층 50
 
4.8%
수영로 25
 
2.4%
분포로 15
 
1.4%
유엔평화로 12
 
1.2%
용호로 12
 
1.2%
동명로 11
 
1.1%
전포대로 11
 
1.1%
26 10
 
1.0%
85 10
 
1.0%
진남로 10
 
1.0%
Other values (495) 875
84.1%
2023-12-11T02:19:01.172293image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
657
 
12.2%
1 559
 
10.4%
396
 
7.4%
2 255
 
4.8%
0 207
 
3.9%
207
 
3.9%
198
 
3.7%
3 178
 
3.3%
, 177
 
3.3%
4 148
 
2.8%
Other values (178) 2383
44.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2418
45.1%
Decimal Number 1943
36.2%
Space Separator 657
 
12.2%
Other Punctuation 178
 
3.3%
Dash Punctuation 58
 
1.1%
Uppercase Letter 36
 
0.7%
Open Punctuation 35
 
0.7%
Close Punctuation 35
 
0.7%
Lowercase Letter 3
 
0.1%
Math Symbol 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
396
 
16.4%
207
 
8.6%
198
 
8.2%
129
 
5.3%
118
 
4.9%
65
 
2.7%
64
 
2.6%
64
 
2.6%
57
 
2.4%
56
 
2.3%
Other values (149) 1064
44.0%
Decimal Number
ValueCountFrequency (%)
1 559
28.8%
2 255
13.1%
0 207
 
10.7%
3 178
 
9.2%
4 148
 
7.6%
6 140
 
7.2%
5 132
 
6.8%
9 126
 
6.5%
7 107
 
5.5%
8 91
 
4.7%
Uppercase Letter
ValueCountFrequency (%)
B 10
27.8%
G 5
13.9%
A 5
13.9%
S 4
 
11.1%
C 3
 
8.3%
L 3
 
8.3%
I 2
 
5.6%
F 2
 
5.6%
K 2
 
5.6%
Lowercase Letter
ValueCountFrequency (%)
s 1
33.3%
k 1
33.3%
e 1
33.3%
Other Punctuation
ValueCountFrequency (%)
, 177
99.4%
. 1
 
0.6%
Space Separator
ValueCountFrequency (%)
657
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 58
100.0%
Open Punctuation
ValueCountFrequency (%)
( 35
100.0%
Close Punctuation
ValueCountFrequency (%)
) 35
100.0%
Math Symbol
ValueCountFrequency (%)
~ 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 2908
54.2%
Hangul 2418
45.1%
Latin 39
 
0.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
396
 
16.4%
207
 
8.6%
198
 
8.2%
129
 
5.3%
118
 
4.9%
65
 
2.7%
64
 
2.6%
64
 
2.6%
57
 
2.4%
56
 
2.3%
Other values (149) 1064
44.0%
Common
ValueCountFrequency (%)
657
22.6%
1 559
19.2%
2 255
 
8.8%
0 207
 
7.1%
3 178
 
6.1%
, 177
 
6.1%
4 148
 
5.1%
6 140
 
4.8%
5 132
 
4.5%
9 126
 
4.3%
Other values (7) 329
11.3%
Latin
ValueCountFrequency (%)
B 10
25.6%
G 5
12.8%
A 5
12.8%
S 4
 
10.3%
C 3
 
7.7%
L 3
 
7.7%
I 2
 
5.1%
F 2
 
5.1%
K 2
 
5.1%
s 1
 
2.6%
Other values (2) 2
 
5.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2947
54.9%
Hangul 2418
45.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
657
22.3%
1 559
19.0%
2 255
 
8.7%
0 207
 
7.0%
3 178
 
6.0%
, 177
 
6.0%
4 148
 
5.0%
6 140
 
4.8%
5 132
 
4.5%
9 126
 
4.3%
Other values (19) 368
12.5%
Hangul
ValueCountFrequency (%)
396
 
16.4%
207
 
8.6%
198
 
8.2%
129
 
5.3%
118
 
4.9%
65
 
2.7%
64
 
2.6%
64
 
2.6%
57
 
2.4%
56
 
2.3%
Other values (149) 1064
44.0%
Distinct173
Distinct (%)44.0%
Missing0
Missing (%)0.0%
Memory size3.2 KiB
2023-12-11T02:19:01.591951image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length6
Mean length8.6412214
Min length6

Characters and Unicode

Total characters3396
Distinct characters17
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique171 ?
Unique (%)43.5%

Sample

1st row개인정보포함
2nd row051-625-5837
3rd row051-621-2984
4th row051-242-2426
5th row개인정보포함
ValueCountFrequency (%)
개인정보포함 220
56.0%
051-929-5599 2
 
0.5%
051-637-4522 1
 
0.3%
051-644-4394 1
 
0.3%
051-622-0583 1
 
0.3%
051-633-0933 1
 
0.3%
051-642-3778 1
 
0.3%
051-633-1141 1
 
0.3%
051-624-0390 1
 
0.3%
051-612-3363 1
 
0.3%
Other values (163) 163
41.5%
2023-12-11T02:19:02.314497image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 346
 
10.2%
5 286
 
8.4%
1 280
 
8.2%
0 256
 
7.5%
6 242
 
7.1%
220
 
6.5%
220
 
6.5%
220
 
6.5%
220
 
6.5%
220
 
6.5%
Other values (7) 886
26.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1730
50.9%
Other Letter 1320
38.9%
Dash Punctuation 346
 
10.2%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 286
16.5%
1 280
16.2%
0 256
14.8%
6 242
14.0%
2 185
10.7%
3 119
6.9%
4 116
6.7%
7 94
 
5.4%
8 86
 
5.0%
9 66
 
3.8%
Other Letter
ValueCountFrequency (%)
220
16.7%
220
16.7%
220
16.7%
220
16.7%
220
16.7%
220
16.7%
Dash Punctuation
ValueCountFrequency (%)
- 346
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 2076
61.1%
Hangul 1320
38.9%

Most frequent character per script

Common
ValueCountFrequency (%)
- 346
16.7%
5 286
13.8%
1 280
13.5%
0 256
12.3%
6 242
11.7%
2 185
8.9%
3 119
 
5.7%
4 116
 
5.6%
7 94
 
4.5%
8 86
 
4.1%
Hangul
ValueCountFrequency (%)
220
16.7%
220
16.7%
220
16.7%
220
16.7%
220
16.7%
220
16.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2076
61.1%
Hangul 1320
38.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 346
16.7%
5 286
13.8%
1 280
13.5%
0 256
12.3%
6 242
11.7%
2 185
8.9%
3 119
 
5.7%
4 116
 
5.6%
7 94
 
4.5%
8 86
 
4.1%
Hangul
ValueCountFrequency (%)
220
16.7%
220
16.7%
220
16.7%
220
16.7%
220
16.7%
220
16.7%

Missing values

2023-12-11T02:18:58.160375image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T02:18:58.374785image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

행정동상호명도로명주소전화번호
0대연1동GS25 대연중앙점수영로 244개인정보포함
1대연1동한진사수영로196번길 23051-625-5837
2대연1동드림마트수영로196번길 31051-621-2984
3대연1동GS25 대연래미안수영로208번길 15, 105호051-242-2426
4대연1동세븐일레븐 부산대연중앙점수영로220번길 21개인정보포함
5대연1동씨유 대연엔젤점수영로220번길 8051-626-7737
6대연1동GS25 대연용소점수영로250번길 11-2개인정보포함
7대연1동GS25 대연원빌점용소로64번길 140, 101호051-624-6093
8대연1동세븐일레븐 부산UN평화점용소로64번길 85개인정보포함
9대연1동씨유 대연대로점유엔로 121, 1층개인정보포함
행정동상호명도로명주소전화번호
383문현4동뉴빅세일마트지게골로 111, 2층개인정보포함
384문현4동롯데쇼핑㈜ 롯데슈퍼 부산문현점지게골로 45051-636-5601
385문현4동동경슈퍼지게골로 66051-643-3978
386문현4동문현쇼핑몰지게골로37개인정보포함
387문현4동주식회사 대웅식품 문현점우암로 359051-639-9696
388문현4동지에스25 부산곱창골목점지게골로 33-4개인정보포함
389문현4동한별슈퍼마켓동제당로 170051-647-8929
390문현4동세븐일레븐 부산문현장고개점장고개로 105개인정보포함
391문현4동공오일마트 주식회사수영로 30-9개인정보포함
392문현4동씨유 문현파라곤점우암로300, 1동 201,202호개인정보포함