Overview

Dataset statistics

Number of variables4
Number of observations389
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory12.3 KiB
Average record size in memory32.3 B

Variable types

Categorical1
Text3

Dataset

Description부산광역시 남구 쓰레기종량제봉투 판매소 현황 자료로 행정동, 업소명, 도로명주소, 전화번호 등의 항목을 제공합니다.
Author부산광역시 남구
URLhttps://www.data.go.kr/data/15045459/fileData.do

Reproduction

Analysis started2024-03-14 14:00:24.539391
Analysis finished2024-03-14 14:00:25.490446
Duration0.95 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

행정동
Categorical

Distinct18
Distinct (%)4.6%
Missing0
Missing (%)0.0%
Memory size3.2 KiB
대연3동
53 
용호1동
46 
대연1동
40 
용호2동
26 
대연5동
24 
Other values (13)
200 

Length

Max length5
Median length4
Mean length3.9280206
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row대연1동
2nd row대연1동
3rd row대연1동
4th row대연1동
5th row대연1동

Common Values

ValueCountFrequency (%)
대연3동 53
13.6%
용호1동 46
11.8%
대연1동 40
10.3%
용호2동 26
 
6.7%
대연5동 24
 
6.2%
문현1동 23
 
5.9%
용호3동 21
 
5.4%
문현3동 20
 
5.1%
문현2동 19
 
4.9%
대연4동 18
 
4.6%
Other values (8) 99
25.4%

Length

2024-03-14T23:00:25.743909image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
대연3동 53
13.6%
용호1동 46
11.8%
대연1동 40
10.3%
대연5동 27
 
6.9%
용호2동 26
 
6.7%
문현1동 23
 
5.9%
용호3동 21
 
5.4%
문현3동 20
 
5.1%
문현2동 19
 
4.9%
대연4동 18
 
4.6%
Other values (7) 96
24.7%
Distinct381
Distinct (%)97.9%
Missing0
Missing (%)0.0%
Memory size3.2 KiB
2024-03-14T23:00:26.699120image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length26
Median length15
Mean length8.4910026
Min length3

Characters and Unicode

Total characters3303
Distinct characters299
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique375 ?
Unique (%)96.4%

Sample

1st rowGS25 대연중앙점
2nd row한진사
3rd row드림마트
4th rowGS25 대연래미안
5th row세븐일레븐 부산대연중앙점
ValueCountFrequency (%)
씨유 53
 
8.6%
gs25 48
 
7.8%
세븐일레븐 39
 
6.3%
이마트24 22
 
3.6%
지에스25 5
 
0.8%
문현점 5
 
0.8%
현대마트 4
 
0.6%
지에스더프레시 4
 
0.6%
경성대점 3
 
0.5%
부산용호점 3
 
0.5%
Other values (392) 433
70.0%
2024-03-14T23:00:28.254510image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
234
 
7.1%
225
 
6.8%
127
 
3.8%
124
 
3.8%
120
 
3.6%
88
 
2.7%
2 88
 
2.7%
79
 
2.4%
78
 
2.4%
72
 
2.2%
Other values (289) 2068
62.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2693
81.5%
Space Separator 234
 
7.1%
Decimal Number 186
 
5.6%
Uppercase Letter 137
 
4.1%
Other Symbol 19
 
0.6%
Open Punctuation 16
 
0.5%
Close Punctuation 16
 
0.5%
Control 1
 
< 0.1%
Other Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
225
 
8.4%
127
 
4.7%
124
 
4.6%
120
 
4.5%
88
 
3.3%
79
 
2.9%
78
 
2.9%
72
 
2.7%
69
 
2.6%
64
 
2.4%
Other values (264) 1647
61.2%
Uppercase Letter
ValueCountFrequency (%)
S 59
43.1%
G 52
38.0%
K 7
 
5.1%
C 6
 
4.4%
U 5
 
3.6%
J 2
 
1.5%
R 2
 
1.5%
E 1
 
0.7%
P 1
 
0.7%
M 1
 
0.7%
Decimal Number
ValueCountFrequency (%)
2 88
47.3%
5 58
31.2%
4 25
 
13.4%
1 5
 
2.7%
3 3
 
1.6%
8 3
 
1.6%
7 2
 
1.1%
0 2
 
1.1%
Space Separator
ValueCountFrequency (%)
234
100.0%
Other Symbol
ValueCountFrequency (%)
19
100.0%
Open Punctuation
ValueCountFrequency (%)
( 16
100.0%
Close Punctuation
ValueCountFrequency (%)
) 16
100.0%
Control
ValueCountFrequency (%)
1
100.0%
Other Punctuation
ValueCountFrequency (%)
, 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2712
82.1%
Common 454
 
13.7%
Latin 137
 
4.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
225
 
8.3%
127
 
4.7%
124
 
4.6%
120
 
4.4%
88
 
3.2%
79
 
2.9%
78
 
2.9%
72
 
2.7%
69
 
2.5%
64
 
2.4%
Other values (265) 1666
61.4%
Common
ValueCountFrequency (%)
234
51.5%
2 88
 
19.4%
5 58
 
12.8%
4 25
 
5.5%
( 16
 
3.5%
) 16
 
3.5%
1 5
 
1.1%
3 3
 
0.7%
8 3
 
0.7%
7 2
 
0.4%
Other values (3) 4
 
0.9%
Latin
ValueCountFrequency (%)
S 59
43.1%
G 52
38.0%
K 7
 
5.1%
C 6
 
4.4%
U 5
 
3.6%
J 2
 
1.5%
R 2
 
1.5%
E 1
 
0.7%
P 1
 
0.7%
M 1
 
0.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2693
81.5%
ASCII 591
 
17.9%
None 19
 
0.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
234
39.6%
2 88
 
14.9%
S 59
 
10.0%
5 58
 
9.8%
G 52
 
8.8%
4 25
 
4.2%
( 16
 
2.7%
) 16
 
2.7%
K 7
 
1.2%
C 6
 
1.0%
Other values (14) 30
 
5.1%
Hangul
ValueCountFrequency (%)
225
 
8.4%
127
 
4.7%
124
 
4.6%
120
 
4.5%
88
 
3.3%
79
 
2.9%
78
 
2.9%
72
 
2.7%
69
 
2.6%
64
 
2.4%
Other values (264) 1647
61.2%
None
ValueCountFrequency (%)
19
100.0%
Distinct387
Distinct (%)99.5%
Missing0
Missing (%)0.0%
Memory size3.2 KiB
2024-03-14T23:00:29.300310image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length49
Median length38
Mean length14.403599
Min length5

Characters and Unicode

Total characters5603
Distinct characters197
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique385 ?
Unique (%)99.0%

Sample

1st row수영로 244
2nd row수영로196번길 23
3rd row수영로196번길 31
4th row수영로208번길 15, 105호
5th row수영로220번길 21
ValueCountFrequency (%)
1층 50
 
4.7%
수영로 26
 
2.4%
분포로 17
 
1.6%
유엔평화로 12
 
1.1%
용호로 12
 
1.1%
85 11
 
1.0%
동명로 11
 
1.0%
진남로 10
 
0.9%
전포대로 10
 
0.9%
유엔로 9
 
0.8%
Other values (527) 898
84.2%
2024-03-14T23:00:30.834240image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
683
 
12.2%
1 577
 
10.3%
394
 
7.0%
2 259
 
4.6%
0 208
 
3.7%
202
 
3.6%
, 196
 
3.5%
194
 
3.5%
3 177
 
3.2%
147
 
2.6%
Other values (187) 2566
45.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2549
45.5%
Decimal Number 1965
35.1%
Space Separator 683
 
12.2%
Other Punctuation 197
 
3.5%
Dash Punctuation 60
 
1.1%
Open Punctuation 54
 
1.0%
Close Punctuation 54
 
1.0%
Uppercase Letter 34
 
0.6%
Lowercase Letter 3
 
0.1%
Math Symbol 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
394
 
15.5%
202
 
7.9%
194
 
7.6%
147
 
5.8%
143
 
5.6%
70
 
2.7%
69
 
2.7%
65
 
2.6%
62
 
2.4%
56
 
2.2%
Other values (157) 1147
45.0%
Decimal Number
ValueCountFrequency (%)
1 577
29.4%
2 259
13.2%
0 208
 
10.6%
3 177
 
9.0%
4 143
 
7.3%
6 137
 
7.0%
5 136
 
6.9%
9 125
 
6.4%
7 110
 
5.6%
8 93
 
4.7%
Uppercase Letter
ValueCountFrequency (%)
B 10
29.4%
A 6
17.6%
G 5
14.7%
S 4
 
11.8%
L 3
 
8.8%
C 2
 
5.9%
K 2
 
5.9%
I 1
 
2.9%
F 1
 
2.9%
Lowercase Letter
ValueCountFrequency (%)
e 1
33.3%
s 1
33.3%
k 1
33.3%
Other Punctuation
ValueCountFrequency (%)
, 196
99.5%
. 1
 
0.5%
Space Separator
ValueCountFrequency (%)
683
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 60
100.0%
Open Punctuation
ValueCountFrequency (%)
( 54
100.0%
Close Punctuation
ValueCountFrequency (%)
) 54
100.0%
Math Symbol
ValueCountFrequency (%)
~ 2
100.0%
Control
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 3017
53.8%
Hangul 2549
45.5%
Latin 37
 
0.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
394
 
15.5%
202
 
7.9%
194
 
7.6%
147
 
5.8%
143
 
5.6%
70
 
2.7%
69
 
2.7%
65
 
2.6%
62
 
2.4%
56
 
2.2%
Other values (157) 1147
45.0%
Common
ValueCountFrequency (%)
683
22.6%
1 577
19.1%
2 259
 
8.6%
0 208
 
6.9%
, 196
 
6.5%
3 177
 
5.9%
4 143
 
4.7%
6 137
 
4.5%
5 136
 
4.5%
9 125
 
4.1%
Other values (8) 376
12.5%
Latin
ValueCountFrequency (%)
B 10
27.0%
A 6
16.2%
G 5
13.5%
S 4
 
10.8%
L 3
 
8.1%
C 2
 
5.4%
K 2
 
5.4%
I 1
 
2.7%
F 1
 
2.7%
e 1
 
2.7%
Other values (2) 2
 
5.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 3054
54.5%
Hangul 2549
45.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
683
22.4%
1 577
18.9%
2 259
 
8.5%
0 208
 
6.8%
, 196
 
6.4%
3 177
 
5.8%
4 143
 
4.7%
6 137
 
4.5%
5 136
 
4.5%
9 125
 
4.1%
Other values (20) 413
13.5%
Hangul
ValueCountFrequency (%)
394
 
15.5%
202
 
7.9%
194
 
7.6%
147
 
5.8%
143
 
5.6%
70
 
2.7%
69
 
2.7%
65
 
2.6%
62
 
2.4%
56
 
2.2%
Other values (157) 1147
45.0%
Distinct168
Distinct (%)43.2%
Missing0
Missing (%)0.0%
Memory size3.2 KiB
2024-03-14T23:00:31.859498image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length7
Mean length9.151671
Min length7

Characters and Unicode

Total characters3560
Distinct characters18
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique167 ?
Unique (%)42.9%

Sample

1st row개인정보 포함
2nd row051-625-5837
3rd row051-621-2984
4th row051-242-2426
5th row개인정보 포함
ValueCountFrequency (%)
개인정보 222
36.3%
포함 222
36.3%
051-643-3978 1
 
0.2%
051-621-0078 1
 
0.2%
051-646-8898 1
 
0.2%
051-632-5361 1
 
0.2%
051-624-4074 1
 
0.2%
051-622-0583 1
 
0.2%
051-624-0390 1
 
0.2%
051-612-3363 1
 
0.2%
Other values (159) 159
26.0%
2024-03-14T23:00:33.311714image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 335
 
9.4%
5 275
 
7.7%
1 274
 
7.7%
0 257
 
7.2%
6 233
 
6.5%
222
 
6.2%
222
 
6.2%
222
 
6.2%
222
 
6.2%
222
 
6.2%
Other values (8) 1076
30.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1671
46.9%
Other Letter 1332
37.4%
Dash Punctuation 335
 
9.4%
Space Separator 222
 
6.2%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 275
16.5%
1 274
16.4%
0 257
15.4%
6 233
13.9%
2 179
10.7%
4 115
6.9%
3 109
 
6.5%
7 86
 
5.1%
8 81
 
4.8%
9 62
 
3.7%
Other Letter
ValueCountFrequency (%)
222
16.7%
222
16.7%
222
16.7%
222
16.7%
222
16.7%
222
16.7%
Dash Punctuation
ValueCountFrequency (%)
- 335
100.0%
Space Separator
ValueCountFrequency (%)
222
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 2228
62.6%
Hangul 1332
37.4%

Most frequent character per script

Common
ValueCountFrequency (%)
- 335
15.0%
5 275
12.3%
1 274
12.3%
0 257
11.5%
6 233
10.5%
222
10.0%
2 179
8.0%
4 115
 
5.2%
3 109
 
4.9%
7 86
 
3.9%
Other values (2) 143
6.4%
Hangul
ValueCountFrequency (%)
222
16.7%
222
16.7%
222
16.7%
222
16.7%
222
16.7%
222
16.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2228
62.6%
Hangul 1332
37.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 335
15.0%
5 275
12.3%
1 274
12.3%
0 257
11.5%
6 233
10.5%
222
10.0%
2 179
8.0%
4 115
 
5.2%
3 109
 
4.9%
7 86
 
3.9%
Other values (2) 143
6.4%
Hangul
ValueCountFrequency (%)
222
16.7%
222
16.7%
222
16.7%
222
16.7%
222
16.7%
222
16.7%

Missing values

2024-03-14T23:00:25.098093image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-14T23:00:25.379544image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

행정동상호명도로명주소전화번호
0대연1동GS25 대연중앙점수영로 244개인정보 포함
1대연1동한진사수영로196번길 23051-625-5837
2대연1동드림마트수영로196번길 31051-621-2984
3대연1동GS25 대연래미안수영로208번길 15, 105호051-242-2426
4대연1동세븐일레븐 부산대연중앙점수영로220번길 21개인정보 포함
5대연1동씨유 대연엔젤점수영로220번길 8051-626-7737
6대연1동GS25 대연용소점수영로250번길 11-2개인정보 포함
7대연1동GS25 대연원빌점용소로64번길 140, 101호(대연동, 오션팰리스)개인정보 포함
8대연1동세븐일레븐 부산UN평화점용소로64번길 85개인정보 포함
9대연1동씨유 대연대로점유엔로 121, 1층개인정보 포함
행정동상호명도로명주소전화번호
379문현4동롯데쇼핑㈜ 롯데슈퍼 부산문현점지게골로 45051-636-5601
380문현4동동경슈퍼지게골로 66051-643-3978
381문현4동문현쇼핑몰지게골로37개인정보 포함
382문현4동주식회사 대웅식품 문현점우암로 359051-639-9696
383문현4동지에스25 부산곱창골목점지게골로 33-4개인정보 포함
384문현4동한별슈퍼마켓동제당로 170051-647-8929
385문현4동세븐일레븐 부산문현장고개점장고개로 105개인정보 포함
386문현4동공오일마트 주식회사수영로 30-9개인정보 포함
387문현4동씨유 문현파라곤점우암로300, 1동 201,202호개인정보 포함
388문현4동GS25 부산곱창골목점지게골로 33-4(문현동)개인정보 포함