Overview

Dataset statistics

Number of variables3
Number of observations617
Missing cells1
Missing cells (%)0.1%
Duplicate rows2
Duplicate rows (%)0.3%
Total size in memory14.6 KiB
Average record size in memory24.2 B

Variable types

Text3

Dataset

Description부산광역시남구담배소매인지정현황_20210513
Author부산광역시 남구
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=3081544

Alerts

Dataset has 2 (0.3%) duplicate rowsDuplicates

Reproduction

Analysis started2024-04-21 11:15:35.076756
Analysis finished2024-04-21 11:15:36.208920
Duration1.13 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct500
Distinct (%)81.0%
Missing0
Missing (%)0.0%
Memory size4.9 KiB
2024-04-21T20:15:36.703013image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length24
Median length18
Mean length6.9740681
Min length1

Characters and Unicode

Total characters4303
Distinct characters381
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique480 ?
Unique (%)77.8%

Sample

1st row세븐일레븐 부산용호로점
2nd row블루25 씨스페이스 못골점
3rd row베이프마스터 경성대점
4th row지에스(GS)25 대연유일점
5th row지에스(GS)25 못골시장점
ValueCountFrequency (%)
담배 94
 
11.4%
씨유 43
 
5.2%
세븐일레븐 25
 
3.0%
gs25 20
 
2.4%
이마트24 17
 
2.1%
지에스(gs)25 13
 
1.6%
주)코리아세븐 12
 
1.5%
미니스톱 10
 
1.2%
마트 5
 
0.6%
부산대연점 4
 
0.5%
Other values (526) 579
70.4%
2024-04-21T20:15:37.802682image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
234
 
5.4%
209
 
4.9%
143
 
3.3%
137
 
3.2%
136
 
3.2%
106
 
2.5%
103
 
2.4%
92
 
2.1%
2 88
 
2.0%
86
 
2.0%
Other values (371) 2969
69.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3621
84.2%
Space Separator 209
 
4.9%
Decimal Number 190
 
4.4%
Uppercase Letter 157
 
3.6%
Open Punctuation 60
 
1.4%
Close Punctuation 60
 
1.4%
Lowercase Letter 3
 
0.1%
Dash Punctuation 2
 
< 0.1%
Other Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
234
 
6.5%
143
 
3.9%
137
 
3.8%
136
 
3.8%
106
 
2.9%
103
 
2.8%
92
 
2.5%
86
 
2.4%
73
 
2.0%
70
 
1.9%
Other values (335) 2441
67.4%
Uppercase Letter
ValueCountFrequency (%)
S 58
36.9%
G 54
34.4%
C 9
 
5.7%
K 5
 
3.2%
J 4
 
2.5%
E 3
 
1.9%
N 3
 
1.9%
U 3
 
1.9%
V 2
 
1.3%
L 2
 
1.3%
Other values (9) 14
 
8.9%
Decimal Number
ValueCountFrequency (%)
2 88
46.3%
5 63
33.2%
4 21
 
11.1%
1 6
 
3.2%
3 5
 
2.6%
0 4
 
2.1%
9 1
 
0.5%
7 1
 
0.5%
6 1
 
0.5%
Lowercase Letter
ValueCountFrequency (%)
k 1
33.3%
s 1
33.3%
c 1
33.3%
Space Separator
ValueCountFrequency (%)
209
100.0%
Open Punctuation
ValueCountFrequency (%)
( 60
100.0%
Close Punctuation
ValueCountFrequency (%)
) 60
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%
Other Punctuation
ValueCountFrequency (%)
. 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3621
84.2%
Common 522
 
12.1%
Latin 160
 
3.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
234
 
6.5%
143
 
3.9%
137
 
3.8%
136
 
3.8%
106
 
2.9%
103
 
2.8%
92
 
2.5%
86
 
2.4%
73
 
2.0%
70
 
1.9%
Other values (335) 2441
67.4%
Latin
ValueCountFrequency (%)
S 58
36.2%
G 54
33.8%
C 9
 
5.6%
K 5
 
3.1%
J 4
 
2.5%
E 3
 
1.9%
N 3
 
1.9%
U 3
 
1.9%
V 2
 
1.2%
L 2
 
1.2%
Other values (12) 17
 
10.6%
Common
ValueCountFrequency (%)
209
40.0%
2 88
16.9%
5 63
 
12.1%
( 60
 
11.5%
) 60
 
11.5%
4 21
 
4.0%
1 6
 
1.1%
3 5
 
1.0%
0 4
 
0.8%
- 2
 
0.4%
Other values (4) 4
 
0.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3621
84.2%
ASCII 682
 
15.8%

Most frequent character per block

Hangul
ValueCountFrequency (%)
234
 
6.5%
143
 
3.9%
137
 
3.8%
136
 
3.8%
106
 
2.9%
103
 
2.8%
92
 
2.5%
86
 
2.4%
73
 
2.0%
70
 
1.9%
Other values (335) 2441
67.4%
ASCII
ValueCountFrequency (%)
209
30.6%
2 88
12.9%
5 63
 
9.2%
( 60
 
8.8%
) 60
 
8.8%
S 58
 
8.5%
G 54
 
7.9%
4 21
 
3.1%
C 9
 
1.3%
1 6
 
0.9%
Other values (26) 54
 
7.9%
Distinct506
Distinct (%)82.1%
Missing1
Missing (%)0.2%
Memory size4.9 KiB
2024-04-21T20:15:39.112409image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length56
Median length45
Mean length20.873377
Min length1

Characters and Unicode

Total characters12858
Distinct characters250
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique493 ?
Unique (%)80.0%

Sample

1st row부산광역시 남구 용호동 532-6
2nd row부산광역시 남구 대연동 1398-9 롯데식육
3rd row부산광역시 남구 대연동 58-11 로티보이까페
4th row부산광역시 남구 대연동 76-1
5th row부산광역시 남구 대연동 1415-8
ValueCountFrequency (%)
부산광역시 518
18.1%
남구 518
18.1%
대연동 155
 
5.4%
155
 
5.4%
용호동 109
 
3.8%
문현동 100
 
3.5%
감만동 51
 
1.8%
1호 36
 
1.3%
우암동 29
 
1.0%
용당동 23
 
0.8%
Other values (701) 1172
40.9%
2024-04-21T20:15:40.691432image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2789
21.7%
584
 
4.5%
1 576
 
4.5%
561
 
4.4%
535
 
4.2%
534
 
4.2%
527
 
4.1%
522
 
4.1%
521
 
4.1%
519
 
4.0%
Other values (240) 5190
40.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 7449
57.9%
Space Separator 2789
 
21.7%
Decimal Number 2508
 
19.5%
Dash Punctuation 76
 
0.6%
Uppercase Letter 15
 
0.1%
Other Punctuation 5
 
< 0.1%
Close Punctuation 5
 
< 0.1%
Open Punctuation 5
 
< 0.1%
Math Symbol 3
 
< 0.1%
Lowercase Letter 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
584
 
7.8%
561
 
7.5%
535
 
7.2%
534
 
7.2%
527
 
7.1%
522
 
7.0%
521
 
7.0%
519
 
7.0%
518
 
7.0%
437
 
5.9%
Other values (216) 2191
29.4%
Decimal Number
ValueCountFrequency (%)
1 576
23.0%
2 294
11.7%
3 288
11.5%
5 251
10.0%
4 216
 
8.6%
6 184
 
7.3%
7 181
 
7.2%
9 180
 
7.2%
8 171
 
6.8%
0 167
 
6.7%
Uppercase Letter
ValueCountFrequency (%)
S 5
33.3%
G 4
26.7%
B 3
20.0%
K 2
 
13.3%
L 1
 
6.7%
Lowercase Letter
ValueCountFrequency (%)
a 1
50.0%
e 1
50.0%
Space Separator
ValueCountFrequency (%)
2789
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 76
100.0%
Other Punctuation
ValueCountFrequency (%)
. 5
100.0%
Close Punctuation
ValueCountFrequency (%)
) 5
100.0%
Open Punctuation
ValueCountFrequency (%)
( 5
100.0%
Math Symbol
ValueCountFrequency (%)
~ 3
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 7449
57.9%
Common 5391
41.9%
Latin 18
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
584
 
7.8%
561
 
7.5%
535
 
7.2%
534
 
7.2%
527
 
7.1%
522
 
7.0%
521
 
7.0%
519
 
7.0%
518
 
7.0%
437
 
5.9%
Other values (216) 2191
29.4%
Common
ValueCountFrequency (%)
2789
51.7%
1 576
 
10.7%
2 294
 
5.5%
3 288
 
5.3%
5 251
 
4.7%
4 216
 
4.0%
6 184
 
3.4%
7 181
 
3.4%
9 180
 
3.3%
8 171
 
3.2%
Other values (6) 261
 
4.8%
Latin
ValueCountFrequency (%)
S 5
27.8%
G 4
22.2%
B 3
16.7%
K 2
 
11.1%
L 1
 
5.6%
1
 
5.6%
a 1
 
5.6%
e 1
 
5.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 7449
57.9%
ASCII 5408
42.1%
Number Forms 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2789
51.6%
1 576
 
10.7%
2 294
 
5.4%
3 288
 
5.3%
5 251
 
4.6%
4 216
 
4.0%
6 184
 
3.4%
7 181
 
3.3%
9 180
 
3.3%
8 171
 
3.2%
Other values (13) 278
 
5.1%
Hangul
ValueCountFrequency (%)
584
 
7.8%
561
 
7.5%
535
 
7.2%
534
 
7.2%
527
 
7.1%
522
 
7.0%
521
 
7.0%
519
 
7.0%
518
 
7.0%
437
 
5.9%
Other values (216) 2191
29.4%
Number Forms
ValueCountFrequency (%)
1
100.0%
Distinct580
Distinct (%)94.0%
Missing0
Missing (%)0.0%
Memory size4.9 KiB
2024-04-21T20:15:41.740105image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length71
Median length50
Mean length28.119935
Min length1

Characters and Unicode

Total characters17350
Distinct characters281
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique572 ?
Unique (%)92.7%

Sample

1st row부산광역시 남구 용호로 207. 1층 102호 (용호동)
2nd row부산광역시 남구 못골번영로 22. 롯데식육 (대연동)
3rd row부산광역시 남구 용소로 15. 로티보이까페 1층 (대연동)
4th row부산광역시 남구 수영로 286 (대연동)
5th row부산광역시 남구 못골로 73 (대연동)
ValueCountFrequency (%)
부산광역시 586
 
17.2%
남구 586
 
17.2%
대연동 195
 
5.7%
용호동 128
 
3.8%
문현동 110
 
3.2%
1층 105
 
3.1%
감만동 55
 
1.6%
수영로 39
 
1.1%
101호 29
 
0.9%
우암동 28
 
0.8%
Other values (743) 1550
45.4%
2024-04-21T20:15:43.092289image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2938
 
16.9%
1 839
 
4.8%
766
 
4.4%
620
 
3.6%
609
 
3.5%
601
 
3.5%
596
 
3.4%
592
 
3.4%
591
 
3.4%
587
 
3.4%
Other values (271) 8611
49.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 9937
57.3%
Space Separator 2938
 
16.9%
Decimal Number 2800
 
16.1%
Close Punctuation 587
 
3.4%
Open Punctuation 587
 
3.4%
Other Punctuation 369
 
2.1%
Dash Punctuation 92
 
0.5%
Uppercase Letter 28
 
0.2%
Lowercase Letter 6
 
< 0.1%
Math Symbol 5
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
766
 
7.7%
620
 
6.2%
609
 
6.1%
601
 
6.0%
596
 
6.0%
592
 
6.0%
591
 
5.9%
587
 
5.9%
587
 
5.9%
327
 
3.3%
Other values (244) 4061
40.9%
Decimal Number
ValueCountFrequency (%)
1 839
30.0%
2 350
12.5%
0 296
 
10.6%
4 229
 
8.2%
3 224
 
8.0%
5 216
 
7.7%
6 202
 
7.2%
9 163
 
5.8%
7 151
 
5.4%
8 130
 
4.6%
Uppercase Letter
ValueCountFrequency (%)
S 6
21.4%
B 6
21.4%
A 6
21.4%
K 4
14.3%
G 4
14.3%
L 2
 
7.1%
Lowercase Letter
ValueCountFrequency (%)
k 2
33.3%
s 2
33.3%
a 1
16.7%
e 1
16.7%
Space Separator
ValueCountFrequency (%)
2938
100.0%
Close Punctuation
ValueCountFrequency (%)
) 587
100.0%
Open Punctuation
ValueCountFrequency (%)
( 587
100.0%
Other Punctuation
ValueCountFrequency (%)
. 369
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 92
100.0%
Math Symbol
ValueCountFrequency (%)
~ 5
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 9937
57.3%
Common 7378
42.5%
Latin 35
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
766
 
7.7%
620
 
6.2%
609
 
6.1%
601
 
6.0%
596
 
6.0%
592
 
6.0%
591
 
5.9%
587
 
5.9%
587
 
5.9%
327
 
3.3%
Other values (244) 4061
40.9%
Common
ValueCountFrequency (%)
2938
39.8%
1 839
 
11.4%
) 587
 
8.0%
( 587
 
8.0%
. 369
 
5.0%
2 350
 
4.7%
0 296
 
4.0%
4 229
 
3.1%
3 224
 
3.0%
5 216
 
2.9%
Other values (6) 743
 
10.1%
Latin
ValueCountFrequency (%)
S 6
17.1%
B 6
17.1%
A 6
17.1%
K 4
11.4%
G 4
11.4%
k 2
 
5.7%
s 2
 
5.7%
L 2
 
5.7%
a 1
 
2.9%
1
 
2.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 9937
57.3%
ASCII 7412
42.7%
Number Forms 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2938
39.6%
1 839
 
11.3%
) 587
 
7.9%
( 587
 
7.9%
. 369
 
5.0%
2 350
 
4.7%
0 296
 
4.0%
4 229
 
3.1%
3 224
 
3.0%
5 216
 
2.9%
Other values (16) 777
 
10.5%
Hangul
ValueCountFrequency (%)
766
 
7.7%
620
 
6.2%
609
 
6.1%
601
 
6.0%
596
 
6.0%
592
 
6.0%
591
 
5.9%
587
 
5.9%
587
 
5.9%
327
 
3.3%
Other values (244) 4061
40.9%
Number Forms
ValueCountFrequency (%)
1
100.0%

Missing values

2024-04-21T20:15:35.867279image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-21T20:15:36.110451image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업소명업소지번주소업소도로명주소
0세븐일레븐 부산용호로점부산광역시 남구 용호동 532-6부산광역시 남구 용호로 207. 1층 102호 (용호동)
1블루25 씨스페이스 못골점부산광역시 남구 대연동 1398-9 롯데식육부산광역시 남구 못골번영로 22. 롯데식육 (대연동)
2베이프마스터 경성대점부산광역시 남구 대연동 58-11 로티보이까페부산광역시 남구 용소로 15. 로티보이까페 1층 (대연동)
3지에스(GS)25 대연유일점부산광역시 남구 대연동 76-1부산광역시 남구 수영로 286 (대연동)
4지에스(GS)25 못골시장점부산광역시 남구 대연동 1415-8부산광역시 남구 못골로 73 (대연동)
5뉴턴전자담배 용호점부산광역시 남구 용호동 368-19부산광역시 남구 용호로 113 (용호동)
6지에스25 대연평화로점부산광역시 남구 대연동 984-18부산광역시 남구 유엔평화로 26. 1층 101호 (대연동)
7씨유 문현현대점부산광역시 남구 문현동 73-1 현대아파트부산광역시 남구 진남로198번길 9-1. 현대아파트 지하1층 101. 102. 103. 104. 108호 (문현동. 현대아파트)
8세븐일레븐 부산대연센터점부산광역시 남구 대연동 1467-1부산광역시 남구 못골번영로50번길 20. 1층 (대연동)
9지에스(GS)25 경성중앙점부산광역시 남구 대연동 72-7 산암빌딩부산광역시 남구 수영로 298. 산암빌딩 (대연동)
업소명업소지번주소업소도로명주소
607담배부산광역시 남구 감만동 103호
608담배부산광역시 남구 문현동 545번지 27 호
609담배부산광역시 남구 감만동 29번지 36 호부산광역시 남구 석포로58번길 47 (감만동)
610담배부산광역시 남구 감만동 36호부산광역시 남구 석포로 65 (감만동)
611담배부산광역시 남구 대연동 1509번지 3 호부산광역시 남구 진남로 58 (대연동)
612담배부산광역시 남구 용당동 134호
613금석당부산광역시 남구 대연동 1416번지 1 호부산광역시 남구 못골로 75 (대연동)
614담배부산광역시 남구 대연동 376번지 10 호
615담배부산광역시 남구 대연동 368번지 4 호부산광역시 남구 수영로266번길 40 (대연동)
616오륙도중앙유통<NA>부산광역시 남구 용주로6번길 2 (용호동)

Duplicate rows

Most frequently occurring

업소명업소지번주소업소도로명주소# duplicates
0담배부산광역시 남구 문현동 635번지 6 호부산광역시 남구 전포대로 74 (문현동)2
1창성슈퍼마켓부산광역시 남구 감만동 249번지 3호부산광역시 남구 양지골로55번길 14 (감만동)2