Overview

Dataset statistics

Number of variables3
Number of observations620
Missing cells0
Missing cells (%)0.0%
Duplicate rows3
Duplicate rows (%)0.5%
Total size in memory14.7 KiB
Average record size in memory24.2 B

Variable types

Text3

Dataset

Description부산광역시남구담배소매인지정현황_20200529
Author부산광역시 남구
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=3081544

Alerts

Dataset has 3 (0.5%) duplicate rowsDuplicates

Reproduction

Analysis started2024-04-21 11:15:14.822636
Analysis finished2024-04-21 11:15:15.953204
Duration1.13 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct494
Distinct (%)79.7%
Missing0
Missing (%)0.0%
Memory size5.0 KiB
2024-04-21T20:15:16.471014image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length24
Median length18
Mean length6.9548387
Min length1

Characters and Unicode

Total characters4312
Distinct characters384
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique473 ?
Unique (%)76.3%

Sample

1st row리치마트
2nd row세븐일레븐 부산대연대로점
3rd row지에스(GS)25 대연유일점
4th row이마트24 남구문화점
5th row후크전자담배(경성대점)
ValueCountFrequency (%)
담배 97
 
11.8%
씨유 37
 
4.5%
세븐일레븐 21
 
2.6%
gs25 19
 
2.3%
이마트24 15
 
1.8%
주)코리아세븐 12
 
1.5%
지에스(gs)25 11
 
1.3%
미니스톱 11
 
1.3%
부산지원본부 7
 
0.9%
국군복지단 6
 
0.7%
Other values (527) 585
71.3%
2024-04-21T20:15:17.312237image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
226
 
5.2%
205
 
4.8%
137
 
3.2%
134
 
3.1%
126
 
2.9%
108
 
2.5%
105
 
2.4%
101
 
2.3%
2 86
 
2.0%
86
 
2.0%
Other values (374) 2998
69.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3629
84.2%
Space Separator 205
 
4.8%
Decimal Number 189
 
4.4%
Uppercase Letter 161
 
3.7%
Close Punctuation 61
 
1.4%
Open Punctuation 61
 
1.4%
Lowercase Letter 3
 
0.1%
Other Punctuation 2
 
< 0.1%
Dash Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
226
 
6.2%
137
 
3.8%
134
 
3.7%
126
 
3.5%
108
 
3.0%
105
 
2.9%
101
 
2.8%
86
 
2.4%
76
 
2.1%
66
 
1.8%
Other values (338) 2464
67.9%
Uppercase Letter
ValueCountFrequency (%)
S 58
36.0%
G 54
33.5%
C 11
 
6.8%
K 6
 
3.7%
U 5
 
3.1%
J 4
 
2.5%
N 3
 
1.9%
L 3
 
1.9%
E 3
 
1.9%
V 2
 
1.2%
Other values (9) 12
 
7.5%
Decimal Number
ValueCountFrequency (%)
2 86
45.5%
5 62
32.8%
4 21
 
11.1%
1 7
 
3.7%
0 6
 
3.2%
3 4
 
2.1%
7 1
 
0.5%
9 1
 
0.5%
6 1
 
0.5%
Lowercase Letter
ValueCountFrequency (%)
k 1
33.3%
s 1
33.3%
c 1
33.3%
Space Separator
ValueCountFrequency (%)
205
100.0%
Close Punctuation
ValueCountFrequency (%)
) 61
100.0%
Open Punctuation
ValueCountFrequency (%)
( 61
100.0%
Other Punctuation
ValueCountFrequency (%)
. 2
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3629
84.2%
Common 519
 
12.0%
Latin 164
 
3.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
226
 
6.2%
137
 
3.8%
134
 
3.7%
126
 
3.5%
108
 
3.0%
105
 
2.9%
101
 
2.8%
86
 
2.4%
76
 
2.1%
66
 
1.8%
Other values (338) 2464
67.9%
Latin
ValueCountFrequency (%)
S 58
35.4%
G 54
32.9%
C 11
 
6.7%
K 6
 
3.7%
U 5
 
3.0%
J 4
 
2.4%
N 3
 
1.8%
L 3
 
1.8%
E 3
 
1.8%
V 2
 
1.2%
Other values (12) 15
 
9.1%
Common
ValueCountFrequency (%)
205
39.5%
2 86
16.6%
5 62
 
11.9%
) 61
 
11.8%
( 61
 
11.8%
4 21
 
4.0%
1 7
 
1.3%
0 6
 
1.2%
3 4
 
0.8%
. 2
 
0.4%
Other values (4) 4
 
0.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3629
84.2%
ASCII 683
 
15.8%

Most frequent character per block

Hangul
ValueCountFrequency (%)
226
 
6.2%
137
 
3.8%
134
 
3.7%
126
 
3.5%
108
 
3.0%
105
 
2.9%
101
 
2.8%
86
 
2.4%
76
 
2.1%
66
 
1.8%
Other values (338) 2464
67.9%
ASCII
ValueCountFrequency (%)
205
30.0%
2 86
12.6%
5 62
 
9.1%
) 61
 
8.9%
( 61
 
8.9%
S 58
 
8.5%
G 54
 
7.9%
4 21
 
3.1%
C 11
 
1.6%
1 7
 
1.0%
Other values (26) 57
 
8.3%
Distinct489
Distinct (%)78.9%
Missing0
Missing (%)0.0%
Memory size5.0 KiB
2024-04-21T20:15:18.522934image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length56
Median length45
Mean length20.454839
Min length1

Characters and Unicode

Total characters12682
Distinct characters240
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique477 ?
Unique (%)76.9%

Sample

1st row부산광역시 남구 대연동 369번지 5호
2nd row부산광역시 남구 대연동 334번지 2호
3rd row부산광역시 남구 대연동 76번지 1호
4th row부산광역시 남구 대연동 966번지 3호
5th row부산광역시 남구 대연동 383번지 10호
ValueCountFrequency (%)
부산광역시 503
17.7%
남구 503
17.7%
162
 
5.7%
대연동 151
 
5.3%
용호동 101
 
3.6%
문현동 98
 
3.5%
감만동 49
 
1.7%
1호 47
 
1.7%
우암동 27
 
1.0%
2호 21
 
0.7%
Other values (660) 1174
41.4%
2024-04-21T20:15:20.009157image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2810
22.2%
615
 
4.8%
1 578
 
4.6%
545
 
4.3%
519
 
4.1%
518
 
4.1%
510
 
4.0%
507
 
4.0%
507
 
4.0%
504
 
4.0%
Other values (230) 5069
40.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 7329
57.8%
Space Separator 2810
 
22.2%
Decimal Number 2480
 
19.6%
Dash Punctuation 29
 
0.2%
Uppercase Letter 13
 
0.1%
Other Punctuation 5
 
< 0.1%
Close Punctuation 5
 
< 0.1%
Open Punctuation 5
 
< 0.1%
Math Symbol 3
 
< 0.1%
Lowercase Letter 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
615
 
8.4%
545
 
7.4%
519
 
7.1%
518
 
7.1%
510
 
7.0%
507
 
6.9%
507
 
6.9%
504
 
6.9%
503
 
6.9%
479
 
6.5%
Other values (206) 2122
29.0%
Decimal Number
ValueCountFrequency (%)
1 578
23.3%
2 300
12.1%
3 274
11.0%
5 257
10.4%
4 213
 
8.6%
6 178
 
7.2%
7 178
 
7.2%
9 172
 
6.9%
0 167
 
6.7%
8 163
 
6.6%
Uppercase Letter
ValueCountFrequency (%)
S 4
30.8%
G 3
23.1%
B 3
23.1%
K 2
15.4%
L 1
 
7.7%
Lowercase Letter
ValueCountFrequency (%)
a 1
50.0%
e 1
50.0%
Space Separator
ValueCountFrequency (%)
2810
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 29
100.0%
Other Punctuation
ValueCountFrequency (%)
. 5
100.0%
Close Punctuation
ValueCountFrequency (%)
) 5
100.0%
Open Punctuation
ValueCountFrequency (%)
( 5
100.0%
Math Symbol
ValueCountFrequency (%)
~ 3
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 7329
57.8%
Common 5337
42.1%
Latin 16
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
615
 
8.4%
545
 
7.4%
519
 
7.1%
518
 
7.1%
510
 
7.0%
507
 
6.9%
507
 
6.9%
504
 
6.9%
503
 
6.9%
479
 
6.5%
Other values (206) 2122
29.0%
Common
ValueCountFrequency (%)
2810
52.7%
1 578
 
10.8%
2 300
 
5.6%
3 274
 
5.1%
5 257
 
4.8%
4 213
 
4.0%
6 178
 
3.3%
7 178
 
3.3%
9 172
 
3.2%
0 167
 
3.1%
Other values (6) 210
 
3.9%
Latin
ValueCountFrequency (%)
S 4
25.0%
G 3
18.8%
B 3
18.8%
K 2
12.5%
a 1
 
6.2%
1
 
6.2%
L 1
 
6.2%
e 1
 
6.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 7329
57.8%
ASCII 5352
42.2%
Number Forms 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2810
52.5%
1 578
 
10.8%
2 300
 
5.6%
3 274
 
5.1%
5 257
 
4.8%
4 213
 
4.0%
6 178
 
3.3%
7 178
 
3.3%
9 172
 
3.2%
0 167
 
3.1%
Other values (13) 225
 
4.2%
Hangul
ValueCountFrequency (%)
615
 
8.4%
545
 
7.4%
519
 
7.1%
518
 
7.1%
510
 
7.0%
507
 
6.9%
507
 
6.9%
504
 
6.9%
503
 
6.9%
479
 
6.5%
Other values (206) 2122
29.0%
Number Forms
ValueCountFrequency (%)
1
100.0%
Distinct581
Distinct (%)93.7%
Missing0
Missing (%)0.0%
Memory size5.0 KiB
2024-04-21T20:15:21.057459image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length67
Median length50
Mean length27.753226
Min length1

Characters and Unicode

Total characters17207
Distinct characters275
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique572 ?
Unique (%)92.3%

Sample

1st row부산광역시 남구 수영로266번길 36. 1층 (대연동)
2nd row부산광역시 남구 수영로 256-1 (대연동)
3rd row부산광역시 남구 수영로 286 (대연동)
4th row부산광역시 남구 유엔평화로70번길 24. 1층 (대연동)
5th row부산광역시 남구 수영로 271. 1층 (대연동)
ValueCountFrequency (%)
부산광역시 588
 
17.4%
남구 588
 
17.4%
대연동 199
 
5.9%
용호동 123
 
3.6%
문현동 111
 
3.3%
1층 89
 
2.6%
감만동 55
 
1.6%
수영로 42
 
1.2%
101호 30
 
0.9%
우암동 27
 
0.8%
Other values (739) 1525
45.2%
2024-04-21T20:15:22.644792image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2908
 
16.9%
1 821
 
4.8%
767
 
4.5%
624
 
3.6%
607
 
3.5%
600
 
3.5%
596
 
3.5%
593
 
3.4%
593
 
3.4%
590
 
3.4%
Other values (265) 8508
49.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 9864
57.3%
Space Separator 2908
 
16.9%
Decimal Number 2775
 
16.1%
Close Punctuation 589
 
3.4%
Open Punctuation 589
 
3.4%
Other Punctuation 351
 
2.0%
Dash Punctuation 91
 
0.5%
Uppercase Letter 28
 
0.2%
Lowercase Letter 6
 
< 0.1%
Math Symbol 5
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
767
 
7.8%
624
 
6.3%
607
 
6.2%
600
 
6.1%
596
 
6.0%
593
 
6.0%
593
 
6.0%
590
 
6.0%
589
 
6.0%
318
 
3.2%
Other values (236) 3987
40.4%
Decimal Number
ValueCountFrequency (%)
1 821
29.6%
2 351
12.6%
0 292
 
10.5%
4 229
 
8.3%
3 223
 
8.0%
5 215
 
7.7%
6 200
 
7.2%
9 166
 
6.0%
7 148
 
5.3%
8 130
 
4.7%
Uppercase Letter
ValueCountFrequency (%)
S 6
21.4%
A 5
17.9%
B 5
17.9%
G 4
14.3%
K 4
14.3%
L 2
 
7.1%
C 1
 
3.6%
F 1
 
3.6%
Lowercase Letter
ValueCountFrequency (%)
s 2
33.3%
k 2
33.3%
a 1
16.7%
e 1
16.7%
Space Separator
ValueCountFrequency (%)
2908
100.0%
Close Punctuation
ValueCountFrequency (%)
) 589
100.0%
Open Punctuation
ValueCountFrequency (%)
( 589
100.0%
Other Punctuation
ValueCountFrequency (%)
. 351
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 91
100.0%
Math Symbol
ValueCountFrequency (%)
~ 5
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 9864
57.3%
Common 7308
42.5%
Latin 35
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
767
 
7.8%
624
 
6.3%
607
 
6.2%
600
 
6.1%
596
 
6.0%
593
 
6.0%
593
 
6.0%
590
 
6.0%
589
 
6.0%
318
 
3.2%
Other values (236) 3987
40.4%
Common
ValueCountFrequency (%)
2908
39.8%
1 821
 
11.2%
) 589
 
8.1%
( 589
 
8.1%
. 351
 
4.8%
2 351
 
4.8%
0 292
 
4.0%
4 229
 
3.1%
3 223
 
3.1%
5 215
 
2.9%
Other values (6) 740
 
10.1%
Latin
ValueCountFrequency (%)
S 6
17.1%
A 5
14.3%
B 5
14.3%
G 4
11.4%
K 4
11.4%
s 2
 
5.7%
k 2
 
5.7%
L 2
 
5.7%
a 1
 
2.9%
1
 
2.9%
Other values (3) 3
8.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 9864
57.3%
ASCII 7342
42.7%
Number Forms 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2908
39.6%
1 821
 
11.2%
) 589
 
8.0%
( 589
 
8.0%
. 351
 
4.8%
2 351
 
4.8%
0 292
 
4.0%
4 229
 
3.1%
3 223
 
3.0%
5 215
 
2.9%
Other values (18) 774
 
10.5%
Hangul
ValueCountFrequency (%)
767
 
7.8%
624
 
6.3%
607
 
6.2%
600
 
6.1%
596
 
6.0%
593
 
6.0%
593
 
6.0%
590
 
6.0%
589
 
6.0%
318
 
3.2%
Other values (236) 3987
40.4%
Number Forms
ValueCountFrequency (%)
1
100.0%

Missing values

2024-04-21T20:15:15.613525image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-21T20:15:15.854581image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업소명업소지번주소업소도로명주소
0리치마트부산광역시 남구 대연동 369번지 5호부산광역시 남구 수영로266번길 36. 1층 (대연동)
1세븐일레븐 부산대연대로점부산광역시 남구 대연동 334번지 2호부산광역시 남구 수영로 256-1 (대연동)
2지에스(GS)25 대연유일점부산광역시 남구 대연동 76번지 1호부산광역시 남구 수영로 286 (대연동)
3이마트24 남구문화점부산광역시 남구 대연동 966번지 3호부산광역시 남구 유엔평화로70번길 24. 1층 (대연동)
4후크전자담배(경성대점)부산광역시 남구 대연동 383번지 10호부산광역시 남구 수영로 271. 1층 (대연동)
5재하마트(용호점)부산광역시 남구 용호동 128번지 4호 용호동 시장부산광역시 남구 용호로90번길 30. 1층 (용호동)
6세븐일레븐 문현대로점부산광역시 남구 문현동 406번지 20호부산광역시 남구 수영로 17. 1층 (문현동)
7세븐일레븐 부산우암이편한점부산광역시 남구 우암동 99번지 4호 새길카센타부산광역시 남구 유엔로 34 (우암동)
8CJFW(주)부산항터미널감만점부산광역시 남구 감만동 624번지 한진해운부산광역시 남구 북항로 105. 운영건물(본관)동 4층 (감만동)
9제이씨마트(J.C마트)부산광역시 남구 용당동 580번지 10호 용당창조아파트부산광역시 남구 유엔평화로 136-1. 1층 101호 (용당동. 용당창조아파트)
업소명업소지번주소업소도로명주소
610담배부산광역시 남구 문현동 545번지 27 호
611담배부산광역시 남구 감만동 29번지 36 호부산광역시 남구 석포로58번길 47 (감만동)
612담배부산광역시 남구 감만동 36호부산광역시 남구 석포로 65 (감만동)
613담배부산광역시 남구 용호동 545번지 2 호부산광역시 남구 용주로 26 (용호동)
614담배부산광역시 남구 대연동 1509번지 3 호부산광역시 남구 진남로 58 (대연동)
615담배부산광역시 남구 문현동 220번지 5호부산광역시 남구 수영로39번가길 42-1 (문현동)
616담배부산광역시 남구 용당동 134호
617금석당부산광역시 남구 대연동 1416번지 1 호부산광역시 남구 못골로 75 (대연동)
618담배부산광역시 남구 대연동 376번지 10 호
619담배부산광역시 남구 대연동 368번지 4 호부산광역시 남구 수영로266번길 40 (대연동)

Duplicate rows

Most frequently occurring

업소명업소지번주소업소도로명주소# duplicates
0국군복지단 부산지원본부부산광역시 남구 감만동 418호부산광역시 남구 무민사로 17 (감만동)2
1담배부산광역시 남구 문현동 635번지 6 호부산광역시 남구 전포대로 74 (문현동)2
2창성슈퍼마켓부산광역시 남구 감만동 249번지 3호부산광역시 남구 양지골로55번길 14 (감만동)2