Overview

Dataset statistics

Number of variables3
Number of observations598
Missing cells0
Missing cells (%)0.0%
Duplicate rows2
Duplicate rows (%)0.3%
Total size in memory14.1 KiB
Average record size in memory24.2 B

Variable types

Text3

Dataset

Description부산광역시 남구 담배소매인 지정 현황에 대한 데이터로 업소명, 업소지번주소, 업소도로명주소에 대한 항목을 제공합니다.
URLhttps://www.data.go.kr/data/3081544/fileData.do

Alerts

Dataset has 2 (0.3%) duplicate rowsDuplicates

Reproduction

Analysis started2023-12-12 13:02:10.464136
Analysis finished2023-12-12 13:02:11.048062
Duration0.58 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct493
Distinct (%)82.4%
Missing0
Missing (%)0.0%
Memory size4.8 KiB
2023-12-12T22:02:11.269477image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length26
Median length18
Mean length7.451505
Min length1

Characters and Unicode

Total characters4456
Distinct characters395
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique477 ?
Unique (%)79.8%

Sample

1st row씨유 부산오륙도점
2nd row지에스25 용호파크시티점
3rd row지에스25 대연유엔파워점
4th row착한 과일슈퍼
5th row지에스25 용호해링턴점
ValueCountFrequency (%)
담배 87
 
10.4%
씨유 54
 
6.5%
세븐일레븐 32
 
3.8%
이마트24 23
 
2.8%
지에스(gs)25 19
 
2.3%
gs25 17
 
2.0%
주)코리아세븐 8
 
1.0%
지에스25 6
 
0.7%
용호점 6
 
0.7%
경성대점 4
 
0.5%
Other values (529) 577
69.3%
2023-12-12T22:02:11.715912image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
254
 
5.7%
239
 
5.4%
147
 
3.3%
133
 
3.0%
126
 
2.8%
2 104
 
2.3%
99
 
2.2%
97
 
2.2%
96
 
2.2%
90
 
2.0%
Other values (385) 3071
68.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3687
82.7%
Space Separator 239
 
5.4%
Decimal Number 216
 
4.8%
Uppercase Letter 176
 
3.9%
Open Punctuation 63
 
1.4%
Close Punctuation 63
 
1.4%
Lowercase Letter 7
 
0.2%
Dash Punctuation 2
 
< 0.1%
Other Punctuation 2
 
< 0.1%
Other Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
254
 
6.9%
147
 
4.0%
133
 
3.6%
126
 
3.4%
99
 
2.7%
97
 
2.6%
96
 
2.6%
90
 
2.4%
79
 
2.1%
78
 
2.1%
Other values (344) 2488
67.5%
Uppercase Letter
ValueCountFrequency (%)
S 64
36.4%
G 58
33.0%
E 7
 
4.0%
C 7
 
4.0%
R 6
 
3.4%
K 4
 
2.3%
J 3
 
1.7%
N 3
 
1.7%
V 3
 
1.7%
A 3
 
1.7%
Other values (10) 18
 
10.2%
Decimal Number
ValueCountFrequency (%)
2 104
48.1%
5 69
31.9%
4 28
 
13.0%
1 5
 
2.3%
3 4
 
1.9%
0 3
 
1.4%
9 1
 
0.5%
7 1
 
0.5%
8 1
 
0.5%
Lowercase Letter
ValueCountFrequency (%)
e 2
28.6%
s 1
14.3%
k 1
14.3%
f 1
14.3%
a 1
14.3%
c 1
14.3%
Space Separator
ValueCountFrequency (%)
239
100.0%
Open Punctuation
ValueCountFrequency (%)
( 63
100.0%
Close Punctuation
ValueCountFrequency (%)
) 63
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%
Other Punctuation
ValueCountFrequency (%)
. 2
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3688
82.8%
Common 585
 
13.1%
Latin 183
 
4.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
254
 
6.9%
147
 
4.0%
133
 
3.6%
126
 
3.4%
99
 
2.7%
97
 
2.6%
96
 
2.6%
90
 
2.4%
79
 
2.1%
78
 
2.1%
Other values (345) 2489
67.5%
Latin
ValueCountFrequency (%)
S 64
35.0%
G 58
31.7%
E 7
 
3.8%
C 7
 
3.8%
R 6
 
3.3%
K 4
 
2.2%
J 3
 
1.6%
N 3
 
1.6%
V 3
 
1.6%
A 3
 
1.6%
Other values (16) 25
 
13.7%
Common
ValueCountFrequency (%)
239
40.9%
2 104
17.8%
5 69
 
11.8%
( 63
 
10.8%
) 63
 
10.8%
4 28
 
4.8%
1 5
 
0.9%
3 4
 
0.7%
0 3
 
0.5%
- 2
 
0.3%
Other values (4) 5
 
0.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3687
82.7%
ASCII 768
 
17.2%
None 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
254
 
6.9%
147
 
4.0%
133
 
3.6%
126
 
3.4%
99
 
2.7%
97
 
2.6%
96
 
2.6%
90
 
2.4%
79
 
2.1%
78
 
2.1%
Other values (344) 2488
67.5%
ASCII
ValueCountFrequency (%)
239
31.1%
2 104
13.5%
5 69
 
9.0%
S 64
 
8.3%
( 63
 
8.2%
) 63
 
8.2%
G 58
 
7.6%
4 28
 
3.6%
E 7
 
0.9%
C 7
 
0.9%
Other values (30) 66
 
8.6%
None
ValueCountFrequency (%)
1
100.0%
Distinct513
Distinct (%)85.8%
Missing0
Missing (%)0.0%
Memory size4.8 KiB
2023-12-12T22:02:12.119983image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length56
Median length44
Mean length21.665552
Min length1

Characters and Unicode

Total characters12956
Distinct characters280
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique497 ?
Unique (%)83.1%

Sample

1st row부산광역시 남구 용호동 944 오륙도 에스케이뷰 아파트
2nd row부산광역시 남구 용호동 549-1 데시앙 해링턴 플레이스 파크시티
3rd row부산광역시 남구 대연동 867-35
4th row부산광역시 남구 용호동 176-30 엘지메트로시티
5th row부산광역시 남구 용호동 549-1 데시앙 해링턴 플레이스 파크시티
ValueCountFrequency (%)
부산광역시 527
18.4%
남구 527
18.4%
대연동 166
 
5.8%
142
 
5.0%
용호동 122
 
4.3%
문현동 104
 
3.6%
감만동 49
 
1.7%
1호 28
 
1.0%
우암동 26
 
0.9%
용당동 21
 
0.7%
Other values (753) 1149
40.2%
2023-12-12T22:02:12.724745image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2715
21.0%
1 579
 
4.5%
573
 
4.4%
547
 
4.2%
547
 
4.2%
543
 
4.2%
530
 
4.1%
530
 
4.1%
528
 
4.1%
528
 
4.1%
Other values (270) 5336
41.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 7529
58.1%
Space Separator 2715
 
21.0%
Decimal Number 2538
 
19.6%
Dash Punctuation 142
 
1.1%
Uppercase Letter 14
 
0.1%
Close Punctuation 5
 
< 0.1%
Open Punctuation 5
 
< 0.1%
Math Symbol 3
 
< 0.1%
Other Punctuation 2
 
< 0.1%
Lowercase Letter 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
573
 
7.6%
547
 
7.3%
547
 
7.3%
543
 
7.2%
530
 
7.0%
530
 
7.0%
528
 
7.0%
528
 
7.0%
526
 
7.0%
373
 
5.0%
Other values (246) 2304
30.6%
Decimal Number
ValueCountFrequency (%)
1 579
22.8%
2 298
11.7%
3 283
11.2%
5 265
10.4%
4 224
 
8.8%
7 182
 
7.2%
6 181
 
7.1%
9 181
 
7.1%
8 174
 
6.9%
0 171
 
6.7%
Uppercase Letter
ValueCountFrequency (%)
S 5
35.7%
G 4
28.6%
B 2
 
14.3%
K 2
 
14.3%
L 1
 
7.1%
Lowercase Letter
ValueCountFrequency (%)
a 1
50.0%
e 1
50.0%
Space Separator
ValueCountFrequency (%)
2715
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 142
100.0%
Close Punctuation
ValueCountFrequency (%)
) 5
100.0%
Open Punctuation
ValueCountFrequency (%)
( 5
100.0%
Math Symbol
ValueCountFrequency (%)
~ 3
100.0%
Other Punctuation
ValueCountFrequency (%)
. 2
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 7529
58.1%
Common 5410
41.8%
Latin 17
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
573
 
7.6%
547
 
7.3%
547
 
7.3%
543
 
7.2%
530
 
7.0%
530
 
7.0%
528
 
7.0%
528
 
7.0%
526
 
7.0%
373
 
5.0%
Other values (246) 2304
30.6%
Common
ValueCountFrequency (%)
2715
50.2%
1 579
 
10.7%
2 298
 
5.5%
3 283
 
5.2%
5 265
 
4.9%
4 224
 
4.1%
7 182
 
3.4%
6 181
 
3.3%
9 181
 
3.3%
8 174
 
3.2%
Other values (6) 328
 
6.1%
Latin
ValueCountFrequency (%)
S 5
29.4%
G 4
23.5%
B 2
 
11.8%
K 2
 
11.8%
1
 
5.9%
a 1
 
5.9%
L 1
 
5.9%
e 1
 
5.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 7529
58.1%
ASCII 5426
41.9%
Number Forms 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2715
50.0%
1 579
 
10.7%
2 298
 
5.5%
3 283
 
5.2%
5 265
 
4.9%
4 224
 
4.1%
7 182
 
3.4%
6 181
 
3.3%
9 181
 
3.3%
8 174
 
3.2%
Other values (13) 344
 
6.3%
Hangul
ValueCountFrequency (%)
573
 
7.6%
547
 
7.3%
547
 
7.3%
543
 
7.2%
530
 
7.0%
530
 
7.0%
528
 
7.0%
528
 
7.0%
526
 
7.0%
373
 
5.0%
Other values (246) 2304
30.6%
Number Forms
ValueCountFrequency (%)
1
100.0%
Distinct566
Distinct (%)94.6%
Missing0
Missing (%)0.0%
Memory size4.8 KiB
2023-12-12T22:02:13.110354image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length71
Median length51
Mean length28.899666
Min length1

Characters and Unicode

Total characters17282
Distinct characters294
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique560 ?
Unique (%)93.6%

Sample

1st row부산광역시 남구 오륙도로 85. 상가1동 107. 108호 (용호동. 오륙도 에스케이뷰 아파트)
2nd row부산광역시 남구 용주로 36. 303동 106호 (용호동. 데시앙 해링턴 플레이스 파크시티)
3rd row부산광역시 남구 유엔로169번길 10. 1층 (대연동)
4th row부산광역시 남구 분포로 111. 1002동 111호 (용호동. 엘지메트로시티)
5th row부산광역시 남구 용주로 36. 302동 103호 (용호동. 데시앙 해링턴 플레이스 파크시티)
ValueCountFrequency (%)
부산광역시 570
 
16.7%
남구 570
 
16.7%
대연동 191
 
5.6%
용호동 133
 
3.9%
1층 126
 
3.7%
문현동 109
 
3.2%
감만동 52
 
1.5%
수영로 40
 
1.2%
101호 30
 
0.9%
용호로 27
 
0.8%
Other values (764) 1563
45.8%
2023-12-12T22:02:13.758333image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2942
 
17.0%
1 853
 
4.9%
763
 
4.4%
602
 
3.5%
596
 
3.4%
589
 
3.4%
588
 
3.4%
579
 
3.4%
575
 
3.3%
) 572
 
3.3%
Other values (284) 8623
49.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 9887
57.2%
Space Separator 2942
 
17.0%
Decimal Number 2797
 
16.2%
Close Punctuation 572
 
3.3%
Open Punctuation 572
 
3.3%
Other Punctuation 390
 
2.3%
Dash Punctuation 85
 
0.5%
Uppercase Letter 27
 
0.2%
Math Symbol 5
 
< 0.1%
Lowercase Letter 4
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
763
 
7.7%
602
 
6.1%
596
 
6.0%
589
 
6.0%
588
 
5.9%
579
 
5.9%
575
 
5.8%
572
 
5.8%
571
 
5.8%
335
 
3.4%
Other values (255) 4117
41.6%
Decimal Number
ValueCountFrequency (%)
1 853
30.5%
2 345
12.3%
0 313
 
11.2%
3 232
 
8.3%
4 219
 
7.8%
6 201
 
7.2%
5 200
 
7.2%
9 159
 
5.7%
7 153
 
5.5%
8 122
 
4.4%
Uppercase Letter
ValueCountFrequency (%)
A 7
25.9%
B 7
25.9%
S 5
18.5%
K 3
11.1%
G 3
11.1%
L 1
 
3.7%
C 1
 
3.7%
Lowercase Letter
ValueCountFrequency (%)
a 1
25.0%
s 1
25.0%
k 1
25.0%
e 1
25.0%
Other Punctuation
ValueCountFrequency (%)
. 389
99.7%
, 1
 
0.3%
Space Separator
ValueCountFrequency (%)
2942
100.0%
Close Punctuation
ValueCountFrequency (%)
) 572
100.0%
Open Punctuation
ValueCountFrequency (%)
( 572
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 85
100.0%
Math Symbol
ValueCountFrequency (%)
~ 5
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 9887
57.2%
Common 7363
42.6%
Latin 32
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
763
 
7.7%
602
 
6.1%
596
 
6.0%
589
 
6.0%
588
 
5.9%
579
 
5.9%
575
 
5.8%
572
 
5.8%
571
 
5.8%
335
 
3.4%
Other values (255) 4117
41.6%
Common
ValueCountFrequency (%)
2942
40.0%
1 853
 
11.6%
) 572
 
7.8%
( 572
 
7.8%
. 389
 
5.3%
2 345
 
4.7%
0 313
 
4.3%
3 232
 
3.2%
4 219
 
3.0%
6 201
 
2.7%
Other values (7) 725
 
9.8%
Latin
ValueCountFrequency (%)
A 7
21.9%
B 7
21.9%
S 5
15.6%
K 3
9.4%
G 3
9.4%
a 1
 
3.1%
L 1
 
3.1%
s 1
 
3.1%
1
 
3.1%
k 1
 
3.1%
Other values (2) 2
 
6.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 9887
57.2%
ASCII 7394
42.8%
Number Forms 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2942
39.8%
1 853
 
11.5%
) 572
 
7.7%
( 572
 
7.7%
. 389
 
5.3%
2 345
 
4.7%
0 313
 
4.2%
3 232
 
3.1%
4 219
 
3.0%
6 201
 
2.7%
Other values (18) 756
 
10.2%
Hangul
ValueCountFrequency (%)
763
 
7.7%
602
 
6.1%
596
 
6.0%
589
 
6.0%
588
 
5.9%
579
 
5.9%
575
 
5.8%
572
 
5.8%
571
 
5.8%
335
 
3.4%
Other values (255) 4117
41.6%
Number Forms
ValueCountFrequency (%)
1
100.0%

Missing values

2023-12-12T22:02:10.946761image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T22:02:11.017175image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업소명업소지번주소업소도로명주소
0씨유 부산오륙도점부산광역시 남구 용호동 944 오륙도 에스케이뷰 아파트부산광역시 남구 오륙도로 85. 상가1동 107. 108호 (용호동. 오륙도 에스케이뷰 아파트)
1지에스25 용호파크시티점부산광역시 남구 용호동 549-1 데시앙 해링턴 플레이스 파크시티부산광역시 남구 용주로 36. 303동 106호 (용호동. 데시앙 해링턴 플레이스 파크시티)
2지에스25 대연유엔파워점부산광역시 남구 대연동 867-35부산광역시 남구 유엔로169번길 10. 1층 (대연동)
3착한 과일슈퍼부산광역시 남구 용호동 176-30 엘지메트로시티부산광역시 남구 분포로 111. 1002동 111호 (용호동. 엘지메트로시티)
4지에스25 용호해링턴점부산광역시 남구 용호동 549-1 데시앙 해링턴 플레이스 파크시티부산광역시 남구 용주로 36. 302동 103호 (용호동. 데시앙 해링턴 플레이스 파크시티)
5지에스(GS)25부경삼광점부산광역시 남구 대연동 561-2 금산빌딩부산광역시 남구 용소로64번길 3. 금산빌딩 1층 (대연동)
6지에스(GS)25못골제일점부산광역시 남구 대연동 1465-9부산광역시 남구 못골번영로40번길 25. 1층 (대연동)
7재승전기철물부산광역시 남구 대연동 1085-4부산광역시 남구 석포로91번길 64. 1층 (대연동)
8㈜코리아세븐 부산용호빌리브센트로점부산광역시 남구 용호동 958 빌리브센트로부산광역시 남구 분포로 61. 빌리브센트로 A동 1층 119호 (용호동)
9스펀지마트부산광역시 남구 대연동 537-12부산광역시 남구 용소로 54. 1층 (대연동)
업소명업소지번주소업소도로명주소
588담배부산광역시 남구 감만동 205번지 29 호부산광역시 남구 양지골로113번길 32 (감만동)
589담배부산광역시 남구 감만동 103호
590담배부산광역시 남구 문현동 545번지 27 호
591담배부산광역시 남구 감만동 29번지 36 호부산광역시 남구 석포로58번길 47 (감만동)
592담배부산광역시 남구 감만동 36호부산광역시 남구 석포로 65 (감만동)
593담배부산광역시 남구 대연동 1509번지 3 호부산광역시 남구 진남로 58 (대연동)
594담배부산광역시 남구 용당동 134호
595금석당부산광역시 남구 대연동 1416번지 1 호부산광역시 남구 못골로 75 (대연동)
596담배부산광역시 남구 대연동 376번지 10 호
597담배부산광역시 남구 대연동 368번지 4 호부산광역시 남구 수영로266번길 40 (대연동)

Duplicate rows

Most frequently occurring

업소명업소지번주소업소도로명주소# duplicates
0담배부산광역시 남구 문현동 635번지 6 호부산광역시 남구 전포대로 74 (문현동)2
1창성슈퍼마켓부산광역시 남구 감만동 249번지 3호부산광역시 남구 양지골로55번길 14 (감만동)2