Overview

Dataset statistics

Number of variables4
Number of observations1074
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory34.7 KiB
Average record size in memory33.1 B

Variable types

Numeric1
Text3

Dataset

Description부산광역시해운대구_담배소매업현황_20220921
Author부산광역시 해운대구
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=3075756

Alerts

번호 has unique valuesUnique

Reproduction

Analysis started2023-12-10 17:30:51.045001
Analysis finished2023-12-10 17:30:52.914014
Duration1.87 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

번호
Real number (ℝ)

UNIQUE 

Distinct1074
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean537.5
Minimum1
Maximum1074
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size9.6 KiB
2023-12-11T02:30:53.119879image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile54.65
Q1269.25
median537.5
Q3805.75
95-th percentile1020.35
Maximum1074
Range1073
Interquartile range (IQR)536.5

Descriptive statistics

Standard deviation310.1814
Coefficient of variation (CV)0.57708167
Kurtosis-1.2
Mean537.5
Median Absolute Deviation (MAD)268.5
Skewness0
Sum577275
Variance96212.5
MonotonicityStrictly increasing
2023-12-11T02:30:53.457410image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.1%
723 1
 
0.1%
709 1
 
0.1%
710 1
 
0.1%
711 1
 
0.1%
712 1
 
0.1%
713 1
 
0.1%
714 1
 
0.1%
715 1
 
0.1%
716 1
 
0.1%
Other values (1064) 1064
99.1%
ValueCountFrequency (%)
1 1
0.1%
2 1
0.1%
3 1
0.1%
4 1
0.1%
5 1
0.1%
6 1
0.1%
7 1
0.1%
8 1
0.1%
9 1
0.1%
10 1
0.1%
ValueCountFrequency (%)
1074 1
0.1%
1073 1
0.1%
1072 1
0.1%
1071 1
0.1%
1070 1
0.1%
1069 1
0.1%
1068 1
0.1%
1067 1
0.1%
1066 1
0.1%
1065 1
0.1%
Distinct1014
Distinct (%)94.4%
Missing0
Missing (%)0.0%
Memory size8.5 KiB
2023-12-11T02:30:54.170693image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length26
Median length20
Mean length7.2858473
Min length1

Characters and Unicode

Total characters7825
Distinct characters512
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique977 ?
Unique (%)91.0%

Sample

1st row진진슈퍼
2nd row장희빈
3rd row엄마밥집유통
4th row씨유 해운대마리안느점
5th row카페051 해운대자이점
ValueCountFrequency (%)
씨유 63
 
4.4%
이마트24 35
 
2.5%
세븐일레븐 35
 
2.5%
지에스(gs)25 28
 
2.0%
gs25 24
 
1.7%
주)코리아세븐 17
 
1.2%
지에스25 16
 
1.1%
더마트 8
 
0.6%
해운대점 7
 
0.5%
마트 7
 
0.5%
Other values (1074) 1186
83.2%
2023-12-11T02:30:55.225024image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
401
 
5.1%
362
 
4.6%
208
 
2.7%
* 207
 
2.6%
203
 
2.6%
198
 
2.5%
2 171
 
2.2%
166
 
2.1%
160
 
2.0%
152
 
1.9%
Other values (502) 5597
71.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 6378
81.5%
Decimal Number 385
 
4.9%
Space Separator 362
 
4.6%
Uppercase Letter 223
 
2.8%
Other Punctuation 213
 
2.7%
Close Punctuation 118
 
1.5%
Open Punctuation 118
 
1.5%
Lowercase Letter 25
 
0.3%
Dash Punctuation 3
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
401
 
6.3%
208
 
3.3%
203
 
3.2%
198
 
3.1%
166
 
2.6%
160
 
2.5%
152
 
2.4%
149
 
2.3%
125
 
2.0%
107
 
1.7%
Other values (448) 4509
70.7%
Uppercase Letter
ValueCountFrequency (%)
S 84
37.7%
G 77
34.5%
C 10
 
4.5%
U 6
 
2.7%
L 5
 
2.2%
A 5
 
2.2%
R 5
 
2.2%
T 4
 
1.8%
K 3
 
1.3%
I 3
 
1.3%
Other values (14) 21
 
9.4%
Lowercase Letter
ValueCountFrequency (%)
o 6
24.0%
l 4
16.0%
c 2
 
8.0%
a 2
 
8.0%
f 2
 
8.0%
e 2
 
8.0%
h 1
 
4.0%
t 1
 
4.0%
i 1
 
4.0%
u 1
 
4.0%
Other values (3) 3
12.0%
Decimal Number
ValueCountFrequency (%)
2 171
44.4%
5 112
29.1%
4 49
 
12.7%
1 27
 
7.0%
3 6
 
1.6%
0 5
 
1.3%
6 5
 
1.3%
9 5
 
1.3%
7 3
 
0.8%
8 2
 
0.5%
Other Punctuation
ValueCountFrequency (%)
* 207
97.2%
. 5
 
2.3%
? 1
 
0.5%
Space Separator
ValueCountFrequency (%)
362
100.0%
Close Punctuation
ValueCountFrequency (%)
) 118
100.0%
Open Punctuation
ValueCountFrequency (%)
( 118
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 6378
81.5%
Common 1199
 
15.3%
Latin 248
 
3.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
401
 
6.3%
208
 
3.3%
203
 
3.2%
198
 
3.1%
166
 
2.6%
160
 
2.5%
152
 
2.4%
149
 
2.3%
125
 
2.0%
107
 
1.7%
Other values (448) 4509
70.7%
Latin
ValueCountFrequency (%)
S 84
33.9%
G 77
31.0%
C 10
 
4.0%
U 6
 
2.4%
o 6
 
2.4%
L 5
 
2.0%
A 5
 
2.0%
R 5
 
2.0%
T 4
 
1.6%
l 4
 
1.6%
Other values (27) 42
16.9%
Common
ValueCountFrequency (%)
362
30.2%
* 207
17.3%
2 171
14.3%
) 118
 
9.8%
( 118
 
9.8%
5 112
 
9.3%
4 49
 
4.1%
1 27
 
2.3%
3 6
 
0.5%
0 5
 
0.4%
Other values (7) 24
 
2.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 6378
81.5%
ASCII 1447
 
18.5%

Most frequent character per block

Hangul
ValueCountFrequency (%)
401
 
6.3%
208
 
3.3%
203
 
3.2%
198
 
3.1%
166
 
2.6%
160
 
2.5%
152
 
2.4%
149
 
2.3%
125
 
2.0%
107
 
1.7%
Other values (448) 4509
70.7%
ASCII
ValueCountFrequency (%)
362
25.0%
* 207
14.3%
2 171
11.8%
) 118
 
8.2%
( 118
 
8.2%
5 112
 
7.7%
S 84
 
5.8%
G 77
 
5.3%
4 49
 
3.4%
1 27
 
1.9%
Other values (44) 122
 
8.4%
Distinct913
Distinct (%)85.0%
Missing0
Missing (%)0.0%
Memory size8.5 KiB
2023-12-11T02:30:55.819513image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length51
Median length43
Mean length23.766294
Min length1

Characters and Unicode

Total characters25525
Distinct characters327
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique900 ?
Unique (%)83.8%

Sample

1st row부산광역시 해운대구 반송동 709-33
2nd row부산광역시 해운대구 송정동 200-1
3rd row부산광역시 해운대구 우동 1498 트럼프월드센텀아파트
4th row부산광역시 해운대구 중동 1400-24 호텔마리안느
5th row부산광역시 해운대구 우동 1536 해운대자이2차
ValueCountFrequency (%)
부산광역시 925
18.9%
해운대구 925
18.9%
중동 160
 
3.3%
우동 158
 
3.2%
156
 
3.2%
반여동 140
 
2.9%
재송동 119
 
2.4%
좌동 108
 
2.2%
반송동 99
 
2.0%
송정동 61
 
1.2%
Other values (1333) 2053
41.9%
2023-12-11T02:30:56.734081image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5355
21.0%
1 1418
 
5.6%
1008
 
3.9%
983
 
3.9%
973
 
3.8%
970
 
3.8%
966
 
3.8%
960
 
3.8%
948
 
3.7%
933
 
3.7%
Other values (317) 11011
43.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 14918
58.4%
Space Separator 5355
 
21.0%
Decimal Number 4932
 
19.3%
Dash Punctuation 269
 
1.1%
Uppercase Letter 23
 
0.1%
Other Punctuation 18
 
0.1%
Close Punctuation 5
 
< 0.1%
Open Punctuation 5
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1008
 
6.8%
983
 
6.6%
973
 
6.5%
970
 
6.5%
966
 
6.5%
960
 
6.4%
948
 
6.4%
933
 
6.3%
928
 
6.2%
927
 
6.2%
Other values (287) 5322
35.7%
Uppercase Letter
ValueCountFrequency (%)
B 6
26.1%
A 4
17.4%
C 2
 
8.7%
I 2
 
8.7%
S 2
 
8.7%
O 1
 
4.3%
R 1
 
4.3%
P 1
 
4.3%
H 1
 
4.3%
T 1
 
4.3%
Other values (2) 2
 
8.7%
Decimal Number
ValueCountFrequency (%)
1 1418
28.8%
2 588
11.9%
0 462
 
9.4%
4 462
 
9.4%
3 415
 
8.4%
9 377
 
7.6%
5 350
 
7.1%
8 296
 
6.0%
7 288
 
5.8%
6 276
 
5.6%
Other Punctuation
ValueCountFrequency (%)
. 12
66.7%
@ 3
 
16.7%
/ 2
 
11.1%
? 1
 
5.6%
Space Separator
ValueCountFrequency (%)
5355
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 269
100.0%
Close Punctuation
ValueCountFrequency (%)
) 5
100.0%
Open Punctuation
ValueCountFrequency (%)
( 5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 14918
58.4%
Common 10584
41.5%
Latin 23
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1008
 
6.8%
983
 
6.6%
973
 
6.5%
970
 
6.5%
966
 
6.5%
960
 
6.4%
948
 
6.4%
933
 
6.3%
928
 
6.2%
927
 
6.2%
Other values (287) 5322
35.7%
Common
ValueCountFrequency (%)
5355
50.6%
1 1418
 
13.4%
2 588
 
5.6%
0 462
 
4.4%
4 462
 
4.4%
3 415
 
3.9%
9 377
 
3.6%
5 350
 
3.3%
8 296
 
2.8%
7 288
 
2.7%
Other values (8) 573
 
5.4%
Latin
ValueCountFrequency (%)
B 6
26.1%
A 4
17.4%
C 2
 
8.7%
I 2
 
8.7%
S 2
 
8.7%
O 1
 
4.3%
R 1
 
4.3%
P 1
 
4.3%
H 1
 
4.3%
T 1
 
4.3%
Other values (2) 2
 
8.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 14918
58.4%
ASCII 10607
41.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5355
50.5%
1 1418
 
13.4%
2 588
 
5.5%
0 462
 
4.4%
4 462
 
4.4%
3 415
 
3.9%
9 377
 
3.6%
5 350
 
3.3%
8 296
 
2.8%
7 288
 
2.7%
Other values (20) 596
 
5.6%
Hangul
ValueCountFrequency (%)
1008
 
6.8%
983
 
6.6%
973
 
6.5%
970
 
6.5%
966
 
6.5%
960
 
6.4%
948
 
6.4%
933
 
6.3%
928
 
6.2%
927
 
6.2%
Other values (287) 5322
35.7%
Distinct832
Distinct (%)77.5%
Missing0
Missing (%)0.0%
Memory size8.5 KiB
2023-12-11T02:30:57.175266image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length58
Median length51
Mean length26.790503
Min length1

Characters and Unicode

Total characters28773
Distinct characters333
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique829 ?
Unique (%)77.2%

Sample

1st row부산광역시 해운대구 아랫반송로21번길 23-28. 1층 (반송동)
2nd row부산광역시 해운대구 송정중앙로21번길 24-4. 1층 (송정동)
3rd row부산광역시 해운대구 센텀동로 9. 지하2층 201호 일부호 (우동. 트럼프월드센텀아파트)
4th row부산광역시 해운대구 해운대해변로 310. 호텔마리안느 1층 (중동)
5th row부산광역시 해운대구 해운대로469번길 110. 1층 103호 (우동. 해운대자이2차)
ValueCountFrequency (%)
부산광역시 833
 
16.2%
해운대구 833
 
16.2%
1층 181
 
3.5%
우동 164
 
3.2%
반여동 141
 
2.7%
중동 138
 
2.7%
반송동 92
 
1.8%
좌동 86
 
1.7%
재송동 79
 
1.5%
해운대로 56
 
1.1%
Other values (1118) 2541
49.4%
2023-12-11T02:30:57.905654image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4780
 
16.6%
1 1352
 
4.7%
1123
 
3.9%
1110
 
3.9%
1101
 
3.8%
1076
 
3.7%
891
 
3.1%
886
 
3.1%
861
 
3.0%
858
 
3.0%
Other values (323) 14735
51.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 16979
59.0%
Space Separator 4780
 
16.6%
Decimal Number 4425
 
15.4%
Open Punctuation 837
 
2.9%
Close Punctuation 837
 
2.9%
Other Punctuation 722
 
2.5%
Dash Punctuation 144
 
0.5%
Uppercase Letter 46
 
0.2%
Math Symbol 3
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1123
 
6.6%
1110
 
6.5%
1101
 
6.5%
1076
 
6.3%
891
 
5.2%
886
 
5.2%
861
 
5.1%
858
 
5.1%
843
 
5.0%
838
 
4.9%
Other values (292) 7392
43.5%
Uppercase Letter
ValueCountFrequency (%)
B 11
23.9%
A 8
17.4%
C 7
15.2%
P 4
 
8.7%
E 4
 
8.7%
I 3
 
6.5%
T 2
 
4.3%
H 2
 
4.3%
S 2
 
4.3%
R 1
 
2.2%
Other values (2) 2
 
4.3%
Decimal Number
ValueCountFrequency (%)
1 1352
30.6%
2 600
13.6%
0 499
 
11.3%
3 401
 
9.1%
5 314
 
7.1%
4 295
 
6.7%
6 278
 
6.3%
7 252
 
5.7%
8 230
 
5.2%
9 204
 
4.6%
Other Punctuation
ValueCountFrequency (%)
. 718
99.4%
/ 2
 
0.3%
? 1
 
0.1%
@ 1
 
0.1%
Space Separator
ValueCountFrequency (%)
4780
100.0%
Open Punctuation
ValueCountFrequency (%)
( 837
100.0%
Close Punctuation
ValueCountFrequency (%)
) 837
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 144
100.0%
Math Symbol
ValueCountFrequency (%)
~ 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 16979
59.0%
Common 11748
40.8%
Latin 46
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1123
 
6.6%
1110
 
6.5%
1101
 
6.5%
1076
 
6.3%
891
 
5.2%
886
 
5.2%
861
 
5.1%
858
 
5.1%
843
 
5.0%
838
 
4.9%
Other values (292) 7392
43.5%
Common
ValueCountFrequency (%)
4780
40.7%
1 1352
 
11.5%
( 837
 
7.1%
) 837
 
7.1%
. 718
 
6.1%
2 600
 
5.1%
0 499
 
4.2%
3 401
 
3.4%
5 314
 
2.7%
4 295
 
2.5%
Other values (9) 1115
 
9.5%
Latin
ValueCountFrequency (%)
B 11
23.9%
A 8
17.4%
C 7
15.2%
P 4
 
8.7%
E 4
 
8.7%
I 3
 
6.5%
T 2
 
4.3%
H 2
 
4.3%
S 2
 
4.3%
R 1
 
2.2%
Other values (2) 2
 
4.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 16979
59.0%
ASCII 11794
41.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
4780
40.5%
1 1352
 
11.5%
( 837
 
7.1%
) 837
 
7.1%
. 718
 
6.1%
2 600
 
5.1%
0 499
 
4.2%
3 401
 
3.4%
5 314
 
2.7%
4 295
 
2.5%
Other values (21) 1161
 
9.8%
Hangul
ValueCountFrequency (%)
1123
 
6.6%
1110
 
6.5%
1101
 
6.5%
1076
 
6.3%
891
 
5.2%
886
 
5.2%
861
 
5.1%
858
 
5.1%
843
 
5.0%
838
 
4.9%
Other values (292) 7392
43.5%

Interactions

2023-12-11T02:30:52.301060image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Missing values

2023-12-11T02:30:52.619210image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T02:30:52.838332image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

번호업소명업소지번주소업소도로명주소
01진진슈퍼부산광역시 해운대구 반송동 709-33부산광역시 해운대구 아랫반송로21번길 23-28. 1층 (반송동)
12장희빈부산광역시 해운대구 송정동 200-1부산광역시 해운대구 송정중앙로21번길 24-4. 1층 (송정동)
23엄마밥집유통부산광역시 해운대구 우동 1498 트럼프월드센텀아파트부산광역시 해운대구 센텀동로 9. 지하2층 201호 일부호 (우동. 트럼프월드센텀아파트)
34씨유 해운대마리안느점부산광역시 해운대구 중동 1400-24 호텔마리안느부산광역시 해운대구 해운대해변로 310. 호텔마리안느 1층 (중동)
45카페051 해운대자이점부산광역시 해운대구 우동 1536 해운대자이2차부산광역시 해운대구 해운대로469번길 110. 1층 103호 (우동. 해운대자이2차)
56세븐일레븐 해운대월드마크점부산광역시 해운대구 우동 1435-2 대우월드마크해운대부산광역시 해운대구 마린시티1로 137. 대우월드마크해운대 106호 (우동)
67세븐일레븐 반여선수촌2호점부산광역시 해운대구 반여동 1638 아시아선수촌아파트부산광역시 해운대구 선수촌로 122. 분산상가동 1층 101호 (반여동. 아시아선수촌아파트)
78하카전자담배 해운대역 직영점부산광역시 해운대구 우동 529-2부산광역시 해운대구 해운대로 624 (우동)
89황금로또부산광역시 해운대구 좌동 985부산광역시 해운대구 양운로 104. 1층 (좌동)
910씨유 재송센텀그린점부산광역시 해운대구 반여동 1291-1219 델피하우스생맥주부산광역시 해운대구 해운대로61번길 61. 1층 (반여동)
번호업소명업소지번주소업소도로명주소
10641065권*선부산광역시 해운대구 해운대로 631-3 (우동)
10651066김*동부산광역시 해운대구 우동 539번지 2호
10661067더마트 장산부산광역시 해운대구 우동 613번지 6 호부산광역시 해운대구 구남로21번길 13-1 (우동)
10671068해운대슈퍼부산광역시 해운대구 우동 693번지 4 호부산광역시 해운대구 해운대해변로209번길 22 (우동)
10681069김*자부산광역시 해운대구 우동 호 대우마리나 1 106
10691070조*규부산광역시 해운대구 좌동 7호
10701071정*철부산광역시 해운대구 우동 594번지 5 호
10711072세기문화사부산광역시 해운대구 좌동 호 대우2차상가
10721073삼거리식당부산광역시 해운대구 재송동 호 삼익 1 328
10731074럭키마트부산광역시 해운대구 재송동 1123-22호