Overview

Dataset statistics

Number of variables3
Number of observations1077
Missing cells2
Missing cells (%)0.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory26.4 KiB
Average record size in memory25.1 B

Variable types

Numeric1
Text2

Dataset

Description부산광역시해운대구_담배소매업현황_20230922
Author부산광역시 해운대구
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=3075756

Reproduction

Analysis started2023-12-10 17:30:41.858352
Analysis finished2023-12-10 17:30:43.796042
Duration1.94 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

NO
Real number (ℝ)

Distinct1076
Distinct (%)100.0%
Missing1
Missing (%)0.1%
Infinite0
Infinite (%)0.0%
Mean538.5
Minimum1
Maximum1076
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size9.6 KiB
2023-12-11T02:30:44.048884image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile54.75
Q1269.75
median538.5
Q3807.25
95-th percentile1022.25
Maximum1076
Range1075
Interquartile range (IQR)537.5

Descriptive statistics

Standard deviation310.75875
Coefficient of variation (CV)0.57708217
Kurtosis-1.2
Mean538.5
Median Absolute Deviation (MAD)269
Skewness0
Sum579426
Variance96571
MonotonicityStrictly increasing
2023-12-11T02:30:44.505241image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.1%
725 1
 
0.1%
711 1
 
0.1%
712 1
 
0.1%
713 1
 
0.1%
714 1
 
0.1%
715 1
 
0.1%
716 1
 
0.1%
717 1
 
0.1%
718 1
 
0.1%
Other values (1066) 1066
99.0%
ValueCountFrequency (%)
1 1
0.1%
2 1
0.1%
3 1
0.1%
4 1
0.1%
5 1
0.1%
6 1
0.1%
7 1
0.1%
8 1
0.1%
9 1
0.1%
10 1
0.1%
ValueCountFrequency (%)
1076 1
0.1%
1075 1
0.1%
1074 1
0.1%
1073 1
0.1%
1072 1
0.1%
1071 1
0.1%
1070 1
0.1%
1069 1
0.1%
1068 1
0.1%
1067 1
0.1%
Distinct1050
Distinct (%)97.6%
Missing1
Missing (%)0.1%
Memory size8.5 KiB
2023-12-11T02:30:45.274488image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length28
Median length20
Mean length7.4711896
Min length2

Characters and Unicode

Total characters8039
Distinct characters516
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1030 ?
Unique (%)95.7%

Sample

1st row세븐일레븐 송정그린웨이점
2nd row지에스(GS)25 트럼프월드점
3rd row지에스(GS)25 부산영산대점
4th row카페 가까운 슈퍼
5th row씨유재송동부점
ValueCountFrequency (%)
씨유 66
 
4.5%
세븐일레븐 42
 
2.9%
이마트24 40
 
2.7%
지에스(gs)25 33
 
2.3%
지에스25 25
 
1.7%
gs25 20
 
1.4%
주)코리아세븐 17
 
1.2%
홈플러스(주 7
 
0.5%
빅세일마트 6
 
0.4%
해운대점 6
 
0.4%
Other values (1116) 1199
82.1%
2023-12-11T02:30:46.289178image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
431
 
5.4%
386
 
4.8%
212
 
2.6%
200
 
2.5%
193
 
2.4%
2 178
 
2.2%
172
 
2.1%
172
 
2.1%
160
 
2.0%
157
 
2.0%
Other values (506) 5778
71.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 6740
83.8%
Decimal Number 403
 
5.0%
Space Separator 386
 
4.8%
Uppercase Letter 237
 
2.9%
Close Punctuation 120
 
1.5%
Open Punctuation 120
 
1.5%
Lowercase Letter 25
 
0.3%
Other Punctuation 6
 
0.1%
Dash Punctuation 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
431
 
6.4%
212
 
3.1%
200
 
3.0%
193
 
2.9%
172
 
2.6%
172
 
2.6%
160
 
2.4%
157
 
2.3%
134
 
2.0%
116
 
1.7%
Other values (453) 4793
71.1%
Uppercase Letter
ValueCountFrequency (%)
S 85
35.9%
G 76
32.1%
C 9
 
3.8%
R 7
 
3.0%
T 6
 
2.5%
A 6
 
2.5%
L 5
 
2.1%
U 5
 
2.1%
E 5
 
2.1%
H 5
 
2.1%
Other values (14) 28
 
11.8%
Lowercase Letter
ValueCountFrequency (%)
o 6
24.0%
l 4
16.0%
a 2
 
8.0%
c 2
 
8.0%
f 2
 
8.0%
e 2
 
8.0%
u 1
 
4.0%
p 1
 
4.0%
b 1
 
4.0%
i 1
 
4.0%
Other values (3) 3
12.0%
Decimal Number
ValueCountFrequency (%)
2 178
44.2%
5 118
29.3%
4 56
 
13.9%
1 24
 
6.0%
3 9
 
2.2%
0 7
 
1.7%
9 4
 
1.0%
6 3
 
0.7%
7 2
 
0.5%
8 2
 
0.5%
Other Punctuation
ValueCountFrequency (%)
. 5
83.3%
? 1
 
16.7%
Space Separator
ValueCountFrequency (%)
386
100.0%
Close Punctuation
ValueCountFrequency (%)
) 120
100.0%
Open Punctuation
ValueCountFrequency (%)
( 120
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 6740
83.8%
Common 1037
 
12.9%
Latin 262
 
3.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
431
 
6.4%
212
 
3.1%
200
 
3.0%
193
 
2.9%
172
 
2.6%
172
 
2.6%
160
 
2.4%
157
 
2.3%
134
 
2.0%
116
 
1.7%
Other values (453) 4793
71.1%
Latin
ValueCountFrequency (%)
S 85
32.4%
G 76
29.0%
C 9
 
3.4%
R 7
 
2.7%
T 6
 
2.3%
o 6
 
2.3%
A 6
 
2.3%
L 5
 
1.9%
U 5
 
1.9%
E 5
 
1.9%
Other values (27) 52
19.8%
Common
ValueCountFrequency (%)
386
37.2%
2 178
17.2%
) 120
 
11.6%
( 120
 
11.6%
5 118
 
11.4%
4 56
 
5.4%
1 24
 
2.3%
3 9
 
0.9%
0 7
 
0.7%
. 5
 
0.5%
Other values (6) 14
 
1.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 6740
83.8%
ASCII 1299
 
16.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
431
 
6.4%
212
 
3.1%
200
 
3.0%
193
 
2.9%
172
 
2.6%
172
 
2.6%
160
 
2.4%
157
 
2.3%
134
 
2.0%
116
 
1.7%
Other values (453) 4793
71.1%
ASCII
ValueCountFrequency (%)
386
29.7%
2 178
13.7%
) 120
 
9.2%
( 120
 
9.2%
5 118
 
9.1%
S 85
 
6.5%
G 76
 
5.9%
4 56
 
4.3%
1 24
 
1.8%
C 9
 
0.7%
Other values (43) 127
 
9.8%
Distinct1064
Distinct (%)98.8%
Missing0
Missing (%)0.0%
Memory size8.5 KiB
2023-12-11T02:30:47.110365image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length58
Median length49
Mean length32.29805
Min length1

Characters and Unicode

Total characters34785
Distinct characters348
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1052 ?
Unique (%)97.7%

Sample

1st row부산광역시 해운대구 해운대로 1140-17. 1층 (송정동)
2nd row부산광역시 해운대구 센텀동로 9. 106호 (우동. 트럼프월드센텀아파트)
3rd row부산광역시 해운대구 반송순환로 142. 영산대학교 해운대캠퍼스 L동 지하1층 (반송동)
4th row부산광역시 해운대구 재반로226번길 72. 1층 103호 (반여동. 현대일성아파트)
5th row부산광역시 해운대구 해운대로153번길 20. 스포츠댄스 1층 (재송동)
ValueCountFrequency (%)
부산광역시 1076
 
17.0%
해운대구 1074
 
16.9%
1층 196
 
3.1%
우동 189
 
3.0%
중동 163
 
2.6%
반여동 155
 
2.4%
좌동 102
 
1.6%
반송동 97
 
1.5%
재송동 95
 
1.5%
송정동 60
 
0.9%
Other values (1328) 3130
49.4%
2023-12-11T02:30:48.167034image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5814
 
16.7%
1 1680
 
4.8%
1414
 
4.1%
1389
 
4.0%
1357
 
3.9%
1261
 
3.6%
1137
 
3.3%
1131
 
3.3%
1107
 
3.2%
1104
 
3.2%
Other values (338) 17391
50.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 20659
59.4%
Space Separator 5814
 
16.7%
Decimal Number 5579
 
16.0%
Close Punctuation 849
 
2.4%
Open Punctuation 849
 
2.4%
Other Punctuation 770
 
2.2%
Dash Punctuation 201
 
0.6%
Uppercase Letter 52
 
0.1%
Math Symbol 7
 
< 0.1%
Lowercase Letter 5
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1414
 
6.8%
1389
 
6.7%
1357
 
6.6%
1261
 
6.1%
1137
 
5.5%
1131
 
5.5%
1107
 
5.4%
1104
 
5.3%
1088
 
5.3%
1083
 
5.2%
Other values (302) 8588
41.6%
Uppercase Letter
ValueCountFrequency (%)
B 16
30.8%
A 9
17.3%
C 7
13.5%
E 5
 
9.6%
P 4
 
7.7%
T 3
 
5.8%
I 2
 
3.8%
S 2
 
3.8%
H 2
 
3.8%
O 1
 
1.9%
Decimal Number
ValueCountFrequency (%)
1 1680
30.1%
2 769
13.8%
0 584
 
10.5%
3 499
 
8.9%
5 394
 
7.1%
4 385
 
6.9%
6 351
 
6.3%
7 338
 
6.1%
8 298
 
5.3%
9 281
 
5.0%
Other Punctuation
ValueCountFrequency (%)
. 746
96.9%
, 19
 
2.5%
/ 2
 
0.3%
@ 1
 
0.1%
? 1
 
0.1%
# 1
 
0.1%
Lowercase Letter
ValueCountFrequency (%)
c 2
40.0%
e 1
20.0%
h 1
20.0%
k 1
20.0%
Space Separator
ValueCountFrequency (%)
5814
100.0%
Close Punctuation
ValueCountFrequency (%)
) 849
100.0%
Open Punctuation
ValueCountFrequency (%)
( 849
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 201
100.0%
Math Symbol
ValueCountFrequency (%)
~ 7
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 20659
59.4%
Common 14069
40.4%
Latin 57
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1414
 
6.8%
1389
 
6.7%
1357
 
6.6%
1261
 
6.1%
1137
 
5.5%
1131
 
5.5%
1107
 
5.4%
1104
 
5.3%
1088
 
5.3%
1083
 
5.2%
Other values (302) 8588
41.6%
Common
ValueCountFrequency (%)
5814
41.3%
1 1680
 
11.9%
) 849
 
6.0%
( 849
 
6.0%
2 769
 
5.5%
. 746
 
5.3%
0 584
 
4.2%
3 499
 
3.5%
5 394
 
2.8%
4 385
 
2.7%
Other values (11) 1500
 
10.7%
Latin
ValueCountFrequency (%)
B 16
28.1%
A 9
15.8%
C 7
12.3%
E 5
 
8.8%
P 4
 
7.0%
T 3
 
5.3%
I 2
 
3.5%
S 2
 
3.5%
c 2
 
3.5%
H 2
 
3.5%
Other values (5) 5
 
8.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 20659
59.4%
ASCII 14126
40.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5814
41.2%
1 1680
 
11.9%
) 849
 
6.0%
( 849
 
6.0%
2 769
 
5.4%
. 746
 
5.3%
0 584
 
4.1%
3 499
 
3.5%
5 394
 
2.8%
4 385
 
2.7%
Other values (26) 1557
 
11.0%
Hangul
ValueCountFrequency (%)
1414
 
6.8%
1389
 
6.7%
1357
 
6.6%
1261
 
6.1%
1137
 
5.5%
1131
 
5.5%
1107
 
5.4%
1104
 
5.3%
1088
 
5.3%
1083
 
5.2%
Other values (302) 8588
41.6%

Interactions

2023-12-11T02:30:42.772674image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Missing values

2023-12-11T02:30:43.057951image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T02:30:43.362741image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-11T02:30:43.641738image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

NO업소명업소도로명주소
01세븐일레븐 송정그린웨이점부산광역시 해운대구 해운대로 1140-17. 1층 (송정동)
12지에스(GS)25 트럼프월드점부산광역시 해운대구 센텀동로 9. 106호 (우동. 트럼프월드센텀아파트)
23지에스(GS)25 부산영산대점부산광역시 해운대구 반송순환로 142. 영산대학교 해운대캠퍼스 L동 지하1층 (반송동)
34카페 가까운 슈퍼부산광역시 해운대구 재반로226번길 72. 1층 103호 (반여동. 현대일성아파트)
45씨유재송동부점부산광역시 해운대구 해운대로153번길 20. 스포츠댄스 1층 (재송동)
56큐마트부산광역시 해운대구 반여로 64. 1층 (반여동)
67이마트24 해운대캐슬스타점부산광역시 해운대구 중동2로34번길 6. 1층 (중동)
78지에스25 해운중동역점부산광역시 해운대구 좌동순환로 5. 이안해운대 101동 105호 (중동)
89씨유해운대상록점부산광역시 해운대구 세실로 7. 상가동 비4호 (좌동. 상록아파트)
910씨유해운대이안점부산광역시 해운대구 좌동순환로15번길 9-8 (좌동)
NO업소명업소도로명주소
10671068해운대슈퍼부산광역시 해운대구 우동 539번지 2호
10681069김정자부산광역시 해운대구 구남로21번길 13-1 (우동)
10691070조성규부산광역시 해운대구 해운대해변로209번길 22 (우동)
10701071정종철부산광역시 해운대구 우동 호 대우마리나 1 106
10711072세기문화사부산광역시 해운대구 좌동 7호
10721073삼거리식당부산광역시 해운대구해운대로 633
10731074럭키마트부산광역시 해운대구 좌동순환로 275 상가
10741075김차향부산광역시 해운대구 재송동 호 삼익 1 328
10751076김인호부산광역시 해운대구 재반로84번길 84-1
1076<NA><NA>