Overview

Dataset statistics

Number of variables4
Number of observations758
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory24.6 KiB
Average record size in memory33.2 B

Variable types

Numeric1
Text3

Dataset

Description부산광역시_강서구_담배소매인현황_20230516
Author부산광역시 강서구
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=15033280

Alerts

연번 has unique valuesUnique

Reproduction

Analysis started2023-12-10 17:02:35.522264
Analysis finished2023-12-10 17:02:36.770644
Duration1.25 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct758
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean379.5
Minimum1
Maximum758
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size6.8 KiB
2023-12-11T02:02:36.903872image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile38.85
Q1190.25
median379.5
Q3568.75
95-th percentile720.15
Maximum758
Range757
Interquartile range (IQR)378.5

Descriptive statistics

Standard deviation218.96004
Coefficient of variation (CV)0.57696981
Kurtosis-1.2
Mean379.5
Median Absolute Deviation (MAD)189.5
Skewness0
Sum287661
Variance47943.5
MonotonicityStrictly increasing
2023-12-11T02:02:37.140967image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.1%
511 1
 
0.1%
502 1
 
0.1%
503 1
 
0.1%
504 1
 
0.1%
505 1
 
0.1%
506 1
 
0.1%
507 1
 
0.1%
508 1
 
0.1%
509 1
 
0.1%
Other values (748) 748
98.7%
ValueCountFrequency (%)
1 1
0.1%
2 1
0.1%
3 1
0.1%
4 1
0.1%
5 1
0.1%
6 1
0.1%
7 1
0.1%
8 1
0.1%
9 1
0.1%
10 1
0.1%
ValueCountFrequency (%)
758 1
0.1%
757 1
0.1%
756 1
0.1%
755 1
0.1%
754 1
0.1%
753 1
0.1%
752 1
0.1%
751 1
0.1%
750 1
0.1%
749 1
0.1%
Distinct703
Distinct (%)92.7%
Missing0
Missing (%)0.0%
Memory size6.1 KiB
2023-12-11T02:02:37.615976image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length29
Median length20
Mean length7.6952507
Min length1

Characters and Unicode

Total characters5833
Distinct characters443
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique687 ?
Unique (%)90.6%

Sample

1st row씨유 명지국제중흥점
2nd row지에스(GS)25 명지제나우스점
3rd row씨유 명지국제협성점
4th row카페051 명지중흥부영점
5th row참마트(강동점)
ValueCountFrequency (%)
씨유 52
 
5.1%
이마트24 38
 
3.7%
세븐일레븐 28
 
2.7%
지에스25 20
 
2.0%
15
 
1.5%
구내식당 11
 
1.1%
잡화점 10
 
1.0%
지에스(gs)25 9
 
0.9%
명지점 6
 
0.6%
gs25 6
 
0.6%
Other values (759) 825
80.9%
2023-12-11T02:02:38.347662image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
301
 
5.2%
279
 
4.8%
175
 
3.0%
154
 
2.6%
143
 
2.5%
119
 
2.0%
115
 
2.0%
112
 
1.9%
2 108
 
1.9%
105
 
1.8%
Other values (433) 4222
72.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4981
85.4%
Space Separator 301
 
5.2%
Decimal Number 225
 
3.9%
Uppercase Letter 110
 
1.9%
Open Punctuation 78
 
1.3%
Close Punctuation 78
 
1.3%
Lowercase Letter 37
 
0.6%
Other Punctuation 16
 
0.3%
Dash Punctuation 4
 
0.1%
Other Symbol 3
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
279
 
5.6%
175
 
3.5%
154
 
3.1%
143
 
2.9%
119
 
2.4%
115
 
2.3%
112
 
2.2%
105
 
2.1%
102
 
2.0%
101
 
2.0%
Other values (386) 3576
71.8%
Uppercase Letter
ValueCountFrequency (%)
S 31
28.2%
G 28
25.5%
C 7
 
6.4%
R 7
 
6.4%
K 6
 
5.5%
H 5
 
4.5%
U 4
 
3.6%
D 4
 
3.6%
E 4
 
3.6%
B 3
 
2.7%
Other values (7) 11
 
10.0%
Lowercase Letter
ValueCountFrequency (%)
e 7
18.9%
n 5
13.5%
o 3
8.1%
i 3
8.1%
k 3
8.1%
a 3
8.1%
r 2
 
5.4%
h 2
 
5.4%
g 2
 
5.4%
c 2
 
5.4%
Other values (5) 5
13.5%
Decimal Number
ValueCountFrequency (%)
2 108
48.0%
5 56
24.9%
4 42
 
18.7%
1 9
 
4.0%
0 5
 
2.2%
8 3
 
1.3%
3 1
 
0.4%
6 1
 
0.4%
Other Punctuation
ValueCountFrequency (%)
. 15
93.8%
! 1
 
6.2%
Space Separator
ValueCountFrequency (%)
301
100.0%
Open Punctuation
ValueCountFrequency (%)
( 78
100.0%
Close Punctuation
ValueCountFrequency (%)
) 78
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 4
100.0%
Other Symbol
ValueCountFrequency (%)
3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4984
85.4%
Common 702
 
12.0%
Latin 147
 
2.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
279
 
5.6%
175
 
3.5%
154
 
3.1%
143
 
2.9%
119
 
2.4%
115
 
2.3%
112
 
2.2%
105
 
2.1%
102
 
2.0%
101
 
2.0%
Other values (387) 3579
71.8%
Latin
ValueCountFrequency (%)
S 31
21.1%
G 28
19.0%
C 7
 
4.8%
e 7
 
4.8%
R 7
 
4.8%
K 6
 
4.1%
n 5
 
3.4%
H 5
 
3.4%
U 4
 
2.7%
D 4
 
2.7%
Other values (22) 43
29.3%
Common
ValueCountFrequency (%)
301
42.9%
2 108
 
15.4%
( 78
 
11.1%
) 78
 
11.1%
5 56
 
8.0%
4 42
 
6.0%
. 15
 
2.1%
1 9
 
1.3%
0 5
 
0.7%
- 4
 
0.6%
Other values (4) 6
 
0.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4981
85.4%
ASCII 849
 
14.6%
None 3
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
301
35.5%
2 108
 
12.7%
( 78
 
9.2%
) 78
 
9.2%
5 56
 
6.6%
4 42
 
4.9%
S 31
 
3.7%
G 28
 
3.3%
. 15
 
1.8%
1 9
 
1.1%
Other values (36) 103
 
12.1%
Hangul
ValueCountFrequency (%)
279
 
5.6%
175
 
3.5%
154
 
3.1%
143
 
2.9%
119
 
2.4%
115
 
2.3%
112
 
2.2%
105
 
2.1%
102
 
2.0%
101
 
2.0%
Other values (386) 3576
71.8%
None
ValueCountFrequency (%)
3
100.0%
Distinct734
Distinct (%)96.8%
Missing0
Missing (%)0.0%
Memory size6.1 KiB
2023-12-11T02:02:38.951760image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length51
Median length47
Mean length24.253298
Min length1

Characters and Unicode

Total characters18384
Distinct characters241
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique716 ?
Unique (%)94.5%

Sample

1st row부산광역시 강서구 명지동 3400 부산명지 중흥S-클래스 프라디움
2nd row부산광역시 강서구 명지동 3604-2 부산 명지 제나우스 블루오션 오피스텔
3rd row부산광역시 강서구 명지동 3501-1
4th row부산광역시 강서구 명지동 3400 부산명지 중흥S-클래스 프라디움
5th row부산광역시 강서구 강동동 29-450
ValueCountFrequency (%)
부산광역시 754
20.0%
강서구 754
20.0%
명지동 171
 
4.5%
송정동 111
 
3.0%
대저2동 83
 
2.2%
대저1동 82
 
2.2%
1호 81
 
2.2%
50
 
1.3%
신호동 43
 
1.1%
2호 42
 
1.1%
Other values (893) 1590
42.3%
2023-12-11T02:02:40.252928image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3445
18.7%
1 947
 
5.2%
827
 
4.5%
810
 
4.4%
808
 
4.4%
787
 
4.3%
775
 
4.2%
768
 
4.2%
765
 
4.2%
755
 
4.1%
Other values (231) 7697
41.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 10802
58.8%
Decimal Number 3877
 
21.1%
Space Separator 3445
 
18.7%
Dash Punctuation 194
 
1.1%
Uppercase Letter 34
 
0.2%
Close Punctuation 9
 
< 0.1%
Open Punctuation 9
 
< 0.1%
Other Punctuation 8
 
< 0.1%
Lowercase Letter 4
 
< 0.1%
Math Symbol 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
827
 
7.7%
810
 
7.5%
808
 
7.5%
787
 
7.3%
775
 
7.2%
768
 
7.1%
765
 
7.1%
755
 
7.0%
755
 
7.0%
738
 
6.8%
Other values (203) 3014
27.9%
Decimal Number
ValueCountFrequency (%)
1 947
24.4%
2 543
14.0%
3 529
13.6%
5 369
 
9.5%
4 315
 
8.1%
6 277
 
7.1%
8 231
 
6.0%
7 228
 
5.9%
0 227
 
5.9%
9 211
 
5.4%
Uppercase Letter
ValueCountFrequency (%)
S 8
23.5%
A 6
17.6%
B 5
14.7%
L 4
11.8%
P 4
11.8%
D 3
 
8.8%
C 2
 
5.9%
H 1
 
2.9%
G 1
 
2.9%
Other Punctuation
ValueCountFrequency (%)
. 6
75.0%
· 2
 
25.0%
Lowercase Letter
ValueCountFrequency (%)
e 2
50.0%
i 2
50.0%
Space Separator
ValueCountFrequency (%)
3445
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 194
100.0%
Close Punctuation
ValueCountFrequency (%)
) 9
100.0%
Open Punctuation
ValueCountFrequency (%)
( 9
100.0%
Math Symbol
ValueCountFrequency (%)
~ 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 10802
58.8%
Common 7544
41.0%
Latin 38
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
827
 
7.7%
810
 
7.5%
808
 
7.5%
787
 
7.3%
775
 
7.2%
768
 
7.1%
765
 
7.1%
755
 
7.0%
755
 
7.0%
738
 
6.8%
Other values (203) 3014
27.9%
Common
ValueCountFrequency (%)
3445
45.7%
1 947
 
12.6%
2 543
 
7.2%
3 529
 
7.0%
5 369
 
4.9%
4 315
 
4.2%
6 277
 
3.7%
8 231
 
3.1%
7 228
 
3.0%
0 227
 
3.0%
Other values (7) 433
 
5.7%
Latin
ValueCountFrequency (%)
S 8
21.1%
A 6
15.8%
B 5
13.2%
L 4
10.5%
P 4
10.5%
D 3
 
7.9%
e 2
 
5.3%
C 2
 
5.3%
i 2
 
5.3%
H 1
 
2.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 10802
58.8%
ASCII 7580
41.2%
None 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3445
45.4%
1 947
 
12.5%
2 543
 
7.2%
3 529
 
7.0%
5 369
 
4.9%
4 315
 
4.2%
6 277
 
3.7%
8 231
 
3.0%
7 228
 
3.0%
0 227
 
3.0%
Other values (17) 469
 
6.2%
Hangul
ValueCountFrequency (%)
827
 
7.7%
810
 
7.5%
808
 
7.5%
787
 
7.3%
775
 
7.2%
768
 
7.1%
765
 
7.1%
755
 
7.0%
755
 
7.0%
738
 
6.8%
Other values (203) 3014
27.9%
None
ValueCountFrequency (%)
· 2
100.0%
Distinct677
Distinct (%)89.3%
Missing0
Missing (%)0.0%
Memory size6.1 KiB
2023-12-11T02:02:40.796685image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length72
Median length57
Mean length29.146438
Min length1

Characters and Unicode

Total characters22093
Distinct characters281
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique674 ?
Unique (%)88.9%

Sample

1st row부산광역시 강서구 명지국제7로 133. 근린생활시설3동 120호 (명지동. 부산명지 중흥S-클래스 프라디움)
2nd row부산광역시 강서구 명지국제2로 16. 부산 명지 제나우스 블루오션 오피스텔 108.109호 (명지동)
3rd row부산광역시 강서구 명지국제5로136번길 2. 1층 (명지동)
4th row부산광역시 강서구 명지국제7로 133. 근린생활시설1동 110호 (명지동. 부산명지 중흥S-클래스 프라디움)
5th row부산광역시 강서구 낙동북로 100 (강동동)
ValueCountFrequency (%)
부산광역시 678
 
16.8%
강서구 678
 
16.8%
명지동 161
 
4.0%
1층 119
 
2.9%
송정동 104
 
2.6%
대저2동 66
 
1.6%
대저1동 57
 
1.4%
신호동 38
 
0.9%
화전동 32
 
0.8%
지사동 30
 
0.7%
Other values (898) 2076
51.4%
2023-12-11T02:02:41.821795image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3581
 
16.2%
1 1072
 
4.9%
1064
 
4.8%
876
 
4.0%
744
 
3.4%
740
 
3.3%
728
 
3.3%
695
 
3.1%
695
 
3.1%
) 688
 
3.1%
Other values (271) 11210
50.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 12878
58.3%
Decimal Number 3736
 
16.9%
Space Separator 3581
 
16.2%
Close Punctuation 688
 
3.1%
Open Punctuation 688
 
3.1%
Other Punctuation 387
 
1.8%
Dash Punctuation 82
 
0.4%
Uppercase Letter 41
 
0.2%
Math Symbol 10
 
< 0.1%
Lowercase Letter 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1064
 
8.3%
876
 
6.8%
744
 
5.8%
740
 
5.7%
728
 
5.7%
695
 
5.4%
695
 
5.4%
680
 
5.3%
679
 
5.3%
624
 
4.8%
Other values (243) 5353
41.6%
Decimal Number
ValueCountFrequency (%)
1 1072
28.7%
2 588
15.7%
3 365
 
9.8%
0 332
 
8.9%
4 280
 
7.5%
6 275
 
7.4%
5 251
 
6.7%
9 193
 
5.2%
8 192
 
5.1%
7 188
 
5.0%
Uppercase Letter
ValueCountFrequency (%)
A 12
29.3%
S 9
22.0%
B 8
19.5%
P 3
 
7.3%
C 3
 
7.3%
D 3
 
7.3%
R 1
 
2.4%
L 1
 
2.4%
H 1
 
2.4%
Other Punctuation
ValueCountFrequency (%)
. 382
98.7%
, 3
 
0.8%
· 2
 
0.5%
Space Separator
ValueCountFrequency (%)
3581
100.0%
Close Punctuation
ValueCountFrequency (%)
) 688
100.0%
Open Punctuation
ValueCountFrequency (%)
( 688
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 82
100.0%
Math Symbol
ValueCountFrequency (%)
~ 10
100.0%
Lowercase Letter
ValueCountFrequency (%)
e 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 12878
58.3%
Common 9172
41.5%
Latin 43
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1064
 
8.3%
876
 
6.8%
744
 
5.8%
740
 
5.7%
728
 
5.7%
695
 
5.4%
695
 
5.4%
680
 
5.3%
679
 
5.3%
624
 
4.8%
Other values (243) 5353
41.6%
Common
ValueCountFrequency (%)
3581
39.0%
1 1072
 
11.7%
) 688
 
7.5%
( 688
 
7.5%
2 588
 
6.4%
. 382
 
4.2%
3 365
 
4.0%
0 332
 
3.6%
4 280
 
3.1%
6 275
 
3.0%
Other values (8) 921
 
10.0%
Latin
ValueCountFrequency (%)
A 12
27.9%
S 9
20.9%
B 8
18.6%
P 3
 
7.0%
C 3
 
7.0%
D 3
 
7.0%
e 2
 
4.7%
R 1
 
2.3%
L 1
 
2.3%
H 1
 
2.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 12878
58.3%
ASCII 9213
41.7%
None 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3581
38.9%
1 1072
 
11.6%
) 688
 
7.5%
( 688
 
7.5%
2 588
 
6.4%
. 382
 
4.1%
3 365
 
4.0%
0 332
 
3.6%
4 280
 
3.0%
6 275
 
3.0%
Other values (17) 962
 
10.4%
Hangul
ValueCountFrequency (%)
1064
 
8.3%
876
 
6.8%
744
 
5.8%
740
 
5.7%
728
 
5.7%
695
 
5.4%
695
 
5.4%
680
 
5.3%
679
 
5.3%
624
 
4.8%
Other values (243) 5353
41.6%
None
ValueCountFrequency (%)
· 2
100.0%

Interactions

2023-12-11T02:02:36.325030image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Missing values

2023-12-11T02:02:36.567498image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T02:02:36.714016image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번업소명업소지번주소업소도로명주소
01씨유 명지국제중흥점부산광역시 강서구 명지동 3400 부산명지 중흥S-클래스 프라디움부산광역시 강서구 명지국제7로 133. 근린생활시설3동 120호 (명지동. 부산명지 중흥S-클래스 프라디움)
12지에스(GS)25 명지제나우스점부산광역시 강서구 명지동 3604-2 부산 명지 제나우스 블루오션 오피스텔부산광역시 강서구 명지국제2로 16. 부산 명지 제나우스 블루오션 오피스텔 108.109호 (명지동)
23씨유 명지국제협성점부산광역시 강서구 명지동 3501-1부산광역시 강서구 명지국제5로136번길 2. 1층 (명지동)
34카페051 명지중흥부영점부산광역시 강서구 명지동 3400 부산명지 중흥S-클래스 프라디움부산광역시 강서구 명지국제7로 133. 근린생활시설1동 110호 (명지동. 부산명지 중흥S-클래스 프라디움)
45참마트(강동점)부산광역시 강서구 강동동 29-450부산광역시 강서구 낙동북로 100 (강동동)
56이마트24 부산명지행복점부산광역시 강서구 명지동 3253-13부산광역시 강서구 명지오션시티10로 139 (명지동)
67브루바틀 2호점부산광역시 강서구 명지동 3595-3 더샵 명지퍼스트월드 3단지부산광역시 강서구 명지국제2로 41. 판매시설동 1층 118호 (명지동. 더샵 명지퍼스트월드 3단지)
78빛과꿈터부산광역시 강서구 명지동 3313-1부산광역시 강서구 명지국제4로208번길 6. 102호 (명지동)
89씨유 뉴화전원룸점부산광역시 강서구 화전동 568-7부산광역시 강서구 화전산단5로132번길 14-1 (화전동)
910서원 탑마트부산광역시 강서구 대저1동 1491-43부산광역시 강서구 대저로 105-1 (대저1동)
연번업소명업소지번주소업소도로명주소
748749잡화점부산광역시 강서구 대저2동 5739-9호
749750부산광역시 강서구 대저2동 1241-1호
750751.부산광역시 강서구 대저2동 1902-4호
7517525지구대마트부산광역시 강서구 대저2동 794부산광역시 강서구 공항진입로42번길 54. 사서함 307-21호 (대저2동)
752753.부산광역시 강서구 대저1동 1061-1호
753754.부산광역시 강서구 대저1동 1435-1호
754755중리2구상회부산광역시 강서구 대저1동 1047-11호
755756.부산광역시 강서구 대저1동 1059-14호
7567572012-01-02부산광역시 강서구 대저1동 1289호
757758부산광역시 강서구 대저1동 671호