Overview

Dataset statistics

Number of variables5
Number of observations415
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory16.7 KiB
Average record size in memory41.3 B

Variable types

Numeric1
Categorical1
Text3

Dataset

Description부산광역시 수영구 담배소매인 지정 현황과 관련하여 관내 담배소매인으로 지정 된 업체의 명칭, 주소, 소매인 구분 등 정보를 제공합니다.
Author부산광역시 수영구
URLhttps://www.data.go.kr/data/15035515/fileData.do

Alerts

소매인구분 is highly imbalanced (57.4%)Imbalance
연번 has unique valuesUnique

Reproduction

Analysis started2023-12-23 07:01:30.997177
Analysis finished2023-12-23 07:01:33.864546
Duration2.87 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct415
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean208
Minimum1
Maximum415
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.8 KiB
2023-12-23T07:01:34.360970image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile21.7
Q1104.5
median208
Q3311.5
95-th percentile394.3
Maximum415
Range414
Interquartile range (IQR)207

Descriptive statistics

Standard deviation119.94443
Coefficient of variation (CV)0.57665592
Kurtosis-1.2
Mean208
Median Absolute Deviation (MAD)104
Skewness0
Sum86320
Variance14386.667
MonotonicityStrictly increasing
2023-12-23T07:01:35.444282image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.2%
2 1
 
0.2%
285 1
 
0.2%
284 1
 
0.2%
283 1
 
0.2%
282 1
 
0.2%
281 1
 
0.2%
280 1
 
0.2%
279 1
 
0.2%
278 1
 
0.2%
Other values (405) 405
97.6%
ValueCountFrequency (%)
1 1
0.2%
2 1
0.2%
3 1
0.2%
4 1
0.2%
5 1
0.2%
6 1
0.2%
7 1
0.2%
8 1
0.2%
9 1
0.2%
10 1
0.2%
ValueCountFrequency (%)
415 1
0.2%
414 1
0.2%
413 1
0.2%
412 1
0.2%
411 1
0.2%
410 1
0.2%
409 1
0.2%
408 1
0.2%
407 1
0.2%
406 1
0.2%

소매인구분
Categorical

IMBALANCE 

Distinct2
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size3.4 KiB
일반소매인
379 
구내소매인
 
36

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row일반소매인
2nd row일반소매인
3rd row일반소매인
4th row일반소매인
5th row일반소매인

Common Values

ValueCountFrequency (%)
일반소매인 379
91.3%
구내소매인 36
 
8.7%

Length

2023-12-23T07:01:36.332639image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-23T07:01:36.782330image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
일반소매인 379
91.3%
구내소매인 36
 
8.7%
Distinct400
Distinct (%)96.4%
Missing0
Missing (%)0.0%
Memory size3.4 KiB
2023-12-23T07:01:37.944541image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length27
Median length15
Mean length8.5373494
Min length1

Characters and Unicode

Total characters3543
Distinct characters341
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique396 ?
Unique (%)95.4%

Sample

1st row성이 푸드
2nd row세븐일레븐 남천3호점
3rd row(주)코리아세븐 민락수변공원점
4th row씨유 광안타운점
5th rowCU망미우리점
ValueCountFrequency (%)
씨유 41
 
6.5%
세븐일레븐 35
 
5.6%
지에스(gs)25 25
 
4.0%
이마트24 22
 
3.5%
gs25 17
 
2.7%
없음 13
 
2.1%
주)코리아세븐 9
 
1.4%
남천점 4
 
0.6%
수영점 4
 
0.6%
주식회사 4
 
0.6%
Other values (425) 454
72.3%
2023-12-23T07:01:39.739879image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
215
 
6.1%
215
 
6.1%
93
 
2.6%
2 90
 
2.5%
84
 
2.4%
83
 
2.3%
83
 
2.3%
83
 
2.3%
82
 
2.3%
78
 
2.2%
Other values (331) 2437
68.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2822
79.7%
Space Separator 215
 
6.1%
Decimal Number 195
 
5.5%
Uppercase Letter 177
 
5.0%
Open Punctuation 56
 
1.6%
Close Punctuation 56
 
1.6%
Lowercase Letter 18
 
0.5%
Dash Punctuation 2
 
0.1%
Other Punctuation 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
215
 
7.6%
93
 
3.3%
84
 
3.0%
83
 
2.9%
83
 
2.9%
83
 
2.9%
82
 
2.9%
78
 
2.8%
70
 
2.5%
70
 
2.5%
Other values (289) 1881
66.7%
Uppercase Letter
ValueCountFrequency (%)
S 62
35.0%
G 54
30.5%
C 15
 
8.5%
U 10
 
5.6%
M 5
 
2.8%
B 4
 
2.3%
K 4
 
2.3%
A 4
 
2.3%
R 3
 
1.7%
E 3
 
1.7%
Other values (7) 13
 
7.3%
Lowercase Letter
ValueCountFrequency (%)
e 4
22.2%
o 4
22.2%
f 2
11.1%
p 2
11.1%
s 1
 
5.6%
m 1
 
5.6%
a 1
 
5.6%
h 1
 
5.6%
t 1
 
5.6%
u 1
 
5.6%
Decimal Number
ValueCountFrequency (%)
2 90
46.2%
5 62
31.8%
4 30
 
15.4%
1 5
 
2.6%
7 2
 
1.0%
3 2
 
1.0%
0 2
 
1.0%
6 1
 
0.5%
9 1
 
0.5%
Other Punctuation
ValueCountFrequency (%)
. 1
50.0%
& 1
50.0%
Space Separator
ValueCountFrequency (%)
215
100.0%
Open Punctuation
ValueCountFrequency (%)
( 56
100.0%
Close Punctuation
ValueCountFrequency (%)
) 56
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2822
79.7%
Common 526
 
14.8%
Latin 195
 
5.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
215
 
7.6%
93
 
3.3%
84
 
3.0%
83
 
2.9%
83
 
2.9%
83
 
2.9%
82
 
2.9%
78
 
2.8%
70
 
2.5%
70
 
2.5%
Other values (289) 1881
66.7%
Latin
ValueCountFrequency (%)
S 62
31.8%
G 54
27.7%
C 15
 
7.7%
U 10
 
5.1%
M 5
 
2.6%
e 4
 
2.1%
B 4
 
2.1%
K 4
 
2.1%
o 4
 
2.1%
A 4
 
2.1%
Other values (17) 29
14.9%
Common
ValueCountFrequency (%)
215
40.9%
2 90
17.1%
5 62
 
11.8%
( 56
 
10.6%
) 56
 
10.6%
4 30
 
5.7%
1 5
 
1.0%
- 2
 
0.4%
7 2
 
0.4%
3 2
 
0.4%
Other values (5) 6
 
1.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2822
79.7%
ASCII 721
 
20.3%

Most frequent character per block

Hangul
ValueCountFrequency (%)
215
 
7.6%
93
 
3.3%
84
 
3.0%
83
 
2.9%
83
 
2.9%
83
 
2.9%
82
 
2.9%
78
 
2.8%
70
 
2.5%
70
 
2.5%
Other values (289) 1881
66.7%
ASCII
ValueCountFrequency (%)
215
29.8%
2 90
12.5%
5 62
 
8.6%
S 62
 
8.6%
( 56
 
7.8%
) 56
 
7.8%
G 54
 
7.5%
4 30
 
4.2%
C 15
 
2.1%
U 10
 
1.4%
Other values (32) 71
 
9.8%
Distinct414
Distinct (%)99.8%
Missing0
Missing (%)0.0%
Memory size3.4 KiB
2023-12-23T07:01:40.630495image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length44
Median length38
Mean length23.503614
Min length16

Characters and Unicode

Total characters9754
Distinct characters214
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique413 ?
Unique (%)99.5%

Sample

1st row부산광역시 수영구 민락동 33-5
2nd row부산광역시 수영구 남천동 6-2
3rd row부산광역시 수영구 민락동 110-47 씨플렉스
4th row부산광역시 수영구 광안동 166-4
5th row부산광역시 수영구 망미동 412-7
ValueCountFrequency (%)
부산광역시 415
20.5%
수영구 415
20.5%
광안동 154
 
7.6%
민락동 81
 
4.0%
망미동 56
 
2.8%
남천동 48
 
2.4%
수영동 44
 
2.2%
1호 26
 
1.3%
5호 11
 
0.5%
10
 
0.5%
Other values (568) 769
37.9%
2023-12-23T07:01:41.859336image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1693
17.4%
602
 
6.2%
1 496
 
5.1%
463
 
4.7%
462
 
4.7%
436
 
4.5%
421
 
4.3%
421
 
4.3%
420
 
4.3%
417
 
4.3%
Other values (204) 3923
40.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5922
60.7%
Decimal Number 1925
 
19.7%
Space Separator 1693
 
17.4%
Dash Punctuation 197
 
2.0%
Uppercase Letter 8
 
0.1%
Lowercase Letter 6
 
0.1%
Other Punctuation 3
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
602
 
10.2%
463
 
7.8%
462
 
7.8%
436
 
7.4%
421
 
7.1%
421
 
7.1%
420
 
7.1%
417
 
7.0%
417
 
7.0%
257
 
4.3%
Other values (183) 1606
27.1%
Decimal Number
ValueCountFrequency (%)
1 496
25.8%
2 195
 
10.1%
4 192
 
10.0%
3 189
 
9.8%
5 175
 
9.1%
7 163
 
8.5%
0 154
 
8.0%
8 136
 
7.1%
6 119
 
6.2%
9 106
 
5.5%
Uppercase Letter
ValueCountFrequency (%)
A 2
25.0%
B 2
25.0%
S 1
12.5%
K 1
12.5%
G 1
12.5%
F 1
12.5%
Other Punctuation
ValueCountFrequency (%)
. 2
66.7%
, 1
33.3%
Space Separator
ValueCountFrequency (%)
1693
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 197
100.0%
Lowercase Letter
ValueCountFrequency (%)
e 6
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5922
60.7%
Common 3818
39.1%
Latin 14
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
602
 
10.2%
463
 
7.8%
462
 
7.8%
436
 
7.4%
421
 
7.1%
421
 
7.1%
420
 
7.1%
417
 
7.0%
417
 
7.0%
257
 
4.3%
Other values (183) 1606
27.1%
Common
ValueCountFrequency (%)
1693
44.3%
1 496
 
13.0%
- 197
 
5.2%
2 195
 
5.1%
4 192
 
5.0%
3 189
 
5.0%
5 175
 
4.6%
7 163
 
4.3%
0 154
 
4.0%
8 136
 
3.6%
Other values (4) 228
 
6.0%
Latin
ValueCountFrequency (%)
e 6
42.9%
A 2
 
14.3%
B 2
 
14.3%
S 1
 
7.1%
K 1
 
7.1%
G 1
 
7.1%
F 1
 
7.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5922
60.7%
ASCII 3832
39.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1693
44.2%
1 496
 
12.9%
- 197
 
5.1%
2 195
 
5.1%
4 192
 
5.0%
3 189
 
4.9%
5 175
 
4.6%
7 163
 
4.3%
0 154
 
4.0%
8 136
 
3.5%
Other values (11) 242
 
6.3%
Hangul
ValueCountFrequency (%)
602
 
10.2%
463
 
7.8%
462
 
7.8%
436
 
7.4%
421
 
7.1%
421
 
7.1%
420
 
7.1%
417
 
7.0%
417
 
7.0%
257
 
4.3%
Other values (183) 1606
27.1%
Distinct408
Distinct (%)98.3%
Missing0
Missing (%)0.0%
Memory size3.4 KiB
2023-12-23T07:01:42.576861image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length60
Median length51
Mean length30.250602
Min length1

Characters and Unicode

Total characters12554
Distinct characters235
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique405 ?
Unique (%)97.6%

Sample

1st row부산광역시 수영구 민락본동로27번길 67. 1층 (민락동)
2nd row부산광역시 수영구 광남로48번길 20 (남천동)
3rd row부산광역시 수영구 광안해변로358번길 14. 씨플렉스 (민락동)
4th row부산광역시 수영구 수영로554번길 22. 1층 (광안동)
5th row부산광역시 수영구 과정로56번길 7. 1층 (망미동)
ValueCountFrequency (%)
부산광역시 409
 
17.0%
수영구 409
 
17.0%
광안동 165
 
6.9%
민락동 78
 
3.2%
1층 76
 
3.2%
망미동 63
 
2.6%
남천동 54
 
2.2%
수영로 48
 
2.0%
수영동 42
 
1.7%
101호 28
 
1.2%
Other values (590) 1029
42.9%
2023-12-23T07:01:44.104561image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2040
 
16.2%
703
 
5.6%
605
 
4.8%
569
 
4.5%
1 487
 
3.9%
464
 
3.7%
426
 
3.4%
424
 
3.4%
415
 
3.3%
414
 
3.3%
Other values (225) 6007
47.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 7424
59.1%
Space Separator 2040
 
16.2%
Decimal Number 1951
 
15.5%
Open Punctuation 407
 
3.2%
Close Punctuation 407
 
3.2%
Other Punctuation 265
 
2.1%
Dash Punctuation 47
 
0.4%
Uppercase Letter 6
 
< 0.1%
Lowercase Letter 5
 
< 0.1%
Letter Number 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
703
 
9.5%
605
 
8.1%
569
 
7.7%
464
 
6.2%
426
 
5.7%
424
 
5.7%
415
 
5.6%
414
 
5.6%
412
 
5.5%
412
 
5.5%
Other values (202) 2580
34.8%
Decimal Number
ValueCountFrequency (%)
1 487
25.0%
2 250
12.8%
0 207
10.6%
5 180
 
9.2%
3 168
 
8.6%
6 166
 
8.5%
4 156
 
8.0%
7 139
 
7.1%
9 101
 
5.2%
8 97
 
5.0%
Uppercase Letter
ValueCountFrequency (%)
B 3
50.0%
K 1
 
16.7%
S 1
 
16.7%
A 1
 
16.7%
Other Punctuation
ValueCountFrequency (%)
. 262
98.9%
· 3
 
1.1%
Space Separator
ValueCountFrequency (%)
2040
100.0%
Open Punctuation
ValueCountFrequency (%)
( 407
100.0%
Close Punctuation
ValueCountFrequency (%)
) 407
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 47
100.0%
Lowercase Letter
ValueCountFrequency (%)
e 5
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 7424
59.1%
Common 5118
40.8%
Latin 12
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
703
 
9.5%
605
 
8.1%
569
 
7.7%
464
 
6.2%
426
 
5.7%
424
 
5.7%
415
 
5.6%
414
 
5.6%
412
 
5.5%
412
 
5.5%
Other values (202) 2580
34.8%
Common
ValueCountFrequency (%)
2040
39.9%
1 487
 
9.5%
( 407
 
8.0%
) 407
 
8.0%
. 262
 
5.1%
2 250
 
4.9%
0 207
 
4.0%
5 180
 
3.5%
3 168
 
3.3%
6 166
 
3.2%
Other values (7) 544
 
10.6%
Latin
ValueCountFrequency (%)
e 5
41.7%
B 3
25.0%
K 1
 
8.3%
S 1
 
8.3%
A 1
 
8.3%
1
 
8.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 7424
59.1%
ASCII 5126
40.8%
None 3
 
< 0.1%
Number Forms 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2040
39.8%
1 487
 
9.5%
( 407
 
7.9%
) 407
 
7.9%
. 262
 
5.1%
2 250
 
4.9%
0 207
 
4.0%
5 180
 
3.5%
3 168
 
3.3%
6 166
 
3.2%
Other values (11) 552
 
10.8%
Hangul
ValueCountFrequency (%)
703
 
9.5%
605
 
8.1%
569
 
7.7%
464
 
6.2%
426
 
5.7%
424
 
5.7%
415
 
5.6%
414
 
5.6%
412
 
5.5%
412
 
5.5%
Other values (202) 2580
34.8%
None
ValueCountFrequency (%)
· 3
100.0%
Number Forms
ValueCountFrequency (%)
1
100.0%

Interactions

2023-12-23T07:01:32.415617image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-23T07:01:44.409905image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번소매인구분
연번1.0000.445
소매인구분0.4451.000
2023-12-23T07:01:44.667046image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번소매인구분
연번1.0000.339
소매인구분0.3391.000

Missing values

2023-12-23T07:01:33.119499image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-23T07:01:33.750459image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번소매인구분업소명업소지번주소업소도로명주소
01일반소매인성이 푸드부산광역시 수영구 민락동 33-5부산광역시 수영구 민락본동로27번길 67. 1층 (민락동)
12일반소매인세븐일레븐 남천3호점부산광역시 수영구 남천동 6-2부산광역시 수영구 광남로48번길 20 (남천동)
23일반소매인(주)코리아세븐 민락수변공원점부산광역시 수영구 민락동 110-47 씨플렉스부산광역시 수영구 광안해변로358번길 14. 씨플렉스 (민락동)
34일반소매인씨유 광안타운점부산광역시 수영구 광안동 166-4부산광역시 수영구 수영로554번길 22. 1층 (광안동)
45일반소매인CU망미우리점부산광역시 수영구 망미동 412-7부산광역시 수영구 과정로56번길 7. 1층 (망미동)
56일반소매인부산수영지역자활센터 GS25광안오션테라스점부산광역시 수영구 민락동 783 e편한세상 오션테라스 3단지부산광역시 수영구 광안해변로326번길 32. 112호 (민락동. e편한세상 오션테라스 3단지)
67일반소매인이마트24센텀비치푸르지오점부산광역시 수영구 민락동 108-1 센텀비치푸르지오부산광역시 수영구 광안해변로 420. 센텀비치푸르지오 상가동 209.210호 (민락동)
78일반소매인이마트24 대남교차로점부산광역시 수영구 남천동 353-4부산광역시 수영구 수영로 379. 1층 101호 (남천동)
89일반소매인씨유 망미블루점부산광역시 수영구 망미동 405-24부산광역시 수영구 과정로42번길 55. 1층 (망미동)
910일반소매인세븐일레븐 부산수영에이스점부산광역시 수영구 민락동 218-1부산광역시 수영구 광안로61번길 73 (민락동)
연번소매인구분업소명업소지번주소업소도로명주소
405406일반소매인없음부산광역시 수영구 망미동 180-16호부산광역시 수영구 구락로 105 (망미동)
406407구내소매인대성제강부산광역시 수영구 광안동 172-1번지부산광역시 수영구 수영로540번길 57 (광안동)
407408일반소매인대원슈퍼마켓부산광역시 수영구 광안동 373-15번지부산광역시 수영구 수영로 524 (광안동)
408409일반소매인하니미용실부산광역시 수영구 광안동 178-31번지부산광역시 수영구 남천바다로15번길 71 (광안동)
409410일반소매인없음부산광역시 수영구 광안동 121-23번지부산광역시 수영구 수영로606번길 55-1 (광안동)
410411일반소매인없음부산광역시 수영구 수영동 452-5호부산광역시 수영구 연수로401번길 25 (수영동)
411412일반소매인없음부산광역시 수영구 수영동 731호
412413일반소매인없음부산광역시 수영구 수영동 490-19호부산광역시 수영구 구락로43번길 16 (수영동)
413414일반소매인없음부산광역시 수영구 광안동 99-9번지부산광역시 수영구 수영로642번길 9 (광안동)
414415일반소매인진양상회부산광역시 수영구 망미동 266-32호부산광역시 수영구 수미로25번길 14 (망미동)