Overview

Dataset statistics

Number of variables5
Number of observations84
Missing cells50
Missing cells (%)11.9%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory3.4 KiB
Average record size in memory41.6 B

Variable types

Categorical1
Text3
DateTime1

Dataset

Description부산광역시서구_프랜차이즈카페_20221116
Author부산광역시 서구
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=15094648

Alerts

데이터기준일 has constant value ""Constant
소재지전화 has 50 (59.5%) missing valuesMissing
업소명 has unique valuesUnique

Reproduction

Analysis started2023-12-10 17:09:47.527144
Analysis finished2023-12-10 17:09:48.113256
Duration0.59 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업종
Categorical

Distinct2
Distinct (%)2.4%
Missing0
Missing (%)0.0%
Memory size804.0 B
휴게음식점
71 
일반음식점
13 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row휴게음식점
2nd row휴게음식점
3rd row휴게음식점
4th row휴게음식점
5th row휴게음식점

Common Values

ValueCountFrequency (%)
휴게음식점 71
84.5%
일반음식점 13
 
15.5%

Length

2023-12-11T02:09:48.208406image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T02:09:48.366480image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
휴게음식점 71
84.5%
일반음식점 13
 
15.5%

업소명
Text

UNIQUE 

Distinct84
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size804.0 B
2023-12-11T02:09:48.686180image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length18
Median length14
Mean length10.702381
Min length3

Characters and Unicode

Total characters899
Distinct characters140
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique84 ?
Unique (%)100.0%

Sample

1st row투썸플레이스 송도해수욕장점
2nd row투썸플레이스 고신대복음병원점
3rd row투썸플레이스 부산서대신점
4th row까페베네 송도해수욕장점
5th row이디야 부민동아대점
ValueCountFrequency (%)
컴포즈 5
 
3.1%
동대신점 4
 
2.5%
플루800 4
 
2.5%
블루샥 4
 
2.5%
투썸플레이스 3
 
1.9%
대신푸르지오점 3
 
1.9%
공차 3
 
1.9%
부산송도점 3
 
1.9%
서대신점 3
 
1.9%
더벤티 3
 
1.9%
Other values (95) 124
78.0%
2023-12-11T02:09:49.240673image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
76
 
8.5%
76
 
8.5%
46
 
5.1%
34
 
3.8%
34
 
3.8%
29
 
3.2%
28
 
3.1%
25
 
2.8%
24
 
2.7%
24
 
2.7%
Other values (130) 503
56.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 771
85.8%
Space Separator 76
 
8.5%
Decimal Number 28
 
3.1%
Uppercase Letter 8
 
0.9%
Open Punctuation 6
 
0.7%
Close Punctuation 6
 
0.7%
Lowercase Letter 4
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
76
 
9.9%
46
 
6.0%
34
 
4.4%
34
 
4.4%
29
 
3.8%
28
 
3.6%
25
 
3.2%
24
 
3.1%
24
 
3.1%
18
 
2.3%
Other values (111) 433
56.2%
Decimal Number
ValueCountFrequency (%)
0 13
46.4%
8 5
 
17.9%
1 4
 
14.3%
5 4
 
14.3%
9 1
 
3.6%
2 1
 
3.6%
Uppercase Letter
ValueCountFrequency (%)
T 2
25.0%
E 2
25.0%
L 1
12.5%
H 1
12.5%
I 1
12.5%
R 1
12.5%
Lowercase Letter
ValueCountFrequency (%)
e 1
25.0%
a 1
25.0%
f 1
25.0%
c 1
25.0%
Space Separator
ValueCountFrequency (%)
76
100.0%
Open Punctuation
ValueCountFrequency (%)
( 6
100.0%
Close Punctuation
ValueCountFrequency (%)
) 6
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 771
85.8%
Common 116
 
12.9%
Latin 12
 
1.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
76
 
9.9%
46
 
6.0%
34
 
4.4%
34
 
4.4%
29
 
3.8%
28
 
3.6%
25
 
3.2%
24
 
3.1%
24
 
3.1%
18
 
2.3%
Other values (111) 433
56.2%
Latin
ValueCountFrequency (%)
T 2
16.7%
E 2
16.7%
e 1
8.3%
a 1
8.3%
f 1
8.3%
c 1
8.3%
L 1
8.3%
H 1
8.3%
I 1
8.3%
R 1
8.3%
Common
ValueCountFrequency (%)
76
65.5%
0 13
 
11.2%
( 6
 
5.2%
) 6
 
5.2%
8 5
 
4.3%
1 4
 
3.4%
5 4
 
3.4%
9 1
 
0.9%
2 1
 
0.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 771
85.8%
ASCII 128
 
14.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
76
 
9.9%
46
 
6.0%
34
 
4.4%
34
 
4.4%
29
 
3.8%
28
 
3.6%
25
 
3.2%
24
 
3.1%
24
 
3.1%
18
 
2.3%
Other values (111) 433
56.2%
ASCII
ValueCountFrequency (%)
76
59.4%
0 13
 
10.2%
( 6
 
4.7%
) 6
 
4.7%
8 5
 
3.9%
1 4
 
3.1%
5 4
 
3.1%
T 2
 
1.6%
E 2
 
1.6%
9 1
 
0.8%
Other values (9) 9
 
7.0%
Distinct83
Distinct (%)98.8%
Missing0
Missing (%)0.0%
Memory size804.0 B
2023-12-11T02:09:49.607909image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length58
Median length49
Mean length34.940476
Min length23

Characters and Unicode

Total characters2935
Distinct characters139
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique82 ?
Unique (%)97.6%

Sample

1st row부산광역시 서구 송도해변로 117 (암남동)
2nd row부산광역시 서구 감천로 255, 2층 (암남동)
3rd row부산광역시 서구 구덕로321번길 11, 1층 (서대신동2가)
4th row부산광역시 서구 암남공원로 39, 301동 103호 (암남동, 송동풍림아이원)
5th row부산광역시 서구 임시수도기념로 9 (부민동2가)
ValueCountFrequency (%)
부산광역시 84
 
15.1%
서구 84
 
15.1%
1층 38
 
6.8%
암남동 20
 
3.6%
구덕로 18
 
3.2%
송도해변로 13
 
2.3%
동대신동3가 13
 
2.3%
아미동2가 7
 
1.3%
101호 7
 
1.3%
서대신동3가 6
 
1.1%
Other values (180) 267
47.9%
2023-12-11T02:09:50.346760image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
473
 
16.1%
1 187
 
6.4%
122
 
4.2%
118
 
4.0%
112
 
3.8%
99
 
3.4%
91
 
3.1%
, 91
 
3.1%
2 89
 
3.0%
88
 
3.0%
Other values (129) 1465
49.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1660
56.6%
Decimal Number 515
 
17.5%
Space Separator 473
 
16.1%
Other Punctuation 91
 
3.1%
Close Punctuation 84
 
2.9%
Open Punctuation 84
 
2.9%
Dash Punctuation 21
 
0.7%
Uppercase Letter 6
 
0.2%
Math Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
122
 
7.3%
118
 
7.1%
112
 
6.7%
99
 
6.0%
91
 
5.5%
88
 
5.3%
85
 
5.1%
84
 
5.1%
84
 
5.1%
65
 
3.9%
Other values (110) 712
42.9%
Decimal Number
ValueCountFrequency (%)
1 187
36.3%
2 89
17.3%
3 71
 
13.8%
0 43
 
8.3%
9 26
 
5.0%
5 23
 
4.5%
8 23
 
4.5%
4 23
 
4.5%
7 20
 
3.9%
6 10
 
1.9%
Uppercase Letter
ValueCountFrequency (%)
B 3
50.0%
A 2
33.3%
R 1
 
16.7%
Space Separator
ValueCountFrequency (%)
473
100.0%
Other Punctuation
ValueCountFrequency (%)
, 91
100.0%
Close Punctuation
ValueCountFrequency (%)
) 84
100.0%
Open Punctuation
ValueCountFrequency (%)
( 84
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 21
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1660
56.6%
Common 1269
43.2%
Latin 6
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
122
 
7.3%
118
 
7.1%
112
 
6.7%
99
 
6.0%
91
 
5.5%
88
 
5.3%
85
 
5.1%
84
 
5.1%
84
 
5.1%
65
 
3.9%
Other values (110) 712
42.9%
Common
ValueCountFrequency (%)
473
37.3%
1 187
 
14.7%
, 91
 
7.2%
2 89
 
7.0%
) 84
 
6.6%
( 84
 
6.6%
3 71
 
5.6%
0 43
 
3.4%
9 26
 
2.0%
5 23
 
1.8%
Other values (6) 98
 
7.7%
Latin
ValueCountFrequency (%)
B 3
50.0%
A 2
33.3%
R 1
 
16.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1660
56.6%
ASCII 1275
43.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
473
37.1%
1 187
 
14.7%
, 91
 
7.1%
2 89
 
7.0%
) 84
 
6.6%
( 84
 
6.6%
3 71
 
5.6%
0 43
 
3.4%
9 26
 
2.0%
5 23
 
1.8%
Other values (9) 104
 
8.2%
Hangul
ValueCountFrequency (%)
122
 
7.3%
118
 
7.1%
112
 
6.7%
99
 
6.0%
91
 
5.5%
88
 
5.3%
85
 
5.1%
84
 
5.1%
84
 
5.1%
65
 
3.9%
Other values (110) 712
42.9%

소재지전화
Text

MISSING 

Distinct34
Distinct (%)100.0%
Missing50
Missing (%)59.5%
Memory size804.0 B
2023-12-11T02:09:50.652252image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length14
Mean length13.823529
Min length12

Characters and Unicode

Total characters470
Distinct characters12
Distinct categories3 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique34 ?
Unique (%)100.0%

Sample

1st row051 -247 -2388
2nd row051 -231 -9330
3rd row051 -257 -8111
4th row0507-1480-1274
5th row02 -3015-1100
ValueCountFrequency (%)
051 27
32.9%
231 3
 
3.7%
243 2
 
2.4%
2989 2
 
2.4%
247 2
 
2.4%
254 2
 
2.4%
2388 1
 
1.2%
1454 1
 
1.2%
051-231-7544 1
 
1.2%
255 1
 
1.2%
Other values (40) 40
48.8%
2023-12-11T02:09:51.078091image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 68
14.5%
5 60
12.8%
1 60
12.8%
58
12.3%
0 53
11.3%
2 41
8.7%
4 30
6.4%
3 27
 
5.7%
7 23
 
4.9%
8 21
 
4.5%
Other values (2) 29
6.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 344
73.2%
Dash Punctuation 68
 
14.5%
Space Separator 58
 
12.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 60
17.4%
1 60
17.4%
0 53
15.4%
2 41
11.9%
4 30
8.7%
3 27
7.8%
7 23
 
6.7%
8 21
 
6.1%
6 16
 
4.7%
9 13
 
3.8%
Dash Punctuation
ValueCountFrequency (%)
- 68
100.0%
Space Separator
ValueCountFrequency (%)
58
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 470
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 68
14.5%
5 60
12.8%
1 60
12.8%
58
12.3%
0 53
11.3%
2 41
8.7%
4 30
6.4%
3 27
 
5.7%
7 23
 
4.9%
8 21
 
4.5%
Other values (2) 29
6.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 470
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 68
14.5%
5 60
12.8%
1 60
12.8%
58
12.3%
0 53
11.3%
2 41
8.7%
4 30
6.4%
3 27
 
5.7%
7 23
 
4.9%
8 21
 
4.5%
Other values (2) 29
6.2%

데이터기준일
Date

CONSTANT 

Distinct1
Distinct (%)1.2%
Missing0
Missing (%)0.0%
Memory size804.0 B
Minimum2022-11-16 00:00:00
Maximum2022-11-16 00:00:00
2023-12-11T02:09:51.219696image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T02:09:51.338176image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Correlations

2023-12-11T02:09:51.450599image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업종업소명소재지(도로명)소재지전화
업종1.0001.0001.0001.000
업소명1.0001.0001.0001.000
소재지(도로명)1.0001.0001.0001.000
소재지전화1.0001.0001.0001.000

Missing values

2023-12-11T02:09:47.942547image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T02:09:48.058243image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업종업소명소재지(도로명)소재지전화데이터기준일
0휴게음식점투썸플레이스 송도해수욕장점부산광역시 서구 송도해변로 117 (암남동)051 -247 -23882022-11-16
1휴게음식점투썸플레이스 고신대복음병원점부산광역시 서구 감천로 255, 2층 (암남동)<NA>2022-11-16
2휴게음식점투썸플레이스 부산서대신점부산광역시 서구 구덕로321번길 11, 1층 (서대신동2가)<NA>2022-11-16
3휴게음식점까페베네 송도해수욕장점부산광역시 서구 암남공원로 39, 301동 103호 (암남동, 송동풍림아이원)051 -231 -93302022-11-16
4휴게음식점이디야 부민동아대점부산광역시 서구 임시수도기념로 9 (부민동2가)<NA>2022-11-16
5휴게음식점이디야부산광역시 서구 구덕로 293, 1층 103호 (서대신동2가, 희망센츄럴타운)051 -257 -81112022-11-16
6휴게음식점(주)이디야 부산송도해상케이블카점부산광역시 서구 송도해변로 171, 3층 (암남동)<NA>2022-11-16
7휴게음식점이디야부산충무동점부산광역시 서구 충무대로 277, 1층 101호 (충무동1가, 충무 에코펠리스2차)<NA>2022-11-16
8휴게음식점이디야 부산대병원점부산광역시 서구 까치고개로 195 (아미동2가)0507-1480-12742022-11-16
9휴게음식점스타벅스부산동대신역점부산광역시 서구 구덕로322번길 7 (동대신동3가)02 -3015-11002022-11-16
업종업소명소재지(도로명)소재지전화데이터기준일
74휴게음식점블루샥 커피 부산송도점부산광역시 서구 송도해변로 97, 베스트웨스턴플러스 부산송도호텔 1층 103호 (암남동)<NA>2022-11-16
75휴게음식점블루샥 부산대학교병원점부산광역시 서구 구덕로185번길 32-5, 1층 (아미동2가)<NA>2022-11-16
76휴게음식점블루샥 서구청점부산광역시 서구 구덕로 115-2, 1층 (충무동1가)<NA>2022-11-16
77휴게음식점하이오부산광역시 서구 동대로19번길 32, 301동 102호 (동대신동3가, 브라운스톤 하이포레)051 -242 -31772022-11-16
78휴게음식점하이오커피 서대신점부산광역시 서구 부용로 14, 1층 일부호 (서대신동1가)<NA>2022-11-16
79휴게음식점달리는커피 서대신점부산광역시 서구 망양로 16-1, 1층 (서대신동3가)051- 207-15442022-11-16
80휴게음식점어랏커피 동대신점부산광역시 서구 대영로73번길 15-1, 삼익아파트 101호 (동대신동2가)<NA>2022-11-16
81휴게음식점김준호의 대단한커피 부산토성점부산광역시 서구 구덕로 지하 170 (토성동3가)<NA>2022-11-16
82휴게음식점히스피 충무점부산광역시 서구 구덕로 114-1 (충무동1가)070-4833-71082022-11-16
83휴게음식점아덴블랑제리 부산송도점부산광역시 서구 송도해변로 192, 판매시설동 RB동 108호 (암남동, 송도힐스테이트이진베이시티아파트)<NA>2022-11-16