Overview

Dataset statistics

Number of variables5
Number of observations324
Missing cells217
Missing cells (%)13.4%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory12.8 KiB
Average record size in memory40.4 B

Variable types

Categorical2
Text3

Dataset

Description부산광역시강서구_카페현황_20221230
Author부산광역시 강서구
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=15111562

Alerts

구분 has constant value ""Constant
업종명 has constant value ""Constant
소재지전화 has 216 (66.7%) missing valuesMissing

Reproduction

Analysis started2023-12-10 16:06:42.884331
Analysis finished2023-12-10 16:06:43.739467
Duration0.86 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

구분
Categorical

CONSTANT 

Distinct1
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size2.7 KiB
휴게음식점
324 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row휴게음식점
2nd row휴게음식점
3rd row휴게음식점
4th row휴게음식점
5th row휴게음식점

Common Values

ValueCountFrequency (%)
휴게음식점 324
100.0%

Length

2023-12-11T01:06:43.804535image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:06:43.906163image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
휴게음식점 324
100.0%

업종명
Categorical

CONSTANT 

Distinct1
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size2.7 KiB
커피숍
324 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row커피숍
2nd row커피숍
3rd row커피숍
4th row커피숍
5th row커피숍

Common Values

ValueCountFrequency (%)
커피숍 324
100.0%

Length

2023-12-11T01:06:44.013861image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:06:44.110842image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
커피숍 324
100.0%
Distinct321
Distinct (%)99.1%
Missing0
Missing (%)0.0%
Memory size2.7 KiB
2023-12-11T01:06:44.339283image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length35
Median length19
Mean length9.1697531
Min length2

Characters and Unicode

Total characters2971
Distinct characters379
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique318 ?
Unique (%)98.1%

Sample

1st row요거프레소 김해공항국내선점
2nd row김해공항 설빙
3rd row(주)푸르웰 가덕해양Park커피점
4th row삼성웰스토리(주)전기부산웰스토리카페
5th row스타벅스 명지오션시티점
ValueCountFrequency (%)
명지국제신도시점 13
 
2.6%
컴포즈커피 11
 
2.2%
하삼동커피 8
 
1.6%
명지점 7
 
1.4%
이디야커피 7
 
1.4%
카페 6
 
1.2%
부산명지점 6
 
1.2%
텐퍼센트 5
 
1.0%
명지오션점 5
 
1.0%
스타벅스 5
 
1.0%
Other values (358) 426
85.4%
2023-12-11T01:06:44.936379image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
175
 
5.9%
141
 
4.7%
105
 
3.5%
96
 
3.2%
96
 
3.2%
80
 
2.7%
71
 
2.4%
70
 
2.4%
66
 
2.2%
64
 
2.2%
Other values (369) 2007
67.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2396
80.6%
Space Separator 175
 
5.9%
Lowercase Letter 140
 
4.7%
Uppercase Letter 83
 
2.8%
Decimal Number 70
 
2.4%
Close Punctuation 51
 
1.7%
Open Punctuation 51
 
1.7%
Other Punctuation 4
 
0.1%
Dash Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
141
 
5.9%
105
 
4.4%
96
 
4.0%
96
 
4.0%
80
 
3.3%
71
 
3.0%
70
 
2.9%
66
 
2.8%
64
 
2.7%
44
 
1.8%
Other values (310) 1563
65.2%
Lowercase Letter
ValueCountFrequency (%)
e 24
17.1%
a 17
12.1%
r 13
 
9.3%
o 11
 
7.9%
c 9
 
6.4%
s 8
 
5.7%
t 8
 
5.7%
l 6
 
4.3%
i 5
 
3.6%
f 5
 
3.6%
Other values (11) 34
24.3%
Uppercase Letter
ValueCountFrequency (%)
A 8
 
9.6%
R 8
 
9.6%
T 7
 
8.4%
E 7
 
8.4%
F 6
 
7.2%
C 5
 
6.0%
O 5
 
6.0%
B 4
 
4.8%
I 4
 
4.8%
P 4
 
4.8%
Other values (11) 25
30.1%
Decimal Number
ValueCountFrequency (%)
1 16
22.9%
5 12
17.1%
2 10
14.3%
0 9
12.9%
9 7
10.0%
3 5
 
7.1%
4 5
 
7.1%
6 4
 
5.7%
7 1
 
1.4%
8 1
 
1.4%
Other Punctuation
ValueCountFrequency (%)
, 2
50.0%
' 1
25.0%
& 1
25.0%
Space Separator
ValueCountFrequency (%)
175
100.0%
Close Punctuation
ValueCountFrequency (%)
) 51
100.0%
Open Punctuation
ValueCountFrequency (%)
( 51
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2396
80.6%
Common 352
 
11.8%
Latin 223
 
7.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
141
 
5.9%
105
 
4.4%
96
 
4.0%
96
 
4.0%
80
 
3.3%
71
 
3.0%
70
 
2.9%
66
 
2.8%
64
 
2.7%
44
 
1.8%
Other values (310) 1563
65.2%
Latin
ValueCountFrequency (%)
e 24
 
10.8%
a 17
 
7.6%
r 13
 
5.8%
o 11
 
4.9%
c 9
 
4.0%
s 8
 
3.6%
A 8
 
3.6%
R 8
 
3.6%
t 8
 
3.6%
T 7
 
3.1%
Other values (32) 110
49.3%
Common
ValueCountFrequency (%)
175
49.7%
) 51
 
14.5%
( 51
 
14.5%
1 16
 
4.5%
5 12
 
3.4%
2 10
 
2.8%
0 9
 
2.6%
9 7
 
2.0%
3 5
 
1.4%
4 5
 
1.4%
Other values (7) 11
 
3.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2396
80.6%
ASCII 575
 
19.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
175
30.4%
) 51
 
8.9%
( 51
 
8.9%
e 24
 
4.2%
a 17
 
3.0%
1 16
 
2.8%
r 13
 
2.3%
5 12
 
2.1%
o 11
 
1.9%
2 10
 
1.7%
Other values (49) 195
33.9%
Hangul
ValueCountFrequency (%)
141
 
5.9%
105
 
4.4%
96
 
4.0%
96
 
4.0%
80
 
3.3%
71
 
3.0%
70
 
2.9%
66
 
2.8%
64
 
2.7%
44
 
1.8%
Other values (310) 1563
65.2%
Distinct321
Distinct (%)99.4%
Missing1
Missing (%)0.3%
Memory size2.7 KiB
2023-12-11T01:06:45.300177image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length118
Median length54
Mean length39.074303
Min length24

Characters and Unicode

Total characters12621
Distinct characters254
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique319 ?
Unique (%)98.8%

Sample

1st row부산광역시 강서구 공항진입로 108 (대저2동,국내선2층)
2nd row부산광역시 강서구 공항진입로 108 (대저2동,국제선2층)
3rd row부산광역시 강서구 거가대로 2571 (천성동)
4th row부산광역시 강서구 녹산산업중로 333, 14호 (송정동)
5th row부산광역시 강서구 명지오션시티11로 66, 1,2층 104,202호 (명지동, 지타워)
ValueCountFrequency (%)
부산광역시 323
 
13.7%
강서구 323
 
13.7%
명지동 181
 
7.7%
1층 138
 
5.9%
일부호 70
 
3.0%
일부 26
 
1.1%
대저2동 23
 
1.0%
신호동 21
 
0.9%
대저1동 21
 
0.9%
명지국제8로 20
 
0.8%
Other values (573) 1209
51.3%
2023-12-11T01:06:45.880412image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2032
 
16.1%
1 729
 
5.8%
438
 
3.5%
437
 
3.5%
437
 
3.5%
, 414
 
3.3%
404
 
3.2%
397
 
3.1%
393
 
3.1%
339
 
2.7%
Other values (244) 6601
52.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 7149
56.6%
Decimal Number 2242
 
17.8%
Space Separator 2032
 
16.1%
Other Punctuation 414
 
3.3%
Close Punctuation 325
 
2.6%
Open Punctuation 325
 
2.6%
Dash Punctuation 81
 
0.6%
Uppercase Letter 38
 
0.3%
Lowercase Letter 9
 
0.1%
Math Symbol 6
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
438
 
6.1%
437
 
6.1%
437
 
6.1%
404
 
5.7%
397
 
5.6%
393
 
5.5%
339
 
4.7%
327
 
4.6%
326
 
4.6%
324
 
4.5%
Other values (216) 3327
46.5%
Uppercase Letter
ValueCountFrequency (%)
B 10
26.3%
S 9
23.7%
C 6
15.8%
A 5
13.2%
R 2
 
5.3%
O 1
 
2.6%
W 1
 
2.6%
E 1
 
2.6%
N 1
 
2.6%
T 1
 
2.6%
Decimal Number
ValueCountFrequency (%)
1 729
32.5%
2 319
14.2%
0 234
 
10.4%
3 212
 
9.5%
4 163
 
7.3%
5 138
 
6.2%
8 132
 
5.9%
6 130
 
5.8%
7 98
 
4.4%
9 87
 
3.9%
Space Separator
ValueCountFrequency (%)
2032
100.0%
Other Punctuation
ValueCountFrequency (%)
, 414
100.0%
Close Punctuation
ValueCountFrequency (%)
) 325
100.0%
Open Punctuation
ValueCountFrequency (%)
( 325
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 81
100.0%
Lowercase Letter
ValueCountFrequency (%)
e 9
100.0%
Math Symbol
ValueCountFrequency (%)
~ 6
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 7149
56.6%
Common 5425
43.0%
Latin 47
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
438
 
6.1%
437
 
6.1%
437
 
6.1%
404
 
5.7%
397
 
5.6%
393
 
5.5%
339
 
4.7%
327
 
4.6%
326
 
4.6%
324
 
4.5%
Other values (216) 3327
46.5%
Common
ValueCountFrequency (%)
2032
37.5%
1 729
 
13.4%
, 414
 
7.6%
) 325
 
6.0%
( 325
 
6.0%
2 319
 
5.9%
0 234
 
4.3%
3 212
 
3.9%
4 163
 
3.0%
5 138
 
2.5%
Other values (6) 534
 
9.8%
Latin
ValueCountFrequency (%)
B 10
21.3%
S 9
19.1%
e 9
19.1%
C 6
12.8%
A 5
10.6%
R 2
 
4.3%
O 1
 
2.1%
W 1
 
2.1%
E 1
 
2.1%
N 1
 
2.1%
Other values (2) 2
 
4.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 7149
56.6%
ASCII 5472
43.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2032
37.1%
1 729
 
13.3%
, 414
 
7.6%
) 325
 
5.9%
( 325
 
5.9%
2 319
 
5.8%
0 234
 
4.3%
3 212
 
3.9%
4 163
 
3.0%
5 138
 
2.5%
Other values (18) 581
 
10.6%
Hangul
ValueCountFrequency (%)
438
 
6.1%
437
 
6.1%
437
 
6.1%
404
 
5.7%
397
 
5.6%
393
 
5.5%
339
 
4.7%
327
 
4.6%
326
 
4.6%
324
 
4.5%
Other values (216) 3327
46.5%

소재지전화
Text

MISSING 

Distinct105
Distinct (%)97.2%
Missing216
Missing (%)66.7%
Memory size2.7 KiB
2023-12-11T01:06:46.198706image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length12
Mean length12.111111
Min length11

Characters and Unicode

Total characters1308
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique102 ?
Unique (%)94.4%

Sample

1st row051-941-5525
2nd row051-974-0021
3rd row051-715-2200
4th row051-970-9125
5th row051-271-8465
ValueCountFrequency (%)
051-633-0102 2
 
1.9%
051-973-1548 2
 
1.9%
051-715-2200 2
 
1.9%
051-201-2360 1
 
0.9%
051-517-3168 1
 
0.9%
051-941-9874 1
 
0.9%
051-711-4363 1
 
0.9%
051-899-6928 1
 
0.9%
051-990-0011 1
 
0.9%
051-522-0713 1
 
0.9%
Other values (95) 95
88.0%
2023-12-11T01:06:46.681927image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 227
17.4%
- 216
16.5%
1 197
15.1%
5 163
12.5%
2 115
8.8%
7 95
7.3%
9 71
 
5.4%
8 65
 
5.0%
3 61
 
4.7%
6 49
 
3.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1092
83.5%
Dash Punctuation 216
 
16.5%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 227
20.8%
1 197
18.0%
5 163
14.9%
2 115
10.5%
7 95
8.7%
9 71
 
6.5%
8 65
 
6.0%
3 61
 
5.6%
6 49
 
4.5%
4 49
 
4.5%
Dash Punctuation
ValueCountFrequency (%)
- 216
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1308
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 227
17.4%
- 216
16.5%
1 197
15.1%
5 163
12.5%
2 115
8.8%
7 95
7.3%
9 71
 
5.4%
8 65
 
5.0%
3 61
 
4.7%
6 49
 
3.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1308
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 227
17.4%
- 216
16.5%
1 197
15.1%
5 163
12.5%
2 115
8.8%
7 95
7.3%
9 71
 
5.4%
8 65
 
5.0%
3 61
 
4.7%
6 49
 
3.7%

Missing values

2023-12-11T01:06:43.496386image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T01:06:43.597584image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-11T01:06:43.687547image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

구분업종명업소명소재지(도로명)소재지전화
0휴게음식점커피숍요거프레소 김해공항국내선점부산광역시 강서구 공항진입로 108 (대저2동,국내선2층)051-941-5525
1휴게음식점커피숍김해공항 설빙부산광역시 강서구 공항진입로 108 (대저2동,국제선2층)051-974-0021
2휴게음식점커피숍(주)푸르웰 가덕해양Park커피점부산광역시 강서구 거가대로 2571 (천성동)051-715-2200
3휴게음식점커피숍삼성웰스토리(주)전기부산웰스토리카페부산광역시 강서구 녹산산업중로 333, 14호 (송정동)051-970-9125
4휴게음식점커피숍스타벅스 명지오션시티점부산광역시 강서구 명지오션시티11로 66, 1,2층 104,202호 (명지동, 지타워)051-271-8465
5휴게음식점커피숍푸디스트(주) PNIT 부산신항국제터미널본관카페부산광역시 강서구 신항남로 330 (성북동, 부산신항국제터미널)051-290-8010
6휴게음식점커피숍이디야커피명지점부산광역시 강서구 명지오션시티8로16번길 25 (명지동)051-311-9199
7휴게음식점커피숍소보루부산광역시 강서구 대저로 263 (대저1동)051-972-6859
8휴게음식점커피숍이디야커피 신호점부산광역시 강서구 신호산단5로 42 (신호동)<NA>
9휴게음식점커피숍투썸플레이스 대저점부산광역시 강서구 공항로811번가길 27-1 (대저2동)<NA>
구분업종명업소명소재지(도로명)소재지전화
314휴게음식점커피숍단아헌부산광역시 강서구 명지국제2로28번길 23, 1동 103,104호 (명지동)<NA>
315휴게음식점커피숍커피베이 부산국제신도시점부산광역시 강서구 명지국제6로 99, A-209호 (명지동)<NA>
316휴게음식점커피숍썰물부산광역시 강서구 명지국제2로 80, 비주거시설동 1-52호 (명지동, e편한세상 명지)<NA>
317휴게음식점커피숍르카페(Le cafe')부산광역시 강서구 신호산단2로 43-7, 1층 일부호 (신호동)<NA>
318휴게음식점커피숍카페나루부산광역시 강서구 생곡로 230-56, 2층 (생곡동)<NA>
319휴게음식점커피숍어랏투고명지더샵점부산광역시 강서구 명지국제2로 29, 1층 111호 (명지동)<NA>
320휴게음식점커피숍카페율하온 명지국제신도시점부산광역시 강서구 명지국제7로 130, 상가2동 101-2호 (명지동, 더 힐 시그니처)051-924-5870
321휴게음식점커피숍담윤부산광역시 강서구 명지국제13로가길 1, 1층 (명지동)<NA>
322휴게음식점커피숍내림 오션시티점부산광역시 강서구 명지오션시티4로 70, 1층 105호 (명지동)<NA>
323휴게음식점커피숍부산씽크합공장 웰빙주식회사부산광역시 강서구 낙동북로 134, 1층 일부호 (강동동)<NA>