Overview

Dataset statistics

Number of variables4
Number of observations3524
Missing cells1151
Missing cells (%)8.2%
Duplicate rows2
Duplicate rows (%)0.1%
Total size in memory110.3 KiB
Average record size in memory32.0 B

Variable types

Categorical1
Text3

Dataset

Description부산광역시연제구_식품접객업소현황_20221114
Author부산광역시 연제구
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=3043636

Alerts

Dataset has 2 (0.1%) duplicate rowsDuplicates
소재지전화 has 1151 (32.7%) missing valuesMissing

Reproduction

Analysis started2023-12-10 16:40:03.471476
Analysis finished2023-12-10 16:40:04.364024
Duration0.89 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업종명
Categorical

Distinct6
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size27.7 KiB
일반음식점
2485 
휴게음식점
672 
유흥주점영업
 
183
단란주점
 
86
제과점영업
 
70

Length

Max length6
Median length5
Mean length5.0354711
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row일반음식점
2nd row일반음식점
3rd row일반음식점
4th row일반음식점
5th row일반음식점

Common Values

ValueCountFrequency (%)
일반음식점 2485
70.5%
휴게음식점 672
 
19.1%
유흥주점영업 183
 
5.2%
단란주점 86
 
2.4%
제과점영업 70
 
2.0%
위탁급식영업 28
 
0.8%

Length

2023-12-11T01:40:04.465177image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:40:04.648484image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
일반음식점 2485
70.5%
휴게음식점 672
 
19.1%
유흥주점영업 183
 
5.2%
단란주점 86
 
2.4%
제과점영업 70
 
2.0%
위탁급식영업 28
 
0.8%
Distinct3424
Distinct (%)97.2%
Missing0
Missing (%)0.0%
Memory size27.7 KiB
2023-12-11T01:40:05.158779image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length41
Median length25
Mean length6.7905789
Min length1

Characters and Unicode

Total characters23930
Distinct characters883
Distinct categories11 ?
Distinct scripts5 ?
Distinct blocks7 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3341 ?
Unique (%)94.8%

Sample

1st row컴포즈교대점
2nd row언양진미식당
3rd row용문각(2호점)
4th row치킨업(UP)
5th row밀양돼지국밥
ValueCountFrequency (%)
연산점 130
 
2.5%
시청점 38
 
0.7%
부산시청점 33
 
0.6%
세븐일레븐 24
 
0.5%
씨유 23
 
0.5%
부산연산점 20
 
0.4%
커피 17
 
0.3%
연산토곡점 16
 
0.3%
카페 15
 
0.3%
칼국수 13
 
0.3%
Other values (3932) 4777
93.6%
2023-12-11T01:40:05.820470image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1596
 
6.7%
889
 
3.7%
710
 
3.0%
569
 
2.4%
480
 
2.0%
417
 
1.7%
312
 
1.3%
( 294
 
1.2%
) 294
 
1.2%
278
 
1.2%
Other values (873) 18091
75.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 20256
84.6%
Space Separator 1596
 
6.7%
Uppercase Letter 541
 
2.3%
Lowercase Letter 466
 
1.9%
Decimal Number 405
 
1.7%
Open Punctuation 294
 
1.2%
Close Punctuation 294
 
1.2%
Other Punctuation 71
 
0.3%
Letter Number 3
 
< 0.1%
Other Symbol 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
889
 
4.4%
710
 
3.5%
569
 
2.8%
480
 
2.4%
417
 
2.1%
312
 
1.5%
278
 
1.4%
255
 
1.3%
223
 
1.1%
215
 
1.1%
Other values (798) 15908
78.5%
Uppercase Letter
ValueCountFrequency (%)
C 69
 
12.8%
S 49
 
9.1%
G 39
 
7.2%
E 36
 
6.7%
B 36
 
6.7%
O 34
 
6.3%
A 30
 
5.5%
P 27
 
5.0%
T 21
 
3.9%
M 19
 
3.5%
Other values (16) 181
33.5%
Lowercase Letter
ValueCountFrequency (%)
e 81
17.4%
a 46
 
9.9%
o 40
 
8.6%
i 33
 
7.1%
n 32
 
6.9%
f 31
 
6.7%
r 27
 
5.8%
c 26
 
5.6%
h 17
 
3.6%
t 17
 
3.6%
Other values (14) 116
24.9%
Decimal Number
ValueCountFrequency (%)
2 101
24.9%
5 68
16.8%
1 67
16.5%
0 47
11.6%
3 32
 
7.9%
9 23
 
5.7%
8 21
 
5.2%
4 18
 
4.4%
6 15
 
3.7%
7 13
 
3.2%
Other Punctuation
ValueCountFrequency (%)
& 42
59.2%
. 19
26.8%
' 3
 
4.2%
% 2
 
2.8%
! 2
 
2.8%
# 1
 
1.4%
· 1
 
1.4%
; 1
 
1.4%
Letter Number
ValueCountFrequency (%)
2
66.7%
1
33.3%
Space Separator
ValueCountFrequency (%)
1596
100.0%
Open Punctuation
ValueCountFrequency (%)
( 294
100.0%
Close Punctuation
ValueCountFrequency (%)
) 294
100.0%
Other Symbol
ValueCountFrequency (%)
2
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 20237
84.6%
Common 2664
 
11.1%
Latin 1010
 
4.2%
Han 17
 
0.1%
Hiragana 2
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
889
 
4.4%
710
 
3.5%
569
 
2.8%
480
 
2.4%
417
 
2.1%
312
 
1.5%
278
 
1.4%
255
 
1.3%
223
 
1.1%
215
 
1.1%
Other values (780) 15889
78.5%
Latin
ValueCountFrequency (%)
e 81
 
8.0%
C 69
 
6.8%
S 49
 
4.9%
a 46
 
4.6%
o 40
 
4.0%
G 39
 
3.9%
E 36
 
3.6%
B 36
 
3.6%
O 34
 
3.4%
i 33
 
3.3%
Other values (42) 547
54.2%
Common
ValueCountFrequency (%)
1596
59.9%
( 294
 
11.0%
) 294
 
11.0%
2 101
 
3.8%
5 68
 
2.6%
1 67
 
2.5%
0 47
 
1.8%
& 42
 
1.6%
3 32
 
1.2%
9 23
 
0.9%
Other values (13) 100
 
3.8%
Han
ValueCountFrequency (%)
2
 
11.8%
1
 
5.9%
1
 
5.9%
1
 
5.9%
1
 
5.9%
1
 
5.9%
1
 
5.9%
1
 
5.9%
1
 
5.9%
1
 
5.9%
Other values (6) 6
35.3%
Hiragana
ValueCountFrequency (%)
1
50.0%
1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 20237
84.6%
ASCII 3668
 
15.3%
CJK 17
 
0.1%
Number Forms 3
 
< 0.1%
Letterlike Symbols 2
 
< 0.1%
Hiragana 2
 
< 0.1%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1596
43.5%
( 294
 
8.0%
) 294
 
8.0%
2 101
 
2.8%
e 81
 
2.2%
C 69
 
1.9%
5 68
 
1.9%
1 67
 
1.8%
S 49
 
1.3%
0 47
 
1.3%
Other values (61) 1002
27.3%
Hangul
ValueCountFrequency (%)
889
 
4.4%
710
 
3.5%
569
 
2.8%
480
 
2.4%
417
 
2.1%
312
 
1.5%
278
 
1.4%
255
 
1.3%
223
 
1.1%
215
 
1.1%
Other values (780) 15889
78.5%
CJK
ValueCountFrequency (%)
2
 
11.8%
1
 
5.9%
1
 
5.9%
1
 
5.9%
1
 
5.9%
1
 
5.9%
1
 
5.9%
1
 
5.9%
1
 
5.9%
1
 
5.9%
Other values (6) 6
35.3%
Letterlike Symbols
ValueCountFrequency (%)
2
100.0%
Number Forms
ValueCountFrequency (%)
2
66.7%
1
33.3%
Hiragana
ValueCountFrequency (%)
1
50.0%
1
50.0%
None
ValueCountFrequency (%)
· 1
100.0%
Distinct3067
Distinct (%)87.0%
Missing0
Missing (%)0.0%
Memory size27.7 KiB
2023-12-11T01:40:06.282354image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length67
Median length60
Mean length30.13025
Min length1

Characters and Unicode

Total characters106179
Distinct characters305
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2771 ?
Unique (%)78.6%

Sample

1st row부산광역시 연제구 교대로 18 (거제동)
2nd row부산광역시 연제구 교대로 11 (거제동)
3rd row부산광역시 연제구 중앙대로1175번길 33-1 (거제동)
4th row부산광역시 연제구 거제천로 183 (거제동)
5th row부산광역시 연제구 중앙대로1120번길 13 (연산동)
ValueCountFrequency (%)
부산광역시 3517
16.6%
연제구 3517
16.6%
연산동 2806
 
13.2%
1층 1344
 
6.3%
거제동 736
 
3.5%
2층 207
 
1.0%
과정로 195
 
0.9%
월드컵대로 173
 
0.8%
반송로 167
 
0.8%
일부호 163
 
0.8%
Other values (1460) 8417
39.6%
2023-12-11T01:40:07.202909image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
19867
18.7%
6755
 
6.4%
6473
 
6.1%
1 5556
 
5.2%
4740
 
4.5%
3911
 
3.7%
3911
 
3.7%
3770
 
3.6%
( 3645
 
3.4%
) 3645
 
3.4%
Other values (295) 43906
41.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 62087
58.5%
Space Separator 19867
 
18.7%
Decimal Number 16064
 
15.1%
Open Punctuation 3645
 
3.4%
Close Punctuation 3645
 
3.4%
Dash Punctuation 518
 
0.5%
Uppercase Letter 292
 
0.3%
Other Punctuation 30
 
< 0.1%
Math Symbol 24
 
< 0.1%
Lowercase Letter 7
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
6755
 
10.9%
6473
 
10.4%
4740
 
7.6%
3911
 
6.3%
3911
 
6.3%
3770
 
6.1%
3566
 
5.7%
3530
 
5.7%
3521
 
5.7%
3520
 
5.7%
Other values (259) 18390
29.6%
Uppercase Letter
ValueCountFrequency (%)
B 40
13.7%
E 37
12.7%
S 33
11.3%
W 32
11.0%
A 29
9.9%
K 28
9.6%
I 27
9.2%
V 26
8.9%
C 19
6.5%
D 9
 
3.1%
Other values (4) 12
 
4.1%
Decimal Number
ValueCountFrequency (%)
1 5556
34.6%
2 2300
14.3%
3 1679
 
10.5%
0 1305
 
8.1%
4 1180
 
7.3%
5 1097
 
6.8%
8 819
 
5.1%
6 797
 
5.0%
7 717
 
4.5%
9 614
 
3.8%
Lowercase Letter
ValueCountFrequency (%)
e 4
57.1%
s 1
 
14.3%
b 1
 
14.3%
a 1
 
14.3%
Other Punctuation
ValueCountFrequency (%)
. 18
60.0%
· 7
 
23.3%
& 5
 
16.7%
Space Separator
ValueCountFrequency (%)
19867
100.0%
Open Punctuation
ValueCountFrequency (%)
( 3645
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3645
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 518
100.0%
Math Symbol
ValueCountFrequency (%)
~ 24
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 62087
58.5%
Common 43793
41.2%
Latin 299
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
6755
 
10.9%
6473
 
10.4%
4740
 
7.6%
3911
 
6.3%
3911
 
6.3%
3770
 
6.1%
3566
 
5.7%
3530
 
5.7%
3521
 
5.7%
3520
 
5.7%
Other values (259) 18390
29.6%
Common
ValueCountFrequency (%)
19867
45.4%
1 5556
 
12.7%
( 3645
 
8.3%
) 3645
 
8.3%
2 2300
 
5.3%
3 1679
 
3.8%
0 1305
 
3.0%
4 1180
 
2.7%
5 1097
 
2.5%
8 819
 
1.9%
Other values (8) 2700
 
6.2%
Latin
ValueCountFrequency (%)
B 40
13.4%
E 37
12.4%
S 33
11.0%
W 32
10.7%
A 29
9.7%
K 28
9.4%
I 27
9.0%
V 26
8.7%
C 19
6.4%
D 9
 
3.0%
Other values (8) 19
6.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 62086
58.5%
ASCII 44085
41.5%
None 7
 
< 0.1%
Compat Jamo 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
19867
45.1%
1 5556
 
12.6%
( 3645
 
8.3%
) 3645
 
8.3%
2 2300
 
5.2%
3 1679
 
3.8%
0 1305
 
3.0%
4 1180
 
2.7%
5 1097
 
2.5%
8 819
 
1.9%
Other values (25) 2992
 
6.8%
Hangul
ValueCountFrequency (%)
6755
 
10.9%
6473
 
10.4%
4740
 
7.6%
3911
 
6.3%
3911
 
6.3%
3770
 
6.1%
3566
 
5.7%
3530
 
5.7%
3521
 
5.7%
3520
 
5.7%
Other values (258) 18389
29.6%
None
ValueCountFrequency (%)
· 7
100.0%
Compat Jamo
ValueCountFrequency (%)
1
100.0%

소재지전화
Text

MISSING 

Distinct2302
Distinct (%)97.0%
Missing1151
Missing (%)32.7%
Memory size27.7 KiB
2023-12-11T01:40:07.624846image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length12
Mean length12.024442
Min length5

Characters and Unicode

Total characters28534
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2245 ?
Unique (%)94.6%

Sample

1st row051-506-0355
2nd row051-504-7226
3rd row051-864-6005
4th row051-866-9050
5th row051-996-7666
ValueCountFrequency (%)
051-500-8000 7
 
0.3%
051-890-8023 5
 
0.2%
051-851-2190 4
 
0.2%
051-850-8000 3
 
0.1%
051-507-3939 3
 
0.1%
051-862-6974 3
 
0.1%
051 3
 
0.1%
051-851-8859 2
 
0.1%
051-867-3040 2
 
0.1%
051-853-1600 2
 
0.1%
Other values (2292) 2339
98.6%
2023-12-11T01:40:08.226500image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 4746
16.6%
5 4663
16.3%
0 3991
14.0%
1 3655
12.8%
8 2919
10.2%
6 1979
6.9%
7 1647
 
5.8%
2 1467
 
5.1%
3 1345
 
4.7%
9 1188
 
4.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 23788
83.4%
Dash Punctuation 4746
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 4663
19.6%
0 3991
16.8%
1 3655
15.4%
8 2919
12.3%
6 1979
8.3%
7 1647
 
6.9%
2 1467
 
6.2%
3 1345
 
5.7%
9 1188
 
5.0%
4 934
 
3.9%
Dash Punctuation
ValueCountFrequency (%)
- 4746
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 28534
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 4746
16.6%
5 4663
16.3%
0 3991
14.0%
1 3655
12.8%
8 2919
10.2%
6 1979
6.9%
7 1647
 
5.8%
2 1467
 
5.1%
3 1345
 
4.7%
9 1188
 
4.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 28534
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 4746
16.6%
5 4663
16.3%
0 3991
14.0%
1 3655
12.8%
8 2919
10.2%
6 1979
6.9%
7 1647
 
5.8%
2 1467
 
5.1%
3 1345
 
4.7%
9 1188
 
4.2%

Missing values

2023-12-11T01:40:04.196245image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T01:40:04.305362image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업종명업소명소재지(도로명)소재지전화
0일반음식점컴포즈교대점부산광역시 연제구 교대로 18 (거제동)051-506-0355
1일반음식점언양진미식당부산광역시 연제구 교대로 11 (거제동)051-504-7226
2일반음식점용문각(2호점)부산광역시 연제구 중앙대로1175번길 33-1 (거제동)051-864-6005
3일반음식점치킨업(UP)부산광역시 연제구 거제천로 183 (거제동)051-866-9050
4일반음식점밀양돼지국밥부산광역시 연제구 중앙대로1120번길 13 (연산동)051-996-7666
5일반음식점연산식당부산광역시 연제구 과정로265번가길 5 (연산동)051-866-5258
6일반음식점고성횟집부산광역시 연제구 중앙대로 1116-9 (연산동)051-861-7666
7일반음식점쏘맥부산광역시 연제구 중앙대로 1116-11 (연산동)051-867-4789
8일반음식점24시콩나물해장국부산광역시 연제구 중앙대로 1116-11 (연산동)051-852-5453
9일반음식점참나무숯불갈비부산광역시 연제구 중앙대로1133번길 13 (연산동)051-852-2014
업종명업소명소재지(도로명)소재지전화
3514제과점영업트레이더스베이커리 연산점부산광역시 연제구 좌수영로 241 이마트트레이더스연산점 지하1층 (연산동)051-968-5648
3515제과점영업토리제과부산광역시 연제구 거제천로118번길 14 2층 (연산동)<NA>
3516제과점영업프흐미에부산광역시 연제구 월드컵대로10번길 8 1층 (연산동)<NA>
3517제과점영업무무제과부산광역시 연제구 연안로 5 1층 (연산동)<NA>
3518제과점영업카페호밀부산광역시 연제구 명륜로2번길 23 1층 (거제동)<NA>
3519제과점영업베이크어케이크(BAKE A CAKE)부산광역시 연제구 중앙대로 1130 1층 114호 (연산동 연산동 SK VIEW(2단지))<NA>
3520제과점영업구자윤과자점부산광역시 연제구 세병로 29 1층 (연산동)<NA>
3521제과점영업구자윤과자점부산광역시 연제구 시청로 38 1.2층 (연산동)<NA>
3522제과점영업홈플러스(주)아시아드점부산광역시 연제구 종합운동장로 7 홈플러스 아시아드점 지하2층 (거제동)051-890-8023
3523제과점영업홈플러스(주)아시아드점부산광역시 연제구 종합운동장로 7 홈플러스 아시아드점 지하2층 (거제동)051-890-8023

Duplicate rows

Most frequently occurring

업종명업소명소재지(도로명)소재지전화# duplicates
0제과점영업홈플러스(주)아시아드점부산광역시 연제구 종합운동장로 7 홈플러스 아시아드점 지하2층 (거제동)051-890-80232
1휴게음식점홈플러스(주)아시아드점부산광역시 연제구 종합운동장로 7 홈플러스 아시아드점 지하2층 (거제동)051-890-80232