Overview

Dataset statistics

Number of variables5
Number of observations144
Missing cells90
Missing cells (%)12.5%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory5.8 KiB
Average record size in memory40.9 B

Variable types

Categorical1
Text4

Dataset

Description부산광역시부산진구_제과점현황_20221117
Author부산광역시 부산진구
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=15094117

Alerts

업종명 has constant value ""Constant
소재지전화 has 89 (61.8%) missing valuesMissing

Reproduction

Analysis started2023-12-10 17:37:44.499836
Analysis finished2023-12-10 17:37:45.666513
Duration1.17 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업종명
Categorical

CONSTANT 

Distinct1
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
제과점영업
144 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row제과점영업
2nd row제과점영업
3rd row제과점영업
4th row제과점영업
5th row제과점영업

Common Values

ValueCountFrequency (%)
제과점영업 144
100.0%

Length

2023-12-11T02:37:45.818455image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T02:37:46.070056image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
제과점영업 144
100.0%
Distinct135
Distinct (%)93.8%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
2023-12-11T02:37:46.607676image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length31
Median length17.5
Mean length8.1319444
Min length2

Characters and Unicode

Total characters1171
Distinct characters278
Distinct categories8 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique131 ?
Unique (%)91.0%

Sample

1st row빵굽는마을
2nd row스완베이커리
3rd row호수제과점
4th row파리바게뜨 개금백양점
5th row코코로브레드
ValueCountFrequency (%)
파리바게뜨 14
 
7.7%
뚜레쥬르 4
 
2.2%
파리바게트신개금엘지점 2
 
1.1%
bakery 2
 
1.1%
다옵스베이커리 2
 
1.1%
희와제과 2
 
1.1%
빵굽는마을 1
 
0.5%
cake 1
 
0.5%
모구모구과자점인전포 1
 
0.5%
메종지미지니팍(masion 1
 
0.5%
Other values (152) 152
83.5%
2023-12-11T02:37:47.460365image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
62
 
5.3%
52
 
4.4%
41
 
3.5%
38
 
3.2%
30
 
2.6%
29
 
2.5%
28
 
2.4%
28
 
2.4%
25
 
2.1%
21
 
1.8%
Other values (268) 817
69.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 965
82.4%
Lowercase Letter 66
 
5.6%
Uppercase Letter 50
 
4.3%
Space Separator 38
 
3.2%
Close Punctuation 17
 
1.5%
Open Punctuation 17
 
1.5%
Decimal Number 14
 
1.2%
Other Punctuation 4
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
62
 
6.4%
52
 
5.4%
41
 
4.2%
30
 
3.1%
29
 
3.0%
28
 
2.9%
28
 
2.9%
25
 
2.6%
21
 
2.2%
21
 
2.2%
Other values (225) 628
65.1%
Uppercase Letter
ValueCountFrequency (%)
B 6
12.0%
S 5
 
10.0%
N 4
 
8.0%
A 4
 
8.0%
I 4
 
8.0%
M 4
 
8.0%
J 3
 
6.0%
E 2
 
4.0%
O 2
 
4.0%
R 2
 
4.0%
Other values (10) 14
28.0%
Lowercase Letter
ValueCountFrequency (%)
e 12
18.2%
a 10
15.2%
l 7
10.6%
o 6
9.1%
k 6
9.1%
i 5
7.6%
u 4
 
6.1%
y 3
 
4.5%
n 3
 
4.5%
c 3
 
4.5%
Other values (5) 7
10.6%
Decimal Number
ValueCountFrequency (%)
1 5
35.7%
2 5
35.7%
9 4
28.6%
Other Punctuation
ValueCountFrequency (%)
? 2
50.0%
' 2
50.0%
Space Separator
ValueCountFrequency (%)
38
100.0%
Close Punctuation
ValueCountFrequency (%)
) 17
100.0%
Open Punctuation
ValueCountFrequency (%)
( 17
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 963
82.2%
Latin 116
 
9.9%
Common 90
 
7.7%
Han 2
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
62
 
6.4%
52
 
5.4%
41
 
4.3%
30
 
3.1%
29
 
3.0%
28
 
2.9%
28
 
2.9%
25
 
2.6%
21
 
2.2%
21
 
2.2%
Other values (223) 626
65.0%
Latin
ValueCountFrequency (%)
e 12
 
10.3%
a 10
 
8.6%
l 7
 
6.0%
o 6
 
5.2%
k 6
 
5.2%
B 6
 
5.2%
S 5
 
4.3%
i 5
 
4.3%
N 4
 
3.4%
A 4
 
3.4%
Other values (25) 51
44.0%
Common
ValueCountFrequency (%)
38
42.2%
) 17
18.9%
( 17
18.9%
1 5
 
5.6%
2 5
 
5.6%
9 4
 
4.4%
? 2
 
2.2%
' 2
 
2.2%
Han
ValueCountFrequency (%)
1
50.0%
1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 963
82.2%
ASCII 206
 
17.6%
CJK 1
 
0.1%
CJK Compat Ideographs 1
 
0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
62
 
6.4%
52
 
5.4%
41
 
4.3%
30
 
3.1%
29
 
3.0%
28
 
2.9%
28
 
2.9%
25
 
2.6%
21
 
2.2%
21
 
2.2%
Other values (223) 626
65.0%
ASCII
ValueCountFrequency (%)
38
18.4%
) 17
 
8.3%
( 17
 
8.3%
e 12
 
5.8%
a 10
 
4.9%
l 7
 
3.4%
o 6
 
2.9%
k 6
 
2.9%
B 6
 
2.9%
1 5
 
2.4%
Other values (33) 82
39.8%
CJK
ValueCountFrequency (%)
1
100.0%
CJK Compat Ideographs
ValueCountFrequency (%)
1
100.0%
Distinct141
Distinct (%)98.6%
Missing1
Missing (%)0.7%
Memory size1.3 KiB
2023-12-11T02:37:48.052414image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length55
Median length46
Mean length35.034965
Min length23

Characters and Unicode

Total characters5010
Distinct characters183
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique139 ?
Unique (%)97.2%

Sample

1st row부산광역시 부산진구 동성로 79-1, 1층 (전포동)
2nd row부산광역시 부산진구 동평로 407 (양정동)
3rd row부산광역시 부산진구 거제대로60번길 34 (양정동)
4th row부산광역시 부산진구 백양관문로 3, 개금주공상가 1층 111,113호 (개금동)
5th row부산광역시 부산진구 새싹로 158 (연지동)
ValueCountFrequency (%)
부산광역시 143
 
15.2%
부산진구 143
 
15.2%
1층 60
 
6.4%
전포동 34
 
3.6%
부전동 33
 
3.5%
가야대로 21
 
2.2%
개금동 14
 
1.5%
중앙대로 14
 
1.5%
지하1층 13
 
1.4%
772 13
 
1.4%
Other values (280) 451
48.0%
2023-12-11T02:37:49.471828image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
796
 
15.9%
351
 
7.0%
299
 
6.0%
1 235
 
4.7%
185
 
3.7%
, 156
 
3.1%
) 152
 
3.0%
( 152
 
3.0%
150
 
3.0%
145
 
2.9%
Other values (173) 2389
47.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3016
60.2%
Space Separator 796
 
15.9%
Decimal Number 715
 
14.3%
Other Punctuation 157
 
3.1%
Close Punctuation 152
 
3.0%
Open Punctuation 152
 
3.0%
Dash Punctuation 15
 
0.3%
Uppercase Letter 7
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
351
 
11.6%
299
 
9.9%
185
 
6.1%
150
 
5.0%
145
 
4.8%
144
 
4.8%
144
 
4.8%
144
 
4.8%
144
 
4.8%
114
 
3.8%
Other values (151) 1196
39.7%
Decimal Number
ValueCountFrequency (%)
1 235
32.9%
2 102
14.3%
7 75
 
10.5%
6 53
 
7.4%
0 51
 
7.1%
4 49
 
6.9%
5 40
 
5.6%
9 38
 
5.3%
3 36
 
5.0%
8 36
 
5.0%
Uppercase Letter
ValueCountFrequency (%)
B 2
28.6%
F 1
14.3%
J 1
14.3%
S 1
14.3%
Y 1
14.3%
K 1
14.3%
Other Punctuation
ValueCountFrequency (%)
, 156
99.4%
@ 1
 
0.6%
Space Separator
ValueCountFrequency (%)
796
100.0%
Close Punctuation
ValueCountFrequency (%)
) 152
100.0%
Open Punctuation
ValueCountFrequency (%)
( 152
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 15
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3016
60.2%
Common 1987
39.7%
Latin 7
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
351
 
11.6%
299
 
9.9%
185
 
6.1%
150
 
5.0%
145
 
4.8%
144
 
4.8%
144
 
4.8%
144
 
4.8%
144
 
4.8%
114
 
3.8%
Other values (151) 1196
39.7%
Common
ValueCountFrequency (%)
796
40.1%
1 235
 
11.8%
, 156
 
7.9%
) 152
 
7.6%
( 152
 
7.6%
2 102
 
5.1%
7 75
 
3.8%
6 53
 
2.7%
0 51
 
2.6%
4 49
 
2.5%
Other values (6) 166
 
8.4%
Latin
ValueCountFrequency (%)
B 2
28.6%
F 1
14.3%
J 1
14.3%
S 1
14.3%
Y 1
14.3%
K 1
14.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3016
60.2%
ASCII 1994
39.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
796
39.9%
1 235
 
11.8%
, 156
 
7.8%
) 152
 
7.6%
( 152
 
7.6%
2 102
 
5.1%
7 75
 
3.8%
6 53
 
2.7%
0 51
 
2.6%
4 49
 
2.5%
Other values (12) 173
 
8.7%
Hangul
ValueCountFrequency (%)
351
 
11.6%
299
 
9.9%
185
 
6.1%
150
 
5.0%
145
 
4.8%
144
 
4.8%
144
 
4.8%
144
 
4.8%
144
 
4.8%
114
 
3.8%
Other values (151) 1196
39.7%
Distinct132
Distinct (%)91.7%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
2023-12-11T02:37:50.048211image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length47
Median length43
Mean length25.972222
Min length19

Characters and Unicode

Total characters3740
Distinct characters155
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique125 ?
Unique (%)86.8%

Sample

1st row부산광역시 부산진구 전포동 205-21 (205-25를 205-21로 합병)
2nd row부산광역시 부산진구 양정동 406-32
3rd row부산광역시 부산진구 양정동 389-13
4th row부산광역시 부산진구 개금동 53-3 개금주공상가
5th row부산광역시 부산진구 연지동 165-7
ValueCountFrequency (%)
부산광역시 144
21.2%
부산진구 144
21.2%
전포동 35
 
5.2%
부전동 35
 
5.2%
개금동 15
 
2.2%
당감동 13
 
1.9%
503-15 13
 
1.9%
양정동 11
 
1.6%
부암동 10
 
1.5%
범천동 8
 
1.2%
Other values (203) 251
37.0%
2023-12-11T02:37:51.016099image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
676
18.1%
349
 
9.3%
298
 
8.0%
1 159
 
4.3%
156
 
4.2%
147
 
3.9%
145
 
3.9%
145
 
3.9%
145
 
3.9%
144
 
3.9%
Other values (145) 1376
36.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2192
58.6%
Decimal Number 699
 
18.7%
Space Separator 676
 
18.1%
Dash Punctuation 126
 
3.4%
Open Punctuation 19
 
0.5%
Close Punctuation 18
 
0.5%
Other Punctuation 7
 
0.2%
Uppercase Letter 3
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
349
15.9%
298
13.6%
156
 
7.1%
147
 
6.7%
145
 
6.6%
145
 
6.6%
145
 
6.6%
144
 
6.6%
75
 
3.4%
35
 
1.6%
Other values (126) 553
25.2%
Decimal Number
ValueCountFrequency (%)
1 159
22.7%
5 100
14.3%
2 90
12.9%
3 74
10.6%
6 59
 
8.4%
0 57
 
8.2%
7 44
 
6.3%
4 44
 
6.3%
8 39
 
5.6%
9 33
 
4.7%
Uppercase Letter
ValueCountFrequency (%)
S 1
33.3%
K 1
33.3%
Y 1
33.3%
Other Punctuation
ValueCountFrequency (%)
, 5
71.4%
@ 2
 
28.6%
Space Separator
ValueCountFrequency (%)
676
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 126
100.0%
Open Punctuation
ValueCountFrequency (%)
( 19
100.0%
Close Punctuation
ValueCountFrequency (%)
) 18
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2192
58.6%
Common 1545
41.3%
Latin 3
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
349
15.9%
298
13.6%
156
 
7.1%
147
 
6.7%
145
 
6.6%
145
 
6.6%
145
 
6.6%
144
 
6.6%
75
 
3.4%
35
 
1.6%
Other values (126) 553
25.2%
Common
ValueCountFrequency (%)
676
43.8%
1 159
 
10.3%
- 126
 
8.2%
5 100
 
6.5%
2 90
 
5.8%
3 74
 
4.8%
6 59
 
3.8%
0 57
 
3.7%
7 44
 
2.8%
4 44
 
2.8%
Other values (6) 116
 
7.5%
Latin
ValueCountFrequency (%)
S 1
33.3%
K 1
33.3%
Y 1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2192
58.6%
ASCII 1548
41.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
676
43.7%
1 159
 
10.3%
- 126
 
8.1%
5 100
 
6.5%
2 90
 
5.8%
3 74
 
4.8%
6 59
 
3.8%
0 57
 
3.7%
7 44
 
2.8%
4 44
 
2.8%
Other values (9) 119
 
7.7%
Hangul
ValueCountFrequency (%)
349
15.9%
298
13.6%
156
 
7.1%
147
 
6.7%
145
 
6.6%
145
 
6.6%
145
 
6.6%
144
 
6.6%
75
 
3.4%
35
 
1.6%
Other values (126) 553
25.2%

소재지전화
Text

MISSING 

Distinct53
Distinct (%)96.4%
Missing89
Missing (%)61.8%
Memory size1.3 KiB
2023-12-11T02:37:51.625297image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.036364
Min length11

Characters and Unicode

Total characters662
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique51 ?
Unique (%)92.7%

Sample

1st row051-895-9008
2nd row051-807-7069
3rd row051-816-5394
4th row051-891-4598
5th row051-895-4379
ValueCountFrequency (%)
051-894-9006 2
 
3.6%
051-807-8290 2
 
3.6%
051-541-0651 1
 
1.8%
051-610-6375 1
 
1.8%
051-895-1116 1
 
1.8%
051-891-9821 1
 
1.8%
051-810-3963 1
 
1.8%
051-643-0122 1
 
1.8%
051-583-9535 1
 
1.8%
051-908-5321 1
 
1.8%
Other values (43) 43
78.2%
2023-12-11T02:37:52.767129image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 122
18.4%
- 110
16.6%
1 96
14.5%
5 75
11.3%
8 75
11.3%
9 46
 
6.9%
6 34
 
5.1%
4 27
 
4.1%
7 27
 
4.1%
2 27
 
4.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 552
83.4%
Dash Punctuation 110
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 122
22.1%
1 96
17.4%
5 75
13.6%
8 75
13.6%
9 46
 
8.3%
6 34
 
6.2%
4 27
 
4.9%
7 27
 
4.9%
2 27
 
4.9%
3 23
 
4.2%
Dash Punctuation
ValueCountFrequency (%)
- 110
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 662
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 122
18.4%
- 110
16.6%
1 96
14.5%
5 75
11.3%
8 75
11.3%
9 46
 
6.9%
6 34
 
5.1%
4 27
 
4.1%
7 27
 
4.1%
2 27
 
4.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 662
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 122
18.4%
- 110
16.6%
1 96
14.5%
5 75
11.3%
8 75
11.3%
9 46
 
6.9%
6 34
 
5.1%
4 27
 
4.1%
7 27
 
4.1%
2 27
 
4.1%

Missing values

2023-12-11T02:37:45.120450image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T02:37:45.327360image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-11T02:37:45.570301image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

업종명업소명소재지(도로명)소재지(지번)소재지전화
0제과점영업빵굽는마을부산광역시 부산진구 동성로 79-1, 1층 (전포동)부산광역시 부산진구 전포동 205-21 (205-25를 205-21로 합병)<NA>
1제과점영업스완베이커리부산광역시 부산진구 동평로 407 (양정동)부산광역시 부산진구 양정동 406-32<NA>
2제과점영업호수제과점부산광역시 부산진구 거제대로60번길 34 (양정동)부산광역시 부산진구 양정동 389-13<NA>
3제과점영업파리바게뜨 개금백양점부산광역시 부산진구 백양관문로 3, 개금주공상가 1층 111,113호 (개금동)부산광역시 부산진구 개금동 53-3 개금주공상가051-895-9008
4제과점영업코코로브레드부산광역시 부산진구 새싹로 158 (연지동)부산광역시 부산진구 연지동 165-7051-807-7069
5제과점영업밀베이커리부산광역시 부산진구 중앙대로755번길 7 (부전동)부산광역시 부산진구 부전동 266-12051-816-5394
6제과점영업파리휘셀베이커리부산광역시 부산진구 개금본동로 22 (개금동)부산광역시 부산진구 개금동 25-6051-891-4598
7제과점영업한태현베이커리부산광역시 부산진구 당감서로 15 (당감동)부산광역시 부산진구 당감동 500-33051-895-4379
8제과점영업왕비제과부산광역시 부산진구 개금본동로 42 (개금동)부산광역시 부산진구 개금동 19-6<NA>
9제과점영업빠리바게트부암화승점<NA>부산광역시 부산진구 부암동 500 화승@상가21동 111호, 112호051-807-8290
업종명업소명소재지(도로명)소재지(지번)소재지전화
134제과점영업밀키샵 전포점부산광역시 부산진구 서전로68번길 108, 1층 (전포동)부산광역시 부산진구 전포동 355-4<NA>
135제과점영업(주)남포당부산광역시 부산진구 가야대로 772, 지하1층 (부전동)부산광역시 부산진구 부전동 503-15<NA>
136제과점영업플러피부산광역시 부산진구 서전로37번길 26, 1층 (전포동)부산광역시 부산진구 전포동 664-7<NA>
137제과점영업바이스벌사 디저트부산광역시 부산진구 전포대로246번길 13-5, 102호 (전포동, 마티에르)부산광역시 부산진구 전포동 197-16 마티에르<NA>
138제과점영업힌터그룬트부산광역시 부산진구 동성로15번길 37, 1층 (전포동)부산광역시 부산진구 전포동 363-56<NA>
139제과점영업뚜레쥬르 부산개금백병원부산광역시 부산진구 복지로 75, 인제대학교 부산백병원 F동 1층 (개금동)부산광역시 부산진구 개금동 633-165 인제대학교 부산백병원<NA>
140제과점영업베희글부산광역시 부산진구 서전로38번길 35, 1층 (전포동)부산광역시 부산진구 전포동 680-3<NA>
141제과점영업빅싸게마트부산광역시 부산진구 동천로 112, 1층 (전포동)부산광역시 부산진구 전포동 660-5<NA>
142제과점영업노티드 롯데부산본점부산광역시 부산진구 가야대로 772, 롯데백화점부산본점 지하2층 (부전동)부산광역시 부산진구 부전동 503-15 롯데백화점부산본점<NA>
143제과점영업파밀리아 제과점부산광역시 부산진구 가야대로 772, 지하2층 (부전동)부산광역시 부산진구 부전동 503-15<NA>