Overview

Dataset statistics

Number of variables5
Number of observations232
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory9.2 KiB
Average record size in memory40.6 B

Variable types

Categorical1
DateTime1
Text3

Dataset

Description부산광역시부산진구숙박업현황_20221017
Author부산광역시 부산진구
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=15025544

Alerts

업종명 is highly imbalanced (67.1%)Imbalance

Reproduction

Analysis started2023-12-10 16:54:56.440243
Analysis finished2023-12-10 16:54:56.952767
Duration0.51 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업종명
Categorical

IMBALANCE 

Distinct2
Distinct (%)0.9%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
숙박업(일반)
218 
숙박업(생활)
 
14

Length

Max length7
Median length7
Mean length7
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row숙박업(일반)
2nd row숙박업(일반)
3rd row숙박업(일반)
4th row숙박업(일반)
5th row숙박업(일반)

Common Values

ValueCountFrequency (%)
숙박업(일반) 218
94.0%
숙박업(생활) 14
 
6.0%

Length

2023-12-11T01:54:57.031527image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:54:57.142911image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
숙박업(일반 218
94.0%
숙박업(생활 14
 
6.0%
Distinct222
Distinct (%)95.7%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
Minimum1965-08-13 00:00:00
Maximum2022-06-29 00:00:00
2023-12-11T01:54:57.265918image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:54:57.466313image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct226
Distinct (%)97.4%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
2023-12-11T01:54:57.898353image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length43
Median length25
Mean length6.25
Min length2

Characters and Unicode

Total characters1450
Distinct characters271
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique220 ?
Unique (%)94.8%

Sample

1st row원진
2nd row롯데빌
3rd row윙크장
4th row동광
5th row부산
ValueCountFrequency (%)
호텔 22
 
6.7%
모텔 9
 
2.7%
hotel 7
 
2.1%
서면점 4
 
1.2%
부산 4
 
1.2%
3
 
0.9%
스테이 3
 
0.9%
서면 3
 
0.9%
이화장 2
 
0.6%
부산장 2
 
0.6%
Other values (261) 269
82.0%
2023-12-11T01:54:58.565210image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
129
 
8.9%
96
 
6.6%
79
 
5.4%
58
 
4.0%
38
 
2.6%
37
 
2.6%
34
 
2.3%
( 26
 
1.8%
) 26
 
1.8%
T 21
 
1.4%
Other values (261) 906
62.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1067
73.6%
Uppercase Letter 149
 
10.3%
Space Separator 96
 
6.6%
Lowercase Letter 44
 
3.0%
Decimal Number 33
 
2.3%
Open Punctuation 26
 
1.8%
Close Punctuation 26
 
1.8%
Other Punctuation 6
 
0.4%
Dash Punctuation 3
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
129
 
12.1%
79
 
7.4%
58
 
5.4%
38
 
3.6%
37
 
3.5%
34
 
3.2%
20
 
1.9%
19
 
1.8%
17
 
1.6%
17
 
1.6%
Other values (209) 619
58.0%
Uppercase Letter
ValueCountFrequency (%)
T 21
14.1%
E 19
12.8%
H 18
12.1%
O 17
11.4%
L 17
11.4%
S 10
 
6.7%
N 6
 
4.0%
J 5
 
3.4%
K 5
 
3.4%
B 4
 
2.7%
Other values (12) 27
18.1%
Lowercase Letter
ValueCountFrequency (%)
o 9
20.5%
e 7
15.9%
n 6
13.6%
t 4
9.1%
y 4
9.1%
a 3
 
6.8%
s 2
 
4.5%
m 2
 
4.5%
u 1
 
2.3%
q 1
 
2.3%
Other values (5) 5
11.4%
Decimal Number
ValueCountFrequency (%)
2 11
33.3%
5 7
21.2%
9 4
 
12.1%
7 4
 
12.1%
1 4
 
12.1%
6 1
 
3.0%
3 1
 
3.0%
4 1
 
3.0%
Other Punctuation
ValueCountFrequency (%)
. 4
66.7%
& 1
 
16.7%
, 1
 
16.7%
Space Separator
ValueCountFrequency (%)
96
100.0%
Open Punctuation
ValueCountFrequency (%)
( 26
100.0%
Close Punctuation
ValueCountFrequency (%)
) 26
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1067
73.6%
Latin 193
 
13.3%
Common 190
 
13.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
129
 
12.1%
79
 
7.4%
58
 
5.4%
38
 
3.6%
37
 
3.5%
34
 
3.2%
20
 
1.9%
19
 
1.8%
17
 
1.6%
17
 
1.6%
Other values (209) 619
58.0%
Latin
ValueCountFrequency (%)
T 21
 
10.9%
E 19
 
9.8%
H 18
 
9.3%
O 17
 
8.8%
L 17
 
8.8%
S 10
 
5.2%
o 9
 
4.7%
e 7
 
3.6%
N 6
 
3.1%
n 6
 
3.1%
Other values (27) 63
32.6%
Common
ValueCountFrequency (%)
96
50.5%
( 26
 
13.7%
) 26
 
13.7%
2 11
 
5.8%
5 7
 
3.7%
9 4
 
2.1%
. 4
 
2.1%
7 4
 
2.1%
1 4
 
2.1%
- 3
 
1.6%
Other values (5) 5
 
2.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1067
73.6%
ASCII 383
 
26.4%

Most frequent character per block

Hangul
ValueCountFrequency (%)
129
 
12.1%
79
 
7.4%
58
 
5.4%
38
 
3.6%
37
 
3.5%
34
 
3.2%
20
 
1.9%
19
 
1.8%
17
 
1.6%
17
 
1.6%
Other values (209) 619
58.0%
ASCII
ValueCountFrequency (%)
96
25.1%
( 26
 
6.8%
) 26
 
6.8%
T 21
 
5.5%
E 19
 
5.0%
H 18
 
4.7%
O 17
 
4.4%
L 17
 
4.4%
2 11
 
2.9%
S 10
 
2.6%
Other values (42) 122
31.9%
Distinct231
Distinct (%)99.6%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
2023-12-11T01:54:59.010665image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length43
Median length39
Mean length28.711207
Min length22

Characters and Unicode

Total characters6661
Distinct characters102
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique230 ?
Unique (%)99.1%

Sample

1st row부산광역시 부산진구 부전로 169-6 (부전동)
2nd row부산광역시 부산진구 서전로 15-1 (부전동)
3rd row부산광역시 부산진구 중앙대로970번길 10 (양정동)
4th row부산광역시 부산진구 새싹로52번길 32 (부전동)
5th row부산광역시 부산진구 중앙번영로 4-1 (범천동)
ValueCountFrequency (%)
부산광역시 232
19.3%
부산진구 232
19.3%
부전동 144
 
12.0%
범천동 28
 
2.3%
초읍동 20
 
1.7%
양정동 14
 
1.2%
부전로 14
 
1.2%
중앙대로691번가길 13
 
1.1%
부전로20번길 13
 
1.1%
서면로 10
 
0.8%
Other values (272) 480
40.0%
2023-12-11T01:54:59.698072image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
968
 
14.5%
649
 
9.7%
465
 
7.0%
254
 
3.8%
( 233
 
3.5%
233
 
3.5%
) 233
 
3.5%
232
 
3.5%
232
 
3.5%
232
 
3.5%
Other values (92) 2930
44.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4054
60.9%
Decimal Number 1030
 
15.5%
Space Separator 968
 
14.5%
Open Punctuation 233
 
3.5%
Close Punctuation 233
 
3.5%
Dash Punctuation 87
 
1.3%
Other Punctuation 35
 
0.5%
Math Symbol 12
 
0.2%
Uppercase Letter 9
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
649
16.0%
465
11.5%
254
 
6.3%
233
 
5.7%
232
 
5.7%
232
 
5.7%
232
 
5.7%
232
 
5.7%
232
 
5.7%
197
 
4.9%
Other values (67) 1096
27.0%
Decimal Number
ValueCountFrequency (%)
1 207
20.1%
2 123
11.9%
3 108
10.5%
5 98
9.5%
7 93
9.0%
0 93
9.0%
6 90
8.7%
4 80
 
7.8%
8 73
 
7.1%
9 65
 
6.3%
Uppercase Letter
ValueCountFrequency (%)
M 2
22.2%
J 2
22.2%
G 1
11.1%
T 1
11.1%
K 1
11.1%
B 1
11.1%
A 1
11.1%
Other Punctuation
ValueCountFrequency (%)
, 32
91.4%
. 2
 
5.7%
& 1
 
2.9%
Space Separator
ValueCountFrequency (%)
968
100.0%
Open Punctuation
ValueCountFrequency (%)
( 233
100.0%
Close Punctuation
ValueCountFrequency (%)
) 233
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 87
100.0%
Math Symbol
ValueCountFrequency (%)
~ 12
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4054
60.9%
Common 2598
39.0%
Latin 9
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
649
16.0%
465
11.5%
254
 
6.3%
233
 
5.7%
232
 
5.7%
232
 
5.7%
232
 
5.7%
232
 
5.7%
232
 
5.7%
197
 
4.9%
Other values (67) 1096
27.0%
Common
ValueCountFrequency (%)
968
37.3%
( 233
 
9.0%
) 233
 
9.0%
1 207
 
8.0%
2 123
 
4.7%
3 108
 
4.2%
5 98
 
3.8%
7 93
 
3.6%
0 93
 
3.6%
6 90
 
3.5%
Other values (8) 352
 
13.5%
Latin
ValueCountFrequency (%)
M 2
22.2%
J 2
22.2%
G 1
11.1%
T 1
11.1%
K 1
11.1%
B 1
11.1%
A 1
11.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4054
60.9%
ASCII 2607
39.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
968
37.1%
( 233
 
8.9%
) 233
 
8.9%
1 207
 
7.9%
2 123
 
4.7%
3 108
 
4.1%
5 98
 
3.8%
7 93
 
3.6%
0 93
 
3.6%
6 90
 
3.5%
Other values (15) 361
 
13.8%
Hangul
ValueCountFrequency (%)
649
16.0%
465
11.5%
254
 
6.3%
233
 
5.7%
232
 
5.7%
232
 
5.7%
232
 
5.7%
232
 
5.7%
232
 
5.7%
197
 
4.9%
Other values (67) 1096
27.0%
Distinct229
Distinct (%)98.7%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
2023-12-11T01:55:00.183539image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length38
Median length22
Mean length22.357759
Min length18

Characters and Unicode

Total characters5187
Distinct characters79
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique226 ?
Unique (%)97.4%

Sample

1st row부산광역시 부산진구 부전동 346-3
2nd row부산광역시 부산진구 부전동 142-5
3rd row부산광역시 부산진구 양정동 270-1
4th row부산광역시 부산진구 부전동 392-32
5th row부산광역시 부산진구 범천동 847-33
ValueCountFrequency (%)
부산광역시 232
24.1%
부산진구 232
24.1%
부전동 143
14.8%
범천동 28
 
2.9%
초읍동 20
 
2.1%
양정동 14
 
1.5%
당감동 8
 
0.8%
개금동 6
 
0.6%
전포동 6
 
0.6%
4
 
0.4%
Other values (261) 270
28.0%
2023-12-11T01:55:00.939187image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
961
18.5%
614
11.8%
465
 
9.0%
234
 
4.5%
233
 
4.5%
232
 
4.5%
232
 
4.5%
232
 
4.5%
232
 
4.5%
- 230
 
4.4%
Other values (69) 1522
29.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2853
55.0%
Decimal Number 1120
 
21.6%
Space Separator 961
 
18.5%
Dash Punctuation 230
 
4.4%
Uppercase Letter 8
 
0.2%
Math Symbol 5
 
0.1%
Other Punctuation 4
 
0.1%
Close Punctuation 3
 
0.1%
Open Punctuation 3
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
614
21.5%
465
16.3%
234
 
8.2%
233
 
8.2%
232
 
8.1%
232
 
8.1%
232
 
8.1%
232
 
8.1%
150
 
5.3%
28
 
1.0%
Other values (46) 201
 
7.0%
Decimal Number
ValueCountFrequency (%)
1 190
17.0%
2 187
16.7%
5 147
13.1%
3 117
10.4%
4 109
9.7%
8 85
7.6%
6 76
 
6.8%
7 75
 
6.7%
9 72
 
6.4%
0 62
 
5.5%
Uppercase Letter
ValueCountFrequency (%)
J 2
25.0%
M 2
25.0%
A 1
12.5%
G 1
12.5%
T 1
12.5%
K 1
12.5%
Other Punctuation
ValueCountFrequency (%)
, 3
75.0%
& 1
 
25.0%
Space Separator
ValueCountFrequency (%)
961
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 230
100.0%
Math Symbol
ValueCountFrequency (%)
~ 5
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3
100.0%
Open Punctuation
ValueCountFrequency (%)
( 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2853
55.0%
Common 2326
44.8%
Latin 8
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
614
21.5%
465
16.3%
234
 
8.2%
233
 
8.2%
232
 
8.1%
232
 
8.1%
232
 
8.1%
232
 
8.1%
150
 
5.3%
28
 
1.0%
Other values (46) 201
 
7.0%
Common
ValueCountFrequency (%)
961
41.3%
- 230
 
9.9%
1 190
 
8.2%
2 187
 
8.0%
5 147
 
6.3%
3 117
 
5.0%
4 109
 
4.7%
8 85
 
3.7%
6 76
 
3.3%
7 75
 
3.2%
Other values (7) 149
 
6.4%
Latin
ValueCountFrequency (%)
J 2
25.0%
M 2
25.0%
A 1
12.5%
G 1
12.5%
T 1
12.5%
K 1
12.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2853
55.0%
ASCII 2334
45.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
961
41.2%
- 230
 
9.9%
1 190
 
8.1%
2 187
 
8.0%
5 147
 
6.3%
3 117
 
5.0%
4 109
 
4.7%
8 85
 
3.6%
6 76
 
3.3%
7 75
 
3.2%
Other values (13) 157
 
6.7%
Hangul
ValueCountFrequency (%)
614
21.5%
465
16.3%
234
 
8.2%
233
 
8.2%
232
 
8.1%
232
 
8.1%
232
 
8.1%
232
 
8.1%
150
 
5.3%
28
 
1.0%
Other values (46) 201
 
7.0%

Missing values

2023-12-11T01:54:56.766804image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T01:54:56.908464image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업종명신고일자업소명영업소 주소(도로명)영업소 주소(지번)
0숙박업(일반)1965-08-13원진부산광역시 부산진구 부전로 169-6 (부전동)부산광역시 부산진구 부전동 346-3
1숙박업(일반)1968-12-19롯데빌부산광역시 부산진구 서전로 15-1 (부전동)부산광역시 부산진구 부전동 142-5
2숙박업(일반)1968-11-18윙크장부산광역시 부산진구 중앙대로970번길 10 (양정동)부산광역시 부산진구 양정동 270-1
3숙박업(일반)1970-12-21동광부산광역시 부산진구 새싹로52번길 32 (부전동)부산광역시 부산진구 부전동 392-32
4숙박업(일반)1970-01-10부산부산광역시 부산진구 중앙번영로 4-1 (범천동)부산광역시 부산진구 범천동 847-33
5숙박업(일반)1971-12-18천애부산광역시 부산진구 범일로142번길 47-5 (범천동)부산광역시 부산진구 범천동 841-166
6숙박업(일반)1972-12-29모텔QT부산광역시 부산진구 서전로10번길 45-12 (부전동)부산광역시 부산진구 부전동 168-160
7숙박업(일반)1972-05-01온정부산광역시 부산진구 양지로5번길 30 (양정동)부산광역시 부산진구 양정동 354-8
8숙박업(일반)1973-12-31뉴부산장여관부산광역시 부산진구 부전로96번길 34-8 (부전동)부산광역시 부산진구 부전동 477-22
9숙박업(일반)1973-05-25핑크장부산광역시 부산진구 서면문화로5번길 38 (부전동)부산광역시 부산진구 부전동 477-10
업종명신고일자업소명영업소 주소(도로명)영업소 주소(지번)
222숙박업(생활)1986-03-26산들 수피아부산광역시 부산진구 범일로131번길 41-5 (범천동)부산광역시 부산진구 범천동 735-33
223숙박업(생활)2012-12-18피노키오부산광역시 부산진구 가야대로703번가길 11 (당감동)부산광역시 부산진구 당감동 955
224숙박업(생활)2016-07-05주안부산광역시 부산진구 범일로131번길 61, 2~4층 (범천동)부산광역시 부산진구 범천동 881-60
225숙박업(생활)2016-11-09부산 트레블 타이부산광역시 부산진구 서면로68번길 52 (부전동)부산광역시 부산진구 부전동 242-13
226숙박업(생활)2017-07-19엘라부산광역시 부산진구 전포대로223번길 10-1, 2층 (전포동)부산광역시 부산진구 전포동 671-11
227숙박업(생활)2017-11-02미호재 서면점부산광역시 부산진구 서전로10번길 22-7 (부전동)부산광역시 부산진구 부전동 156-50
228숙박업(생활)2020-10-21어반스테이 서면부산광역시 부산진구 중앙대로 752, 서면역 트루엘 센트럴 (부전동)부산광역시 부산진구 부전동 129-1 서면역 트루엘 센트럴
229숙박업(생활)2020-12-17스페이스앤무드부산광역시 부산진구 새싹로 22-1, 라이프빌리지 11층12층 (부전동)부산광역시 부산진구 부전동 396-14 라이프빌리지
230숙박업(생활)2021-08-04포르투나 더 원부산광역시 부산진구 복지로13번길 10 (개금동)부산광역시 부산진구 개금동 523-38
231숙박업(생활)2022-04-12아이, 집부산광역시 부산진구 서면문화로 23, MJ빌딩 13층 (부전동)부산광역시 부산진구 부전동 474-2 MJ빌딩