Overview

Dataset statistics

Number of variables4
Number of observations177
Missing cells13
Missing cells (%)1.8%
Duplicate rows3
Duplicate rows (%)1.7%
Total size in memory5.7 KiB
Average record size in memory32.7 B

Variable types

Categorical1
Text3

Dataset

Description부산광역시_동구_숙박업현황_20210125
Author부산광역시 동구
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=15028641

Alerts

Dataset has 3 (1.7%) duplicate rowsDuplicates
업종명 is highly imbalanced (62.1%)Imbalance
소재지전화 has 13 (7.3%) missing valuesMissing

Reproduction

Analysis started2023-12-10 16:48:19.406262
Analysis finished2023-12-10 16:48:19.783020
Duration0.38 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업종명
Categorical

IMBALANCE 

Distinct2
Distinct (%)1.1%
Missing0
Missing (%)0.0%
Memory size1.5 KiB
숙박업(일반)
164 
숙박업(생활)
 
13

Length

Max length7
Median length7
Mean length7
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row숙박업(일반)
2nd row숙박업(일반)
3rd row숙박업(일반)
4th row숙박업(일반)
5th row숙박업(일반)

Common Values

ValueCountFrequency (%)
숙박업(일반) 164
92.7%
숙박업(생활) 13
 
7.3%

Length

2023-12-11T01:48:19.851429image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:48:19.956120image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
숙박업(일반 164
92.7%
숙박업(생활 13
 
7.3%
Distinct174
Distinct (%)98.3%
Missing0
Missing (%)0.0%
Memory size1.5 KiB
2023-12-11T01:48:20.271369image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length21
Median length20
Mean length5.6892655
Min length2

Characters and Unicode

Total characters1007
Distinct characters224
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique171 ?
Unique (%)96.6%

Sample

1st row모찌호스텔(Mozzi hostel)
2nd row부산숙박닷컴 게스트하우스
3rd row라마다앙코르부산역호텔
4th row더웨이호텔
5th row브라운도트호텔 범일점
ValueCountFrequency (%)
부산역 6
 
2.8%
게스트하우스 5
 
2.4%
레지던스 4
 
1.9%
모텔 3
 
1.4%
하운드호텔 2
 
0.9%
탑모텔 2
 
0.9%
2
 
0.9%
오름 2
 
0.9%
호텔 2
 
0.9%
워라밸 2
 
0.9%
Other values (180) 181
85.8%
2023-12-11T01:48:20.750728image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
88
 
8.7%
52
 
5.2%
50
 
5.0%
40
 
4.0%
36
 
3.6%
35
 
3.5%
34
 
3.4%
26
 
2.6%
21
 
2.1%
20
 
2.0%
Other values (214) 605
60.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 897
89.1%
Space Separator 34
 
3.4%
Uppercase Letter 26
 
2.6%
Lowercase Letter 14
 
1.4%
Open Punctuation 11
 
1.1%
Close Punctuation 11
 
1.1%
Decimal Number 11
 
1.1%
Other Punctuation 2
 
0.2%
Letter Number 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
88
 
9.8%
52
 
5.8%
50
 
5.6%
40
 
4.5%
36
 
4.0%
35
 
3.9%
26
 
2.9%
21
 
2.3%
20
 
2.2%
18
 
2.0%
Other values (181) 511
57.0%
Uppercase Letter
ValueCountFrequency (%)
C 4
15.4%
O 3
11.5%
E 3
11.5%
H 3
11.5%
T 3
11.5%
L 2
7.7%
M 1
 
3.8%
I 1
 
3.8%
B 1
 
3.8%
W 1
 
3.8%
Other values (4) 4
15.4%
Lowercase Letter
ValueCountFrequency (%)
o 4
28.6%
n 2
14.3%
z 2
14.3%
h 1
 
7.1%
l 1
 
7.1%
e 1
 
7.1%
t 1
 
7.1%
s 1
 
7.1%
i 1
 
7.1%
Decimal Number
ValueCountFrequency (%)
6 4
36.4%
3 2
18.2%
9 2
18.2%
2 2
18.2%
7 1
 
9.1%
Space Separator
ValueCountFrequency (%)
34
100.0%
Open Punctuation
ValueCountFrequency (%)
( 11
100.0%
Close Punctuation
ValueCountFrequency (%)
) 11
100.0%
Other Punctuation
ValueCountFrequency (%)
. 2
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 897
89.1%
Common 69
 
6.9%
Latin 41
 
4.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
88
 
9.8%
52
 
5.8%
50
 
5.6%
40
 
4.5%
36
 
4.0%
35
 
3.9%
26
 
2.9%
21
 
2.3%
20
 
2.2%
18
 
2.0%
Other values (181) 511
57.0%
Latin
ValueCountFrequency (%)
o 4
 
9.8%
C 4
 
9.8%
O 3
 
7.3%
E 3
 
7.3%
H 3
 
7.3%
T 3
 
7.3%
n 2
 
4.9%
z 2
 
4.9%
L 2
 
4.9%
h 1
 
2.4%
Other values (14) 14
34.1%
Common
ValueCountFrequency (%)
34
49.3%
( 11
 
15.9%
) 11
 
15.9%
6 4
 
5.8%
3 2
 
2.9%
9 2
 
2.9%
2 2
 
2.9%
. 2
 
2.9%
7 1
 
1.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 897
89.1%
ASCII 109
 
10.8%
Number Forms 1
 
0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
88
 
9.8%
52
 
5.8%
50
 
5.6%
40
 
4.5%
36
 
4.0%
35
 
3.9%
26
 
2.9%
21
 
2.3%
20
 
2.2%
18
 
2.0%
Other values (181) 511
57.0%
ASCII
ValueCountFrequency (%)
34
31.2%
( 11
 
10.1%
) 11
 
10.1%
6 4
 
3.7%
o 4
 
3.7%
C 4
 
3.7%
O 3
 
2.8%
E 3
 
2.8%
H 3
 
2.8%
T 3
 
2.8%
Other values (22) 29
26.6%
Number Forms
ValueCountFrequency (%)
1
100.0%
Distinct174
Distinct (%)98.3%
Missing0
Missing (%)0.0%
Memory size1.5 KiB
2023-12-11T01:48:21.030922image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length50
Median length36
Mean length27.124294
Min length20

Characters and Unicode

Total characters4801
Distinct characters66
Distinct categories8 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique171 ?
Unique (%)96.6%

Sample

1st row부산광역시 동구 중앙대로196번길 16-12, 5층 (초량동)
2nd row부산광역시 동구 초량중로 60 (초량동)
3rd row부산광역시 동구 중앙대로196번길 10, 부산역라마다앙코르호텔 (초량동)
4th row부산광역시 동구 중앙대로209번길 12 (초량동)
5th row부산광역시 동구 중앙대로 528 (범일동)
ValueCountFrequency (%)
부산광역시 177
19.6%
동구 177
19.6%
초량동 109
 
12.1%
범일동 50
 
5.5%
수정동 13
 
1.4%
중앙대로196번길 13
 
1.4%
초량로13번길 10
 
1.1%
대영로243번길 10
 
1.1%
7 10
 
1.1%
중앙대로 10
 
1.1%
Other values (199) 325
36.0%
2023-12-11T01:48:21.519737image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
727
 
15.1%
355
 
7.4%
1 200
 
4.2%
184
 
3.8%
178
 
3.7%
178
 
3.7%
) 177
 
3.7%
177
 
3.7%
( 177
 
3.7%
177
 
3.7%
Other values (56) 2271
47.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2742
57.1%
Decimal Number 845
 
17.6%
Space Separator 727
 
15.1%
Close Punctuation 177
 
3.7%
Open Punctuation 177
 
3.7%
Dash Punctuation 89
 
1.9%
Other Punctuation 36
 
0.7%
Math Symbol 8
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
355
12.9%
184
 
6.7%
178
 
6.5%
178
 
6.5%
177
 
6.5%
177
 
6.5%
177
 
6.5%
170
 
6.2%
143
 
5.2%
140
 
5.1%
Other values (40) 863
31.5%
Decimal Number
ValueCountFrequency (%)
1 200
23.7%
2 142
16.8%
3 102
12.1%
9 75
 
8.9%
4 66
 
7.8%
6 61
 
7.2%
7 57
 
6.7%
0 56
 
6.6%
5 43
 
5.1%
8 43
 
5.1%
Space Separator
ValueCountFrequency (%)
727
100.0%
Close Punctuation
ValueCountFrequency (%)
) 177
100.0%
Open Punctuation
ValueCountFrequency (%)
( 177
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 89
100.0%
Other Punctuation
ValueCountFrequency (%)
, 36
100.0%
Math Symbol
ValueCountFrequency (%)
~ 8
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2742
57.1%
Common 2059
42.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
355
12.9%
184
 
6.7%
178
 
6.5%
178
 
6.5%
177
 
6.5%
177
 
6.5%
177
 
6.5%
170
 
6.2%
143
 
5.2%
140
 
5.1%
Other values (40) 863
31.5%
Common
ValueCountFrequency (%)
727
35.3%
1 200
 
9.7%
) 177
 
8.6%
( 177
 
8.6%
2 142
 
6.9%
3 102
 
5.0%
- 89
 
4.3%
9 75
 
3.6%
4 66
 
3.2%
6 61
 
3.0%
Other values (6) 243
 
11.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2742
57.1%
ASCII 2059
42.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
727
35.3%
1 200
 
9.7%
) 177
 
8.6%
( 177
 
8.6%
2 142
 
6.9%
3 102
 
5.0%
- 89
 
4.3%
9 75
 
3.6%
4 66
 
3.2%
6 61
 
3.0%
Other values (6) 243
 
11.8%
Hangul
ValueCountFrequency (%)
355
12.9%
184
 
6.7%
178
 
6.5%
178
 
6.5%
177
 
6.5%
177
 
6.5%
177
 
6.5%
170
 
6.2%
143
 
5.2%
140
 
5.1%
Other values (40) 863
31.5%

소재지전화
Text

MISSING 

Distinct162
Distinct (%)98.8%
Missing13
Missing (%)7.3%
Memory size1.5 KiB
2023-12-11T01:48:21.811656image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.018293
Min length12

Characters and Unicode

Total characters1971
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique160 ?
Unique (%)97.6%

Sample

1st row070-7724-0070
2nd row070-4651-4112
3rd row051-922-0000
4th row051-852-3600
5th row051-791-0770
ValueCountFrequency (%)
051-468-1537 2
 
1.2%
051-463-1555 2
 
1.2%
051-463-1731 1
 
0.6%
051-467-0977 1
 
0.6%
051-467-0541 1
 
0.6%
051-467-4277 1
 
0.6%
051-467-3446 1
 
0.6%
051-467-3195 1
 
0.6%
051-467-2338 1
 
0.6%
051-467-2313 1
 
0.6%
Other values (152) 152
92.7%
2023-12-11T01:48:22.222743image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 328
16.6%
0 264
13.4%
5 261
13.2%
1 254
12.9%
4 217
11.0%
6 215
10.9%
3 106
 
5.4%
7 104
 
5.3%
8 91
 
4.6%
2 87
 
4.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1643
83.4%
Dash Punctuation 328
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 264
16.1%
5 261
15.9%
1 254
15.5%
4 217
13.2%
6 215
13.1%
3 106
6.5%
7 104
 
6.3%
8 91
 
5.5%
2 87
 
5.3%
9 44
 
2.7%
Dash Punctuation
ValueCountFrequency (%)
- 328
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1971
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 328
16.6%
0 264
13.4%
5 261
13.2%
1 254
12.9%
4 217
11.0%
6 215
10.9%
3 106
 
5.4%
7 104
 
5.3%
8 91
 
4.6%
2 87
 
4.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1971
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 328
16.6%
0 264
13.4%
5 261
13.2%
1 254
12.9%
4 217
11.0%
6 215
10.9%
3 106
 
5.4%
7 104
 
5.3%
8 91
 
4.6%
2 87
 
4.4%

Missing values

2023-12-11T01:48:19.647246image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T01:48:19.740207image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업종명업소명업소소재지(도로명)소재지전화
0숙박업(일반)모찌호스텔(Mozzi hostel)부산광역시 동구 중앙대로196번길 16-12, 5층 (초량동)070-7724-0070
1숙박업(일반)부산숙박닷컴 게스트하우스부산광역시 동구 초량중로 60 (초량동)070-4651-4112
2숙박업(일반)라마다앙코르부산역호텔부산광역시 동구 중앙대로196번길 10, 부산역라마다앙코르호텔 (초량동)051-922-0000
3숙박업(일반)더웨이호텔부산광역시 동구 중앙대로209번길 12 (초량동)051-852-3600
4숙박업(일반)브라운도트호텔 범일점부산광역시 동구 중앙대로 528 (범일동)051-791-0770
5숙박업(일반)하운드호텔 범일부산광역시 동구 조방로34번길 5 (범일동)051-647-6829
6숙박업(일반)CC모텔부산광역시 동구 자성공원로 9-1 (범일동)051-645-6767
7숙박업(일반)W모텔부산광역시 동구 조방로34번길 3 (범일동)051-645-6717
8숙박업(일반)지앤지모텔부산광역시 동구 조방로16번길 9 (범일동)051-633-8008
9숙박업(일반)호텔루이스부산광역시 동구 조방로38번길 8-1 (범일동)051-633-7722
업종명업소명업소소재지(도로명)소재지전화
167숙박업(생활)워라밸 게스트하우스부산광역시 동구 초량중로 11, 2층 (초량동)051-463-1555
168숙박업(생활)파밀리에게스트하우스부산광역시 동구 중앙대로214번길 3-4, 4,5층 (초량동)051-461-0080
169숙박업(생활)부산역 오름 레지던스부산광역시 동구 중앙대로180번길 16-8, 지하1층일부,지상1층일부,2~20층 (초량동)<NA>
170숙박업(생활)민트 파라다이스부산광역시 동구 대영로239번길 20 (초량동)<NA>
171숙박업(생활)영광숙박부산광역시 동구 범곡로28번길 9 (범일동)051-644-3255
172숙박업(생활)리젠시빌부산광역시 동구 조방로49번길 23-11 (범일동)051-635-1818
173숙박업(생활)복성여관부산광역시 동구 대영로243번길 55 (초량동)051-468-1676
174숙박업(생활)온팍스 레지던스부산광역시 동구 중앙대로196번길 16-12, 3~4층 (초량동)051-468-1537
175숙박업(생활)워라밸 게스트하우스부산광역시 동구 초량중로 11, 2층 (초량동)051-463-1555
176숙박업(생활)부산역 오름 레지던스부산광역시 동구 중앙대로180번길 16-8, 지하1층일부,지상1층일부,2~20층 (초량동)<NA>

Duplicate rows

Most frequently occurring

업종명업소명업소소재지(도로명)소재지전화# duplicates
0숙박업(생활)부산역 오름 레지던스부산광역시 동구 중앙대로180번길 16-8, 지하1층일부,지상1층일부,2~20층 (초량동)<NA>2
1숙박업(생활)온팍스 레지던스부산광역시 동구 중앙대로196번길 16-12, 3~4층 (초량동)051-468-15372
2숙박업(생활)워라밸 게스트하우스부산광역시 동구 초량중로 11, 2층 (초량동)051-463-15552