Overview

Dataset statistics

Number of variables3
Number of observations81
Missing cells2
Missing cells (%)0.8%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.0 KiB
Average record size in memory25.6 B

Variable types

Text3

Dataset

Description부산광역시연제구_숙박업현황_20221024
Author부산광역시 연제구
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=3082200

Alerts

소재지전화 has 2 (2.5%) missing valuesMissing
영업소 주소(도로명) has unique valuesUnique

Reproduction

Analysis started2023-12-10 16:44:54.405281
Analysis finished2023-12-10 16:44:54.982954
Duration0.58 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct77
Distinct (%)95.1%
Missing0
Missing (%)0.0%
Memory size780.0 B
2023-12-11T01:44:55.245029image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length23
Median length15
Mean length5.8888889
Min length2

Characters and Unicode

Total characters477
Distinct characters148
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique73 ?
Unique (%)90.1%

Sample

1st row궁락모텔
2nd row미진장여관
3rd row제일여관
4th row세림장
5th row대원장여관
ValueCountFrequency (%)
호텔 6
 
5.9%
아르빌 2
 
2.0%
원하우스 2
 
2.0%
신성하우스 2
 
2.0%
더킹호텔 2
 
2.0%
홈텔 2
 
2.0%
앙소르 2
 
2.0%
17th(티에이치)호텔 1
 
1.0%
호텔오마이 1
 
1.0%
파라다이스모텔 1
 
1.0%
Other values (80) 80
79.2%
2023-12-11T01:44:55.802196image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
49
 
10.3%
28
 
5.9%
21
 
4.4%
21
 
4.4%
17
 
3.6%
14
 
2.9%
12
 
2.5%
( 12
 
2.5%
) 12
 
2.5%
10
 
2.1%
Other values (138) 281
58.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 371
77.8%
Uppercase Letter 46
 
9.6%
Space Separator 21
 
4.4%
Open Punctuation 12
 
2.5%
Close Punctuation 12
 
2.5%
Decimal Number 7
 
1.5%
Lowercase Letter 7
 
1.5%
Dash Punctuation 1
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
49
 
13.2%
28
 
7.5%
21
 
5.7%
17
 
4.6%
14
 
3.8%
12
 
3.2%
10
 
2.7%
7
 
1.9%
6
 
1.6%
5
 
1.3%
Other values (104) 202
54.4%
Uppercase Letter
ValueCountFrequency (%)
O 7
15.2%
B 4
 
8.7%
T 4
 
8.7%
H 3
 
6.5%
Y 3
 
6.5%
I 3
 
6.5%
N 3
 
6.5%
S 2
 
4.3%
W 2
 
4.3%
A 2
 
4.3%
Other values (9) 13
28.3%
Lowercase Letter
ValueCountFrequency (%)
g 2
28.6%
t 1
14.3%
h 1
14.3%
e 1
14.3%
u 1
14.3%
o 1
14.3%
Decimal Number
ValueCountFrequency (%)
1 3
42.9%
7 1
 
14.3%
9 1
 
14.3%
2 1
 
14.3%
5 1
 
14.3%
Space Separator
ValueCountFrequency (%)
21
100.0%
Open Punctuation
ValueCountFrequency (%)
( 12
100.0%
Close Punctuation
ValueCountFrequency (%)
) 12
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 371
77.8%
Common 53
 
11.1%
Latin 53
 
11.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
49
 
13.2%
28
 
7.5%
21
 
5.7%
17
 
4.6%
14
 
3.8%
12
 
3.2%
10
 
2.7%
7
 
1.9%
6
 
1.6%
5
 
1.3%
Other values (104) 202
54.4%
Latin
ValueCountFrequency (%)
O 7
 
13.2%
B 4
 
7.5%
T 4
 
7.5%
H 3
 
5.7%
Y 3
 
5.7%
I 3
 
5.7%
N 3
 
5.7%
S 2
 
3.8%
W 2
 
3.8%
A 2
 
3.8%
Other values (15) 20
37.7%
Common
ValueCountFrequency (%)
21
39.6%
( 12
22.6%
) 12
22.6%
1 3
 
5.7%
7 1
 
1.9%
- 1
 
1.9%
9 1
 
1.9%
2 1
 
1.9%
5 1
 
1.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 371
77.8%
ASCII 106
 
22.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
49
 
13.2%
28
 
7.5%
21
 
5.7%
17
 
4.6%
14
 
3.8%
12
 
3.2%
10
 
2.7%
7
 
1.9%
6
 
1.6%
5
 
1.3%
Other values (104) 202
54.4%
ASCII
ValueCountFrequency (%)
21
19.8%
( 12
 
11.3%
) 12
 
11.3%
O 7
 
6.6%
B 4
 
3.8%
T 4
 
3.8%
H 3
 
2.8%
Y 3
 
2.8%
I 3
 
2.8%
N 3
 
2.8%
Other values (24) 34
32.1%
Distinct81
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size780.0 B
2023-12-11T01:44:56.132649image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length45
Median length42
Mean length27.925926
Min length21

Characters and Unicode

Total characters2262
Distinct characters60
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique81 ?
Unique (%)100.0%

Sample

1st row부산광역시 연제구 중앙대로1120번길 17 (연산동)
2nd row부산광역시 연제구 거제시장로 24 (거제동)
3rd row부산광역시 연제구 월드컵대로 217-1 (거제동)
4th row부산광역시 연제구 과정로191번길 3 (연산동)
5th row부산광역시 연제구 거제천로230번길 98 (연산동)
ValueCountFrequency (%)
부산광역시 72
18.9%
연제구 72
18.9%
연산동 67
17.6%
과정로191번길 7
 
1.8%
반송로 6
 
1.6%
중앙대로1120번길 6
 
1.6%
과정로 5
 
1.3%
거제천로154번길 4
 
1.1%
월드컵대로 4
 
1.1%
거제천로230번길 4
 
1.1%
Other values (101) 133
35.0%
2023-12-11T01:44:56.671184image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
299
 
13.2%
159
 
7.0%
159
 
7.0%
1 132
 
5.8%
101
 
4.5%
84
 
3.7%
82
 
3.6%
) 82
 
3.6%
( 82
 
3.6%
82
 
3.6%
Other values (50) 1000
44.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1352
59.8%
Decimal Number 396
 
17.5%
Space Separator 299
 
13.2%
Close Punctuation 82
 
3.6%
Open Punctuation 82
 
3.6%
Dash Punctuation 21
 
0.9%
Other Punctuation 15
 
0.7%
Math Symbol 9
 
0.4%
Uppercase Letter 6
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
159
11.8%
159
11.8%
101
 
7.5%
84
 
6.2%
82
 
6.1%
82
 
6.1%
81
 
6.0%
81
 
6.0%
81
 
6.0%
81
 
6.0%
Other values (32) 361
26.7%
Decimal Number
ValueCountFrequency (%)
1 132
33.3%
2 56
14.1%
5 41
 
10.4%
3 37
 
9.3%
4 30
 
7.6%
0 28
 
7.1%
6 24
 
6.1%
9 21
 
5.3%
8 18
 
4.5%
7 9
 
2.3%
Uppercase Letter
ValueCountFrequency (%)
C 4
66.7%
M 2
33.3%
Space Separator
ValueCountFrequency (%)
299
100.0%
Close Punctuation
ValueCountFrequency (%)
) 82
100.0%
Open Punctuation
ValueCountFrequency (%)
( 82
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 21
100.0%
Other Punctuation
ValueCountFrequency (%)
, 15
100.0%
Math Symbol
ValueCountFrequency (%)
~ 9
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1352
59.8%
Common 904
40.0%
Latin 6
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
159
11.8%
159
11.8%
101
 
7.5%
84
 
6.2%
82
 
6.1%
82
 
6.1%
81
 
6.0%
81
 
6.0%
81
 
6.0%
81
 
6.0%
Other values (32) 361
26.7%
Common
ValueCountFrequency (%)
299
33.1%
1 132
14.6%
) 82
 
9.1%
( 82
 
9.1%
2 56
 
6.2%
5 41
 
4.5%
3 37
 
4.1%
4 30
 
3.3%
0 28
 
3.1%
6 24
 
2.7%
Other values (6) 93
 
10.3%
Latin
ValueCountFrequency (%)
C 4
66.7%
M 2
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1352
59.8%
ASCII 910
40.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
299
32.9%
1 132
14.5%
) 82
 
9.0%
( 82
 
9.0%
2 56
 
6.2%
5 41
 
4.5%
3 37
 
4.1%
4 30
 
3.3%
0 28
 
3.1%
6 24
 
2.6%
Other values (8) 99
 
10.9%
Hangul
ValueCountFrequency (%)
159
11.8%
159
11.8%
101
 
7.5%
84
 
6.2%
82
 
6.1%
82
 
6.1%
81
 
6.0%
81
 
6.0%
81
 
6.0%
81
 
6.0%
Other values (32) 361
26.7%

소재지전화
Text

MISSING 

Distinct79
Distinct (%)100.0%
Missing2
Missing (%)2.5%
Memory size780.0 B
2023-12-11T01:44:57.030042image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length14
Mean length13.746835
Min length10

Characters and Unicode

Total characters1086
Distinct characters12
Distinct categories3 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique79 ?
Unique (%)100.0%

Sample

1st row 051- 861-6727
2nd row 051- 852-5927
3rd row 051- 503-5639
4th row051 -751 -1915
5th row 051- 865-7560
ValueCountFrequency (%)
051 67
37.9%
851 4
 
2.3%
852 4
 
2.3%
866 4
 
2.3%
3
 
1.7%
868 3
 
1.7%
863 2
 
1.1%
864 2
 
1.1%
851-9949 1
 
0.6%
6123 1
 
0.6%
Other values (86) 86
48.6%
2023-12-11T01:44:57.607049image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 158
14.5%
5 151
13.9%
146
13.4%
1 123
11.3%
0 118
10.9%
8 94
8.7%
6 75
6.9%
2 55
 
5.1%
7 52
 
4.8%
3 49
 
4.5%
Other values (2) 65
6.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 782
72.0%
Dash Punctuation 158
 
14.5%
Space Separator 146
 
13.4%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 151
19.3%
1 123
15.7%
0 118
15.1%
8 94
12.0%
6 75
9.6%
2 55
 
7.0%
7 52
 
6.6%
3 49
 
6.3%
9 33
 
4.2%
4 32
 
4.1%
Dash Punctuation
ValueCountFrequency (%)
- 158
100.0%
Space Separator
ValueCountFrequency (%)
146
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1086
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 158
14.5%
5 151
13.9%
146
13.4%
1 123
11.3%
0 118
10.9%
8 94
8.7%
6 75
6.9%
2 55
 
5.1%
7 52
 
4.8%
3 49
 
4.5%
Other values (2) 65
6.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1086
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 158
14.5%
5 151
13.9%
146
13.4%
1 123
11.3%
0 118
10.9%
8 94
8.7%
6 75
6.9%
2 55
 
5.1%
7 52
 
4.8%
3 49
 
4.5%
Other values (2) 65
6.0%

Correlations

2023-12-11T01:44:57.783750image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업소명영업소 주소(도로명)소재지전화
업소명1.0001.0001.000
영업소 주소(도로명)1.0001.0001.000
소재지전화1.0001.0001.000

Missing values

2023-12-11T01:44:54.806515image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T01:44:54.933686image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업소명영업소 주소(도로명)소재지전화
0궁락모텔부산광역시 연제구 중앙대로1120번길 17 (연산동)051- 861-6727
1미진장여관부산광역시 연제구 거제시장로 24 (거제동)051- 852-5927
2제일여관부산광역시 연제구 월드컵대로 217-1 (거제동)051- 503-5639
3세림장부산광역시 연제구 과정로191번길 3 (연산동)051 -751 -1915
4대원장여관부산광역시 연제구 거제천로230번길 98 (연산동)051- 865-7560
5에그(egg)모텔부산광역시 연제구 고분로13번길 13 (연산동)051 -863 -9110
6토곡부산광역시 연제구 과정로 187-1 (연산동)051- 759-2040
7BNB(비앤비)부산광역시 연제구 월드컵대로114번길 15 (연산동)051-866 -8277
8샤이어호텔부산광역시 연제구 반송로 18-6 (연산동)051- 866-4645
9오모텔부산광역시 연제구 과정로 165-2 (연산동)051 -758 -7583
업소명영업소 주소(도로명)소재지전화
71어반스테이 부산연산부산광역시 연제구 중앙대로 1116-8, 2~23층 일부호 (연산동)1644-7694-
72더킹호텔부산광역시연제구과정로161,2~10층(연산동)051-752-1999
73원하우스부산광역시연제구월드컵대로120번길5(연산동)051-852-6032
74태양빌부산광역시연제구과정로165-1(연산동)051-759-3334
75신성하우스부산광역시연제구과정로132번길5,2~4층(연산동)051-758-9318
76아르빌부산광역시연제구거제천로230번길96(연산동)051-852-8769
77앙소르부산광역시연제구중앙대로1124번길24(연산동)051-852-8792
78빅토리아코트부산광역시연제구반송로18-10(연산동)051-862-7322
79피아노홈텔부산광역시연제구중앙대로1124번길30(연산동)<NA>
80포시즌레지던스부산광역시연제구월드컵대로152,CMC빌딩8~15층(연산동)051-852-3654