Overview

Dataset statistics

Number of variables3
Number of observations94
Missing cells7
Missing cells (%)2.5%
Duplicate rows8
Duplicate rows (%)8.5%
Total size in memory2.3 KiB
Average record size in memory25.4 B

Variable types

Text3

Dataset

Description부산광역시연제구_숙박업현황_20191218
Author부산광역시 연제구
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=3082200

Alerts

Dataset has 8 (8.5%) duplicate rowsDuplicates
소재지전화 has 7 (7.4%) missing valuesMissing

Reproduction

Analysis started2023-12-10 16:44:59.093007
Analysis finished2023-12-10 16:44:59.580848
Duration0.49 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct85
Distinct (%)90.4%
Missing0
Missing (%)0.0%
Memory size884.0 B
2023-12-11T01:44:59.804365image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length23
Median length15
Mean length5.9042553
Min length2

Characters and Unicode

Total characters555
Distinct characters165
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique76 ?
Unique (%)80.9%

Sample

1st row궁락모텔
2nd row미진장여관
3rd row제일여관
4th row세림장
5th row1RUA(일루아)
ValueCountFrequency (%)
호텔 4
 
3.6%
시애틀비호텔 2
 
1.8%
호텔m(목화호텔 2
 
1.8%
모텔 2
 
1.8%
태양 2
 
1.8%
아르빌 2
 
1.8%
신성하우스 2
 
1.8%
앙소르 2
 
1.8%
원하우스 2
 
1.8%
더킹호텔 2
 
1.8%
Other values (87) 90
80.4%
2023-12-11T01:45:00.279205image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
54
 
9.7%
29
 
5.2%
27
 
4.9%
20
 
3.6%
) 19
 
3.4%
( 19
 
3.4%
18
 
3.2%
16
 
2.9%
13
 
2.3%
13
 
2.3%
Other values (155) 327
58.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 424
76.4%
Uppercase Letter 48
 
8.6%
Close Punctuation 19
 
3.4%
Open Punctuation 19
 
3.4%
Space Separator 18
 
3.2%
Decimal Number 14
 
2.5%
Lowercase Letter 10
 
1.8%
Other Punctuation 2
 
0.4%
Dash Punctuation 1
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
54
 
12.7%
29
 
6.8%
27
 
6.4%
20
 
4.7%
16
 
3.8%
13
 
3.1%
13
 
3.1%
7
 
1.7%
7
 
1.7%
6
 
1.4%
Other values (115) 232
54.7%
Uppercase Letter
ValueCountFrequency (%)
U 5
10.4%
M 5
10.4%
O 5
10.4%
B 4
 
8.3%
I 3
 
6.2%
T 3
 
6.2%
R 3
 
6.2%
A 3
 
6.2%
N 3
 
6.2%
S 2
 
4.2%
Other values (8) 12
25.0%
Lowercase Letter
ValueCountFrequency (%)
g 2
20.0%
t 1
10.0%
h 1
10.0%
p 1
10.0%
m 1
10.0%
a 1
10.0%
e 1
10.0%
u 1
10.0%
o 1
10.0%
Decimal Number
ValueCountFrequency (%)
1 4
28.6%
5 2
14.3%
9 2
14.3%
6 2
14.3%
3 2
14.3%
2 1
 
7.1%
7 1
 
7.1%
Other Punctuation
ValueCountFrequency (%)
; 1
50.0%
& 1
50.0%
Close Punctuation
ValueCountFrequency (%)
) 19
100.0%
Open Punctuation
ValueCountFrequency (%)
( 19
100.0%
Space Separator
ValueCountFrequency (%)
18
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 424
76.4%
Common 73
 
13.2%
Latin 58
 
10.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
54
 
12.7%
29
 
6.8%
27
 
6.4%
20
 
4.7%
16
 
3.8%
13
 
3.1%
13
 
3.1%
7
 
1.7%
7
 
1.7%
6
 
1.4%
Other values (115) 232
54.7%
Latin
ValueCountFrequency (%)
U 5
 
8.6%
M 5
 
8.6%
O 5
 
8.6%
B 4
 
6.9%
I 3
 
5.2%
T 3
 
5.2%
R 3
 
5.2%
A 3
 
5.2%
N 3
 
5.2%
S 2
 
3.4%
Other values (17) 22
37.9%
Common
ValueCountFrequency (%)
) 19
26.0%
( 19
26.0%
18
24.7%
1 4
 
5.5%
5 2
 
2.7%
9 2
 
2.7%
6 2
 
2.7%
3 2
 
2.7%
2 1
 
1.4%
7 1
 
1.4%
Other values (3) 3
 
4.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 424
76.4%
ASCII 131
 
23.6%

Most frequent character per block

Hangul
ValueCountFrequency (%)
54
 
12.7%
29
 
6.8%
27
 
6.4%
20
 
4.7%
16
 
3.8%
13
 
3.1%
13
 
3.1%
7
 
1.7%
7
 
1.7%
6
 
1.4%
Other values (115) 232
54.7%
ASCII
ValueCountFrequency (%)
) 19
14.5%
( 19
14.5%
18
 
13.7%
U 5
 
3.8%
M 5
 
3.8%
O 5
 
3.8%
1 4
 
3.1%
B 4
 
3.1%
I 3
 
2.3%
T 3
 
2.3%
Other values (30) 46
35.1%
Distinct86
Distinct (%)91.5%
Missing0
Missing (%)0.0%
Memory size884.0 B
2023-12-11T01:45:00.564358image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length45
Median length38
Mean length28.212766
Min length21

Characters and Unicode

Total characters2652
Distinct characters57
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique78 ?
Unique (%)83.0%

Sample

1st row부산광역시 연제구 중앙대로1120번길 17 (연산동)
2nd row부산광역시 연제구 거제시장로 24 (거제동)
3rd row부산광역시 연제구 월드컵대로 217-1 (거제동)
4th row부산광역시 연제구 과정로191번길 3 (연산동)
5th row부산광역시 연제구 반송로 18-14 (연산동)
ValueCountFrequency (%)
부산광역시 94
19.4%
연제구 94
19.4%
연산동 88
18.2%
반송로 10
 
2.1%
중앙대로1120번길 10
 
2.1%
과정로 8
 
1.7%
과정로191번길 7
 
1.4%
월드컵대로 6
 
1.2%
거제천로154번길 6
 
1.2%
5 5
 
1.0%
Other values (98) 156
32.2%
2023-12-11T01:45:01.065211image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
390
 
14.7%
185
 
7.0%
185
 
7.0%
1 158
 
6.0%
117
 
4.4%
97
 
3.7%
) 95
 
3.6%
95
 
3.6%
( 95
 
3.6%
94
 
3.5%
Other values (47) 1141
43.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1560
58.8%
Decimal Number 459
 
17.3%
Space Separator 390
 
14.7%
Close Punctuation 95
 
3.6%
Open Punctuation 95
 
3.6%
Dash Punctuation 25
 
0.9%
Other Punctuation 14
 
0.5%
Math Symbol 8
 
0.3%
Uppercase Letter 6
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
185
11.9%
185
11.9%
117
 
7.5%
97
 
6.2%
95
 
6.1%
94
 
6.0%
94
 
6.0%
94
 
6.0%
94
 
6.0%
94
 
6.0%
Other values (29) 411
26.3%
Decimal Number
ValueCountFrequency (%)
1 158
34.4%
2 65
14.2%
5 50
 
10.9%
4 36
 
7.8%
0 35
 
7.6%
3 35
 
7.6%
6 27
 
5.9%
9 22
 
4.8%
8 19
 
4.1%
7 12
 
2.6%
Uppercase Letter
ValueCountFrequency (%)
C 4
66.7%
M 2
33.3%
Space Separator
ValueCountFrequency (%)
390
100.0%
Close Punctuation
ValueCountFrequency (%)
) 95
100.0%
Open Punctuation
ValueCountFrequency (%)
( 95
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 25
100.0%
Other Punctuation
ValueCountFrequency (%)
, 14
100.0%
Math Symbol
ValueCountFrequency (%)
~ 8
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1560
58.8%
Common 1086
41.0%
Latin 6
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
185
11.9%
185
11.9%
117
 
7.5%
97
 
6.2%
95
 
6.1%
94
 
6.0%
94
 
6.0%
94
 
6.0%
94
 
6.0%
94
 
6.0%
Other values (29) 411
26.3%
Common
ValueCountFrequency (%)
390
35.9%
1 158
14.5%
) 95
 
8.7%
( 95
 
8.7%
2 65
 
6.0%
5 50
 
4.6%
4 36
 
3.3%
0 35
 
3.2%
3 35
 
3.2%
6 27
 
2.5%
Other values (6) 100
 
9.2%
Latin
ValueCountFrequency (%)
C 4
66.7%
M 2
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1560
58.8%
ASCII 1092
41.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
390
35.7%
1 158
14.5%
) 95
 
8.7%
( 95
 
8.7%
2 65
 
6.0%
5 50
 
4.6%
4 36
 
3.3%
0 35
 
3.2%
3 35
 
3.2%
6 27
 
2.5%
Other values (8) 106
 
9.7%
Hangul
ValueCountFrequency (%)
185
11.9%
185
11.9%
117
 
7.5%
97
 
6.2%
95
 
6.1%
94
 
6.0%
94
 
6.0%
94
 
6.0%
94
 
6.0%
94
 
6.0%
Other values (29) 411
26.3%

소재지전화
Text

MISSING 

Distinct75
Distinct (%)86.2%
Missing7
Missing (%)7.4%
Memory size884.0 B
2023-12-11T01:45:01.396884image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters1044
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique64 ?
Unique (%)73.6%

Sample

1st row051-861-6727
2nd row051-852-5927
3rd row051-503-5639
4th row051-753-0316
5th row051-865-7560
ValueCountFrequency (%)
051-852-3654 3
 
3.4%
051-852-6032 2
 
2.3%
051-864-2790 2
 
2.3%
051-759-3334 2
 
2.3%
051-853-1818 2
 
2.3%
051-752-1999 2
 
2.3%
051-852-7685 2
 
2.3%
051-852-8769 2
 
2.3%
051-758-9318 2
 
2.3%
051-852-8792 2
 
2.3%
Other values (65) 66
75.9%
2023-12-11T01:45:01.797862image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 174
16.7%
5 165
15.8%
1 133
12.7%
0 129
12.4%
8 107
10.2%
6 89
8.5%
2 59
 
5.7%
7 57
 
5.5%
3 55
 
5.3%
4 40
 
3.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 870
83.3%
Dash Punctuation 174
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 165
19.0%
1 133
15.3%
0 129
14.8%
8 107
12.3%
6 89
10.2%
2 59
 
6.8%
7 57
 
6.6%
3 55
 
6.3%
4 40
 
4.6%
9 36
 
4.1%
Dash Punctuation
ValueCountFrequency (%)
- 174
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1044
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 174
16.7%
5 165
15.8%
1 133
12.7%
0 129
12.4%
8 107
10.2%
6 89
8.5%
2 59
 
5.7%
7 57
 
5.5%
3 55
 
5.3%
4 40
 
3.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1044
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 174
16.7%
5 165
15.8%
1 133
12.7%
0 129
12.4%
8 107
10.2%
6 89
8.5%
2 59
 
5.7%
7 57
 
5.5%
3 55
 
5.3%
4 40
 
3.8%

Correlations

2023-12-11T01:45:01.908898image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업소명업소소재지(도로명)소재지전화
업소명1.0001.0001.000
업소소재지(도로명)1.0001.0001.000
소재지전화1.0001.0001.000

Missing values

2023-12-11T01:44:59.446658image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T01:44:59.541764image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업소명업소소재지(도로명)소재지전화
0궁락모텔부산광역시 연제구 중앙대로1120번길 17 (연산동)051-861-6727
1미진장여관부산광역시 연제구 거제시장로 24 (거제동)051-852-5927
2제일여관부산광역시 연제구 월드컵대로 217-1 (거제동)051-503-5639
3세림장부산광역시 연제구 과정로191번길 3 (연산동)051-753-0316
41RUA(일루아)부산광역시 연제구 반송로 18-14 (연산동)<NA>
5대원장여관부산광역시 연제구 거제천로230번길 98 (연산동)051-865-7560
6에그(egg)모텔부산광역시 연제구 고분로13번길 13 (연산동)051-863-9110
7토곡부산광역시 연제구 과정로 187-1 (연산동)051-759-2040
8BNB(비앤비)부산광역시 연제구 월드컵대로114번길 15 (연산동)051-866-8277
9샤이어호텔부산광역시 연제구 반송로 18-6 (연산동)051-866-4645
업소명업소소재지(도로명)소재지전화
84시애틀비호텔부산광역시 연제구 거제천로154번길 42 (연산동)051-852-7685
85(주)영진관광 더킹호텔부산광역시 연제구 과정로 161, 2~10층 (연산동)051-752-1999
86원하우스부산광역시 연제구 월드컵대로120번길 5 (연산동)051-852-6032
87태양 빌부산광역시 연제구 과정로 165-1 (연산동)051-759-3334
88신성하우스부산광역시 연제구 과정로132번길 5, 2~4층 (연산동)051-758-9318
89아르빌부산광역시 연제구 거제천로230번길 96 (연산동)051-852-8769
90앙소르부산광역시 연제구 중앙대로1124번길 24 (연산동)051-852-8792
91빅토리아코트부산광역시 연제구 반송로 18-10 (연산동)051-862-7322
92포시즌게스트하우스부산광역시 연제구 월드컵대로 152, 15층 (연산동)051-852-3654
93팡팡레지던스부산광역시 연제구 월드컵대로 152, CMC빌딩 9~14층 (연산동)051-852-3654

Duplicate rows

Most frequently occurring

업소명업소소재지(도로명)소재지전화# duplicates
0(주)영진관광 더킹호텔부산광역시 연제구 과정로 161, 2~10층 (연산동)051-752-19992
1빅토리아코트부산광역시 연제구 반송로 18-10 (연산동)051-862-73222
2신성하우스부산광역시 연제구 과정로132번길 5, 2~4층 (연산동)051-758-93182
3아르빌부산광역시 연제구 거제천로230번길 96 (연산동)051-852-87692
4앙소르부산광역시 연제구 중앙대로1124번길 24 (연산동)051-852-87922
5원하우스부산광역시 연제구 월드컵대로120번길 5 (연산동)051-852-60322
6태양 빌부산광역시 연제구 과정로 165-1 (연산동)051-759-33342
7호텔M(목화호텔)부산광역시 연제구 중앙대로 1125 (연산동)051-853-18182