Overview

Dataset statistics

Number of variables5
Number of observations98
Missing cells7
Missing cells (%)1.4%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory4.0 KiB
Average record size in memory41.3 B

Variable types

Categorical1
Text4

Dataset

Description부산광역시 북구 관내에 있는 숙박업소 현황으로 업소명, 소재지 도로명주소, 지번주소, 전화번호(공란 개인정보) 등의 정보를 제공합니다.
Author부산광역시 북구
URLhttps://www.data.go.kr/data/3069378/fileData.do

Alerts

업종명 is highly imbalanced (85.6%)Imbalance
소재지전화 has 7 (7.1%) missing valuesMissing
영업소 주소(도로명) has unique valuesUnique

Reproduction

Analysis started2024-03-14 16:46:17.529318
Analysis finished2024-03-14 16:46:19.233843
Duration1.7 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업종명
Categorical

IMBALANCE 

Distinct2
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size912.0 B
숙박업(일반)
96 
숙박업(생활)
 
2

Length

Max length7
Median length7
Mean length7
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row숙박업(일반)
2nd row숙박업(일반)
3rd row숙박업(일반)
4th row숙박업(일반)
5th row숙박업(일반)

Common Values

ValueCountFrequency (%)
숙박업(일반) 96
98.0%
숙박업(생활) 2
 
2.0%

Length

2024-03-15T01:46:19.403061image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-15T01:46:19.714696image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
숙박업(일반 96
98.0%
숙박업(생활 2
 
2.0%
Distinct96
Distinct (%)98.0%
Missing0
Missing (%)0.0%
Memory size912.0 B
2024-03-15T01:46:20.931657image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length11
Mean length4.9387755
Min length2

Characters and Unicode

Total characters484
Distinct characters162
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique94 ?
Unique (%)95.9%

Sample

1st row역전
2nd row보림
3rd row산해
4th row밀양
5th row청포별장
ValueCountFrequency (%)
브라운도트호텔 3
 
2.7%
호텔 3
 
2.7%
스카이모텔 2
 
1.8%
덴바스타 2
 
1.8%
만덕점 2
 
1.8%
엠유(mu 2
 
1.8%
본호텔 1
 
0.9%
런더너 1
 
0.9%
티티호텔 1
 
0.9%
모텔25시 1
 
0.9%
Other values (93) 93
83.8%
2024-03-15T01:46:22.679489image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
47
 
9.7%
29
 
6.0%
24
 
5.0%
19
 
3.9%
14
 
2.9%
13
 
2.7%
13
 
2.7%
) 10
 
2.1%
( 10
 
2.1%
8
 
1.7%
Other values (152) 297
61.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 427
88.2%
Uppercase Letter 18
 
3.7%
Space Separator 13
 
2.7%
Close Punctuation 10
 
2.1%
Open Punctuation 10
 
2.1%
Decimal Number 5
 
1.0%
Other Punctuation 1
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
47
 
11.0%
29
 
6.8%
24
 
5.6%
19
 
4.4%
14
 
3.3%
13
 
3.0%
8
 
1.9%
7
 
1.6%
6
 
1.4%
6
 
1.4%
Other values (135) 254
59.5%
Uppercase Letter
ValueCountFrequency (%)
U 3
16.7%
W 3
16.7%
S 2
11.1%
O 2
11.1%
M 2
11.1%
A 2
11.1%
B 2
11.1%
H 1
 
5.6%
V 1
 
5.6%
Decimal Number
ValueCountFrequency (%)
2 2
40.0%
5 1
20.0%
1 1
20.0%
7 1
20.0%
Space Separator
ValueCountFrequency (%)
13
100.0%
Close Punctuation
ValueCountFrequency (%)
) 10
100.0%
Open Punctuation
ValueCountFrequency (%)
( 10
100.0%
Other Punctuation
ValueCountFrequency (%)
& 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 427
88.2%
Common 39
 
8.1%
Latin 18
 
3.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
47
 
11.0%
29
 
6.8%
24
 
5.6%
19
 
4.4%
14
 
3.3%
13
 
3.0%
8
 
1.9%
7
 
1.6%
6
 
1.4%
6
 
1.4%
Other values (135) 254
59.5%
Latin
ValueCountFrequency (%)
U 3
16.7%
W 3
16.7%
S 2
11.1%
O 2
11.1%
M 2
11.1%
A 2
11.1%
B 2
11.1%
H 1
 
5.6%
V 1
 
5.6%
Common
ValueCountFrequency (%)
13
33.3%
) 10
25.6%
( 10
25.6%
2 2
 
5.1%
& 1
 
2.6%
5 1
 
2.6%
1 1
 
2.6%
7 1
 
2.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 427
88.2%
ASCII 57
 
11.8%

Most frequent character per block

Hangul
ValueCountFrequency (%)
47
 
11.0%
29
 
6.8%
24
 
5.6%
19
 
4.4%
14
 
3.3%
13
 
3.0%
8
 
1.9%
7
 
1.6%
6
 
1.4%
6
 
1.4%
Other values (135) 254
59.5%
ASCII
ValueCountFrequency (%)
13
22.8%
) 10
17.5%
( 10
17.5%
U 3
 
5.3%
W 3
 
5.3%
S 2
 
3.5%
O 2
 
3.5%
M 2
 
3.5%
2 2
 
3.5%
A 2
 
3.5%
Other values (7) 8
14.0%
Distinct98
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size912.0 B
2024-03-15T01:46:23.785196image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length33
Median length30
Mean length25.642857
Min length20

Characters and Unicode

Total characters2513
Distinct characters59
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique98 ?
Unique (%)100.0%

Sample

1st row부산광역시 북구 낙동대로1704번길 10 (구포동)
2nd row부산광역시 북구 구포만세길 6 (구포동)
3rd row부산광역시 북구 구포만세길 36-19 (구포동)
4th row부산광역시 북구 낙동대로1694번가길 6-1 (구포동)
5th row부산광역시 북구 가람로 6-1 (구포동)
ValueCountFrequency (%)
부산광역시 98
19.9%
북구 98
19.9%
구포동 57
 
11.6%
덕천동 17
 
3.4%
낙동대로 13
 
2.6%
화명동 11
 
2.2%
만덕동 11
 
2.2%
만덕고개길 9
 
1.8%
구포만세길 8
 
1.6%
금곡대로8번길 7
 
1.4%
Other values (100) 164
33.3%
2024-03-15T01:46:25.632779image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
395
 
15.7%
166
 
6.6%
138
 
5.5%
101
 
4.0%
100
 
4.0%
98
 
3.9%
98
 
3.9%
98
 
3.9%
98
 
3.9%
) 98
 
3.9%
Other values (49) 1123
44.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1510
60.1%
Space Separator 395
 
15.7%
Decimal Number 392
 
15.6%
Close Punctuation 98
 
3.9%
Open Punctuation 98
 
3.9%
Dash Punctuation 15
 
0.6%
Other Punctuation 5
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
166
 
11.0%
138
 
9.1%
101
 
6.7%
100
 
6.6%
98
 
6.5%
98
 
6.5%
98
 
6.5%
98
 
6.5%
76
 
5.0%
75
 
5.0%
Other values (34) 462
30.6%
Decimal Number
ValueCountFrequency (%)
1 85
21.7%
6 48
12.2%
8 44
11.2%
7 42
10.7%
2 37
9.4%
4 34
 
8.7%
3 33
 
8.4%
5 24
 
6.1%
0 24
 
6.1%
9 21
 
5.4%
Space Separator
ValueCountFrequency (%)
395
100.0%
Close Punctuation
ValueCountFrequency (%)
) 98
100.0%
Open Punctuation
ValueCountFrequency (%)
( 98
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 15
100.0%
Other Punctuation
ValueCountFrequency (%)
, 5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1510
60.1%
Common 1003
39.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
166
 
11.0%
138
 
9.1%
101
 
6.7%
100
 
6.6%
98
 
6.5%
98
 
6.5%
98
 
6.5%
98
 
6.5%
76
 
5.0%
75
 
5.0%
Other values (34) 462
30.6%
Common
ValueCountFrequency (%)
395
39.4%
) 98
 
9.8%
( 98
 
9.8%
1 85
 
8.5%
6 48
 
4.8%
8 44
 
4.4%
7 42
 
4.2%
2 37
 
3.7%
4 34
 
3.4%
3 33
 
3.3%
Other values (5) 89
 
8.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1510
60.1%
ASCII 1003
39.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
395
39.4%
) 98
 
9.8%
( 98
 
9.8%
1 85
 
8.5%
6 48
 
4.8%
8 44
 
4.4%
7 42
 
4.2%
2 37
 
3.7%
4 34
 
3.4%
3 33
 
3.3%
Other values (5) 89
 
8.9%
Hangul
ValueCountFrequency (%)
166
 
11.0%
138
 
9.1%
101
 
6.7%
100
 
6.6%
98
 
6.5%
98
 
6.5%
98
 
6.5%
98
 
6.5%
76
 
5.0%
75
 
5.0%
Other values (34) 462
30.6%
Distinct94
Distinct (%)95.9%
Missing0
Missing (%)0.0%
Memory size912.0 B
2024-03-15T01:46:26.938407image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length32
Median length27
Mean length20.122449
Min length17

Characters and Unicode

Total characters1972
Distinct characters42
Distinct categories5 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique90 ?
Unique (%)91.8%

Sample

1st row부산광역시 북구 구포동 1060-55
2nd row부산광역시 북구 구포동 1069-2
3rd row부산광역시 북구 구포동 1060-254
4th row부산광역시 북구 구포동 1060-279
5th row부산광역시 북구 구포동 1054-10
ValueCountFrequency (%)
부산광역시 98
24.5%
북구 98
24.5%
구포동 58
14.5%
덕천동 17
 
4.2%
만덕동 12
 
3.0%
화명동 11
 
2.8%
2270-2 2
 
0.5%
2270-1 2
 
0.5%
2274-5 2
 
0.5%
2275-4 2
 
0.5%
Other values (98) 98
24.5%
2024-03-15T01:46:28.702989image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
401
20.3%
156
 
7.9%
98
 
5.0%
98
 
5.0%
98
 
5.0%
98
 
5.0%
98
 
5.0%
98
 
5.0%
98
 
5.0%
- 93
 
4.7%
Other values (32) 636
32.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 998
50.6%
Decimal Number 478
24.2%
Space Separator 401
20.3%
Dash Punctuation 93
 
4.7%
Uppercase Letter 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
156
15.6%
98
9.8%
98
9.8%
98
9.8%
98
9.8%
98
9.8%
98
9.8%
98
9.8%
58
 
5.8%
29
 
2.9%
Other values (18) 69
6.9%
Decimal Number
ValueCountFrequency (%)
1 87
18.2%
2 64
13.4%
0 56
11.7%
5 55
11.5%
3 54
11.3%
6 43
9.0%
4 40
8.4%
7 37
7.7%
8 23
 
4.8%
9 19
 
4.0%
Uppercase Letter
ValueCountFrequency (%)
B 1
50.0%
T 1
50.0%
Space Separator
ValueCountFrequency (%)
401
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 93
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 998
50.6%
Common 972
49.3%
Latin 2
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
156
15.6%
98
9.8%
98
9.8%
98
9.8%
98
9.8%
98
9.8%
98
9.8%
98
9.8%
58
 
5.8%
29
 
2.9%
Other values (18) 69
6.9%
Common
ValueCountFrequency (%)
401
41.3%
- 93
 
9.6%
1 87
 
9.0%
2 64
 
6.6%
0 56
 
5.8%
5 55
 
5.7%
3 54
 
5.6%
6 43
 
4.4%
4 40
 
4.1%
7 37
 
3.8%
Other values (2) 42
 
4.3%
Latin
ValueCountFrequency (%)
B 1
50.0%
T 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 998
50.6%
ASCII 974
49.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
401
41.2%
- 93
 
9.5%
1 87
 
8.9%
2 64
 
6.6%
0 56
 
5.7%
5 55
 
5.6%
3 54
 
5.5%
6 43
 
4.4%
4 40
 
4.1%
7 37
 
3.8%
Other values (4) 44
 
4.5%
Hangul
ValueCountFrequency (%)
156
15.6%
98
9.8%
98
9.8%
98
9.8%
98
9.8%
98
9.8%
98
9.8%
98
9.8%
58
 
5.8%
29
 
2.9%
Other values (18) 69
6.9%

소재지전화
Text

MISSING 

Distinct91
Distinct (%)100.0%
Missing7
Missing (%)7.1%
Memory size912.0 B
2024-03-15T01:46:29.821369image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters1092
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique91 ?
Unique (%)100.0%

Sample

1st row051-336-2988
2nd row051-332-1535
3rd row051-331-4488
4th row051-332-3482
5th row051-334-5726
ValueCountFrequency (%)
051-336-3279 1
 
1.1%
051-333-4685 1
 
1.1%
051-341-5540 1
 
1.1%
051-925-7725 1
 
1.1%
051-336-6673 1
 
1.1%
051-334-5665 1
 
1.1%
051-331-4610 1
 
1.1%
051-365-0806 1
 
1.1%
051-364-4504 1
 
1.1%
051-363-6565 1
 
1.1%
Other values (81) 81
89.0%
2024-03-15T01:46:31.216663image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3 220
20.1%
- 182
16.7%
1 142
13.0%
5 140
12.8%
0 132
12.1%
6 63
 
5.8%
8 49
 
4.5%
2 47
 
4.3%
4 42
 
3.8%
7 40
 
3.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 910
83.3%
Dash Punctuation 182
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
3 220
24.2%
1 142
15.6%
5 140
15.4%
0 132
14.5%
6 63
 
6.9%
8 49
 
5.4%
2 47
 
5.2%
4 42
 
4.6%
7 40
 
4.4%
9 35
 
3.8%
Dash Punctuation
ValueCountFrequency (%)
- 182
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1092
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
3 220
20.1%
- 182
16.7%
1 142
13.0%
5 140
12.8%
0 132
12.1%
6 63
 
5.8%
8 49
 
4.5%
2 47
 
4.3%
4 42
 
3.8%
7 40
 
3.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1092
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3 220
20.1%
- 182
16.7%
1 142
13.0%
5 140
12.8%
0 132
12.1%
6 63
 
5.8%
8 49
 
4.5%
2 47
 
4.3%
4 42
 
3.8%
7 40
 
3.7%

Correlations

2024-03-15T01:46:31.461313image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업종명업소명영업소 주소(도로명)영업소 주소(지번)소재지전화
업종명1.0001.0001.0001.0001.000
업소명1.0001.0001.0000.9951.000
영업소 주소(도로명)1.0001.0001.0001.0001.000
영업소 주소(지번)1.0000.9951.0001.0001.000
소재지전화1.0001.0001.0001.0001.000

Missing values

2024-03-15T01:46:18.784268image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-15T01:46:19.086212image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업종명업소명영업소 주소(도로명)영업소 주소(지번)소재지전화
0숙박업(일반)역전부산광역시 북구 낙동대로1704번길 10 (구포동)부산광역시 북구 구포동 1060-55051-336-2988
1숙박업(일반)보림부산광역시 북구 구포만세길 6 (구포동)부산광역시 북구 구포동 1069-2051-332-1535
2숙박업(일반)산해부산광역시 북구 구포만세길 36-19 (구포동)부산광역시 북구 구포동 1060-254051-331-4488
3숙박업(일반)밀양부산광역시 북구 낙동대로1694번가길 6-1 (구포동)부산광역시 북구 구포동 1060-279051-332-3482
4숙박업(일반)청포별장부산광역시 북구 가람로 6-1 (구포동)부산광역시 북구 구포동 1054-10051-334-5726
5숙박업(일반)동원부산광역시 북구 낙동대로1694번나길 38 (구포동)부산광역시 북구 구포동 1060-78051-338-7804
6숙박업(일반)명성부산광역시 북구 구포만세길 33 (구포동)부산광역시 북구 구포동 1060-375051-332-6846
7숙박업(일반)유선게스트하우스부산광역시 북구 낙동대로1582번길 5 (구포동)부산광역시 북구 구포동 1120-6<NA>
8숙박업(일반)한일부산광역시 북구 구포시장길 20 (구포동)부산광역시 북구 구포동 589-23051-332-6628
9숙박업(일반)브이세븐(V7)부산광역시 북구 낙동대로 1666 (구포동)부산광역시 북구 구포동 1067-12051-335-9991
업종명업소명영업소 주소(도로명)영업소 주소(지번)소재지전화
88숙박업(일반)지모텔부산광역시 북구 낙동대로1762번길 11 (구포동)부산광역시 북구 구포동 518051-332-3400
89숙박업(일반)엠유(MU)부산광역시 북구 금곡대로303번길 81 (화명동)부산광역시 북구 화명동 2270-2051-363-2666
90숙박업(일반)덴바스타 키즈호텔부산광역시 북구 낙동대로1739번길 10 (구포동)부산광역시 북구 구포동 211-7051-337-9898
91숙박업(일반)용궁장부산광역시 북구 낙동대로1694번나길 35 (구포동)부산광역시 북구 구포동 253-6051-337-8382
92숙박업(일반)아바모텔부산광역시 북구 화명대로 6 (화명동)부산광역시 북구 화명동 2274-2051-365-7715
93숙박업(일반)브라운도트호텔 덕천점부산광역시 북구 금곡대로8번길 20 (덕천동)부산광역시 북구 덕천동 399-17<NA>
94숙박업(일반)본호텔부산광역시 북구 낙동대로 1746-8, 바르도호텔 (구포동)부산광역시 북구 구포동 365-2 바르도호텔<NA>
95숙박업(일반)티티호텔 구포부산광역시 북구 낙동대로 1684 (구포동)부산광역시 북구 구포동 1060-262<NA>
96숙박업(생활)제이아이모텔부산광역시 북구 낙동대로1570번가길 11 (구포동)부산광역시 북구 구포동 1184-18051-301-3061
97숙박업(생활)대림장부산광역시 북구 덕천로304번길 13 (만덕동)부산광역시 북구 만덕동 846-4<NA>