Overview

Dataset statistics

Number of variables5
Number of observations99
Missing cells7
Missing cells (%)1.4%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory4.0 KiB
Average record size in memory41.3 B

Variable types

Categorical1
Text4

Dataset

Description부산광역시북구_숙박업현황_20230112
Author부산광역시 북구
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=3069378

Alerts

업종명 is highly imbalanced (85.7%)Imbalance
소재지전화 has 7 (7.1%) missing valuesMissing
영업소 주소(도로명) has unique valuesUnique

Reproduction

Analysis started2023-12-10 16:09:42.446176
Analysis finished2023-12-10 16:09:43.298000
Duration0.85 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업종명
Categorical

IMBALANCE 

Distinct2
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size924.0 B
숙박업(일반)
97 
숙박업(생활)
 
2

Length

Max length7
Median length7
Mean length7
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row숙박업(일반)
2nd row숙박업(일반)
3rd row숙박업(일반)
4th row숙박업(일반)
5th row숙박업(일반)

Common Values

ValueCountFrequency (%)
숙박업(일반) 97
98.0%
숙박업(생활) 2
 
2.0%

Length

2023-12-11T01:09:43.357201image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:09:43.442378image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
숙박업(일반 97
98.0%
숙박업(생활 2
 
2.0%
Distinct97
Distinct (%)98.0%
Missing0
Missing (%)0.0%
Memory size924.0 B
2023-12-11T01:09:43.693157image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length10
Mean length4.7676768
Min length2

Characters and Unicode

Total characters472
Distinct characters154
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique95 ?
Unique (%)96.0%

Sample

1st row역전
2nd row보림
3rd row산해
4th row밀양
5th row청포별장
ValueCountFrequency (%)
호텔 3
 
2.8%
브라운도트호텔 3
 
2.8%
스카이모텔 2
 
1.9%
엠유(mu 2
 
1.9%
코코모텔 1
 
0.9%
아브레모텔 1
 
0.9%
1
 
0.9%
역전 1
 
0.9%
영빈장 1
 
0.9%
티파니 1
 
0.9%
Other values (92) 92
85.2%
2023-12-11T01:09:44.114541image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
47
 
10.0%
29
 
6.1%
25
 
5.3%
20
 
4.2%
13
 
2.8%
13
 
2.8%
( 10
 
2.1%
) 10
 
2.1%
9
 
1.9%
9
 
1.9%
Other values (144) 287
60.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 419
88.8%
Uppercase Letter 18
 
3.8%
Open Punctuation 10
 
2.1%
Close Punctuation 10
 
2.1%
Space Separator 9
 
1.9%
Decimal Number 5
 
1.1%
Other Punctuation 1
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
47
 
11.2%
29
 
6.9%
25
 
6.0%
20
 
4.8%
13
 
3.1%
13
 
3.1%
9
 
2.1%
7
 
1.7%
6
 
1.4%
6
 
1.4%
Other values (127) 244
58.2%
Uppercase Letter
ValueCountFrequency (%)
W 3
16.7%
U 3
16.7%
M 2
11.1%
S 2
11.1%
O 2
11.1%
B 2
11.1%
A 2
11.1%
H 1
 
5.6%
V 1
 
5.6%
Decimal Number
ValueCountFrequency (%)
2 2
40.0%
5 1
20.0%
1 1
20.0%
7 1
20.0%
Open Punctuation
ValueCountFrequency (%)
( 10
100.0%
Close Punctuation
ValueCountFrequency (%)
) 10
100.0%
Space Separator
ValueCountFrequency (%)
9
100.0%
Other Punctuation
ValueCountFrequency (%)
& 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 419
88.8%
Common 35
 
7.4%
Latin 18
 
3.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
47
 
11.2%
29
 
6.9%
25
 
6.0%
20
 
4.8%
13
 
3.1%
13
 
3.1%
9
 
2.1%
7
 
1.7%
6
 
1.4%
6
 
1.4%
Other values (127) 244
58.2%
Latin
ValueCountFrequency (%)
W 3
16.7%
U 3
16.7%
M 2
11.1%
S 2
11.1%
O 2
11.1%
B 2
11.1%
A 2
11.1%
H 1
 
5.6%
V 1
 
5.6%
Common
ValueCountFrequency (%)
( 10
28.6%
) 10
28.6%
9
25.7%
2 2
 
5.7%
5 1
 
2.9%
& 1
 
2.9%
1 1
 
2.9%
7 1
 
2.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 419
88.8%
ASCII 53
 
11.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
47
 
11.2%
29
 
6.9%
25
 
6.0%
20
 
4.8%
13
 
3.1%
13
 
3.1%
9
 
2.1%
7
 
1.7%
6
 
1.4%
6
 
1.4%
Other values (127) 244
58.2%
ASCII
ValueCountFrequency (%)
( 10
18.9%
) 10
18.9%
9
17.0%
W 3
 
5.7%
U 3
 
5.7%
M 2
 
3.8%
S 2
 
3.8%
O 2
 
3.8%
2 2
 
3.8%
B 2
 
3.8%
Other values (7) 8
15.1%
Distinct99
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size924.0 B
2023-12-11T01:09:44.444063image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length33
Median length30
Mean length25.626263
Min length20

Characters and Unicode

Total characters2537
Distinct characters59
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique99 ?
Unique (%)100.0%

Sample

1st row부산광역시 북구 낙동대로1704번길 10 (구포동)
2nd row부산광역시 북구 구포만세길 6 (구포동)
3rd row부산광역시 북구 구포만세길 36-19 (구포동)
4th row부산광역시 북구 낙동대로1694번가길 6-1 (구포동)
5th row부산광역시 북구 가람로 6-1 (구포동)
ValueCountFrequency (%)
부산광역시 99
19.9%
북구 99
19.9%
구포동 57
 
11.4%
덕천동 18
 
3.6%
낙동대로 13
 
2.6%
만덕동 11
 
2.2%
화명동 11
 
2.2%
만덕고개길 9
 
1.8%
구포만세길 8
 
1.6%
금곡대로8번길 7
 
1.4%
Other values (102) 166
33.3%
2023-12-11T01:09:45.060751image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
399
 
15.7%
167
 
6.6%
139
 
5.5%
102
 
4.0%
101
 
4.0%
99
 
3.9%
99
 
3.9%
99
 
3.9%
99
 
3.9%
) 99
 
3.9%
Other values (49) 1134
44.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1524
60.1%
Space Separator 399
 
15.7%
Decimal Number 395
 
15.6%
Close Punctuation 99
 
3.9%
Open Punctuation 99
 
3.9%
Dash Punctuation 16
 
0.6%
Other Punctuation 5
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
167
 
11.0%
139
 
9.1%
102
 
6.7%
101
 
6.6%
99
 
6.5%
99
 
6.5%
99
 
6.5%
99
 
6.5%
76
 
5.0%
76
 
5.0%
Other values (34) 467
30.6%
Decimal Number
ValueCountFrequency (%)
1 85
21.5%
6 49
12.4%
8 44
11.1%
7 42
10.6%
2 38
9.6%
4 35
8.9%
3 33
 
8.4%
0 24
 
6.1%
5 24
 
6.1%
9 21
 
5.3%
Space Separator
ValueCountFrequency (%)
399
100.0%
Close Punctuation
ValueCountFrequency (%)
) 99
100.0%
Open Punctuation
ValueCountFrequency (%)
( 99
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 16
100.0%
Other Punctuation
ValueCountFrequency (%)
, 5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1524
60.1%
Common 1013
39.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
167
 
11.0%
139
 
9.1%
102
 
6.7%
101
 
6.6%
99
 
6.5%
99
 
6.5%
99
 
6.5%
99
 
6.5%
76
 
5.0%
76
 
5.0%
Other values (34) 467
30.6%
Common
ValueCountFrequency (%)
399
39.4%
) 99
 
9.8%
( 99
 
9.8%
1 85
 
8.4%
6 49
 
4.8%
8 44
 
4.3%
7 42
 
4.1%
2 38
 
3.8%
4 35
 
3.5%
3 33
 
3.3%
Other values (5) 90
 
8.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1524
60.1%
ASCII 1013
39.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
399
39.4%
) 99
 
9.8%
( 99
 
9.8%
1 85
 
8.4%
6 49
 
4.8%
8 44
 
4.3%
7 42
 
4.1%
2 38
 
3.8%
4 35
 
3.5%
3 33
 
3.3%
Other values (5) 90
 
8.9%
Hangul
ValueCountFrequency (%)
167
 
11.0%
139
 
9.1%
102
 
6.7%
101
 
6.6%
99
 
6.5%
99
 
6.5%
99
 
6.5%
99
 
6.5%
76
 
5.0%
76
 
5.0%
Other values (34) 467
30.6%
Distinct95
Distinct (%)96.0%
Missing0
Missing (%)0.0%
Memory size924.0 B
2023-12-11T01:09:45.406904image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length32
Median length27
Mean length20.757576
Min length17

Characters and Unicode

Total characters2055
Distinct characters42
Distinct categories5 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique91 ?
Unique (%)91.9%

Sample

1st row부산광역시 북구 구포동 1060-55
2nd row부산광역시 북구 구포동 1069-2
3rd row부산광역시 북구 구포동 1060-254
4th row부산광역시 북구 구포동 1060-279
5th row부산광역시 북구 구포동 1054-10
ValueCountFrequency (%)
부산광역시 99
23.7%
북구 99
23.7%
구포동 58
13.9%
덕천동 18
 
4.3%
t통b반 14
 
3.4%
만덕동 12
 
2.9%
화명동 11
 
2.6%
2274-5 2
 
0.5%
2270-1 2
 
0.5%
2275-4 2
 
0.5%
Other values (99) 100
24.0%
2023-12-11T01:09:45.921105image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
417
20.3%
157
 
7.6%
99
 
4.8%
99
 
4.8%
99
 
4.8%
99
 
4.8%
99
 
4.8%
99
 
4.8%
99
 
4.8%
- 94
 
4.6%
Other values (32) 694
33.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1034
50.3%
Decimal Number 482
23.5%
Space Separator 417
20.3%
Dash Punctuation 94
 
4.6%
Uppercase Letter 28
 
1.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
157
15.2%
99
9.6%
99
9.6%
99
9.6%
99
9.6%
99
9.6%
99
9.6%
99
9.6%
58
 
5.6%
30
 
2.9%
Other values (18) 96
9.3%
Decimal Number
ValueCountFrequency (%)
1 87
18.0%
2 64
13.3%
5 56
11.6%
0 56
11.6%
3 55
11.4%
6 43
8.9%
4 41
8.5%
7 37
7.7%
8 24
 
5.0%
9 19
 
3.9%
Uppercase Letter
ValueCountFrequency (%)
T 14
50.0%
B 14
50.0%
Space Separator
ValueCountFrequency (%)
417
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 94
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1034
50.3%
Common 993
48.3%
Latin 28
 
1.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
157
15.2%
99
9.6%
99
9.6%
99
9.6%
99
9.6%
99
9.6%
99
9.6%
99
9.6%
58
 
5.6%
30
 
2.9%
Other values (18) 96
9.3%
Common
ValueCountFrequency (%)
417
42.0%
- 94
 
9.5%
1 87
 
8.8%
2 64
 
6.4%
5 56
 
5.6%
0 56
 
5.6%
3 55
 
5.5%
6 43
 
4.3%
4 41
 
4.1%
7 37
 
3.7%
Other values (2) 43
 
4.3%
Latin
ValueCountFrequency (%)
T 14
50.0%
B 14
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1034
50.3%
ASCII 1021
49.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
417
40.8%
- 94
 
9.2%
1 87
 
8.5%
2 64
 
6.3%
5 56
 
5.5%
0 56
 
5.5%
3 55
 
5.4%
6 43
 
4.2%
4 41
 
4.0%
7 37
 
3.6%
Other values (4) 71
 
7.0%
Hangul
ValueCountFrequency (%)
157
15.2%
99
9.6%
99
9.6%
99
9.6%
99
9.6%
99
9.6%
99
9.6%
99
9.6%
58
 
5.6%
30
 
2.9%
Other values (18) 96
9.3%

소재지전화
Text

MISSING 

Distinct92
Distinct (%)100.0%
Missing7
Missing (%)7.1%
Memory size924.0 B
2023-12-11T01:09:46.265492image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters1104
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique92 ?
Unique (%)100.0%

Sample

1st row051-336-2988
2nd row051-332-1535
3rd row051-331-4488
4th row051-332-3482
5th row051-334-5726
ValueCountFrequency (%)
051-332-6685 1
 
1.1%
051-365-1833 1
 
1.1%
051-341-5540 1
 
1.1%
051-333-6670 1
 
1.1%
051-336-6673 1
 
1.1%
051-334-5665 1
 
1.1%
051-331-4610 1
 
1.1%
051-365-0806 1
 
1.1%
051-364-4504 1
 
1.1%
051-363-6565 1
 
1.1%
Other values (82) 82
89.1%
2023-12-11T01:09:46.732839image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3 225
20.4%
- 184
16.7%
1 143
13.0%
5 140
12.7%
0 135
12.2%
6 65
 
5.9%
8 49
 
4.4%
2 46
 
4.2%
4 43
 
3.9%
7 39
 
3.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 920
83.3%
Dash Punctuation 184
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
3 225
24.5%
1 143
15.5%
5 140
15.2%
0 135
14.7%
6 65
 
7.1%
8 49
 
5.3%
2 46
 
5.0%
4 43
 
4.7%
7 39
 
4.2%
9 35
 
3.8%
Dash Punctuation
ValueCountFrequency (%)
- 184
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1104
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
3 225
20.4%
- 184
16.7%
1 143
13.0%
5 140
12.7%
0 135
12.2%
6 65
 
5.9%
8 49
 
4.4%
2 46
 
4.2%
4 43
 
3.9%
7 39
 
3.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1104
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3 225
20.4%
- 184
16.7%
1 143
13.0%
5 140
12.7%
0 135
12.2%
6 65
 
5.9%
8 49
 
4.4%
2 46
 
4.2%
4 43
 
3.9%
7 39
 
3.5%

Correlations

2023-12-11T01:09:46.844333image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업종명업소명영업소 주소(도로명)영업소 주소(지번)소재지전화
업종명1.0001.0001.0001.0001.000
업소명1.0001.0001.0000.9951.000
영업소 주소(도로명)1.0001.0001.0001.0001.000
영업소 주소(지번)1.0000.9951.0001.0001.000
소재지전화1.0001.0001.0001.0001.000

Missing values

2023-12-11T01:09:43.153280image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T01:09:43.257020image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업종명업소명영업소 주소(도로명)영업소 주소(지번)소재지전화
0숙박업(일반)역전부산광역시 북구 낙동대로1704번길 10 (구포동)부산광역시 북구 구포동 1060-55051-336-2988
1숙박업(일반)보림부산광역시 북구 구포만세길 6 (구포동)부산광역시 북구 구포동 1069-2051-332-1535
2숙박업(일반)산해부산광역시 북구 구포만세길 36-19 (구포동)부산광역시 북구 구포동 1060-254051-331-4488
3숙박업(일반)밀양부산광역시 북구 낙동대로1694번가길 6-1 (구포동)부산광역시 북구 구포동 1060-279051-332-3482
4숙박업(일반)청포별장부산광역시 북구 가람로 6-1 (구포동)부산광역시 북구 구포동 1054-10051-334-5726
5숙박업(일반)동원부산광역시 북구 낙동대로1694번나길 38 (구포동)부산광역시 북구 구포동 1060-78051-338-7804
6숙박업(일반)명성부산광역시 북구 구포만세길 33 (구포동)부산광역시 북구 구포동 1060-375 T통B반051-332-6846
7숙박업(일반)유선게스트하우스부산광역시 북구 낙동대로1582번길 5 (구포동)부산광역시 북구 구포동 1120-6<NA>
8숙박업(일반)한일부산광역시 북구 구포시장길 20 (구포동)부산광역시 북구 구포동 589-23 T통B반051-332-6628
9숙박업(일반)호림장부산광역시 북구 만덕대로 46-2 (덕천동)부산광역시 북구 덕천동 384-5051-332-0495
업종명업소명영업소 주소(도로명)영업소 주소(지번)소재지전화
89숙박업(일반)지모텔부산광역시 북구 낙동대로1762번길 11 (구포동)부산광역시 북구 구포동 518051-332-3400
90숙박업(일반)엠유(MU)부산광역시 북구 금곡대로303번길 81 (화명동)부산광역시 북구 화명동 2270-2051-363-2666
91숙박업(일반)덴바스타 키즈호텔부산광역시 북구 낙동대로1739번길 10 (구포동)부산광역시 북구 구포동 211-7051-337-9898
92숙박업(일반)용궁장부산광역시 북구 낙동대로1694번나길 35 (구포동)부산광역시 북구 구포동 253-6051-337-8382
93숙박업(일반)아바모텔부산광역시 북구 화명대로 6 (화명동)부산광역시 북구 화명동 2274-2051-365-7715
94숙박업(일반)브라운도트호텔 덕천점부산광역시 북구 금곡대로8번길 20 (덕천동)부산광역시 북구 덕천동 399-17<NA>
95숙박업(일반)본호텔부산광역시 북구 낙동대로 1746-8, 바르도호텔 (구포동)부산광역시 북구 구포동 365-2 바르도호텔<NA>
96숙박업(일반)티티호텔 구포부산광역시 북구 낙동대로 1684 (구포동)부산광역시 북구 구포동 1060-262<NA>
97숙박업(생활)제이아이모텔부산광역시 북구 낙동대로1570번가길 11 (구포동)부산광역시 북구 구포동 1184-18051-301-3061
98숙박업(생활)대림장부산광역시 북구 덕천로304번길 13 (만덕동)부산광역시 북구 만덕동 846-4<NA>