Overview

Dataset statistics

Number of variables5
Number of observations167
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory6.7 KiB
Average record size in memory40.8 B

Variable types

Categorical1
Text4

Dataset

Description대구광역시 동구 관내의 숙박, 호텔 등의 공중위생업소 현황 데이터 입니다. 이 데이터는 업소명, 주소, 연락처 등의 항목으로 구성되어 있습니다.
Author대구광역시 동구
URLhttps://www.data.go.kr/data/3055360/fileData.do

Alerts

업종명 is highly imbalanced (80.6%)Imbalance

Reproduction

Analysis started2024-04-06 08:42:28.270876
Analysis finished2024-04-06 08:42:29.569958
Duration1.3 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업종명
Categorical

IMBALANCE 

Distinct2
Distinct (%)1.2%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
숙박업(일반)
162 
숙박업(생활)
 
5

Length

Max length7
Median length7
Mean length7
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row숙박업(일반)
2nd row숙박업(일반)
3rd row숙박업(일반)
4th row숙박업(일반)
5th row숙박업(일반)

Common Values

ValueCountFrequency (%)
숙박업(일반) 162
97.0%
숙박업(생활) 5
 
3.0%

Length

2024-04-06T17:42:29.817133image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-06T17:42:30.120439image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
숙박업(일반 162
97.0%
숙박업(생활 5
 
3.0%
Distinct166
Distinct (%)99.4%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
2024-04-06T17:42:30.729253image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length34
Median length25
Mean length6.0598802
Min length1

Characters and Unicode

Total characters1012
Distinct characters237
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique165 ?
Unique (%)98.8%

Sample

1st row장원장여관
2nd row신화여관
3rd row영빈모텔
4th row동덕여인숙
5th row서울모텔
ValueCountFrequency (%)
호텔 5
 
2.5%
hotel 4
 
2.0%
황금모텔 2
 
1.0%
하운드 2
 
1.0%
대구 2
 
1.0%
동대구역점 2
 
1.0%
22 2
 
1.0%
체리쉬호텔 1
 
0.5%
이스턴호텔 1
 
0.5%
에이치에비뉴(h.avenue 1
 
0.5%
Other values (177) 177
88.9%
2024-04-06T17:42:31.889164image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
104
 
10.3%
61
 
6.0%
46
 
4.5%
32
 
3.2%
31
 
3.1%
26
 
2.6%
26
 
2.6%
25
 
2.5%
23
 
2.3%
( 20
 
2.0%
Other values (227) 618
61.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 796
78.7%
Uppercase Letter 84
 
8.3%
Lowercase Letter 37
 
3.7%
Space Separator 32
 
3.2%
Decimal Number 21
 
2.1%
Open Punctuation 20
 
2.0%
Close Punctuation 20
 
2.0%
Other Punctuation 2
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
104
 
13.1%
61
 
7.7%
46
 
5.8%
31
 
3.9%
26
 
3.3%
26
 
3.3%
25
 
3.1%
23
 
2.9%
13
 
1.6%
12
 
1.5%
Other values (182) 429
53.9%
Uppercase Letter
ValueCountFrequency (%)
T 9
10.7%
O 8
9.5%
A 8
9.5%
S 7
 
8.3%
E 7
 
8.3%
H 7
 
8.3%
L 6
 
7.1%
M 5
 
6.0%
Y 5
 
6.0%
K 4
 
4.8%
Other values (11) 18
21.4%
Lowercase Letter
ValueCountFrequency (%)
e 5
13.5%
o 5
13.5%
t 4
10.8%
l 4
10.8%
i 3
8.1%
n 3
8.1%
d 2
 
5.4%
m 2
 
5.4%
g 2
 
5.4%
u 2
 
5.4%
Other values (5) 5
13.5%
Decimal Number
ValueCountFrequency (%)
2 15
71.4%
1 2
 
9.5%
0 2
 
9.5%
5 1
 
4.8%
7 1
 
4.8%
Space Separator
ValueCountFrequency (%)
32
100.0%
Open Punctuation
ValueCountFrequency (%)
( 20
100.0%
Close Punctuation
ValueCountFrequency (%)
) 20
100.0%
Other Punctuation
ValueCountFrequency (%)
. 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 796
78.7%
Latin 121
 
12.0%
Common 95
 
9.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
104
 
13.1%
61
 
7.7%
46
 
5.8%
31
 
3.9%
26
 
3.3%
26
 
3.3%
25
 
3.1%
23
 
2.9%
13
 
1.6%
12
 
1.5%
Other values (182) 429
53.9%
Latin
ValueCountFrequency (%)
T 9
 
7.4%
O 8
 
6.6%
A 8
 
6.6%
S 7
 
5.8%
E 7
 
5.8%
H 7
 
5.8%
L 6
 
5.0%
e 5
 
4.1%
M 5
 
4.1%
o 5
 
4.1%
Other values (26) 54
44.6%
Common
ValueCountFrequency (%)
32
33.7%
( 20
21.1%
) 20
21.1%
2 15
15.8%
. 2
 
2.1%
1 2
 
2.1%
0 2
 
2.1%
5 1
 
1.1%
7 1
 
1.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 796
78.7%
ASCII 216
 
21.3%

Most frequent character per block

Hangul
ValueCountFrequency (%)
104
 
13.1%
61
 
7.7%
46
 
5.8%
31
 
3.9%
26
 
3.3%
26
 
3.3%
25
 
3.1%
23
 
2.9%
13
 
1.6%
12
 
1.5%
Other values (182) 429
53.9%
ASCII
ValueCountFrequency (%)
32
 
14.8%
( 20
 
9.3%
) 20
 
9.3%
2 15
 
6.9%
T 9
 
4.2%
O 8
 
3.7%
A 8
 
3.7%
S 7
 
3.2%
E 7
 
3.2%
H 7
 
3.2%
Other values (35) 83
38.4%
Distinct166
Distinct (%)99.4%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
2024-04-06T17:42:32.688774image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length60
Median length58
Mean length25.053892
Min length20

Characters and Unicode

Total characters4184
Distinct characters117
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique165 ?
Unique (%)98.8%

Sample

1st row대구광역시 동구 아양로 294 (입석동)
2nd row대구광역시 동구 아양로38길 5-2 (효목동)
3rd row대구광역시 동구 동부로32길 28 (신천동)
4th row대구광역시 동구 아양로 75-2 (신암동)
5th row대구광역시 동구 동부로28길 15 (신천동)
ValueCountFrequency (%)
대구광역시 167
19.3%
동구 167
19.3%
신천동 50
 
5.8%
신암동 26
 
3.0%
효목동 23
 
2.7%
동부로26길 16
 
1.8%
용수동 10
 
1.2%
신암남로 10
 
1.2%
팔공산로185길 9
 
1.0%
동부로30길 9
 
1.0%
Other values (212) 380
43.8%
2024-04-06T17:42:34.220464image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
701
16.8%
418
 
10.0%
342
 
8.2%
181
 
4.3%
) 168
 
4.0%
( 168
 
4.0%
167
 
4.0%
167
 
4.0%
167
 
4.0%
167
 
4.0%
Other values (107) 1538
36.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2395
57.2%
Space Separator 701
 
16.8%
Decimal Number 664
 
15.9%
Close Punctuation 168
 
4.0%
Open Punctuation 168
 
4.0%
Dash Punctuation 49
 
1.2%
Other Punctuation 26
 
0.6%
Uppercase Letter 9
 
0.2%
Math Symbol 4
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
418
17.5%
342
14.3%
181
 
7.6%
167
 
7.0%
167
 
7.0%
167
 
7.0%
167
 
7.0%
102
 
4.3%
94
 
3.9%
53
 
2.2%
Other values (81) 537
22.4%
Decimal Number
ValueCountFrequency (%)
1 135
20.3%
2 124
18.7%
3 81
12.2%
6 65
9.8%
5 56
8.4%
8 55
8.3%
4 46
 
6.9%
0 44
 
6.6%
7 38
 
5.7%
9 20
 
3.0%
Uppercase Letter
ValueCountFrequency (%)
T 2
22.2%
H 1
11.1%
O 1
11.1%
E 1
11.1%
L 1
11.1%
S 1
11.1%
A 1
11.1%
Y 1
11.1%
Other Punctuation
ValueCountFrequency (%)
, 23
88.5%
. 2
 
7.7%
' 1
 
3.8%
Space Separator
ValueCountFrequency (%)
701
100.0%
Close Punctuation
ValueCountFrequency (%)
) 168
100.0%
Open Punctuation
ValueCountFrequency (%)
( 168
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 49
100.0%
Math Symbol
ValueCountFrequency (%)
~ 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2395
57.2%
Common 1780
42.5%
Latin 9
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
418
17.5%
342
14.3%
181
 
7.6%
167
 
7.0%
167
 
7.0%
167
 
7.0%
167
 
7.0%
102
 
4.3%
94
 
3.9%
53
 
2.2%
Other values (81) 537
22.4%
Common
ValueCountFrequency (%)
701
39.4%
) 168
 
9.4%
( 168
 
9.4%
1 135
 
7.6%
2 124
 
7.0%
3 81
 
4.6%
6 65
 
3.7%
5 56
 
3.1%
8 55
 
3.1%
- 49
 
2.8%
Other values (8) 178
 
10.0%
Latin
ValueCountFrequency (%)
T 2
22.2%
H 1
11.1%
O 1
11.1%
E 1
11.1%
L 1
11.1%
S 1
11.1%
A 1
11.1%
Y 1
11.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2395
57.2%
ASCII 1789
42.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
701
39.2%
) 168
 
9.4%
( 168
 
9.4%
1 135
 
7.5%
2 124
 
6.9%
3 81
 
4.5%
6 65
 
3.6%
5 56
 
3.1%
8 55
 
3.1%
- 49
 
2.7%
Other values (16) 187
 
10.5%
Hangul
ValueCountFrequency (%)
418
17.5%
342
14.3%
181
 
7.6%
167
 
7.0%
167
 
7.0%
167
 
7.0%
167
 
7.0%
102
 
4.3%
94
 
3.9%
53
 
2.2%
Other values (81) 537
22.4%
Distinct166
Distinct (%)99.4%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
2024-04-06T17:42:35.294865image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length50
Median length42
Mean length19.706587
Min length17

Characters and Unicode

Total characters3291
Distinct characters74
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique165 ?
Unique (%)98.8%

Sample

1st row대구광역시 동구 입석동 999-1
2nd row대구광역시 동구 효목동 960-22
3rd row대구광역시 동구 신천동 382-4
4th row대구광역시 동구 신암동 603-165
5th row대구광역시 동구 신천동 334-16
ValueCountFrequency (%)
대구광역시 167
24.3%
동구 167
24.3%
신천동 52
 
7.6%
신암동 28
 
4.1%
효목동 25
 
3.6%
용수동 10
 
1.5%
상매동 9
 
1.3%
입석동 7
 
1.0%
중대동 6
 
0.9%
지묘동 5
 
0.7%
Other values (190) 212
30.8%
2024-04-06T17:42:36.929472image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
688
20.9%
335
 
10.2%
335
 
10.2%
174
 
5.3%
167
 
5.1%
167
 
5.1%
167
 
5.1%
- 154
 
4.7%
3 117
 
3.6%
1 112
 
3.4%
Other values (64) 875
26.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1714
52.1%
Decimal Number 727
22.1%
Space Separator 688
20.9%
Dash Punctuation 154
 
4.7%
Other Punctuation 6
 
0.2%
Close Punctuation 1
 
< 0.1%
Open Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
335
19.5%
335
19.5%
174
10.2%
167
9.7%
167
9.7%
167
9.7%
84
 
4.9%
52
 
3.0%
28
 
1.6%
25
 
1.5%
Other values (47) 180
10.5%
Decimal Number
ValueCountFrequency (%)
3 117
16.1%
1 112
15.4%
2 98
13.5%
0 71
9.8%
6 67
9.2%
5 65
8.9%
9 58
8.0%
4 53
7.3%
7 51
7.0%
8 35
 
4.8%
Other Punctuation
ValueCountFrequency (%)
, 3
50.0%
. 2
33.3%
' 1
 
16.7%
Space Separator
ValueCountFrequency (%)
688
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 154
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1714
52.1%
Common 1577
47.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
335
19.5%
335
19.5%
174
10.2%
167
9.7%
167
9.7%
167
9.7%
84
 
4.9%
52
 
3.0%
28
 
1.6%
25
 
1.5%
Other values (47) 180
10.5%
Common
ValueCountFrequency (%)
688
43.6%
- 154
 
9.8%
3 117
 
7.4%
1 112
 
7.1%
2 98
 
6.2%
0 71
 
4.5%
6 67
 
4.2%
5 65
 
4.1%
9 58
 
3.7%
4 53
 
3.4%
Other values (7) 94
 
6.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1714
52.1%
ASCII 1577
47.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
688
43.6%
- 154
 
9.8%
3 117
 
7.4%
1 112
 
7.1%
2 98
 
6.2%
0 71
 
4.5%
6 67
 
4.2%
5 65
 
4.1%
9 58
 
3.7%
4 53
 
3.4%
Other values (7) 94
 
6.0%
Hangul
ValueCountFrequency (%)
335
19.5%
335
19.5%
174
10.2%
167
9.7%
167
9.7%
167
9.7%
84
 
4.9%
52
 
3.0%
28
 
1.6%
25
 
1.5%
Other values (47) 180
10.5%
Distinct164
Distinct (%)98.2%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
2024-04-06T17:42:37.536183image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length12
Mean length12.011976
Min length12

Characters and Unicode

Total characters2006
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique162 ?
Unique (%)97.0%

Sample

1st row053-984-7667
2nd row053-742-1730
3rd row053-755-9244
4th row053-941-9379
5th row053-743-7613
ValueCountFrequency (%)
053-000-0000 3
 
1.8%
053-986-8201 2
 
1.2%
053-752-7272 1
 
0.6%
053-942-9475 1
 
0.6%
053-986-1173 1
 
0.6%
053-958-6373 1
 
0.6%
053-984-7667 1
 
0.6%
053-744-8917 1
 
0.6%
053-952-7454 1
 
0.6%
053-951-2364 1
 
0.6%
Other values (154) 154
92.2%
2024-04-06T17:42:38.543633image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 334
16.7%
5 318
15.9%
0 293
14.6%
3 289
14.4%
9 152
7.6%
4 120
 
6.0%
7 119
 
5.9%
2 109
 
5.4%
8 108
 
5.4%
6 84
 
4.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1672
83.3%
Dash Punctuation 334
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 318
19.0%
0 293
17.5%
3 289
17.3%
9 152
9.1%
4 120
 
7.2%
7 119
 
7.1%
2 109
 
6.5%
8 108
 
6.5%
6 84
 
5.0%
1 80
 
4.8%
Dash Punctuation
ValueCountFrequency (%)
- 334
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 2006
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 334
16.7%
5 318
15.9%
0 293
14.6%
3 289
14.4%
9 152
7.6%
4 120
 
6.0%
7 119
 
5.9%
2 109
 
5.4%
8 108
 
5.4%
6 84
 
4.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2006
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 334
16.7%
5 318
15.9%
0 293
14.6%
3 289
14.4%
9 152
7.6%
4 120
 
6.0%
7 119
 
5.9%
2 109
 
5.4%
8 108
 
5.4%
6 84
 
4.2%

Missing values

2024-04-06T17:42:29.020466image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-06T17:42:29.414443image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업종명업소명영업소 주소(도로명)영업소 주소(지번)소재지전화
0숙박업(일반)장원장여관대구광역시 동구 아양로 294 (입석동)대구광역시 동구 입석동 999-1053-984-7667
1숙박업(일반)신화여관대구광역시 동구 아양로38길 5-2 (효목동)대구광역시 동구 효목동 960-22053-742-1730
2숙박업(일반)영빈모텔대구광역시 동구 동부로32길 28 (신천동)대구광역시 동구 신천동 382-4053-755-9244
3숙박업(일반)동덕여인숙대구광역시 동구 아양로 75-2 (신암동)대구광역시 동구 신암동 603-165053-941-9379
4숙박업(일반)서울모텔대구광역시 동구 동부로28길 15 (신천동)대구광역시 동구 신천동 334-16053-743-7613
5숙박업(일반)청수여관대구광역시 동구 반야월로 174 (신기동)대구광역시 동구 신기동 15-2053-962-5859
6숙박업(일반)신모텔대구광역시 동구 신암남로 109 (신암동)대구광역시 동구 신암동 259-42053-955-5277
7숙박업(일반)현대장여관대구광역시 동구 동부로 65 (신천동)대구광역시 동구 신천동 23-1053-752-3200
8숙박업(일반)동양장여관대구광역시 동구 동부로30길 6 (신천동)대구광역시 동구 신천동 330-7053-755-2429
9숙박업(일반)오케이(OK)모텔대구광역시 동구 동부로28길 15-1 (신천동)대구광역시 동구 신천동 334-10053-742-9035
업종명업소명영업소 주소(도로명)영업소 주소(지번)소재지전화
157숙박업(일반)제이비관광호텔(JB TOURIST HOTEL)대구광역시 동구 율암로 162, 1~7층층 (상매동)대구광역시 동구 상매동 506-6053-964-2000
158숙박업(일반)제이비한옥호텔(JBHANOKHOTEL)대구광역시 동구 율암로 156-13, 1~3층 (상매동)대구광역시 동구 상매동 506-3053-000-0000
159숙박업(일반)호텔골든캐프대구광역시 동구 율암로 156-28, 호텔골든캐프 (상매동)대구광역시 동구 상매동 5050507-1440-9067
160숙박업(일반)호텔루나대구광역시 동구 팔공산로185길 33-4 (용수동)대구광역시 동구 용수동 67-26053-982-8037
161숙박업(일반)동아장여관대구광역시 동구 효목로 34, 3층 (효목동)대구광역시 동구 효목동 557-2053-000-0000
162숙박업(생활)애플호텔펜션대구광역시 동구 팔공산로185길 33-6 (용수동)대구광역시 동구 용수동 67-25053-983-0809
163숙박업(생활)대구광역시 동구 팔공로 525 (지묘동)대구광역시 동구 지묘동 85-1053-984-0033
164숙박업(생활)팔공펜션대구광역시 동구 팔공산로185길 35 (용수동)대구광역시 동구 용수동 67-28053-981-6688
165숙박업(생활)스파펜션링스대구광역시 동구 팔공산로185길 39 (용수동)대구광역시 동구 용수동 59-18053-981-3321
166숙박업(생활)와이컬렉션 by UH FLAT 대구대구광역시 동구 동부로26길 6, 대구 메리어트 호텔 및 서비스드 레지던스 12~23층 41개호 (신천동)대구광역시 동구 신천동 326-1 대구 메리어트 호텔 및 서비스드 레지던스053-746-2288