Overview

Dataset statistics

Number of variables4
Number of observations83
Missing cells3
Missing cells (%)0.9%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.7 KiB
Average record size in memory33.6 B

Variable types

Categorical1
Text3

Dataset

Description대구광역시 수성구 관광숙박업 현황 (2018. 8월 기준)
Author대구광역시 수성구
URLhttp://data.daegu.go.kr/open/data/dataView.do?dataSetId=15054707&dataSetDetailId=150547072cc8d25296a27&provdMethod=FILE

Alerts

업종명 is highly imbalanced (90.6%)Imbalance
소재지전화 has 3 (3.6%) missing valuesMissing
업소명 has unique valuesUnique
업소소재지(도로명) has unique valuesUnique

Reproduction

Analysis started2024-04-17 10:27:05.568896
Analysis finished2024-04-17 10:27:05.882136
Duration0.31 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업종명
Categorical

IMBALANCE 

Distinct2
Distinct (%)2.4%
Missing0
Missing (%)0.0%
Memory size796.0 B
숙박업(일반)
82 
숙박업(생활)
 
1

Length

Max length7
Median length7
Mean length7
Min length7

Unique

Unique1 ?
Unique (%)1.2%

Sample

1st row숙박업(일반)
2nd row숙박업(일반)
3rd row숙박업(일반)
4th row숙박업(일반)
5th row숙박업(일반)

Common Values

ValueCountFrequency (%)
숙박업(일반) 82
98.8%
숙박업(생활) 1
 
1.2%

Length

2024-04-17T19:27:05.928543image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-17T19:27:06.000911image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
숙박업(일반 82
98.8%
숙박업(생활 1
 
1.2%

업소명
Text

UNIQUE 

Distinct83
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size796.0 B
2024-04-17T19:27:06.166341image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length26
Median length18
Mean length5.2771084
Min length1

Characters and Unicode

Total characters438
Distinct characters161
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique83 ?
Unique (%)100.0%

Sample

1st row호텔아리아나
2nd row오렌지모텔
3rd row(주)대구그랜드호텔
4th row송림장
5th row유림여관
ValueCountFrequency (%)
호텔아리아나 1
 
1.1%
꾸띠모텔 1
 
1.1%
젠모텔 1
 
1.1%
체리시호텔 1
 
1.1%
노블레스여관 1
 
1.1%
엘레강스 1
 
1.1%
렉스모텔 1
 
1.1%
드라마 1
 
1.1%
베르사체 1
 
1.1%
폭스모텔 1
 
1.1%
Other values (77) 77
88.5%
2024-04-17T19:27:06.465954image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
51
 
11.6%
35
 
8.0%
15
 
3.4%
( 13
 
3.0%
) 13
 
3.0%
10
 
2.3%
9
 
2.1%
9
 
2.1%
6
 
1.4%
6
 
1.4%
Other values (151) 271
61.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 338
77.2%
Uppercase Letter 55
 
12.6%
Open Punctuation 13
 
3.0%
Close Punctuation 13
 
3.0%
Decimal Number 8
 
1.8%
Lowercase Letter 5
 
1.1%
Space Separator 4
 
0.9%
Dash Punctuation 1
 
0.2%
Other Punctuation 1
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
51
 
15.1%
35
 
10.4%
15
 
4.4%
10
 
3.0%
9
 
2.7%
9
 
2.7%
6
 
1.8%
6
 
1.8%
5
 
1.5%
5
 
1.5%
Other values (118) 187
55.3%
Uppercase Letter
ValueCountFrequency (%)
O 5
 
9.1%
M 5
 
9.1%
S 5
 
9.1%
U 4
 
7.3%
T 4
 
7.3%
I 3
 
5.5%
E 3
 
5.5%
L 3
 
5.5%
P 3
 
5.5%
B 3
 
5.5%
Other values (10) 17
30.9%
Decimal Number
ValueCountFrequency (%)
2 5
62.5%
3 1
 
12.5%
5 1
 
12.5%
6 1
 
12.5%
Lowercase Letter
ValueCountFrequency (%)
o 2
40.0%
u 1
20.0%
s 1
20.0%
g 1
20.0%
Open Punctuation
ValueCountFrequency (%)
( 13
100.0%
Close Punctuation
ValueCountFrequency (%)
) 13
100.0%
Space Separator
ValueCountFrequency (%)
4
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%
Other Punctuation
ValueCountFrequency (%)
. 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 338
77.2%
Latin 60
 
13.7%
Common 40
 
9.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
51
 
15.1%
35
 
10.4%
15
 
4.4%
10
 
3.0%
9
 
2.7%
9
 
2.7%
6
 
1.8%
6
 
1.8%
5
 
1.5%
5
 
1.5%
Other values (118) 187
55.3%
Latin
ValueCountFrequency (%)
O 5
 
8.3%
M 5
 
8.3%
S 5
 
8.3%
U 4
 
6.7%
T 4
 
6.7%
I 3
 
5.0%
E 3
 
5.0%
L 3
 
5.0%
P 3
 
5.0%
B 3
 
5.0%
Other values (14) 22
36.7%
Common
ValueCountFrequency (%)
( 13
32.5%
) 13
32.5%
2 5
 
12.5%
4
 
10.0%
3 1
 
2.5%
5 1
 
2.5%
6 1
 
2.5%
- 1
 
2.5%
. 1
 
2.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 338
77.2%
ASCII 100
 
22.8%

Most frequent character per block

Hangul
ValueCountFrequency (%)
51
 
15.1%
35
 
10.4%
15
 
4.4%
10
 
3.0%
9
 
2.7%
9
 
2.7%
6
 
1.8%
6
 
1.8%
5
 
1.5%
5
 
1.5%
Other values (118) 187
55.3%
ASCII
ValueCountFrequency (%)
( 13
 
13.0%
) 13
 
13.0%
O 5
 
5.0%
M 5
 
5.0%
S 5
 
5.0%
2 5
 
5.0%
U 4
 
4.0%
4
 
4.0%
T 4
 
4.0%
I 3
 
3.0%
Other values (23) 39
39.0%
Distinct83
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size796.0 B
2024-04-17T19:27:06.683902image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length34
Median length33
Mean length24.939759
Min length20

Characters and Unicode

Total characters2070
Distinct characters55
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique83 ?
Unique (%)100.0%

Sample

1st row대구광역시 수성구 동대구로 27 (두산동)
2nd row대구광역시 수성구 용학로25길 14 (두산동)
3rd row대구광역시 수성구 동대구로 305 (범어동)
4th row대구광역시 수성구 수성로 4 (상동)
5th row대구광역시 수성구 신천동로 268-1 (중동)
ValueCountFrequency (%)
대구광역시 83
19.6%
수성구 83
19.6%
두산동 34
 
8.0%
황금동 23
 
5.4%
청수로26길 13
 
3.1%
동대구로25길 11
 
2.6%
동대구로 9
 
2.1%
범어동 8
 
1.9%
청수로25길 7
 
1.7%
수성로 7
 
1.7%
Other values (97) 146
34.4%
2024-04-17T19:27:07.038378image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
344
16.6%
191
 
9.2%
115
 
5.6%
111
 
5.4%
108
 
5.2%
91
 
4.4%
2 86
 
4.2%
) 83
 
4.0%
83
 
4.0%
83
 
4.0%
Other values (45) 775
37.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1232
59.5%
Space Separator 344
 
16.6%
Decimal Number 309
 
14.9%
Close Punctuation 83
 
4.0%
Open Punctuation 83
 
4.0%
Dash Punctuation 18
 
0.9%
Math Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
191
15.5%
115
9.3%
111
9.0%
108
8.8%
91
 
7.4%
83
 
6.7%
83
 
6.7%
83
 
6.7%
81
 
6.6%
45
 
3.7%
Other values (30) 241
19.6%
Decimal Number
ValueCountFrequency (%)
2 86
27.8%
1 55
17.8%
5 43
13.9%
6 35
11.3%
4 26
 
8.4%
3 20
 
6.5%
8 15
 
4.9%
0 14
 
4.5%
7 12
 
3.9%
9 3
 
1.0%
Space Separator
ValueCountFrequency (%)
344
100.0%
Close Punctuation
ValueCountFrequency (%)
) 83
100.0%
Open Punctuation
ValueCountFrequency (%)
( 83
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 18
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1232
59.5%
Common 838
40.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
191
15.5%
115
9.3%
111
9.0%
108
8.8%
91
 
7.4%
83
 
6.7%
83
 
6.7%
83
 
6.7%
81
 
6.6%
45
 
3.7%
Other values (30) 241
19.6%
Common
ValueCountFrequency (%)
344
41.1%
2 86
 
10.3%
) 83
 
9.9%
( 83
 
9.9%
1 55
 
6.6%
5 43
 
5.1%
6 35
 
4.2%
4 26
 
3.1%
3 20
 
2.4%
- 18
 
2.1%
Other values (5) 45
 
5.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1232
59.5%
ASCII 838
40.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
344
41.1%
2 86
 
10.3%
) 83
 
9.9%
( 83
 
9.9%
1 55
 
6.6%
5 43
 
5.1%
6 35
 
4.2%
4 26
 
3.1%
3 20
 
2.4%
- 18
 
2.1%
Other values (5) 45
 
5.4%
Hangul
ValueCountFrequency (%)
191
15.5%
115
9.3%
111
9.0%
108
8.8%
91
 
7.4%
83
 
6.7%
83
 
6.7%
83
 
6.7%
81
 
6.6%
45
 
3.7%
Other values (30) 241
19.6%

소재지전화
Text

MISSING 

Distinct79
Distinct (%)98.8%
Missing3
Missing (%)3.6%
Memory size796.0 B
2024-04-17T19:27:07.231895image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters960
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique78 ?
Unique (%)97.5%

Sample

1st row053-765-7776
2nd row053-765-3888
3rd row053-742-0001
4th row053-768-6662
5th row053-763-4657
ValueCountFrequency (%)
053-765-3888 2
 
2.5%
053-766-7700 1
 
1.2%
053-765-9753 1
 
1.2%
053-766-1121 1
 
1.2%
053-764-1790 1
 
1.2%
053-762-6373 1
 
1.2%
053-763-1137 1
 
1.2%
053-764-1777 1
 
1.2%
053-768-3360 1
 
1.2%
053-761-1101 1
 
1.2%
Other values (69) 69
86.2%
2024-04-17T19:27:07.516924image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 160
16.7%
0 146
15.2%
5 135
14.1%
3 120
12.5%
7 115
12.0%
6 109
11.4%
1 52
 
5.4%
8 44
 
4.6%
4 32
 
3.3%
2 29
 
3.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 800
83.3%
Dash Punctuation 160
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 146
18.2%
5 135
16.9%
3 120
15.0%
7 115
14.4%
6 109
13.6%
1 52
 
6.5%
8 44
 
5.5%
4 32
 
4.0%
2 29
 
3.6%
9 18
 
2.2%
Dash Punctuation
ValueCountFrequency (%)
- 160
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 960
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 160
16.7%
0 146
15.2%
5 135
14.1%
3 120
12.5%
7 115
12.0%
6 109
11.4%
1 52
 
5.4%
8 44
 
4.6%
4 32
 
3.3%
2 29
 
3.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 960
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 160
16.7%
0 146
15.2%
5 135
14.1%
3 120
12.5%
7 115
12.0%
6 109
11.4%
1 52
 
5.4%
8 44
 
4.6%
4 32
 
3.3%
2 29
 
3.0%

Correlations

2024-04-17T19:27:07.597887image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업종명업소명업소소재지(도로명)소재지전화
업종명1.0001.0001.0001.000
업소명1.0001.0001.0001.000
업소소재지(도로명)1.0001.0001.0001.000
소재지전화1.0001.0001.0001.000

Missing values

2024-04-17T19:27:05.793713image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-17T19:27:05.856647image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업종명업소명업소소재지(도로명)소재지전화
0숙박업(일반)호텔아리아나대구광역시 수성구 동대구로 27 (두산동)053-765-7776
1숙박업(일반)오렌지모텔대구광역시 수성구 용학로25길 14 (두산동)053-765-3888
2숙박업(일반)(주)대구그랜드호텔대구광역시 수성구 동대구로 305 (범어동)053-742-0001
3숙박업(일반)송림장대구광역시 수성구 수성로 4 (상동)053-768-6662
4숙박업(일반)유림여관대구광역시 수성구 신천동로 268-1 (중동)053-763-4657
5숙박업(일반)리버사이드모텔대구광역시 수성구 신천동로 34 (상동)053-764-0466
6숙박업(일반)더썸모텔대구광역시 수성구 용학로 141 (두산동)053-782-8500
7숙박업(일반)보잉호텔수성대구광역시 수성구 희망로 221 (황금동)053-764-1155
8숙박업(일반)원빈장대구광역시 수성구 화랑로 202 (만촌동)053-954-9945
9숙박업(일반)석경탕여관대구광역시 수성구 수성로 224 (중동)053-765-2003
업종명업소명업소소재지(도로명)소재지전화
73숙박업(일반)황금호텔대구광역시 수성구 동대구로 115 (황금동 6층)053-766-8012
74숙박업(일반)대구광역시 수성구 청수로25길 22 (황금동)053-761-0856
75숙박업(일반)대구광역시 수성구 동대구로15길 30 (두산동)053-761-2273
76숙박업(일반)힙모텔대구광역시 수성구 청수로24길 87 (두산동)053-768-6660
77숙박업(일반)지(G)모텔대구광역시 수성구 동대구로15길 28 (두산동)053-765-0038
78숙박업(일반)호텔라온제나대구광역시 수성구 범어천로 73 10~14층 (범어동)053-756-6700
79숙박업(일반)애플모텔대구광역시 수성구 청수로26길 22 (두산동)<NA>
80숙박업(일반)지지모텔대구광역시 수성구 동대구로25길 24-6 (황금동)<NA>
81숙박업(일반)제이비프리미엄(JBPREMIUM)대구광역시 수성구 청수로26길 62 2 3층 (두산동)053-768-2202
82숙박업(생활)퀸즈텔대구광역시 수성구 청수로26길 56 (두산동)053-763-4266