Overview

Dataset statistics

Number of variables5
Number of observations216
Missing cells11
Missing cells (%)1.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory8.6 KiB
Average record size in memory40.6 B

Variable types

Categorical1
Text3
DateTime1

Dataset

Description대구광역시 수성구 관광업소현황_20180831
Author대구광역시 수성구
URLhttp://data.daegu.go.kr/open/data/dataView.do?dataSetId=3070206&dataSetDetailId=30702061c08092d10301_201909091328&provdMethod=FILE

Alerts

데이터기준일 has constant value ""Constant
전화번호 has 11 (5.1%) missing valuesMissing

Reproduction

Analysis started2024-04-19 05:52:15.894669
Analysis finished2024-04-19 05:52:16.297041
Duration0.4 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업종
Categorical

Distinct4
Distinct (%)1.9%
Missing0
Missing (%)0.0%
Memory size1.8 KiB
숙박업(일반)
87 
국내여행업
53 
국외여행업
53 
일반여행업
23 

Length

Max length7
Median length5
Mean length5.8055556
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row국내여행업
2nd row국내여행업
3rd row국내여행업
4th row국내여행업
5th row국내여행업

Common Values

ValueCountFrequency (%)
숙박업(일반) 87
40.3%
국내여행업 53
24.5%
국외여행업 53
24.5%
일반여행업 23
 
10.6%

Length

2024-04-19T14:52:16.391543image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-19T14:52:16.544685image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
숙박업(일반 87
40.3%
국내여행업 53
24.5%
국외여행업 53
24.5%
일반여행업 23
 
10.6%

상호
Text

Distinct182
Distinct (%)84.3%
Missing0
Missing (%)0.0%
Memory size1.8 KiB
2024-04-19T14:52:16.826732image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length26
Median length15
Mean length7.0925926
Min length1

Characters and Unicode

Total characters1532
Distinct characters248
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique148 ?
Unique (%)68.5%

Sample

1st row(주)유성관광여행사
2nd row(주)세일관광여행사
3rd row(주)여행연합
4th row(주)경일항공여행사
5th row동화항공여행주식회사
ValueCountFrequency (%)
주식회사 18
 
6.9%
골프 3
 
1.2%
여행사 3
 
1.2%
투어 3
 
1.2%
주)유성관광여행사 2
 
0.8%
아름다운 2
 
0.8%
사람과 2
 
0.8%
2
 
0.8%
주)허브여행 2
 
0.8%
주)우리들여행사 2
 
0.8%
Other values (188) 221
85.0%
2024-04-19T14:52:17.213696image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
101
 
6.6%
( 89
 
5.8%
) 89
 
5.8%
64
 
4.2%
56
 
3.7%
53
 
3.5%
51
 
3.3%
44
 
2.9%
42
 
2.7%
38
 
2.5%
Other values (238) 905
59.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1249
81.5%
Open Punctuation 89
 
5.8%
Close Punctuation 89
 
5.8%
Uppercase Letter 47
 
3.1%
Space Separator 44
 
2.9%
Decimal Number 8
 
0.5%
Other Symbol 2
 
0.1%
Lowercase Letter 2
 
0.1%
Dash Punctuation 1
 
0.1%
Other Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
101
 
8.1%
64
 
5.1%
56
 
4.5%
53
 
4.2%
51
 
4.1%
42
 
3.4%
38
 
3.0%
38
 
3.0%
37
 
3.0%
34
 
2.7%
Other values (205) 735
58.8%
Uppercase Letter
ValueCountFrequency (%)
O 5
 
10.6%
S 5
 
10.6%
T 4
 
8.5%
M 3
 
6.4%
U 3
 
6.4%
L 3
 
6.4%
A 2
 
4.3%
G 2
 
4.3%
J 2
 
4.3%
K 2
 
4.3%
Other values (11) 16
34.0%
Decimal Number
ValueCountFrequency (%)
2 5
62.5%
5 1
 
12.5%
6 1
 
12.5%
3 1
 
12.5%
Lowercase Letter
ValueCountFrequency (%)
o 1
50.0%
g 1
50.0%
Open Punctuation
ValueCountFrequency (%)
( 89
100.0%
Close Punctuation
ValueCountFrequency (%)
) 89
100.0%
Space Separator
ValueCountFrequency (%)
44
100.0%
Other Symbol
ValueCountFrequency (%)
2
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%
Other Punctuation
ValueCountFrequency (%)
. 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1251
81.7%
Common 232
 
15.1%
Latin 49
 
3.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
101
 
8.1%
64
 
5.1%
56
 
4.5%
53
 
4.2%
51
 
4.1%
42
 
3.4%
38
 
3.0%
38
 
3.0%
37
 
3.0%
34
 
2.7%
Other values (206) 737
58.9%
Latin
ValueCountFrequency (%)
O 5
 
10.2%
S 5
 
10.2%
T 4
 
8.2%
M 3
 
6.1%
U 3
 
6.1%
L 3
 
6.1%
A 2
 
4.1%
G 2
 
4.1%
J 2
 
4.1%
K 2
 
4.1%
Other values (13) 18
36.7%
Common
ValueCountFrequency (%)
( 89
38.4%
) 89
38.4%
44
19.0%
2 5
 
2.2%
- 1
 
0.4%
5 1
 
0.4%
6 1
 
0.4%
3 1
 
0.4%
. 1
 
0.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1249
81.5%
ASCII 281
 
18.3%
None 2
 
0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
101
 
8.1%
64
 
5.1%
56
 
4.5%
53
 
4.2%
51
 
4.1%
42
 
3.4%
38
 
3.0%
38
 
3.0%
37
 
3.0%
34
 
2.7%
Other values (205) 735
58.8%
ASCII
ValueCountFrequency (%)
( 89
31.7%
) 89
31.7%
44
15.7%
2 5
 
1.8%
O 5
 
1.8%
S 5
 
1.8%
T 4
 
1.4%
M 3
 
1.1%
U 3
 
1.1%
L 3
 
1.1%
Other values (22) 31
 
11.0%
None
ValueCountFrequency (%)
2
100.0%
Distinct177
Distinct (%)81.9%
Missing0
Missing (%)0.0%
Memory size1.8 KiB
2024-04-19T14:52:17.616033image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length51
Median length45
Mean length28.185185
Min length20

Characters and Unicode

Total characters6088
Distinct characters122
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique141 ?
Unique (%)65.3%

Sample

1st row대구광역시 수성구 국채보상로 826 (범어동)
2nd row대구광역시 수성구 지범로 192 (범물동)
3rd row대구광역시 수성구 달구벌대로 3297 (사월동)
4th row대구광역시 수성구 들안로 275-1 (수성동2가,2층)
5th row대구광역시 수성구 국채보상로 1049 (만촌동)
ValueCountFrequency (%)
수성구 216
 
18.3%
대구광역시 215
 
18.2%
두산동 37
 
3.1%
범어동 37
 
3.1%
달구벌대로 31
 
2.6%
황금동 28
 
2.4%
동대구로 21
 
1.8%
만촌동 18
 
1.5%
지산동 16
 
1.4%
수성로 15
 
1.3%
Other values (257) 546
46.3%
2024-04-19T14:52:18.073041image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
968
 
15.9%
515
 
8.5%
299
 
4.9%
297
 
4.9%
279
 
4.6%
269
 
4.4%
232
 
3.8%
217
 
3.6%
216
 
3.5%
( 214
 
3.5%
Other values (112) 2582
42.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3552
58.3%
Decimal Number 983
 
16.1%
Space Separator 968
 
15.9%
Open Punctuation 214
 
3.5%
Close Punctuation 214
 
3.5%
Other Punctuation 104
 
1.7%
Dash Punctuation 50
 
0.8%
Uppercase Letter 2
 
< 0.1%
Math Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
515
14.5%
299
 
8.4%
297
 
8.4%
279
 
7.9%
269
 
7.6%
232
 
6.5%
217
 
6.1%
216
 
6.1%
213
 
6.0%
76
 
2.1%
Other values (94) 939
26.4%
Decimal Number
ValueCountFrequency (%)
2 200
20.3%
1 197
20.0%
3 104
10.6%
6 88
9.0%
5 85
8.6%
0 85
8.6%
4 76
 
7.7%
8 57
 
5.8%
7 49
 
5.0%
9 42
 
4.3%
Uppercase Letter
ValueCountFrequency (%)
S 1
50.0%
K 1
50.0%
Space Separator
ValueCountFrequency (%)
968
100.0%
Open Punctuation
ValueCountFrequency (%)
( 214
100.0%
Close Punctuation
ValueCountFrequency (%)
) 214
100.0%
Other Punctuation
ValueCountFrequency (%)
, 104
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 50
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3552
58.3%
Common 2534
41.6%
Latin 2
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
515
14.5%
299
 
8.4%
297
 
8.4%
279
 
7.9%
269
 
7.6%
232
 
6.5%
217
 
6.1%
216
 
6.1%
213
 
6.0%
76
 
2.1%
Other values (94) 939
26.4%
Common
ValueCountFrequency (%)
968
38.2%
( 214
 
8.4%
) 214
 
8.4%
2 200
 
7.9%
1 197
 
7.8%
, 104
 
4.1%
3 104
 
4.1%
6 88
 
3.5%
5 85
 
3.4%
0 85
 
3.4%
Other values (6) 275
 
10.9%
Latin
ValueCountFrequency (%)
S 1
50.0%
K 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3552
58.3%
ASCII 2536
41.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
968
38.2%
( 214
 
8.4%
) 214
 
8.4%
2 200
 
7.9%
1 197
 
7.8%
, 104
 
4.1%
3 104
 
4.1%
6 88
 
3.5%
5 85
 
3.4%
0 85
 
3.4%
Other values (8) 277
 
10.9%
Hangul
ValueCountFrequency (%)
515
14.5%
299
 
8.4%
297
 
8.4%
279
 
7.9%
269
 
7.6%
232
 
6.5%
217
 
6.1%
216
 
6.1%
213
 
6.0%
76
 
2.1%
Other values (94) 939
26.4%

전화번호
Text

MISSING 

Distinct167
Distinct (%)81.5%
Missing11
Missing (%)5.1%
Memory size1.8 KiB
2024-04-19T14:52:18.303180image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length11.995122
Min length9

Characters and Unicode

Total characters2459
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique132 ?
Unique (%)64.4%

Sample

1st row053-742-0333
2nd row053-784-6701
3rd row053-811-9111
4th row053-746-0022
5th row053-752-6200
ValueCountFrequency (%)
053-746-0022 4
 
2.0%
053-756-0010 3
 
1.5%
053-783-0044 2
 
1.0%
053-243-8866 2
 
1.0%
053-631-1777 2
 
1.0%
053-742-0333 2
 
1.0%
053-759-0990 2
 
1.0%
053-782-1738 2
 
1.0%
053-422-7555 2
 
1.0%
053-254-8700 2
 
1.0%
Other values (157) 182
88.8%
2024-04-19T14:52:18.677704image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 409
16.6%
0 382
15.5%
5 348
14.2%
3 304
12.4%
7 253
10.3%
6 173
7.0%
4 137
 
5.6%
1 133
 
5.4%
2 128
 
5.2%
8 119
 
4.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 2050
83.4%
Dash Punctuation 409
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 382
18.6%
5 348
17.0%
3 304
14.8%
7 253
12.3%
6 173
8.4%
4 137
 
6.7%
1 133
 
6.5%
2 128
 
6.2%
8 119
 
5.8%
9 73
 
3.6%
Dash Punctuation
ValueCountFrequency (%)
- 409
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 2459
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 409
16.6%
0 382
15.5%
5 348
14.2%
3 304
12.4%
7 253
10.3%
6 173
7.0%
4 137
 
5.6%
1 133
 
5.4%
2 128
 
5.2%
8 119
 
4.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2459
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 409
16.6%
0 382
15.5%
5 348
14.2%
3 304
12.4%
7 253
10.3%
6 173
7.0%
4 137
 
5.6%
1 133
 
5.4%
2 128
 
5.2%
8 119
 
4.8%

데이터기준일
Date

CONSTANT 

Distinct1
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size1.8 KiB
Minimum2018-08-31 00:00:00
Maximum2018-08-31 00:00:00
2024-04-19T14:52:18.798471image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-19T14:52:18.881280image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Missing values

2024-04-19T14:52:16.157444image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-19T14:52:16.251748image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업종상호소재지(도로명)전화번호데이터기준일
0국내여행업(주)유성관광여행사대구광역시 수성구 국채보상로 826 (범어동)053-742-03332018-08-31
1국내여행업(주)세일관광여행사대구광역시 수성구 지범로 192 (범물동)053-784-67012018-08-31
2국내여행업(주)여행연합대구광역시 수성구 달구벌대로 3297 (사월동)053-811-91112018-08-31
3국내여행업(주)경일항공여행사대구광역시 수성구 들안로 275-1 (수성동2가,2층)053-746-00222018-08-31
4국내여행업동화항공여행주식회사대구광역시 수성구 국채보상로 1049 (만촌동)053-752-62002018-08-31
5국내여행업(주)알프스여행사대구광역시 수성구 지범로 91 (지산동)053-784-90052018-08-31
6국내여행업(주)투어토탈대구광역시 수성구 고산로 101 (신매동, 이마트대구시지점)053-792-14142018-08-31
7국내여행업(주)즐거운여행사대구광역시 수성구 동원로 136 (만촌동)053-745-69662018-08-31
8국내여행업(주)동성여행사대구광역시 수성구 달구벌대로 3081, 103호 (시지동,은세계상가1층)053-792-89222018-08-31
9국내여행업(주)수은관광대구광역시 수성구 달구벌대로 3103 (시지동)053-792-28882018-08-31
업종상호소재지(도로명)전화번호데이터기준일
206숙박업(일반)볼보모텔대구광역시 수성구 동대구로25길 46 (황금동)053-764-55192018-08-31
207숙박업(일반)TALK대구광역시 수성구 청수로25길 16 (황금동)053-761-77122018-08-31
208숙박업(일반)황금호텔대구광역시 수성구 동대구로 115 (황금동,6층)053-766-80122018-08-31
209숙박업(일반)대구광역시 수성구 청수로25길 22 (황금동)053-761-08562018-08-31
210숙박업(일반)대구광역시 수성구 동대구로15길 30 (두산동)053-761-22732018-08-31
211숙박업(일반)힙모텔대구광역시 수성구 청수로24길 87 (두산동)053-768-66602018-08-31
212숙박업(일반)지(G)모텔대구광역시 수성구 동대구로15길 28 (두산동)053-765-00382018-08-31
213숙박업(일반)호텔라온제나대구광역시 수성구 범어천로 73, 10~14층 (범어동)053-756-67002018-08-31
214숙박업(일반)애플모텔대구광역시 수성구 청수로26길 22 (두산동)<NA>2018-08-31
215숙박업(일반)지지모텔대구광역시 수성구 동대구로25길 24-6 (황금동)<NA>2018-08-31