Overview

Dataset statistics

Number of variables4
Number of observations364
Missing cells89
Missing cells (%)6.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory11.5 KiB
Average record size in memory32.4 B

Variable types

Categorical1
Text3

Dataset

Description대구광역시_중구_여행업_20200408
Author대구광역시 중구
URLhttp://data.daegu.go.kr/open/data/dataView.do?dataSetId=15043287&dataSetDetailId=1504328718a9dce8b8f98&provdMethod=FILE

Alerts

전화번호 has 89 (24.5%) missing valuesMissing

Reproduction

Analysis started2024-04-21 15:39:55.660234
Analysis finished2024-04-21 15:39:56.279947
Duration0.62 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업종
Categorical

Distinct3
Distinct (%)0.8%
Missing0
Missing (%)0.0%
Memory size3.0 KiB
국외여행업
178 
국내여행업
142 
일반여행업
44 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row국내여행업
2nd row국내여행업
3rd row국내여행업
4th row국내여행업
5th row국내여행업

Common Values

ValueCountFrequency (%)
국외여행업 178
48.9%
국내여행업 142
39.0%
일반여행업 44
 
12.1%

Length

2024-04-22T00:39:56.391563image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-22T00:39:56.574832image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
국외여행업 178
48.9%
국내여행업 142
39.0%
일반여행업 44
 
12.1%

상호
Text

Distinct230
Distinct (%)63.2%
Missing0
Missing (%)0.0%
Memory size3.0 KiB
2024-04-22T00:39:57.374335image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length13
Mean length8.3928571
Min length2

Characters and Unicode

Total characters3055
Distinct characters240
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique96 ?
Unique (%)26.4%

Sample

1st row경상관광(주)
2nd row(주)로얄관광여행사
3rd row(주)신세계항공여행사
4th row(주)우방관광여행사
5th row(주)다모아관광여행사
ValueCountFrequency (%)
주식회사 13
 
3.2%
여행사 7
 
1.7%
주)여행하는 4
 
1.0%
투어 3
 
0.7%
3
 
0.7%
여행 3
 
0.7%
주)대구우리투어 2
 
0.5%
주)그루터기 2
 
0.5%
주)에프엠테마투어 2
 
0.5%
주)가야투어 2
 
0.5%
Other values (234) 366
89.9%
2024-04-22T00:39:58.431840image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
313
 
10.2%
( 294
 
9.6%
) 294
 
9.6%
189
 
6.2%
189
 
6.2%
169
 
5.5%
108
 
3.5%
106
 
3.5%
53
 
1.7%
43
 
1.4%
Other values (230) 1297
42.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2405
78.7%
Open Punctuation 294
 
9.6%
Close Punctuation 294
 
9.6%
Space Separator 43
 
1.4%
Uppercase Letter 11
 
0.4%
Other Punctuation 4
 
0.1%
Decimal Number 3
 
0.1%
Dash Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
313
 
13.0%
189
 
7.9%
189
 
7.9%
169
 
7.0%
108
 
4.5%
106
 
4.4%
53
 
2.2%
43
 
1.8%
43
 
1.8%
41
 
1.7%
Other values (215) 1151
47.9%
Uppercase Letter
ValueCountFrequency (%)
A 2
18.2%
U 2
18.2%
O 2
18.2%
S 1
9.1%
N 1
9.1%
T 1
9.1%
R 1
9.1%
D 1
9.1%
Decimal Number
ValueCountFrequency (%)
1 2
66.7%
2 1
33.3%
Open Punctuation
ValueCountFrequency (%)
( 294
100.0%
Close Punctuation
ValueCountFrequency (%)
) 294
100.0%
Space Separator
ValueCountFrequency (%)
43
100.0%
Other Punctuation
ValueCountFrequency (%)
" 4
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2405
78.7%
Common 639
 
20.9%
Latin 11
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
313
 
13.0%
189
 
7.9%
189
 
7.9%
169
 
7.0%
108
 
4.5%
106
 
4.4%
53
 
2.2%
43
 
1.8%
43
 
1.8%
41
 
1.7%
Other values (215) 1151
47.9%
Latin
ValueCountFrequency (%)
A 2
18.2%
U 2
18.2%
O 2
18.2%
S 1
9.1%
N 1
9.1%
T 1
9.1%
R 1
9.1%
D 1
9.1%
Common
ValueCountFrequency (%)
( 294
46.0%
) 294
46.0%
43
 
6.7%
" 4
 
0.6%
1 2
 
0.3%
2 1
 
0.2%
- 1
 
0.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2405
78.7%
ASCII 650
 
21.3%

Most frequent character per block

Hangul
ValueCountFrequency (%)
313
 
13.0%
189
 
7.9%
189
 
7.9%
169
 
7.0%
108
 
4.5%
106
 
4.4%
53
 
2.2%
43
 
1.8%
43
 
1.8%
41
 
1.7%
Other values (215) 1151
47.9%
ASCII
ValueCountFrequency (%)
( 294
45.2%
) 294
45.2%
43
 
6.6%
" 4
 
0.6%
A 2
 
0.3%
1 2
 
0.3%
U 2
 
0.3%
O 2
 
0.3%
S 1
 
0.2%
N 1
 
0.2%
Other values (5) 5
 
0.8%
Distinct211
Distinct (%)58.0%
Missing0
Missing (%)0.0%
Memory size3.0 KiB
2024-04-22T00:39:59.322122image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length53
Median length41
Mean length30.087912
Min length18

Characters and Unicode

Total characters10952
Distinct characters162
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique93 ?
Unique (%)25.5%

Sample

1st row대구광역시 중구 태평로 177 (태평로1가)
2nd row대구광역시 중구 국채보상로131길 32 (동인동1가)
3rd row대구광역시 중구 국채보상로 627-1 (공평동)
4th row대구광역시 중구 태평로 242 (동인동1가)
5th row대구광역시 중구 중앙대로 432-1 (포정동)
ValueCountFrequency (%)
대구광역시 364
 
16.7%
중구 364
 
16.7%
국채보상로 82
 
3.8%
경상감영길 70
 
3.2%
2층 64
 
2.9%
동인동2가 59
 
2.7%
동덕로 57
 
2.6%
동인동1가 48
 
2.2%
삼덕동2가 36
 
1.7%
대봉동 36
 
1.7%
Other values (290) 999
45.8%
2024-04-22T00:40:00.474993image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1815
 
16.6%
749
 
6.8%
585
 
5.3%
1 459
 
4.2%
441
 
4.0%
2 432
 
3.9%
373
 
3.4%
372
 
3.4%
368
 
3.4%
366
 
3.3%
Other values (152) 4992
45.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 6177
56.4%
Decimal Number 1889
 
17.2%
Space Separator 1815
 
16.6%
Open Punctuation 363
 
3.3%
Close Punctuation 363
 
3.3%
Other Punctuation 285
 
2.6%
Dash Punctuation 49
 
0.4%
Uppercase Letter 10
 
0.1%
Math Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
749
 
12.1%
585
 
9.5%
441
 
7.1%
373
 
6.0%
372
 
6.0%
368
 
6.0%
366
 
5.9%
323
 
5.2%
211
 
3.4%
196
 
3.2%
Other values (129) 2193
35.5%
Decimal Number
ValueCountFrequency (%)
1 459
24.3%
2 432
22.9%
5 180
 
9.5%
3 169
 
8.9%
0 141
 
7.5%
7 130
 
6.9%
6 128
 
6.8%
4 105
 
5.6%
8 84
 
4.4%
9 61
 
3.2%
Uppercase Letter
ValueCountFrequency (%)
D 2
20.0%
A 2
20.0%
H 2
20.0%
I 1
10.0%
C 1
10.0%
T 1
10.0%
Y 1
10.0%
Space Separator
ValueCountFrequency (%)
1815
100.0%
Open Punctuation
ValueCountFrequency (%)
( 363
100.0%
Close Punctuation
ValueCountFrequency (%)
) 363
100.0%
Other Punctuation
ValueCountFrequency (%)
, 285
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 49
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 6177
56.4%
Common 4765
43.5%
Latin 10
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
749
 
12.1%
585
 
9.5%
441
 
7.1%
373
 
6.0%
372
 
6.0%
368
 
6.0%
366
 
5.9%
323
 
5.2%
211
 
3.4%
196
 
3.2%
Other values (129) 2193
35.5%
Common
ValueCountFrequency (%)
1815
38.1%
1 459
 
9.6%
2 432
 
9.1%
( 363
 
7.6%
) 363
 
7.6%
, 285
 
6.0%
5 180
 
3.8%
3 169
 
3.5%
0 141
 
3.0%
7 130
 
2.7%
Other values (6) 428
 
9.0%
Latin
ValueCountFrequency (%)
D 2
20.0%
A 2
20.0%
H 2
20.0%
I 1
10.0%
C 1
10.0%
T 1
10.0%
Y 1
10.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 6177
56.4%
ASCII 4775
43.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1815
38.0%
1 459
 
9.6%
2 432
 
9.0%
( 363
 
7.6%
) 363
 
7.6%
, 285
 
6.0%
5 180
 
3.8%
3 169
 
3.5%
0 141
 
3.0%
7 130
 
2.7%
Other values (13) 438
 
9.2%
Hangul
ValueCountFrequency (%)
749
 
12.1%
585
 
9.5%
441
 
7.1%
373
 
6.0%
372
 
6.0%
368
 
6.0%
366
 
5.9%
323
 
5.2%
211
 
3.4%
196
 
3.2%
Other values (129) 2193
35.5%

전화번호
Text

MISSING 

Distinct172
Distinct (%)62.5%
Missing89
Missing (%)24.5%
Memory size3.0 KiB
2024-04-22T00:40:01.410724image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length12
Mean length12.025455
Min length12

Characters and Unicode

Total characters3307
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique71 ?
Unique (%)25.8%

Sample

1st row053-425-8800
2nd row053-422-7701
3rd row053-424-1125
4th row053-421-6181
5th row053-425-1400
ValueCountFrequency (%)
053-427-1144 4
 
1.5%
053-425-5312 2
 
0.7%
053-474-2052 2
 
0.7%
053-254-6885 2
 
0.7%
053-421-2277 2
 
0.7%
053-425-3500 2
 
0.7%
053-427-1888 2
 
0.7%
053-555-0540 2
 
0.7%
053-423-8006 2
 
0.7%
053-421-4000 2
 
0.7%
Other values (162) 253
92.0%
2024-04-22T00:40:02.406754image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 550
16.6%
0 530
16.0%
5 510
15.4%
3 404
12.2%
2 384
11.6%
4 287
8.7%
1 193
 
5.8%
7 160
 
4.8%
6 112
 
3.4%
8 108
 
3.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 2757
83.4%
Dash Punctuation 550
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 530
19.2%
5 510
18.5%
3 404
14.7%
2 384
13.9%
4 287
10.4%
1 193
 
7.0%
7 160
 
5.8%
6 112
 
4.1%
8 108
 
3.9%
9 69
 
2.5%
Dash Punctuation
ValueCountFrequency (%)
- 550
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 3307
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 550
16.6%
0 530
16.0%
5 510
15.4%
3 404
12.2%
2 384
11.6%
4 287
8.7%
1 193
 
5.8%
7 160
 
4.8%
6 112
 
3.4%
8 108
 
3.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 3307
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 550
16.6%
0 530
16.0%
5 510
15.4%
3 404
12.2%
2 384
11.6%
4 287
8.7%
1 193
 
5.8%
7 160
 
4.8%
6 112
 
3.4%
8 108
 
3.3%

Missing values

2024-04-22T00:39:56.070718image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-22T00:39:56.222239image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업종상호소재지(도로명)전화번호
0국내여행업경상관광(주)대구광역시 중구 태평로 177 (태평로1가)053-425-8800
1국내여행업(주)로얄관광여행사대구광역시 중구 국채보상로131길 32 (동인동1가)<NA>
2국내여행업(주)신세계항공여행사대구광역시 중구 국채보상로 627-1 (공평동)053-422-7701
3국내여행업(주)우방관광여행사대구광역시 중구 태평로 242 (동인동1가)053-424-1125
4국내여행업(주)다모아관광여행사대구광역시 중구 중앙대로 432-1 (포정동)053-421-6181
5국내여행업(주)남경여행사대구광역시 중구 경상감영길 281 (동인동1가)053-425-1400
6국내여행업(주)해동항공여행사대구광역시 중구 국채보상로 701-1 (동인동4가)053-427-0707
7국내여행업(주)대마관광여행사대구광역시 중구 국채보상로 571053-253-3111
8국내여행업보람관광(주)대구광역시 중구 공평로20길 32, 2층 (동인동1가)053-424-2420
9국내여행업(주)코스모스항공여행사대구광역시 중구 국채보상로131길 55, 18호 (동인동1가)053-427-2693
업종상호소재지(도로명)전화번호
354일반여행업(주)케이에스투어대구광역시 중구 태평로 177, 1층 5호 (태평로1가, 태평라이프아파트)053-425-8800
355일반여행업오렌지 여행사대구광역시 중구 국채보상로 586, 16층 1658호 (동성로2가)<NA>
356일반여행업주식회사 하나지니항공여행사대구광역시 중구 경상감영길 28, 202호 (서문로1가)<NA>
357일반여행업대구메이트대구광역시 중구 국채보상로 487, 3층 303호 (동산동)<NA>
358일반여행업주식회사 아는여행사대구광역시 중구 동덕로 54, 306호 (대봉동)<NA>
359일반여행업(주)문샷대구광역시 중구 국채보상로102길 34-21 (동산동)<NA>
360일반여행업제이앤에스대구광역시 중구 남산로 43-1, 3층 (남산동)<NA>
361일반여행업(주)퍼시픽투어대구광역시 중구 국채보상로 655, 국채보상공원 화성파크드림CITY 4층 404호 (동인동2가)053-427-0999
362일반여행업(주)머스트고대구광역시 중구 공평로 12, 미르치과병원 지하1층 (삼덕동2가)<NA>
363일반여행업대구경북문화관광협동조합대구광역시 중구 대봉로43안길 40, 2층 (대봉동)<NA>