Overview

Dataset statistics

Number of variables4
Number of observations59
Missing cells5
Missing cells (%)2.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.0 KiB
Average record size in memory34.2 B

Variable types

Categorical1
Text3

Dataset

Description2013년 12월 기준 대구지역 관광식당업 현황
Author대구광역시
URLhttp://data.daegu.go.kr/open/data/dataView.do?dataSetId=15054192&dataSetDetailId=150541921b5c332a1e3a4_201705301624&provdMethod=FILE

Alerts

시·도명 has constant value ""Constant
전화번호 has 5 (8.5%) missing valuesMissing
업체명 has unique valuesUnique
소재지 has unique valuesUnique

Reproduction

Analysis started2024-04-21 13:31:30.846881
Analysis finished2024-04-21 13:31:31.645035
Duration0.8 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시·도명
Categorical

CONSTANT 

Distinct1
Distinct (%)1.7%
Missing0
Missing (%)0.0%
Memory size600.0 B
대구
59 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row대구
2nd row대구
3rd row대구
4th row대구
5th row대구

Common Values

ValueCountFrequency (%)
대구 59
100.0%

Length

2024-04-21T22:31:31.852261image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-21T22:31:32.143435image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
대구 59
100.0%

업체명
Text

UNIQUE 

Distinct59
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size600.0 B
2024-04-21T22:31:32.920626image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length10
Mean length6.0169492
Min length2

Characters and Unicode

Total characters355
Distinct characters128
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique59 ?
Unique (%)100.0%

Sample

1st row스타케밥
2nd row아웃백스테이크하우스(동성로점)
3rd row중화반점
4th row만리장성
5th row샤로니스
ValueCountFrequency (%)
스타케밥 1
 
1.7%
왕가네손짜장 1
 
1.7%
연경반점 1
 
1.7%
㈜드마리스(대구점 1
 
1.7%
리안 1
 
1.7%
아트리움 1
 
1.7%
안압정 1
 
1.7%
인화반점 1
 
1.7%
스위트인디아 1
 
1.7%
기찬낙지 1
 
1.7%
Other values (50) 50
83.3%
2024-04-21T22:31:34.222326image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
20
 
5.6%
19
 
5.4%
( 17
 
4.8%
) 17
 
4.8%
9
 
2.5%
8
 
2.3%
8
 
2.3%
8
 
2.3%
7
 
2.0%
6
 
1.7%
Other values (118) 236
66.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 315
88.7%
Open Punctuation 17
 
4.8%
Close Punctuation 17
 
4.8%
Uppercase Letter 4
 
1.1%
Space Separator 1
 
0.3%
Other Symbol 1
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
20
 
6.3%
19
 
6.0%
9
 
2.9%
8
 
2.5%
8
 
2.5%
8
 
2.5%
7
 
2.2%
6
 
1.9%
5
 
1.6%
5
 
1.6%
Other values (111) 220
69.8%
Uppercase Letter
ValueCountFrequency (%)
A 2
50.0%
T 1
25.0%
R 1
25.0%
Open Punctuation
ValueCountFrequency (%)
( 17
100.0%
Close Punctuation
ValueCountFrequency (%)
) 17
100.0%
Space Separator
ValueCountFrequency (%)
1
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 316
89.0%
Common 35
 
9.9%
Latin 4
 
1.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
20
 
6.3%
19
 
6.0%
9
 
2.8%
8
 
2.5%
8
 
2.5%
8
 
2.5%
7
 
2.2%
6
 
1.9%
5
 
1.6%
5
 
1.6%
Other values (112) 221
69.9%
Common
ValueCountFrequency (%)
( 17
48.6%
) 17
48.6%
1
 
2.9%
Latin
ValueCountFrequency (%)
A 2
50.0%
T 1
25.0%
R 1
25.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 315
88.7%
ASCII 39
 
11.0%
None 1
 
0.3%

Most frequent character per block

Hangul
ValueCountFrequency (%)
20
 
6.3%
19
 
6.0%
9
 
2.9%
8
 
2.5%
8
 
2.5%
8
 
2.5%
7
 
2.2%
6
 
1.9%
5
 
1.6%
5
 
1.6%
Other values (111) 220
69.8%
ASCII
ValueCountFrequency (%)
( 17
43.6%
) 17
43.6%
A 2
 
5.1%
T 1
 
2.6%
R 1
 
2.6%
1
 
2.6%
None
ValueCountFrequency (%)
1
100.0%

소재지
Text

UNIQUE 

Distinct59
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size600.0 B
2024-04-21T22:31:35.263521image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length37
Median length24
Mean length14.152542
Min length9

Characters and Unicode

Total characters835
Distinct characters72
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique59 ?
Unique (%)100.0%

Sample

1st row중구 공평동 63(동성로 3길 62, 1층)
2nd row중구 남일동 24-4
3rd row중구 남일동 92
4th row중구 대봉1동 55-39
5th row중구 동성로 3길 35
ValueCountFrequency (%)
수성구 27
 
14.0%
중구 10
 
5.2%
북구 10
 
5.2%
달서구 7
 
3.6%
두산동 6
 
3.1%
상동 4
 
2.1%
황금동 3
 
1.6%
산격동 3
 
1.6%
만촌동 2
 
1.0%
범어동 2
 
1.0%
Other values (109) 119
61.7%
2024-04-21T22:31:36.720648image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
134
16.0%
63
 
7.5%
61
 
7.3%
- 51
 
6.1%
1 41
 
4.9%
2 39
 
4.7%
38
 
4.6%
3 32
 
3.8%
29
 
3.5%
4 28
 
3.4%
Other values (62) 319
38.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 361
43.2%
Decimal Number 272
32.6%
Space Separator 134
 
16.0%
Dash Punctuation 51
 
6.1%
Open Punctuation 7
 
0.8%
Close Punctuation 7
 
0.8%
Other Punctuation 2
 
0.2%
Uppercase Letter 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
63
17.5%
61
16.9%
38
 
10.5%
29
 
8.0%
15
 
4.2%
12
 
3.3%
10
 
2.8%
10
 
2.8%
8
 
2.2%
8
 
2.2%
Other values (46) 107
29.6%
Decimal Number
ValueCountFrequency (%)
1 41
15.1%
2 39
14.3%
3 32
11.8%
4 28
10.3%
5 27
9.9%
0 26
9.6%
6 24
8.8%
9 22
8.1%
7 18
6.6%
8 15
 
5.5%
Space Separator
ValueCountFrequency (%)
134
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 51
100.0%
Open Punctuation
ValueCountFrequency (%)
( 7
100.0%
Close Punctuation
ValueCountFrequency (%)
) 7
100.0%
Other Punctuation
ValueCountFrequency (%)
, 2
100.0%
Uppercase Letter
ValueCountFrequency (%)
A 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 473
56.6%
Hangul 361
43.2%
Latin 1
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
63
17.5%
61
16.9%
38
 
10.5%
29
 
8.0%
15
 
4.2%
12
 
3.3%
10
 
2.8%
10
 
2.8%
8
 
2.2%
8
 
2.2%
Other values (46) 107
29.6%
Common
ValueCountFrequency (%)
134
28.3%
- 51
 
10.8%
1 41
 
8.7%
2 39
 
8.2%
3 32
 
6.8%
4 28
 
5.9%
5 27
 
5.7%
0 26
 
5.5%
6 24
 
5.1%
9 22
 
4.7%
Other values (5) 49
 
10.4%
Latin
ValueCountFrequency (%)
A 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 474
56.8%
Hangul 361
43.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
134
28.3%
- 51
 
10.8%
1 41
 
8.6%
2 39
 
8.2%
3 32
 
6.8%
4 28
 
5.9%
5 27
 
5.7%
0 26
 
5.5%
6 24
 
5.1%
9 22
 
4.6%
Other values (6) 50
 
10.5%
Hangul
ValueCountFrequency (%)
63
17.5%
61
16.9%
38
 
10.5%
29
 
8.0%
15
 
4.2%
12
 
3.3%
10
 
2.8%
10
 
2.8%
8
 
2.2%
8
 
2.2%
Other values (46) 107
29.6%

전화번호
Text

MISSING 

Distinct54
Distinct (%)100.0%
Missing5
Missing (%)8.5%
Memory size600.0 B
2024-04-21T22:31:37.634099image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.018519
Min length12

Characters and Unicode

Total characters649
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique54 ?
Unique (%)100.0%

Sample

1st row053-424-9951
2nd row053-257-8701
3rd row053-425-6839
4th row053-424-1169
5th row053-425-3242
ValueCountFrequency (%)
053-628-3337 1
 
1.9%
053-763-5988 1
 
1.9%
053-794-0291 1
 
1.9%
053-746-0230 1
 
1.9%
053-754-3111 1
 
1.9%
053-742-3369 1
 
1.9%
053-752-4192 1
 
1.9%
053-741-4624 1
 
1.9%
053-767-3232 1
 
1.9%
053-761-2611 1
 
1.9%
Other values (44) 44
81.5%
2024-04-21T22:31:38.931942image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 108
16.6%
5 92
14.2%
0 90
13.9%
3 90
13.9%
2 50
7.7%
7 47
7.2%
8 42
 
6.5%
1 41
 
6.3%
6 34
 
5.2%
4 30
 
4.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 541
83.4%
Dash Punctuation 108
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 92
17.0%
0 90
16.6%
3 90
16.6%
2 50
9.2%
7 47
8.7%
8 42
7.8%
1 41
7.6%
6 34
 
6.3%
4 30
 
5.5%
9 25
 
4.6%
Dash Punctuation
ValueCountFrequency (%)
- 108
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 649
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 108
16.6%
5 92
14.2%
0 90
13.9%
3 90
13.9%
2 50
7.7%
7 47
7.2%
8 42
 
6.5%
1 41
 
6.3%
6 34
 
5.2%
4 30
 
4.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 649
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 108
16.6%
5 92
14.2%
0 90
13.9%
3 90
13.9%
2 50
7.7%
7 47
7.2%
8 42
 
6.5%
1 41
 
6.3%
6 34
 
5.2%
4 30
 
4.6%

Correlations

2024-04-21T22:31:39.191985image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업체명소재지전화번호
업체명1.0001.0001.000
소재지1.0001.0001.000
전화번호1.0001.0001.000

Missing values

2024-04-21T22:31:31.267968image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-21T22:31:31.539474image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시·도명업체명소재지전화번호
0대구스타케밥중구 공평동 63(동성로 3길 62, 1층)053-424-9951
1대구아웃백스테이크하우스(동성로점)중구 남일동 24-4053-257-8701
2대구중화반점중구 남일동 92053-425-6839
3대구만리장성중구 대봉1동 55-39053-424-1169
4대구샤로니스중구 동성로 3길 35<NA>
5대구바라지레스토랑중구 동성로1가 6-3 (3층)053-425-3242
6대구사마르칸트중구 동성로2가 2-3 (3층)053-252-4021
7대구타라(TARA)중구 동성로3가 35-14 (2층)053-255-8050
8대구베니건스(동성로점)중구 사일동 59 동성프라자 2층053-424-8200
9대구신짜오(동성로점)중구 삼덕동1가 25-4053-428-6544
시·도명업체명소재지전화번호
49대구아웃백스테이크하우스(황금점)수성구 황금동 670-2053-768-7011
50대구누들볼수성구 황금동 676-1<NA>
51대구아서원수성구 황금동 690-2053-761-2223
52대구아웃백스테이크하우스(죽전점)달서구 감삼동 141-5053-571-1851
53대구길손짬뽕달서구 본동 962-1053-561-5757
54대구홍콩중국요리달서구 본리동 358-6053-527-2100
55대구와룡끼타달서구 성서로 72길 5070-8849-4013
56대구아방궁달서구 이곡동 1000-239053-591-0270
57대구자방차이나타운달서구 호산동 105-3053-593-1788
58대구촉향원달서구 호산동 709-10053-588-2111