Overview

Dataset statistics

Number of variables6
Number of observations33
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.7 KiB
Average record size in memory53.0 B

Variable types

Numeric1
Categorical2
Text3

Dataset

Description대구광역시_동구_동구5미전문음식점현황_20180901
Author대구광역시 동구
URLhttp://data.daegu.go.kr/open/data/dataView.do?dataSetId=15052652&dataSetDetailId=1505265218d4a74bc4559_201809211852&provdMethod=FILE

Alerts

주메뉴 is highly overall correlated with 구분High correlation
구분 is highly overall correlated with 연번 and 1 other fieldsHigh correlation
연번 is highly overall correlated with 구분High correlation
구분 is highly imbalanced (51.4%)Imbalance
주메뉴 is highly imbalanced (51.1%)Imbalance
연번 has unique valuesUnique
업소명 has unique valuesUnique
전화번호 has unique valuesUnique

Reproduction

Analysis started2023-12-10 18:33:21.373455
Analysis finished2023-12-10 18:33:23.978231
Duration2.6 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct33
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean17
Minimum1
Maximum33
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size429.0 B
2023-12-11T03:33:24.108098image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2.6
Q19
median17
Q325
95-th percentile31.4
Maximum33
Range32
Interquartile range (IQR)16

Descriptive statistics

Standard deviation9.6695398
Coefficient of variation (CV)0.56879646
Kurtosis-1.2
Mean17
Median Absolute Deviation (MAD)8
Skewness0
Sum561
Variance93.5
MonotonicityStrictly increasing
2023-12-11T03:33:24.373194image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=33)
ValueCountFrequency (%)
1 1
 
3.0%
26 1
 
3.0%
20 1
 
3.0%
21 1
 
3.0%
22 1
 
3.0%
23 1
 
3.0%
24 1
 
3.0%
25 1
 
3.0%
27 1
 
3.0%
2 1
 
3.0%
Other values (23) 23
69.7%
ValueCountFrequency (%)
1 1
3.0%
2 1
3.0%
3 1
3.0%
4 1
3.0%
5 1
3.0%
6 1
3.0%
7 1
3.0%
8 1
3.0%
9 1
3.0%
10 1
3.0%
ValueCountFrequency (%)
33 1
3.0%
32 1
3.0%
31 1
3.0%
30 1
3.0%
29 1
3.0%
28 1
3.0%
27 1
3.0%
26 1
3.0%
25 1
3.0%
24 1
3.0%

구분
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct4
Distinct (%)12.1%
Missing0
Missing (%)0.0%
Memory size396.0 B
닭요리
27 
연근요리
 
2
오리요리
 
2
산채요리
 
2

Length

Max length4
Median length3
Mean length3.1818182
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row연근요리
2nd row연근요리
3rd row오리요리
4th row오리요리
5th row산채요리

Common Values

ValueCountFrequency (%)
닭요리 27
81.8%
연근요리 2
 
6.1%
오리요리 2
 
6.1%
산채요리 2
 
6.1%

Length

2023-12-11T03:33:24.634899image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T03:33:24.792302image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
닭요리 27
81.8%
연근요리 2
 
6.1%
오리요리 2
 
6.1%
산채요리 2
 
6.1%

업소명
Text

UNIQUE 

Distinct33
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size396.0 B
2023-12-11T03:33:25.147047image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length10
Mean length5.969697
Min length3

Characters and Unicode

Total characters197
Distinct characters106
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique33 ?
Unique (%)100.0%

Sample

1st row연향이머무는뜨락
2nd row반야월연근사랑 협동조합
3rd row쌍쌍오리한마당
4th row하늘천따지식당
5th row고향차밭골식당
ValueCountFrequency (%)
연향이머무는뜨락 1
 
2.8%
반야월연근사랑 1
 
2.8%
무릉도원식당 1
 
2.8%
부산식당 1
 
2.8%
삼아통닭식당 1
 
2.8%
아가씨와건달들 1
 
2.8%
아로마 1
 
2.8%
오동나무식당 1
 
2.8%
운수좋은날 1
 
2.8%
은행나무식당 1
 
2.8%
Other values (26) 26
72.2%
2023-12-11T03:33:26.059996image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
15
 
7.6%
14
 
7.1%
9
 
4.6%
8
 
4.1%
4
 
2.0%
4
 
2.0%
4
 
2.0%
4
 
2.0%
3
 
1.5%
3
 
1.5%
Other values (96) 129
65.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 194
98.5%
Space Separator 3
 
1.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
15
 
7.7%
14
 
7.2%
9
 
4.6%
8
 
4.1%
4
 
2.1%
4
 
2.1%
4
 
2.1%
4
 
2.1%
3
 
1.5%
3
 
1.5%
Other values (95) 126
64.9%
Space Separator
ValueCountFrequency (%)
3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 194
98.5%
Common 3
 
1.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
15
 
7.7%
14
 
7.2%
9
 
4.6%
8
 
4.1%
4
 
2.1%
4
 
2.1%
4
 
2.1%
4
 
2.1%
3
 
1.5%
3
 
1.5%
Other values (95) 126
64.9%
Common
ValueCountFrequency (%)
3
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 194
98.5%
ASCII 3
 
1.5%

Most frequent character per block

Hangul
ValueCountFrequency (%)
15
 
7.7%
14
 
7.2%
9
 
4.6%
8
 
4.1%
4
 
2.1%
4
 
2.1%
4
 
2.1%
4
 
2.1%
3
 
1.5%
3
 
1.5%
Other values (95) 126
64.9%
ASCII
ValueCountFrequency (%)
3
100.0%

주소
Text

Distinct26
Distinct (%)78.8%
Missing0
Missing (%)0.0%
Memory size396.0 B
2023-12-11T03:33:26.855064image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length39
Median length38
Mean length23.030303
Min length20

Characters and Unicode

Total characters760
Distinct characters57
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique22 ?
Unique (%)66.7%

Sample

1st row대구광역시 동구 갓바위로 57(진인동)
2nd row대구광역시 동구 동호로2길 13-5(동호동)
3rd row대구광역시 동구 화랑로88길 76(방촌동)
4th row대구광역시 동구 서촌로 129(송정동)
5th row대구광역시 동구 파계로138길 12(중대동)
ValueCountFrequency (%)
대구광역시 33
24.3%
동구 33
24.3%
아양로9길 17
12.5%
아양로 5
 
3.7%
6-6(신암동 4
 
2.9%
10(신암동 3
 
2.2%
6-4(신암동 2
 
1.5%
3(신암동 2
 
1.5%
아양로7길 2
 
1.5%
12 2
 
1.5%
Other values (32) 33
24.3%
2023-12-11T03:33:27.517718image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
103
 
13.6%
70
 
9.2%
66
 
8.7%
34
 
4.5%
( 33
 
4.3%
33
 
4.3%
33
 
4.3%
33
 
4.3%
33
 
4.3%
) 33
 
4.3%
Other values (47) 289
38.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 474
62.4%
Space Separator 103
 
13.6%
Decimal Number 102
 
13.4%
Open Punctuation 33
 
4.3%
Close Punctuation 33
 
4.3%
Dash Punctuation 13
 
1.7%
Other Punctuation 2
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
70
14.8%
66
13.9%
34
 
7.2%
33
 
7.0%
33
 
7.0%
33
 
7.0%
33
 
7.0%
28
 
5.9%
27
 
5.7%
25
 
5.3%
Other values (32) 92
19.4%
Decimal Number
ValueCountFrequency (%)
9 20
19.6%
6 15
14.7%
1 15
14.7%
5 13
12.7%
3 9
8.8%
2 8
 
7.8%
7 8
 
7.8%
4 6
 
5.9%
8 5
 
4.9%
0 3
 
2.9%
Space Separator
ValueCountFrequency (%)
103
100.0%
Open Punctuation
ValueCountFrequency (%)
( 33
100.0%
Close Punctuation
ValueCountFrequency (%)
) 33
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 13
100.0%
Other Punctuation
ValueCountFrequency (%)
, 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 474
62.4%
Common 286
37.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
70
14.8%
66
13.9%
34
 
7.2%
33
 
7.0%
33
 
7.0%
33
 
7.0%
33
 
7.0%
28
 
5.9%
27
 
5.7%
25
 
5.3%
Other values (32) 92
19.4%
Common
ValueCountFrequency (%)
103
36.0%
( 33
 
11.5%
) 33
 
11.5%
9 20
 
7.0%
6 15
 
5.2%
1 15
 
5.2%
- 13
 
4.5%
5 13
 
4.5%
3 9
 
3.1%
2 8
 
2.8%
Other values (5) 24
 
8.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 474
62.4%
ASCII 286
37.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
103
36.0%
( 33
 
11.5%
) 33
 
11.5%
9 20
 
7.0%
6 15
 
5.2%
1 15
 
5.2%
- 13
 
4.5%
5 13
 
4.5%
3 9
 
3.1%
2 8
 
2.8%
Other values (5) 24
 
8.4%
Hangul
ValueCountFrequency (%)
70
14.8%
66
13.9%
34
 
7.2%
33
 
7.0%
33
 
7.0%
33
 
7.0%
33
 
7.0%
28
 
5.9%
27
 
5.7%
25
 
5.3%
Other values (32) 92
19.4%

전화번호
Text

UNIQUE 

Distinct33
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size396.0 B
2023-12-11T03:33:27.948278image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters396
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique33 ?
Unique (%)100.0%

Sample

1st row053-981-8200
2nd row053-964-0912
3rd row053-983-6689
4th row053-982-6190
5th row053-981-5883
ValueCountFrequency (%)
053-981-8200 1
 
3.0%
053-951-3450 1
 
3.0%
053-952-2240 1
 
3.0%
053-958-0816 1
 
3.0%
053-959-7986 1
 
3.0%
053-954-2802 1
 
3.0%
053-954-6580 1
 
3.0%
053-954-1911 1
 
3.0%
053-942-6660 1
 
3.0%
053-959-2759 1
 
3.0%
Other values (23) 23
69.7%
2023-12-11T03:33:28.586191image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 66
16.7%
5 62
15.7%
0 55
13.9%
9 48
12.1%
3 45
11.4%
4 26
 
6.6%
8 24
 
6.1%
1 20
 
5.1%
2 20
 
5.1%
6 20
 
5.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 330
83.3%
Dash Punctuation 66
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 62
18.8%
0 55
16.7%
9 48
14.5%
3 45
13.6%
4 26
7.9%
8 24
 
7.3%
1 20
 
6.1%
2 20
 
6.1%
6 20
 
6.1%
7 10
 
3.0%
Dash Punctuation
ValueCountFrequency (%)
- 66
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 396
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 66
16.7%
5 62
15.7%
0 55
13.9%
9 48
12.1%
3 45
11.4%
4 26
 
6.6%
8 24
 
6.1%
1 20
 
5.1%
2 20
 
5.1%
6 20
 
5.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 396
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 66
16.7%
5 62
15.7%
0 55
13.9%
9 48
12.1%
3 45
11.4%
4 26
 
6.6%
8 24
 
6.1%
1 20
 
5.1%
2 20
 
5.1%
6 20
 
5.1%

주메뉴
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct8
Distinct (%)24.2%
Missing0
Missing (%)0.0%
Memory size396.0 B
모듬똥집
25 
연근정식
 
2
한방오리불고기
 
1
생오리구이
 
1
차밭골정식
 
1
Other values (3)

Length

Max length7
Median length4
Mean length4.0606061
Min length2

Unique

Unique6 ?
Unique (%)18.2%

Sample

1st row연근정식
2nd row연근정식
3rd row한방오리불고기
4th row생오리구이
5th row차밭골정식

Common Values

ValueCountFrequency (%)
모듬똥집 25
75.8%
연근정식 2
 
6.1%
한방오리불고기 1
 
3.0%
생오리구이 1
 
3.0%
차밭골정식 1
 
3.0%
곤드레밥 1
 
3.0%
삼계탕 1
 
3.0%
옻닭 1
 
3.0%

Length

2023-12-11T03:33:28.868003image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T03:33:29.112601image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
모듬똥집 25
75.8%
연근정식 2
 
6.1%
한방오리불고기 1
 
3.0%
생오리구이 1
 
3.0%
차밭골정식 1
 
3.0%
곤드레밥 1
 
3.0%
삼계탕 1
 
3.0%
옻닭 1
 
3.0%

Interactions

2023-12-11T03:33:23.020457image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T03:33:29.284557image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번구분업소명주소전화번호주메뉴
연번1.0000.7931.0000.6361.0000.519
구분0.7931.0001.0001.0001.0001.000
업소명1.0001.0001.0001.0001.0001.000
주소0.6361.0001.0001.0001.0001.000
전화번호1.0001.0001.0001.0001.0001.000
주메뉴0.5191.0001.0001.0001.0001.000
2023-12-11T03:33:29.458428image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
주메뉴구분
주메뉴1.0000.928
구분0.9281.000
2023-12-11T03:33:29.588813image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번구분주메뉴
연번1.0000.5420.249
구분0.5421.0000.928
주메뉴0.2490.9281.000

Missing values

2023-12-11T03:33:23.318089image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T03:33:23.901800image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번구분업소명주소전화번호주메뉴
01연근요리연향이머무는뜨락대구광역시 동구 갓바위로 57(진인동)053-981-8200연근정식
12연근요리반야월연근사랑 협동조합대구광역시 동구 동호로2길 13-5(동호동)053-964-0912연근정식
23오리요리쌍쌍오리한마당대구광역시 동구 화랑로88길 76(방촌동)053-983-6689한방오리불고기
34오리요리하늘천따지식당대구광역시 동구 서촌로 129(송정동)053-982-6190생오리구이
45산채요리고향차밭골식당대구광역시 동구 파계로138길 12(중대동)053-981-5883차밭골정식
56산채요리산중식당대구광역시 동구 팔공산로185길 55(용수동)053-982-0077곤드레밥
67닭요리고산정삼계탕대구광역시 동구 안심로 375(신서동)053-962-6699삼계탕
78닭요리백림정대구광역시 동구 도평로 249(도동)053-986-0032옻닭
89닭요리고인돌식당대구광역시 동구 아양로9길 5(신암동)053-951-3238모듬똥집
910닭요리궁전통닭대구광역시 동구 아양로9길 6-4(신암동)053-957-4636모듬똥집
연번구분업소명주소전화번호주메뉴
2324닭요리오동나무식당대구광역시 동구 아양로 53-2(신암동)053-954-6800모듬똥집
2425닭요리운수좋은날대구광역시 동구 아양로9길 6-3(신암동)053-959-2759모듬똥집
2526닭요리은행나무식당대구광역시 동구 아양로9길 10(신암동)053-942-6660모듬똥집
2627닭요리제일통닭대구광역시 동구 아양로9길 6-6(신암동)053-954-1911모듬똥집
2728닭요리진미통닭식당대구광역시 동구 아양로9길 10(신암동)053-954-6580모듬똥집
2829닭요리타이타닉식당대구광역시 동구 아양로9길 3(신암동)053-954-2802모듬똥집
2930닭요리평강공주와 온달장군식당대구광역시 동구 아양로9길 6-1(신암동)053-959-7986모듬똥집
3031닭요리평화통닭식당대구광역시 동구 아양로9길 6-6(신암동)053-958-0816모듬똥집
3132닭요리포항통닭식당대구광역시 동구 아양로9길 6-6(신암동)053-952-2240모듬똥집
3233닭요리합천통닭대구광역시 동구 아양로 53-4(신암동)053-951-3190모듬똥집