Overview

Dataset statistics

Number of variables4
Number of observations10000
Missing cells1
Missing cells (%)< 0.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory390.6 KiB
Average record size in memory40.0 B

Variable types

Text3
Categorical1

Dataset

Description대구의 숨겨진 맛집, 일반음식점 관련 데이터입니다.구군별 업소명과 업태, 그리고 업소의 주소가 기록되어 있습니다.
Author대구광역시
URLhttps://www.data.go.kr/data/3056779/fileData.do

Alerts

연번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 18:31:47.182762
Analysis finished2023-12-12 18:31:48.803779
Duration1.62 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Text

UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-13T03:31:49.176617image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length5
Mean length4.6532
Min length1

Characters and Unicode

Total characters46532
Distinct characters24
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique10000 ?
Unique (%)100.0%

Sample

1st row4595
2nd row1342
3rd row17664
4th row15591
5th row20672
ValueCountFrequency (%)
4595 1
 
< 0.1%
12478 1
 
< 0.1%
7933 1
 
< 0.1%
17975 1
 
< 0.1%
23362 1
 
< 0.1%
14595 1
 
< 0.1%
7139 1
 
< 0.1%
1232 1
 
< 0.1%
30820 1
 
< 0.1%
22341 1
 
< 0.1%
Other values (9991) 9991
99.9%
2023-12-13T03:31:49.878596image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2 7307
15.7%
1 7167
15.4%
3 4415
9.5%
4 3981
8.6%
9 3979
8.6%
6 3963
8.5%
0 3961
8.5%
5 3952
8.5%
8 3926
8.4%
7 3865
8.3%
Other values (14) 16
 
< 0.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 46516
> 99.9%
Uppercase Letter 12
 
< 0.1%
Close Punctuation 1
 
< 0.1%
Space Separator 1
 
< 0.1%
Open Punctuation 1
 
< 0.1%
Other Punctuation 1
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 7307
15.7%
1 7167
15.4%
3 4415
9.5%
4 3981
8.6%
9 3979
8.6%
6 3963
8.5%
0 3961
8.5%
5 3952
8.5%
8 3926
8.4%
7 3865
8.3%
Uppercase Letter
ValueCountFrequency (%)
R 2
16.7%
E 2
16.7%
P 1
8.3%
B 1
8.3%
H 1
8.3%
A 1
8.3%
L 1
8.3%
O 1
8.3%
D 1
8.3%
N 1
8.3%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%
Space Separator
ValueCountFrequency (%)
1
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%
Other Punctuation
ValueCountFrequency (%)
" 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 46520
> 99.9%
Latin 12
 
< 0.1%

Most frequent character per script

Common
ValueCountFrequency (%)
2 7307
15.7%
1 7167
15.4%
3 4415
9.5%
4 3981
8.6%
9 3979
8.6%
6 3963
8.5%
0 3961
8.5%
5 3952
8.5%
8 3926
8.4%
7 3865
8.3%
Other values (4) 4
 
< 0.1%
Latin
ValueCountFrequency (%)
R 2
16.7%
E 2
16.7%
P 1
8.3%
B 1
8.3%
H 1
8.3%
A 1
8.3%
L 1
8.3%
O 1
8.3%
D 1
8.3%
N 1
8.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 46532
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2 7307
15.7%
1 7167
15.4%
3 4415
9.5%
4 3981
8.6%
9 3979
8.6%
6 3963
8.5%
0 3961
8.5%
5 3952
8.5%
8 3926
8.4%
7 3865
8.3%
Other values (14) 16
 
< 0.1%
Distinct9346
Distinct (%)93.5%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-13T03:31:50.311802image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length37
Median length31
Mean length6.3731
Min length1

Characters and Unicode

Total characters63731
Distinct characters1110
Distinct categories12 ?
Distinct scripts6 ?
Distinct blocks8 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique8908 ?
Unique (%)89.1%

Sample

1st row밀면땡기네
2nd row삐떡
3rd row다전손칼국수
4th row장군식당
5th row짱구
ValueCountFrequency (%)
본점 24
 
0.2%
다사점 22
 
0.2%
식당 22
 
0.2%
칠곡점 20
 
0.2%
상인점 20
 
0.2%
성서점 18
 
0.2%
동성로점 18
 
0.2%
화원점 15
 
0.1%
대구 15
 
0.1%
침산점 13
 
0.1%
Other values (9868) 11700
98.4%
2023-12-13T03:31:50.877810image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1998
 
3.1%
1888
 
3.0%
1397
 
2.2%
1277
 
2.0%
1183
 
1.9%
1046
 
1.6%
906
 
1.4%
( 812
 
1.3%
) 812
 
1.3%
768
 
1.2%
Other values (1100) 51644
81.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 56346
88.4%
Space Separator 1888
 
3.0%
Lowercase Letter 1483
 
2.3%
Uppercase Letter 1366
 
2.1%
Open Punctuation 812
 
1.3%
Close Punctuation 812
 
1.3%
Decimal Number 751
 
1.2%
Other Punctuation 256
 
0.4%
Dash Punctuation 12
 
< 0.1%
Connector Punctuation 3
 
< 0.1%
Other values (2) 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1998
 
3.5%
1397
 
2.5%
1277
 
2.3%
1183
 
2.1%
1046
 
1.9%
906
 
1.6%
768
 
1.4%
680
 
1.2%
652
 
1.2%
626
 
1.1%
Other values (1020) 45813
81.3%
Lowercase Letter
ValueCountFrequency (%)
e 195
13.1%
o 153
 
10.3%
a 136
 
9.2%
n 95
 
6.4%
i 89
 
6.0%
r 80
 
5.4%
s 78
 
5.3%
l 71
 
4.8%
u 70
 
4.7%
t 65
 
4.4%
Other values (16) 451
30.4%
Uppercase Letter
ValueCountFrequency (%)
B 131
 
9.6%
A 109
 
8.0%
O 107
 
7.8%
E 104
 
7.6%
C 82
 
6.0%
T 66
 
4.8%
S 66
 
4.8%
N 61
 
4.5%
I 57
 
4.2%
D 56
 
4.1%
Other values (16) 527
38.6%
Other Punctuation
ValueCountFrequency (%)
& 99
38.7%
, 59
23.0%
. 54
21.1%
' 18
 
7.0%
! 6
 
2.3%
6
 
2.3%
/ 5
 
2.0%
· 5
 
2.0%
# 2
 
0.8%
; 1
 
0.4%
Decimal Number
ValueCountFrequency (%)
1 124
16.5%
0 115
15.3%
3 94
12.5%
2 89
11.9%
9 88
11.7%
8 65
8.7%
7 57
7.6%
4 41
 
5.5%
5 41
 
5.5%
6 37
 
4.9%
Space Separator
ValueCountFrequency (%)
1888
100.0%
Open Punctuation
ValueCountFrequency (%)
( 812
100.0%
Close Punctuation
ValueCountFrequency (%)
) 812
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 12
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 3
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%
Math Symbol
ValueCountFrequency (%)
+ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 56297
88.3%
Common 4535
 
7.1%
Latin 2850
 
4.5%
Han 43
 
0.1%
Hiragana 4
 
< 0.1%
Katakana 2
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1998
 
3.5%
1397
 
2.5%
1277
 
2.3%
1183
 
2.1%
1046
 
1.9%
906
 
1.6%
768
 
1.4%
680
 
1.2%
652
 
1.2%
626
 
1.1%
Other values (984) 45764
81.3%
Latin
ValueCountFrequency (%)
e 195
 
6.8%
o 153
 
5.4%
a 136
 
4.8%
B 131
 
4.6%
A 109
 
3.8%
O 107
 
3.8%
E 104
 
3.6%
n 95
 
3.3%
i 89
 
3.1%
C 82
 
2.9%
Other values (43) 1649
57.9%
Han
ValueCountFrequency (%)
4
 
9.3%
3
 
7.0%
3
 
7.0%
2
 
4.7%
2
 
4.7%
2
 
4.7%
2
 
4.7%
1
 
2.3%
1
 
2.3%
1
 
2.3%
Other values (22) 22
51.2%
Common
ValueCountFrequency (%)
1888
41.6%
( 812
17.9%
) 812
17.9%
1 124
 
2.7%
0 115
 
2.5%
& 99
 
2.2%
3 94
 
2.1%
2 89
 
2.0%
9 88
 
1.9%
8 65
 
1.4%
Other values (17) 349
 
7.7%
Hiragana
ValueCountFrequency (%)
2
50.0%
2
50.0%
Katakana
ValueCountFrequency (%)
1
50.0%
1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 56297
88.3%
ASCII 7373
 
11.6%
CJK 42
 
0.1%
None 11
 
< 0.1%
Hiragana 4
 
< 0.1%
Katakana 2
 
< 0.1%
Number Forms 1
 
< 0.1%
CJK Compat Ideographs 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1998
 
3.5%
1397
 
2.5%
1277
 
2.3%
1183
 
2.1%
1046
 
1.9%
906
 
1.6%
768
 
1.4%
680
 
1.2%
652
 
1.2%
626
 
1.1%
Other values (984) 45764
81.3%
ASCII
ValueCountFrequency (%)
1888
25.6%
( 812
 
11.0%
) 812
 
11.0%
e 195
 
2.6%
o 153
 
2.1%
a 136
 
1.8%
B 131
 
1.8%
1 124
 
1.7%
0 115
 
1.6%
A 109
 
1.5%
Other values (67) 2898
39.3%
None
ValueCountFrequency (%)
6
54.5%
· 5
45.5%
CJK
ValueCountFrequency (%)
4
 
9.5%
3
 
7.1%
3
 
7.1%
2
 
4.8%
2
 
4.8%
2
 
4.8%
2
 
4.8%
1
 
2.4%
1
 
2.4%
1
 
2.4%
Other values (21) 21
50.0%
Hiragana
ValueCountFrequency (%)
2
50.0%
2
50.0%
Number Forms
ValueCountFrequency (%)
1
100.0%
Katakana
ValueCountFrequency (%)
1
50.0%
1
50.0%
CJK Compat Ideographs
ValueCountFrequency (%)
1
100.0%

업태
Categorical

Distinct22
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
한식
4060 
기타
1652 
호프/통닭
954 
식육(숯불구이)
923 
경양식
527 
Other values (17)
1884 

Length

Max length26
Median length2
Mean length3.2894
Min length2

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row기타
2nd row경양식
3rd row한식
4th row식육(숯불구이)
5th row기타

Common Values

ValueCountFrequency (%)
한식 4060
40.6%
기타 1652
16.5%
호프/통닭 954
 
9.5%
식육(숯불구이) 923
 
9.2%
경양식 527
 
5.3%
중국식 412
 
4.1%
분식 350
 
3.5%
일식 305
 
3.0%
정종/대포집/소주방 222
 
2.2%
회집 205
 
2.1%
Other values (12) 390
 
3.9%

Length

2023-12-13T03:31:51.063134image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
한식 4060
40.6%
기타 1652
16.5%
호프/통닭 954
 
9.5%
식육(숯불구이 923
 
9.2%
경양식 527
 
5.3%
중국식 412
 
4.1%
분식 350
 
3.5%
일식 305
 
3.0%
정종/대포집/소주방 222
 
2.2%
회집 205
 
2.0%
Other values (16) 394
 
3.9%
Distinct9740
Distinct (%)97.4%
Missing1
Missing (%)< 0.1%
Memory size156.2 KiB
2023-12-13T03:31:51.440656image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length63
Median length57
Mean length27.523052
Min length2

Characters and Unicode

Total characters275203
Distinct characters474
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9518 ?
Unique (%)95.2%

Sample

1st row대구광역시 동구 팔공로 1172-1(1층 진인동)
2nd row대구광역시 중구 봉산문화2길 42-26(1,2층 봉산동)
3rd row대구광역시 수성구 지범로21길 15(상가동 1층 1호 지산동)
4th row대구광역시 북구 동북로 141(1층 산격동)
5th row대구광역시 수성구 수성로 291(B동 1층 수성동1가)
ValueCountFrequency (%)
대구광역시 9996
 
20.1%
달서구 2124
 
4.3%
북구 1583
 
3.2%
수성구 1462
 
2.9%
동구 1412
 
2.8%
달성군 983
 
2.0%
중구 917
 
1.8%
서구 759
 
1.5%
남구 644
 
1.3%
1층 557
 
1.1%
Other values (9602) 29173
58.8%
2023-12-13T03:31:52.029289image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
39616
 
14.4%
20287
 
7.4%
1 14748
 
5.4%
12938
 
4.7%
12844
 
4.7%
10317
 
3.7%
10098
 
3.7%
10014
 
3.6%
9589
 
3.5%
( 9559
 
3.5%
Other values (464) 125193
45.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 164667
59.8%
Decimal Number 47246
 
17.2%
Space Separator 39616
 
14.4%
Open Punctuation 9559
 
3.5%
Close Punctuation 9559
 
3.5%
Dash Punctuation 2332
 
0.8%
Other Punctuation 1765
 
0.6%
Uppercase Letter 364
 
0.1%
Math Symbol 50
 
< 0.1%
Lowercase Letter 44
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
20287
 
12.3%
12938
 
7.9%
12844
 
7.8%
10317
 
6.3%
10098
 
6.1%
10014
 
6.1%
9589
 
5.8%
5826
 
3.5%
5501
 
3.3%
3974
 
2.4%
Other values (407) 63279
38.4%
Uppercase Letter
ValueCountFrequency (%)
A 109
29.9%
B 97
26.6%
S 24
 
6.6%
T 15
 
4.1%
C 12
 
3.3%
M 12
 
3.3%
K 11
 
3.0%
L 11
 
3.0%
O 10
 
2.7%
E 10
 
2.7%
Other values (13) 53
14.6%
Lowercase Letter
ValueCountFrequency (%)
e 12
27.3%
r 4
 
9.1%
i 4
 
9.1%
d 3
 
6.8%
o 3
 
6.8%
c 3
 
6.8%
w 3
 
6.8%
m 3
 
6.8%
a 3
 
6.8%
l 2
 
4.5%
Other values (3) 4
 
9.1%
Decimal Number
ValueCountFrequency (%)
1 14748
31.2%
2 6732
14.2%
3 4723
 
10.0%
4 3771
 
8.0%
0 3680
 
7.8%
5 3362
 
7.1%
6 3009
 
6.4%
7 2641
 
5.6%
8 2353
 
5.0%
9 2227
 
4.7%
Other Punctuation
ValueCountFrequency (%)
, 1749
99.1%
. 11
 
0.6%
/ 3
 
0.2%
@ 1
 
0.1%
& 1
 
0.1%
Space Separator
ValueCountFrequency (%)
39616
100.0%
Open Punctuation
ValueCountFrequency (%)
( 9559
100.0%
Close Punctuation
ValueCountFrequency (%)
) 9559
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2332
100.0%
Math Symbol
ValueCountFrequency (%)
~ 50
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 164667
59.8%
Common 110127
40.0%
Latin 409
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
20287
 
12.3%
12938
 
7.9%
12844
 
7.8%
10317
 
6.3%
10098
 
6.1%
10014
 
6.1%
9589
 
5.8%
5826
 
3.5%
5501
 
3.3%
3974
 
2.4%
Other values (407) 63279
38.4%
Latin
ValueCountFrequency (%)
A 109
26.7%
B 97
23.7%
S 24
 
5.9%
T 15
 
3.7%
e 12
 
2.9%
C 12
 
2.9%
M 12
 
2.9%
K 11
 
2.7%
L 11
 
2.7%
O 10
 
2.4%
Other values (27) 96
23.5%
Common
ValueCountFrequency (%)
39616
36.0%
1 14748
 
13.4%
( 9559
 
8.7%
) 9559
 
8.7%
2 6732
 
6.1%
3 4723
 
4.3%
4 3771
 
3.4%
0 3680
 
3.3%
5 3362
 
3.1%
6 3009
 
2.7%
Other values (10) 11368
 
10.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 164667
59.8%
ASCII 110535
40.2%
Number Forms 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
39616
35.8%
1 14748
 
13.3%
( 9559
 
8.6%
) 9559
 
8.6%
2 6732
 
6.1%
3 4723
 
4.3%
4 3771
 
3.4%
0 3680
 
3.3%
5 3362
 
3.0%
6 3009
 
2.7%
Other values (46) 11776
 
10.7%
Hangul
ValueCountFrequency (%)
20287
 
12.3%
12938
 
7.9%
12844
 
7.8%
10317
 
6.3%
10098
 
6.1%
10014
 
6.1%
9589
 
5.8%
5826
 
3.5%
5501
 
3.3%
3974
 
2.4%
Other values (407) 63279
38.4%
Number Forms
ValueCountFrequency (%)
1
100.0%

Missing values

2023-12-13T03:31:48.649888image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T03:31:48.755310image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번업소명업태업소주소
45944595밀면땡기네기타대구광역시 동구 팔공로 1172-1(1층 진인동)
13411342삐떡경양식대구광역시 중구 봉산문화2길 42-26(1,2층 봉산동)
1766317664다전손칼국수한식대구광역시 수성구 지범로21길 15(상가동 1층 1호 지산동)
1559015591장군식당식육(숯불구이)대구광역시 북구 동북로 141(1층 산격동)
2067120672짱구기타대구광역시 수성구 수성로 291(B동 1층 수성동1가)
1837218373맥주타운호프/통닭대구광역시 수성구 공경로 34(만촌동)
2842928429논공옛날국수한식대구광역시 달성군 논공읍 논공중앙로30길 5(1층)
121171211888홍스시일식대구광역시 북구 대천로18길 5(1층 122호 동천동)
1920319204소문난 안동 구시장 찜닭호프/통닭대구광역시 수성구 수성로62길 8(수성동2가)
1778117782대정옥한식대구광역시 수성구 달구벌대로 2336(2,3층 수성동3가)
연번업소명업태업소주소
2768227682향촌참숯양꼬치두류점기타대구광역시 달서구 달구벌대로344길 68(더영스퀘어 2층 201호 두류동)
739740뚱's 삼겹살식육(숯불구이)대구광역시 중구 대봉로 255(봉산동, 지상1층)
2589225892원조선산한식대구광역시 달서구 학산로7길 92(본동)
1835918360매실농장식당한식대구광역시 수성구 파동로14길 20(5층 파동)
1792017921돈카츠마켙대구시지점경양식대구광역시 수성구 천을로 105-5(1층 매호동)
2941529415성은 누릉지 백숙한식대구광역시 달성군 다사읍 달구벌대로115길 30
1085210853삼겹살에소주한잔호프/통닭대구광역시 남구 두류공원로 54(대명동)
1627916280큰맘할매순대국한식대구광역시 북구 칠성시장로 10(1층 107,108호 칠성동1가)
53735374스시류기타대구광역시 동구 동촌로48길 13(1층 방촌동)
93299330제이와이엠스피노이레스토(JYM'S PINOY RESTO)외국음식전문점(인도,태국등)대구광역시 서구 문화로 340(지하 1층 비산동)