Overview

Dataset statistics

Number of variables5
Number of observations43
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.8 KiB
Average record size in memory43.1 B

Variable types

Text4
Categorical1

Dataset

Description경상남도 창녕군 음식점에 대한 데이터를 포함하고 있습니다.(모범음식점 업소명, 주소, 전화번호, 주메뉴, 분류)
Author경상남도 창녕군
URLhttps://bigdata.gyeongnam.go.kr/index.gn?menuCd=DOM_000000114002001000&publicdatapk=15007289

Alerts

분류 is highly imbalanced (62.1%)Imbalance
업소명 has unique valuesUnique
주소 has unique valuesUnique
전화번호 has unique valuesUnique

Reproduction

Analysis started2023-12-11 00:33:18.895076
Analysis finished2023-12-11 00:33:19.341604
Duration0.45 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업소명
Text

UNIQUE 

Distinct43
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size476.0 B
2023-12-11T09:33:19.493586image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length10
Mean length5.9767442
Min length2

Characters and Unicode

Total characters257
Distinct characters125
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique43 ?
Unique (%)100.0%

Sample

1st row북면자연농원
2nd row우포한우프라자
3rd row물망초횟집
4th row조선옥삼계탕
5th row화덕본가
ValueCountFrequency (%)
북면자연농원 1
 
2.2%
도리원 1
 
2.2%
선하회 1
 
2.2%
창녕한우프라자 1
 
2.2%
우포늪식당 1
 
2.2%
귀촌마을 1
 
2.2%
부생밀면고기집 1
 
2.2%
도천진짜순대창녕점 1
 
2.2%
하나비 1
 
2.2%
창녕대가 1
 
2.2%
Other values (36) 36
78.3%
2023-12-11T09:33:19.926003image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
12
 
4.7%
8
 
3.1%
7
 
2.7%
6
 
2.3%
6
 
2.3%
5
 
1.9%
5
 
1.9%
4
 
1.6%
4
 
1.6%
4
 
1.6%
Other values (115) 196
76.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 253
98.4%
Space Separator 3
 
1.2%
Other Symbol 1
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
12
 
4.7%
8
 
3.2%
7
 
2.8%
6
 
2.4%
6
 
2.4%
5
 
2.0%
5
 
2.0%
4
 
1.6%
4
 
1.6%
4
 
1.6%
Other values (113) 192
75.9%
Space Separator
ValueCountFrequency (%)
3
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 254
98.8%
Common 3
 
1.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
12
 
4.7%
8
 
3.1%
7
 
2.8%
6
 
2.4%
6
 
2.4%
5
 
2.0%
5
 
2.0%
4
 
1.6%
4
 
1.6%
4
 
1.6%
Other values (114) 193
76.0%
Common
ValueCountFrequency (%)
3
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 253
98.4%
ASCII 3
 
1.2%
None 1
 
0.4%

Most frequent character per block

Hangul
ValueCountFrequency (%)
12
 
4.7%
8
 
3.2%
7
 
2.8%
6
 
2.4%
6
 
2.4%
5
 
2.0%
5
 
2.0%
4
 
1.6%
4
 
1.6%
4
 
1.6%
Other values (113) 192
75.9%
ASCII
ValueCountFrequency (%)
3
100.0%
None
ValueCountFrequency (%)
1
100.0%

주소
Text

UNIQUE 

Distinct43
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size476.0 B
2023-12-11T09:33:20.161693image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length41
Median length29
Mean length18.232558
Min length15

Characters and Unicode

Total characters784
Distinct characters85
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique43 ?
Unique (%)100.0%

Sample

1st row경상남도 계성면 영산계성로 397-5
2nd row경상남도 계성면 영산계성로 457
3rd row경상남도 남지읍 남지강변길 112
4th row경상남도 남지읍 동포7길 19-1
5th row경상남도 남지읍 남지중앙1길 17-1
ValueCountFrequency (%)
경상남도 43
24.3%
창녕읍 16
 
9.0%
부곡면 7
 
4.0%
영산면 6
 
3.4%
도천면 5
 
2.8%
화왕산로 5
 
2.8%
남지읍 5
 
2.8%
온천로 4
 
2.3%
우포2로 2
 
1.1%
계성면 2
 
1.1%
Other values (77) 82
46.3%
2023-12-11T09:33:20.554681image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
134
 
17.1%
53
 
6.8%
49
 
6.2%
45
 
5.7%
43
 
5.5%
1 38
 
4.8%
23
 
2.9%
22
 
2.8%
21
 
2.7%
20
 
2.6%
Other values (75) 336
42.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 485
61.9%
Decimal Number 138
 
17.6%
Space Separator 134
 
17.1%
Dash Punctuation 13
 
1.7%
Open Punctuation 6
 
0.8%
Close Punctuation 6
 
0.8%
Other Punctuation 2
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
53
 
10.9%
49
 
10.1%
45
 
9.3%
43
 
8.9%
23
 
4.7%
22
 
4.5%
21
 
4.3%
20
 
4.1%
17
 
3.5%
17
 
3.5%
Other values (60) 175
36.1%
Decimal Number
ValueCountFrequency (%)
1 38
27.5%
6 14
 
10.1%
8 14
 
10.1%
2 14
 
10.1%
7 13
 
9.4%
9 12
 
8.7%
5 10
 
7.2%
4 9
 
6.5%
0 7
 
5.1%
3 7
 
5.1%
Space Separator
ValueCountFrequency (%)
134
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 13
100.0%
Open Punctuation
ValueCountFrequency (%)
( 6
100.0%
Close Punctuation
ValueCountFrequency (%)
) 6
100.0%
Other Punctuation
ValueCountFrequency (%)
, 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 485
61.9%
Common 299
38.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
53
 
10.9%
49
 
10.1%
45
 
9.3%
43
 
8.9%
23
 
4.7%
22
 
4.5%
21
 
4.3%
20
 
4.1%
17
 
3.5%
17
 
3.5%
Other values (60) 175
36.1%
Common
ValueCountFrequency (%)
134
44.8%
1 38
 
12.7%
6 14
 
4.7%
8 14
 
4.7%
2 14
 
4.7%
7 13
 
4.3%
- 13
 
4.3%
9 12
 
4.0%
5 10
 
3.3%
4 9
 
3.0%
Other values (5) 28
 
9.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 485
61.9%
ASCII 299
38.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
134
44.8%
1 38
 
12.7%
6 14
 
4.7%
8 14
 
4.7%
2 14
 
4.7%
7 13
 
4.3%
- 13
 
4.3%
9 12
 
4.0%
5 10
 
3.3%
4 9
 
3.0%
Other values (5) 28
 
9.4%
Hangul
ValueCountFrequency (%)
53
 
10.9%
49
 
10.1%
45
 
9.3%
43
 
8.9%
23
 
4.7%
22
 
4.5%
21
 
4.3%
20
 
4.1%
17
 
3.5%
17
 
3.5%
Other values (60) 175
36.1%

전화번호
Text

UNIQUE 

Distinct43
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size476.0 B
2023-12-11T09:33:20.811044image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters516
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique43 ?
Unique (%)100.0%

Sample

1st row055-521-8858
2nd row055-536-4114
3rd row055-536-5510
4th row055-526-1044
5th row055-521-8592
ValueCountFrequency (%)
055-521-8858 1
 
2.3%
055-521-6116 1
 
2.3%
055-536-0475 1
 
2.3%
055-532-8649 1
 
2.3%
055-533-9600 1
 
2.3%
055-533-0392 1
 
2.3%
055-532-4388 1
 
2.3%
055-532-0074 1
 
2.3%
055-532-3301 1
 
2.3%
055-533-1180 1
 
2.3%
Other values (33) 33
76.7%
2023-12-11T09:33:21.499403image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5 148
28.7%
- 86
16.7%
0 67
13.0%
3 59
 
11.4%
2 34
 
6.6%
6 29
 
5.6%
1 26
 
5.0%
9 26
 
5.0%
8 17
 
3.3%
4 13
 
2.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 430
83.3%
Dash Punctuation 86
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 148
34.4%
0 67
15.6%
3 59
 
13.7%
2 34
 
7.9%
6 29
 
6.7%
1 26
 
6.0%
9 26
 
6.0%
8 17
 
4.0%
4 13
 
3.0%
7 11
 
2.6%
Dash Punctuation
ValueCountFrequency (%)
- 86
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 516
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
5 148
28.7%
- 86
16.7%
0 67
13.0%
3 59
 
11.4%
2 34
 
6.6%
6 29
 
5.6%
1 26
 
5.0%
9 26
 
5.0%
8 17
 
3.3%
4 13
 
2.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 516
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5 148
28.7%
- 86
16.7%
0 67
13.0%
3 59
 
11.4%
2 34
 
6.6%
6 29
 
5.6%
1 26
 
5.0%
9 26
 
5.0%
8 17
 
3.3%
4 13
 
2.5%
Distinct41
Distinct (%)95.3%
Missing0
Missing (%)0.0%
Memory size476.0 B
2023-12-11T09:33:21.773072image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length10
Mean length8.2325581
Min length3

Characters and Unicode

Total characters354
Distinct characters106
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique39 ?
Unique (%)90.7%

Sample

1st row오리불고기, 오리탕
2nd row한우숯불구이 및 갈비탕
3rd row자연산활어회
4th row삼계탕,석쇠불고기
5th row돼지갈비
ValueCountFrequency (%)
바다회,회덮밥 2
 
3.3%
모듬순대 2
 
3.3%
순대전골 2
 
3.3%
돼지고기 2
 
3.3%
청국장 2
 
3.3%
갈비탕 2
 
3.3%
오리불고기 2
 
3.3%
2
 
3.3%
돼지갈비,돌솥밥 1
 
1.7%
전문점 1
 
1.7%
Other values (42) 42
70.0%
2023-12-11T09:33:22.184475image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
, 33
 
9.3%
17
 
4.8%
13
 
3.7%
12
 
3.4%
11
 
3.1%
10
 
2.8%
10
 
2.8%
10
 
2.8%
9
 
2.5%
9
 
2.5%
Other values (96) 220
62.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 304
85.9%
Other Punctuation 33
 
9.3%
Space Separator 17
 
4.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
13
 
4.3%
12
 
3.9%
11
 
3.6%
10
 
3.3%
10
 
3.3%
10
 
3.3%
9
 
3.0%
9
 
3.0%
8
 
2.6%
8
 
2.6%
Other values (94) 204
67.1%
Other Punctuation
ValueCountFrequency (%)
, 33
100.0%
Space Separator
ValueCountFrequency (%)
17
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 304
85.9%
Common 50
 
14.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
13
 
4.3%
12
 
3.9%
11
 
3.6%
10
 
3.3%
10
 
3.3%
10
 
3.3%
9
 
3.0%
9
 
3.0%
8
 
2.6%
8
 
2.6%
Other values (94) 204
67.1%
Common
ValueCountFrequency (%)
, 33
66.0%
17
34.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 304
85.9%
ASCII 50
 
14.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
, 33
66.0%
17
34.0%
Hangul
ValueCountFrequency (%)
13
 
4.3%
12
 
3.9%
11
 
3.6%
10
 
3.3%
10
 
3.3%
10
 
3.3%
9
 
3.0%
9
 
3.0%
8
 
2.6%
8
 
2.6%
Other values (94) 204
67.1%

분류
Categorical

IMBALANCE 

Distinct4
Distinct (%)9.3%
Missing0
Missing (%)0.0%
Memory size476.0 B
한식
37 
회집
중식
 
1
일식
 
1

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique2 ?
Unique (%)4.7%

Sample

1st row한식
2nd row한식
3rd row회집
4th row한식
5th row한식

Common Values

ValueCountFrequency (%)
한식 37
86.0%
회집 4
 
9.3%
중식 1
 
2.3%
일식 1
 
2.3%

Length

2023-12-11T09:33:22.319737image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T09:33:22.414968image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
한식 37
86.0%
회집 4
 
9.3%
중식 1
 
2.3%
일식 1
 
2.3%

Correlations

2023-12-11T09:33:22.486372image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업소명주소전화번호주메뉴분류
업소명1.0001.0001.0001.0001.000
주소1.0001.0001.0001.0001.000
전화번호1.0001.0001.0001.0001.000
주메뉴1.0001.0001.0001.0001.000
분류1.0001.0001.0001.0001.000

Missing values

2023-12-11T09:33:19.191208image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T09:33:19.301690image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업소명주소전화번호주메뉴분류
0북면자연농원경상남도 계성면 영산계성로 397-5055-521-8858오리불고기, 오리탕한식
1우포한우프라자경상남도 계성면 영산계성로 457055-536-4114한우숯불구이 및 갈비탕한식
2물망초횟집경상남도 남지읍 남지강변길 112055-536-5510자연산활어회회집
3조선옥삼계탕경상남도 남지읍 동포7길 19-1055-526-1044삼계탕,석쇠불고기한식
4화덕본가경상남도 남지읍 남지중앙1길 17-1055-521-8592돼지갈비한식
5월남쌈샤브이야기경상남도 남지읍 남지중앙로 83-1055-526-4296월남쌈샤브한식
6연안바다횟집경상남도 남지읍 남포1길 2055-536-4252모듬회,회덮밥회집
7가현한우생고기전문점경상남도 대지면 경남대로 4897-51055-532-9259한우생고기한식
8향촌가든경상남도 도천면 가마골2길 5055-536-4450촌닭,오리백숙,송이백숙한식
9통일냉면경상남도 도천면 개울길 29-8055-521-8852냉면, 갈비탕한식
업소명주소전화번호주메뉴분류
33선하회경상남도 창녕읍 술정중앙길 18-6 (신우프라자(106,107,108호))055-533-0503바다회,회덮밥회집
34가얏골감자탕경상남도 창녕읍 우포2로 1179055-532-1770김치감자탕,시래기감자탕한식
35창녕한우경상남도 창녕읍 우포2로 1200055-533-3321한우쇠고기, 돼지고기한식
36화왕산풀향기식당경상남도 창녕읍 자하곡길 75055-533-9098수제비,버섯전골한식
37금별식당경상남도 창녕읍 창녕대동길 39055-532-9393아구찜,추어탕한식
38고궁경상남도 창녕읍 탑금당길 16055-532-0335청국장, 오리불고기한식
39주왕산삼계탕경상남도 창녕읍 화왕산1로 69055-533-9985삼계탕한식
40화왕산갈비마을경상남도 창녕읍 화왕산로 18055-532-9292돼지갈비,소갈비한식
41양반청국장경상남도 창녕읍 화왕산로 64055-533-0066청국장 전문점한식
42풀무원식품㈜영산휴게소 한식당경상남도 영산면 장척호수길 56-110055-521-2978장터국밥,김치찌개한식