Overview

Dataset statistics

Number of variables5
Number of observations26
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.1 KiB
Average record size in memory45.1 B

Variable types

Text4
Categorical1

Dataset

Description경상남도 양산시 모범음식점현황으로 업소명, 업수명 소재지, 업종, 주취급음식 등 양산의 모범음식점을 확인할 수 있습니다.
Author경상남도 양산시
URLhttps://bigdata.gyeongnam.go.kr/index.gn?menuCd=DOM_000000114002001000&publicdatapk=3040405

Alerts

업태 is highly imbalanced (76.5%)Imbalance
업소명 has unique valuesUnique
전화번호 has unique valuesUnique
업소 소재지 has unique valuesUnique

Reproduction

Analysis started2024-04-18 06:05:17.424188
Analysis finished2024-04-18 06:05:19.938235
Duration2.51 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업소명
Text

UNIQUE 

Distinct26
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size340.0 B
2024-04-18T15:05:20.070616image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length10.5
Mean length5.6923077
Min length2

Characters and Unicode

Total characters148
Distinct characters100
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique26 ?
Unique (%)100.0%

Sample

1st row윤동균한방쑥면본점
2nd row진송추어탕 덕계점
3rd row에이원CC점 클럽하우스
4th row종정헌
5th row대각정
ValueCountFrequency (%)
진송추어탕 2
 
6.9%
윤동균한방쑥면본점 1
 
3.4%
감포숯불 1
 
3.4%
경도갈비 1
 
3.4%
통도사휴게소(한.양식 1
 
3.4%
한우한마리 1
 
3.4%
돌담 1
 
3.4%
양산축협중부점식육셀프식당 1
 
3.4%
강변가든 1
 
3.4%
송강민물매운탕 1
 
3.4%
Other values (18) 18
62.1%
2024-04-18T15:05:20.379612image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5
 
3.4%
5
 
3.4%
5
 
3.4%
4
 
2.7%
4
 
2.7%
3
 
2.0%
3
 
2.0%
3
 
2.0%
3
 
2.0%
3
 
2.0%
Other values (90) 110
74.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 140
94.6%
Space Separator 3
 
2.0%
Uppercase Letter 2
 
1.4%
Close Punctuation 1
 
0.7%
Other Punctuation 1
 
0.7%
Open Punctuation 1
 
0.7%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
5
 
3.6%
5
 
3.6%
5
 
3.6%
4
 
2.9%
4
 
2.9%
3
 
2.1%
3
 
2.1%
3
 
2.1%
3
 
2.1%
3
 
2.1%
Other values (85) 102
72.9%
Space Separator
ValueCountFrequency (%)
3
100.0%
Uppercase Letter
ValueCountFrequency (%)
C 2
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%
Other Punctuation
ValueCountFrequency (%)
. 1
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 140
94.6%
Common 6
 
4.1%
Latin 2
 
1.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
5
 
3.6%
5
 
3.6%
5
 
3.6%
4
 
2.9%
4
 
2.9%
3
 
2.1%
3
 
2.1%
3
 
2.1%
3
 
2.1%
3
 
2.1%
Other values (85) 102
72.9%
Common
ValueCountFrequency (%)
3
50.0%
) 1
 
16.7%
. 1
 
16.7%
( 1
 
16.7%
Latin
ValueCountFrequency (%)
C 2
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 140
94.6%
ASCII 8
 
5.4%

Most frequent character per block

Hangul
ValueCountFrequency (%)
5
 
3.6%
5
 
3.6%
5
 
3.6%
4
 
2.9%
4
 
2.9%
3
 
2.1%
3
 
2.1%
3
 
2.1%
3
 
2.1%
3
 
2.1%
Other values (85) 102
72.9%
ASCII
ValueCountFrequency (%)
3
37.5%
C 2
25.0%
) 1
 
12.5%
. 1
 
12.5%
( 1
 
12.5%

전화번호
Text

UNIQUE 

Distinct26
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size340.0 B
2024-04-18T15:05:20.568230image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters312
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique26 ?
Unique (%)100.0%

Sample

1st row055-366-7599
2nd row055-365-0022
3rd row055-371-3500
4th row055-366-3373
5th row055-362-4788
ValueCountFrequency (%)
055-366-7599 1
 
3.8%
055-365-0022 1
 
3.8%
055-383-5215 1
 
3.8%
055-382-8051 1
 
3.8%
055-383-6147 1
 
3.8%
055-366-0092 1
 
3.8%
055-389-1995 1
 
3.8%
055-389-2275 1
 
3.8%
055-385-3000 1
 
3.8%
055-382-9282 1
 
3.8%
Other values (16) 16
61.5%
2024-04-18T15:05:20.863249image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5 68
21.8%
- 52
16.7%
0 43
13.8%
3 39
12.5%
8 26
 
8.3%
6 19
 
6.1%
2 16
 
5.1%
9 14
 
4.5%
7 13
 
4.2%
1 12
 
3.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 260
83.3%
Dash Punctuation 52
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 68
26.2%
0 43
16.5%
3 39
15.0%
8 26
 
10.0%
6 19
 
7.3%
2 16
 
6.2%
9 14
 
5.4%
7 13
 
5.0%
1 12
 
4.6%
4 10
 
3.8%
Dash Punctuation
ValueCountFrequency (%)
- 52
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 312
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
5 68
21.8%
- 52
16.7%
0 43
13.8%
3 39
12.5%
8 26
 
8.3%
6 19
 
6.1%
2 16
 
5.1%
9 14
 
4.5%
7 13
 
4.2%
1 12
 
3.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 312
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5 68
21.8%
- 52
16.7%
0 43
13.8%
3 39
12.5%
8 26
 
8.3%
6 19
 
6.1%
2 16
 
5.1%
9 14
 
4.5%
7 13
 
4.2%
1 12
 
3.8%

업소 소재지
Text

UNIQUE 

Distinct26
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size340.0 B
2024-04-18T15:05:21.092032image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length27
Median length23
Mean length21.615385
Min length17

Characters and Unicode

Total characters562
Distinct characters66
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique26 ?
Unique (%)100.0%

Sample

1st row경상남도 양산시 덕계5길 8(덕계동)
2nd row경상남도 양산시 덕계로 76, 2동 2층(덕계동)
3rd row경상남도 양산시 덕명로 190(매곡동)
4th row경상남도 양산시 동면 석산3길 12
5th row경상남도 양산시 매곡3길 12-3(매곡동)
ValueCountFrequency (%)
경상남도 26
22.2%
양산시 26
22.2%
하북면 4
 
3.4%
상북면 2
 
1.7%
12 2
 
1.7%
물금읍 2
 
1.7%
31-1 1
 
0.9%
경부고속도로 1
 
0.9%
초산6길 1
 
0.9%
양주2길 1
 
0.9%
Other values (51) 51
43.6%
2024-04-18T15:05:21.416466image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
91
 
16.2%
31
 
5.5%
29
 
5.2%
28
 
5.0%
28
 
5.0%
27
 
4.8%
27
 
4.8%
26
 
4.6%
21
 
3.7%
1 18
 
3.2%
Other values (56) 236
42.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 352
62.6%
Space Separator 91
 
16.2%
Decimal Number 80
 
14.2%
Close Punctuation 15
 
2.7%
Open Punctuation 15
 
2.7%
Dash Punctuation 6
 
1.1%
Other Punctuation 3
 
0.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
31
 
8.8%
29
 
8.2%
28
 
8.0%
28
 
8.0%
27
 
7.7%
27
 
7.7%
26
 
7.4%
21
 
6.0%
16
 
4.5%
15
 
4.3%
Other values (41) 104
29.5%
Decimal Number
ValueCountFrequency (%)
1 18
22.5%
3 14
17.5%
2 14
17.5%
7 8
10.0%
4 5
 
6.2%
6 5
 
6.2%
8 4
 
5.0%
5 4
 
5.0%
9 4
 
5.0%
0 4
 
5.0%
Space Separator
ValueCountFrequency (%)
91
100.0%
Close Punctuation
ValueCountFrequency (%)
) 15
100.0%
Open Punctuation
ValueCountFrequency (%)
( 15
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 6
100.0%
Other Punctuation
ValueCountFrequency (%)
, 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 352
62.6%
Common 210
37.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
31
 
8.8%
29
 
8.2%
28
 
8.0%
28
 
8.0%
27
 
7.7%
27
 
7.7%
26
 
7.4%
21
 
6.0%
16
 
4.5%
15
 
4.3%
Other values (41) 104
29.5%
Common
ValueCountFrequency (%)
91
43.3%
1 18
 
8.6%
) 15
 
7.1%
( 15
 
7.1%
3 14
 
6.7%
2 14
 
6.7%
7 8
 
3.8%
- 6
 
2.9%
4 5
 
2.4%
6 5
 
2.4%
Other values (5) 19
 
9.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 352
62.6%
ASCII 210
37.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
91
43.3%
1 18
 
8.6%
) 15
 
7.1%
( 15
 
7.1%
3 14
 
6.7%
2 14
 
6.7%
7 8
 
3.8%
- 6
 
2.9%
4 5
 
2.4%
6 5
 
2.4%
Other values (5) 19
 
9.0%
Hangul
ValueCountFrequency (%)
31
 
8.8%
29
 
8.2%
28
 
8.0%
28
 
8.0%
27
 
7.7%
27
 
7.7%
26
 
7.4%
21
 
6.0%
16
 
4.5%
15
 
4.3%
Other values (41) 104
29.5%

업태
Categorical

IMBALANCE 

Distinct2
Distinct (%)7.7%
Missing0
Missing (%)0.0%
Memory size340.0 B
한식
25 
양식
 
1

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique1 ?
Unique (%)3.8%

Sample

1st row한식
2nd row한식
3rd row양식
4th row한식
5th row한식

Common Values

ValueCountFrequency (%)
한식 25
96.2%
양식 1
 
3.8%

Length

2024-04-18T15:05:21.539171image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-18T15:05:21.633704image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
한식 25
96.2%
양식 1
 
3.8%
Distinct21
Distinct (%)80.8%
Missing0
Missing (%)0.0%
Memory size340.0 B
2024-04-18T15:05:21.778003image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length5
Median length4
Mean length3.7307692
Min length2

Characters and Unicode

Total characters97
Distinct characters53
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique17 ?
Unique (%)65.4%

Sample

1st row한방쑥면
2nd row추어탕
3rd row스테이크
4th row영양돌솥밥
5th row숯불갈비
ValueCountFrequency (%)
숯불갈비 3
 
11.5%
추어탕 2
 
7.7%
소고기구이 2
 
7.7%
복매운탕 2
 
7.7%
민물매운탕 1
 
3.8%
한방쑥면 1
 
3.8%
삼겹살 1
 
3.8%
갈비류 1
 
3.8%
한우국밥 1
 
3.8%
오리백숙 1
 
3.8%
Other values (11) 11
42.3%
2024-04-18T15:05:22.091270image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
7
 
7.2%
6
 
6.2%
5
 
5.2%
5
 
5.2%
4
 
4.1%
3
 
3.1%
3
 
3.1%
3
 
3.1%
3
 
3.1%
3
 
3.1%
Other values (43) 55
56.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 97
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
7
 
7.2%
6
 
6.2%
5
 
5.2%
5
 
5.2%
4
 
4.1%
3
 
3.1%
3
 
3.1%
3
 
3.1%
3
 
3.1%
3
 
3.1%
Other values (43) 55
56.7%

Most occurring scripts

ValueCountFrequency (%)
Hangul 97
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
7
 
7.2%
6
 
6.2%
5
 
5.2%
5
 
5.2%
4
 
4.1%
3
 
3.1%
3
 
3.1%
3
 
3.1%
3
 
3.1%
3
 
3.1%
Other values (43) 55
56.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 97
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
7
 
7.2%
6
 
6.2%
5
 
5.2%
5
 
5.2%
4
 
4.1%
3
 
3.1%
3
 
3.1%
3
 
3.1%
3
 
3.1%
3
 
3.1%
Other values (43) 55
56.7%

Correlations

2024-04-18T15:05:22.178044image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업소명전화번호업소 소재지업태주취급음식
업소명1.0001.0001.0001.0001.000
전화번호1.0001.0001.0001.0001.000
업소 소재지1.0001.0001.0001.0001.000
업태1.0001.0001.0001.0001.000
주취급음식1.0001.0001.0001.0001.000

Missing values

2024-04-18T15:05:19.902338image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업소명전화번호업소 소재지업태주취급음식
0윤동균한방쑥면본점055-366-7599경상남도 양산시 덕계5길 8(덕계동)한식한방쑥면
1진송추어탕 덕계점055-365-0022경상남도 양산시 덕계로 76, 2동 2층(덕계동)한식추어탕
2에이원CC점 클럽하우스055-371-3500경상남도 양산시 덕명로 190(매곡동)양식스테이크
3종정헌055-366-3373경상남도 양산시 동면 석산3길 12한식영양돌솥밥
4대각정055-362-4788경상남도 양산시 매곡3길 12-3(매곡동)한식숯불갈비
5수정원055-385-7117경상남도 양산시 명동7길3(명동)한식숯불갈비
6착한낙지 양산점055-363-5456경상남도 양산시 물금읍 동중5길 23, 1층한식낙지요리
7장수녹각삼계탕055-387-2800경상남도 양산시 물금읍 황산로 669한식삼계탕
8청해대복055-389-2345경상남도 양산시 북안남1길 31(북부동)한식복매운탕
9수영양곱창055-388-3413경상남도 양산시 북안남3길 42(북부동)한식양곱창
업소명전화번호업소 소재지업태주취급음식
16송강민물매운탕055-365-1497경상남도 양산시 용암길 17-8한식민물매운탕
17강변가든055-382-9282경상남도 양산시 원동면 원동로 1712한식갈비탕
18양산축협중부점식육셀프식당055-385-3000경상남도 양산시 장터2길 4(중부동)한식소고기구이
19돌담055-389-2275경상남도 양산시 주진3길 12 (주진동)한식오리백숙
20진송추어탕055-389-1995경상남도 양산시 중앙로 293-1, 1층(북정동)한식추어탕
21한우한마리055-366-0092경상남도 양산시 중앙우회로 148(북부동)한식소고기구이
22통도사휴게소(한.양식)055-383-6147경상남도 양산시 하북면 경부고속도로 31-1한식한우국밥
23경도갈비055-382-8051경상남도 양산시 하북면 신평남부길101-2한식갈비류
24감포숯불055-383-5215경상남도 양산시 하북면 초산3길 33-9한식불고기
25고향의봄055-384-8468경상남도 양산시 하북면 초산6길 15한식숯불갈비