Overview

Dataset statistics

Number of variables6
Number of observations41
Missing cells41
Missing cells (%)16.7%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.1 KiB
Average record size in memory53.2 B

Variable types

Numeric1
Text3
Categorical1
Unsupported1

Dataset

Description당진시 모범 음식점 지정 현황 입니다.(연번, 업소명, 소재지, 업태, 주취급음식, 비고) 데이터 기준일자 : 2023. 04. 24.
URLhttps://www.data.go.kr/data/15052874/fileData.do

Alerts

비고 has 41 (100.0%) missing valuesMissing
연번 has unique valuesUnique
업소명 has unique valuesUnique
비고 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2023-12-12 06:36:32.437221
Analysis finished2023-12-12 06:36:33.096200
Duration0.66 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct41
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean21
Minimum1
Maximum41
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size501.0 B
2023-12-12T15:36:33.174780image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile3
Q111
median21
Q331
95-th percentile39
Maximum41
Range40
Interquartile range (IQR)20

Descriptive statistics

Standard deviation11.979149
Coefficient of variation (CV)0.57043565
Kurtosis-1.2
Mean21
Median Absolute Deviation (MAD)10
Skewness0
Sum861
Variance143.5
MonotonicityStrictly increasing
2023-12-12T15:36:33.346933image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=41)
ValueCountFrequency (%)
1 1
 
2.4%
32 1
 
2.4%
24 1
 
2.4%
25 1
 
2.4%
26 1
 
2.4%
27 1
 
2.4%
28 1
 
2.4%
29 1
 
2.4%
30 1
 
2.4%
31 1
 
2.4%
Other values (31) 31
75.6%
ValueCountFrequency (%)
1 1
2.4%
2 1
2.4%
3 1
2.4%
4 1
2.4%
5 1
2.4%
6 1
2.4%
7 1
2.4%
8 1
2.4%
9 1
2.4%
10 1
2.4%
ValueCountFrequency (%)
41 1
2.4%
40 1
2.4%
39 1
2.4%
38 1
2.4%
37 1
2.4%
36 1
2.4%
35 1
2.4%
34 1
2.4%
33 1
2.4%
32 1
2.4%

업소명
Text

UNIQUE 

Distinct41
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size460.0 B
2023-12-12T15:36:33.592487image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length9
Mean length5.2926829
Min length2

Characters and Unicode

Total characters217
Distinct characters129
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique41 ?
Unique (%)100.0%

Sample

1st row본가건하은칼국수
2nd row당진불고기
3rd row숯불왕꼼장어
4th row굴세상
5th row미향.밤배
ValueCountFrequency (%)
본가건하은칼국수 1
 
2.4%
남원골추어탕 1
 
2.4%
소들곱창 1
 
2.4%
팔각정횟집 1
 
2.4%
원당풍천장어 1
 
2.4%
수참치 1
 
2.4%
담쟝동치미냉면 1
 
2.4%
원조전라도호남광주목포횟집 1
 
2.4%
장어가 1
 
2.4%
동가 1
 
2.4%
Other values (31) 31
75.6%
2023-12-12T15:36:33.982209image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
7
 
3.2%
6
 
2.8%
6
 
2.8%
5
 
2.3%
5
 
2.3%
5
 
2.3%
4
 
1.8%
4
 
1.8%
4
 
1.8%
4
 
1.8%
Other values (119) 167
77.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 214
98.6%
Close Punctuation 1
 
0.5%
Other Punctuation 1
 
0.5%
Open Punctuation 1
 
0.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
7
 
3.3%
6
 
2.8%
6
 
2.8%
5
 
2.3%
5
 
2.3%
5
 
2.3%
4
 
1.9%
4
 
1.9%
4
 
1.9%
4
 
1.9%
Other values (116) 164
76.6%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%
Other Punctuation
ValueCountFrequency (%)
. 1
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 214
98.6%
Common 3
 
1.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
7
 
3.3%
6
 
2.8%
6
 
2.8%
5
 
2.3%
5
 
2.3%
5
 
2.3%
4
 
1.9%
4
 
1.9%
4
 
1.9%
4
 
1.9%
Other values (116) 164
76.6%
Common
ValueCountFrequency (%)
) 1
33.3%
. 1
33.3%
( 1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 214
98.6%
ASCII 3
 
1.4%

Most frequent character per block

Hangul
ValueCountFrequency (%)
7
 
3.3%
6
 
2.8%
6
 
2.8%
5
 
2.3%
5
 
2.3%
5
 
2.3%
4
 
1.9%
4
 
1.9%
4
 
1.9%
4
 
1.9%
Other values (116) 164
76.6%
ASCII
ValueCountFrequency (%)
) 1
33.3%
. 1
33.3%
( 1
33.3%
Distinct40
Distinct (%)97.6%
Missing0
Missing (%)0.0%
Memory size460.0 B
2023-12-12T15:36:34.251957image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length35
Median length27
Mean length23.195122
Min length15

Characters and Unicode

Total characters951
Distinct characters82
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique39 ?
Unique (%)95.1%

Sample

1st row충청남도 당진시 먹거리길 120-28
2nd row충청남도 당진시 무수동1길 25-26
3rd row충청남도 당진시 신평길 91
4th row충청남도 당진시 당진중앙1로 50 (읍내동)
5th row충청남도 당진시 남부로 122 (채운동)
ValueCountFrequency (%)
충청남도 41
20.0%
당진시 41
20.0%
읍내동 7
 
3.4%
채운동 5
 
2.4%
먹거리길 5
 
2.4%
남부로 4
 
2.0%
수청동 4
 
2.0%
대덕동 4
 
2.0%
송악읍 3
 
1.5%
신평면 3
 
1.5%
Other values (77) 88
42.9%
2023-12-12T15:36:34.860430image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
164
17.2%
46
 
4.8%
46
 
4.8%
45
 
4.7%
45
 
4.7%
1 44
 
4.6%
44
 
4.6%
41
 
4.3%
41
 
4.3%
2 33
 
3.5%
Other values (72) 402
42.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 548
57.6%
Space Separator 164
 
17.2%
Decimal Number 164
 
17.2%
Open Punctuation 25
 
2.6%
Close Punctuation 25
 
2.6%
Dash Punctuation 19
 
2.0%
Other Punctuation 6
 
0.6%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
46
 
8.4%
46
 
8.4%
45
 
8.2%
45
 
8.2%
44
 
8.0%
41
 
7.5%
41
 
7.5%
32
 
5.8%
22
 
4.0%
21
 
3.8%
Other values (57) 165
30.1%
Decimal Number
ValueCountFrequency (%)
1 44
26.8%
2 33
20.1%
0 15
 
9.1%
3 15
 
9.1%
5 14
 
8.5%
4 11
 
6.7%
7 11
 
6.7%
8 9
 
5.5%
9 6
 
3.7%
6 6
 
3.7%
Space Separator
ValueCountFrequency (%)
164
100.0%
Open Punctuation
ValueCountFrequency (%)
( 25
100.0%
Close Punctuation
ValueCountFrequency (%)
) 25
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 19
100.0%
Other Punctuation
ValueCountFrequency (%)
, 6
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 548
57.6%
Common 403
42.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
46
 
8.4%
46
 
8.4%
45
 
8.2%
45
 
8.2%
44
 
8.0%
41
 
7.5%
41
 
7.5%
32
 
5.8%
22
 
4.0%
21
 
3.8%
Other values (57) 165
30.1%
Common
ValueCountFrequency (%)
164
40.7%
1 44
 
10.9%
2 33
 
8.2%
( 25
 
6.2%
) 25
 
6.2%
- 19
 
4.7%
0 15
 
3.7%
3 15
 
3.7%
5 14
 
3.5%
4 11
 
2.7%
Other values (5) 38
 
9.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 548
57.6%
ASCII 403
42.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
164
40.7%
1 44
 
10.9%
2 33
 
8.2%
( 25
 
6.2%
) 25
 
6.2%
- 19
 
4.7%
0 15
 
3.7%
3 15
 
3.7%
5 14
 
3.5%
4 11
 
2.7%
Other values (5) 38
 
9.4%
Hangul
ValueCountFrequency (%)
46
 
8.4%
46
 
8.4%
45
 
8.2%
45
 
8.2%
44
 
8.0%
41
 
7.5%
41
 
7.5%
32
 
5.8%
22
 
4.0%
21
 
3.8%
Other values (57) 165
30.1%

업태
Categorical

Distinct6
Distinct (%)14.6%
Missing0
Missing (%)0.0%
Memory size460.0 B
한식
30 
분식
 
3
회집
 
3
중국식
 
2
일식
 
2

Length

Max length3
Median length2
Mean length2.0731707
Min length2

Unique

Unique1 ?
Unique (%)2.4%

Sample

1st row분식
2nd row한식
3rd row한식
4th row한식
5th row한식

Common Values

ValueCountFrequency (%)
한식 30
73.2%
분식 3
 
7.3%
회집 3
 
7.3%
중국식 2
 
4.9%
일식 2
 
4.9%
뷔페식 1
 
2.4%

Length

2023-12-12T15:36:35.032513image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T15:36:35.155132image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
한식 30
73.2%
분식 3
 
7.3%
회집 3
 
7.3%
중국식 2
 
4.9%
일식 2
 
4.9%
뷔페식 1
 
2.4%
Distinct32
Distinct (%)78.0%
Missing0
Missing (%)0.0%
Memory size460.0 B
2023-12-12T15:36:35.400572image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length7
Mean length3.9268293
Min length2

Characters and Unicode

Total characters161
Distinct characters72
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique25 ?
Unique (%)61.0%

Sample

1st row칼국수
2nd row불고기
3rd row꼼장어
4th row굴국밥
5th row한정식
ValueCountFrequency (%)
활어회 4
 
9.8%
한정식 2
 
4.9%
짜장면 2
 
4.9%
콩나물국밥 2
 
4.9%
추어탕 2
 
4.9%
칼국수 2
 
4.9%
갈비 2
 
4.9%
뼈다귀해장국,감자탕 1
 
2.4%
곱창전골 1
 
2.4%
삼겹살 1
 
2.4%
Other values (22) 22
53.7%
2023-12-12T15:36:35.820799image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
9
 
5.6%
7
 
4.3%
6
 
3.7%
, 6
 
3.7%
6
 
3.7%
5
 
3.1%
5
 
3.1%
5
 
3.1%
5
 
3.1%
4
 
2.5%
Other values (62) 103
64.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 154
95.7%
Other Punctuation 6
 
3.7%
Space Separator 1
 
0.6%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
9
 
5.8%
7
 
4.5%
6
 
3.9%
6
 
3.9%
5
 
3.2%
5
 
3.2%
5
 
3.2%
5
 
3.2%
4
 
2.6%
4
 
2.6%
Other values (60) 98
63.6%
Other Punctuation
ValueCountFrequency (%)
, 6
100.0%
Space Separator
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 154
95.7%
Common 7
 
4.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
9
 
5.8%
7
 
4.5%
6
 
3.9%
6
 
3.9%
5
 
3.2%
5
 
3.2%
5
 
3.2%
5
 
3.2%
4
 
2.6%
4
 
2.6%
Other values (60) 98
63.6%
Common
ValueCountFrequency (%)
, 6
85.7%
1
 
14.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 154
95.7%
ASCII 7
 
4.3%

Most frequent character per block

Hangul
ValueCountFrequency (%)
9
 
5.8%
7
 
4.5%
6
 
3.9%
6
 
3.9%
5
 
3.2%
5
 
3.2%
5
 
3.2%
5
 
3.2%
4
 
2.6%
4
 
2.6%
Other values (60) 98
63.6%
ASCII
ValueCountFrequency (%)
, 6
85.7%
1
 
14.3%

비고
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing41
Missing (%)100.0%
Memory size501.0 B

Interactions

2023-12-12T15:36:32.797006image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T15:36:35.929645image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번업소명소재지업태주취급음식
연번1.0001.0000.9450.2450.328
업소명1.0001.0001.0001.0001.000
소재지0.9451.0001.0000.0000.984
업태0.2451.0000.0001.0000.838
주취급음식0.3281.0000.9840.8381.000
2023-12-12T15:36:36.334786image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번업태
연번1.0000.000
업태0.0001.000

Missing values

2023-12-12T15:36:32.927138image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T15:36:33.046490image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번업소명소재지업태주취급음식비고
01본가건하은칼국수충청남도 당진시 먹거리길 120-28분식칼국수<NA>
12당진불고기충청남도 당진시 무수동1길 25-26한식불고기<NA>
23숯불왕꼼장어충청남도 당진시 신평길 91한식꼼장어<NA>
34굴세상충청남도 당진시 당진중앙1로 50 (읍내동)한식굴국밥<NA>
45미향.밤배충청남도 당진시 남부로 122 (채운동)한식한정식<NA>
56용왕회집충청남도 당진시 석문면 장고항로 309회집활어회<NA>
67합덕동천홍충청남도 당진시 합덕읍 남부로 1875중국식짜장면<NA>
78옹기촌충청남도 당진시 무수동안길 13 (읍내동)한식우렁쌈밥,갈비<NA>
89착한코다리충청남도 당진시 시곡로 352 (시곡동)분식낙지볶음<NA>
910국제가든충청남도 당진시 합덕읍 조정이2길 55한식첨삼백숙<NA>
연번업소명소재지업태주취급음식비고
3132장수옥설렁탕충청남도 당진시 서해로 6298-1 (시곡동)한식설렁탕<NA>
3233현대옥당진시청점충청남도 당진시 무수동1길 25-60 (읍내동)한식콩나물국밥<NA>
3334신벌떼해장국충청남도 당진시 무수동7길 123 (읍내동)한식뼈다귀해장국,감자탕<NA>
3435고메샤브충청남도 당진시 시청1로 33 (수청동)한식소고기샤브<NA>
3536롯대정육식당충청남도 당진시 대덕1로1길 20-30, 1층 (대덕동)한식삼겹살<NA>
3637갈비명가충청남도 당진시 면천면 성하로 271한식갈비<NA>
3738당진냉면갈비충청남도 당진시 남부로 252 (수청동, 1,2층)한식갈비<NA>
3839해화가든충청남도 당진시 송산로 270-9 (원당동)한식갈비,곱창<NA>
3940전주콩뿌리콩나물국밥충청남도 당진시 북문로1길 41-17 (읍내동)한식콩나물국밥<NA>
4041우렁이박사충청남도 당진시 신평면 샛터로 7-1한식우렁쌈밥<NA>