Overview

Dataset statistics

Number of variables6
Number of observations371
Missing cells4
Missing cells (%)0.2%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory17.9 KiB
Average record size in memory49.4 B

Variable types

Numeric1
Categorical2
Text3

Dataset

Description인천광역시 모범음식점현황에 대한 항목( 번호/군구/업소명/주소/전화번호/주메뉴)에 관한 데이터를 제공하는 자료 입니다.
Author인천광역시
URLhttps://data.incheon.go.kr/findData/publicDataDetail?dataId=15065207&srcSe=7661IVAWM27C61E190

Alerts

연번 is highly overall correlated with 군구High correlation
군구 is highly overall correlated with 연번High correlation
업태 is highly imbalanced (59.4%)Imbalance
전화번호 has 4 (1.1%) missing valuesMissing
연번 has unique valuesUnique

Reproduction

Analysis started2024-03-18 02:28:03.064458
Analysis finished2024-03-18 02:28:04.885776
Duration1.82 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct371
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean186
Minimum1
Maximum371
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.4 KiB
2024-03-18T11:28:04.948444image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile19.5
Q193.5
median186
Q3278.5
95-th percentile352.5
Maximum371
Range370
Interquartile range (IQR)185

Descriptive statistics

Standard deviation107.24272
Coefficient of variation (CV)0.57657374
Kurtosis-1.2
Mean186
Median Absolute Deviation (MAD)93
Skewness0
Sum69006
Variance11501
MonotonicityStrictly increasing
2024-03-18T11:28:05.065560image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.3%
246 1
 
0.3%
255 1
 
0.3%
254 1
 
0.3%
253 1
 
0.3%
252 1
 
0.3%
251 1
 
0.3%
250 1
 
0.3%
249 1
 
0.3%
248 1
 
0.3%
Other values (361) 361
97.3%
ValueCountFrequency (%)
1 1
0.3%
2 1
0.3%
3 1
0.3%
4 1
0.3%
5 1
0.3%
6 1
0.3%
7 1
0.3%
8 1
0.3%
9 1
0.3%
10 1
0.3%
ValueCountFrequency (%)
371 1
0.3%
370 1
0.3%
369 1
0.3%
368 1
0.3%
367 1
0.3%
366 1
0.3%
365 1
0.3%
364 1
0.3%
363 1
0.3%
362 1
0.3%

군구
Categorical

HIGH CORRELATION 

Distinct10
Distinct (%)2.7%
Missing0
Missing (%)0.0%
Memory size3.0 KiB
미추홀구
82 
부평구
63 
연수구
48 
남동구
45 
강화군
37 
Other values (5)
96 

Length

Max length4
Median length3
Mean length3.0566038
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row강화군
2nd row강화군
3rd row강화군
4th row강화군
5th row강화군

Common Values

ValueCountFrequency (%)
미추홀구 82
22.1%
부평구 63
17.0%
연수구 48
12.9%
남동구 45
12.1%
강화군 37
10.0%
중구 28
 
7.5%
계양구 26
 
7.0%
서구 19
 
5.1%
동구 14
 
3.8%
옹진군 9
 
2.4%

Length

2024-03-18T11:28:05.217047image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-18T11:28:05.348111image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
미추홀구 82
22.1%
부평구 63
17.0%
연수구 48
12.9%
남동구 45
12.1%
강화군 37
10.0%
중구 28
 
7.5%
계양구 26
 
7.0%
서구 19
 
5.1%
동구 14
 
3.8%
옹진군 9
 
2.4%
Distinct362
Distinct (%)97.6%
Missing0
Missing (%)0.0%
Memory size3.0 KiB
2024-03-18T11:28:05.567972image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length15
Mean length5.9407008
Min length2

Characters and Unicode

Total characters2204
Distinct characters386
Distinct categories10 ?
Distinct scripts4 ?
Distinct blocks6 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique354 ?
Unique (%)95.4%

Sample

1st row왕자정
2nd row야콘냉면본가집
3rd row새벽해장국
4th row충남서산집
5th row동촌집
ValueCountFrequency (%)
착한낙지 3
 
0.7%
화로구이 3
 
0.7%
금산식당 2
 
0.5%
송도점 2
 
0.5%
삼산점 2
 
0.5%
채선당 2
 
0.5%
명품삼계탕 2
 
0.5%
해촌 2
 
0.5%
함흥관 2
 
0.5%
명가원설농탕 2
 
0.5%
Other values (396) 399
94.8%
2024-03-18T11:28:05.891697image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
51
 
2.3%
46
 
2.1%
41
 
1.9%
37
 
1.7%
34
 
1.5%
) 31
 
1.4%
( 31
 
1.4%
30
 
1.4%
29
 
1.3%
29
 
1.3%
Other values (376) 1845
83.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2057
93.3%
Space Separator 51
 
2.3%
Close Punctuation 31
 
1.4%
Open Punctuation 31
 
1.4%
Uppercase Letter 11
 
0.5%
Decimal Number 10
 
0.5%
Other Punctuation 6
 
0.3%
Lowercase Letter 5
 
0.2%
Other Symbol 1
 
< 0.1%
Letter Number 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
46
 
2.2%
41
 
2.0%
37
 
1.8%
34
 
1.7%
30
 
1.5%
29
 
1.4%
29
 
1.4%
28
 
1.4%
26
 
1.3%
25
 
1.2%
Other values (351) 1732
84.2%
Uppercase Letter
ValueCountFrequency (%)
C 3
27.3%
K 2
18.2%
L 2
18.2%
V 1
 
9.1%
X 1
 
9.1%
N 1
 
9.1%
I 1
 
9.1%
Decimal Number
ValueCountFrequency (%)
1 2
20.0%
0 2
20.0%
5 2
20.0%
6 1
10.0%
8 1
10.0%
9 1
10.0%
4 1
10.0%
Lowercase Letter
ValueCountFrequency (%)
a 2
40.0%
i 1
20.0%
s 1
20.0%
t 1
20.0%
Other Punctuation
ValueCountFrequency (%)
. 4
66.7%
& 2
33.3%
Space Separator
ValueCountFrequency (%)
51
100.0%
Close Punctuation
ValueCountFrequency (%)
) 31
100.0%
Open Punctuation
ValueCountFrequency (%)
( 31
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2043
92.7%
Common 129
 
5.9%
Latin 17
 
0.8%
Han 15
 
0.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
46
 
2.3%
41
 
2.0%
37
 
1.8%
34
 
1.7%
30
 
1.5%
29
 
1.4%
29
 
1.4%
28
 
1.4%
26
 
1.3%
25
 
1.2%
Other values (339) 1718
84.1%
Han
ValueCountFrequency (%)
2
13.3%
2
13.3%
1
 
6.7%
1
 
6.7%
1
 
6.7%
1
 
6.7%
1
 
6.7%
1
 
6.7%
1
 
6.7%
1
 
6.7%
Other values (3) 3
20.0%
Common
ValueCountFrequency (%)
51
39.5%
) 31
24.0%
( 31
24.0%
. 4
 
3.1%
1 2
 
1.6%
0 2
 
1.6%
5 2
 
1.6%
& 2
 
1.6%
6 1
 
0.8%
8 1
 
0.8%
Other values (2) 2
 
1.6%
Latin
ValueCountFrequency (%)
C 3
17.6%
K 2
11.8%
L 2
11.8%
a 2
11.8%
V 1
 
5.9%
X 1
 
5.9%
i 1
 
5.9%
s 1
 
5.9%
t 1
 
5.9%
N 1
 
5.9%
Other values (2) 2
11.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2042
92.6%
ASCII 145
 
6.6%
CJK 14
 
0.6%
None 1
 
< 0.1%
CJK Compat Ideographs 1
 
< 0.1%
Number Forms 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
51
35.2%
) 31
21.4%
( 31
21.4%
. 4
 
2.8%
C 3
 
2.1%
1 2
 
1.4%
K 2
 
1.4%
0 2
 
1.4%
L 2
 
1.4%
a 2
 
1.4%
Other values (13) 15
 
10.3%
Hangul
ValueCountFrequency (%)
46
 
2.3%
41
 
2.0%
37
 
1.8%
34
 
1.7%
30
 
1.5%
29
 
1.4%
29
 
1.4%
28
 
1.4%
26
 
1.3%
25
 
1.2%
Other values (338) 1717
84.1%
CJK
ValueCountFrequency (%)
2
14.3%
2
14.3%
1
7.1%
1
7.1%
1
7.1%
1
7.1%
1
7.1%
1
7.1%
1
7.1%
1
7.1%
Other values (2) 2
14.3%
None
ValueCountFrequency (%)
1
100.0%
CJK Compat Ideographs
ValueCountFrequency (%)
1
100.0%
Number Forms
ValueCountFrequency (%)
1
100.0%

전화번호
Text

MISSING 

Distinct366
Distinct (%)99.7%
Missing4
Missing (%)1.1%
Memory size3.0 KiB
2024-03-18T11:28:06.174471image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length12
Mean length12.324251
Min length12

Characters and Unicode

Total characters4523
Distinct characters13
Distinct categories4 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique365 ?
Unique (%)99.5%

Sample

1st row032-933-7807
2nd row032-937-7771
3rd row032-934-8882
4th row032-937-3996
5th row032-937-8144
ValueCountFrequency (%)
032 47
 
10.5%
032-772-0062 2
 
0.4%
438 2
 
0.4%
442 2
 
0.4%
471 2
 
0.4%
421 2
 
0.4%
032-511 2
 
0.4%
461 2
 
0.4%
8800 2
 
0.4%
032-502-5802 1
 
0.2%
Other values (384) 384
85.7%
2024-03-18T11:28:06.506683image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 734
16.2%
3 661
14.6%
2 628
13.9%
0 607
13.4%
8 321
7.1%
5 299
6.6%
7 275
 
6.1%
6 230
 
5.1%
1 223
 
4.9%
4 220
 
4.9%
Other values (3) 325
7.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 3671
81.2%
Dash Punctuation 734
 
16.2%
Space Separator 117
 
2.6%
Other Punctuation 1
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
3 661
18.0%
2 628
17.1%
0 607
16.5%
8 321
8.7%
5 299
8.1%
7 275
7.5%
6 230
 
6.3%
1 223
 
6.1%
4 220
 
6.0%
9 207
 
5.6%
Dash Punctuation
ValueCountFrequency (%)
- 734
100.0%
Space Separator
ValueCountFrequency (%)
117
100.0%
Other Punctuation
ValueCountFrequency (%)
' 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 4523
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 734
16.2%
3 661
14.6%
2 628
13.9%
0 607
13.4%
8 321
7.1%
5 299
6.6%
7 275
 
6.1%
6 230
 
5.1%
1 223
 
4.9%
4 220
 
4.9%
Other values (3) 325
7.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 4523
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 734
16.2%
3 661
14.6%
2 628
13.9%
0 607
13.4%
8 321
7.1%
5 299
6.6%
7 275
 
6.1%
6 230
 
5.1%
1 223
 
4.9%
4 220
 
4.9%
Other values (3) 325
7.2%

업태
Categorical

IMBALANCE 

Distinct15
Distinct (%)4.0%
Missing0
Missing (%)0.0%
Memory size3.0 KiB
한식
279 
중국식
 
23
일식
 
17
탕류(보신용)
 
12
식육(숯불구이)
 
12
Other values (10)
28 

Length

Max length8
Median length2
Mean length2.458221
Min length2

Unique

Unique4 ?
Unique (%)1.1%

Sample

1st row한식
2nd row한식
3rd row한식
4th row한식
5th row한식

Common Values

ValueCountFrequency (%)
한식 279
75.2%
중국식 23
 
6.2%
일식 17
 
4.6%
탕류(보신용) 12
 
3.2%
식육(숯불구이) 12
 
3.2%
경양식 8
 
2.2%
뷔페식 5
 
1.3%
중식 3
 
0.8%
기타 3
 
0.8%
분식 3
 
0.8%
Other values (5) 6
 
1.6%

Length

2024-03-18T11:28:06.629467image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
한식 279
75.2%
중국식 23
 
6.2%
일식 17
 
4.6%
탕류(보신용 12
 
3.2%
식육(숯불구이 12
 
3.2%
경양식 8
 
2.2%
뷔페식 5
 
1.3%
중식 3
 
0.8%
기타 3
 
0.8%
분식 3
 
0.8%
Other values (5) 6
 
1.6%
Distinct255
Distinct (%)68.7%
Missing0
Missing (%)0.0%
Memory size3.0 KiB
2024-03-18T11:28:06.846574image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length23
Median length15
Mean length5.0296496
Min length1

Characters and Unicode

Total characters1866
Distinct characters236
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique206 ?
Unique (%)55.5%

Sample

1st row묵밥
2nd row야콘냉면
3rd row해장국
4th row간장게장
5th row꽃게
ValueCountFrequency (%)
삼계탕 13
 
2.9%
보쌈 10
 
2.2%
돼지갈비 9
 
2.0%
설렁탕 9
 
2.0%
삼겹살 9
 
2.0%
한정식 9
 
2.0%
갈비 7
 
1.6%
장어 7
 
1.6%
자장면 6
 
1.3%
탕수육 6
 
1.3%
Other values (239) 365
81.1%
2024-03-18T11:28:07.176483image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
, 106
 
5.7%
105
 
5.6%
79
 
4.2%
58
 
3.1%
56
 
3.0%
48
 
2.6%
42
 
2.3%
34
 
1.8%
33
 
1.8%
30
 
1.6%
Other values (226) 1275
68.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1674
89.7%
Other Punctuation 109
 
5.8%
Space Separator 79
 
4.2%
Close Punctuation 2
 
0.1%
Open Punctuation 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
105
 
6.3%
58
 
3.5%
56
 
3.3%
48
 
2.9%
42
 
2.5%
34
 
2.0%
33
 
2.0%
30
 
1.8%
29
 
1.7%
28
 
1.7%
Other values (220) 1211
72.3%
Other Punctuation
ValueCountFrequency (%)
, 106
97.2%
' 2
 
1.8%
· 1
 
0.9%
Space Separator
ValueCountFrequency (%)
79
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1674
89.7%
Common 192
 
10.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
105
 
6.3%
58
 
3.5%
56
 
3.3%
48
 
2.9%
42
 
2.5%
34
 
2.0%
33
 
2.0%
30
 
1.8%
29
 
1.7%
28
 
1.7%
Other values (220) 1211
72.3%
Common
ValueCountFrequency (%)
, 106
55.2%
79
41.1%
) 2
 
1.0%
( 2
 
1.0%
' 2
 
1.0%
· 1
 
0.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1674
89.7%
ASCII 191
 
10.2%
None 1
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
, 106
55.5%
79
41.4%
) 2
 
1.0%
( 2
 
1.0%
' 2
 
1.0%
Hangul
ValueCountFrequency (%)
105
 
6.3%
58
 
3.5%
56
 
3.3%
48
 
2.9%
42
 
2.5%
34
 
2.0%
33
 
2.0%
30
 
1.8%
29
 
1.7%
28
 
1.7%
Other values (220) 1211
72.3%
None
ValueCountFrequency (%)
· 1
100.0%

Interactions

2024-03-18T11:28:04.603978image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-18T11:28:07.258162image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번군구업태
연번1.0000.9840.330
군구0.9841.0000.321
업태0.3300.3211.000
2024-03-18T11:28:07.353776image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업태군구
업태1.0000.124
군구0.1241.000
2024-03-18T11:28:07.449330image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번군구업태
연번1.0000.7660.130
군구0.7661.0000.124
업태0.1300.1241.000

Missing values

2024-03-18T11:28:04.766345image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-18T11:28:04.848718image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번군구업소명전화번호업태주메뉴
01강화군왕자정032-933-7807한식묵밥
12강화군야콘냉면본가집032-937-7771한식야콘냉면
23강화군새벽해장국032-934-8882한식해장국
34강화군충남서산집032-937-3996한식간장게장
45강화군동촌집032-937-8144한식꽃게
56강화군해안도시032-937-9506한식장어
67강화군돌캐032-932-3221한식꽃게
78강화군한자리한턱032-937-7889한식게장
89강화군국화호수032-933-8264한식참게
910강화군산촌032-933-0051한식백숙
연번군구업소명전화번호업태주메뉴
361362서구어부횟집032-568-2922일식활어회,참치회
362363서구씨제이프레시웨이㈜베어즈베스트클럽하우스032-560-2010한식통큰해물짬뽕
363364서구강선생막국수032-576-3939한식막국수
364365서구경복궁송림점(주)엔타스시스템032-574-7767한식소갈비
365366서구삿뽀로송림점(주)엔타스시스템032-583-7776일식참치스페셜
366367서구종가집032-562-0621한식한우등심
367368서구(주)촌장032-562-4343한식왕길비
368369서구교동찹쌀순대032-563-3838한식찹쌀순대,순대국
369370서구채선당 플러스032-562-7577한식즉석칼국수
370371서구숙성명작032-566-9298한식한우등심구이