Overview

Dataset statistics

Number of variables7
Number of observations103
Missing cells16
Missing cells (%)2.2%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory5.9 KiB
Average record size in memory58.3 B

Variable types

Numeric1
Text4
Categorical1
DateTime1

Dataset

Description인천광역시 서구의 특색 있는 음식점 발굴 및 지역경제 활성화를 위해 서구민들이 추천해주신 서구 곳곳에 숨어있는 음식점 정보 (업소명, 소재지, 전화번호, 대표메뉴, 업태 등) 에 관한 데이터입니다.
Author인천광역시 서구
URLhttps://data.incheon.go.kr/findData/publicDataDetail?dataId=15068557&srcSe=7661IVAWM27C61E190

Alerts

데이터기준일자 has constant value ""Constant
업태 is highly imbalanced (68.2%)Imbalance
전화번호 has 16 (15.5%) missing valuesMissing
연번 has unique valuesUnique
업소명 has unique valuesUnique

Reproduction

Analysis started2024-01-28 16:14:17.728063
Analysis finished2024-01-28 16:14:18.462163
Duration0.73 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct103
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean52
Minimum1
Maximum103
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2024-01-29T01:14:18.522039image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile6.1
Q126.5
median52
Q377.5
95-th percentile97.9
Maximum103
Range102
Interquartile range (IQR)51

Descriptive statistics

Standard deviation29.877528
Coefficient of variation (CV)0.57456784
Kurtosis-1.2
Mean52
Median Absolute Deviation (MAD)26
Skewness0
Sum5356
Variance892.66667
MonotonicityStrictly increasing
2024-01-29T01:14:18.636611image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.0%
2 1
 
1.0%
77 1
 
1.0%
76 1
 
1.0%
75 1
 
1.0%
74 1
 
1.0%
73 1
 
1.0%
72 1
 
1.0%
71 1
 
1.0%
70 1
 
1.0%
Other values (93) 93
90.3%
ValueCountFrequency (%)
1 1
1.0%
2 1
1.0%
3 1
1.0%
4 1
1.0%
5 1
1.0%
6 1
1.0%
7 1
1.0%
8 1
1.0%
9 1
1.0%
10 1
1.0%
ValueCountFrequency (%)
103 1
1.0%
102 1
1.0%
101 1
1.0%
100 1
1.0%
99 1
1.0%
98 1
1.0%
97 1
1.0%
96 1
1.0%
95 1
1.0%
94 1
1.0%

업소명
Text

UNIQUE 

Distinct103
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size956.0 B
2024-01-29T01:14:18.866080image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length10
Mean length5.9417476
Min length2

Characters and Unicode

Total characters612
Distinct characters229
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique103 ?
Unique (%)100.0%

Sample

1st row남도가든
2nd row미야스시
3rd row옛날황해도순대국
4th row평양옥
5th row가촌칡냉면
ValueCountFrequency (%)
남원추어탕 2
 
1.7%
남도가든 1
 
0.9%
낙지세상 1
 
0.9%
돈스마마 1
 
0.9%
흥부왕족발 1
 
0.9%
황토식당 1
 
0.9%
화수분 1
 
0.9%
통돼지두루치기 1
 
0.9%
착한참치 1
 
0.9%
최태호의 1
 
0.9%
Other values (106) 106
90.6%
2024-01-29T01:14:19.208416image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
15
 
2.5%
15
 
2.5%
15
 
2.5%
14
 
2.3%
12
 
2.0%
11
 
1.8%
10
 
1.6%
10
 
1.6%
9
 
1.5%
9
 
1.5%
Other values (219) 492
80.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 590
96.4%
Space Separator 14
 
2.3%
Open Punctuation 2
 
0.3%
Close Punctuation 2
 
0.3%
Lowercase Letter 2
 
0.3%
Uppercase Letter 1
 
0.2%
Decimal Number 1
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
15
 
2.5%
15
 
2.5%
15
 
2.5%
12
 
2.0%
11
 
1.9%
10
 
1.7%
10
 
1.7%
9
 
1.5%
9
 
1.5%
8
 
1.4%
Other values (212) 476
80.7%
Lowercase Letter
ValueCountFrequency (%)
e 1
50.0%
h 1
50.0%
Space Separator
ValueCountFrequency (%)
14
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%
Uppercase Letter
ValueCountFrequency (%)
T 1
100.0%
Decimal Number
ValueCountFrequency (%)
2 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 590
96.4%
Common 19
 
3.1%
Latin 3
 
0.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
15
 
2.5%
15
 
2.5%
15
 
2.5%
12
 
2.0%
11
 
1.9%
10
 
1.7%
10
 
1.7%
9
 
1.5%
9
 
1.5%
8
 
1.4%
Other values (212) 476
80.7%
Common
ValueCountFrequency (%)
14
73.7%
( 2
 
10.5%
) 2
 
10.5%
2 1
 
5.3%
Latin
ValueCountFrequency (%)
e 1
33.3%
h 1
33.3%
T 1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 590
96.4%
ASCII 22
 
3.6%

Most frequent character per block

Hangul
ValueCountFrequency (%)
15
 
2.5%
15
 
2.5%
15
 
2.5%
12
 
2.0%
11
 
1.9%
10
 
1.7%
10
 
1.7%
9
 
1.5%
9
 
1.5%
8
 
1.4%
Other values (212) 476
80.7%
ASCII
ValueCountFrequency (%)
14
63.6%
( 2
 
9.1%
) 2
 
9.1%
e 1
 
4.5%
h 1
 
4.5%
T 1
 
4.5%
2 1
 
4.5%
Distinct102
Distinct (%)99.0%
Missing0
Missing (%)0.0%
Memory size956.0 B
2024-01-29T01:14:19.455461image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length44
Median length39
Mean length27.524272
Min length21

Characters and Unicode

Total characters2835
Distinct characters128
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique101 ?
Unique (%)98.1%

Sample

1st row인천광역시 서구 가남로 401 (가정동)
2nd row인천광역시 서구 염곡로498번안길 20, 104호 (가정동, 제이에스프라자)
3rd row인천광역시 서구 원창로 229 (가정동)
4th row인천광역시 서구 석곶로 23, 1층 (가정동)
5th row인천광역시 서구 원적로 79 (가좌동)
ValueCountFrequency (%)
인천광역시 103
 
17.5%
서구 103
 
17.5%
석남동 27
 
4.6%
가좌동 26
 
4.4%
1층 19
 
3.2%
심곡동 14
 
2.4%
신현동 7
 
1.2%
가정로 7
 
1.2%
2층 7
 
1.2%
심곡로 6
 
1.0%
Other values (190) 268
45.7%
2024-01-29T01:14:19.823854image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
484
 
17.1%
1 125
 
4.4%
110
 
3.9%
109
 
3.8%
107
 
3.8%
105
 
3.7%
104
 
3.7%
104
 
3.7%
103
 
3.6%
) 103
 
3.6%
Other values (118) 1381
48.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1585
55.9%
Space Separator 484
 
17.1%
Decimal Number 461
 
16.3%
Close Punctuation 103
 
3.6%
Open Punctuation 102
 
3.6%
Other Punctuation 67
 
2.4%
Dash Punctuation 19
 
0.7%
Math Symbol 12
 
0.4%
Uppercase Letter 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
110
 
6.9%
109
 
6.9%
107
 
6.8%
105
 
6.6%
104
 
6.6%
104
 
6.6%
103
 
6.5%
103
 
6.5%
103
 
6.5%
47
 
3.0%
Other values (100) 590
37.2%
Decimal Number
ValueCountFrequency (%)
1 125
27.1%
2 83
18.0%
0 52
11.3%
4 37
 
8.0%
3 36
 
7.8%
5 31
 
6.7%
8 26
 
5.6%
9 25
 
5.4%
7 24
 
5.2%
6 22
 
4.8%
Uppercase Letter
ValueCountFrequency (%)
B 1
50.0%
A 1
50.0%
Space Separator
ValueCountFrequency (%)
484
100.0%
Close Punctuation
ValueCountFrequency (%)
) 103
100.0%
Open Punctuation
ValueCountFrequency (%)
( 102
100.0%
Other Punctuation
ValueCountFrequency (%)
, 67
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 19
100.0%
Math Symbol
ValueCountFrequency (%)
~ 12
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1585
55.9%
Common 1248
44.0%
Latin 2
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
110
 
6.9%
109
 
6.9%
107
 
6.8%
105
 
6.6%
104
 
6.6%
104
 
6.6%
103
 
6.5%
103
 
6.5%
103
 
6.5%
47
 
3.0%
Other values (100) 590
37.2%
Common
ValueCountFrequency (%)
484
38.8%
1 125
 
10.0%
) 103
 
8.3%
( 102
 
8.2%
2 83
 
6.7%
, 67
 
5.4%
0 52
 
4.2%
4 37
 
3.0%
3 36
 
2.9%
5 31
 
2.5%
Other values (6) 128
 
10.3%
Latin
ValueCountFrequency (%)
B 1
50.0%
A 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1585
55.9%
ASCII 1250
44.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
484
38.7%
1 125
 
10.0%
) 103
 
8.2%
( 102
 
8.2%
2 83
 
6.6%
, 67
 
5.4%
0 52
 
4.2%
4 37
 
3.0%
3 36
 
2.9%
5 31
 
2.5%
Other values (8) 130
 
10.4%
Hangul
ValueCountFrequency (%)
110
 
6.9%
109
 
6.9%
107
 
6.8%
105
 
6.6%
104
 
6.6%
104
 
6.6%
103
 
6.5%
103
 
6.5%
103
 
6.5%
47
 
3.0%
Other values (100) 590
37.2%

전화번호
Text

MISSING 

Distinct86
Distinct (%)98.9%
Missing16
Missing (%)15.5%
Memory size956.0 B
2024-01-29T01:14:20.029706image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length12
Mean length12.045977
Min length12

Characters and Unicode

Total characters1048
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique85 ?
Unique (%)97.7%

Sample

1st row032-581-7667
2nd row032-581-6422
3rd row032-582-5959
4th row032-583-7664
5th row050-8389-8844
ValueCountFrequency (%)
032-581-6422 2
 
2.3%
032-572-9115 1
 
1.1%
032-573-2821 1
 
1.1%
032-575-5005 1
 
1.1%
032-578-4747 1
 
1.1%
0503-5263-0966 1
 
1.1%
032-581-7799 1
 
1.1%
032-583-9933 1
 
1.1%
032-584-3331 1
 
1.1%
032-562-4343 1
 
1.1%
Other values (76) 76
87.4%
2024-01-29T01:14:20.337417image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 174
16.6%
5 139
13.3%
3 133
12.7%
0 129
12.3%
2 125
11.9%
6 78
7.4%
7 75
7.2%
8 68
 
6.5%
9 49
 
4.7%
1 47
 
4.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 874
83.4%
Dash Punctuation 174
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 139
15.9%
3 133
15.2%
0 129
14.8%
2 125
14.3%
6 78
8.9%
7 75
8.6%
8 68
7.8%
9 49
 
5.6%
1 47
 
5.4%
4 31
 
3.5%
Dash Punctuation
ValueCountFrequency (%)
- 174
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1048
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 174
16.6%
5 139
13.3%
3 133
12.7%
0 129
12.3%
2 125
11.9%
6 78
7.4%
7 75
7.2%
8 68
 
6.5%
9 49
 
4.7%
1 47
 
4.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1048
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 174
16.6%
5 139
13.3%
3 133
12.7%
0 129
12.3%
2 125
11.9%
6 78
7.4%
7 75
7.2%
8 68
 
6.5%
9 49
 
4.7%
1 47
 
4.5%
Distinct77
Distinct (%)74.8%
Missing0
Missing (%)0.0%
Memory size956.0 B
2024-01-29T01:14:20.545599image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length11
Mean length4.5242718
Min length2

Characters and Unicode

Total characters466
Distinct characters139
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique61 ?
Unique (%)59.2%

Sample

1st row추어탕
2nd row초밥
3rd row순대국
4th row숭어매운탕
5th row냉면+설렁탕
ValueCountFrequency (%)
순대국 5
 
4.8%
추어탕 4
 
3.8%
삼겹살 4
 
3.8%
칼국수 3
 
2.9%
돼지갈비 3
 
2.9%
돈가스 3
 
2.9%
순두부찌개 2
 
1.9%
초밥 2
 
1.9%
양꼬치 2
 
1.9%
우렁쌈밥정식 2
 
1.9%
Other values (69) 75
71.4%
2024-01-29T01:14:20.851934image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
20
 
4.3%
+ 18
 
3.9%
17
 
3.6%
17
 
3.6%
15
 
3.2%
15
 
3.2%
10
 
2.1%
10
 
2.1%
10
 
2.1%
9
 
1.9%
Other values (129) 325
69.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 445
95.5%
Math Symbol 18
 
3.9%
Space Separator 2
 
0.4%
Other Punctuation 1
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
20
 
4.5%
17
 
3.8%
17
 
3.8%
15
 
3.4%
15
 
3.4%
10
 
2.2%
10
 
2.2%
10
 
2.2%
9
 
2.0%
8
 
1.8%
Other values (126) 314
70.6%
Math Symbol
ValueCountFrequency (%)
+ 18
100.0%
Space Separator
ValueCountFrequency (%)
2
100.0%
Other Punctuation
ValueCountFrequency (%)
, 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 445
95.5%
Common 21
 
4.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
20
 
4.5%
17
 
3.8%
17
 
3.8%
15
 
3.4%
15
 
3.4%
10
 
2.2%
10
 
2.2%
10
 
2.2%
9
 
2.0%
8
 
1.8%
Other values (126) 314
70.6%
Common
ValueCountFrequency (%)
+ 18
85.7%
2
 
9.5%
, 1
 
4.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 445
95.5%
ASCII 21
 
4.5%

Most frequent character per block

Hangul
ValueCountFrequency (%)
20
 
4.5%
17
 
3.8%
17
 
3.8%
15
 
3.4%
15
 
3.4%
10
 
2.2%
10
 
2.2%
10
 
2.2%
9
 
2.0%
8
 
1.8%
Other values (126) 314
70.6%
ASCII
ValueCountFrequency (%)
+ 18
85.7%
2
 
9.5%
, 1
 
4.8%

업태
Categorical

IMBALANCE 

Distinct5
Distinct (%)4.9%
Missing0
Missing (%)0.0%
Memory size956.0 B
한식
91 
양식
 
5
일식
 
3
분식
 
2
중식
 
2

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row한식
2nd row한식
3rd row한식
4th row한식
5th row한식

Common Values

ValueCountFrequency (%)
한식 91
88.3%
양식 5
 
4.9%
일식 3
 
2.9%
분식 2
 
1.9%
중식 2
 
1.9%

Length

2024-01-29T01:14:20.952730image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-29T01:14:21.033587image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
한식 91
88.3%
양식 5
 
4.9%
일식 3
 
2.9%
분식 2
 
1.9%
중식 2
 
1.9%

데이터기준일자
Date

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size956.0 B
Minimum2022-09-06 00:00:00
Maximum2022-09-06 00:00:00
2024-01-29T01:14:21.097282image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-29T01:14:21.164171image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2024-01-29T01:14:18.261179image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-01-29T01:14:21.215513image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번전화번호대표메뉴업태
연번1.0000.9490.6750.000
전화번호0.9491.0000.9971.000
대표메뉴0.6750.9971.0000.880
업태0.0001.0000.8801.000
2024-01-29T01:14:21.517925image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번업태
연번1.0000.000
업태0.0001.000

Missing values

2024-01-29T01:14:18.345931image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-29T01:14:18.429131image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번업소명소재지전화번호대표메뉴업태데이터기준일자
01남도가든인천광역시 서구 가남로 401 (가정동)032-581-7667추어탕한식2022-09-06
12미야스시인천광역시 서구 염곡로498번안길 20, 104호 (가정동, 제이에스프라자)<NA>초밥한식2022-09-06
23옛날황해도순대국인천광역시 서구 원창로 229 (가정동)032-581-6422순대국한식2022-09-06
34평양옥인천광역시 서구 석곶로 23, 1층 (가정동)032-582-5959숭어매운탕한식2022-09-06
45가촌칡냉면인천광역시 서구 원적로 79 (가좌동)032-583-7664냉면+설렁탕한식2022-09-06
56강선생의 진짜로 명태촌인천광역시 서구 건지로 405-1, 1층 (가좌동)050-8389-8844명태한식2022-09-06
67광어한마리인천광역시 서구 신진말로 26, 202호 (가좌동, 진주프라자)032-573-8959활어회한식2022-09-06
78끼리막창인천광역시 서구 원적로100번길 14 (가좌동)032-575-4163막창한식2022-09-06
89노다지삼겹살인천광역시 서구 장고개로280번길 8 (가좌동)032-583-0068삼겹살한식2022-09-06
910다열 금강산추어탕인천광역시 서구 장고개로 319, 9동 6~8호 (가좌동, 진주아파트상가동)032-573-9009추어탕한식2022-09-06
연번업소명소재지전화번호대표메뉴업태데이터기준일자
9394본터랍스타인천광역시 서구 심곡로 81 (심곡동)032-567-6151랍스터양식2022-09-06
9495교동찹쌀순대인천광역시 서구 심곡로55번길 5, A동 101~102호 (심곡동)032-563-3838순대국한식2022-09-06
9596가좌동진천토종순대인천광역시 서구 담지로104번길 17-1, 1층 (연희동)032-565-0798순대국한식2022-09-06
9697유가네왕족발인천광역시 서구 연희로 10, 1층 (연희동)032-563-3378족발+뷔페식점심한식2022-09-06
9798장수골 남원추어탕인천광역시 서구 담지로104번길 21, 1층 (연희동)<NA>추어탕한식2022-09-06
9899진미아구탕인천광역시 서구 원창로 238 (가정동)<NA>아구찜한식2022-09-06
99100쭈꾸쭈꾸미인천광역시 서구 청라에메랄드로149번길 4-7, 1층 (연희동)032-581-0903쭈꾸미볶음한식2022-09-06
100101청라쭈꾸미마을인천광역시 서구 청라라임로16번길 20, 1층 (연희동)032-565-9411쭈꾸미볶음한식2022-09-06
101102정가며느리볼테기인천광역시 서구 검단로459번길 7-3, 110~111호 (왕길동, 인천아트빌)032-564-6664볼테기탕+찜한식2022-09-06
102103태백산인천광역시 서구 완정로 151, 2~3층 (왕길동)032-567-3392한우갈비살+꽃등심한식2022-09-06