Overview

Dataset statistics

Number of variables5
Number of observations444
Missing cells220
Missing cells (%)9.9%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory17.5 KiB
Average record size in memory40.3 B

Variable types

Categorical2
Text3

Dataset

Description인천광역시 중구 관내에 위치한 카페 및 커피숍 현황에 대한 데이터 입니다. 파일명 인천광역시_중구_카페 및 커피숍 현황 파일내용 사업장명, 소재지지번주소, 도로명주소 등
URLhttps://www.data.go.kr/data/15086876/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
업종명 is highly imbalanced (61.8%)Imbalance
소재지전화 has 220 (49.5%) missing valuesMissing

Reproduction

Analysis started2023-12-12 02:52:15.277266
Analysis finished2023-12-12 02:52:15.795954
Duration0.52 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업종명
Categorical

IMBALANCE 

Distinct2
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size3.6 KiB
휴게음식점
411 
일반음식점
 
33

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row일반음식점
2nd row일반음식점
3rd row일반음식점
4th row일반음식점
5th row일반음식점

Common Values

ValueCountFrequency (%)
휴게음식점 411
92.6%
일반음식점 33
 
7.4%

Length

2023-12-12T11:52:15.848650image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T11:52:15.937916image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
휴게음식점 411
92.6%
일반음식점 33
 
7.4%
Distinct441
Distinct (%)99.3%
Missing0
Missing (%)0.0%
Memory size3.6 KiB
2023-12-12T11:52:16.185570image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length44
Median length25
Mean length9.277027
Min length1

Characters and Unicode

Total characters4119
Distinct characters480
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique439 ?
Unique (%)98.9%

Sample

1st row좋은예감
2nd row까페 꼰띠고 인천차이나타운점
3rd row
4th row카페스윗
5th row서니구락부
ValueCountFrequency (%)
카페 17
 
2.3%
영종하늘도시점 16
 
2.1%
커피 11
 
1.5%
메가엠지씨커피 9
 
1.2%
컴포즈커피 8
 
1.1%
빽다방 7
 
0.9%
cafe 7
 
0.9%
이디야 7
 
0.9%
coffee 7
 
0.9%
인천공항 7
 
0.9%
Other values (555) 650
87.1%
2023-12-12T11:52:16.612973image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
302
 
7.3%
156
 
3.8%
97
 
2.4%
97
 
2.4%
96
 
2.3%
96
 
2.3%
93
 
2.3%
80
 
1.9%
( 76
 
1.8%
) 76
 
1.8%
Other values (470) 2950
71.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2996
72.7%
Lowercase Letter 328
 
8.0%
Space Separator 302
 
7.3%
Uppercase Letter 240
 
5.8%
Open Punctuation 76
 
1.8%
Close Punctuation 76
 
1.8%
Decimal Number 75
 
1.8%
Other Punctuation 22
 
0.5%
Dash Punctuation 2
 
< 0.1%
Connector Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
156
 
5.2%
97
 
3.2%
97
 
3.2%
96
 
3.2%
96
 
3.2%
93
 
3.1%
80
 
2.7%
75
 
2.5%
65
 
2.2%
61
 
2.0%
Other values (400) 2080
69.4%
Uppercase Letter
ValueCountFrequency (%)
E 25
 
10.4%
A 25
 
10.4%
O 22
 
9.2%
T 20
 
8.3%
F 17
 
7.1%
C 15
 
6.2%
S 12
 
5.0%
M 12
 
5.0%
L 11
 
4.6%
H 10
 
4.2%
Other values (15) 71
29.6%
Lowercase Letter
ValueCountFrequency (%)
e 60
18.3%
a 36
11.0%
f 30
 
9.1%
o 24
 
7.3%
c 23
 
7.0%
n 19
 
5.8%
s 17
 
5.2%
i 15
 
4.6%
y 12
 
3.7%
t 11
 
3.4%
Other values (14) 81
24.7%
Decimal Number
ValueCountFrequency (%)
1 19
25.3%
2 19
25.3%
3 11
14.7%
5 8
10.7%
0 6
 
8.0%
6 5
 
6.7%
9 4
 
5.3%
4 3
 
4.0%
Other Punctuation
ValueCountFrequency (%)
, 6
27.3%
& 5
22.7%
. 4
18.2%
' 2
 
9.1%
/ 2
 
9.1%
# 2
 
9.1%
? 1
 
4.5%
Space Separator
ValueCountFrequency (%)
302
100.0%
Open Punctuation
ValueCountFrequency (%)
( 76
100.0%
Close Punctuation
ValueCountFrequency (%)
) 76
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2996
72.7%
Latin 569
 
13.8%
Common 554
 
13.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
156
 
5.2%
97
 
3.2%
97
 
3.2%
96
 
3.2%
96
 
3.2%
93
 
3.1%
80
 
2.7%
75
 
2.5%
65
 
2.2%
61
 
2.0%
Other values (400) 2080
69.4%
Latin
ValueCountFrequency (%)
e 60
 
10.5%
a 36
 
6.3%
f 30
 
5.3%
E 25
 
4.4%
A 25
 
4.4%
o 24
 
4.2%
c 23
 
4.0%
O 22
 
3.9%
T 20
 
3.5%
n 19
 
3.3%
Other values (40) 285
50.1%
Common
ValueCountFrequency (%)
302
54.5%
( 76
 
13.7%
) 76
 
13.7%
1 19
 
3.4%
2 19
 
3.4%
3 11
 
2.0%
5 8
 
1.4%
, 6
 
1.1%
0 6
 
1.1%
6 5
 
0.9%
Other values (10) 26
 
4.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2996
72.7%
ASCII 1122
 
27.2%
Number Forms 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
302
26.9%
( 76
 
6.8%
) 76
 
6.8%
e 60
 
5.3%
a 36
 
3.2%
f 30
 
2.7%
E 25
 
2.2%
A 25
 
2.2%
o 24
 
2.1%
c 23
 
2.0%
Other values (59) 445
39.7%
Hangul
ValueCountFrequency (%)
156
 
5.2%
97
 
3.2%
97
 
3.2%
96
 
3.2%
96
 
3.2%
93
 
3.1%
80
 
2.7%
75
 
2.5%
65
 
2.2%
61
 
2.0%
Other values (400) 2080
69.4%
Number Forms
ValueCountFrequency (%)
1
100.0%
Distinct430
Distinct (%)96.8%
Missing0
Missing (%)0.0%
Memory size3.6 KiB
2023-12-12T11:52:16.918758image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length60
Median length52
Mean length35.475225
Min length19

Characters and Unicode

Total characters15751
Distinct characters297
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique423 ?
Unique (%)95.3%

Sample

1st row인천광역시 중구 제물량로206번길 33 (관동1가, 1층)
2nd row인천광역시 중구 제물량로 262-20, 2,3층 (선린동)
3rd row인천광역시 중구 차이나타운로51번길 36 (관동1가, 2층)
4th row인천광역시 중구 인중로146번길 28 (사동, 1층)
5th row인천광역시 중구 신포로23번길 80 (중앙동2가, 1층)
ValueCountFrequency (%)
인천광역시 444
 
14.3%
중구 444
 
14.3%
1층 248
 
8.0%
운서동 126
 
4.0%
중산동 73
 
2.3%
2층 38
 
1.2%
일부 31
 
1.0%
운남동 27
 
0.9%
공항로 24
 
0.8%
272 22
 
0.7%
Other values (718) 1636
52.6%
2023-12-12T11:52:17.360677image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2674
 
17.0%
1 881
 
5.6%
557
 
3.5%
, 529
 
3.4%
526
 
3.3%
523
 
3.3%
500
 
3.2%
482
 
3.1%
455
 
2.9%
454
 
2.9%
Other values (287) 8170
51.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 8737
55.5%
Decimal Number 2732
 
17.3%
Space Separator 2674
 
17.0%
Other Punctuation 531
 
3.4%
Close Punctuation 450
 
2.9%
Open Punctuation 450
 
2.9%
Dash Punctuation 117
 
0.7%
Uppercase Letter 48
 
0.3%
Math Symbol 12
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
557
 
6.4%
526
 
6.0%
523
 
6.0%
500
 
5.7%
482
 
5.5%
455
 
5.2%
454
 
5.2%
447
 
5.1%
425
 
4.9%
403
 
4.6%
Other values (252) 3965
45.4%
Uppercase Letter
ValueCountFrequency (%)
B 8
16.7%
A 6
12.5%
C 5
10.4%
K 4
8.3%
P 4
8.3%
L 3
 
6.2%
I 3
 
6.2%
J 2
 
4.2%
H 2
 
4.2%
F 2
 
4.2%
Other values (8) 9
18.8%
Decimal Number
ValueCountFrequency (%)
1 881
32.2%
2 454
16.6%
3 264
 
9.7%
0 252
 
9.2%
4 204
 
7.5%
6 170
 
6.2%
7 161
 
5.9%
5 131
 
4.8%
9 116
 
4.2%
8 99
 
3.6%
Other Punctuation
ValueCountFrequency (%)
, 529
99.6%
/ 2
 
0.4%
Space Separator
ValueCountFrequency (%)
2674
100.0%
Close Punctuation
ValueCountFrequency (%)
) 450
100.0%
Open Punctuation
ValueCountFrequency (%)
( 450
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 117
100.0%
Math Symbol
ValueCountFrequency (%)
~ 12
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 8737
55.5%
Common 6966
44.2%
Latin 48
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
557
 
6.4%
526
 
6.0%
523
 
6.0%
500
 
5.7%
482
 
5.5%
455
 
5.2%
454
 
5.2%
447
 
5.1%
425
 
4.9%
403
 
4.6%
Other values (252) 3965
45.4%
Latin
ValueCountFrequency (%)
B 8
16.7%
A 6
12.5%
C 5
10.4%
K 4
8.3%
P 4
8.3%
L 3
 
6.2%
I 3
 
6.2%
J 2
 
4.2%
H 2
 
4.2%
F 2
 
4.2%
Other values (8) 9
18.8%
Common
ValueCountFrequency (%)
2674
38.4%
1 881
 
12.6%
, 529
 
7.6%
2 454
 
6.5%
) 450
 
6.5%
( 450
 
6.5%
3 264
 
3.8%
0 252
 
3.6%
4 204
 
2.9%
6 170
 
2.4%
Other values (7) 638
 
9.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 8737
55.5%
ASCII 7014
44.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2674
38.1%
1 881
 
12.6%
, 529
 
7.5%
2 454
 
6.5%
) 450
 
6.4%
( 450
 
6.4%
3 264
 
3.8%
0 252
 
3.6%
4 204
 
2.9%
6 170
 
2.4%
Other values (25) 686
 
9.8%
Hangul
ValueCountFrequency (%)
557
 
6.4%
526
 
6.0%
523
 
6.0%
500
 
5.7%
482
 
5.5%
455
 
5.2%
454
 
5.2%
447
 
5.1%
425
 
4.9%
403
 
4.6%
Other values (252) 3965
45.4%

소재지전화
Text

MISSING 

Distinct215
Distinct (%)96.0%
Missing220
Missing (%)49.5%
Memory size3.6 KiB
2023-12-12T11:52:17.614957image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length14
Mean length13.991071
Min length13

Characters and Unicode

Total characters3134
Distinct characters12
Distinct categories3 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique209 ?
Unique (%)93.3%

Sample

1st row032 -766 -5627
2nd row032 -761 -1250
3rd row070 -8116-4572
4th row032 -772 -1104
5th row032 -773 -3632
ValueCountFrequency (%)
032 178
30.8%
751 22
 
3.8%
746 20
 
3.5%
752 19
 
3.3%
070 18
 
3.1%
02 13
 
2.3%
773 8
 
1.4%
777 8
 
1.4%
747 6
 
1.0%
772 6
 
1.0%
Other values (240) 279
48.4%
2023-12-12T11:52:17.971250image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 447
14.3%
428
13.7%
0 406
13.0%
7 362
11.6%
2 338
10.8%
3 297
9.5%
5 176
 
5.6%
1 163
 
5.2%
8 154
 
4.9%
4 142
 
4.5%
Other values (2) 221
7.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 2259
72.1%
Dash Punctuation 447
 
14.3%
Space Separator 428
 
13.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 406
18.0%
7 362
16.0%
2 338
15.0%
3 297
13.1%
5 176
7.8%
1 163
7.2%
8 154
 
6.8%
4 142
 
6.3%
6 132
 
5.8%
9 89
 
3.9%
Dash Punctuation
ValueCountFrequency (%)
- 447
100.0%
Space Separator
ValueCountFrequency (%)
428
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 3134
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 447
14.3%
428
13.7%
0 406
13.0%
7 362
11.6%
2 338
10.8%
3 297
9.5%
5 176
 
5.6%
1 163
 
5.2%
8 154
 
4.9%
4 142
 
4.5%
Other values (2) 221
7.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 3134
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 447
14.3%
428
13.7%
0 406
13.0%
7 362
11.6%
2 338
10.8%
3 297
9.5%
5 176
 
5.6%
1 163
 
5.2%
8 154
 
4.9%
4 142
 
4.5%
Other values (2) 221
7.1%

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size3.6 KiB
2023-07-06
444 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023-07-06
2nd row2023-07-06
3rd row2023-07-06
4th row2023-07-06
5th row2023-07-06

Common Values

ValueCountFrequency (%)
2023-07-06 444
100.0%

Length

2023-12-12T11:52:18.095687image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T11:52:18.198141image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023-07-06 444
100.0%

Missing values

2023-12-12T11:52:15.674874image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T11:52:15.762245image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업종명업소명소재지소재지전화데이터기준일자
0일반음식점좋은예감인천광역시 중구 제물량로206번길 33 (관동1가, 1층)<NA>2023-07-06
1일반음식점까페 꼰띠고 인천차이나타운점인천광역시 중구 제물량로 262-20, 2,3층 (선린동)032 -766 -56272023-07-06
2일반음식점인천광역시 중구 차이나타운로51번길 36 (관동1가, 2층)<NA>2023-07-06
3일반음식점카페스윗인천광역시 중구 인중로146번길 28 (사동, 1층)<NA>2023-07-06
4일반음식점서니구락부인천광역시 중구 신포로23번길 80 (중앙동2가, 1층)032 -761 -12502023-07-06
5일반음식점노유민코페인천광역시 중구 차이나타운로 15 (송월동3가, 1층)070 -8116-45722023-07-06
6일반음식점팥누리인천광역시 중구 신포로27번길 44 (관동2가, 1층)032 -772 -11042023-07-06
7일반음식점카페 연화당인천광역시 중구 신포로27번길 97 (중앙동1가, 1층)<NA>2023-07-06
8일반음식점잭슨빌인천광역시 중구 제물량로166번길 1-8 (신생동, 1층)<NA>2023-07-06
9일반음식점작은오븐인천광역시 중구 신포로31번길 11 (관동3가, 1층,2층)032 -773 -36322023-07-06
업종명업소명소재지소재지전화데이터기준일자
434휴게음식점담쟁이넝쿨인천광역시 중구 자유공원남로 12 (송학동1가, 9번지 3층)032 - 772-01542023-07-06
435휴게음식점까로치아인천광역시 중구 공항로 272 (운서동, 화물터미널C 운송대리점 1층)032 -744 -38552023-07-06
436휴게음식점인천브릿지 커피인천광역시 중구 인천대교고속도로 3, 1층 (운남동, 인천대교기념관)032 -751 -94222023-07-06
437휴게음식점카페오라인천광역시 중구 용유서로 380 (을왕동, 지하1층,지상1층)032 -752 -08882023-07-06
438휴게음식점카페 잔피인천광역시 중구 송학로 13-1, 2층 (송학동2가)032 -567 -92552023-07-06
439휴게음식점스타벅스인천공항랜드점인천광역시 중구 공항로 272, 일반동 지하1층 (운서동, 인천국제공항여객터미널)032 -743 -82572023-07-06
440휴게음식점동인천커피숍인천광역시 중구 큰우물로 29 (인현동, 2층)032- 762-89222023-07-06
441휴게음식점커피올레인천광역시 중구 연안부두로 16, 227호 (항동7가, 해양센타)032 - 882-58582023-07-06
442휴게음식점카페미투인천광역시 중구 월미문화로 39, 1,2~옥상층 (북성동1가)032 - 772-71312023-07-06
443휴게음식점역마차커피숍인천광역시 중구 참외전로 131-6 (인현동)032 - 772-77262023-07-06