Overview

Dataset statistics

Number of variables5
Number of observations134
Missing cells86
Missing cells (%)12.8%
Duplicate rows1
Duplicate rows (%)0.7%
Total size in memory5.4 KiB
Average record size in memory41.0 B

Variable types

Categorical1
Text3
DateTime1

Dataset

Description인천광역시 중구 관광 숙박업소에 관한 내용입니다. 파일명 인천광역시 중구 관광숙박업소 현황 내용 업소명, 소재지 ,전화번호 등
URLhttps://www.data.go.kr/data/15028007/fileData.do

Alerts

업종명 has constant value ""Constant
데이터기준일 has constant value ""Constant
Dataset has 1 (0.7%) duplicate rowsDuplicates
소재지전화 has 86 (64.2%) missing valuesMissing

Reproduction

Analysis started2023-12-12 14:35:44.407992
Analysis finished2023-12-12 14:35:44.855347
Duration0.45 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업종명
Categorical

CONSTANT 

Distinct1
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
관광숙박업
134 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row관광숙박업
2nd row관광숙박업
3rd row관광숙박업
4th row관광숙박업
5th row관광숙박업

Common Values

ValueCountFrequency (%)
관광숙박업 134
100.0%

Length

2023-12-12T23:35:44.921400image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T23:35:45.032433image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
관광숙박업 134
100.0%
Distinct133
Distinct (%)99.3%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
2023-12-12T23:35:45.270761image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length24
Median length16
Mean length7.0149254
Min length1

Characters and Unicode

Total characters940
Distinct characters213
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique132 ?
Unique (%)98.5%

Sample

1st rowJN PARK HOTEL(제이앤파크호텔)
2nd rowM(엠)관광호텔
3rd rowMK프리미어스 호텔
4th rowTHEWEEEK&
5th rowW hotel
ValueCountFrequency (%)
호스텔 25
 
12.1%
관광호텔 11
 
5.3%
호텔 6
 
2.9%
비치힐 4
 
1.9%
백운호스텔 2
 
1.0%
펜션 2
 
1.0%
베니키아 2
 
1.0%
2
 
1.0%
어느멋진날에 2
 
1.0%
월미도 2
 
1.0%
Other values (147) 148
71.8%
2023-12-12T23:35:45.707453image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
91
 
9.7%
90
 
9.6%
76
 
8.1%
72
 
7.7%
24
 
2.6%
23
 
2.4%
21
 
2.2%
16
 
1.7%
15
 
1.6%
14
 
1.5%
Other values (203) 498
53.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 811
86.3%
Space Separator 72
 
7.7%
Uppercase Letter 28
 
3.0%
Decimal Number 11
 
1.2%
Open Punctuation 5
 
0.5%
Close Punctuation 5
 
0.5%
Lowercase Letter 5
 
0.5%
Other Punctuation 2
 
0.2%
Dash Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
91
 
11.2%
90
 
11.1%
76
 
9.4%
24
 
3.0%
23
 
2.8%
21
 
2.6%
16
 
2.0%
15
 
1.8%
14
 
1.7%
12
 
1.5%
Other values (173) 429
52.9%
Uppercase Letter
ValueCountFrequency (%)
E 5
17.9%
K 3
10.7%
A 3
10.7%
R 2
 
7.1%
O 2
 
7.1%
H 2
 
7.1%
W 2
 
7.1%
T 2
 
7.1%
M 2
 
7.1%
L 1
 
3.6%
Other values (4) 4
14.3%
Decimal Number
ValueCountFrequency (%)
2 4
36.4%
9 2
18.2%
1 2
18.2%
5 1
 
9.1%
3 1
 
9.1%
0 1
 
9.1%
Lowercase Letter
ValueCountFrequency (%)
h 1
20.0%
o 1
20.0%
t 1
20.0%
e 1
20.0%
l 1
20.0%
Space Separator
ValueCountFrequency (%)
72
100.0%
Open Punctuation
ValueCountFrequency (%)
( 5
100.0%
Close Punctuation
ValueCountFrequency (%)
) 5
100.0%
Other Punctuation
ValueCountFrequency (%)
& 2
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 811
86.3%
Common 96
 
10.2%
Latin 33
 
3.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
91
 
11.2%
90
 
11.1%
76
 
9.4%
24
 
3.0%
23
 
2.8%
21
 
2.6%
16
 
2.0%
15
 
1.8%
14
 
1.7%
12
 
1.5%
Other values (173) 429
52.9%
Latin
ValueCountFrequency (%)
E 5
15.2%
K 3
 
9.1%
A 3
 
9.1%
R 2
 
6.1%
O 2
 
6.1%
H 2
 
6.1%
W 2
 
6.1%
T 2
 
6.1%
M 2
 
6.1%
h 1
 
3.0%
Other values (9) 9
27.3%
Common
ValueCountFrequency (%)
72
75.0%
( 5
 
5.2%
) 5
 
5.2%
2 4
 
4.2%
9 2
 
2.1%
& 2
 
2.1%
1 2
 
2.1%
- 1
 
1.0%
5 1
 
1.0%
3 1
 
1.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 811
86.3%
ASCII 129
 
13.7%

Most frequent character per block

Hangul
ValueCountFrequency (%)
91
 
11.2%
90
 
11.1%
76
 
9.4%
24
 
3.0%
23
 
2.8%
21
 
2.6%
16
 
2.0%
15
 
1.8%
14
 
1.7%
12
 
1.5%
Other values (173) 429
52.9%
ASCII
ValueCountFrequency (%)
72
55.8%
( 5
 
3.9%
) 5
 
3.9%
E 5
 
3.9%
2 4
 
3.1%
K 3
 
2.3%
A 3
 
2.3%
9 2
 
1.6%
R 2
 
1.6%
O 2
 
1.6%
Other values (20) 26
 
20.2%
Distinct124
Distinct (%)92.5%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
2023-12-12T23:35:46.042274image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length47
Median length34
Mean length27.619403
Min length18

Characters and Unicode

Total characters3701
Distinct characters153
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique116 ?
Unique (%)86.6%

Sample

1st row인천광역시 중구 용유서로 262-15 (을왕동)
2nd row인천광역시 중구 연안부두로43번길 12 (항동7가)
3rd row인천광역시 중구 월미로260번길 12 (북성동1가)
4th row인천광역시 중구 용유서로 379, 더위크앤리조트 (을왕동)
5th row인천광역시 중구 월미로 211, W호텔 (북성동1가)
ValueCountFrequency (%)
인천광역시 134
19.1%
중구 134
19.1%
을왕동 92
 
13.1%
을왕로58번길 15
 
2.1%
북성동1가 15
 
2.1%
용유서로302번길 11
 
1.6%
용유서로 11
 
1.6%
무의동 8
 
1.1%
12 8
 
1.1%
왕산로 7
 
1.0%
Other values (186) 267
38.0%
2023-12-12T23:35:46.569099image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
584
 
15.8%
139
 
3.8%
138
 
3.7%
137
 
3.7%
136
 
3.7%
135
 
3.6%
134
 
3.6%
134
 
3.6%
134
 
3.6%
126
 
3.4%
Other values (143) 1904
51.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2200
59.4%
Space Separator 584
 
15.8%
Decimal Number 584
 
15.8%
Open Punctuation 118
 
3.2%
Close Punctuation 118
 
3.2%
Dash Punctuation 47
 
1.3%
Other Punctuation 46
 
1.2%
Uppercase Letter 4
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
139
 
6.3%
138
 
6.3%
137
 
6.2%
136
 
6.2%
135
 
6.1%
134
 
6.1%
134
 
6.1%
134
 
6.1%
126
 
5.7%
118
 
5.4%
Other values (125) 869
39.5%
Decimal Number
ValueCountFrequency (%)
1 101
17.3%
2 95
16.3%
3 68
11.6%
5 66
11.3%
8 58
9.9%
4 58
9.9%
0 50
8.6%
7 38
 
6.5%
6 28
 
4.8%
9 22
 
3.8%
Uppercase Letter
ValueCountFrequency (%)
W 2
50.0%
T 1
25.0%
O 1
25.0%
Space Separator
ValueCountFrequency (%)
584
100.0%
Open Punctuation
ValueCountFrequency (%)
( 118
100.0%
Close Punctuation
ValueCountFrequency (%)
) 118
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 47
100.0%
Other Punctuation
ValueCountFrequency (%)
, 46
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2200
59.4%
Common 1497
40.4%
Latin 4
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
139
 
6.3%
138
 
6.3%
137
 
6.2%
136
 
6.2%
135
 
6.1%
134
 
6.1%
134
 
6.1%
134
 
6.1%
126
 
5.7%
118
 
5.4%
Other values (125) 869
39.5%
Common
ValueCountFrequency (%)
584
39.0%
( 118
 
7.9%
) 118
 
7.9%
1 101
 
6.7%
2 95
 
6.3%
3 68
 
4.5%
5 66
 
4.4%
8 58
 
3.9%
4 58
 
3.9%
0 50
 
3.3%
Other values (5) 181
 
12.1%
Latin
ValueCountFrequency (%)
W 2
50.0%
T 1
25.0%
O 1
25.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2200
59.4%
ASCII 1501
40.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
584
38.9%
( 118
 
7.9%
) 118
 
7.9%
1 101
 
6.7%
2 95
 
6.3%
3 68
 
4.5%
5 66
 
4.4%
8 58
 
3.9%
4 58
 
3.9%
0 50
 
3.3%
Other values (8) 185
 
12.3%
Hangul
ValueCountFrequency (%)
139
 
6.3%
138
 
6.3%
137
 
6.2%
136
 
6.2%
135
 
6.1%
134
 
6.1%
134
 
6.1%
134
 
6.1%
126
 
5.7%
118
 
5.4%
Other values (125) 869
39.5%

소재지전화
Text

MISSING 

Distinct44
Distinct (%)91.7%
Missing86
Missing (%)64.2%
Memory size1.2 KiB
2023-12-12T23:35:46.865160image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length11.979167
Min length11

Characters and Unicode

Total characters575
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique40 ?
Unique (%)83.3%

Sample

1st row032-752-9892
2nd row032-889-0245
3rd row032-777-7272
4th row032-745-0000
5th row032-772-6300
ValueCountFrequency (%)
032-765-7142 2
 
4.2%
032-746-8600 2
 
4.2%
032-751-9700 2
 
4.2%
032-752-2000 2
 
4.2%
032-760-7822 1
 
2.1%
032-752-9892 1
 
2.1%
032-777-5633 1
 
2.1%
032-747-0015 1
 
2.1%
032-764-3003 1
 
2.1%
032-747-0170 1
 
2.1%
Other values (34) 34
70.8%
2023-12-12T23:35:47.247728image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 116
20.2%
- 96
16.7%
2 82
14.3%
7 68
11.8%
3 66
11.5%
5 32
 
5.6%
6 27
 
4.7%
4 26
 
4.5%
8 26
 
4.5%
1 22
 
3.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 479
83.3%
Dash Punctuation 96
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 116
24.2%
2 82
17.1%
7 68
14.2%
3 66
13.8%
5 32
 
6.7%
6 27
 
5.6%
4 26
 
5.4%
8 26
 
5.4%
1 22
 
4.6%
9 14
 
2.9%
Dash Punctuation
ValueCountFrequency (%)
- 96
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 575
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 116
20.2%
- 96
16.7%
2 82
14.3%
7 68
11.8%
3 66
11.5%
5 32
 
5.6%
6 27
 
4.7%
4 26
 
4.5%
8 26
 
4.5%
1 22
 
3.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 575
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 116
20.2%
- 96
16.7%
2 82
14.3%
7 68
11.8%
3 66
11.5%
5 32
 
5.6%
6 27
 
4.7%
4 26
 
4.5%
8 26
 
4.5%
1 22
 
3.8%

데이터기준일
Date

CONSTANT 

Distinct1
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
Minimum2023-07-20 00:00:00
Maximum2023-07-20 00:00:00
2023-12-12T23:35:47.391835image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T23:35:47.518343image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Missing values

2023-12-12T23:35:44.709968image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T23:35:44.819818image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업종명업소명소재지소재지전화데이터기준일
0관광숙박업JN PARK HOTEL(제이앤파크호텔)인천광역시 중구 용유서로 262-15 (을왕동)032-752-98922023-07-20
1관광숙박업M(엠)관광호텔인천광역시 중구 연안부두로43번길 12 (항동7가)032-889-02452023-07-20
2관광숙박업MK프리미어스 호텔인천광역시 중구 월미로260번길 12 (북성동1가)032-777-72722023-07-20
3관광숙박업THEWEEEK&인천광역시 중구 용유서로 379, 더위크앤리조트 (을왕동)032-745-00002023-07-20
4관광숙박업W hotel인천광역시 중구 월미로 211, W호텔 (북성동1가)032-772-63002023-07-20
5관광숙박업인천광역시 중구 용유서로 380-18 (을왕동)<NA>2023-07-20
6관광숙박업나폴리 호스텔인천광역시 중구 용유서로 340 (을왕동)<NA>2023-07-20
7관광숙박업뉴욕파크 호스텔인천광역시 중구 을왕로40번길 5 (을왕동)032-752-86312023-07-20
8관광숙박업다이아몬드 호텔인천광역시 중구 월미로 299 (북성동1가, 다이아몬드호텔)032-777-99092023-07-20
9관광숙박업당나귀호스텔인천광역시 중구 왕산로48번길 7 (을왕동)<NA>2023-07-20
업종명업소명소재지소재지전화데이터기준일
124관광숙박업해처럼인천광역시 중구 왕산로 72, 해처럼팬션 (을왕동)<NA>2023-07-20
125관광숙박업호텔 그랜드 스위트인천광역시 중구 월미로248번길 2 (북성동1가)032-777-56332023-07-20
126관광숙박업호텔ORA인천광역시 중구 공항서로 345 (남북동, 트윈TWO관광호텔)032-752-80802023-07-20
127관광숙박업호텔오션뷰인천광역시 중구 선녀바위로55번길 13, 호텔오션뷰 (을왕동)<NA>2023-07-20
128관광숙박업호텔휴로프트인천광역시 중구 마시란로 51-29 (덕교동)032-751-38002023-07-20
129관광숙박업홀리랜드A 호스텔인천광역시 중구 을왕로58번길 9-9 (을왕동, 홀리랜드)032-746-86002023-07-20
130관광숙박업홀리랜드B 호스텔인천광역시 중구 을왕로58번길 9-9 (을왕동, 홀리랜드)032-746-86002023-07-20
131관광숙박업화이트인천광역시 중구 왕산로52번길 14 (을왕동)<NA>2023-07-20
132관광숙박업휴 호스텔인천광역시 중구 선녀바위로55번길 35 (을왕동)<NA>2023-07-20
133관광숙박업힛포인천광역시 중구 선녀바위로55번길 10-4, 을왕선녀펜션 (을왕동)<NA>2023-07-20

Duplicate rows

Most frequently occurring

업종명업소명소재지소재지전화데이터기준일# duplicates
0관광숙박업백운호스텔인천광역시 중구 을왕로58번길 14 (을왕동)<NA>2023-07-202