Overview

Dataset statistics

Number of variables4
Number of observations1390
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory43.6 KiB
Average record size in memory32.1 B

Variable types

Categorical2
Text2

Dataset

Description인천광역시 관광사업체 현황(분류/구분/업체명/지역/소재지/등록일) 데이터 입니다. * 인천광역시 관광사업체통계시스템 데이터
Author인천광역시
URLhttps://data.incheon.go.kr/findData/publicDataDetail?dataId=15066564&srcSe=7661IVAWM27C61E190

Reproduction

Analysis started2024-04-17 23:06:35.892841
Analysis finished2024-04-17 23:06:37.030567
Duration1.14 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

군구
Categorical

Distinct10
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Memory size11.0 KiB
중구
382 
연수구
194 
서구
181 
남동구
167 
부평구
127 
Other values (5)
339 

Length

Max length4
Median length3
Mean length2.6374101
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row연수구
2nd row중구
3rd row강화군
4th row계양구
5th row남동구

Common Values

ValueCountFrequency (%)
중구 382
27.5%
연수구 194
14.0%
서구 181
13.0%
남동구 167
12.0%
부평구 127
 
9.1%
강화군 121
 
8.7%
미추홀구 74
 
5.3%
계양구 72
 
5.2%
옹진군 57
 
4.1%
동구 15
 
1.1%

Length

2024-04-18T08:06:37.115248image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-18T08:06:37.253386image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
중구 382
27.5%
연수구 194
14.0%
서구 181
13.0%
남동구 167
12.0%
부평구 127
 
9.1%
강화군 121
 
8.7%
미추홀구 74
 
5.3%
계양구 72
 
5.2%
옹진군 57
 
4.1%
동구 15
 
1.1%

분류
Categorical

Distinct28
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size11.0 KiB
종합여행업
294 
국내외여행업
255 
기타유원시설업
143 
국내여행업
136 
호스텔업
91 
Other values (23)
471 

Length

Max length12
Median length10
Mean length5.8517986
Min length4

Unique

Unique2 ?
Unique (%)0.1%

Sample

1st row가족호텔업
2nd row가족호텔업
3rd row관광궤도업
4th row관광극장유흥업
5th row관광극장유흥업

Common Values

ValueCountFrequency (%)
종합여행업 294
21.2%
국내외여행업 255
18.3%
기타유원시설업 143
10.3%
국내여행업 136
9.8%
호스텔업 91
 
6.5%
일반야영장업 86
 
6.2%
외국인관광도시민박업 86
 
6.2%
관광호텔업 85
 
6.1%
관광식당업 59
 
4.2%
관광펜션업 28
 
2.0%
Other values (18) 127
9.1%

Length

2024-04-18T08:06:37.417401image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
종합여행업 294
21.1%
국내외여행업 255
18.3%
기타유원시설업 143
10.3%
국내여행업 136
9.8%
호스텔업 91
 
6.5%
일반야영장업 86
 
6.2%
외국인관광도시민박업 86
 
6.2%
관광호텔업 85
 
6.1%
관광식당업 59
 
4.2%
관광펜션업 28
 
2.0%
Other values (19) 129
9.3%
Distinct1315
Distinct (%)94.6%
Missing0
Missing (%)0.0%
Memory size11.0 KiB
2024-04-18T08:06:37.692109image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length39
Median length32
Mean length8.3546763
Min length3

Characters and Unicode

Total characters11613
Distinct characters618
Distinct categories12 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1242 ?
Unique (%)89.4%

Sample

1st row오크우드프리미어 인천호텔
2nd row오션스카이 가족호텔
3rd row해강개발 주식회사 강화리조트
4th row아라비안관광나이트
5th row디퍼(Differ) 7080 라이브
ValueCountFrequency (%)
주식회사 40
 
2.2%
호스텔 29
 
1.6%
호텔 11
 
0.6%
여행사 10
 
0.6%
관광호텔 9
 
0.5%
키즈카페 8
 
0.4%
게스트하우스 8
 
0.4%
투어 6
 
0.3%
하우스 6
 
0.3%
캠핑장 5
 
0.3%
Other values (1505) 1680
92.7%
2024-04-18T08:06:38.149840image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1812
 
15.6%
341
 
2.9%
276
 
2.4%
258
 
2.2%
241
 
2.1%
236
 
2.0%
199
 
1.7%
192
 
1.7%
188
 
1.6%
( 172
 
1.5%
Other values (608) 7698
66.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 8422
72.5%
Space Separator 1812
 
15.6%
Lowercase Letter 354
 
3.0%
Uppercase Letter 311
 
2.7%
Other Symbol 276
 
2.4%
Open Punctuation 174
 
1.5%
Close Punctuation 173
 
1.5%
Decimal Number 44
 
0.4%
Other Punctuation 38
 
0.3%
Dash Punctuation 7
 
0.1%
Other values (2) 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
341
 
4.0%
258
 
3.1%
241
 
2.9%
236
 
2.8%
199
 
2.4%
192
 
2.3%
188
 
2.2%
161
 
1.9%
154
 
1.8%
150
 
1.8%
Other values (535) 6302
74.8%
Uppercase Letter
ValueCountFrequency (%)
T 35
 
11.3%
O 26
 
8.4%
A 25
 
8.0%
E 24
 
7.7%
S 21
 
6.8%
B 21
 
6.8%
R 19
 
6.1%
C 16
 
5.1%
H 15
 
4.8%
L 12
 
3.9%
Other values (15) 97
31.2%
Lowercase Letter
ValueCountFrequency (%)
e 50
14.1%
a 40
11.3%
o 33
 
9.3%
r 25
 
7.1%
t 25
 
7.1%
s 24
 
6.8%
p 18
 
5.1%
m 18
 
5.1%
u 16
 
4.5%
n 14
 
4.0%
Other values (14) 91
25.7%
Decimal Number
ValueCountFrequency (%)
2 9
20.5%
1 8
18.2%
0 6
13.6%
4 5
11.4%
3 4
9.1%
9 4
9.1%
7 4
9.1%
8 2
 
4.5%
5 1
 
2.3%
6 1
 
2.3%
Other Punctuation
ValueCountFrequency (%)
& 16
42.1%
; 15
39.5%
, 4
 
10.5%
. 2
 
5.3%
' 1
 
2.6%
Open Punctuation
ValueCountFrequency (%)
( 172
98.9%
[ 2
 
1.1%
Close Punctuation
ValueCountFrequency (%)
) 171
98.8%
] 2
 
1.2%
Space Separator
ValueCountFrequency (%)
1812
100.0%
Other Symbol
ValueCountFrequency (%)
276
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 7
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%
Final Punctuation
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 8698
74.9%
Common 2250
 
19.4%
Latin 665
 
5.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
341
 
3.9%
276
 
3.2%
258
 
3.0%
241
 
2.8%
236
 
2.7%
199
 
2.3%
192
 
2.2%
188
 
2.2%
161
 
1.9%
154
 
1.8%
Other values (536) 6452
74.2%
Latin
ValueCountFrequency (%)
e 50
 
7.5%
a 40
 
6.0%
T 35
 
5.3%
o 33
 
5.0%
O 26
 
3.9%
A 25
 
3.8%
r 25
 
3.8%
t 25
 
3.8%
E 24
 
3.6%
s 24
 
3.6%
Other values (39) 358
53.8%
Common
ValueCountFrequency (%)
1812
80.5%
( 172
 
7.6%
) 171
 
7.6%
& 16
 
0.7%
; 15
 
0.7%
2 9
 
0.4%
1 8
 
0.4%
- 7
 
0.3%
0 6
 
0.3%
4 5
 
0.2%
Other values (13) 29
 
1.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 8422
72.5%
ASCII 2914
 
25.1%
None 276
 
2.4%
Punctuation 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1812
62.2%
( 172
 
5.9%
) 171
 
5.9%
e 50
 
1.7%
a 40
 
1.4%
T 35
 
1.2%
o 33
 
1.1%
O 26
 
0.9%
A 25
 
0.9%
r 25
 
0.9%
Other values (61) 525
 
18.0%
Hangul
ValueCountFrequency (%)
341
 
4.0%
258
 
3.1%
241
 
2.9%
236
 
2.8%
199
 
2.4%
192
 
2.3%
188
 
2.2%
161
 
1.9%
154
 
1.8%
150
 
1.8%
Other values (535) 6302
74.8%
None
ValueCountFrequency (%)
276
100.0%
Punctuation
ValueCountFrequency (%)
1
100.0%
Distinct1331
Distinct (%)95.8%
Missing0
Missing (%)0.0%
Memory size11.0 KiB
2024-04-18T08:06:38.477240image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length55
Median length42
Mean length25.371223
Min length8

Characters and Unicode

Total characters35266
Distinct characters425
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1274 ?
Unique (%)91.7%

Sample

1st row연수구 컨벤시아대로 165
2nd row인천광역시 중구 선녀바위로 35
3rd row인천시 강화군 길상면 장흥로 217
4th row인천광역시 계양구 도두리로 14 (작전동) 4~7층
5th row인천광역시 남동구 앵고개로948번길 54 (논현동,굿모닝타워 10층)
ValueCountFrequency (%)
중구 373
 
5.7%
연수구 194
 
3.0%
서구 179
 
2.7%
남동구 164
 
2.5%
인천광역시 152
 
2.3%
부평구 126
 
1.9%
강화군 114
 
1.7%
인천 100
 
1.5%
미추홀구 73
 
1.1%
계양구 66
 
1.0%
Other values (2593) 5014
76.5%
2024-04-18T08:06:38.942502image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5164
 
14.6%
1 1664
 
4.7%
1303
 
3.7%
1298
 
3.7%
1288
 
3.7%
2 1224
 
3.5%
, 1151
 
3.3%
0 912
 
2.6%
3 911
 
2.6%
) 881
 
2.5%
Other values (415) 19470
55.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 18550
52.6%
Decimal Number 7989
22.7%
Space Separator 5165
 
14.6%
Other Punctuation 1164
 
3.3%
Close Punctuation 881
 
2.5%
Open Punctuation 879
 
2.5%
Dash Punctuation 420
 
1.2%
Uppercase Letter 184
 
0.5%
Math Symbol 18
 
0.1%
Lowercase Letter 12
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1303
 
7.0%
1298
 
7.0%
1288
 
6.9%
602
 
3.2%
575
 
3.1%
521
 
2.8%
436
 
2.4%
400
 
2.2%
365
 
2.0%
359
 
1.9%
Other values (364) 11403
61.5%
Uppercase Letter
ValueCountFrequency (%)
B 60
32.6%
A 24
 
13.0%
C 19
 
10.3%
I 17
 
9.2%
D 15
 
8.2%
S 10
 
5.4%
T 7
 
3.8%
E 5
 
2.7%
R 4
 
2.2%
H 4
 
2.2%
Other values (11) 19
 
10.3%
Decimal Number
ValueCountFrequency (%)
1 1664
20.8%
2 1224
15.3%
0 912
11.4%
3 911
11.4%
4 721
9.0%
5 606
 
7.6%
6 580
 
7.3%
7 517
 
6.5%
8 477
 
6.0%
9 377
 
4.7%
Lowercase Letter
ValueCountFrequency (%)
v 2
16.7%
i 2
16.7%
c 2
16.7%
e 1
8.3%
r 1
8.3%
u 1
8.3%
o 1
8.3%
t 1
8.3%
g 1
8.3%
Other Punctuation
ValueCountFrequency (%)
, 1151
98.9%
. 9
 
0.8%
: 2
 
0.2%
/ 2
 
0.2%
Space Separator
ValueCountFrequency (%)
5164
> 99.9%
  1
 
< 0.1%
Close Punctuation
ValueCountFrequency (%)
) 881
100.0%
Open Punctuation
ValueCountFrequency (%)
( 879
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 420
100.0%
Math Symbol
ValueCountFrequency (%)
~ 18
100.0%
Other Symbol
ValueCountFrequency (%)
4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 18550
52.6%
Common 16520
46.8%
Latin 196
 
0.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1303
 
7.0%
1298
 
7.0%
1288
 
6.9%
602
 
3.2%
575
 
3.1%
521
 
2.8%
436
 
2.4%
400
 
2.2%
365
 
2.0%
359
 
1.9%
Other values (364) 11403
61.5%
Latin
ValueCountFrequency (%)
B 60
30.6%
A 24
 
12.2%
C 19
 
9.7%
I 17
 
8.7%
D 15
 
7.7%
S 10
 
5.1%
T 7
 
3.6%
E 5
 
2.6%
R 4
 
2.0%
H 4
 
2.0%
Other values (20) 31
15.8%
Common
ValueCountFrequency (%)
5164
31.3%
1 1664
 
10.1%
2 1224
 
7.4%
, 1151
 
7.0%
0 912
 
5.5%
3 911
 
5.5%
) 881
 
5.3%
( 879
 
5.3%
4 721
 
4.4%
5 606
 
3.7%
Other values (11) 2407
14.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 18550
52.6%
ASCII 16711
47.4%
CJK Compat 4
 
< 0.1%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5164
30.9%
1 1664
 
10.0%
2 1224
 
7.3%
, 1151
 
6.9%
0 912
 
5.5%
3 911
 
5.5%
) 881
 
5.3%
( 879
 
5.3%
4 721
 
4.3%
5 606
 
3.6%
Other values (39) 2598
15.5%
Hangul
ValueCountFrequency (%)
1303
 
7.0%
1298
 
7.0%
1288
 
6.9%
602
 
3.2%
575
 
3.1%
521
 
2.8%
436
 
2.4%
400
 
2.2%
365
 
2.0%
359
 
1.9%
Other values (364) 11403
61.5%
CJK Compat
ValueCountFrequency (%)
4
100.0%
None
ValueCountFrequency (%)
  1
100.0%

Correlations

2024-04-18T08:06:39.034251image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
군구분류
군구1.0000.673
분류0.6731.000
2024-04-18T08:06:39.117669image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
군구분류
군구1.0000.312
분류0.3121.000
2024-04-18T08:06:39.192460image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
군구분류
군구1.0000.312
분류0.3121.000

Missing values

2024-04-18T08:06:36.993222image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

군구분류업체명주소(도로명)
0연수구가족호텔업오크우드프리미어 인천호텔연수구 컨벤시아대로 165
1중구가족호텔업오션스카이 가족호텔인천광역시 중구 선녀바위로 35
2강화군관광궤도업해강개발 주식회사 강화리조트인천시 강화군 길상면 장흥로 217
3계양구관광극장유흥업아라비안관광나이트인천광역시 계양구 도두리로 14 (작전동) 4~7층
4남동구관광극장유흥업디퍼(Differ) 7080 라이브인천광역시 남동구 앵고개로948번길 54 (논현동,굿모닝타워 10층)
5미추홀구관광극장유흥업리버관광나이트인천광역시 미추홀구 주안중로 19 (주안동)
6미추홀구관광극장유흥업코리아관광나이트미추홀구 경인로 392(주안동)
7미추홀구관광극장유흥업동경중년관광나이트미추홀구 주안로 116
8미추홀구관광극장유흥업백악관관광나이트미추홀구 주안동로 4 (주안동)
9미추홀구관광극장유흥업뉴월드관광나이트클럽미추홀구 주안로 136, 3층 (주안동)
군구분류업체명주소(도로명)
1380중구호스텔업꿈호스텔중구 용유서로 380-18
1381중구호스텔업원더풀호스텔중구 왕산로 68-16
1382중구호스텔업레드호스텔중구 왕산로71
1383중구호스텔업큐하우스중구 용유서로302번길 36(을왕동 741)
1384중구호스텔업오아시스중구 용유서로 262-5
1385중구호스텔업플레이하운드중구 용유서로 450번길 41-21,22
1386중구호스텔업어울림 펜션중구 을왕동 783-1 외 2필지(용유서로 348번길 11-8)
1387중구호스텔업선녀바위로 5인천시 중구 을왕동 703-14
1388옹진군휴양콘도미니엄업마린프라자㈜인천광역시 옹진군 자월면 승봉로29번길 17-41
1389중구휴양콘도미니엄업TheWeek&amp; (더위크앤 리조트)중구 용유서로 379(을왕동 773)