Overview

Dataset statistics

Number of variables4
Number of observations72
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.4 KiB
Average record size in memory33.8 B

Variable types

Text2
Categorical2

Dataset

Description인천광역시 연수구 관내 숙박업소 공중위생서비스 현황에 관한 데이터로 녹색, 황색, 백색등급 및 최우수, 우수업소 현황 자료입니다.
Author인천광역시 연수구
URLhttps://www.data.go.kr/data/15039741/fileData.do

Alerts

등급 is highly overall correlated with 구분High correlation
구분 is highly overall correlated with 등급High correlation
업소명 has unique valuesUnique
업소소재지(도로명) has unique valuesUnique

Reproduction

Analysis started2024-04-06 08:01:15.000397
Analysis finished2024-04-06 08:01:18.296973
Duration3.3 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업소명
Text

UNIQUE 

Distinct72
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size708.0 B
2024-04-06T17:01:18.628346image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length13
Mean length6.7916667
Min length2

Characters and Unicode

Total characters489
Distinct characters151
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique72 ?
Unique (%)100.0%

Sample

1st row호텔리오
2nd row호텔메이
3rd row이채모텔
4th row포시즌 모텔
5th row줌모텔
ValueCountFrequency (%)
모텔 6
 
5.7%
인천송도점 5
 
4.7%
호텔 4
 
3.8%
스테이 3
 
2.8%
랜드마크 2
 
1.9%
인천 2
 
1.9%
송도 2
 
1.9%
리치호텔 1
 
0.9%
둥지모텔 1
 
0.9%
송도점 1
 
0.9%
Other values (79) 79
74.5%
2024-04-06T17:01:19.340064image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
53
 
10.8%
35
 
7.2%
34
 
7.0%
19
 
3.9%
18
 
3.7%
17
 
3.5%
17
 
3.5%
14
 
2.9%
13
 
2.7%
11
 
2.2%
Other values (141) 258
52.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 410
83.8%
Space Separator 34
 
7.0%
Uppercase Letter 16
 
3.3%
Decimal Number 9
 
1.8%
Open Punctuation 7
 
1.4%
Close Punctuation 7
 
1.4%
Lowercase Letter 6
 
1.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
53
 
12.9%
35
 
8.5%
19
 
4.6%
18
 
4.4%
17
 
4.1%
17
 
4.1%
14
 
3.4%
13
 
3.2%
11
 
2.7%
8
 
2.0%
Other values (114) 205
50.0%
Uppercase Letter
ValueCountFrequency (%)
E 4
25.0%
H 2
12.5%
V 2
12.5%
A 1
 
6.2%
N 1
 
6.2%
Y 1
 
6.2%
S 1
 
6.2%
I 1
 
6.2%
W 1
 
6.2%
Q 1
 
6.2%
Decimal Number
ValueCountFrequency (%)
2 2
22.2%
0 2
22.2%
5 1
11.1%
8 1
11.1%
9 1
11.1%
4 1
11.1%
3 1
11.1%
Lowercase Letter
ValueCountFrequency (%)
l 1
16.7%
e 1
16.7%
t 1
16.7%
m 1
16.7%
a 1
16.7%
o 1
16.7%
Space Separator
ValueCountFrequency (%)
34
100.0%
Open Punctuation
ValueCountFrequency (%)
( 7
100.0%
Close Punctuation
ValueCountFrequency (%)
) 7
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 410
83.8%
Common 57
 
11.7%
Latin 22
 
4.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
53
 
12.9%
35
 
8.5%
19
 
4.6%
18
 
4.4%
17
 
4.1%
17
 
4.1%
14
 
3.4%
13
 
3.2%
11
 
2.7%
8
 
2.0%
Other values (114) 205
50.0%
Latin
ValueCountFrequency (%)
E 4
18.2%
H 2
 
9.1%
V 2
 
9.1%
A 1
 
4.5%
l 1
 
4.5%
e 1
 
4.5%
N 1
 
4.5%
t 1
 
4.5%
m 1
 
4.5%
a 1
 
4.5%
Other values (7) 7
31.8%
Common
ValueCountFrequency (%)
34
59.6%
( 7
 
12.3%
) 7
 
12.3%
2 2
 
3.5%
0 2
 
3.5%
5 1
 
1.8%
8 1
 
1.8%
9 1
 
1.8%
4 1
 
1.8%
3 1
 
1.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 410
83.8%
ASCII 79
 
16.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
53
 
12.9%
35
 
8.5%
19
 
4.6%
18
 
4.4%
17
 
4.1%
17
 
4.1%
14
 
3.4%
13
 
3.2%
11
 
2.7%
8
 
2.0%
Other values (114) 205
50.0%
ASCII
ValueCountFrequency (%)
34
43.0%
( 7
 
8.9%
) 7
 
8.9%
E 4
 
5.1%
2 2
 
2.5%
0 2
 
2.5%
H 2
 
2.5%
V 2
 
2.5%
5 1
 
1.3%
A 1
 
1.3%
Other values (17) 17
21.5%
Distinct72
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size708.0 B
2024-04-06T17:01:19.745026image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length86
Median length49
Mean length33
Min length21

Characters and Unicode

Total characters2376
Distinct characters87
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique72 ?
Unique (%)100.0%

Sample

1st row인천광역시 연수구 인권로 16, 1~4층 (옥련동)
2nd row인천광역시 연수구 인권로9번길 10 (옥련동)
3rd row인천광역시 연수구 대암로 35 (옥련동)
4th row인천광역시 연수구 대암로 7 (옥련동)
5th row인천광역시 연수구 능허대로 185 (옥련동, 1~4층)
ValueCountFrequency (%)
인천광역시 72
17.1%
연수구 72
17.1%
옥련동 50
 
11.8%
송도동 14
 
3.3%
대암로 10
 
2.4%
능허대로 9
 
2.1%
능허대로179번길 8
 
1.9%
능허대로191번길 7
 
1.7%
아트센터대로168번길 7
 
1.7%
인권로 5
 
1.2%
Other values (110) 168
39.8%
2024-04-06T17:01:20.695550image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
350
 
14.7%
1 137
 
5.8%
83
 
3.5%
79
 
3.3%
78
 
3.3%
75
 
3.2%
75
 
3.2%
73
 
3.1%
72
 
3.0%
( 72
 
3.0%
Other values (77) 1282
54.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1323
55.7%
Decimal Number 434
 
18.3%
Space Separator 350
 
14.7%
Open Punctuation 72
 
3.0%
Close Punctuation 72
 
3.0%
Other Punctuation 69
 
2.9%
Math Symbol 33
 
1.4%
Dash Punctuation 19
 
0.8%
Uppercase Letter 4
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
83
 
6.3%
79
 
6.0%
78
 
5.9%
75
 
5.7%
75
 
5.7%
73
 
5.5%
72
 
5.4%
72
 
5.4%
72
 
5.4%
72
 
5.4%
Other values (58) 572
43.2%
Decimal Number
ValueCountFrequency (%)
1 137
31.6%
2 43
 
9.9%
9 37
 
8.5%
4 35
 
8.1%
3 35
 
8.1%
5 34
 
7.8%
7 32
 
7.4%
6 31
 
7.1%
8 26
 
6.0%
0 24
 
5.5%
Uppercase Letter
ValueCountFrequency (%)
A 2
50.0%
C 1
25.0%
B 1
25.0%
Space Separator
ValueCountFrequency (%)
350
100.0%
Open Punctuation
ValueCountFrequency (%)
( 72
100.0%
Close Punctuation
ValueCountFrequency (%)
) 72
100.0%
Other Punctuation
ValueCountFrequency (%)
, 69
100.0%
Math Symbol
ValueCountFrequency (%)
~ 33
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 19
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1323
55.7%
Common 1049
44.1%
Latin 4
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
83
 
6.3%
79
 
6.0%
78
 
5.9%
75
 
5.7%
75
 
5.7%
73
 
5.5%
72
 
5.4%
72
 
5.4%
72
 
5.4%
72
 
5.4%
Other values (58) 572
43.2%
Common
ValueCountFrequency (%)
350
33.4%
1 137
 
13.1%
( 72
 
6.9%
) 72
 
6.9%
, 69
 
6.6%
2 43
 
4.1%
9 37
 
3.5%
4 35
 
3.3%
3 35
 
3.3%
5 34
 
3.2%
Other values (6) 165
15.7%
Latin
ValueCountFrequency (%)
A 2
50.0%
C 1
25.0%
B 1
25.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1323
55.7%
ASCII 1053
44.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
350
33.2%
1 137
 
13.0%
( 72
 
6.8%
) 72
 
6.8%
, 69
 
6.6%
2 43
 
4.1%
9 37
 
3.5%
4 35
 
3.3%
3 35
 
3.3%
5 34
 
3.2%
Other values (9) 169
16.0%
Hangul
ValueCountFrequency (%)
83
 
6.3%
79
 
6.0%
78
 
5.9%
75
 
5.7%
75
 
5.7%
73
 
5.5%
72
 
5.4%
72
 
5.4%
72
 
5.4%
72
 
5.4%
Other values (58) 572
43.2%

구분
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)4.2%
Missing0
Missing (%)0.0%
Memory size708.0 B
최우수
53 
우수
15 
일반관리
 
4

Length

Max length4
Median length3
Mean length2.8472222
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row최우수
2nd row최우수
3rd row최우수
4th row우수
5th row우수

Common Values

ValueCountFrequency (%)
최우수 53
73.6%
우수 15
 
20.8%
일반관리 4
 
5.6%

Length

2024-04-06T17:01:21.072677image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-06T17:01:21.328283image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
최우수 53
73.6%
우수 15
 
20.8%
일반관리 4
 
5.6%

등급
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)4.2%
Missing0
Missing (%)0.0%
Memory size708.0 B
녹색
53 
황색
15 
백색
 
4

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row녹색
2nd row녹색
3rd row녹색
4th row황색
5th row황색

Common Values

ValueCountFrequency (%)
녹색 53
73.6%
황색 15
 
20.8%
백색 4
 
5.6%

Length

2024-04-06T17:01:21.592477image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-06T17:01:21.794161image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
녹색 53
73.6%
황색 15
 
20.8%
백색 4
 
5.6%

Correlations

2024-04-06T17:01:21.938255image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업소명업소소재지(도로명)구분등급
업소명1.0001.0001.0001.000
업소소재지(도로명)1.0001.0001.0001.000
구분1.0001.0001.0001.000
등급1.0001.0001.0001.000
2024-04-06T17:01:22.201238image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
등급구분
등급1.0001.000
구분1.0001.000
2024-04-06T17:01:22.347599image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분등급
구분1.0001.000
등급1.0001.000

Missing values

2024-04-06T17:01:18.068988image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-06T17:01:18.233536image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업소명업소소재지(도로명)구분등급
0호텔리오인천광역시 연수구 인권로 16, 1~4층 (옥련동)최우수녹색
1호텔메이인천광역시 연수구 인권로9번길 10 (옥련동)최우수녹색
2이채모텔인천광역시 연수구 대암로 35 (옥련동)최우수녹색
3포시즌 모텔인천광역시 연수구 대암로 7 (옥련동)우수황색
4줌모텔인천광역시 연수구 능허대로 185 (옥련동, 1~4층)우수황색
5팔레스모텔인천광역시 연수구 대암로 19 (옥련동)최우수녹색
6가빈인천광역시 연수구 인권로 17 (옥련동)우수황색
7삼미장여관인천광역시 연수구 능허대로167번길 6 (옥련동)우수황색
8아이엔지 모텔인천광역시 연수구 능허대로104번길 51 (옥련동, 지하1~지상7층)최우수녹색
9호텔로제인천광역시 연수구 능허대로191번길 32 (옥련동)최우수녹색
업소명업소소재지(도로명)구분등급
62더 스테이 송도인천광역시 연수구 아트센터대로168번길 100, 한라 웨스턴파크 송도 5~10,12,14,15,17~19,21~24,27~29,31~33,37층 (송도동)최우수녹색
63더노벰버스테이인천광역시 연수구 아트센터대로168번길 101, 송도랜드마크푸르지오시티 A-B동 4-14, 16-36층 (송도동)최우수녹색
64랜드마크 송도스테이 투인천광역시 연수구 아트센터대로168번길 101, 송도랜드마크푸르지오시티 1,4~14,16~20,22~29,31~34,36층 (송도동)최우수녹색
65호텔로뎀인천광역시 연수구 능허대로179번길 18-10 (옥련동, 지하1층, 지상1,2,3,4층)최우수녹색
66브라운도트호텔인천송도인천광역시 연수구 능허대로 227-7 (옥련동)최우수녹색
67어반스테이 인천송도점인천광역시 연수구 아트센터대로168번길 101, 송도랜드마크푸르지오시티 1,4~14,16~35층 (송도동)최우수녹색
68다온호텔인천광역시 연수구 대암로 11-1, 1~5층 (옥련동)최우수녹색
69더휴식 아늑 인천송도점인천광역시 연수구 인권로9번길 9 (옥련동)우수황색
70자우리호텔 인천송도점인천광역시 연수구 능허대로179번길 56 (옥련동)일반관리백색
71에이치에비뉴(H AVENUE)인천광역시 연수구 능허대로 207 (옥련동)최우수녹색