Overview

Dataset statistics

Number of variables4
Number of observations740
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory24.0 KiB
Average record size in memory33.2 B

Variable types

Text3
Numeric1

Dataset

Description인천광역시 옹진군 관내에 소재한 호텔, 모텔, 펜션, 민박 등 숙박업소에 대한 데이터로 상호명, 대표자명, 업체전화번호, 소재지 주소, 객실수 데이터가 있음
Author인천광역시 옹진군
URLhttps://data.incheon.go.kr/findData/publicDataDetail?dataId=15127508&srcSe=7661IVAWM27C61E190

Reproduction

Analysis started2024-04-14 03:10:03.594997
Analysis finished2024-04-14 03:10:04.421998
Duration0.83 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct704
Distinct (%)95.1%
Missing0
Missing (%)0.0%
Memory size5.9 KiB
2024-04-14T12:10:04.562923image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length33
Median length18
Mean length5.7527027
Min length1

Characters and Unicode

Total characters4257
Distinct characters453
Distinct categories9 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique675 ?
Unique (%)91.2%

Sample

1st row엉클조
2nd row추장민박
3rd row시도민션민박
4th row영화속풍경
5th row노루메기
ValueCountFrequency (%)
민박 42
 
4.6%
펜션 31
 
3.4%
아일랜드 6
 
0.7%
바다향기 4
 
0.4%
호텔 4
 
0.4%
독채 3
 
0.3%
펜션민박 3
 
0.3%
해변민박 3
 
0.3%
3
 
0.3%
하우스 3
 
0.3%
Other values (741) 805
88.8%
2024-04-14T12:10:04.865006image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
377
 
8.9%
376
 
8.8%
180
 
4.2%
175
 
4.1%
168
 
3.9%
92
 
2.2%
80
 
1.9%
70
 
1.6%
57
 
1.3%
54
 
1.3%
Other values (443) 2628
61.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3865
90.8%
Space Separator 168
 
3.9%
Uppercase Letter 73
 
1.7%
Decimal Number 59
 
1.4%
Lowercase Letter 50
 
1.2%
Open Punctuation 16
 
0.4%
Close Punctuation 16
 
0.4%
Other Punctuation 9
 
0.2%
Dash Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
377
 
9.8%
376
 
9.7%
180
 
4.7%
175
 
4.5%
92
 
2.4%
80
 
2.1%
70
 
1.8%
57
 
1.5%
54
 
1.4%
52
 
1.3%
Other values (388) 2352
60.9%
Lowercase Letter
ValueCountFrequency (%)
e 9
18.0%
t 8
16.0%
s 4
 
8.0%
n 3
 
6.0%
o 3
 
6.0%
l 3
 
6.0%
i 3
 
6.0%
y 3
 
6.0%
a 2
 
4.0%
h 2
 
4.0%
Other values (9) 10
20.0%
Uppercase Letter
ValueCountFrequency (%)
A 14
19.2%
B 10
13.7%
N 8
11.0%
O 7
9.6%
I 5
 
6.8%
E 4
 
5.5%
M 4
 
5.5%
P 3
 
4.1%
L 3
 
4.1%
G 3
 
4.1%
Other values (8) 12
16.4%
Decimal Number
ValueCountFrequency (%)
2 10
16.9%
9 10
16.9%
1 9
15.3%
5 7
11.9%
8 6
10.2%
4 4
 
6.8%
3 4
 
6.8%
6 3
 
5.1%
7 3
 
5.1%
0 3
 
5.1%
Other Punctuation
ValueCountFrequency (%)
& 5
55.6%
, 2
 
22.2%
. 1
 
11.1%
1
 
11.1%
Space Separator
ValueCountFrequency (%)
168
100.0%
Open Punctuation
ValueCountFrequency (%)
( 16
100.0%
Close Punctuation
ValueCountFrequency (%)
) 16
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3864
90.8%
Common 269
 
6.3%
Latin 123
 
2.9%
Han 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
377
 
9.8%
376
 
9.7%
180
 
4.7%
175
 
4.5%
92
 
2.4%
80
 
2.1%
70
 
1.8%
57
 
1.5%
54
 
1.4%
52
 
1.3%
Other values (387) 2351
60.8%
Latin
ValueCountFrequency (%)
A 14
 
11.4%
B 10
 
8.1%
e 9
 
7.3%
t 8
 
6.5%
N 8
 
6.5%
O 7
 
5.7%
I 5
 
4.1%
E 4
 
3.3%
M 4
 
3.3%
s 4
 
3.3%
Other values (27) 50
40.7%
Common
ValueCountFrequency (%)
168
62.5%
( 16
 
5.9%
) 16
 
5.9%
2 10
 
3.7%
9 10
 
3.7%
1 9
 
3.3%
5 7
 
2.6%
8 6
 
2.2%
& 5
 
1.9%
4 4
 
1.5%
Other values (8) 18
 
6.7%
Han
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3864
90.8%
ASCII 391
 
9.2%
None 1
 
< 0.1%
CJK 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
377
 
9.8%
376
 
9.7%
180
 
4.7%
175
 
4.5%
92
 
2.4%
80
 
2.1%
70
 
1.8%
57
 
1.5%
54
 
1.4%
52
 
1.3%
Other values (387) 2351
60.8%
ASCII
ValueCountFrequency (%)
168
43.0%
( 16
 
4.1%
) 16
 
4.1%
A 14
 
3.6%
2 10
 
2.6%
9 10
 
2.6%
B 10
 
2.6%
e 9
 
2.3%
1 9
 
2.3%
t 8
 
2.0%
Other values (44) 121
30.9%
None
ValueCountFrequency (%)
1
100.0%
CJK
ValueCountFrequency (%)
1
100.0%
Distinct686
Distinct (%)92.7%
Missing0
Missing (%)0.0%
Memory size5.9 KiB
2024-04-14T12:10:05.127102image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length3
Mean length3.0648649
Min length2

Characters and Unicode

Total characters2268
Distinct characters196
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique643 ?
Unique (%)86.9%

Sample

1st row조일권
2nd row김건추
3rd row백나영
4th row박상근
5th row장금란
ValueCountFrequency (%)
장혜선 8
 
1.1%
1명 6
 
0.8%
6
 
0.8%
김현중 3
 
0.4%
김순자 3
 
0.4%
오국진 3
 
0.4%
이명숙 3
 
0.4%
윤경숙 3
 
0.4%
이중화 2
 
0.3%
장태숙 2
 
0.3%
Other values (680) 715
94.8%
2024-04-14T12:10:05.495406image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
173
 
7.6%
111
 
4.9%
90
 
4.0%
62
 
2.7%
52
 
2.3%
51
 
2.2%
46
 
2.0%
46
 
2.0%
44
 
1.9%
43
 
1.9%
Other values (186) 1550
68.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2239
98.7%
Space Separator 15
 
0.7%
Decimal Number 7
 
0.3%
Close Punctuation 3
 
0.1%
Open Punctuation 3
 
0.1%
Other Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
173
 
7.7%
111
 
5.0%
90
 
4.0%
62
 
2.8%
52
 
2.3%
51
 
2.3%
46
 
2.1%
46
 
2.1%
44
 
2.0%
43
 
1.9%
Other values (181) 1521
67.9%
Space Separator
ValueCountFrequency (%)
15
100.0%
Decimal Number
ValueCountFrequency (%)
1 7
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3
100.0%
Open Punctuation
ValueCountFrequency (%)
( 3
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2240
98.8%
Common 28
 
1.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
173
 
7.7%
111
 
5.0%
90
 
4.0%
62
 
2.8%
52
 
2.3%
51
 
2.3%
46
 
2.1%
46
 
2.1%
44
 
2.0%
43
 
1.9%
Other values (182) 1522
67.9%
Common
ValueCountFrequency (%)
15
53.6%
1 7
25.0%
) 3
 
10.7%
( 3
 
10.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2239
98.7%
ASCII 28
 
1.2%
None 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
173
 
7.7%
111
 
5.0%
90
 
4.0%
62
 
2.8%
52
 
2.3%
51
 
2.3%
46
 
2.1%
46
 
2.1%
44
 
2.0%
43
 
1.9%
Other values (181) 1521
67.9%
ASCII
ValueCountFrequency (%)
15
53.6%
1 7
25.0%
) 3
 
10.7%
( 3
 
10.7%
None
ValueCountFrequency (%)
1
100.0%

주소
Text

Distinct737
Distinct (%)99.6%
Missing0
Missing (%)0.0%
Memory size5.9 KiB
2024-04-14T12:10:05.739192image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length42
Median length37
Mean length26.145946
Min length19

Characters and Unicode

Total characters19348
Distinct characters145
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique734 ?
Unique (%)99.2%

Sample

1st row인천광역시 옹진군 북도면 모도로78번길 16(가동, 나동)
2nd row인천광역시 옹진군 북도면 시도로104번길 37-14
3rd row인천광역시 옹진군 북도면 시도로104번길 151, 나동
4th row인천광역시 옹진군 북도면 시도로86번길 167, 2동
5th row인천광역시 옹진군 북도면 시도로 176
ValueCountFrequency (%)
인천광역시 740
18.9%
옹진군 740
18.9%
영흥면 333
 
8.5%
자월면 127
 
3.2%
북도면 95
 
2.4%
백령면 66
 
1.7%
덕적면 61
 
1.6%
선재로 45
 
1.1%
1동 40
 
1.0%
대청면 34
 
0.9%
Other values (851) 1633
41.7%
2024-04-14T12:10:06.111842image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3187
 
16.5%
756
 
3.9%
746
 
3.9%
740
 
3.8%
740
 
3.8%
740
 
3.8%
740
 
3.8%
740
 
3.8%
740
 
3.8%
740
 
3.8%
Other values (135) 9479
49.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 11850
61.2%
Decimal Number 3678
 
19.0%
Space Separator 3187
 
16.5%
Dash Punctuation 360
 
1.9%
Other Punctuation 189
 
1.0%
Close Punctuation 27
 
0.1%
Open Punctuation 27
 
0.1%
Math Symbol 22
 
0.1%
Uppercase Letter 7
 
< 0.1%
Lowercase Letter 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
756
 
6.4%
746
 
6.3%
740
 
6.2%
740
 
6.2%
740
 
6.2%
740
 
6.2%
740
 
6.2%
740
 
6.2%
740
 
6.2%
720
 
6.1%
Other values (114) 4448
37.5%
Decimal Number
ValueCountFrequency (%)
1 716
19.5%
2 550
15.0%
3 398
10.8%
7 345
9.4%
4 344
9.4%
6 332
9.0%
5 311
8.5%
0 230
 
6.3%
9 227
 
6.2%
8 225
 
6.1%
Uppercase Letter
ValueCountFrequency (%)
A 4
57.1%
B 2
28.6%
C 1
 
14.3%
Other Punctuation
ValueCountFrequency (%)
, 188
99.5%
. 1
 
0.5%
Space Separator
ValueCountFrequency (%)
3187
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 360
100.0%
Close Punctuation
ValueCountFrequency (%)
) 27
100.0%
Open Punctuation
ValueCountFrequency (%)
( 27
100.0%
Math Symbol
ValueCountFrequency (%)
~ 22
100.0%
Lowercase Letter
ValueCountFrequency (%)
a 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 11850
61.2%
Common 7490
38.7%
Latin 8
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
756
 
6.4%
746
 
6.3%
740
 
6.2%
740
 
6.2%
740
 
6.2%
740
 
6.2%
740
 
6.2%
740
 
6.2%
740
 
6.2%
720
 
6.1%
Other values (114) 4448
37.5%
Common
ValueCountFrequency (%)
3187
42.6%
1 716
 
9.6%
2 550
 
7.3%
3 398
 
5.3%
- 360
 
4.8%
7 345
 
4.6%
4 344
 
4.6%
6 332
 
4.4%
5 311
 
4.2%
0 230
 
3.1%
Other values (7) 717
 
9.6%
Latin
ValueCountFrequency (%)
A 4
50.0%
B 2
25.0%
C 1
 
12.5%
a 1
 
12.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 11850
61.2%
ASCII 7498
38.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3187
42.5%
1 716
 
9.5%
2 550
 
7.3%
3 398
 
5.3%
- 360
 
4.8%
7 345
 
4.6%
4 344
 
4.6%
6 332
 
4.4%
5 311
 
4.1%
0 230
 
3.1%
Other values (11) 725
 
9.7%
Hangul
ValueCountFrequency (%)
756
 
6.4%
746
 
6.3%
740
 
6.2%
740
 
6.2%
740
 
6.2%
740
 
6.2%
740
 
6.2%
740
 
6.2%
740
 
6.2%
720
 
6.1%
Other values (114) 4448
37.5%

객실수
Real number (ℝ)

Distinct33
Distinct (%)4.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6.2067568
Minimum1
Maximum50
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size6.6 KiB
2024-04-14T12:10:06.248307image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q13
median5
Q37
95-th percentile18
Maximum50
Range49
Interquartile range (IQR)4

Descriptive statistics

Standard deviation5.7509623
Coefficient of variation (CV)0.92656479
Kurtosis12.268667
Mean6.2067568
Median Absolute Deviation (MAD)2
Skewness3.0121968
Sum4593
Variance33.073567
MonotonicityNot monotonic
2024-04-14T12:10:06.380005image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=33)
ValueCountFrequency (%)
4 134
18.1%
3 104
14.1%
5 88
11.9%
6 84
11.4%
2 76
10.3%
7 61
8.2%
1 49
 
6.6%
8 26
 
3.5%
11 16
 
2.2%
10 14
 
1.9%
Other values (23) 88
11.9%
ValueCountFrequency (%)
1 49
 
6.6%
2 76
10.3%
3 104
14.1%
4 134
18.1%
5 88
11.9%
6 84
11.4%
7 61
8.2%
8 26
 
3.5%
9 8
 
1.1%
10 14
 
1.9%
ValueCountFrequency (%)
50 1
 
0.1%
42 1
 
0.1%
39 2
0.3%
33 2
0.3%
32 1
 
0.1%
29 2
0.3%
28 2
0.3%
27 1
 
0.1%
25 4
0.5%
24 3
0.4%

Interactions

2024-04-14T12:10:04.224087image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Missing values

2024-04-14T12:10:04.325977image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-14T12:10:04.390512image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

상호명대표자명주소객실수
0엉클조조일권인천광역시 옹진군 북도면 모도로78번길 16(가동, 나동)4
1추장민박김건추인천광역시 옹진군 북도면 시도로104번길 37-143
2시도민션민박백나영인천광역시 옹진군 북도면 시도로104번길 151, 나동4
3영화속풍경박상근인천광역시 옹진군 북도면 시도로86번길 167, 2동6
4노루메기장금란인천광역시 옹진군 북도면 시도로 1764
5섬사랑굴사랑민박강석화인천광역시 옹진군 북도면 모도로50번길 10, 1동6
6모도민박최광선인천광역시 옹진군 북도면 모도로50번길 56-6, 2동3
7배미꾸미최경혜인천광역시 옹진군 북도면 모도로140번길 41, 2동7
8토속점민박이재철인천광역시 옹진군 북도면 장봉로 176(가동,나동,다동)7
9신촌민박김대식인천광역시 옹진군 북도면 장봉로238번길 21, 1동7
상호명대표자명주소객실수
730하와이비치임훈혁인천광역시 옹진군 영흥면 내리 1327-1617
731퀸스비치조성원인천광역시 옹진군 영흥면 선재리 54321
732오후엔이강혁인천광역시 옹진군 북도면 장봉리 12-418
733영흥도관광펜션김교욱인천광역시 옹진군 영흥면 내리 8-25616
734네이쳐펜션방현복인천광역시 옹진군 영흥면 내리 738-512
735블루힐펜션김현중인천광역시 옹진군 북도면 장봉리 1302-54
736바다로가는 길목이성자인천광역시 옹진군 자월면 승봉리 60411
737블랙트리하우스이중화인천광역시 옹진군 영흥면 내리 1527-624
738메종 드 아일랜드(주)빌드인천광역시 옹진군 영흥면 내리 16244
739전통한옥펜션보경김동인인천광역시 옹진군 영흥면 내리 169-35