Overview

Dataset statistics

Number of variables14
Number of observations1373
Missing cells11042
Missing cells (%)57.4%
Duplicate rows2
Duplicate rows (%)0.1%
Total size in memory155.7 KiB
Average record size in memory116.1 B

Variable types

Text4
Numeric2
Boolean4
Categorical4

Dataset

Description제주관광정보시스템(VISITJEJU)의 숙박 콘텐츠 정보로 콘텐츠명, 룸개수, 애완동물동반허용여부, 조식제공여부, 금연여부, 공항교통편, 셔틀버스운행여부, 등급, 체크인시간, 체크아웃시간, LATE체크인여부, 장애인전용객실여부, 부대시설기타, 유아서비스기타 등의 정보를 제공합니다.
Author제주관광공사
URLhttps://www.data.go.kr/data/15041985/fileData.do

Alerts

Dataset has 2 (0.1%) duplicate rowsDuplicates
룸개수 is highly overall correlated with 등급 and 2 other fieldsHigh correlation
등급 is highly overall correlated with 룸개수 and 1 other fieldsHigh correlation
셔틀버스운행여부 is highly overall correlated with 체크인시간High correlation
체크인시간 is highly overall correlated with 룸개수 and 2 other fieldsHigh correlation
체크아웃시간 is highly overall correlated with 룸개수 and 1 other fieldsHigh correlation
장애인전용객실여부 is highly overall correlated with 등급High correlation
조식제공여부 is highly imbalanced (59.1%)Imbalance
금연여부 is highly imbalanced (62.7%)Imbalance
체크인시간 is highly imbalanced (61.2%)Imbalance
체크아웃시간 is highly imbalanced (61.2%)Imbalance
룸개수 has 1162 (84.6%) missing valuesMissing
애완동물동반허용여부 has 1134 (82.6%) missing valuesMissing
공항교통편 has 1300 (94.7%) missing valuesMissing
셔틀버스운행여부 has 1215 (88.5%) missing valuesMissing
등급 has 1161 (84.6%) missing valuesMissing
LATE체크인여부 has 1196 (87.1%) missing valuesMissing
장애인전용객실여부 has 1225 (89.2%) missing valuesMissing
부대시설기타 has 1304 (95.0%) missing valuesMissing
유아서비스기타 has 1345 (98.0%) missing valuesMissing
등급 has 146 (10.6%) zerosZeros

Reproduction

Analysis started2024-03-23 06:58:51.096948
Analysis finished2024-03-23 06:58:58.153540
Duration7.06 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct1362
Distinct (%)99.2%
Missing0
Missing (%)0.0%
Memory size10.9 KiB
2024-03-23T06:58:58.760420image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length33
Median length29
Mean length6.9650401
Min length1

Characters and Unicode

Total characters9563
Distinct characters635
Distinct categories9 ?
Distinct scripts4 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1351 ?
Unique (%)98.4%

Sample

1st row나쿠펜다 제주
2nd row제주엘루이호텔
3rd row아인스호텔
4th row씨스테이호텔
5th row썬라이즈호텔
ValueCountFrequency (%)
제주 50
 
2.8%
게스트하우스 39
 
2.2%
펜션 38
 
2.1%
호텔 36
 
2.0%
민박 14
 
0.8%
신신호텔 8
 
0.4%
서귀포 8
 
0.4%
하우스 6
 
0.3%
리조트 6
 
0.3%
호텔앤리조트 6
 
0.3%
Other values (1493) 1568
88.1%
2024-03-23T06:59:00.140290image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
567
 
5.9%
416
 
4.4%
315
 
3.3%
310
 
3.2%
293
 
3.1%
282
 
2.9%
277
 
2.9%
274
 
2.9%
271
 
2.8%
250
 
2.6%
Other values (625) 6308
66.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 8730
91.3%
Space Separator 416
 
4.4%
Uppercase Letter 126
 
1.3%
Decimal Number 110
 
1.2%
Lowercase Letter 102
 
1.1%
Other Punctuation 22
 
0.2%
Open Punctuation 21
 
0.2%
Close Punctuation 21
 
0.2%
Connector Punctuation 15
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
567
 
6.5%
315
 
3.6%
310
 
3.6%
293
 
3.4%
282
 
3.2%
277
 
3.2%
274
 
3.1%
271
 
3.1%
250
 
2.9%
243
 
2.8%
Other values (560) 5648
64.7%
Uppercase Letter
ValueCountFrequency (%)
O 11
 
8.7%
S 10
 
7.9%
B 10
 
7.9%
A 9
 
7.1%
E 9
 
7.1%
U 9
 
7.1%
J 7
 
5.6%
T 6
 
4.8%
C 6
 
4.8%
K 5
 
4.0%
Other values (14) 44
34.9%
Lowercase Letter
ValueCountFrequency (%)
n 14
13.7%
e 11
10.8%
o 10
9.8%
a 9
 
8.8%
i 9
 
8.8%
d 7
 
6.9%
s 5
 
4.9%
l 5
 
4.9%
b 5
 
4.9%
g 4
 
3.9%
Other values (11) 23
22.5%
Decimal Number
ValueCountFrequency (%)
2 26
23.6%
3 20
18.2%
1 14
12.7%
4 12
10.9%
0 9
 
8.2%
9 9
 
8.2%
5 8
 
7.3%
6 6
 
5.5%
7 4
 
3.6%
8 2
 
1.8%
Other Punctuation
ValueCountFrequency (%)
& 16
72.7%
! 2
 
9.1%
/ 1
 
4.5%
, 1
 
4.5%
? 1
 
4.5%
. 1
 
4.5%
Space Separator
ValueCountFrequency (%)
416
100.0%
Open Punctuation
ValueCountFrequency (%)
( 21
100.0%
Close Punctuation
ValueCountFrequency (%)
) 21
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 15
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 8729
91.3%
Common 605
 
6.3%
Latin 228
 
2.4%
Han 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
567
 
6.5%
315
 
3.6%
310
 
3.6%
293
 
3.4%
282
 
3.2%
277
 
3.2%
274
 
3.1%
271
 
3.1%
250
 
2.9%
243
 
2.8%
Other values (559) 5647
64.7%
Latin
ValueCountFrequency (%)
n 14
 
6.1%
e 11
 
4.8%
O 11
 
4.8%
S 10
 
4.4%
B 10
 
4.4%
o 10
 
4.4%
a 9
 
3.9%
A 9
 
3.9%
i 9
 
3.9%
E 9
 
3.9%
Other values (35) 126
55.3%
Common
ValueCountFrequency (%)
416
68.8%
2 26
 
4.3%
( 21
 
3.5%
) 21
 
3.5%
3 20
 
3.3%
& 16
 
2.6%
_ 15
 
2.5%
1 14
 
2.3%
4 12
 
2.0%
0 9
 
1.5%
Other values (10) 35
 
5.8%
Han
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 8729
91.3%
ASCII 833
 
8.7%
CJK 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
567
 
6.5%
315
 
3.6%
310
 
3.6%
293
 
3.4%
282
 
3.2%
277
 
3.2%
274
 
3.1%
271
 
3.1%
250
 
2.9%
243
 
2.8%
Other values (559) 5647
64.7%
ASCII
ValueCountFrequency (%)
416
49.9%
2 26
 
3.1%
( 21
 
2.5%
) 21
 
2.5%
3 20
 
2.4%
& 16
 
1.9%
_ 15
 
1.8%
1 14
 
1.7%
n 14
 
1.7%
4 12
 
1.4%
Other values (55) 258
31.0%
CJK
ValueCountFrequency (%)
1
100.0%

룸개수
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct66
Distinct (%)31.3%
Missing1162
Missing (%)84.6%
Infinite0
Infinite (%)0.0%
Mean203.85308
Minimum1
Maximum32423
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size12.2 KiB
2024-03-23T06:59:00.882758image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2
Q13
median6
Q339
95-th percentile251.5
Maximum32423
Range32422
Interquartile range (IQR)36

Descriptive statistics

Standard deviation2232.5776
Coefficient of variation (CV)10.951896
Kurtosis209.48698
Mean203.85308
Median Absolute Deviation (MAD)4
Skewness14.449678
Sum43013
Variance4984402.9
MonotonicityNot monotonic
2024-03-23T06:59:01.568840image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
3 29
 
2.1%
2 23
 
1.7%
4 18
 
1.3%
5 17
 
1.2%
6 16
 
1.2%
7 15
 
1.1%
1 7
 
0.5%
8 5
 
0.4%
204 4
 
0.3%
12 4
 
0.3%
Other values (56) 73
 
5.3%
(Missing) 1162
84.6%
ValueCountFrequency (%)
1 7
 
0.5%
2 23
1.7%
3 29
2.1%
4 18
1.3%
5 17
1.2%
6 16
1.2%
7 15
1.1%
8 5
 
0.4%
9 4
 
0.3%
10 2
 
0.1%
ValueCountFrequency (%)
32423 1
0.1%
1600 1
0.1%
342 1
0.1%
324 1
0.1%
310 1
0.1%
307 1
0.1%
305 1
0.1%
302 2
0.1%
280 1
0.1%
255 1
0.1%
Distinct2
Distinct (%)0.8%
Missing1134
Missing (%)82.6%
Memory size2.8 KiB
False
167 
True
 
72
(Missing)
1134 
ValueCountFrequency (%)
False 167
 
12.2%
True 72
 
5.2%
(Missing) 1134
82.6%
2024-03-23T06:59:02.055289image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

조식제공여부
Categorical

IMBALANCE 

Distinct4
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size10.9 KiB
<NA>
1172 
3
 
87
1
 
80
2
 
34

Length

Max length4
Median length4
Mean length3.5608157
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 1172
85.4%
3 87
 
6.3%
1 80
 
5.8%
2 34
 
2.5%

Length

2024-03-23T06:59:02.723516image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-23T06:59:03.063272image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 1172
85.4%
3 87
 
6.3%
1 80
 
5.8%
2 34
 
2.5%

금연여부
Categorical

IMBALANCE 

Distinct4
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size10.9 KiB
<NA>
1174 
1
 
110
2
 
88
3
 
1

Length

Max length4
Median length4
Mean length3.5651857
Min length1

Unique

Unique1 ?
Unique (%)0.1%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 1174
85.5%
1 110
 
8.0%
2 88
 
6.4%
3 1
 
0.1%

Length

2024-03-23T06:59:03.542815image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-23T06:59:04.010040image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 1174
85.5%
1 110
 
8.0%
2 88
 
6.4%
3 1
 
0.1%

공항교통편
Text

MISSING 

Distinct66
Distinct (%)90.4%
Missing1300
Missing (%)94.7%
Memory size10.9 KiB
2024-03-23T06:59:04.894770image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length116
Median length37
Mean length18.876712
Min length2

Characters and Unicode

Total characters1378
Distinct characters170
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique59 ?
Unique (%)80.8%

Sample

1st row4번게이트 102번 탑승후 애월리정류소에서 202번 환승
2nd row제주공항 GATE 1에서 111번 또는 112번 탑승 → 성산환승정류장(고성리 회전교차로) 하차 → 201번 탑승 후 고성오일시장 정류장 하차
3rd row131번, 132번 직통버스
4th row버스, 렌트카
5th row공항버스 제주관광대정류장 이용
ValueCountFrequency (%)
버스 22
 
7.0%
하차 14
 
4.5%
13
 
4.1%
101번 7
 
2.2%
600번 7
 
2.2%
탑승 6
 
1.9%
6
 
1.9%
택시 6
 
1.9%
201번 5
 
1.6%
202번 5
 
1.6%
Other values (159) 223
71.0%
2024-03-23T06:59:06.086421image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
249
 
18.1%
1 79
 
5.7%
0 70
 
5.1%
51
 
3.7%
2 43
 
3.1%
41
 
3.0%
40
 
2.9%
, 31
 
2.2%
25
 
1.8%
21
 
1.5%
Other values (160) 728
52.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 751
54.5%
Decimal Number 264
 
19.2%
Space Separator 249
 
18.1%
Other Punctuation 48
 
3.5%
Open Punctuation 17
 
1.2%
Close Punctuation 17
 
1.2%
Dash Punctuation 15
 
1.1%
Math Symbol 11
 
0.8%
Uppercase Letter 4
 
0.3%
Lowercase Letter 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
51
 
6.8%
41
 
5.5%
40
 
5.3%
25
 
3.3%
21
 
2.8%
21
 
2.8%
20
 
2.7%
19
 
2.5%
19
 
2.5%
19
 
2.5%
Other values (133) 475
63.2%
Decimal Number
ValueCountFrequency (%)
1 79
29.9%
0 70
26.5%
2 43
16.3%
3 17
 
6.4%
6 14
 
5.3%
5 13
 
4.9%
4 12
 
4.5%
8 8
 
3.0%
7 6
 
2.3%
9 2
 
0.8%
Other Punctuation
ValueCountFrequency (%)
, 31
64.6%
/ 7
 
14.6%
. 6
 
12.5%
* 2
 
4.2%
' 2
 
4.2%
Uppercase Letter
ValueCountFrequency (%)
G 1
25.0%
A 1
25.0%
T 1
25.0%
E 1
25.0%
Math Symbol
ValueCountFrequency (%)
> 7
63.6%
4
36.4%
Space Separator
ValueCountFrequency (%)
249
100.0%
Open Punctuation
ValueCountFrequency (%)
( 17
100.0%
Close Punctuation
ValueCountFrequency (%)
) 17
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 15
100.0%
Lowercase Letter
ValueCountFrequency (%)
m 1
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 751
54.5%
Common 622
45.1%
Latin 5
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
51
 
6.8%
41
 
5.5%
40
 
5.3%
25
 
3.3%
21
 
2.8%
21
 
2.8%
20
 
2.7%
19
 
2.5%
19
 
2.5%
19
 
2.5%
Other values (133) 475
63.2%
Common
ValueCountFrequency (%)
249
40.0%
1 79
 
12.7%
0 70
 
11.3%
2 43
 
6.9%
, 31
 
5.0%
( 17
 
2.7%
) 17
 
2.7%
3 17
 
2.7%
- 15
 
2.4%
6 14
 
2.3%
Other values (12) 70
 
11.3%
Latin
ValueCountFrequency (%)
m 1
20.0%
G 1
20.0%
A 1
20.0%
T 1
20.0%
E 1
20.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 748
54.3%
ASCII 622
45.1%
Arrows 4
 
0.3%
Compat Jamo 3
 
0.2%
Geometric Shapes 1
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
249
40.0%
1 79
 
12.7%
0 70
 
11.3%
2 43
 
6.9%
, 31
 
5.0%
( 17
 
2.7%
) 17
 
2.7%
3 17
 
2.7%
- 15
 
2.4%
6 14
 
2.3%
Other values (15) 70
 
11.3%
Hangul
ValueCountFrequency (%)
51
 
6.8%
41
 
5.5%
40
 
5.3%
25
 
3.3%
21
 
2.8%
21
 
2.8%
20
 
2.7%
19
 
2.5%
19
 
2.5%
19
 
2.5%
Other values (132) 472
63.1%
Arrows
ValueCountFrequency (%)
4
100.0%
Compat Jamo
ValueCountFrequency (%)
3
100.0%
Geometric Shapes
ValueCountFrequency (%)
1
100.0%

셔틀버스운행여부
Boolean

HIGH CORRELATION  MISSING 

Distinct2
Distinct (%)1.3%
Missing1215
Missing (%)88.5%
Memory size2.8 KiB
False
 
121
True
 
37
(Missing)
1215 
ValueCountFrequency (%)
False 121
 
8.8%
True 37
 
2.7%
(Missing) 1215
88.5%
2024-03-23T06:59:06.497229image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

등급
Real number (ℝ)

HIGH CORRELATION  MISSING  ZEROS 

Distinct6
Distinct (%)2.8%
Missing1161
Missing (%)84.6%
Infinite0
Infinite (%)0.0%
Mean1.0424528
Minimum0
Maximum5
Zeros146
Zeros (%)10.6%
Negative0
Negative (%)0.0%
Memory size12.2 KiB
2024-03-23T06:59:06.774217image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q32
95-th percentile5
Maximum5
Range5
Interquartile range (IQR)2

Descriptive statistics

Standard deviation1.7011537
Coefficient of variation (CV)1.6318759
Kurtosis-0.024990668
Mean1.0424528
Median Absolute Deviation (MAD)0
Skewness1.2626558
Sum221
Variance2.8939238
MonotonicityNot monotonic
2024-03-23T06:59:07.088433image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
0 146
 
10.6%
4 21
 
1.5%
3 14
 
1.0%
5 13
 
0.9%
2 12
 
0.9%
1 6
 
0.4%
(Missing) 1161
84.6%
ValueCountFrequency (%)
0 146
10.6%
1 6
 
0.4%
2 12
 
0.9%
3 14
 
1.0%
4 21
 
1.5%
5 13
 
0.9%
ValueCountFrequency (%)
5 13
 
0.9%
4 21
 
1.5%
3 14
 
1.0%
2 12
 
0.9%
1 6
 
0.4%
0 146
10.6%

체크인시간
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct17
Distinct (%)1.2%
Missing0
Missing (%)0.0%
Memory size10.9 KiB
<NA>
904 
15:00
265 
16:00
109 
14:00
 
40
17:00
 
26
Other values (12)
 
29

Length

Max length5
Median length4
Mean length4.3415878
Min length4

Unique

Unique6 ?
Unique (%)0.4%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row15:00
5th row15:00

Common Values

ValueCountFrequency (%)
<NA> 904
65.8%
15:00 265
 
19.3%
16:00 109
 
7.9%
14:00 40
 
2.9%
17:00 26
 
1.9%
00:00 10
 
0.7%
03:00 4
 
0.3%
15:30 3
 
0.2%
11:00 2
 
0.1%
13:00 2
 
0.1%
Other values (7) 8
 
0.6%

Length

2024-03-23T06:59:07.542350image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
na 904
65.8%
15:00 265
 
19.3%
16:00 109
 
7.9%
14:00 40
 
2.9%
17:00 26
 
1.9%
00:00 10
 
0.7%
03:00 4
 
0.3%
15:30 3
 
0.2%
04:00 2
 
0.1%
11:00 2
 
0.1%
Other values (7) 8
 
0.6%

체크아웃시간
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct12
Distinct (%)0.9%
Missing0
Missing (%)0.0%
Memory size10.9 KiB
<NA>
902 
11:00
353 
10:00
 
44
12:00
 
44
00:00
 
10
Other values (7)
 
20

Length

Max length5
Median length4
Mean length4.3430444
Min length4

Unique

Unique4 ?
Unique (%)0.3%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row11:00
5th row11:00

Common Values

ValueCountFrequency (%)
<NA> 902
65.7%
11:00 353
 
25.7%
10:00 44
 
3.2%
12:00 44
 
3.2%
00:00 10
 
0.7%
10:30 10
 
0.7%
13:00 3
 
0.2%
11:30 3
 
0.2%
02:30 1
 
0.1%
09:30 1
 
0.1%
Other values (2) 2
 
0.1%

Length

2024-03-23T06:59:08.007845image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
na 902
65.7%
11:00 353
 
25.7%
10:00 44
 
3.2%
12:00 44
 
3.2%
00:00 10
 
0.7%
10:30 10
 
0.7%
13:00 3
 
0.2%
11:30 3
 
0.2%
02:30 1
 
0.1%
09:30 1
 
0.1%
Other values (2) 2
 
0.1%

LATE체크인여부
Boolean

MISSING 

Distinct2
Distinct (%)1.1%
Missing1196
Missing (%)87.1%
Memory size2.8 KiB
True
143 
False
 
34
(Missing)
1196 
ValueCountFrequency (%)
True 143
 
10.4%
False 34
 
2.5%
(Missing) 1196
87.1%
2024-03-23T06:59:08.407549image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

장애인전용객실여부
Boolean

HIGH CORRELATION  MISSING 

Distinct2
Distinct (%)1.4%
Missing1225
Missing (%)89.2%
Memory size2.8 KiB
False
 
111
True
 
37
(Missing)
1225 
ValueCountFrequency (%)
False 111
 
8.1%
True 37
 
2.7%
(Missing) 1225
89.2%
2024-03-23T06:59:08.867198image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

부대시설기타
Text

MISSING 

Distinct63
Distinct (%)91.3%
Missing1304
Missing (%)95.0%
Memory size10.9 KiB
2024-03-23T06:59:09.752352image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length86
Median length34
Mean length13.869565
Min length1

Characters and Unicode

Total characters957
Distinct characters238
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique58 ?
Unique (%)84.1%

Sample

1st row급속 전기차 충전소
2nd row제주도모퉁이스낵연회장
3rd row노래방
4th row200여 평 유럽풍 정원
5th row베란다 / 화장실 / TV / 냉장고 / 옷걸이 / 거울 / 선풍기 / 에어컨 / 샴푸 / 바디샴푸 / 수건 / 생수 / 비상벨 / 비상전등 / 소화기 등
ValueCountFrequency (%)
14
 
6.6%
노래방 7
 
3.3%
카페 5
 
2.3%
주차장 4
 
1.9%
편의점 4
 
1.9%
1층 3
 
1.4%
3
 
1.4%
에어컨 3
 
1.4%
3
 
1.4%
게임장 2
 
0.9%
Other values (149) 165
77.5%
2024-03-23T06:59:10.918952image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
147
 
15.4%
, 76
 
7.9%
32
 
3.3%
17
 
1.8%
17
 
1.8%
/ 17
 
1.8%
16
 
1.7%
12
 
1.3%
12
 
1.3%
12
 
1.3%
Other values (228) 599
62.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 686
71.7%
Space Separator 147
 
15.4%
Other Punctuation 95
 
9.9%
Decimal Number 16
 
1.7%
Uppercase Letter 5
 
0.5%
Open Punctuation 4
 
0.4%
Close Punctuation 4
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
32
 
4.7%
17
 
2.5%
17
 
2.5%
16
 
2.3%
12
 
1.7%
12
 
1.7%
12
 
1.7%
11
 
1.6%
10
 
1.5%
10
 
1.5%
Other values (211) 537
78.3%
Decimal Number
ValueCountFrequency (%)
0 4
25.0%
1 4
25.0%
5 3
18.8%
2 2
12.5%
7 1
 
6.2%
3 1
 
6.2%
4 1
 
6.2%
Uppercase Letter
ValueCountFrequency (%)
B 2
40.0%
V 1
20.0%
T 1
20.0%
Q 1
20.0%
Other Punctuation
ValueCountFrequency (%)
, 76
80.0%
/ 17
 
17.9%
' 2
 
2.1%
Space Separator
ValueCountFrequency (%)
147
100.0%
Open Punctuation
ValueCountFrequency (%)
( 4
100.0%
Close Punctuation
ValueCountFrequency (%)
) 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 686
71.7%
Common 266
 
27.8%
Latin 5
 
0.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
32
 
4.7%
17
 
2.5%
17
 
2.5%
16
 
2.3%
12
 
1.7%
12
 
1.7%
12
 
1.7%
11
 
1.6%
10
 
1.5%
10
 
1.5%
Other values (211) 537
78.3%
Common
ValueCountFrequency (%)
147
55.3%
, 76
28.6%
/ 17
 
6.4%
( 4
 
1.5%
0 4
 
1.5%
1 4
 
1.5%
) 4
 
1.5%
5 3
 
1.1%
' 2
 
0.8%
2 2
 
0.8%
Other values (3) 3
 
1.1%
Latin
ValueCountFrequency (%)
B 2
40.0%
V 1
20.0%
T 1
20.0%
Q 1
20.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 686
71.7%
ASCII 271
 
28.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
147
54.2%
, 76
28.0%
/ 17
 
6.3%
( 4
 
1.5%
0 4
 
1.5%
1 4
 
1.5%
) 4
 
1.5%
5 3
 
1.1%
B 2
 
0.7%
' 2
 
0.7%
Other values (7) 8
 
3.0%
Hangul
ValueCountFrequency (%)
32
 
4.7%
17
 
2.5%
17
 
2.5%
16
 
2.3%
12
 
1.7%
12
 
1.7%
12
 
1.7%
11
 
1.6%
10
 
1.5%
10
 
1.5%
Other values (211) 537
78.3%

유아서비스기타
Text

MISSING 

Distinct27
Distinct (%)96.4%
Missing1345
Missing (%)98.0%
Memory size10.9 KiB
2024-03-23T06:59:11.792932image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length42
Median length19.5
Mean length15.25
Min length2

Characters and Unicode

Total characters427
Distinct characters126
Distinct categories8 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique26 ?
Unique (%)92.9%

Sample

1st row침대 안전 가드
2nd row23423
3rd row불가
4th row침대에 설치 가능한 베이비가드 보유 및 사전신청 시 설치가능
5th row요청 시 유아 용품 대여
ValueCountFrequency (%)
대여 6
 
5.6%
유아 5
 
4.7%
서비스 5
 
4.7%
어린이 4
 
3.7%
유모차 4
 
3.7%
키즈 4
 
3.7%
침대 2
 
1.9%
유료 2
 
1.9%
‘플레이타임’ 2
 
1.9%
클럽 2
 
1.9%
Other values (63) 71
66.4%
2024-03-23T06:59:13.156762image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
79
 
18.5%
21
 
4.9%
, 17
 
4.0%
13
 
3.0%
12
 
2.8%
11
 
2.6%
10
 
2.3%
9
 
2.1%
8
 
1.9%
7
 
1.6%
Other values (116) 240
56.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 306
71.7%
Space Separator 79
 
18.5%
Other Punctuation 17
 
4.0%
Decimal Number 13
 
3.0%
Open Punctuation 4
 
0.9%
Close Punctuation 4
 
0.9%
Final Punctuation 2
 
0.5%
Initial Punctuation 2
 
0.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
21
 
6.9%
13
 
4.2%
12
 
3.9%
11
 
3.6%
10
 
3.3%
9
 
2.9%
8
 
2.6%
7
 
2.3%
6
 
2.0%
6
 
2.0%
Other values (106) 203
66.3%
Decimal Number
ValueCountFrequency (%)
2 5
38.5%
3 4
30.8%
4 3
23.1%
6 1
 
7.7%
Space Separator
ValueCountFrequency (%)
79
100.0%
Other Punctuation
ValueCountFrequency (%)
, 17
100.0%
Open Punctuation
ValueCountFrequency (%)
( 4
100.0%
Close Punctuation
ValueCountFrequency (%)
) 4
100.0%
Final Punctuation
ValueCountFrequency (%)
2
100.0%
Initial Punctuation
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 306
71.7%
Common 121
 
28.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
21
 
6.9%
13
 
4.2%
12
 
3.9%
11
 
3.6%
10
 
3.3%
9
 
2.9%
8
 
2.6%
7
 
2.3%
6
 
2.0%
6
 
2.0%
Other values (106) 203
66.3%
Common
ValueCountFrequency (%)
79
65.3%
, 17
 
14.0%
2 5
 
4.1%
( 4
 
3.3%
) 4
 
3.3%
3 4
 
3.3%
4 3
 
2.5%
2
 
1.7%
2
 
1.7%
6 1
 
0.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 306
71.7%
ASCII 117
 
27.4%
Punctuation 4
 
0.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
79
67.5%
, 17
 
14.5%
2 5
 
4.3%
( 4
 
3.4%
) 4
 
3.4%
3 4
 
3.4%
4 3
 
2.6%
6 1
 
0.9%
Hangul
ValueCountFrequency (%)
21
 
6.9%
13
 
4.2%
12
 
3.9%
11
 
3.6%
10
 
3.3%
9
 
2.9%
8
 
2.6%
7
 
2.3%
6
 
2.0%
6
 
2.0%
Other values (106) 203
66.3%
Punctuation
ValueCountFrequency (%)
2
50.0%
2
50.0%

Interactions

2024-03-23T06:58:54.745533image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-23T06:58:54.037706image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-23T06:58:55.001722image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-23T06:58:54.464849image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-23T06:59:13.564763image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
룸개수애완동물동반허용여부조식제공여부금연여부공항교통편셔틀버스운행여부등급체크인시간체크아웃시간LATE체크인여부장애인전용객실여부부대시설기타유아서비스기타
룸개수1.0000.0000.0000.0001.0000.0000.3141.0001.0000.0000.0001.0001.000
애완동물동반허용여부0.0001.0000.0260.0870.0000.5320.0000.0000.1300.0000.0000.6941.000
조식제공여부0.0000.0261.0000.0000.9770.2270.4200.7080.5470.0880.2271.0001.000
금연여부0.0000.0870.0001.0000.9610.1260.1780.0000.0000.0000.0621.0001.000
공항교통편1.0000.0000.9770.9611.0001.0000.7050.0000.0000.6250.8941.0001.000
셔틀버스운행여부0.0000.5320.2270.1261.0001.0000.2440.7090.4770.2920.0221.0001.000
등급0.3140.0000.4200.1780.7050.2441.0000.3160.3530.4580.7051.0001.000
체크인시간1.0000.0000.7080.0000.0000.7090.3161.0000.8740.0000.5120.9901.000
체크아웃시간1.0000.1300.5470.0000.0000.4770.3530.8741.0000.0000.2950.0001.000
LATE체크인여부0.0000.0000.0880.0000.6250.2920.4580.0000.0001.0000.0000.0001.000
장애인전용객실여부0.0000.0000.2270.0620.8940.0220.7050.5120.2950.0001.0001.0001.000
부대시설기타1.0000.6941.0001.0001.0001.0001.0000.9900.0000.0001.0001.0001.000
유아서비스기타1.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.000
2024-03-23T06:59:14.005220image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
셔틀버스운행여부애완동물동반허용여부장애인전용객실여부조식제공여부금연여부체크아웃시간LATE체크인여부체크인시간
셔틀버스운행여부1.0000.3570.0120.3700.2070.4640.1890.537
애완동물동반허용여부0.3571.0000.0000.0420.1440.1270.0000.000
장애인전용객실여부0.0120.0001.0000.3700.1030.2850.0000.380
조식제공여부0.3700.0420.3701.0000.0000.2810.1460.418
금연여부0.2070.1440.1030.0001.0000.0000.0000.000
체크아웃시간0.4640.1270.2850.2810.0001.0000.0000.575
LATE체크인여부0.1890.0000.0000.1460.0000.0001.0000.000
체크인시간0.5370.0000.3800.4180.0000.5750.0001.000
2024-03-23T06:59:14.381800image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
룸개수등급애완동물동반허용여부조식제공여부금연여부셔틀버스운행여부체크인시간체크아웃시간LATE체크인여부장애인전용객실여부
룸개수1.0000.5390.0000.0000.0000.0000.9620.9800.0000.000
등급0.5391.0000.0000.1890.0730.1720.1510.1810.3250.511
애완동물동반허용여부0.0000.0001.0000.0420.1440.3570.0000.1270.0000.000
조식제공여부0.0000.1890.0421.0000.0000.3700.4180.2810.1460.370
금연여부0.0000.0730.1440.0001.0000.2070.0000.0000.0000.103
셔틀버스운행여부0.0000.1720.3570.3700.2071.0000.5370.4640.1890.012
체크인시간0.9620.1510.0000.4180.0000.5371.0000.5750.0000.380
체크아웃시간0.9800.1810.1270.2810.0000.4640.5751.0000.0000.285
LATE체크인여부0.0000.3250.0000.1460.0000.1890.0000.0001.0000.000
장애인전용객실여부0.0000.5110.0000.3700.1030.0120.3800.2850.0001.000

Missing values

2024-03-23T06:58:55.399778image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-23T06:58:56.223787image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-03-23T06:58:57.152305image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

콘텐츠명룸개수애완동물동반허용여부조식제공여부금연여부공항교통편셔틀버스운행여부등급체크인시간체크아웃시간LATE체크인여부장애인전용객실여부부대시설기타유아서비스기타
0나쿠펜다 제주<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
1제주엘루이호텔<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
2아인스호텔<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
3씨스테이호텔<NA><NA><NA><NA><NA><NA><NA>15:0011:00<NA><NA><NA><NA>
4썬라이즈호텔<NA><NA><NA><NA><NA><NA><NA>15:0011:00<NA><NA><NA><NA>
5하도리민박<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
6갤러리 호텔 비앤비<NA><NA><NA><NA><NA><NA><NA>14:0011:00<NA><NA><NA><NA>
7베스트웨스턴 제주 호텔<NA><NA><NA><NA><NA><NA><NA>15:0011:00<NA><NA><NA><NA>
8라마다제주시티홀<NA><NA><NA><NA><NA><NA><NA>14:0011:00<NA><NA><NA><NA>
9배배게스트하우스<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
콘텐츠명룸개수애완동물동반허용여부조식제공여부금연여부공항교통편셔틀버스운행여부등급체크인시간체크아웃시간LATE체크인여부장애인전용객실여부부대시설기타유아서비스기타
1363휴일기록2<NA><NA><NA><NA><NA>016:0011:00<NA><NA><NA><NA>
1364제주와일드<NA><NA>22<NA><NA>016:0012:00<NA><NA><NA><NA>
1365신신호텔 제주공항204n11<NA>n415:0011:00y<NA><NA><NA>
1366신신호텔 제주월드컵243n11<NA>n415:0011:00y<NA><NA><NA>
1367신신호텔 천지연302n11<NA>n415:0011:00<NA><NA><NA><NA>
1368신신호텔 천지연302n11<NA>n415:0011:00<NA><NA><NA><NA>
1369제주은빌레 휴양펜션4<NA>1<NA><NA><NA>015:0011:00<NA>n<NA><NA>
1370신신호텔 제주월드컵243n11<NA>n415:0011:00<NA><NA><NA><NA>
1371애월하타1<NA><NA>1<NA><NA>015:0011:00<NA><NA><NA><NA>
1372훈데르트힐즈48<NA>1<NA><NA><NA>015:0011:00<NA><NA><NA><NA>

Duplicate rows

Most frequently occurring

콘텐츠명룸개수애완동물동반허용여부조식제공여부금연여부공항교통편셔틀버스운행여부등급체크인시간체크아웃시간LATE체크인여부장애인전용객실여부부대시설기타유아서비스기타# duplicates
0신신호텔 천지연302n11<NA>n415:0011:00<NA><NA><NA><NA>2
1웰컴모텔<NA>y12<NA>y003:0012:00yy<NA><NA>2