Overview

Dataset statistics

Number of variables12
Number of observations100
Missing cells75
Missing cells (%)6.2%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory9.7 KiB
Average record size in memory99.3 B

Variable types

Categorical6
Text4
Numeric2

Alerts

eng_lang_area_nm has constant value ""Constant
kor_lang_area_nm has constant value ""Constant
jan_lang_area_nm has constant value ""Constant
chg_lang_area_nm has constant value ""Constant
BASE_YMD has constant value ""Constant
se_nm is highly imbalanced (52.5%)Imbalance
kor_lang_hotel_nm has 7 (7.0%) missing valuesMissing
tel_no has 68 (68.0%) missing valuesMissing
eng_lang_hotel_nm has unique valuesUnique

Reproduction

Analysis started2023-12-10 10:16:31.889281
Analysis finished2023-12-10 10:16:33.275383
Duration1.39 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

se_nm
Categorical

IMBALANCE 

Distinct4
Distinct (%)4.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
호텔
82 
게스트하우스
에어비앤비
 
5
기타
 
4

Length

Max length6
Median length2
Mean length2.51
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row호텔
2nd row호텔
3rd row게스트하우스
4th row호텔
5th row에어비앤비

Common Values

ValueCountFrequency (%)
호텔 82
82.0%
게스트하우스 9
 
9.0%
에어비앤비 5
 
5.0%
기타 4
 
4.0%

Length

2023-12-10T19:16:33.361660image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T19:16:33.473064image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
호텔 82
82.0%
게스트하우스 9
 
9.0%
에어비앤비 5
 
5.0%
기타 4
 
4.0%

eng_lang_hotel_nm
Text

UNIQUE 

Distinct100
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
2023-12-10T19:16:33.830092image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length47
Median length29.5
Mean length22.51
Min length10

Characters and Unicode

Total characters2251
Distinct characters69
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique100 ?
Unique (%)100.0%

Sample

1st rowHanoi Gortage Hotel & TRAVEL
2nd rowVinpeal da nang ocean resort & villas
3rd row1812 Boutique Hostel
4th row1BR Apartment at The 5 stars Resort H Danang
5th row2 Ton Duc Thang Street, Ho Chi Minh City 700000
ValueCountFrequency (%)
hotel 57
 
13.8%
19
 
4.6%
an 16
 
3.9%
resort 14
 
3.4%
spa 12
 
2.9%
boutique 11
 
2.7%
beach 8
 
1.9%
hoi 8
 
1.9%
trang 8
 
1.9%
nha 8
 
1.9%
Other values (192) 251
60.9%
2023-12-10T19:16:34.455592image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
313
13.9%
a 205
 
9.1%
e 168
 
7.5%
o 153
 
6.8%
t 143
 
6.4%
n 139
 
6.2%
l 106
 
4.7%
A 103
 
4.6%
H 93
 
4.1%
i 92
 
4.1%
Other values (59) 736
32.7%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 1446
64.2%
Uppercase Letter 439
 
19.5%
Space Separator 313
 
13.9%
Other Punctuation 25
 
1.1%
Decimal Number 25
 
1.1%
Dash Punctuation 3
 
0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a 205
14.2%
e 168
11.6%
o 153
10.6%
t 143
9.9%
n 139
9.6%
l 106
7.3%
i 92
 
6.4%
r 85
 
5.9%
s 57
 
3.9%
u 49
 
3.4%
Other values (19) 249
17.2%
Uppercase Letter
ValueCountFrequency (%)
A 103
23.5%
H 93
21.2%
B 44
10.0%
R 27
 
6.2%
S 26
 
5.9%
T 23
 
5.2%
N 18
 
4.1%
L 13
 
3.0%
V 13
 
3.0%
C 11
 
2.5%
Other values (15) 68
15.5%
Decimal Number
ValueCountFrequency (%)
0 5
20.0%
7 4
16.0%
2 4
16.0%
1 3
12.0%
4 3
12.0%
3 2
 
8.0%
5 1
 
4.0%
9 1
 
4.0%
8 1
 
4.0%
6 1
 
4.0%
Other Punctuation
ValueCountFrequency (%)
& 19
76.0%
, 3
 
12.0%
' 3
 
12.0%
Space Separator
ValueCountFrequency (%)
313
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 1885
83.7%
Common 366
 
16.3%

Most frequent character per script

Latin
ValueCountFrequency (%)
a 205
 
10.9%
e 168
 
8.9%
o 153
 
8.1%
t 143
 
7.6%
n 139
 
7.4%
l 106
 
5.6%
A 103
 
5.5%
H 93
 
4.9%
i 92
 
4.9%
r 85
 
4.5%
Other values (44) 598
31.7%
Common
ValueCountFrequency (%)
313
85.5%
& 19
 
5.2%
0 5
 
1.4%
7 4
 
1.1%
2 4
 
1.1%
1 3
 
0.8%
4 3
 
0.8%
- 3
 
0.8%
, 3
 
0.8%
' 3
 
0.8%
Other values (5) 6
 
1.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2245
99.7%
Latin Ext Additional 3
 
0.1%
None 3
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
313
13.9%
a 205
 
9.1%
e 168
 
7.5%
o 153
 
6.8%
t 143
 
6.4%
n 139
 
6.2%
l 106
 
4.7%
A 103
 
4.6%
H 93
 
4.1%
i 92
 
4.1%
Other values (53) 730
32.5%
Latin Ext Additional
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%
None
ValueCountFrequency (%)
ư 1
33.3%
Đ 1
33.3%
à 1
33.3%

kor_lang_hotel_nm
Text

MISSING 

Distinct93
Distinct (%)100.0%
Missing7
Missing (%)7.0%
Memory size932.0 B
2023-12-10T19:16:34.883776image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length21
Median length17
Mean length11.11828
Min length4

Characters and Unicode

Total characters1034
Distinct characters150
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique93 ?
Unique (%)100.0%

Sample

1st row하노이 고르티지 호텔
2nd row다낭 빈펄 오션 리조트
3rd row1812 부티크 호스텔
4th row22 레지던스 하노이
5th row4 시즌스 호스텔
ValueCountFrequency (%)
호텔 48
 
15.6%
리조트 12
 
3.9%
부티크 11
 
3.6%
9
 
2.9%
비치 8
 
2.6%
스파 8
 
2.6%
8
 
2.6%
하노이 7
 
2.3%
호이안 7
 
2.3%
나트랑 7
 
2.3%
Other values (144) 183
59.4%
2023-12-10T19:16:35.482461image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
215
20.8%
67
 
6.5%
60
 
5.8%
47
 
4.5%
40
 
3.9%
37
 
3.6%
36
 
3.5%
35
 
3.4%
24
 
2.3%
21
 
2.0%
Other values (140) 452
43.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 796
77.0%
Space Separator 215
 
20.8%
Decimal Number 9
 
0.9%
Uppercase Letter 7
 
0.7%
Other Punctuation 5
 
0.5%
Dash Punctuation 2
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
67
 
8.4%
60
 
7.5%
47
 
5.9%
40
 
5.0%
37
 
4.6%
36
 
4.5%
35
 
4.4%
24
 
3.0%
21
 
2.6%
19
 
2.4%
Other values (127) 410
51.5%
Decimal Number
ValueCountFrequency (%)
2 3
33.3%
7 2
22.2%
1 2
22.2%
8 1
 
11.1%
4 1
 
11.1%
Uppercase Letter
ValueCountFrequency (%)
S 2
28.6%
A 2
28.6%
Z 1
14.3%
M 1
14.3%
E 1
14.3%
Space Separator
ValueCountFrequency (%)
215
100.0%
Other Punctuation
ValueCountFrequency (%)
& 5
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 796
77.0%
Common 231
 
22.3%
Latin 7
 
0.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
67
 
8.4%
60
 
7.5%
47
 
5.9%
40
 
5.0%
37
 
4.6%
36
 
4.5%
35
 
4.4%
24
 
3.0%
21
 
2.6%
19
 
2.4%
Other values (127) 410
51.5%
Common
ValueCountFrequency (%)
215
93.1%
& 5
 
2.2%
2 3
 
1.3%
7 2
 
0.9%
- 2
 
0.9%
1 2
 
0.9%
8 1
 
0.4%
4 1
 
0.4%
Latin
ValueCountFrequency (%)
S 2
28.6%
A 2
28.6%
Z 1
14.3%
M 1
14.3%
E 1
14.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 796
77.0%
ASCII 238
 
23.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
215
90.3%
& 5
 
2.1%
2 3
 
1.3%
7 2
 
0.8%
- 2
 
0.8%
S 2
 
0.8%
A 2
 
0.8%
1 2
 
0.8%
Z 1
 
0.4%
8 1
 
0.4%
Other values (3) 3
 
1.3%
Hangul
ValueCountFrequency (%)
67
 
8.4%
60
 
7.5%
47
 
5.9%
40
 
5.0%
37
 
4.6%
36
 
4.5%
35
 
4.4%
24
 
3.0%
21
 
2.6%
19
 
2.4%
Other values (127) 410
51.5%

eng_lang_area_nm
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
Vietnam
100 

Length

Max length7
Median length7
Mean length7
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowVietnam
2nd rowVietnam
3rd rowVietnam
4th rowVietnam
5th rowVietnam

Common Values

ValueCountFrequency (%)
Vietnam 100
100.0%

Length

2023-12-10T19:16:35.671417image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T19:16:35.789567image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
vietnam 100
100.0%

kor_lang_area_nm
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
베트남
100 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row베트남
2nd row베트남
3rd row베트남
4th row베트남
5th row베트남

Common Values

ValueCountFrequency (%)
베트남 100
100.0%

Length

2023-12-10T19:16:35.931686image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T19:16:36.063087image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
베트남 100
100.0%

jan_lang_area_nm
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
ベトナム
100 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowベトナム
2nd rowベトナム
3rd rowベトナム
4th rowベトナム
5th rowベトナム

Common Values

ValueCountFrequency (%)
ベトナム 100
100.0%

Length

2023-12-10T19:16:36.197504image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T19:16:36.312732image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
ベトナム 100
100.0%

chg_lang_area_nm
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
越南
100 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row越南
2nd row越南
3rd row越南
4th row越南
5th row越南

Common Values

ValueCountFrequency (%)
越南 100
100.0%

Length

2023-12-10T19:16:36.457320image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T19:16:36.589695image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
越南 100
100.0%
Distinct99
Distinct (%)99.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
2023-12-10T19:16:36.863932image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length144
Median length64
Mean length52.99
Min length9

Characters and Unicode

Total characters5299
Distinct characters118
Distinct categories11 ?
Distinct scripts4 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique98 ?
Unique (%)98.0%

Sample

1st row60 Ấu Triệu, Hàng Trống, Hoàn Kiếm, Hà Nội, 베트남
2nd rowTrường Sa, Hòa Hải, Ngũ Hành Sơn, Đà Nẵng, 베트남
3rd rowNguyễn Cao Luyện, Sơn Trà, Đà Nẵng, 베트남
4th rowTruong Sa
5th rowTòa nhà Mê Linh Ponit, Phòng 606, Lầu 6, 2, Đường Ngô Đức Kế, Phường Bến Nghé, Quận 1, Thành phố Hồ Chí Minh, Bến Nghé, Quận 1, Hồ Chí Minh, 베트남
ValueCountFrequency (%)
베트남 92
 
8.1%
an 38
 
3.3%
đà 23
 
2.0%
nẵng 22
 
1.9%
thành 21
 
1.8%
nam 20
 
1.8%
hội 19
 
1.7%
1 19
 
1.7%
sơn 19
 
1.7%
18
 
1.6%
Other values (301) 847
74.4%
2023-12-10T19:16:37.400417image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1038
19.6%
n 430
 
8.1%
, 370
 
7.0%
h 315
 
5.9%
g 178
 
3.4%
T 164
 
3.1%
i 147
 
2.8%
H 137
 
2.6%
à 124
 
2.3%
a 123
 
2.3%
Other values (108) 2273
42.9%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 2371
44.7%
Space Separator 1038
19.6%
Uppercase Letter 897
 
16.9%
Other Punctuation 398
 
7.5%
Decimal Number 304
 
5.7%
Other Letter 282
 
5.3%
Dash Punctuation 5
 
0.1%
Open Punctuation 1
 
< 0.1%
Connector Punctuation 1
 
< 0.1%
Nonspacing Mark 1
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
n 430
18.1%
h 315
13.3%
g 178
 
7.5%
i 147
 
6.2%
à 124
 
5.2%
a 123
 
5.2%
u 117
 
4.9%
m 62
 
2.6%
r 58
 
2.4%
47
 
2.0%
Other values (59) 770
32.5%
Uppercase Letter
ValueCountFrequency (%)
T 164
18.3%
H 137
15.3%
N 118
13.2%
P 61
 
6.8%
Đ 49
 
5.5%
L 48
 
5.4%
C 47
 
5.2%
B 46
 
5.1%
Q 44
 
4.9%
A 42
 
4.7%
Other values (12) 141
15.7%
Decimal Number
ValueCountFrequency (%)
0 78
25.7%
1 56
18.4%
2 34
11.2%
5 29
 
9.5%
6 27
 
8.9%
3 24
 
7.9%
8 18
 
5.9%
9 17
 
5.6%
4 11
 
3.6%
7 10
 
3.3%
Other Letter
ValueCountFrequency (%)
93
33.0%
92
32.6%
92
32.6%
1
 
0.4%
1
 
0.4%
1
 
0.4%
1
 
0.4%
1
 
0.4%
Other Punctuation
ValueCountFrequency (%)
, 370
93.0%
. 21
 
5.3%
/ 7
 
1.8%
Space Separator
ValueCountFrequency (%)
1038
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 5
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%
Nonspacing Mark
ValueCountFrequency (%)
̀ 1
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 3268
61.7%
Common 1748
33.0%
Hangul 282
 
5.3%
Inherited 1
 
< 0.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
n 430
 
13.2%
h 315
 
9.6%
g 178
 
5.4%
T 164
 
5.0%
i 147
 
4.5%
H 137
 
4.2%
à 124
 
3.8%
a 123
 
3.8%
N 118
 
3.6%
u 117
 
3.6%
Other values (81) 1415
43.3%
Common
ValueCountFrequency (%)
1038
59.4%
, 370
 
21.2%
0 78
 
4.5%
1 56
 
3.2%
2 34
 
1.9%
5 29
 
1.7%
6 27
 
1.5%
3 24
 
1.4%
. 21
 
1.2%
8 18
 
1.0%
Other values (8) 53
 
3.0%
Hangul
ValueCountFrequency (%)
93
33.0%
92
32.6%
92
32.6%
1
 
0.4%
1
 
0.4%
1
 
0.4%
1
 
0.4%
1
 
0.4%
Inherited
ValueCountFrequency (%)
̀ 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 4252
80.2%
None 398
 
7.5%
Latin Ext Additional 366
 
6.9%
Hangul 282
 
5.3%
Diacriticals 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1038
24.4%
n 430
 
10.1%
, 370
 
8.7%
h 315
 
7.4%
g 178
 
4.2%
T 164
 
3.9%
i 147
 
3.5%
H 137
 
3.2%
a 123
 
2.9%
N 118
 
2.8%
Other values (46) 1232
29.0%
None
ValueCountFrequency (%)
à 124
31.2%
Đ 49
 
12.3%
ư 35
 
8.8%
á 25
 
6.3%
ơ 23
 
5.8%
â 21
 
5.3%
ò 17
 
4.3%
ê 15
 
3.8%
í 14
 
3.5%
ũ 13
 
3.3%
Other values (12) 62
15.6%
Hangul
ValueCountFrequency (%)
93
33.0%
92
32.6%
92
32.6%
1
 
0.4%
1
 
0.4%
1
 
0.4%
1
 
0.4%
1
 
0.4%
Latin Ext Additional
ValueCountFrequency (%)
47
12.8%
ế 40
 
10.9%
31
 
8.5%
25
 
6.8%
24
 
6.6%
22
 
6.0%
22
 
6.0%
21
 
5.7%
18
 
4.9%
14
 
3.8%
Other values (21) 102
27.9%
Diacriticals
ValueCountFrequency (%)
̀ 1
100.0%

lo
Real number (ℝ)

Distinct98
Distinct (%)98.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean107.77369
Minimum103.83817
Maximum115.82647
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T19:16:37.618369image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum103.83817
5-th percentile105.84201
Q1106.695
median108.23865
Q3108.32843
95-th percentile109.19669
Maximum115.82647
Range11.988298
Interquartile range (IQR)1.6334334

Descriptive statistics

Standard deviation1.4342186
Coefficient of variation (CV)0.013307688
Kurtosis9.0099472
Mean107.77369
Median Absolute Deviation (MAD)0.233658
Skewness1.2691867
Sum10777.369
Variance2.0569831
MonotonicityNot monotonic
2023-12-10T19:16:37.841161image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
105.8483697 2
 
2.0%
108.3406489 2
 
2.0%
108.2444789 1
 
1.0%
109.1944885 1
 
1.0%
106.6928555 1
 
1.0%
106.6955829 1
 
1.0%
109.1957608 1
 
1.0%
105.8504515 1
 
1.0%
108.177594 1
 
1.0%
105.8595791 1
 
1.0%
Other values (88) 88
88.0%
ValueCountFrequency (%)
103.838167 1
1.0%
105.7799211 1
1.0%
105.7882652 1
1.0%
105.7920081 1
1.0%
105.8260476 1
1.0%
105.8428511 1
1.0%
105.8472338 1
1.0%
105.8483697 2
2.0%
105.8504515 1
1.0%
105.8508811 1
1.0%
ValueCountFrequency (%)
115.8264655 1
1.0%
109.2776855 1
1.0%
109.2028649 1
1.0%
109.198435 1
1.0%
109.1967132 1
1.0%
109.1966894 1
1.0%
109.1962584 1
1.0%
109.1957608 1
1.0%
109.1953621 1
1.0%
109.1949069 1
1.0%

la
Real number (ℝ)

Distinct98
Distinct (%)98.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean15.365599
Minimum10.33318
Maximum22.334111
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T19:16:38.046751image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum10.33318
5-th percentile10.771346
Q112.233587
median15.900851
Q316.073119
95-th percentile21.033146
Maximum22.334111
Range12.000931
Interquartile range (IQR)3.8395315

Descriptive statistics

Standard deviation3.5687246
Coefficient of variation (CV)0.23225417
Kurtosis-0.94065343
Mean15.365599
Median Absolute Deviation (MAD)3.6632417
Skewness0.30652048
Sum1536.5599
Variance12.735795
MonotonicityNot monotonic
2023-12-10T19:16:38.271953image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
21.0286357 2
 
2.0%
15.9136072 2
 
2.0%
16.0520532 1
 
1.0%
12.2385317 1
 
1.0%
10.7697515 1
 
1.0%
10.7716823 1
 
1.0%
12.2368224 1
 
1.0%
21.0314326 1
 
1.0%
10.9381822 1
 
1.0%
21.0224131 1
 
1.0%
Other values (88) 88
88.0%
ValueCountFrequency (%)
10.3331797 1
1.0%
10.723282 1
1.0%
10.7673315 1
1.0%
10.767885 1
1.0%
10.7697515 1
1.0%
10.7714294 1
1.0%
10.7716365 1
1.0%
10.7716823 1
1.0%
10.7716992 1
1.0%
10.7718346 1
1.0%
ValueCountFrequency (%)
22.3341111 1
1.0%
21.20219 1
1.0%
21.0666149 1
1.0%
21.0378804 1
1.0%
21.0336365 1
1.0%
21.0331202 1
1.0%
21.0324403 1
1.0%
21.0321988 1
1.0%
21.0320506 1
1.0%
21.0314326 1
1.0%

tel_no
Text

MISSING 

Distinct32
Distinct (%)100.0%
Missing68
Missing (%)68.0%
Memory size932.0 B
2023-12-10T19:16:38.558837image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length17
Median length16
Mean length15.40625
Min length12

Characters and Unicode

Total characters493
Distinct characters16
Distinct categories7 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique32 ?
Unique (%)100.0%

Sample

1st row+84 439377666
2nd row(+84) 43 974 9999
3rd row+84 28 3837 3536
4th row+84 236 6269 888
5th row+84 235 3926 969
ValueCountFrequency (%)
84 28
24.3%
888 6
 
5.2%
236 4
 
3.5%
3741 3
 
2.6%
62 3
 
2.6%
235 3
 
2.6%
3926 2
 
1.7%
28 2
 
1.7%
252 2
 
1.7%
258 2
 
1.7%
Other values (58) 60
52.2%
2023-12-10T19:16:39.023160image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
83
16.8%
8 76
15.4%
3 54
11.0%
4 45
9.1%
2 38
7.7%
6 35
7.1%
5 34
6.9%
9 31
 
6.3%
+ 30
 
6.1%
7 25
 
5.1%
Other values (6) 42
8.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 370
75.1%
Space Separator 83
 
16.8%
Math Symbol 30
 
6.1%
Dash Punctuation 6
 
1.2%
Other Punctuation 2
 
0.4%
Open Punctuation 1
 
0.2%
Close Punctuation 1
 
0.2%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
8 76
20.5%
3 54
14.6%
4 45
12.2%
2 38
10.3%
6 35
9.5%
5 34
9.2%
9 31
8.4%
7 25
 
6.8%
1 17
 
4.6%
0 15
 
4.1%
Space Separator
ValueCountFrequency (%)
83
100.0%
Math Symbol
ValueCountFrequency (%)
+ 30
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 6
100.0%
Other Punctuation
ValueCountFrequency (%)
. 2
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 493
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
83
16.8%
8 76
15.4%
3 54
11.0%
4 45
9.1%
2 38
7.7%
6 35
7.1%
5 34
6.9%
9 31
 
6.3%
+ 30
 
6.1%
7 25
 
5.1%
Other values (6) 42
8.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 493
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
83
16.8%
8 76
15.4%
3 54
11.0%
4 45
9.1%
2 38
7.7%
6 35
7.1%
5 34
6.9%
9 31
 
6.3%
+ 30
 
6.1%
7 25
 
5.1%
Other values (6) 42
8.5%

BASE_YMD
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
2020-12-09
100 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2020-12-09
2nd row2020-12-09
3rd row2020-12-09
4th row2020-12-09
5th row2020-12-09

Common Values

ValueCountFrequency (%)
2020-12-09 100
100.0%

Length

2023-12-10T19:16:39.172298image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T19:16:39.266703image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2020-12-09 100
100.0%

Interactions

2023-12-10T19:16:32.636041image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:16:32.377683image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:16:32.756798image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:16:32.502332image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T19:16:39.350622image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
se_nmeng_lang_hotel_nmkor_lang_hotel_nmrn_adreslolatel_no
se_nm1.0001.0001.0001.0000.0000.2911.000
eng_lang_hotel_nm1.0001.0001.0001.0001.0001.0001.000
kor_lang_hotel_nm1.0001.0001.0001.0001.0001.0001.000
rn_adres1.0001.0001.0001.0001.0001.0001.000
lo0.0001.0001.0001.0001.0000.9701.000
la0.2911.0001.0001.0000.9701.0001.000
tel_no1.0001.0001.0001.0001.0001.0001.000
2023-12-10T19:16:39.502997image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
lolase_nm
lo1.000-0.4180.000
la-0.4181.0000.188
se_nm0.0000.1881.000

Missing values

2023-12-10T19:16:32.914541image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T19:16:33.103375image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-10T19:16:33.222340image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

se_nmeng_lang_hotel_nmkor_lang_hotel_nmeng_lang_area_nmkor_lang_area_nmjan_lang_area_nmchg_lang_area_nmrn_adreslolatel_noBASE_YMD
0호텔Hanoi Gortage Hotel & TRAVEL하노이 고르티지 호텔Vietnam베트남ベトナム越南60 Ấu Triệu, Hàng Trống, Hoàn Kiếm, Hà Nội, 베트남105.8483721.028636+84 4393776662020-12-09
1호텔Vinpeal da nang ocean resort & villas다낭 빈펄 오션 리조트Vietnam베트남ベトナム越南Trường Sa, Hòa Hải, Ngũ Hành Sơn, Đà Nẵng, 베트남108.26937315.992632(+84) 43 974 99992020-12-09
2게스트하우스1812 Boutique Hostel1812 부티크 호스텔Vietnam베트남ベトナム越南Nguyễn Cao Luyện, Sơn Trà, Đà Nẵng, 베트남108.24218916.069429<NA>2020-12-09
3호텔1BR Apartment at The 5 stars Resort H Danang<NA>Vietnam베트남ベトナム越南Truong Sa115.82646510.723282<NA>2020-12-09
4에어비앤비2 Ton Duc Thang Street, Ho Chi Minh City 700000<NA>Vietnam베트남ベトナム越南Tòa nhà Mê Linh Ponit, Phòng 606, Lầu 6, 2, Đường Ngô Đức Kế, Phường Bến Nghé, Quận 1, Thành phố Hồ Chí Minh, Bến Nghé, Quận 1, Hồ Chí Minh, 베트남106.70574810.774574<NA>2020-12-09
5호텔22 Residence Hanoi22 레지던스 하노이Vietnam베트남ベトナム越南18 Nam Ngư, Cửa Nam, Hoàn Kiếm, Hà Nội, 베트남105.84285121.026548<NA>2020-12-09
6호텔3BR Green & Artistic Home<NA>Vietnam베트남ベトナム越南250 Duong Dinh Nghe105.79200821.024528<NA>2020-12-09
7게스트하우스4 SEASONS HOSTEL4 시즌스 호스텔Vietnam베트남ベトナム越南71b Châu Thị Vĩnh tế, Mỹ An, Ngũ Hành Sơn, Đà Nẵng, 베트남108.24051316.050799<NA>2020-12-09
8기타46 Tay Ho Street, QuangAn<NA>Vietnam베트남ベトナム越南2 Tây Hồ, Quảng An, Tây Hồ, Hà Nội, 베트남105.82604821.066615<NA>2020-12-09
9호텔7 studio -Sunrise City<NA>Vietnam베트남ベトナム越南Nguyen Huu Tho106.7046910.767331<NA>2020-12-09
se_nmeng_lang_hotel_nmkor_lang_hotel_nmeng_lang_area_nmkor_lang_area_nmjan_lang_area_nmchg_lang_area_nmrn_adreslolatel_noBASE_YMD
90호텔Balcona Hotel & Spa발코나 호텔 다낭Vietnam베트남ベトナム越南288 Võ Nguyên Giáp, Bắc Mỹ Phú, Ngũ Hành Sơn, Đà Nẵng 550000 베트남108.24808516.051576+84 868 187 3732020-12-09
91호텔Bali Boutique Hotel발리 부티크 벤 탄 호텔Vietnam베트남ベトナム越南28 Trương Định, Phường Bến Thành, Quận 1, Hồ Chí Minh, 베트남106.69620710.771973<NA>2020-12-09
92호텔Bamboo Green Central Hotel뱀부 그린 센트럴Vietnam베트남ベトナム越南158 Phan Châu Trinh, Phước Ninh, Đà Nẵng, 베트남108.21963416.063167<NA>2020-12-09
93호텔Bamboo Green Riverside Hotel뱀부 그린 리버사이드 호텔Vietnam베트남ベトナム越南68 Bạch Đằng, Hải Châu 1, Q. Hải Châu, Đà Nẵng 550000 베트남108.22428916.072668<NA>2020-12-09
94호텔Bamboo Village Beach Resort뱀부 빌리지 비치 리조트Vietnam베트남ベトナム越南38 Nguyễn Đình Chiểu, Phường Hàm Tiến, Thành phố Phan Thiết, Bình Thuận, 베트남108.19881310.945825+84 62 3847 0072020-12-09
95게스트하우스Banana Homestay바나나 홈스테이Vietnam베트남ベトナム越南Lương Như Bích, Cẩm Nam, Tp. Hội An, Quảng Nam, 베트남108.3362315.873383+84-121-936-79502020-12-09
96호텔Banyan Tree Lang Co반얀트리 랑코 리조트Vietnam베트남ベトナム越南Beach Villa, Lộc Vĩnh, Phú Lộc, Thừa Thiên Huế, 베트남107.95451416.336276+84 234 3695 8882020-12-09
97에어비앤비Bao Anh Boutique Hotel바오 안 부티크 호텔Vietnam베트남ベトナム越南Đỗ Bá, Mỹ An, Ngũ Hành Sơn, Đà Nẵng, 베트남108.24271116.050695<NA>2020-12-09
98호텔Bao Quynh Bungalow바오쿠인방갈로Vietnam베트남ベトナム越南45d Nguyễn Đình Chiểu, khu phố 1, Hàm Tiến, Tp. Phan Thiết, Bình Thuận, 베트남108.19535510.944445+84 252 3741 0072020-12-09
99게스트하우스Barney's Danang Backpackers House바니스 다낭 백패커스Vietnam베트남ベトナム越南129 Trần Hưng Đạo, Nại Hiên Đông, Sơn Trà, Đà Nẵng, 베트남108.22910416.080126+84 126 520 61032020-12-09