Overview

Dataset statistics

Number of variables13
Number of observations100
Missing cells245
Missing cells (%)18.8%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory10.6 KiB
Average record size in memory108.3 B

Variable types

Categorical6
Text4
Numeric2
Unsupported1

Alerts

ENG_LANG_AREA_NM has constant value ""Constant
KOR_LANG_AREA_NM has constant value ""Constant
JAN_LANG_AREA_NM has constant value ""Constant
CHG_LANG_AREA_NM has constant value ""Constant
BASE_YMD has constant value ""Constant
LO is highly overall correlated with LAHigh correlation
LA is highly overall correlated with LOHigh correlation
SE_NM is highly imbalanced (82.2%)Imbalance
ENG_LANG_HOTEL_NM has 15 (15.0%) missing valuesMissing
KOR_LANG_HOTEL_NM has 50 (50.0%) missing valuesMissing
TEL_NO has 80 (80.0%) missing valuesMissing
REGIST_DE has 100 (100.0%) missing valuesMissing
REGIST_DE is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2023-12-10 10:10:03.103121
Analysis finished2023-12-10 10:10:06.612475
Duration3.51 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

SE_NM
Categorical

IMBALANCE 

Distinct3
Distinct (%)3.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
호텔
96 
게스트하우스
 
2
기타
 
2

Length

Max length6
Median length2
Mean length2.08
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row호텔
2nd row호텔
3rd row호텔
4th row호텔
5th row호텔

Common Values

ValueCountFrequency (%)
호텔 96
96.0%
게스트하우스 2
 
2.0%
기타 2
 
2.0%

Length

2023-12-10T19:10:06.750065image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T19:10:06.923121image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
호텔 96
96.0%
게스트하우스 2
 
2.0%
기타 2
 
2.0%

ENG_LANG_HOTEL_NM
Text

MISSING 

Distinct84
Distinct (%)98.8%
Missing15
Missing (%)15.0%
Memory size932.0 B
2023-12-10T19:10:07.371487image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length48
Median length34
Mean length26.329412
Min length12

Characters and Unicode

Total characters2238
Distinct characters57
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique83 ?
Unique (%)97.6%

Sample

1st rowRamee Guestline Hotel
2nd rowArmani Hotel Dubai
3rd rowPremier Inn Dubai International Airport
4th rowAsiana Hotel
5th rowJumeirah Mina A'Salam - Madinat Jumeirah
ValueCountFrequency (%)
dubai 49
 
13.4%
hotel 43
 
11.7%
12
 
3.3%
the 10
 
2.7%
jumeirah 7
 
1.9%
apartments 7
 
1.9%
al 7
 
1.9%
downtown 7
 
1.9%
by 6
 
1.6%
inn 6
 
1.6%
Other values (144) 212
57.9%
2023-12-10T19:10:08.076218image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
281
 
12.6%
a 205
 
9.2%
e 175
 
7.8%
t 142
 
6.3%
i 139
 
6.2%
o 132
 
5.9%
n 117
 
5.2%
r 113
 
5.0%
l 106
 
4.7%
u 85
 
3.8%
Other values (47) 743
33.2%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 1592
71.1%
Uppercase Letter 343
 
15.3%
Space Separator 281
 
12.6%
Dash Punctuation 11
 
0.5%
Other Punctuation 11
 
0.5%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a 205
12.9%
e 175
11.0%
t 142
8.9%
i 139
8.7%
o 132
8.3%
n 117
 
7.3%
r 113
 
7.1%
l 106
 
6.7%
u 85
 
5.3%
b 64
 
4.0%
Other values (16) 314
19.7%
Uppercase Letter
ValueCountFrequency (%)
D 62
18.1%
H 57
16.6%
A 33
9.6%
M 26
 
7.6%
R 21
 
6.1%
P 17
 
5.0%
C 16
 
4.7%
S 15
 
4.4%
I 13
 
3.8%
J 11
 
3.2%
Other values (15) 72
21.0%
Other Punctuation
ValueCountFrequency (%)
& 5
45.5%
, 5
45.5%
' 1
 
9.1%
Dash Punctuation
ValueCountFrequency (%)
- 10
90.9%
1
 
9.1%
Space Separator
ValueCountFrequency (%)
281
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 1935
86.5%
Common 303
 
13.5%

Most frequent character per script

Latin
ValueCountFrequency (%)
a 205
 
10.6%
e 175
 
9.0%
t 142
 
7.3%
i 139
 
7.2%
o 132
 
6.8%
n 117
 
6.0%
r 113
 
5.8%
l 106
 
5.5%
u 85
 
4.4%
b 64
 
3.3%
Other values (41) 657
34.0%
Common
ValueCountFrequency (%)
281
92.7%
- 10
 
3.3%
& 5
 
1.7%
, 5
 
1.7%
' 1
 
0.3%
1
 
0.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2237
> 99.9%
Punctuation 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
281
 
12.6%
a 205
 
9.2%
e 175
 
7.8%
t 142
 
6.3%
i 139
 
6.2%
o 132
 
5.9%
n 117
 
5.2%
r 113
 
5.1%
l 106
 
4.7%
u 85
 
3.8%
Other values (46) 742
33.2%
Punctuation
ValueCountFrequency (%)
1
100.0%

KOR_LANG_HOTEL_NM
Text

MISSING 

Distinct50
Distinct (%)100.0%
Missing50
Missing (%)50.0%
Memory size932.0 B
2023-12-10T19:10:08.493669image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length25
Median length21
Mean length13.92
Min length6

Characters and Unicode

Total characters696
Distinct characters126
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique50 ?
Unique (%)100.0%

Sample

1st row아르마니 호텔 두바이
2nd row프리미어 인 두바이
3rd row아시아나 호텔
4th row주메이라 미나 아살람 - 마디낫 주메이라
5th row주메이라 알 카스르 - 마디낫 주메이라
ValueCountFrequency (%)
두바이 25
 
12.4%
호텔 21
 
10.4%
9
 
4.5%
주메이라 8
 
4.0%
다운타운 6
 
3.0%
바이 5
 
2.5%
5
 
2.5%
5
 
2.5%
4
 
2.0%
힐튼 4
 
2.0%
Other values (88) 109
54.2%
2023-12-10T19:10:09.380347image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
151
21.7%
54
 
7.8%
31
 
4.5%
26
 
3.7%
23
 
3.3%
21
 
3.0%
17
 
2.4%
15
 
2.2%
14
 
2.0%
13
 
1.9%
Other values (116) 331
47.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 534
76.7%
Space Separator 151
 
21.7%
Dash Punctuation 7
 
1.0%
Other Punctuation 2
 
0.3%
Uppercase Letter 1
 
0.1%
Decimal Number 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
54
 
10.1%
31
 
5.8%
26
 
4.9%
23
 
4.3%
21
 
3.9%
17
 
3.2%
15
 
2.8%
14
 
2.6%
13
 
2.4%
13
 
2.4%
Other values (110) 307
57.5%
Other Punctuation
ValueCountFrequency (%)
& 1
50.0%
, 1
50.0%
Space Separator
ValueCountFrequency (%)
151
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 7
100.0%
Uppercase Letter
ValueCountFrequency (%)
W 1
100.0%
Decimal Number
ValueCountFrequency (%)
2 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 534
76.7%
Common 161
 
23.1%
Latin 1
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
54
 
10.1%
31
 
5.8%
26
 
4.9%
23
 
4.3%
21
 
3.9%
17
 
3.2%
15
 
2.8%
14
 
2.6%
13
 
2.4%
13
 
2.4%
Other values (110) 307
57.5%
Common
ValueCountFrequency (%)
151
93.8%
- 7
 
4.3%
& 1
 
0.6%
2 1
 
0.6%
, 1
 
0.6%
Latin
ValueCountFrequency (%)
W 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 534
76.7%
ASCII 162
 
23.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
151
93.2%
- 7
 
4.3%
W 1
 
0.6%
& 1
 
0.6%
2 1
 
0.6%
, 1
 
0.6%
Hangul
ValueCountFrequency (%)
54
 
10.1%
31
 
5.8%
26
 
4.9%
23
 
4.3%
21
 
3.9%
17
 
3.2%
15
 
2.8%
14
 
2.6%
13
 
2.4%
13
 
2.4%
Other values (110) 307
57.5%

ENG_LANG_AREA_NM
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
Dubai
100 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowDubai
2nd rowDubai
3rd rowDubai
4th rowDubai
5th rowDubai

Common Values

ValueCountFrequency (%)
Dubai 100
100.0%

Length

2023-12-10T19:10:09.628485image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T19:10:09.802009image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
dubai 100
100.0%

KOR_LANG_AREA_NM
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
두바이
100 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row두바이
2nd row두바이
3rd row두바이
4th row두바이
5th row두바이

Common Values

ValueCountFrequency (%)
두바이 100
100.0%

Length

2023-12-10T19:10:09.992564image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T19:10:10.155085image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
두바이 100
100.0%

JAN_LANG_AREA_NM
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
ドバイ
100 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowドバイ
2nd rowドバイ
3rd rowドバイ
4th rowドバイ
5th rowドバイ

Common Values

ValueCountFrequency (%)
ドバイ 100
100.0%

Length

2023-12-10T19:10:10.398371image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T19:10:10.602387image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
ドバイ 100
100.0%

CHG_LANG_AREA_NM
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
迪拜
100 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row迪拜
2nd row迪拜
3rd row迪拜
4th row迪拜
5th row迪拜

Common Values

ValueCountFrequency (%)
迪拜 100
100.0%

Length

2023-12-10T19:10:10.804556image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T19:10:10.980253image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
迪拜 100
100.0%
Distinct96
Distinct (%)96.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
2023-12-10T19:10:11.484619image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length90
Median length55
Mean length33.05
Min length14

Characters and Unicode

Total characters3305
Distinct characters102
Distinct categories7 ?
Distinct scripts4 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique93 ?
Unique (%)93.0%

Sample

1st row45 Al Rigga Road - Dubai - 아랍에미레이트
2nd rowUnnamed Road - Dubai - 아랍에미리트
3rd row26 52b St - Dubai - 아랍에미레이트
4th rowKutta Residence, Salahuddin Rd - Dubai - 아랍에미레이트
5th rowUnnamed Road - Dubai - 아랍에미리트
ValueCountFrequency (%)
161
22.4%
아랍에미레이트 68
 
9.5%
dubai 58
 
8.1%
두바이 31
 
4.3%
road 22
 
3.1%
st 22
 
3.1%
al 19
 
2.6%
아랍에미리트 18
 
2.5%
street 17
 
2.4%
sheikh 12
 
1.7%
Other values (168) 291
40.5%
2023-12-10T19:10:12.447715image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
619
 
18.7%
a 203
 
6.1%
- 162
 
4.9%
i 135
 
4.1%
e 131
 
4.0%
116
 
3.5%
t 96
 
2.9%
u 93
 
2.8%
91
 
2.8%
87
 
2.6%
Other values (92) 1572
47.6%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 1254
37.9%
Other Letter 773
23.4%
Space Separator 619
18.7%
Uppercase Letter 336
 
10.2%
Dash Punctuation 162
 
4.9%
Decimal Number 144
 
4.4%
Other Punctuation 17
 
0.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
116
15.0%
91
11.8%
87
11.3%
86
11.1%
86
11.1%
86
11.1%
68
8.8%
31
 
4.0%
31
 
4.0%
21
 
2.7%
Other values (29) 70
9.1%
Lowercase Letter
ValueCountFrequency (%)
a 203
16.2%
i 135
10.8%
e 131
10.4%
t 96
 
7.7%
u 93
 
7.4%
d 86
 
6.9%
b 76
 
6.1%
r 68
 
5.4%
o 56
 
4.5%
n 55
 
4.4%
Other values (16) 255
20.3%
Uppercase Letter
ValueCountFrequency (%)
S 73
21.7%
D 62
18.5%
R 40
11.9%
A 35
10.4%
B 27
 
8.0%
M 13
 
3.9%
P 11
 
3.3%
Z 9
 
2.7%
C 9
 
2.7%
O 6
 
1.8%
Other values (12) 51
15.2%
Decimal Number
ValueCountFrequency (%)
1 35
24.3%
2 23
16.0%
5 16
11.1%
3 14
 
9.7%
4 11
 
7.6%
9 10
 
6.9%
8 10
 
6.9%
7 9
 
6.2%
0 8
 
5.6%
6 8
 
5.6%
Other Punctuation
ValueCountFrequency (%)
, 12
70.6%
. 4
 
23.5%
' 1
 
5.9%
Space Separator
ValueCountFrequency (%)
619
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 162
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 1590
48.1%
Common 942
28.5%
Hangul 770
23.3%
Arabic 3
 
0.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
a 203
 
12.8%
i 135
 
8.5%
e 131
 
8.2%
t 96
 
6.0%
u 93
 
5.8%
d 86
 
5.4%
b 76
 
4.8%
S 73
 
4.6%
r 68
 
4.3%
D 62
 
3.9%
Other values (38) 567
35.7%
Hangul
ValueCountFrequency (%)
116
15.1%
91
11.8%
87
11.3%
86
11.2%
86
11.2%
86
11.2%
68
8.8%
31
 
4.0%
31
 
4.0%
21
 
2.7%
Other values (26) 67
8.7%
Common
ValueCountFrequency (%)
619
65.7%
- 162
 
17.2%
1 35
 
3.7%
2 23
 
2.4%
5 16
 
1.7%
3 14
 
1.5%
, 12
 
1.3%
4 11
 
1.2%
9 10
 
1.1%
8 10
 
1.1%
Other values (5) 30
 
3.2%
Arabic
ValueCountFrequency (%)
ي 1
33.3%
ب 1
33.3%
د 1
33.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2532
76.6%
Hangul 770
 
23.3%
Arabic 3
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
619
24.4%
a 203
 
8.0%
- 162
 
6.4%
i 135
 
5.3%
e 131
 
5.2%
t 96
 
3.8%
u 93
 
3.7%
d 86
 
3.4%
b 76
 
3.0%
S 73
 
2.9%
Other values (53) 858
33.9%
Hangul
ValueCountFrequency (%)
116
15.1%
91
11.8%
87
11.3%
86
11.2%
86
11.2%
86
11.2%
68
8.8%
31
 
4.0%
31
 
4.0%
21
 
2.7%
Other values (26) 67
8.7%
Arabic
ValueCountFrequency (%)
ي 1
33.3%
ب 1
33.3%
د 1
33.3%

LO
Real number (ℝ)

HIGH CORRELATION 

Distinct80
Distinct (%)80.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean53.740157
Minimum-95.937253
Maximum55.426124
Zeros0
Zeros (%)0.0%
Negative1
Negative (%)1.0%
Memory size1.0 KiB
2023-12-10T19:10:12.720134image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum-95.937253
5-th percentile55.137719
Q155.213099
median55.276089
Q355.298639
95-th percentile55.339598
Maximum55.426124
Range151.36338
Interquartile range (IQR)0.0855395

Descriptive statistics

Standard deviation15.119319
Coefficient of variation (CV)0.28134117
Kurtosis99.989406
Mean53.740157
Median Absolute Deviation (MAD)0.03253675
Skewness-9.9992154
Sum5374.0157
Variance228.5938
MonotonicityNot monotonic
2023-12-10T19:10:12.969040image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
55.2707828 7
 
7.0%
55.2760885 6
 
6.0%
55.2796053 4
 
4.0%
55.1377696 3
 
3.0%
55.3130663 2
 
2.0%
55.2986385 2
 
2.0%
55.2057449 2
 
2.0%
55.2646596 2
 
2.0%
55.3224579 1
 
1.0%
55.2554388 1
 
1.0%
Other values (70) 70
70.0%
ValueCountFrequency (%)
-95.9372532 1
 
1.0%
54.3810499 1
 
1.0%
55.1336921 1
 
1.0%
55.1346326 1
 
1.0%
55.1367517 1
 
1.0%
55.1377696 3
3.0%
55.14039 1
 
1.0%
55.1418193 1
 
1.0%
55.1437296 1
 
1.0%
55.144181 1
 
1.0%
ValueCountFrequency (%)
55.4261239 1
1.0%
55.3656728 1
1.0%
55.3615364 1
1.0%
55.3602314 1
1.0%
55.343626 1
1.0%
55.3393857 1
1.0%
55.3305198 1
1.0%
55.3300478 1
1.0%
55.3292847 1
1.0%
55.3279941 1
1.0%

LA
Real number (ℝ)

HIGH CORRELATION 

Distinct80
Distinct (%)80.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean25.305002
Minimum24.437595
Maximum36.743435
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T19:10:13.241746image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum24.437595
5-th percentile25.077376
Q125.155509
median25.211084
Q325.25036
95-th percentile25.271629
Maximum36.743435
Range12.30584
Interquartile range (IQR)0.094851225

Descriptive statistics

Standard deviation1.1596066
Coefficient of variation (CV)0.045825192
Kurtosis98.514782
Mean25.305002
Median Absolute Deviation (MAD)0.04181925
Skewness9.8864201
Sum2530.5002
Variance1.3446875
MonotonicityNot monotonic
2023-12-10T19:10:13.497288image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
25.2048493 7
 
7.0%
25.2140133 6
 
6.0%
25.198765 4
 
4.0%
25.0703577 3
 
3.0%
25.2627284 2
 
2.0%
25.2498113 2
 
2.0%
25.1555087 2
 
2.0%
25.2335173 2
 
2.0%
25.2643982 1
 
1.0%
25.1891247 1
 
1.0%
Other values (70) 70
70.0%
ValueCountFrequency (%)
24.4375949 1
 
1.0%
25.0691132 1
 
1.0%
25.0703577 3
3.0%
25.0777452 1
 
1.0%
25.0779214 1
 
1.0%
25.0781911 1
 
1.0%
25.080494 1
 
1.0%
25.083827 1
 
1.0%
25.0857958 1
 
1.0%
25.0898792 1
 
1.0%
ValueCountFrequency (%)
36.7434349 1
1.0%
25.284755 1
1.0%
25.2808801 1
1.0%
25.2763552 1
1.0%
25.2743987 1
1.0%
25.2714828 1
1.0%
25.2706957 1
1.0%
25.2691294 1
1.0%
25.2691002 1
1.0%
25.2683571 1
1.0%

TEL_NO
Text

MISSING 

Distinct19
Distinct (%)95.0%
Missing80
Missing (%)80.0%
Memory size932.0 B
2023-12-10T19:10:13.827368image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length17
Median length15
Mean length14.55
Min length11

Characters and Unicode

Total characters291
Distinct characters13
Distinct categories4 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique18 ?
Unique (%)90.0%

Sample

1st row+971 4 260 4000
2nd row+971-4-238-7777
3rd row+971-4-366-8888
4th row+971-4-366-8888
5th row+971 4 561 9000
ValueCountFrequency (%)
4 14
20.9%
971 13
19.4%
971-4-366-8888 2
 
3.0%
1111 2
 
3.0%
1234 2
 
3.0%
04 2
 
3.0%
2000 2
 
3.0%
00 2
 
3.0%
9000 2
 
3.0%
3000 1
 
1.5%
Other values (25) 25
37.3%
2023-12-10T19:10:14.359369image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
47
16.2%
4 33
11.3%
1 31
10.7%
0 31
10.7%
7 27
9.3%
9 22
7.6%
8 20
6.9%
+ 17
 
5.8%
6 16
 
5.5%
3 15
 
5.2%
Other values (3) 32
11.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 218
74.9%
Space Separator 47
 
16.2%
Math Symbol 17
 
5.8%
Dash Punctuation 9
 
3.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
4 33
15.1%
1 31
14.2%
0 31
14.2%
7 27
12.4%
9 22
10.1%
8 20
9.2%
6 16
7.3%
3 15
6.9%
2 13
 
6.0%
5 10
 
4.6%
Space Separator
ValueCountFrequency (%)
47
100.0%
Math Symbol
ValueCountFrequency (%)
+ 17
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 9
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 291
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
47
16.2%
4 33
11.3%
1 31
10.7%
0 31
10.7%
7 27
9.3%
9 22
7.6%
8 20
6.9%
+ 17
 
5.8%
6 16
 
5.5%
3 15
 
5.2%
Other values (3) 32
11.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 291
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
47
16.2%
4 33
11.3%
1 31
10.7%
0 31
10.7%
7 27
9.3%
9 22
7.6%
8 20
6.9%
+ 17
 
5.8%
6 16
 
5.5%
3 15
 
5.2%
Other values (3) 32
11.0%

REGIST_DE
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing100
Missing (%)100.0%
Memory size1.0 KiB

BASE_YMD
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
2020-12-09
100 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2020-12-09
2nd row2020-12-09
3rd row2020-12-09
4th row2020-12-09
5th row2020-12-09

Common Values

ValueCountFrequency (%)
2020-12-09 100
100.0%

Length

2023-12-10T19:10:14.609002image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T19:10:14.761611image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2020-12-09 100
100.0%

Interactions

2023-12-10T19:10:05.205193image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:10:04.849934image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:10:05.727053image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:10:05.026119image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T19:10:14.883332image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
SE_NMENG_LANG_HOTEL_NMKOR_LANG_HOTEL_NMRN_ADDRLOLATEL_NO
SE_NM1.0000.0001.0001.0000.0000.000NaN
ENG_LANG_HOTEL_NM0.0001.0001.0000.9981.0001.0001.000
KOR_LANG_HOTEL_NM1.0001.0001.0001.0001.0001.0001.000
RN_ADDR1.0000.9981.0001.0001.0001.0001.000
LO0.0001.0001.0001.0001.0000.693NaN
LA0.0001.0001.0001.0000.6931.000NaN
TEL_NONaN1.0001.0001.000NaNNaN1.000
2023-12-10T19:10:15.137894image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
LOLASE_NM
LO1.0000.8320.000
LA0.8321.0000.000
SE_NM0.0000.0001.000

Missing values

2023-12-10T19:10:05.936511image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T19:10:06.312306image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-10T19:10:06.512597image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

SE_NMENG_LANG_HOTEL_NMKOR_LANG_HOTEL_NMENG_LANG_AREA_NMKOR_LANG_AREA_NMJAN_LANG_AREA_NMCHG_LANG_AREA_NMRN_ADDRLOLATEL_NOREGIST_DEBASE_YMD
0호텔Ramee Guestline Hotel<NA>Dubai두바이ドバイ迪拜45 Al Rigga Road - Dubai - 아랍에미레이트55.32245825.264398<NA><NA>2020-12-09
1호텔Armani Hotel Dubai아르마니 호텔 두바이Dubai두바이ドバイ迪拜Unnamed Road - Dubai - 아랍에미리트55.27960525.198765<NA><NA>2020-12-09
2호텔Premier Inn Dubai International Airport프리미어 인 두바이Dubai두바이ドバイ迪拜26 52b St - Dubai - 아랍에미레이트55.42612425.227555+971 4 260 4000<NA>2020-12-09
3호텔Asiana Hotel아시아나 호텔Dubai두바이ドバイ迪拜Kutta Residence, Salahuddin Rd - Dubai - 아랍에미레이트55.3269625.270696+971-4-238-7777<NA>2020-12-09
4호텔Jumeirah Mina A'Salam - Madinat Jumeirah주메이라 미나 아살람 - 마디낫 주메이라Dubai두바이ドバイ迪拜Unnamed Road - Dubai - 아랍에미리트55.27960525.198765+971-4-366-8888<NA>2020-12-09
5호텔Jumeirah Al Qasr - Madinat Jumeirah주메이라 알 카스르 - 마디낫 주메이라Dubai두바이ドバイ迪拜Madinat Jumeirah King Salman bin Abdulaziz Al Saud Street - Umm Suqeim St - Dubai - 아랍에미리트55.18491925.132513+971-4-366-8888<NA>2020-12-09
6호텔Holiday Inn Express Dubai Airport홀리데이 인 익스프레스 두바이 에어포트Dubai두바이ドバイ迪拜48 54 St - Dubai - 아랍에미리트55.36023125.241881<NA><NA>2020-12-09
7게스트하우스At the top hostel - Marina Crown<NA>Dubai두바이ドバイ迪拜Marina Crown - King Salman bin Abdulaziz Al Saud St - Dubai - 아랍에미레이트55.14823225.089879<NA><NA>2020-12-09
8호텔Copthorne Hotel Dubai<NA>Dubai두바이ドバイ迪拜11 34 Street - 두바이 - 아랍에미레이트55.26988725.234499<NA><NA>2020-12-09
9호텔Rove Downtown Dubai로브 다운타운 두바이Dubai두바이ドバイ迪拜2 43 St - Dubai - 아랍에미리트55.2061525.098093+971 4 561 9000<NA>2020-12-09
SE_NMENG_LANG_HOTEL_NMKOR_LANG_HOTEL_NMENG_LANG_AREA_NMKOR_LANG_AREA_NMJAN_LANG_AREA_NMCHG_LANG_AREA_NMRN_ADDRLOLATEL_NOREGIST_DEBASE_YMD
90호텔Radisson bllu Dubai<NA>Dubai두바이ドバイ迪拜Al Falak St - Dubai - 아랍에미레이트55.16029725.0963<NA><NA>2020-12-09
91기타Golden sends Hotel Apartments<NA>Dubai두바이ドバイ迪拜11 15 A St - Dubai - 아랍에미레이트55.20574525.155509<NA><NA>2020-12-09
92호텔Holiday Inn Express Dubai Jumeirah<NA>Dubai두바이ドバイ迪拜Jumeirah Road - 두바이 - 아랍에미레이트55.2646625.233517<NA><NA>2020-12-09
93호텔<NA>퀸 엘리자베스 2 호텔Dubai두바이ドバイ迪拜Port Rashid - Unnamed Rd - Dubai - 아랍에미리트55.2756525.284755+97145268888<NA>2020-12-09
94호텔<NA>산마르코 호텔Dubai두바이ドバイ迪拜Frij Murar, Street 255.30951125.276355<NA><NA>2020-12-09
95호텔<NA>힐튼 가든 인 두바이 알 무라카밧Dubai두바이ドバイ迪拜Abu Baker Al Siddique Road55.32928525.274399<NA><NA>2020-12-09
96호텔<NA>샹그릴라 두바이 아파트먼트Dubai두바이ドバイ迪拜Sheikh Zayed Road, Trade Center Area55.28693425.223346<NA><NA>2020-12-09
97호텔<NA>노보텔 두바이 월드 트레이드 센터Dubai두바이ドバイ迪拜Al Saada Street54.3810524.437595<NA><NA>2020-12-09
98호텔<NA>W 두바이 알 합투르 시티Dubai두바이ドバイ迪拜Al Habtoor City55.25431625.184179<NA><NA>2020-12-09
99호텔<NA>칼튼 플레이스 호텔Dubai두바이ドバイ迪拜Al Maktoum Street55.32079425.26082<NA><NA>2020-12-09