Overview

Dataset statistics

Number of variables12
Number of observations10000
Missing cells8450
Missing cells (%)7.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.0 MiB
Average record size in memory107.0 B

Variable types

DateTime1
Categorical4
Numeric3
Text4

Dataset

Description2019년~2023년6월30일 까지 7Luck 카지노의 3개 영업점(서울강남, 강북, 부산)의 F&B 파트에서 고객에게 제공한 식음료를 일별, 메뉴별, 국적별, 성별로 주문 수량을 집계한 데이터입니다. 더 상세한 데이터를 요청하시면 검토후 개방하겠습니다.
URLhttps://www.data.go.kr/data/15048172/fileData.do

Alerts

영업점코드 is highly overall correlated with 영업점명High correlation
영업점명 is highly overall correlated with 영업점코드High correlation
식음메뉴그룹코드 is highly overall correlated with 식음메뉴코드 and 1 other fieldsHigh correlation
식음메뉴코드 is highly overall correlated with 식음메뉴그룹코드High correlation
식음메뉴그룹명 is highly overall correlated with 식음메뉴그룹코드High correlation
식음메뉴명(영문) has 6067 (60.7%) missing valuesMissing
고객여권발급국가코드 has 1191 (11.9%) missing valuesMissing
고객여권발급국가명 has 1192 (11.9%) missing valuesMissing

Reproduction

Analysis started2023-12-12 22:13:54.988616
Analysis finished2023-12-12 22:13:58.390210
Duration3.4 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct1492
Distinct (%)14.9%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2019-01-01 00:00:00
Maximum2023-06-30 00:00:00
2023-12-13T07:13:58.454921image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T07:13:58.614750image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

영업점코드
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
CX
4021 
HT
3428 
LT
2551 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowHT
2nd rowHT
3rd rowHT
4th rowCX
5th rowLT

Common Values

ValueCountFrequency (%)
CX 4021
40.2%
HT 3428
34.3%
LT 2551
25.5%

Length

2023-12-13T07:13:58.763554image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T07:13:58.857245image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
cx 4021
40.2%
ht 3428
34.3%
lt 2551
25.5%

영업점명
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
코엑스
4021 
드래곤
3428 
롯데
2551 

Length

Max length3
Median length3
Mean length2.7449
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row드래곤
2nd row드래곤
3rd row드래곤
4th row코엑스
5th row롯데

Common Values

ValueCountFrequency (%)
코엑스 4021
40.2%
드래곤 3428
34.3%
롯데 2551
25.5%

Length

2023-12-13T07:13:58.972005image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T07:13:59.066430image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
코엑스 4021
40.2%
드래곤 3428
34.3%
롯데 2551
25.5%

식음메뉴그룹코드
Real number (ℝ)

HIGH CORRELATION 

Distinct19
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3030.2641
Minimum1001
Maximum9002
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-13T07:13:59.153548image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1001
5-th percentile1001
Q11002
median2009
Q32012
95-th percentile9001
Maximum9002
Range8001
Interquartile range (IQR)1010

Descriptive statistics

Standard deviation3030.5289
Coefficient of variation (CV)1.0000874
Kurtosis-0.18540277
Mean3030.2641
Median Absolute Deviation (MAD)1007
Skewness1.2930426
Sum30302641
Variance9184105.1
MonotonicityNot monotonic
2023-12-13T07:13:59.290531image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=19)
ValueCountFrequency (%)
1002 2604
26.0%
2011 1527
15.3%
9001 1275
12.8%
2012 767
 
7.7%
8000 766
 
7.7%
1001 721
 
7.2%
1004 579
 
5.8%
1003 501
 
5.0%
2009 414
 
4.1%
2007 226
 
2.3%
Other values (9) 620
 
6.2%
ValueCountFrequency (%)
1001 721
 
7.2%
1002 2604
26.0%
1003 501
 
5.0%
1004 579
 
5.8%
2001 11
 
0.1%
2002 80
 
0.8%
2003 24
 
0.2%
2004 2
 
< 0.1%
2005 36
 
0.4%
2006 55
 
0.5%
ValueCountFrequency (%)
9002 163
 
1.6%
9001 1275
12.8%
8000 766
7.7%
2012 767
7.7%
2011 1527
15.3%
2010 115
 
1.1%
2009 414
 
4.1%
2008 134
 
1.3%
2007 226
 
2.3%
2006 55
 
0.5%

식음메뉴그룹명
Categorical

HIGH CORRELATION 

Distinct19
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
라이스류
2604 
음료
1527 
기타
1275 
커피
767 
담배
766 
Other values (14)
3061 

Length

Max length7
Median length2
Mean length3.3275
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row숲/샌드위치
2nd row라이스류
3rd rowVIP안주
4th row라이스류
5th row라이스류

Common Values

ValueCountFrequency (%)
라이스류 2604
26.0%
음료 1527
15.3%
기타 1275
12.8%
커피 767
 
7.7%
담배 766
 
7.7%
숲/샌드위치 721
 
7.2%
VIP안주 579
 
5.8%
면류 501
 
5.0%
주류(일반용) 414
 
4.1%
생과일주스 226
 
2.3%
Other values (9) 620
 
6.2%

Length

2023-12-13T07:13:59.417708image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
라이스류 2604
26.0%
음료 1527
15.3%
기타 1275
12.8%
커피 767
 
7.7%
담배 766
 
7.7%
숲/샌드위치 721
 
7.2%
vip안주 579
 
5.8%
면류 501
 
5.0%
주류(일반용 414
 
4.1%
생과일주스 226
 
2.3%
Other values (9) 620
 
6.2%

식음메뉴코드
Real number (ℝ)

HIGH CORRELATION 

Distinct755
Distinct (%)7.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3364.8886
Minimum70
Maximum48373
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-13T07:13:59.562587image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum70
5-th percentile1031
Q11331
median2065
Q33009
95-th percentile9030
Maximum48373
Range48303
Interquartile range (IQR)1678

Descriptive statistics

Standard deviation4252.0715
Coefficient of variation (CV)1.2636589
Kurtosis55.724542
Mean3364.8886
Median Absolute Deviation (MAD)756
Skewness6.0713098
Sum33648886
Variance18080112
MonotonicityNot monotonic
2023-12-13T07:13:59.728783image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1028 223
 
2.2%
2057 127
 
1.3%
2124 104
 
1.0%
2100 103
 
1.0%
9004 100
 
1.0%
3010 99
 
1.0%
1062 95
 
0.9%
2074 94
 
0.9%
2121 93
 
0.9%
2063 87
 
0.9%
Other values (745) 8875
88.8%
ValueCountFrequency (%)
70 1
 
< 0.1%
1001 2
 
< 0.1%
1002 2
 
< 0.1%
1003 49
0.5%
1006 37
0.4%
1007 14
 
0.1%
1009 14
 
0.1%
1011 12
 
0.1%
1012 4
 
< 0.1%
1018 4
 
< 0.1%
ValueCountFrequency (%)
48373 25
0.2%
45211 27
0.3%
32754 6
 
0.1%
23434 2
 
< 0.1%
13242 1
 
< 0.1%
12524 4
 
< 0.1%
10202 20
0.2%
10010 1
 
< 0.1%
10006 6
 
0.1%
10003 46
0.5%
Distinct875
Distinct (%)8.8%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-13T07:13:59.986627image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length16
Mean length7.5736
Min length1

Characters and Unicode

Total characters75736
Distinct characters470
Distinct categories10 ?
Distinct scripts4 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique129 ?
Unique (%)1.3%

Sample

1st row후라이드 에그(2알)
2nd row장어덮밥
3rd row계절과일(VIP)
4th row봄새싹회덮밥정식
5th row쇠고기미역국&생선구이
ValueCountFrequency (%)
에쎄 260
 
2.2%
계절과일(vip 223
 
1.9%
생맥주 177
 
1.5%
아이스 174
 
1.5%
위스키 139
 
1.2%
말보로 138
 
1.2%
블랙커피 108
 
0.9%
미에로화이바 100
 
0.9%
에비앙워터(vip 99
 
0.8%
펩시콜라 98
 
0.8%
Other values (883) 10158
87.0%
2023-12-13T07:14:00.547313image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
( 4377
 
5.8%
) 4377
 
5.8%
2273
 
3.0%
I 1926
 
2.5%
P 1915
 
2.5%
V 1906
 
2.5%
1762
 
2.3%
1692
 
2.2%
990
 
1.3%
984
 
1.3%
Other values (460) 53534
70.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 52229
69.0%
Uppercase Letter 9618
 
12.7%
Open Punctuation 4377
 
5.8%
Close Punctuation 4377
 
5.8%
Decimal Number 2957
 
3.9%
Space Separator 1692
 
2.2%
Other Punctuation 382
 
0.5%
Dash Punctuation 52
 
0.1%
Math Symbol 42
 
0.1%
Connector Punctuation 10
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2273
 
4.4%
1762
 
3.4%
990
 
1.9%
984
 
1.9%
909
 
1.7%
859
 
1.6%
766
 
1.5%
762
 
1.5%
761
 
1.5%
754
 
1.4%
Other values (420) 41409
79.3%
Uppercase Letter
ValueCountFrequency (%)
I 1926
20.0%
P 1915
19.9%
V 1906
19.8%
M 793
8.2%
G 751
 
7.8%
O 480
 
5.0%
T 398
 
4.1%
L 354
 
3.7%
H 175
 
1.8%
C 140
 
1.5%
Other values (12) 780
8.1%
Decimal Number
ValueCountFrequency (%)
0 926
31.3%
5 616
20.8%
1 379
12.8%
3 299
 
10.1%
4 292
 
9.9%
6 204
 
6.9%
2 94
 
3.2%
8 86
 
2.9%
7 61
 
2.1%
Other Punctuation
ValueCountFrequency (%)
. 216
56.5%
& 146
38.2%
, 20
 
5.2%
Open Punctuation
ValueCountFrequency (%)
( 4377
100.0%
Close Punctuation
ValueCountFrequency (%)
) 4377
100.0%
Space Separator
ValueCountFrequency (%)
1692
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 52
100.0%
Math Symbol
ValueCountFrequency (%)
+ 42
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 10
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 52192
68.9%
Common 13889
 
18.3%
Latin 9618
 
12.7%
Han 37
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2273
 
4.4%
1762
 
3.4%
990
 
1.9%
984
 
1.9%
909
 
1.7%
859
 
1.6%
766
 
1.5%
762
 
1.5%
761
 
1.5%
754
 
1.4%
Other values (419) 41372
79.3%
Latin
ValueCountFrequency (%)
I 1926
20.0%
P 1915
19.9%
V 1906
19.8%
M 793
8.2%
G 751
 
7.8%
O 480
 
5.0%
T 398
 
4.1%
L 354
 
3.7%
H 175
 
1.8%
C 140
 
1.5%
Other values (12) 780
8.1%
Common
ValueCountFrequency (%)
( 4377
31.5%
) 4377
31.5%
1692
 
12.2%
0 926
 
6.7%
5 616
 
4.4%
1 379
 
2.7%
3 299
 
2.2%
4 292
 
2.1%
. 216
 
1.6%
6 204
 
1.5%
Other values (8) 511
 
3.7%
Han
ValueCountFrequency (%)
37
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 52192
68.9%
ASCII 23507
31.0%
CJK 37
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
( 4377
18.6%
) 4377
18.6%
I 1926
8.2%
P 1915
8.1%
V 1906
8.1%
1692
 
7.2%
0 926
 
3.9%
M 793
 
3.4%
G 751
 
3.2%
5 616
 
2.6%
Other values (30) 4228
18.0%
Hangul
ValueCountFrequency (%)
2273
 
4.4%
1762
 
3.4%
990
 
1.9%
984
 
1.9%
909
 
1.7%
859
 
1.6%
766
 
1.5%
762
 
1.5%
761
 
1.5%
754
 
1.4%
Other values (419) 41372
79.3%
CJK
ValueCountFrequency (%)
37
100.0%
Distinct488
Distinct (%)12.4%
Missing6067
Missing (%)60.7%
Memory size156.2 KiB
2023-12-13T07:14:00.865727image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length63
Median length50
Mean length25.643021
Min length3

Characters and Unicode

Total characters100854
Distinct characters273
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique83 ?
Unique (%)2.1%

Sample

1st rowFried Egg
2nd rowRice with Grilled Eel(장어덮밥정식)
3rd rowSeasonal Fresh Fruits
4th rowMixed Vegetable and Sashimi with Rice
5th rowseaweed soup with beef & Grilled Fish
ValueCountFrequency (%)
with 1281
 
8.4%
rice 734
 
4.8%
beef 524
 
3.4%
soup 427
 
2.8%
fried 325
 
2.1%
pork 322
 
2.1%
stir-fried 321
 
2.1%
and 300
 
2.0%
grilled 271
 
1.8%
stew 228
 
1.5%
Other values (555) 10487
68.9%
2023-12-13T07:14:01.368190image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
11639
 
11.5%
e 9843
 
9.8%
i 6689
 
6.6%
o 4837
 
4.8%
a 4451
 
4.4%
t 4081
 
4.0%
r 4068
 
4.0%
S 3476
 
3.4%
d 3225
 
3.2%
n 3041
 
3.0%
Other values (263) 45504
45.1%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 65010
64.5%
Uppercase Letter 12530
 
12.4%
Space Separator 11649
 
11.6%
Other Letter 7832
 
7.8%
Open Punctuation 1428
 
1.4%
Close Punctuation 1426
 
1.4%
Dash Punctuation 478
 
0.5%
Other Punctuation 330
 
0.3%
Decimal Number 171
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
343
 
4.4%
280
 
3.6%
280
 
3.6%
203
 
2.6%
186
 
2.4%
186
 
2.4%
180
 
2.3%
178
 
2.3%
177
 
2.3%
161
 
2.1%
Other values (199) 5658
72.2%
Lowercase Letter
ValueCountFrequency (%)
e 9843
15.1%
i 6689
 
10.3%
o 4837
 
7.4%
a 4451
 
6.8%
t 4081
 
6.3%
r 4068
 
6.3%
d 3225
 
5.0%
n 3041
 
4.7%
l 2947
 
4.5%
c 2844
 
4.4%
Other values (17) 18984
29.2%
Uppercase Letter
ValueCountFrequency (%)
S 3476
27.7%
B 1700
13.6%
R 1397
11.1%
P 868
 
6.9%
F 851
 
6.8%
C 622
 
5.0%
G 476
 
3.8%
M 441
 
3.5%
L 377
 
3.0%
N 369
 
2.9%
Other values (16) 1953
15.6%
Other Punctuation
ValueCountFrequency (%)
& 261
79.1%
, 41
 
12.4%
' 28
 
8.5%
Space Separator
ValueCountFrequency (%)
11639
99.9%
  10
 
0.1%
Dash Punctuation
ValueCountFrequency (%)
- 466
97.5%
12
 
2.5%
Decimal Number
ValueCountFrequency (%)
7 170
99.4%
2 1
 
0.6%
Open Punctuation
ValueCountFrequency (%)
( 1428
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1426
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 77540
76.9%
Common 15482
 
15.4%
Hangul 7832
 
7.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
343
 
4.4%
280
 
3.6%
280
 
3.6%
203
 
2.6%
186
 
2.4%
186
 
2.4%
180
 
2.3%
178
 
2.3%
177
 
2.3%
161
 
2.1%
Other values (199) 5658
72.2%
Latin
ValueCountFrequency (%)
e 9843
 
12.7%
i 6689
 
8.6%
o 4837
 
6.2%
a 4451
 
5.7%
t 4081
 
5.3%
r 4068
 
5.2%
S 3476
 
4.5%
d 3225
 
4.2%
n 3041
 
3.9%
l 2947
 
3.8%
Other values (43) 30882
39.8%
Common
ValueCountFrequency (%)
11639
75.2%
( 1428
 
9.2%
) 1426
 
9.2%
- 466
 
3.0%
& 261
 
1.7%
7 170
 
1.1%
, 41
 
0.3%
' 28
 
0.2%
12
 
0.1%
  10
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 92999
92.2%
Hangul 7832
 
7.8%
Punctuation 12
 
< 0.1%
None 11
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
11639
 
12.5%
e 9843
 
10.6%
i 6689
 
7.2%
o 4837
 
5.2%
a 4451
 
4.8%
t 4081
 
4.4%
r 4068
 
4.4%
S 3476
 
3.7%
d 3225
 
3.5%
n 3041
 
3.3%
Other values (51) 37649
40.5%
Hangul
ValueCountFrequency (%)
343
 
4.4%
280
 
3.6%
280
 
3.6%
203
 
2.6%
186
 
2.4%
186
 
2.4%
180
 
2.3%
178
 
2.3%
177
 
2.3%
161
 
2.1%
Other values (199) 5658
72.2%
Punctuation
ValueCountFrequency (%)
12
100.0%
None
ValueCountFrequency (%)
  10
90.9%
é 1
 
9.1%

성별
Categorical

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
M
6604 
F
2205 
<NA>
1191 

Length

Max length4
Median length1
Mean length1.3573
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd rowF
3rd rowM
4th rowM
5th rowM

Common Values

ValueCountFrequency (%)
M 6604
66.0%
F 2205
 
22.1%
<NA> 1191
 
11.9%

Length

2023-12-13T07:14:01.537261image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T07:14:01.678918image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
m 6604
66.0%
f 2205
 
22.1%
na 1191
 
11.9%
Distinct54
Distinct (%)0.6%
Missing1191
Missing (%)11.9%
Memory size156.2 KiB
2023-12-13T07:14:01.850190image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length3
Median length3
Mean length2.9988648
Min length1

Characters and Unicode

Total characters26417
Distinct characters26
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique13 ?
Unique (%)0.1%

Sample

1st rowJPN
2nd rowTWN
3rd rowCHN
4th rowBGD
5th rowAUS
ValueCountFrequency (%)
chn 2212
25.1%
jpn 1149
13.0%
usa 1069
12.1%
kor 1052
11.9%
twn 786
 
8.9%
vnm 436
 
4.9%
mng 354
 
4.0%
can 288
 
3.3%
rus 163
 
1.9%
uzb 151
 
1.7%
Other values (44) 1149
13.0%
2023-12-13T07:14:02.230388image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
N 5525
20.9%
C 2508
9.5%
H 2451
9.3%
A 1791
 
6.8%
U 1531
 
5.8%
P 1499
 
5.7%
S 1489
 
5.6%
K 1346
 
5.1%
R 1264
 
4.8%
J 1154
 
4.4%
Other values (16) 5859
22.2%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter 26417
100.0%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
N 5525
20.9%
C 2508
9.5%
H 2451
9.3%
A 1791
 
6.8%
U 1531
 
5.8%
P 1499
 
5.7%
S 1489
 
5.6%
K 1346
 
5.1%
R 1264
 
4.8%
J 1154
 
4.4%
Other values (16) 5859
22.2%

Most occurring scripts

ValueCountFrequency (%)
Latin 26417
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
N 5525
20.9%
C 2508
9.5%
H 2451
9.3%
A 1791
 
6.8%
U 1531
 
5.8%
P 1499
 
5.7%
S 1489
 
5.6%
K 1346
 
5.1%
R 1264
 
4.8%
J 1154
 
4.4%
Other values (16) 5859
22.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 26417
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
N 5525
20.9%
C 2508
9.5%
H 2451
9.3%
A 1791
 
6.8%
U 1531
 
5.8%
P 1499
 
5.7%
S 1489
 
5.6%
K 1346
 
5.1%
R 1264
 
4.8%
J 1154
 
4.4%
Other values (16) 5859
22.2%
Distinct53
Distinct (%)0.6%
Missing1192
Missing (%)11.9%
Memory size156.2 KiB
2023-12-13T07:14:02.460260image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length2
Mean length3.2954133
Min length2

Characters and Unicode

Total characters29026
Distinct characters103
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique12 ?
Unique (%)0.1%

Sample

1st row일본
2nd row대만
3rd row중국
4th row방글라데시
5th row호주
ValueCountFrequency (%)
중국 2212
25.1%
일본 1149
13.0%
미국 1069
12.1%
대한민국(영주권자 1052
11.9%
대만 786
 
8.9%
베트남 436
 
5.0%
몽골 354
 
4.0%
캐나다 288
 
3.3%
러시아 163
 
1.9%
우즈베키스탄 151
 
1.7%
Other values (43) 1148
13.0%
2023-12-13T07:14:02.861564image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4415
 
15.2%
2212
 
7.6%
1838
 
6.3%
1184
 
4.1%
1154
 
4.0%
1149
 
4.0%
1086
 
3.7%
1083
 
3.7%
1057
 
3.6%
) 1052
 
3.6%
Other values (93) 12796
44.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 26922
92.8%
Close Punctuation 1052
 
3.6%
Open Punctuation 1052
 
3.6%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4415
16.4%
2212
 
8.2%
1838
 
6.8%
1184
 
4.4%
1154
 
4.3%
1149
 
4.3%
1086
 
4.0%
1083
 
4.0%
1057
 
3.9%
1052
 
3.9%
Other values (91) 10692
39.7%
Close Punctuation
ValueCountFrequency (%)
) 1052
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1052
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 26922
92.8%
Common 2104
 
7.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
4415
16.4%
2212
 
8.2%
1838
 
6.8%
1184
 
4.4%
1154
 
4.3%
1149
 
4.3%
1086
 
4.0%
1083
 
4.0%
1057
 
3.9%
1052
 
3.9%
Other values (91) 10692
39.7%
Common
ValueCountFrequency (%)
) 1052
50.0%
( 1052
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 26922
92.8%
ASCII 2104
 
7.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
4415
16.4%
2212
 
8.2%
1838
 
6.8%
1184
 
4.4%
1154
 
4.3%
1149
 
4.3%
1086
 
4.0%
1083
 
4.0%
1057
 
3.9%
1052
 
3.9%
Other values (91) 10692
39.7%
ASCII
ValueCountFrequency (%)
) 1052
50.0%
( 1052
50.0%

주문수량
Real number (ℝ)

Distinct187
Distinct (%)1.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean8.1794
Minimum0
Maximum1152
Zeros39
Zeros (%)0.4%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-13T07:14:03.021696image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile1
Q11
median2
Q34
95-th percentile24
Maximum1152
Range1152
Interquartile range (IQR)3

Descriptive statistics

Standard deviation35.691408
Coefficient of variation (CV)4.3635729
Kurtosis245.32943
Mean8.1794
Median Absolute Deviation (MAD)1
Skewness13.105136
Sum81794
Variance1273.8766
MonotonicityNot monotonic
2023-12-13T07:14:03.223649image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 4225
42.2%
2 2102
21.0%
3 839
 
8.4%
4 537
 
5.4%
5 387
 
3.9%
6 256
 
2.6%
7 172
 
1.7%
10 139
 
1.4%
8 122
 
1.2%
9 89
 
0.9%
Other values (177) 1132
 
11.3%
ValueCountFrequency (%)
0 39
 
0.4%
1 4225
42.2%
2 2102
21.0%
3 839
 
8.4%
4 537
 
5.4%
5 387
 
3.9%
6 256
 
2.6%
7 172
 
1.7%
8 122
 
1.2%
9 89
 
0.9%
ValueCountFrequency (%)
1152 1
< 0.1%
864 1
< 0.1%
721 1
< 0.1%
639 1
< 0.1%
627 1
< 0.1%
613 1
< 0.1%
607 1
< 0.1%
603 1
< 0.1%
551 1
< 0.1%
489 1
< 0.1%

Interactions

2023-12-13T07:13:57.506902image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T07:13:56.537819image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T07:13:56.881230image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T07:13:57.658047image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T07:13:56.652296image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T07:13:57.282662image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T07:13:57.774585image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T07:13:56.770925image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T07:13:57.401289image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T07:14:03.336436image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
영업점코드영업점명식음메뉴그룹코드식음메뉴그룹명식음메뉴코드성별고객여권발급국가코드고객여권발급국가명주문수량
영업점코드1.0001.0000.1650.4660.2270.0000.4780.4610.075
영업점명1.0001.0000.1650.4660.2270.0000.4780.4610.075
식음메뉴그룹코드0.1650.1651.0001.0000.6410.0550.3010.3160.107
식음메뉴그룹명0.4660.4661.0001.0000.6530.1010.2880.2710.288
식음메뉴코드0.2270.2270.6410.6531.0000.0150.1370.1330.000
성별0.0000.0000.0550.1010.0151.0000.2290.2210.000
고객여권발급국가코드0.4780.4780.3010.2880.1370.2291.0001.0000.000
고객여권발급국가명0.4610.4610.3160.2710.1330.2211.0001.0000.000
주문수량0.0750.0750.1070.2880.0000.0000.0000.0001.000
2023-12-13T07:14:03.484297image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
성별식음메뉴그룹명영업점코드영업점명
성별1.0000.0800.0000.000
식음메뉴그룹명0.0801.0000.2810.281
영업점코드0.0000.2811.0001.000
영업점명0.0000.2811.0001.000
2023-12-13T07:14:03.610208image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
식음메뉴그룹코드식음메뉴코드주문수량영업점코드영업점명식음메뉴그룹명성별
식음메뉴그룹코드1.0000.7510.1620.1460.1460.9990.042
식음메뉴코드0.7511.0000.1490.0960.0960.3730.010
주문수량0.1620.1491.0000.0330.0330.1170.000
영업점코드0.1460.0960.0331.0001.0000.2810.000
영업점명0.1460.0960.0331.0001.0000.2810.000
식음메뉴그룹명0.9990.3730.1170.2810.2811.0000.080
성별0.0420.0100.0000.0000.0000.0801.000

Missing values

2023-12-13T07:13:57.940816image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T07:13:58.167787image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-13T07:13:58.310031image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

지급일자영업점코드영업점명식음메뉴그룹코드식음메뉴그룹명식음메뉴코드식음메뉴명(한글)식음메뉴명(영문)성별고객여권발급국가코드고객여권발급국가명주문수량
139072020-11-17HT드래곤1001숲/샌드위치1032후라이드 에그(2알)Fried Egg<NA><NA><NA>1
479242023-05-29HT드래곤1002라이스류1304장어덮밥Rice with Grilled Eel(장어덮밥정식)FJPN일본1
919032022-12-01HT드래곤1004VIP안주1028계절과일(VIP)Seasonal Fresh FruitsMTWN대만2
889632022-02-25CX코엑스1002라이스류1040봄새싹회덮밥정식Mixed Vegetable and Sashimi with RiceMCHN중국3
804932019-11-17LT롯데1002라이스류1251쇠고기미역국&생선구이seaweed soup with beef & Grilled FishMBGD방글라데시1
549132019-11-30CX코엑스1003면류1253물냉면Set Menu with Chilled Buckwheat NoodlesMAUS호주1
493392019-03-23LT롯데9002옵션58012인상<NA>MVNM베트남1
568032019-12-08CX코엑스1002라이스류1374소갈비덮밥Rice with Beef RibsFMNG몽골3
869822021-09-05HT드래곤1002라이스류1304장어덮밥Rice with Grilled Eel(장어덮밥정식)MKOR대한민국(영주권자)1
693872023-04-03HT드래곤2012커피3004아메리칸 커피(설탕)<NA><NA><NA><NA>33
지급일자영업점코드영업점명식음메뉴그룹코드식음메뉴그룹명식음메뉴코드식음메뉴명(한글)식음메뉴명(영문)성별고객여권발급국가코드고객여권발급국가명주문수량
413102021-12-05HT드래곤1002라이스류1589두부김치Bean Curd with Stir-fried Kimchi(두부김치)FVNM베트남1
42442019-05-09HT드래곤2012커피2122아메리칸 커피(블랙)<NA>MJPN일본1
652302022-04-06LT롯데2011음료2057펩시콜라<NA>FUSA미국2
261282019-02-16LT롯데1002라이스류5716한방갈비탕Ginseng Beef Rib StewMLKA스리랑카1
205682022-08-17HT드래곤1002라이스류1422훈제오리가지볶음Stir-fried Smoked Duck with Eggplant(훈제오리가지볶음)MKOR대한민국(영주권자)1
480192023-04-01CX코엑스2009주류(일반용)2034생맥주<NA><NA><NA><NA>1
463122023-03-11CX코엑스1002라이스류1062흑후추스테이크Black Pepper SteakFCAN캐나다3
566902019-11-07HT드래곤9001기타9022광동쌍화탕(VIP,COLD)<NA>MTWN대만1
256712019-03-30CX코엑스9001기타9003땅콩(믹스넛)<NA><NA><NA><NA>1
334272020-01-31HT드래곤1002라이스류1486순두부찌개Soft Bean Curd Stew(순두부찌개)FCHN중국4