Overview

Dataset statistics

Number of variables8
Number of observations10000
Missing cells7962
Missing cells (%)10.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory722.7 KiB
Average record size in memory74.0 B

Variable types

DateTime1
Categorical3
Text2
Numeric2

Dataset

Description그랜드코리아레저에서 운영하는 세븐럭카지노 강북 힐튼점에서 고객에게 제공하는 식음료데 대한 데이터로 월별, 식음료메뉴, 주문 건수, 수량 데이터를 제공합니다.
Author그랜드코리아레저(주)
URLhttps://www.data.go.kr/data/15048173/fileData.do

Alerts

영업장 has constant value ""Constant
주문건수 is highly overall correlated with 제공수량High correlation
제공수량 is highly overall correlated with 주문건수High correlation
구분 is highly overall correlated with 제공대상High correlation
제공대상 is highly overall correlated with 구분High correlation
메뉴명_영문 has 7962 (79.6%) missing valuesMissing

Reproduction

Analysis started2023-12-12 03:46:15.977211
Analysis finished2023-12-12 03:46:17.968812
Duration1.99 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

일자
Date

Distinct77
Distinct (%)0.8%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2015-01-31 00:00:00
Maximum2021-09-30 00:00:00
2023-12-12T12:46:18.067299image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T12:46:18.271332image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

영업장
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
힐튼
10000 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row힐튼
2nd row힐튼
3rd row힐튼
4th row힐튼
5th row힐튼

Common Values

ValueCountFrequency (%)
힐튼 10000
100.0%

Length

2023-12-12T12:46:18.451206image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T12:46:18.597906image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
힐튼 10000
100.0%

구분
Categorical

HIGH CORRELATION 

Distinct16
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
음료
2199 
기타
1761 
라이스류
1220 
커피
802 
주류(일반용)
802 
Other values (11)
3216 

Length

Max length7
Median length2
Mean length3.2274
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row기타
2nd rowVIP안주
3rd row음료
4th row라이스류
5th row커피

Common Values

ValueCountFrequency (%)
음료 2199
22.0%
기타 1761
17.6%
라이스류 1220
12.2%
커피 802
 
8.0%
주류(일반용) 802
 
8.0%
VIP안주 633
 
6.3%
칵테일 626
 
6.3%
위스키 492
 
4.9%
생과일주스 264
 
2.6%
면류 262
 
2.6%
Other values (6) 939
9.4%

Length

2023-12-12T12:46:18.765211image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
음료 2199
22.0%
기타 1761
17.6%
라이스류 1220
12.2%
커피 802
 
8.0%
주류(일반용 802
 
8.0%
vip안주 633
 
6.3%
칵테일 626
 
6.3%
위스키 492
 
4.9%
생과일주스 264
 
2.6%
면류 262
 
2.6%
Other values (6) 939
9.4%
Distinct509
Distinct (%)5.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T12:46:19.137539image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length12
Mean length7.3026
Min length1

Characters and Unicode

Total characters73026
Distinct characters388
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique53 ?
Unique (%)0.5%

Sample

1st row레스토랑에서식사
2nd row복숭아(VIP용)
3rd row보이차(주전자)
4th row전가복(VIP)
5th row아메리칸 커피(블랙)
ValueCountFrequency (%)
아이스 548
 
4.3%
온더락 296
 
2.3%
스트레이트 292
 
2.3%
위스키 246
 
1.9%
발렌타인17년 217
 
1.7%
조니워커블루 193
 
1.5%
소주 191
 
1.5%
vip용 185
 
1.4%
워터 184
 
1.4%
소다 151
 
1.2%
Other values (496) 10340
80.5%
2023-12-12T12:46:19.770381image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
( 3550
 
4.9%
) 3550
 
4.9%
3330
 
4.6%
2843
 
3.9%
2353
 
3.2%
I 1591
 
2.2%
P 1484
 
2.0%
V 1460
 
2.0%
1431
 
2.0%
1290
 
1.8%
Other values (378) 50144
68.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 51365
70.3%
Uppercase Letter 9952
 
13.6%
Open Punctuation 3550
 
4.9%
Close Punctuation 3550
 
4.9%
Space Separator 2843
 
3.9%
Decimal Number 1269
 
1.7%
Dash Punctuation 202
 
0.3%
Other Punctuation 180
 
0.2%
Math Symbol 89
 
0.1%
Connector Punctuation 26
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3330
 
6.5%
2353
 
4.6%
1431
 
2.8%
1290
 
2.5%
1151
 
2.2%
1018
 
2.0%
955
 
1.9%
889
 
1.7%
882
 
1.7%
837
 
1.6%
Other values (343) 37229
72.5%
Uppercase Letter
ValueCountFrequency (%)
I 1591
16.0%
P 1484
14.9%
V 1460
14.7%
O 1028
10.3%
T 669
 
6.7%
L 557
 
5.6%
C 379
 
3.8%
A 368
 
3.7%
D 354
 
3.6%
H 329
 
3.3%
Other values (10) 1733
17.4%
Decimal Number
ValueCountFrequency (%)
1 330
26.0%
0 287
22.6%
7 217
17.1%
3 200
15.8%
2 105
 
8.3%
8 67
 
5.3%
5 63
 
5.0%
Other Punctuation
ValueCountFrequency (%)
, 108
60.0%
& 72
40.0%
Open Punctuation
ValueCountFrequency (%)
( 3550
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3550
100.0%
Space Separator
ValueCountFrequency (%)
2843
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 202
100.0%
Math Symbol
ValueCountFrequency (%)
+ 89
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 26
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 51365
70.3%
Common 11709
 
16.0%
Latin 9952
 
13.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3330
 
6.5%
2353
 
4.6%
1431
 
2.8%
1290
 
2.5%
1151
 
2.2%
1018
 
2.0%
955
 
1.9%
889
 
1.7%
882
 
1.7%
837
 
1.6%
Other values (343) 37229
72.5%
Latin
ValueCountFrequency (%)
I 1591
16.0%
P 1484
14.9%
V 1460
14.7%
O 1028
10.3%
T 669
 
6.7%
L 557
 
5.6%
C 379
 
3.8%
A 368
 
3.7%
D 354
 
3.6%
H 329
 
3.3%
Other values (10) 1733
17.4%
Common
ValueCountFrequency (%)
( 3550
30.3%
) 3550
30.3%
2843
24.3%
1 330
 
2.8%
0 287
 
2.5%
7 217
 
1.9%
- 202
 
1.7%
3 200
 
1.7%
, 108
 
0.9%
2 105
 
0.9%
Other values (5) 317
 
2.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 51365
70.3%
ASCII 21661
29.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
( 3550
16.4%
) 3550
16.4%
2843
13.1%
I 1591
 
7.3%
P 1484
 
6.9%
V 1460
 
6.7%
O 1028
 
4.7%
T 669
 
3.1%
L 557
 
2.6%
C 379
 
1.7%
Other values (25) 4550
21.0%
Hangul
ValueCountFrequency (%)
3330
 
6.5%
2353
 
4.6%
1431
 
2.8%
1290
 
2.5%
1151
 
2.2%
1018
 
2.0%
955
 
1.9%
889
 
1.7%
882
 
1.7%
837
 
1.6%
Other values (343) 37229
72.5%

메뉴명_영문
Text

MISSING 

Distinct315
Distinct (%)15.5%
Missing7962
Missing (%)79.6%
Memory size156.2 KiB
2023-12-12T12:46:20.100596image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length74
Median length55
Mean length27.791953
Min length3

Characters and Unicode

Total characters56640
Distinct characters297
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique41 ?
Unique (%)2.0%

Sample

1st rowStewed Assorted Delicacies (전가복)
2nd rowSeasonal Fresh Fruits
3rd rowBeef Jerky
4th rowTraditional Korean Sweets
5th rowKorean Ramen(Shin Ramen,신라면)
ValueCountFrequency (%)
with 635
 
8.3%
beef 268
 
3.5%
rice 264
 
3.5%
grilled 202
 
2.6%
stir-fried 196
 
2.6%
and 188
 
2.5%
fried 152
 
2.0%
soup 146
 
1.9%
spicy 108
 
1.4%
pork 105
 
1.4%
Other values (488) 5384
70.4%
2023-12-12T12:46:20.578567image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5680
 
10.0%
e 5123
 
9.0%
i 3513
 
6.2%
a 2380
 
4.2%
o 2356
 
4.2%
r 2166
 
3.8%
t 2002
 
3.5%
d 1843
 
3.3%
S 1828
 
3.2%
n 1520
 
2.7%
Other values (287) 28229
49.8%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 32914
58.1%
Other Letter 8135
 
14.4%
Uppercase Letter 6513
 
11.5%
Space Separator 5680
 
10.0%
Open Punctuation 1462
 
2.6%
Close Punctuation 1458
 
2.6%
Dash Punctuation 255
 
0.5%
Other Punctuation 181
 
0.3%
Decimal Number 42
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
330
 
4.1%
330
 
4.1%
319
 
3.9%
198
 
2.4%
198
 
2.4%
186
 
2.3%
172
 
2.1%
172
 
2.1%
159
 
2.0%
152
 
1.9%
Other values (229) 5919
72.8%
Lowercase Letter
ValueCountFrequency (%)
e 5123
15.6%
i 3513
 
10.7%
a 2380
 
7.2%
o 2356
 
7.2%
r 2166
 
6.6%
t 2002
 
6.1%
d 1843
 
5.6%
n 1520
 
4.6%
l 1511
 
4.6%
h 1373
 
4.2%
Other values (15) 9127
27.7%
Uppercase Letter
ValueCountFrequency (%)
S 1828
28.1%
R 761
11.7%
B 639
 
9.8%
C 387
 
5.9%
F 355
 
5.5%
P 342
 
5.3%
A 288
 
4.4%
G 280
 
4.3%
M 266
 
4.1%
T 239
 
3.7%
Other values (15) 1128
17.3%
Other Punctuation
ValueCountFrequency (%)
& 116
64.1%
, 54
29.8%
' 11
 
6.1%
Space Separator
ValueCountFrequency (%)
5680
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1462
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1458
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 255
100.0%
Decimal Number
ValueCountFrequency (%)
7 42
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 39427
69.6%
Common 9078
 
16.0%
Hangul 8135
 
14.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
330
 
4.1%
330
 
4.1%
319
 
3.9%
198
 
2.4%
198
 
2.4%
186
 
2.3%
172
 
2.1%
172
 
2.1%
159
 
2.0%
152
 
1.9%
Other values (229) 5919
72.8%
Latin
ValueCountFrequency (%)
e 5123
 
13.0%
i 3513
 
8.9%
a 2380
 
6.0%
o 2356
 
6.0%
r 2166
 
5.5%
t 2002
 
5.1%
d 1843
 
4.7%
S 1828
 
4.6%
n 1520
 
3.9%
l 1511
 
3.8%
Other values (40) 15185
38.5%
Common
ValueCountFrequency (%)
5680
62.6%
( 1462
 
16.1%
) 1458
 
16.1%
- 255
 
2.8%
& 116
 
1.3%
, 54
 
0.6%
7 42
 
0.5%
' 11
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 48505
85.6%
Hangul 8135
 
14.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5680
 
11.7%
e 5123
 
10.6%
i 3513
 
7.2%
a 2380
 
4.9%
o 2356
 
4.9%
r 2166
 
4.5%
t 2002
 
4.1%
d 1843
 
3.8%
S 1828
 
3.8%
n 1520
 
3.1%
Other values (48) 20094
41.4%
Hangul
ValueCountFrequency (%)
330
 
4.1%
330
 
4.1%
319
 
3.9%
198
 
2.4%
198
 
2.4%
186
 
2.3%
172
 
2.1%
172
 
2.1%
159
 
2.0%
152
 
1.9%
Other values (229) 5919
72.8%

제공대상
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
일반
8234 
VIP
1766 

Length

Max length3
Median length2
Mean length2.1766
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row일반
2nd rowVIP
3rd rowVIP
4th rowVIP
5th row일반

Common Values

ValueCountFrequency (%)
일반 8234
82.3%
VIP 1766
 
17.7%

Length

2023-12-12T12:46:20.747616image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T12:46:20.845102image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
일반 8234
82.3%
vip 1766
 
17.7%

주문건수
Real number (ℝ)

HIGH CORRELATION 

Distinct1520
Distinct (%)15.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean297.5191
Minimum1
Maximum7030
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T12:46:20.968467image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2
Q120
median73
Q3237
95-th percentile1512
Maximum7030
Range7029
Interquartile range (IQR)217

Descriptive statistics

Standard deviation635.18142
Coefficient of variation (CV)2.1349265
Kurtosis22.752232
Mean297.5191
Median Absolute Deviation (MAD)66
Skewness4.2143538
Sum2975191
Variance403455.43
MonotonicityNot monotonic
2023-12-12T12:46:21.140633image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 376
 
3.8%
2 257
 
2.6%
3 176
 
1.8%
4 176
 
1.8%
5 137
 
1.4%
6 132
 
1.3%
7 122
 
1.2%
8 121
 
1.2%
11 99
 
1.0%
16 97
 
1.0%
Other values (1510) 8307
83.1%
ValueCountFrequency (%)
1 376
3.8%
2 257
2.6%
3 176
1.8%
4 176
1.8%
5 137
 
1.4%
6 132
 
1.3%
7 122
 
1.2%
8 121
 
1.2%
9 95
 
0.9%
10 95
 
0.9%
ValueCountFrequency (%)
7030 1
< 0.1%
6885 1
< 0.1%
6872 1
< 0.1%
6295 1
< 0.1%
6051 1
< 0.1%
5884 1
< 0.1%
5870 1
< 0.1%
5700 1
< 0.1%
5576 1
< 0.1%
5570 1
< 0.1%

제공수량
Real number (ℝ)

HIGH CORRELATION 

Distinct2293
Distinct (%)22.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean836.9407
Minimum1
Maximum46754
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T12:46:21.332558image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile3
Q132
median138
Q3487.5
95-th percentile4090.4
Maximum46754
Range46753
Interquartile range (IQR)455.5

Descriptive statistics

Standard deviation2282.961
Coefficient of variation (CV)2.7277452
Kurtosis55.705897
Mean836.9407
Median Absolute Deviation (MAD)128
Skewness6.0231656
Sum8369407
Variance5211910.9
MonotonicityNot monotonic
2023-12-12T12:46:21.487356image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 272
 
2.7%
2 205
 
2.1%
4 152
 
1.5%
3 151
 
1.5%
5 106
 
1.1%
7 100
 
1.0%
6 92
 
0.9%
8 85
 
0.9%
10 78
 
0.8%
9 76
 
0.8%
Other values (2283) 8683
86.8%
ValueCountFrequency (%)
1 272
2.7%
2 205
2.1%
3 151
1.5%
4 152
1.5%
5 106
 
1.1%
6 92
 
0.9%
7 100
 
1.0%
8 85
 
0.9%
9 76
 
0.8%
10 78
 
0.8%
ValueCountFrequency (%)
46754 1
< 0.1%
38187 1
< 0.1%
36257 1
< 0.1%
31428 1
< 0.1%
30511 1
< 0.1%
26558 1
< 0.1%
25357 1
< 0.1%
20383 1
< 0.1%
20263 1
< 0.1%
19903 1
< 0.1%

Interactions

2023-12-12T12:46:17.341710image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T12:46:17.036683image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T12:46:17.517289image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T12:46:17.186968image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T12:46:21.595708image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
일자구분제공대상주문건수제공수량
일자1.0000.0000.0610.0680.000
구분0.0001.0000.8780.4600.185
제공대상0.0610.8781.0000.1350.084
주문건수0.0680.4600.1351.0000.449
제공수량0.0000.1850.0840.4491.000
2023-12-12T12:46:21.730983image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
제공대상구분
제공대상1.0000.732
구분0.7321.000
2023-12-12T12:46:21.863773image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
주문건수제공수량구분제공대상
주문건수1.0000.9040.1990.103
제공수량0.9041.0000.0730.064
구분0.1990.0731.0000.732
제공대상0.1030.0640.7321.000

Missing values

2023-12-12T12:46:17.691055image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T12:46:17.882478image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

일자영업장구분메뉴명_한글메뉴명_영문제공대상주문건수제공수량
83652018-11-30힐튼기타레스토랑에서식사<NA>일반1516
126212021-08-31힐튼VIP안주복숭아(VIP용)<NA>VIP4545
85132018-12-31힐튼음료보이차(주전자)<NA>VIP1720
73222018-06-30힐튼라이스류전가복(VIP)Stewed Assorted Delicacies (전가복)VIP274277
8172015-05-31힐튼커피아메리칸 커피(블랙)<NA>일반372379
115422020-07-31힐튼주류(일반용)소주 스트레이트<NA>일반68
107082019-12-31힐튼기타광동쌍화탕(VIP,COLD)<NA>일반2653
84982018-12-31힐튼음료아이스 초코<NA>일반15115
57942017-09-30힐튼음료소다수<NA>일반1390
45312017-02-28힐튼음료생수(BOTTLE)<NA>일반51715228
일자영업장구분메뉴명_한글메뉴명_영문제공대상주문건수제공수량
108312020-01-31힐튼음료비락식혜(VIP)<NA>일반3643
49982017-05-31힐튼브랜디헤네시XO온더락<NA>일반3464
52572017-06-30힐튼음료토마토주스<NA>일반55528
87992019-02-28힐튼맥주아사히<NA>일반284322
81132018-10-31힐튼주류(일반용)소주 스트레이트<NA>일반39
59082017-10-31힐튼위스키발렌타인17년 소다<NA>일반1120
87752019-02-28힐튼VIP안주과일치즈(VIP용)<NA>VIP810
92652019-04-30힐튼기타초콜릿(스니커즈)<NA>일반143433
16672015-10-31힐튼음료자스민차(주전자)<NA>VIP1620
35402016-09-30힐튼라이스류갈치구이정식(VIP용)Table D'hote Set with Grilled Hairtail Fish(갈치구이정식)VIP352367