Overview

Dataset statistics

Number of variables8
Number of observations10000
Missing cells8109
Missing cells (%)10.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory722.7 KiB
Average record size in memory74.0 B

Variable types

DateTime1
Categorical3
Text2
Numeric2

Dataset

Description그랜드코리아레저에서 운영하는 세븐럭카지노 부산롯데점에서 고객에게 제공하는 식음료데 대한 데이터로 월별, 식음료메뉴, 주문 건수, 수량 데이터를 제공합니다.
Author그랜드코리아레저(주)
URLhttps://www.data.go.kr/data/15048174/fileData.do

Alerts

영업장 has constant value ""Constant
주문건수 is highly overall correlated with 제공수량High correlation
제공수량 is highly overall correlated with 주문건수High correlation
구분 is highly overall correlated with 제공대상High correlation
제공대상 is highly overall correlated with 구분High correlation
메뉴명_영문 has 8109 (81.1%) missing valuesMissing

Reproduction

Analysis started2023-12-12 14:04:03.798877
Analysis finished2023-12-12 14:04:05.272062
Duration1.47 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

일자
Date

Distinct78
Distinct (%)0.8%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2015-01-31 00:00:00
Maximum2021-09-30 00:00:00
2023-12-12T23:04:05.363544image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T23:04:05.526940image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

영업장
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
롯데
10000 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row롯데
2nd row롯데
3rd row롯데
4th row롯데
5th row롯데

Common Values

ValueCountFrequency (%)
롯데 10000
100.0%

Length

2023-12-12T23:04:05.667761image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T23:04:05.781361image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
롯데 10000
100.0%

구분
Categorical

HIGH CORRELATION 

Distinct16
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
음료
3144 
기타
1413 
라이스류
1161 
커피
1037 
주류(일반용)
822 
Other values (11)
2423 

Length

Max length7
Median length2
Mean length2.9822
Min length2

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row음료
2nd row커피
3rd row기타
4th row음료
5th row커피

Common Values

ValueCountFrequency (%)
음료 3144
31.4%
기타 1413
14.1%
라이스류 1161
 
11.6%
커피 1037
 
10.4%
주류(일반용) 822
 
8.2%
위스키 673
 
6.7%
칵테일 647
 
6.5%
숲/샌드위치 279
 
2.8%
면류 222
 
2.2%
아이스크림 174
 
1.7%
Other values (6) 428
 
4.3%

Length

2023-12-12T23:04:05.923693image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
음료 3144
31.4%
기타 1413
14.1%
라이스류 1161
 
11.6%
커피 1037
 
10.4%
주류(일반용 822
 
8.2%
위스키 673
 
6.7%
칵테일 647
 
6.5%
숲/샌드위치 279
 
2.8%
면류 222
 
2.2%
아이스크림 174
 
1.7%
Other values (6) 428
 
4.3%
Distinct438
Distinct (%)4.4%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T23:04:06.253335image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length17
Median length13
Mean length7.8441
Min length1

Characters and Unicode

Total characters78441
Distinct characters398
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique38 ?
Unique (%)0.4%

Sample

1st row녹차
2nd row블랙커피
3rd row초콜릿(페레레로쉐)VIP
4th row꿀차(VIP)
5th row아이스커피(크림)
ValueCountFrequency (%)
온더락 509
 
3.8%
아이스 485
 
3.6%
스트레이트 427
 
3.2%
위스키 409
 
3.0%
워터 316
 
2.3%
소주 248
 
1.8%
커피 214
 
1.6%
발렌타인21년 211
 
1.6%
발렌타인17년 209
 
1.5%
소다 206
 
1.5%
Other values (432) 10304
76.1%
2023-12-12T23:04:06.798791image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
) 4529
 
5.8%
( 4529
 
5.8%
3538
 
4.5%
3077
 
3.9%
2407
 
3.1%
I 2328
 
3.0%
V 2271
 
2.9%
P 2242
 
2.9%
1308
 
1.7%
1245
 
1.6%
Other values (388) 50967
65.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 52794
67.3%
Uppercase Letter 11177
 
14.2%
Close Punctuation 4555
 
5.8%
Open Punctuation 4555
 
5.8%
Space Separator 3538
 
4.5%
Decimal Number 1469
 
1.9%
Other Punctuation 208
 
0.3%
Math Symbol 145
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3077
 
5.8%
2407
 
4.6%
1308
 
2.5%
1245
 
2.4%
1220
 
2.3%
1050
 
2.0%
1009
 
1.9%
983
 
1.9%
982
 
1.9%
924
 
1.8%
Other values (355) 38589
73.1%
Uppercase Letter
ValueCountFrequency (%)
I 2328
20.8%
V 2271
20.3%
P 2242
20.1%
O 1019
9.1%
L 478
 
4.3%
T 457
 
4.1%
C 354
 
3.2%
D 321
 
2.9%
E 298
 
2.7%
H 278
 
2.5%
Other values (9) 1131
10.1%
Decimal Number
ValueCountFrequency (%)
1 464
31.6%
2 322
21.9%
0 277
18.9%
7 209
14.2%
3 157
 
10.7%
5 40
 
2.7%
Close Punctuation
ValueCountFrequency (%)
) 4529
99.4%
] 26
 
0.6%
Open Punctuation
ValueCountFrequency (%)
( 4529
99.4%
[ 26
 
0.6%
Other Punctuation
ValueCountFrequency (%)
& 175
84.1%
, 33
 
15.9%
Space Separator
ValueCountFrequency (%)
3538
100.0%
Math Symbol
ValueCountFrequency (%)
+ 145
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 52794
67.3%
Common 14470
 
18.4%
Latin 11177
 
14.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3077
 
5.8%
2407
 
4.6%
1308
 
2.5%
1245
 
2.4%
1220
 
2.3%
1050
 
2.0%
1009
 
1.9%
983
 
1.9%
982
 
1.9%
924
 
1.8%
Other values (355) 38589
73.1%
Latin
ValueCountFrequency (%)
I 2328
20.8%
V 2271
20.3%
P 2242
20.1%
O 1019
9.1%
L 478
 
4.3%
T 457
 
4.1%
C 354
 
3.2%
D 321
 
2.9%
E 298
 
2.7%
H 278
 
2.5%
Other values (9) 1131
10.1%
Common
ValueCountFrequency (%)
) 4529
31.3%
( 4529
31.3%
3538
24.5%
1 464
 
3.2%
2 322
 
2.2%
0 277
 
1.9%
7 209
 
1.4%
& 175
 
1.2%
3 157
 
1.1%
+ 145
 
1.0%
Other values (4) 125
 
0.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 52794
67.3%
ASCII 25647
32.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
) 4529
17.7%
( 4529
17.7%
3538
13.8%
I 2328
9.1%
V 2271
8.9%
P 2242
8.7%
O 1019
 
4.0%
L 478
 
1.9%
1 464
 
1.8%
T 457
 
1.8%
Other values (23) 3792
14.8%
Hangul
ValueCountFrequency (%)
3077
 
5.8%
2407
 
4.6%
1308
 
2.5%
1245
 
2.4%
1220
 
2.3%
1050
 
2.0%
1009
 
1.9%
983
 
1.9%
982
 
1.9%
924
 
1.8%
Other values (355) 38589
73.1%

메뉴명_영문
Text

MISSING 

Distinct222
Distinct (%)11.7%
Missing8109
Missing (%)81.1%
Memory size156.2 KiB
2023-12-12T23:04:07.188295image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length55
Median length39
Mean length21.891063
Min length3

Characters and Unicode

Total characters41396
Distinct characters64
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique30 ?
Unique (%)1.6%

Sample

1st rowRoast walleyed pollack
2nd rowAssorted Cheese
3rd rowBulgogi with Side dishes
4th rowSeasonal sprout salad & Wonton soup
5th rowkimchi stew set
ValueCountFrequency (%)
with 346
 
5.2%
rice 317
 
4.8%
beef 241
 
3.6%
grilled 192
 
2.9%
soup 187
 
2.8%
165
 
2.5%
stew 144
 
2.2%
and 101
 
1.5%
ramen 100
 
1.5%
cheese 97
 
1.5%
Other values (271) 4731
71.5%
2023-12-12T23:04:07.759983image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5043
 
12.2%
e 4763
 
11.5%
i 2922
 
7.1%
a 2257
 
5.5%
o 2081
 
5.0%
r 1989
 
4.8%
t 1807
 
4.4%
s 1738
 
4.2%
l 1597
 
3.9%
n 1514
 
3.7%
Other values (54) 15685
37.9%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 30761
74.3%
Space Separator 5043
 
12.2%
Uppercase Letter 4719
 
11.4%
Other Punctuation 332
 
0.8%
Other Letter 237
 
0.6%
Dash Punctuation 159
 
0.4%
Close Punctuation 66
 
0.2%
Open Punctuation 66
 
0.2%
Decimal Number 13
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 4763
15.5%
i 2922
 
9.5%
a 2257
 
7.3%
o 2081
 
6.8%
r 1989
 
6.5%
t 1807
 
5.9%
s 1738
 
5.7%
l 1597
 
5.2%
n 1514
 
4.9%
d 1491
 
4.8%
Other values (15) 8602
28.0%
Uppercase Letter
ValueCountFrequency (%)
S 1208
25.6%
R 491
10.4%
B 424
 
9.0%
C 381
 
8.1%
F 310
 
6.6%
G 307
 
6.5%
T 222
 
4.7%
P 211
 
4.5%
A 187
 
4.0%
D 174
 
3.7%
Other values (15) 804
17.0%
Other Letter
ValueCountFrequency (%)
68
28.7%
68
28.7%
68
28.7%
11
 
4.6%
11
 
4.6%
11
 
4.6%
Other Punctuation
ValueCountFrequency (%)
& 190
57.2%
' 94
28.3%
, 48
 
14.5%
Space Separator
ValueCountFrequency (%)
5043
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 159
100.0%
Close Punctuation
ValueCountFrequency (%)
) 66
100.0%
Open Punctuation
ValueCountFrequency (%)
( 66
100.0%
Decimal Number
ValueCountFrequency (%)
7 13
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 35480
85.7%
Common 5679
 
13.7%
Hangul 237
 
0.6%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 4763
 
13.4%
i 2922
 
8.2%
a 2257
 
6.4%
o 2081
 
5.9%
r 1989
 
5.6%
t 1807
 
5.1%
s 1738
 
4.9%
l 1597
 
4.5%
n 1514
 
4.3%
d 1491
 
4.2%
Other values (40) 13321
37.5%
Common
ValueCountFrequency (%)
5043
88.8%
& 190
 
3.3%
- 159
 
2.8%
' 94
 
1.7%
) 66
 
1.2%
( 66
 
1.2%
, 48
 
0.8%
7 13
 
0.2%
Hangul
ValueCountFrequency (%)
68
28.7%
68
28.7%
68
28.7%
11
 
4.6%
11
 
4.6%
11
 
4.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 41159
99.4%
Hangul 237
 
0.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5043
 
12.3%
e 4763
 
11.6%
i 2922
 
7.1%
a 2257
 
5.5%
o 2081
 
5.1%
r 1989
 
4.8%
t 1807
 
4.4%
s 1738
 
4.2%
l 1597
 
3.9%
n 1514
 
3.7%
Other values (48) 15448
37.5%
Hangul
ValueCountFrequency (%)
68
28.7%
68
28.7%
68
28.7%
11
 
4.6%
11
 
4.6%
11
 
4.6%

제공대상
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
일반
6131 
VIP
3869 

Length

Max length3
Median length2
Mean length2.3869
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row일반
2nd row일반
3rd rowVIP
4th rowVIP
5th row일반

Common Values

ValueCountFrequency (%)
일반 6131
61.3%
VIP 3869
38.7%

Length

2023-12-12T23:04:07.942955image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T23:04:08.041523image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
일반 6131
61.3%
vip 3869
38.7%

주문건수
Real number (ℝ)

HIGH CORRELATION 

Distinct1325
Distinct (%)13.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean247.487
Minimum1
Maximum6260
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T23:04:08.162607image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2
Q120
median91
Q3250
95-th percentile1019.1
Maximum6260
Range6259
Interquartile range (IQR)230

Descriptive statistics

Standard deviation495.18392
Coefficient of variation (CV)2.0008482
Kurtosis44.920631
Mean247.487
Median Absolute Deviation (MAD)82
Skewness5.6440412
Sum2474870
Variance245207.12
MonotonicityNot monotonic
2023-12-12T23:04:08.303108image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 393
 
3.9%
2 264
 
2.6%
3 213
 
2.1%
4 175
 
1.8%
6 158
 
1.6%
5 156
 
1.6%
7 152
 
1.5%
8 109
 
1.1%
9 103
 
1.0%
11 97
 
1.0%
Other values (1315) 8180
81.8%
ValueCountFrequency (%)
1 393
3.9%
2 264
2.6%
3 213
2.1%
4 175
1.8%
5 156
 
1.6%
6 158
1.6%
7 152
 
1.5%
8 109
 
1.1%
9 103
 
1.0%
10 86
 
0.9%
ValueCountFrequency (%)
6260 1
< 0.1%
5766 1
< 0.1%
5763 1
< 0.1%
5706 1
< 0.1%
5573 1
< 0.1%
5569 1
< 0.1%
5524 1
< 0.1%
5386 1
< 0.1%
5344 1
< 0.1%
5338 1
< 0.1%

제공수량
Real number (ℝ)

HIGH CORRELATION 

Distinct1555
Distinct (%)15.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean419.294
Minimum1
Maximum26950
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T23:04:08.440648image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile3
Q130
median121
Q3339
95-th percentile1356
Maximum26950
Range26949
Interquartile range (IQR)309

Descriptive statistics

Standard deviation1500.2126
Coefficient of variation (CV)3.5779491
Kurtosis158.5998
Mean419.294
Median Absolute Deviation (MAD)107
Skewness11.504012
Sum4192940
Variance2250637.8
MonotonicityNot monotonic
2023-12-12T23:04:08.594666image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 251
 
2.5%
2 225
 
2.2%
3 153
 
1.5%
4 131
 
1.3%
6 129
 
1.3%
5 127
 
1.3%
8 123
 
1.2%
7 113
 
1.1%
9 87
 
0.9%
14 75
 
0.8%
Other values (1545) 8586
85.9%
ValueCountFrequency (%)
1 251
2.5%
2 225
2.2%
3 153
1.5%
4 131
1.3%
5 127
1.3%
6 129
1.3%
7 113
1.1%
8 123
1.2%
9 87
 
0.9%
10 61
 
0.6%
ValueCountFrequency (%)
26950 1
< 0.1%
25927 1
< 0.1%
25662 1
< 0.1%
25561 1
< 0.1%
25031 1
< 0.1%
24920 1
< 0.1%
24575 1
< 0.1%
24567 1
< 0.1%
24299 1
< 0.1%
23570 1
< 0.1%

Interactions

2023-12-12T23:04:04.727678image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T23:04:04.466564image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T23:04:04.854080image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T23:04:04.589522image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T23:04:08.690398image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
일자구분제공대상주문건수제공수량
일자1.0000.0000.0000.0820.000
구분0.0001.0000.7920.3210.167
제공대상0.0000.7921.0000.2920.128
주문건수0.0820.3210.2921.0000.776
제공수량0.0000.1670.1280.7761.000
2023-12-12T23:04:08.825294image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
제공대상구분
제공대상1.0000.644
구분0.6441.000
2023-12-12T23:04:08.925428image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
주문건수제공수량구분제공대상
주문건수1.0000.9330.1310.224
제공수량0.9331.0000.0660.098
구분0.1310.0661.0000.644
제공대상0.2240.0980.6441.000

Missing values

2023-12-12T23:04:05.033139image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T23:04:05.188297image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

일자영업장구분메뉴명_한글메뉴명_영문제공대상주문건수제공수량
43392017-04-30롯데음료녹차<NA>일반567708
80862019-05-31롯데커피블랙커피<NA>일반5291392
105182021-04-30롯데기타초콜릿(페레레로쉐)VIP<NA>VIP109286
84512019-08-31롯데음료꿀차(VIP)<NA>VIP127199
78042019-03-31롯데커피아이스커피(크림)<NA>일반2227
68172018-08-31롯데기타먹태(VIP)Roast walleyed pollackVIP3637
43172017-04-30롯데칵테일위스키 콜라(버번 콕)<NA>일반12261465
15982015-11-30롯데칵테일캄파리 소다<NA>일반2223
12662015-09-30롯데맥주생맥주<NA>VIP826
71602018-11-30롯데위스키조니워커 블루 소다<NA>VIP3039
일자영업장구분메뉴명_한글메뉴명_영문제공대상주문건수제공수량
51412017-09-30롯데커피카페라테(카페오레)<NA>일반349374
65042018-06-30롯데기타쥐포&한치(VIP)<NA>VIP5258
47182017-07-31롯데라이스류갈비탕Ginseng beef rib stew일반914948
109172021-08-31롯데음료녹차<NA>일반2584
38172017-01-31롯데라이스류(2인상)<NA>일반238285
87002019-10-31롯데음료우유(COLD)<NA>일반130185
65402018-07-31롯데면류해운대라면Seafood Ramen일반618659
67162018-08-31롯데주류(일반용)브랜디 스트레이트<NA>일반773
54342017-11-30롯데음료우유(HOT)<NA>일반602677
58482018-02-28롯데음료아이스 우롱차<NA>일반442503