Overview

Dataset statistics

Number of variables9
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory810.5 KiB
Average record size in memory83.0 B

Variable types

Categorical4
DateTime1
Text1
Numeric3

Dataset

Description경상북도 봉화군 봉화사랑상품권 가맹점 중 포스(POS)를 활용하고 있는 업소의 포스 데이터 기반의 판매 데이터로
Author경상북도 봉화군
URLhttps://www.data.go.kr/data/15097851/fileData.do

Alerts

가맹점업종 is highly overall correlated with 가맹점명 and 1 other fieldsHigh correlation
가맹점명 is highly overall correlated with 가맹점업종 and 1 other fieldsHigh correlation
가맹점소재지 is highly overall correlated with 가맹점명 and 2 other fieldsHigh correlation
단가 is highly overall correlated with 총결제금액High correlation
총결제금액 is highly overall correlated with 단가High correlation
결제시간 is highly overall correlated with 가맹점소재지High correlation

Reproduction

Analysis started2023-12-12 06:18:06.778494
Analysis finished2023-12-12 06:18:08.882029
Duration2.1 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

가맹점명
Categorical

HIGH CORRELATION 

Distinct7
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
더카페브레드유
6142 
달달이치킨
1210 
땅땅치킨
1158 
꿀밤
724 
동아식육점
 
324
Other values (2)
 
442

Length

Max length7
Median length7
Mean length5.8277
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row더카페브레드유
2nd row땅땅치킨
3rd row더카페브레드유
4th row대부도
5th row더카페브레드유

Common Values

ValueCountFrequency (%)
더카페브레드유 6142
61.4%
달달이치킨 1210
 
12.1%
땅땅치킨 1158
 
11.6%
꿀밤 724
 
7.2%
동아식육점 324
 
3.2%
대부도 235
 
2.4%
구마식당 207
 
2.1%

Length

2023-12-12T15:18:08.967702image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T15:18:09.091621image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
더카페브레드유 6142
61.4%
달달이치킨 1210
 
12.1%
땅땅치킨 1158
 
11.6%
꿀밤 724
 
7.2%
동아식육점 324
 
3.2%
대부도 235
 
2.4%
구마식당 207
 
2.1%

가맹점업종
Categorical

HIGH CORRELATION 

Distinct5
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
커피 전문점
6142 
치킨 전문점
2368 
한식 일반 음식점업
931 
육류 소매업
 
324
한식 해산물 요리 전문점
 
235

Length

Max length13
Median length6
Mean length6.5369
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row커피 전문점
2nd row치킨 전문점
3rd row커피 전문점
4th row한식 해산물 요리 전문점
5th row커피 전문점

Common Values

ValueCountFrequency (%)
커피 전문점 6142
61.4%
치킨 전문점 2368
 
23.7%
한식 일반 음식점업 931
 
9.3%
육류 소매업 324
 
3.2%
한식 해산물 요리 전문점 235
 
2.4%

Length

2023-12-12T15:18:09.244947image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T15:18:09.368639image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
전문점 8745
40.9%
커피 6142
28.7%
치킨 2368
 
11.1%
한식 1166
 
5.4%
일반 931
 
4.4%
음식점업 931
 
4.4%
육류 324
 
1.5%
소매업 324
 
1.5%
해산물 235
 
1.1%
요리 235
 
1.1%

가맹점소재지
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
춘양면
6349 
봉화읍
3651 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row춘양면
2nd row봉화읍
3rd row춘양면
4th row봉화읍
5th row춘양면

Common Values

ValueCountFrequency (%)
춘양면 6349
63.5%
봉화읍 3651
36.5%

Length

2023-12-12T15:18:09.503023image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T15:18:09.600511image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
춘양면 6349
63.5%
봉화읍 3651
36.5%
Distinct861
Distinct (%)8.6%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2019-06-02 00:00:00
Maximum2021-10-31 00:00:00
2023-12-12T15:18:09.728471image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:18:09.913860image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

결제시간
Categorical

HIGH CORRELATION 

Distinct22
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
23:00
804 
15:00
790 
12:00
782 
13:00
764 
14:00
746 
Other values (17)
6114 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row14:00
2nd row23:00
3rd row08:00
4th row19:00
5th row15:00

Common Values

ValueCountFrequency (%)
23:00 804
 
8.0%
15:00 790
 
7.9%
12:00 782
 
7.8%
13:00 764
 
7.6%
14:00 746
 
7.5%
16:00 674
 
6.7%
10:00 568
 
5.7%
11:00 554
 
5.5%
09:00 549
 
5.5%
00:00 507
 
5.1%
Other values (12) 3262
32.6%

Length

2023-12-12T15:18:10.083409image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
23:00 804
 
8.0%
15:00 790
 
7.9%
12:00 782
 
7.8%
13:00 764
 
7.6%
14:00 746
 
7.5%
16:00 674
 
6.7%
10:00 568
 
5.7%
11:00 554
 
5.5%
09:00 549
 
5.5%
00:00 507
 
5.1%
Other values (12) 3262
32.6%
Distinct412
Distinct (%)4.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T15:18:10.338386image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length16
Mean length5.9251
Min length2

Characters and Unicode

Total characters59251
Distinct characters356
Distinct categories10 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique63 ?
Unique (%)0.6%

Sample

1st row라떼(ICE)
2nd row매콤떡볶이
3rd row커스타드크림빵
4th row우럭
5th row카라멜마끼아또(ICE)
ValueCountFrequency (%)
l 518
 
4.6%
아메리카노(ice 442
 
4.0%
아메리카노 421
 
3.8%
큐브쌀식빵 383
 
3.4%
커스타드크림빵 373
 
3.3%
단팥빵 372
 
3.3%
쌀식빵(우유/밤 322
 
2.9%
블루베리쌀빵 244
 
2.2%
봉투 239
 
2.1%
소주 226
 
2.0%
Other values (411) 7611
68.3%
2023-12-12T15:18:10.806588image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2281
 
3.8%
( 2029
 
3.4%
) 2029
 
3.4%
1468
 
2.5%
1460
 
2.5%
1443
 
2.4%
1209
 
2.0%
1180
 
2.0%
1022
 
1.7%
1004
 
1.7%
Other values (346) 44126
74.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 47860
80.8%
Uppercase Letter 3037
 
5.1%
Open Punctuation 2029
 
3.4%
Close Punctuation 2029
 
3.4%
Decimal Number 1263
 
2.1%
Space Separator 1180
 
2.0%
Other Punctuation 551
 
0.9%
Lowercase Letter 537
 
0.9%
Math Symbol 428
 
0.7%
Dash Punctuation 337
 
0.6%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2281
 
4.8%
1468
 
3.1%
1460
 
3.1%
1443
 
3.0%
1209
 
2.5%
1022
 
2.1%
1004
 
2.1%
969
 
2.0%
932
 
1.9%
907
 
1.9%
Other values (313) 35165
73.5%
Uppercase Letter
ValueCountFrequency (%)
I 751
24.7%
C 751
24.7%
E 751
24.7%
L 537
17.7%
S 175
 
5.8%
H 20
 
0.7%
T 20
 
0.7%
O 20
 
0.7%
D 12
 
0.4%
Decimal Number
ValueCountFrequency (%)
0 561
44.4%
5 237
18.8%
1 162
 
12.8%
3 138
 
10.9%
4 62
 
4.9%
7 53
 
4.2%
2 50
 
4.0%
Lowercase Letter
ValueCountFrequency (%)
c 478
89.0%
s 21
 
3.9%
j 12
 
2.2%
m 12
 
2.2%
l 12
 
2.2%
g 1
 
0.2%
k 1
 
0.2%
Other Punctuation
ValueCountFrequency (%)
/ 487
88.4%
* 38
 
6.9%
: 18
 
3.3%
, 6
 
1.1%
2
 
0.4%
Open Punctuation
ValueCountFrequency (%)
( 2029
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2029
100.0%
Space Separator
ValueCountFrequency (%)
1180
100.0%
Math Symbol
ValueCountFrequency (%)
+ 428
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 337
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 47832
80.7%
Common 7817
 
13.2%
Latin 3574
 
6.0%
Han 28
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2281
 
4.8%
1468
 
3.1%
1460
 
3.1%
1443
 
3.0%
1209
 
2.5%
1022
 
2.1%
1004
 
2.1%
969
 
2.0%
932
 
1.9%
907
 
1.9%
Other values (312) 35137
73.5%
Common
ValueCountFrequency (%)
( 2029
26.0%
) 2029
26.0%
1180
15.1%
0 561
 
7.2%
/ 487
 
6.2%
+ 428
 
5.5%
- 337
 
4.3%
5 237
 
3.0%
1 162
 
2.1%
3 138
 
1.8%
Other values (7) 229
 
2.9%
Latin
ValueCountFrequency (%)
I 751
21.0%
C 751
21.0%
E 751
21.0%
L 537
15.0%
c 478
13.4%
S 175
 
4.9%
s 21
 
0.6%
H 20
 
0.6%
T 20
 
0.6%
O 20
 
0.6%
Other values (6) 50
 
1.4%
Han
ValueCountFrequency (%)
28
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 47832
80.7%
ASCII 11389
 
19.2%
CJK 28
 
< 0.1%
None 2
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
2281
 
4.8%
1468
 
3.1%
1460
 
3.1%
1443
 
3.0%
1209
 
2.5%
1022
 
2.1%
1004
 
2.1%
969
 
2.0%
932
 
1.9%
907
 
1.9%
Other values (312) 35137
73.5%
ASCII
ValueCountFrequency (%)
( 2029
17.8%
) 2029
17.8%
1180
10.4%
I 751
 
6.6%
C 751
 
6.6%
E 751
 
6.6%
0 561
 
4.9%
L 537
 
4.7%
/ 487
 
4.3%
c 478
 
4.2%
Other values (22) 1835
16.1%
CJK
ValueCountFrequency (%)
28
100.0%
None
ValueCountFrequency (%)
2
100.0%

주문개수
Real number (ℝ)

Distinct29
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.9062
Minimum1
Maximum50
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T15:18:10.955400image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11
median1
Q32
95-th percentile5
Maximum50
Range49
Interquartile range (IQR)1

Descriptive statistics

Standard deviation1.978533
Coefficient of variation (CV)1.0379462
Kurtosis122.88092
Mean1.9062
Median Absolute Deviation (MAD)0
Skewness7.7962057
Sum19062
Variance3.914593
MonotonicityNot monotonic
2023-12-12T15:18:11.110277image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=29)
ValueCountFrequency (%)
1 6008
60.1%
2 2082
 
20.8%
3 854
 
8.5%
4 437
 
4.4%
5 234
 
2.3%
6 119
 
1.2%
7 87
 
0.9%
8 59
 
0.6%
9 34
 
0.3%
12 17
 
0.2%
Other values (19) 69
 
0.7%
ValueCountFrequency (%)
1 6008
60.1%
2 2082
 
20.8%
3 854
 
8.5%
4 437
 
4.4%
5 234
 
2.3%
6 119
 
1.2%
7 87
 
0.9%
8 59
 
0.6%
9 34
 
0.3%
10 15
 
0.1%
ValueCountFrequency (%)
50 1
< 0.1%
46 1
< 0.1%
45 1
< 0.1%
38 1
< 0.1%
34 1
< 0.1%
32 1
< 0.1%
30 1
< 0.1%
28 1
< 0.1%
24 1
< 0.1%
22 1
< 0.1%

단가
Real number (ℝ)

HIGH CORRELATION 

Distinct331
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6743.0172
Minimum50
Maximum200000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T15:18:11.257784image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum50
5-th percentile1000
Q12200
median4000
Q38000
95-th percentile19500
Maximum200000
Range199950
Interquartile range (IQR)5800

Descriptive statistics

Standard deviation8402.6919
Coefficient of variation (CV)1.2461324
Kurtosis68.673047
Mean6743.0172
Median Absolute Deviation (MAD)2000
Skewness5.5962378
Sum67430172
Variance70605232
MonotonicityNot monotonic
2023-12-12T15:18:11.398242image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
4000 1267
 
12.7%
3500 1207
 
12.1%
1500 934
 
9.3%
4500 709
 
7.1%
3000 683
 
6.8%
1800 503
 
5.0%
5000 417
 
4.2%
2000 388
 
3.9%
10000 340
 
3.4%
17000 256
 
2.6%
Other values (321) 3296
33.0%
ValueCountFrequency (%)
50 93
 
0.9%
100 223
 
2.2%
500 128
 
1.3%
525 1
 
< 0.1%
700 15
 
0.1%
800 30
 
0.3%
1000 78
 
0.8%
1500 934
9.3%
1700 101
 
1.0%
1800 503
5.0%
ValueCountFrequency (%)
200000 1
 
< 0.1%
150000 1
 
< 0.1%
143300 1
 
< 0.1%
120590 1
 
< 0.1%
100870 1
 
< 0.1%
100190 1
 
< 0.1%
100000 7
0.1%
86820 1
 
< 0.1%
81840 1
 
< 0.1%
79560 1
 
< 0.1%

총결제금액
Real number (ℝ)

HIGH CORRELATION 

Distinct421
Distinct (%)4.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean11204.282
Minimum50
Maximum336000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T15:18:11.548133image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum50
5-th percentile1500
Q13500
median6000
Q315000
95-th percentile37000
Maximum336000
Range335950
Interquartile range (IQR)11500

Descriptive statistics

Standard deviation16120.685
Coefficient of variation (CV)1.4387969
Kurtosis53.85466
Mean11204.282
Median Absolute Deviation (MAD)4000
Skewness5.5662758
Sum1.1204282 × 108
Variance2.598765 × 108
MonotonicityNot monotonic
2023-12-12T15:18:11.694752image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
3500 706
 
7.1%
3000 685
 
6.9%
4000 651
 
6.5%
4500 644
 
6.4%
1500 395
 
4.0%
5000 351
 
3.5%
6000 340
 
3.4%
8000 339
 
3.4%
16000 328
 
3.3%
10000 315
 
3.1%
Other values (411) 5246
52.5%
ValueCountFrequency (%)
50 69
0.7%
100 168
1.7%
150 6
 
0.1%
200 41
 
0.4%
300 23
 
0.2%
400 4
 
< 0.1%
500 81
0.8%
600 1
 
< 0.1%
700 7
 
0.1%
800 18
 
0.2%
ValueCountFrequency (%)
336000 1
 
< 0.1%
240000 1
 
< 0.1%
223998 1
 
< 0.1%
207590 1
 
< 0.1%
204000 1
 
< 0.1%
200000 4
< 0.1%
194700 1
 
< 0.1%
180000 1
 
< 0.1%
176000 1
 
< 0.1%
168000 3
< 0.1%

Interactions

2023-12-12T15:18:08.303330image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:18:07.613101image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:18:07.925203image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:18:08.397550image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:18:07.694857image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:18:08.051769image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:18:08.497909image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:18:07.800037image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:18:08.182423image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T15:18:11.794490image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
가맹점명가맹점업종가맹점소재지결제시간주문개수단가총결제금액
가맹점명1.0001.0001.0000.7570.1580.3990.337
가맹점업종1.0001.0000.8470.7260.1810.4390.322
가맹점소재지1.0000.8471.0000.9610.0760.2050.259
결제시간0.7570.7260.9611.0000.1440.0620.208
주문개수0.1580.1810.0760.1441.0000.0000.697
단가0.3990.4390.2050.0620.0001.0000.723
총결제금액0.3370.3220.2590.2080.6970.7231.000
2023-12-12T15:18:11.908138image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
결제시간가맹점업종가맹점명가맹점소재지
결제시간1.0000.4650.4480.851
가맹점업종0.4651.0001.0000.964
가맹점명0.4481.0001.0001.000
가맹점소재지0.8510.9641.0001.000
2023-12-12T15:18:12.007522image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
주문개수단가총결제금액가맹점명가맹점업종가맹점소재지결제시간
주문개수1.000-0.1630.3970.0800.0760.0590.053
단가-0.1631.0000.8010.2230.2720.2050.024
총결제금액0.3970.8011.0000.1850.1920.2580.082
가맹점명0.0800.2230.1851.0001.0001.0000.448
가맹점업종0.0760.2720.1921.0001.0000.9640.465
가맹점소재지0.0590.2050.2581.0000.9641.0000.851
결제시간0.0530.0240.0820.4480.4650.8511.000

Missing values

2023-12-12T15:18:08.620827image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T15:18:08.784021image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

가맹점명가맹점업종가맹점소재지결제년월일결제시간주문상품명주문개수단가총결제금액
80105더카페브레드유커피 전문점춘양면2021-10-1914:00라떼(ICE)140004000
92611땅땅치킨치킨 전문점봉화읍2020-03-2623:00매콤떡볶이140004000
44679더카페브레드유커피 전문점춘양면2020-05-2508:00커스타드크림빵215003000
22298대부도한식 해산물 요리 전문점봉화읍2020-07-1819:00우럭12500025000
41333더카페브레드유커피 전문점춘양면2020-04-0915:00카라멜마끼아또(ICE)145004500
51826더카페브레드유커피 전문점춘양면2020-09-0313:00우유한잔120002000
62322더카페브레드유커피 전문점춘양면2021-02-1116:00쌀식빵(우유/밤)245009000
21958대부도한식 해산물 요리 전문점봉화읍2020-07-0511:00연어초밥11000010000
23084더카페브레드유커피 전문점춘양면2019-06-0609:00아메리카노 S325007500
26496더카페브레드유커피 전문점춘양면2019-07-3017:00사과쌀빵135003500
가맹점명가맹점업종가맹점소재지결제년월일결제시간주문상품명주문개수단가총결제금액
33516더카페브레드유커피 전문점춘양면2019-11-2708:00아메리카노(ICE)135003500
67515더카페브레드유커피 전문점춘양면2021-04-2814:00콜라215003000
10026달달이치킨치킨 전문점봉화읍2020-01-0822:00500cc9400036000
47951더카페브레드유커피 전문점춘양면2020-07-1209:00블루베리쌀빵135003500
67774더카페브레드유커피 전문점춘양면2021-05-0212:00쌀쿠키(초코/호두)318005400
69902더카페브레드유커피 전문점춘양면2021-05-3117:00통밀곡물식빵160006000
93989땅땅치킨치킨 전문점봉화읍2020-05-2119:00땅땅양념구이(독도愛촌닭)11750017500
44824더카페브레드유커피 전문점춘양면2020-05-2618:00초코머핀117001700
76065더카페브레드유커피 전문점춘양면2021-08-2411:00단팥빵7180012600
56816더카페브레드유커피 전문점춘양면2020-11-0910:00사과쌀빵235007000