Overview

Dataset statistics

Number of variables18
Number of observations30
Missing cells40
Missing cells (%)7.4%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory4.6 KiB
Average record size in memory156.4 B

Variable types

DateTime2
Text2
Numeric5
Categorical8
Boolean1

Dataset

Description샘플 데이터
Author코나아이㈜
URLhttps://bigdata-region.kr/#/dataset/633c7e45-8b46-4845-aed6-bb1ef62af6a7

Alerts

일반주간결제시작일자 has constant value ""Constant
일반주간결제종료일자 has constant value ""Constant
사용여부 is highly overall correlated with 회원코드 and 7 other fieldsHigh correlation
시군구명 is highly overall correlated with 회원코드 and 12 other fieldsHigh correlation
경도 is highly overall correlated with 가맹점번호 and 6 other fieldsHigh correlation
위도 is highly overall correlated with 가맹점번호 and 6 other fieldsHigh correlation
시도명 is highly overall correlated with 회원코드 and 12 other fieldsHigh correlation
성별코드 is highly overall correlated with 회원코드 and 4 other fieldsHigh correlation
읍면동명 is highly overall correlated with 회원코드 and 12 other fieldsHigh correlation
결제상품ID is highly overall correlated with 회원코드 and 6 other fieldsHigh correlation
결제상품명 is highly overall correlated with 회원코드 and 6 other fieldsHigh correlation
회원코드 is highly overall correlated with 연령대코드 and 8 other fieldsHigh correlation
가맹점번호 is highly overall correlated with 결제금액 and 6 other fieldsHigh correlation
연령대코드 is highly overall correlated with 회원코드 and 6 other fieldsHigh correlation
가맹점우편번호 is highly overall correlated with 회원코드 and 8 other fieldsHigh correlation
결제금액 is highly overall correlated with 가맹점번호 and 5 other fieldsHigh correlation
위도 is highly imbalanced (58.2%)Imbalance
경도 is highly imbalanced (58.2%)Imbalance
가맹점업종명 has 20 (66.7%) missing valuesMissing
가맹점우편번호 has 20 (66.7%) missing valuesMissing
결제금액 has 20 (66.7%) zerosZeros

Reproduction

Analysis started2023-12-10 14:12:41.992240
Analysis finished2023-12-10 14:12:48.753331
Duration6.76 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct1
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Memory size372.0 B
Minimum2020-04-27 00:00:00
Maximum2020-04-27 00:00:00
2023-12-10T23:12:48.824229image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:12:48.990660image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)
Distinct1
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Memory size372.0 B
Minimum2020-05-03 00:00:00
Maximum2020-05-03 00:00:00
2023-12-10T23:12:49.158185image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:12:49.358344image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)
Distinct25
Distinct (%)83.3%
Missing0
Missing (%)0.0%
Memory size372.0 B
2023-12-10T23:12:49.721505image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length44
Median length44
Mean length44
Min length44

Characters and Unicode

Total characters1320
Distinct characters65
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique24 ?
Unique (%)80.0%

Sample

1st row0u/25LGjbIQX0oTiqd79fLowjq31KB42TJqx9+VrnjE=
2nd row1K56osTohSaArYgtvLvUGKa/VmB+Rn3UclJY7X8dizU=
3rd row2BiBIc7MndH0e9FjNEowiXw2nG+OVoVPWVoweXPeJRE=
4th row4+KfDtCVLKlySytN+UxCnLWIUdqouuaGgl+gis3RLww=
5th row4+KfDtCVLKlySytN+UxCnLWIUdqouuaGgl+gis3RLww=
ValueCountFrequency (%)
4+kfdtcvlklysytn+uxcnlwiudqouuaggl+gis3rlww 6
 
20.0%
0u/25lgjbiqx0otiqd79flowjq31kb42tjqx9+vrnje 1
 
3.3%
58g87takdwnzw8jpmy7tshj4u8ntlc+zlbrzhbhjlcc 1
 
3.3%
c+atetv+edy54/xcxhs1u9esyzmrgo2ntz6uomkap6c 1
 
3.3%
bj+07wo+plog/wo0ydecll2esrb3uqo6p2sw1wz2lri 1
 
3.3%
atgvbcykjkkgg3mh19fnxvdxswzthklafrat8/x23ue 1
 
3.3%
a2jlkweelcw78u41/ihakib6+ezoeqco2zicxiysbug 1
 
3.3%
a1kv9tuklez9zlyzmijvxoxlkdvfs8ul5vgs/9l4eyg 1
 
3.3%
9sivrjsig96x+7b2opgbfp7e7czcj2pqdatsiwflmwq 1
 
3.3%
80moits/8qx6n1gsmbeqgdcxibdo9eddgj7grh2dwuo 1
 
3.3%
Other values (15) 15
50.0%
2023-12-10T23:12:50.377741image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
+ 37
 
2.8%
l 36
 
2.7%
K 34
 
2.6%
L 34
 
2.6%
4 33
 
2.5%
w 33
 
2.5%
U 33
 
2.5%
C 32
 
2.4%
g 31
 
2.3%
= 30
 
2.3%
Other values (55) 987
74.8%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter 530
40.2%
Lowercase Letter 511
38.7%
Decimal Number 192
 
14.5%
Math Symbol 67
 
5.1%
Other Punctuation 20
 
1.5%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
l 36
 
7.0%
w 33
 
6.5%
g 31
 
6.1%
u 28
 
5.5%
s 26
 
5.1%
o 25
 
4.9%
t 24
 
4.7%
a 24
 
4.7%
c 22
 
4.3%
y 22
 
4.3%
Other values (16) 240
47.0%
Uppercase Letter
ValueCountFrequency (%)
K 34
 
6.4%
L 34
 
6.4%
U 33
 
6.2%
C 32
 
6.0%
E 27
 
5.1%
D 24
 
4.5%
I 24
 
4.5%
B 22
 
4.2%
V 21
 
4.0%
W 21
 
4.0%
Other values (16) 258
48.7%
Decimal Number
ValueCountFrequency (%)
4 33
17.2%
2 25
13.0%
9 22
11.5%
3 22
11.5%
7 22
11.5%
0 16
8.3%
8 15
7.8%
1 13
 
6.8%
6 13
 
6.8%
5 11
 
5.7%
Math Symbol
ValueCountFrequency (%)
+ 37
55.2%
= 30
44.8%
Other Punctuation
ValueCountFrequency (%)
/ 20
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 1041
78.9%
Common 279
 
21.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
l 36
 
3.5%
K 34
 
3.3%
L 34
 
3.3%
w 33
 
3.2%
U 33
 
3.2%
C 32
 
3.1%
g 31
 
3.0%
u 28
 
2.7%
E 27
 
2.6%
s 26
 
2.5%
Other values (42) 727
69.8%
Common
ValueCountFrequency (%)
+ 37
13.3%
4 33
11.8%
= 30
10.8%
2 25
9.0%
9 22
7.9%
3 22
7.9%
7 22
7.9%
/ 20
7.2%
0 16
5.7%
8 15
5.4%
Other values (3) 37
13.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1320
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
+ 37
 
2.8%
l 36
 
2.7%
K 34
 
2.6%
L 34
 
2.6%
4 33
 
2.5%
w 33
 
2.5%
U 33
 
2.5%
C 32
 
2.4%
g 31
 
2.3%
= 30
 
2.3%
Other values (55) 987
74.8%

회원코드
Real number (ℝ)

HIGH CORRELATION 

Distinct25
Distinct (%)83.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.0097667 × 109
Minimum3.001943 × 109
Maximum3.0193212 × 109
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size402.0 B
2023-12-10T23:12:50.647572image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum3.001943 × 109
5-th percentile3.0021099 × 109
Q13.0033966 × 109
median3.0115602 × 109
Q33.0135681 × 109
95-th percentile3.0148591 × 109
Maximum3.0193212 × 109
Range17378153
Interquartile range (IQR)10171526

Descriptive statistics

Standard deviation5287346.5
Coefficient of variation (CV)0.0017567297
Kurtosis-1.1127143
Mean3.0097667 × 109
Median Absolute Deviation (MAD)3298922
Skewness-0.43499842
Sum9.0293001 × 1010
Variance2.7956033 × 1013
MonotonicityNot monotonic
2023-12-10T23:12:50.985197image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=25)
ValueCountFrequency (%)
3014859103 6
 
20.0%
3019321162 1
 
3.3%
3013683145 1
 
3.3%
3012668156 1
 
3.3%
3007243149 1
 
3.3%
3002110711 1
 
3.3%
3007890125 1
 
3.3%
3011347170 1
 
3.3%
3002114442 1
 
3.3%
3012833140 1
 
3.3%
Other values (15) 15
50.0%
ValueCountFrequency (%)
3001943009 1
3.3%
3002109642 1
3.3%
3002110323 1
3.3%
3002110711 1
3.3%
3002110736 1
3.3%
3002111088 1
3.3%
3002112047 1
3.3%
3002114442 1
3.3%
3007243149 1
3.3%
3007890125 1
3.3%
ValueCountFrequency (%)
3019321162 1
 
3.3%
3014859103 6
20.0%
3013683145 1
 
3.3%
3013223142 1
 
3.3%
3012833140 1
 
3.3%
3012668193 1
 
3.3%
3012668156 1
 
3.3%
3012625133 1
 
3.3%
3012607104 1
 
3.3%
3011744117 1
 
3.3%

가맹점번호
Real number (ℝ)

HIGH CORRELATION 

Distinct11
Distinct (%)36.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6.6666692 × 1014
Minimum7.0784354 × 108
Maximum1 × 1015
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size402.0 B
2023-12-10T23:12:51.281295image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum7.0784354 × 108
5-th percentile7.1309002 × 108
Q17.9064383 × 108
median1 × 1015
Q31 × 1015
95-th percentile1 × 1015
Maximum1 × 1015
Range9.9999929 × 1014
Interquartile range (IQR)9.9999921 × 1014

Descriptive statistics

Standard deviation4.7946294 × 1014
Coefficient of variation (CV)0.71919415
Kurtosis-1.5535714
Mean6.6666692 × 1014
Median Absolute Deviation (MAD)0
Skewness-0.74488049
Sum2.0000007 × 1016
Variance2.2988471 × 1029
MonotonicityNot monotonic
2023-12-10T23:12:51.563346image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=11)
ValueCountFrequency (%)
999999999999999 20
66.7%
713079425 1
 
3.3%
713102962 1
 
3.3%
715026043 1
 
3.3%
761314525 1
 
3.3%
791837729 1
 
3.3%
732340966 1
 
3.3%
707843541 1
 
3.3%
793925679 1
 
3.3%
749835951 1
 
3.3%
ValueCountFrequency (%)
707843541 1
3.3%
713079425 1
3.3%
713102962 1
3.3%
715026043 1
3.3%
732340966 1
3.3%
749835951 1
3.3%
761314525 1
3.3%
790245863 1
3.3%
791837729 1
3.3%
793925679 1
3.3%
ValueCountFrequency (%)
999999999999999 20
66.7%
793925679 1
 
3.3%
791837729 1
 
3.3%
790245863 1
 
3.3%
761314525 1
 
3.3%
749835951 1
 
3.3%
732340966 1
 
3.3%
715026043 1
 
3.3%
713102962 1
 
3.3%
713079425 1
 
3.3%

성별코드
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)6.7%
Missing0
Missing (%)0.0%
Memory size372.0 B
M
22 
F

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowF
2nd rowM
3rd rowF
4th rowM
5th rowM

Common Values

ValueCountFrequency (%)
M 22
73.3%
F 8
 
26.7%

Length

2023-12-10T23:12:51.799905image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T23:12:52.071975image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
m 22
73.3%
f 8
 
26.7%

연령대코드
Real number (ℝ)

HIGH CORRELATION 

Distinct7
Distinct (%)23.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean35.333333
Minimum20
Maximum80
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size402.0 B
2023-12-10T23:12:52.230322image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum20
5-th percentile20
Q120
median25
Q350
95-th percentile70
Maximum80
Range60
Interquartile range (IQR)30

Descriptive statistics

Standard deviation18.519949
Coefficient of variation (CV)0.5241495
Kurtosis-0.32678498
Mean35.333333
Median Absolute Deviation (MAD)5
Skewness0.88583353
Sum1060
Variance342.98851
MonotonicityNot monotonic
2023-12-10T23:12:52.461890image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=7)
ValueCountFrequency (%)
20 15
50.0%
50 6
 
20.0%
40 3
 
10.0%
70 2
 
6.7%
30 2
 
6.7%
80 1
 
3.3%
60 1
 
3.3%
ValueCountFrequency (%)
20 15
50.0%
30 2
 
6.7%
40 3
 
10.0%
50 6
 
20.0%
60 1
 
3.3%
70 2
 
6.7%
80 1
 
3.3%
ValueCountFrequency (%)
80 1
 
3.3%
70 2
 
6.7%
60 1
 
3.3%
50 6
 
20.0%
40 3
 
10.0%
30 2
 
6.7%
20 15
50.0%

결제상품ID
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)6.7%
Missing0
Missing (%)0.0%
Memory size372.0 B
140000146000
17 
140000134000
13 

Length

Max length12
Median length12
Mean length12
Min length12

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row140000146000
2nd row140000146000
3rd row140000146000
4th row140000134000
5th row140000134000

Common Values

ValueCountFrequency (%)
140000146000 17
56.7%
140000134000 13
43.3%

Length

2023-12-10T23:12:52.758971image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T23:12:52.942228image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
140000146000 17
56.7%
140000134000 13
43.3%

결제상품명
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)6.7%
Missing0
Missing (%)0.0%
Memory size372.0 B
양평농협 창립50주년 기프트카드
17 
가평사랑상품권(장병용)
13 

Length

Max length17
Median length17
Mean length14.833333
Min length12

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row양평농협 창립50주년 기프트카드
2nd row양평농협 창립50주년 기프트카드
3rd row양평농협 창립50주년 기프트카드
4th row가평사랑상품권(장병용)
5th row가평사랑상품권(장병용)

Common Values

ValueCountFrequency (%)
양평농협 창립50주년 기프트카드 17
56.7%
가평사랑상품권(장병용) 13
43.3%

Length

2023-12-10T23:12:53.126272image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T23:12:53.383039image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
양평농협 17
26.6%
창립50주년 17
26.6%
기프트카드 17
26.6%
가평사랑상품권(장병용 13
20.3%

가맹점업종명
Text

MISSING 

Distinct5
Distinct (%)50.0%
Missing20
Missing (%)66.7%
Memory size372.0 B
2023-12-10T23:12:53.687481image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length7
Median length6.5
Mean length5.6
Min length4

Characters and Unicode

Total characters56
Distinct characters19
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1 ?
Unique (%)10.0%

Sample

1st row일반휴게음식
2nd row일반휴게음식
3rd row신변잡화
4th row신변잡화
5th row음료식품
ValueCountFrequency (%)
유통업 4
28.6%
일반휴게음식 3
21.4%
신변잡화 2
14.3%
비영리 2
14.3%
영리 2
14.3%
음료식품 1
 
7.1%
2023-12-10T23:12:54.169727image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4
 
7.1%
4
 
7.1%
4
 
7.1%
4
 
7.1%
4
 
7.1%
4
 
7.1%
4
 
7.1%
4
 
7.1%
3
 
5.4%
3
 
5.4%
Other values (9) 18
32.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 52
92.9%
Space Separator 4
 
7.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4
 
7.7%
4
 
7.7%
4
 
7.7%
4
 
7.7%
4
 
7.7%
4
 
7.7%
4
 
7.7%
3
 
5.8%
3
 
5.8%
3
 
5.8%
Other values (8) 15
28.8%
Space Separator
ValueCountFrequency (%)
4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 52
92.9%
Common 4
 
7.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
4
 
7.7%
4
 
7.7%
4
 
7.7%
4
 
7.7%
4
 
7.7%
4
 
7.7%
4
 
7.7%
3
 
5.8%
3
 
5.8%
3
 
5.8%
Other values (8) 15
28.8%
Common
ValueCountFrequency (%)
4
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 52
92.9%
ASCII 4
 
7.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
4
100.0%
Hangul
ValueCountFrequency (%)
4
 
7.7%
4
 
7.7%
4
 
7.7%
4
 
7.7%
4
 
7.7%
4
 
7.7%
4
 
7.7%
3
 
5.8%
3
 
5.8%
3
 
5.8%
Other values (8) 15
28.8%

가맹점우편번호
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct8
Distinct (%)80.0%
Missing20
Missing (%)66.7%
Infinite0
Infinite (%)0.0%
Mean12447.3
Minimum12408
Maximum12561
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size402.0 B
2023-12-10T23:12:54.397232image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum12408
5-th percentile12411.6
Q112416.5
median12419
Q312432.75
95-th percentile12560.1
Maximum12561
Range153
Interquartile range (IQR)16.25

Descriptive statistics

Standard deviation59.833194
Coefficient of variation (CV)0.0048069215
Kurtosis1.2750527
Mean12447.3
Median Absolute Deviation (MAD)3
Skewness1.7179903
Sum124473
Variance3580.0111
MonotonicityNot monotonic
2023-12-10T23:12:54.612433image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=8)
ValueCountFrequency (%)
12419 2
 
6.7%
12416 2
 
6.7%
12418 1
 
3.3%
12408 1
 
3.3%
12561 1
 
3.3%
12437 1
 
3.3%
12559 1
 
3.3%
12420 1
 
3.3%
(Missing) 20
66.7%
ValueCountFrequency (%)
12408 1
3.3%
12416 2
6.7%
12418 1
3.3%
12419 2
6.7%
12420 1
3.3%
12437 1
3.3%
12559 1
3.3%
12561 1
3.3%
ValueCountFrequency (%)
12561 1
3.3%
12559 1
3.3%
12437 1
3.3%
12420 1
3.3%
12419 2
6.7%
12418 1
3.3%
12416 2
6.7%
12408 1
3.3%

시도명
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)6.7%
Missing0
Missing (%)0.0%
Memory size372.0 B
<NA>
25 
경기도

Length

Max length4
Median length4
Mean length3.8333333
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row경기도
5th row경기도

Common Values

ValueCountFrequency (%)
<NA> 25
83.3%
경기도 5
 
16.7%

Length

2023-12-10T23:12:54.845249image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T23:12:55.000016image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 25
83.3%
경기도 5
 
16.7%

시군구명
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)6.7%
Missing0
Missing (%)0.0%
Memory size372.0 B
<NA>
25 
가평군

Length

Max length4
Median length4
Mean length3.8333333
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row가평군
5th row가평군

Common Values

ValueCountFrequency (%)
<NA> 25
83.3%
가평군 5
 
16.7%

Length

2023-12-10T23:12:55.183846image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T23:12:55.509116image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 25
83.3%
가평군 5
 
16.7%

읍면동명
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)6.7%
Missing0
Missing (%)0.0%
Memory size372.0 B
<NA>
25 
가평읍

Length

Max length4
Median length4
Mean length3.8333333
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row가평읍
5th row가평읍

Common Values

ValueCountFrequency (%)
<NA> 25
83.3%
가평읍 5
 
16.7%

Length

2023-12-10T23:12:55.796876image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T23:12:55.969640image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 25
83.3%
가평읍 5
 
16.7%

위도
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct5
Distinct (%)16.7%
Missing0
Missing (%)0.0%
Memory size372.0 B
0.0
25 
37.83
 
2
37.828
 
1
37.825
 
1
37.842
 
1

Length

Max length6
Median length3
Mean length3.4333333
Min length3

Unique

Unique3 ?
Unique (%)10.0%

Sample

1st row0.0
2nd row0.0
3rd row0.0
4th row37.828
5th row37.825

Common Values

ValueCountFrequency (%)
0.0 25
83.3%
37.83 2
 
6.7%
37.828 1
 
3.3%
37.825 1
 
3.3%
37.842 1
 
3.3%

Length

2023-12-10T23:12:56.157892image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T23:12:56.371440image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0.0 25
83.3%
37.83 2
 
6.7%
37.828 1
 
3.3%
37.825 1
 
3.3%
37.842 1
 
3.3%

경도
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct5
Distinct (%)16.7%
Missing0
Missing (%)0.0%
Memory size372.0 B
0.0
25 
127.513
 
2
127.514
 
1
127.512
 
1
127.505
 
1

Length

Max length7
Median length3
Mean length3.6666667
Min length3

Unique

Unique3 ?
Unique (%)10.0%

Sample

1st row0.0
2nd row0.0
3rd row0.0
4th row127.514
5th row127.512

Common Values

ValueCountFrequency (%)
0.0 25
83.3%
127.513 2
 
6.7%
127.514 1
 
3.3%
127.512 1
 
3.3%
127.505 1
 
3.3%

Length

2023-12-10T23:12:56.583827image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T23:12:56.840973image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0.0 25
83.3%
127.513 2
 
6.7%
127.514 1
 
3.3%
127.512 1
 
3.3%
127.505 1
 
3.3%

사용여부
Boolean

HIGH CORRELATION 

Distinct2
Distinct (%)6.7%
Missing0
Missing (%)0.0%
Memory size162.0 B
False
20 
True
10 
ValueCountFrequency (%)
False 20
66.7%
True 10
33.3%
2023-12-10T23:12:57.006225image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

결제금액
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct11
Distinct (%)36.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3337
Minimum0
Maximum54000
Zeros20
Zeros (%)66.7%
Negative0
Negative (%)0.0%
Memory size402.0 B
2023-12-10T23:12:57.160715image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q32957.5
95-th percentile9785
Maximum54000
Range54000
Interquartile range (IQR)2957.5

Descriptive statistics

Standard deviation10025.132
Coefficient of variation (CV)3.004235
Kurtosis24.3602
Mean3337
Median Absolute Deviation (MAD)0
Skewness4.7652249
Sum100110
Variance1.0050328 × 108
MonotonicityNot monotonic
2023-12-10T23:12:57.370485image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=11)
ValueCountFrequency (%)
0 20
66.7%
4000 1
 
3.3%
8300 1
 
3.3%
1000 1
 
3.3%
11000 1
 
3.3%
8200 1
 
3.3%
500 1
 
3.3%
3610 1
 
3.3%
54000 1
 
3.3%
5000 1
 
3.3%
ValueCountFrequency (%)
0 20
66.7%
500 1
 
3.3%
1000 1
 
3.3%
3610 1
 
3.3%
4000 1
 
3.3%
4500 1
 
3.3%
5000 1
 
3.3%
8200 1
 
3.3%
8300 1
 
3.3%
11000 1
 
3.3%
ValueCountFrequency (%)
54000 1
3.3%
11000 1
3.3%
8300 1
3.3%
8200 1
3.3%
5000 1
3.3%
4500 1
3.3%
4000 1
3.3%
3610 1
3.3%
1000 1
3.3%
500 1
3.3%

Interactions

2023-12-10T23:12:46.941120image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:12:43.747375image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:12:44.730366image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:12:45.506245image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:12:46.226120image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:12:47.109128image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:12:44.005524image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:12:44.932430image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:12:45.662587image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:12:46.392935image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:12:47.233950image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:12:44.199523image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:12:45.067380image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:12:45.799248image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:12:46.516593image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:12:47.413917image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:12:44.357379image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:12:45.218751image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:12:45.943712image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:12:46.638736image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:12:47.564037image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:12:44.511991image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:12:45.373640image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:12:46.088270image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:12:46.795369image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T23:12:57.544354image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
카드번호회원코드가맹점번호성별코드연령대코드결제상품ID결제상품명가맹점업종명가맹점우편번호위도경도사용여부결제금액
카드번호1.0001.0001.0001.0001.0001.0001.0000.3281.0000.0000.0001.0000.000
회원코드1.0001.0000.5910.4900.8741.0001.0000.6320.9710.0000.0000.6230.000
가맹점번호1.0000.5911.0000.0000.1260.5670.567NaNNaN0.5060.5060.9910.737
성별코드1.0000.4900.0001.0000.2920.6130.6130.2230.3280.0000.0000.0610.000
연령대코드1.0000.8740.1260.2921.0000.7750.7750.4750.8900.0000.0000.2090.000
결제상품ID1.0001.0000.5670.6130.7751.0000.9941.0001.0000.3150.3150.6130.495
결제상품명1.0001.0000.5670.6130.7750.9941.0001.0001.0000.3150.3150.6130.495
가맹점업종명0.3280.632NaN0.2230.4751.0001.0001.0000.5960.6250.625NaN0.292
가맹점우편번호1.0000.971NaN0.3280.8901.0001.0000.5961.0000.0000.000NaN0.920
위도0.0000.0000.5060.0000.0000.3150.3150.6250.0001.0001.0000.4560.767
경도0.0000.0000.5060.0000.0000.3150.3150.6250.0001.0001.0000.4560.767
사용여부1.0000.6230.9910.0610.2090.6130.613NaNNaN0.4560.4561.0000.681
결제금액0.0000.0000.7370.0000.0000.4950.4950.2920.9200.7670.7670.6811.000
2023-12-10T23:12:57.825764image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사용여부시군구명경도위도시도명성별코드읍면동명결제상품ID결제상품명
사용여부1.0001.0000.5210.5211.0000.0181.0000.4190.419
시군구명1.0001.0001.0001.0001.0001.0001.0001.0001.000
경도0.5211.0001.0001.0001.0000.0001.0000.3580.358
위도0.5211.0001.0001.0001.0000.0001.0000.3580.358
시도명1.0001.0001.0001.0001.0001.0001.0001.0001.000
성별코드0.0181.0000.0000.0001.0001.0001.0000.4190.419
읍면동명1.0001.0001.0001.0001.0001.0001.0001.0001.000
결제상품ID0.4191.0000.3580.3581.0000.4191.0001.0000.930
결제상품명0.4191.0000.3580.3581.0000.4191.0000.9301.000
2023-12-10T23:12:58.135227image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
회원코드가맹점번호연령대코드가맹점우편번호결제금액성별코드결제상품ID결제상품명시도명시군구명읍면동명위도경도사용여부
회원코드1.000-0.458-0.505-0.8660.4580.5050.9060.9061.0001.0001.0000.0000.0000.605
가맹점번호-0.4581.0000.4380.378-0.9180.0180.4190.4191.0001.0001.0000.5210.5210.922
연령대코드-0.5050.4381.0000.688-0.4530.2720.7600.7601.0001.0001.0000.0000.0000.185
가맹점우편번호-0.8660.3780.6881.000-0.2260.5000.9350.9351.0001.0001.0000.0000.0001.000
결제금액0.458-0.918-0.453-0.2261.0000.0000.3180.3181.0001.0001.0000.7010.7010.460
성별코드0.5050.0180.2720.5000.0001.0000.4190.4191.0001.0001.0000.0000.0000.018
결제상품ID0.9060.4190.7600.9350.3180.4191.0000.9301.0001.0001.0000.3580.3580.419
결제상품명0.9060.4190.7600.9350.3180.4190.9301.0001.0001.0001.0000.3580.3580.419
시도명1.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.000
시군구명1.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.000
읍면동명1.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.000
위도0.0000.5210.0000.0000.7010.0000.3580.3581.0001.0001.0001.0001.0000.521
경도0.0000.5210.0000.0000.7010.0000.3580.3581.0001.0001.0001.0001.0000.521
사용여부0.6050.9220.1851.0000.4600.0180.4190.4191.0001.0001.0000.5210.5211.000

Missing values

2023-12-10T23:12:47.779314image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T23:12:48.113806image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-10T23:12:48.328070image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

일반주간결제시작일자일반주간결제종료일자카드번호회원코드가맹점번호성별코드연령대코드결제상품ID결제상품명가맹점업종명가맹점우편번호시도명시군구명읍면동명위도경도사용여부결제금액
02020-04-272020-05-030u/25LGjbIQX0oTiqd79fLowjq31KB42TJqx9+VrnjE=3019321162999999999999999F50140000146000양평농협 창립50주년 기프트카드<NA><NA><NA><NA><NA>0.00.0N0
12020-04-272020-05-031K56osTohSaArYgtvLvUGKa/VmB+Rn3UclJY7X8dizU=3002110323999999999999999M40140000146000양평농협 창립50주년 기프트카드<NA><NA><NA><NA><NA>0.00.0N0
22020-04-272020-05-032BiBIc7MndH0e9FjNEowiXw2nG+OVoVPWVoweXPeJRE=3011376245999999999999999F70140000146000양평농협 창립50주년 기프트카드<NA><NA><NA><NA><NA>0.00.0N0
32020-04-272020-05-034+KfDtCVLKlySytN+UxCnLWIUdqouuaGgl+gis3RLww=3014859103713079425M20140000134000가평사랑상품권(장병용)일반휴게음식12419경기도가평군가평읍37.828127.514Y4000
42020-04-272020-05-034+KfDtCVLKlySytN+UxCnLWIUdqouuaGgl+gis3RLww=3014859103713102962M20140000134000가평사랑상품권(장병용)일반휴게음식12416경기도가평군가평읍37.825127.512Y8300
52020-04-272020-05-034+KfDtCVLKlySytN+UxCnLWIUdqouuaGgl+gis3RLww=3014859103715026043M20140000134000가평사랑상품권(장병용)신변잡화12418경기도가평군가평읍37.83127.513Y1000
62020-04-272020-05-034+KfDtCVLKlySytN+UxCnLWIUdqouuaGgl+gis3RLww=3014859103761314525M20140000134000가평사랑상품권(장병용)신변잡화12408경기도가평군가평읍37.842127.505Y11000
72020-04-272020-05-034+KfDtCVLKlySytN+UxCnLWIUdqouuaGgl+gis3RLww=3014859103791837729M20140000134000가평사랑상품권(장병용)음료식품12419경기도가평군가평읍37.83127.513Y8200
82020-04-272020-05-034+MP7VP03NBjPZjBH8aNW7cr3+IwB4v4Cs1UDhG4UsU=3002112047732340966M30140000146000양평농협 창립50주년 기프트카드유통업 비영리12561<NA><NA><NA>0.00.0Y500
92020-04-272020-05-034GdOEb1yTMd9qEcH47R/Vd/rgm9UCTQbrgea4IzTX/g=3002110736999999999999999F20140000146000양평농협 창립50주년 기프트카드<NA><NA><NA><NA><NA>0.00.0N0
일반주간결제시작일자일반주간결제종료일자카드번호회원코드가맹점번호성별코드연령대코드결제상품ID결제상품명가맹점업종명가맹점우편번호시도명시군구명읍면동명위도경도사용여부결제금액
202020-04-272020-05-037m2/909UPqiXo0amRETf0HAmwHQIASxCrlMk4O/8c/Y=3002109642999999999999999F40140000146000양평농협 창립50주년 기프트카드<NA><NA><NA><NA><NA>0.00.0N0
212020-04-272020-05-037prkrrYnlXk+UWDVcy2Ecz5aYPN99EvUFDF08ZQBugE=3008136102999999999999999M60140000146000양평농협 창립50주년 기프트카드<NA><NA><NA><NA><NA>0.00.0N0
222020-04-272020-05-0380MoIts/8QX6N1gSmBEQGDCXIbdO9EDDgj7GrH2DWUo=3011744117999999999999999M50140000146000양평농협 창립50주년 기프트카드<NA><NA><NA><NA><NA>0.00.0N0
232020-04-272020-05-039SIVrjsIg96x+7B2OPGBFP7E7CzCj2PqDaTsiwFlMWQ=3012833140999999999999999M20140000134000가평사랑상품권(장병용)<NA><NA><NA><NA><NA>0.00.0N0
242020-04-272020-05-03A1Kv9tuKlEz9ZLyZmiJvXoXlKDvFS8UL5vgS/9l4eYg=3002114442999999999999999M40140000146000양평농협 창립50주년 기프트카드<NA><NA><NA><NA><NA>0.00.0N0
252020-04-272020-05-03A2jLkwEELCW78U41/IHakiB6+EZoeQCO2ZiCXiYsBug=3011347170999999999999999M50140000146000양평농협 창립50주년 기프트카드<NA><NA><NA><NA><NA>0.00.0N0
262020-04-272020-05-03AtGVBcYkJKKgG3mh19FnxvDXSWzTHKLAfRat8/x23UE=3007890125999999999999999M30140000146000양평농협 창립50주년 기프트카드<NA><NA><NA><NA><NA>0.00.0N0
272020-04-272020-05-03BJ+07wo+pLog/Wo0YdECLL2esrb3uQO6p2sw1WZ2lrI=3002110711749835951F50140000146000양평농협 창립50주년 기프트카드유통업 비영리12559<NA><NA><NA>0.00.0Y5000
282020-04-272020-05-03C+ATeTV+EdY54/xCXhs1U9ESyZMRGO2NTZ6uoMkAp6c=3007243149999999999999999F20140000146000양평농협 창립50주년 기프트카드<NA><NA><NA><NA><NA>0.00.0N0
292020-04-272020-05-03CZneKs7lB4ga9aNBg3bxA1Zzzp2FvZBmPuF2hjYzy2Q=3012668156790245863M20140000134000가평사랑상품권(장병용)유통업 영리12420<NA><NA><NA>0.00.0Y4500