Overview

Dataset statistics

Number of variables10
Number of observations500
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory43.1 KiB
Average record size in memory88.3 B

Variable types

Text2
Categorical2
Numeric6

Dataset

Description샘플 데이터
Author롯데멤버스
URLhttps://bigdata.seoul.go.kr/data/selectSampleData.do?sample_data_seq=56

Reproduction

Analysis started2023-12-10 14:58:31.161603
Analysis finished2023-12-10 14:58:40.633900
Duration9.47 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct172
Distinct (%)34.4%
Missing0
Missing (%)0.0%
Memory size4.0 KiB
2023-12-10T23:58:41.257713image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters1500
Distinct characters35
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique60 ?
Unique (%)12.0%

Sample

1st rowK07
2nd rowR13
3rd rowC07
4th rowK11
5th rowR13
ValueCountFrequency (%)
d12 11
 
2.2%
e13 10
 
2.0%
i02 10
 
2.0%
i04 10
 
2.0%
j02 9
 
1.8%
r07 9
 
1.8%
k07 8
 
1.6%
r13 8
 
1.6%
c01 7
 
1.4%
d05 7
 
1.4%
Other values (162) 411
82.2%
2023-12-10T23:58:42.173768image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 397
26.5%
1 154
 
10.3%
2 116
 
7.7%
3 81
 
5.4%
4 57
 
3.8%
5 53
 
3.5%
R 52
 
3.5%
7 42
 
2.8%
D 39
 
2.6%
6 39
 
2.6%
Other values (25) 470
31.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1000
66.7%
Uppercase Letter 500
33.3%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
R 52
 
10.4%
D 39
 
7.8%
I 31
 
6.2%
T 27
 
5.4%
M 24
 
4.8%
K 24
 
4.8%
Q 23
 
4.6%
A 22
 
4.4%
C 20
 
4.0%
G 20
 
4.0%
Other values (15) 218
43.6%
Decimal Number
ValueCountFrequency (%)
0 397
39.7%
1 154
 
15.4%
2 116
 
11.6%
3 81
 
8.1%
4 57
 
5.7%
5 53
 
5.3%
7 42
 
4.2%
6 39
 
3.9%
8 38
 
3.8%
9 23
 
2.3%

Most occurring scripts

ValueCountFrequency (%)
Common 1000
66.7%
Latin 500
33.3%

Most frequent character per script

Latin
ValueCountFrequency (%)
R 52
 
10.4%
D 39
 
7.8%
I 31
 
6.2%
T 27
 
5.4%
M 24
 
4.8%
K 24
 
4.8%
Q 23
 
4.6%
A 22
 
4.4%
C 20
 
4.0%
G 20
 
4.0%
Other values (15) 218
43.6%
Common
ValueCountFrequency (%)
0 397
39.7%
1 154
 
15.4%
2 116
 
11.6%
3 81
 
8.1%
4 57
 
5.7%
5 53
 
5.3%
7 42
 
4.2%
6 39
 
3.9%
8 38
 
3.8%
9 23
 
2.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1500
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 397
26.5%
1 154
 
10.3%
2 116
 
7.7%
3 81
 
5.4%
4 57
 
3.8%
5 53
 
3.5%
R 52
 
3.5%
7 42
 
2.8%
D 39
 
2.6%
6 39
 
2.6%
Other values (25) 470
31.3%
Distinct122
Distinct (%)24.4%
Missing0
Missing (%)0.0%
Memory size4.0 KiB
2023-12-10T23:58:42.776245image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters1500
Distinct characters35
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique43 ?
Unique (%)8.6%

Sample

1st rowX41
2nd rowX16
3rd rowX11
4th rowI02
5th rowT09
ValueCountFrequency (%)
r09 35
 
7.0%
r13 26
 
5.2%
t09 22
 
4.4%
x11 21
 
4.2%
k07 21
 
4.2%
i02 18
 
3.6%
o04 15
 
3.0%
i04 14
 
2.8%
d08 13
 
2.6%
f02 11
 
2.2%
Other values (112) 304
60.8%
2023-12-10T23:58:43.575023image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 372
24.8%
1 182
12.1%
2 114
 
7.6%
R 87
 
5.8%
3 67
 
4.5%
9 66
 
4.4%
4 62
 
4.1%
5 45
 
3.0%
7 39
 
2.6%
I 37
 
2.5%
Other values (25) 429
28.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1000
66.7%
Uppercase Letter 500
33.3%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
R 87
17.4%
I 37
 
7.4%
T 35
 
7.0%
X 31
 
6.2%
K 28
 
5.6%
D 28
 
5.6%
O 27
 
5.4%
F 24
 
4.8%
A 23
 
4.6%
U 22
 
4.4%
Other values (15) 158
31.6%
Decimal Number
ValueCountFrequency (%)
0 372
37.2%
1 182
18.2%
2 114
 
11.4%
3 67
 
6.7%
9 66
 
6.6%
4 62
 
6.2%
5 45
 
4.5%
7 39
 
3.9%
6 30
 
3.0%
8 23
 
2.3%

Most occurring scripts

ValueCountFrequency (%)
Common 1000
66.7%
Latin 500
33.3%

Most frequent character per script

Latin
ValueCountFrequency (%)
R 87
17.4%
I 37
 
7.4%
T 35
 
7.0%
X 31
 
6.2%
K 28
 
5.6%
D 28
 
5.6%
O 27
 
5.4%
F 24
 
4.8%
A 23
 
4.6%
U 22
 
4.4%
Other values (15) 158
31.6%
Common
ValueCountFrequency (%)
0 372
37.2%
1 182
18.2%
2 114
 
11.4%
3 67
 
6.7%
9 66
 
6.6%
4 62
 
6.2%
5 45
 
4.5%
7 39
 
3.9%
6 30
 
3.0%
8 23
 
2.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1500
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 372
24.8%
1 182
12.1%
2 114
 
7.6%
R 87
 
5.8%
3 67
 
4.5%
9 66
 
4.4%
4 62
 
4.1%
5 45
 
3.0%
7 39
 
2.6%
I 37
 
2.5%
Other values (25) 429
28.6%

성별(GENDER)
Categorical

Distinct2
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size4.0 KiB
2
319 
1
181 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row2
3rd row2
4th row2
5th row1

Common Values

ValueCountFrequency (%)
2 319
63.8%
1 181
36.2%

Length

2023-12-10T23:58:43.861742image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T23:58:44.057549image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2 319
63.8%
1 181
36.2%

연령(AGE_GR_CD)
Real number (ℝ)

Distinct12
Distinct (%)2.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6.074
Minimum1
Maximum12
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size4.5 KiB
2023-12-10T23:58:44.222388image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2
Q14
median6
Q38
95-th percentile11
Maximum12
Range11
Interquartile range (IQR)4

Descriptive statistics

Standard deviation2.5118006
Coefficient of variation (CV)0.41353319
Kurtosis-0.54296587
Mean6.074
Median Absolute Deviation (MAD)2
Skewness0.29835335
Sum3037
Variance6.3091423
MonotonicityNot monotonic
2023-12-10T23:58:44.440440image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=12)
ValueCountFrequency (%)
6 75
15.0%
5 71
14.2%
4 65
13.0%
8 63
12.6%
7 58
11.6%
3 44
8.8%
2 38
7.6%
9 35
7.0%
10 24
 
4.8%
11 14
 
2.8%
Other values (2) 13
 
2.6%
ValueCountFrequency (%)
1 1
 
0.2%
2 38
7.6%
3 44
8.8%
4 65
13.0%
5 71
14.2%
6 75
15.0%
7 58
11.6%
8 63
12.6%
9 35
7.0%
10 24
 
4.8%
ValueCountFrequency (%)
12 12
 
2.4%
11 14
 
2.8%
10 24
 
4.8%
9 35
7.0%
8 63
12.6%
7 58
11.6%
6 75
15.0%
5 71
14.2%
4 65
13.0%
3 44
8.8%

이용년월일(DATE)
Real number (ℝ)

Distinct401
Distinct (%)80.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean20179799
Minimum20170101
Maximum20191230
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size4.5 KiB
2023-12-10T23:58:44.752639image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum20170101
5-th percentile20170223
Q120170828
median20180509
Q320190304
95-th percentile20191109
Maximum20191230
Range21129
Interquartile range (IQR)19475.5

Descriptive statistics

Standard deviation8122.2174
Coefficient of variation (CV)0.00040249248
Kurtosis-1.4541222
Mean20179799
Median Absolute Deviation (MAD)9697
Skewness0.16590089
Sum1.0089899 × 1010
Variance65970415
MonotonicityNot monotonic
2023-12-10T23:58:45.064786image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
20170120 4
 
0.8%
20180902 4
 
0.8%
20170928 4
 
0.8%
20180303 3
 
0.6%
20170619 3
 
0.6%
20171103 3
 
0.6%
20180518 3
 
0.6%
20180221 3
 
0.6%
20171220 3
 
0.6%
20170116 3
 
0.6%
Other values (391) 467
93.4%
ValueCountFrequency (%)
20170101 1
 
0.2%
20170103 1
 
0.2%
20170111 1
 
0.2%
20170113 2
0.4%
20170114 1
 
0.2%
20170116 3
0.6%
20170120 4
0.8%
20170121 1
 
0.2%
20170125 1
 
0.2%
20170126 1
 
0.2%
ValueCountFrequency (%)
20191230 1
0.2%
20191228 1
0.2%
20191226 2
0.4%
20191223 2
0.4%
20191221 1
0.2%
20191218 1
0.2%
20191216 2
0.4%
20191210 2
0.4%
20191209 1
0.2%
20191207 1
0.2%
Distinct4
Distinct (%)0.8%
Missing0
Missing (%)0.0%
Memory size4.0 KiB
4
281 
2
77 
1
76 
3
66 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row4
3rd row1
4th row3
5th row4

Common Values

ValueCountFrequency (%)
4 281
56.2%
2 77
 
15.4%
1 76
 
15.2%
3 66
 
13.2%

Length

2023-12-10T23:58:45.305587image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T23:58:45.503948image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
4 281
56.2%
2 77
 
15.4%
1 76
 
15.2%
3 66
 
13.2%
Distinct18
Distinct (%)3.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean10.842
Minimum1
Maximum20
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size4.5 KiB
2023-12-10T23:58:45.700059image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q16
median12
Q315
95-th percentile20
Maximum20
Range19
Interquartile range (IQR)9

Descriptive statistics

Standard deviation5.4985001
Coefficient of variation (CV)0.50714813
Kurtosis-0.90000453
Mean10.842
Median Absolute Deviation (MAD)4
Skewness-0.17370823
Sum5421
Variance30.233503
MonotonicityNot monotonic
2023-12-10T23:58:46.288993image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=18)
ValueCountFrequency (%)
6 68
13.6%
16 63
12.6%
12 59
11.8%
9 47
9.4%
20 40
8.0%
15 38
7.6%
1 36
7.2%
11 28
5.6%
2 27
 
5.4%
13 25
 
5.0%
Other values (8) 69
13.8%
ValueCountFrequency (%)
1 36
7.2%
2 27
 
5.4%
4 5
 
1.0%
5 3
 
0.6%
6 68
13.6%
7 24
 
4.8%
8 2
 
0.4%
9 47
9.4%
10 1
 
0.2%
11 28
5.6%
ValueCountFrequency (%)
20 40
8.0%
19 4
 
0.8%
18 14
 
2.8%
16 63
12.6%
15 38
7.6%
14 16
 
3.2%
13 25
 
5.0%
12 59
11.8%
11 28
5.6%
10 1
 
0.2%
Distinct16
Distinct (%)3.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.694
Minimum1
Maximum46
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size4.5 KiB
2023-12-10T23:58:46.509602image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11
median1
Q31
95-th percentile4
Maximum46
Range45
Interquartile range (IQR)0

Descriptive statistics

Standard deviation2.9519132
Coefficient of variation (CV)1.7425698
Kurtosis121.99183
Mean1.694
Median Absolute Deviation (MAD)0
Skewness9.7180733
Sum847
Variance8.7137916
MonotonicityNot monotonic
2023-12-10T23:58:46.740543image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=16)
ValueCountFrequency (%)
1 404
80.8%
2 41
 
8.2%
3 19
 
3.8%
4 14
 
2.8%
5 6
 
1.2%
6 4
 
0.8%
16 2
 
0.4%
9 2
 
0.4%
46 1
 
0.2%
17 1
 
0.2%
Other values (6) 6
 
1.2%
ValueCountFrequency (%)
1 404
80.8%
2 41
 
8.2%
3 19
 
3.8%
4 14
 
2.8%
5 6
 
1.2%
6 4
 
0.8%
7 1
 
0.2%
8 1
 
0.2%
9 2
 
0.4%
11 1
 
0.2%
ValueCountFrequency (%)
46 1
0.2%
30 1
0.2%
17 1
0.2%
16 2
0.4%
13 1
0.2%
12 1
0.2%
11 1
0.2%
9 2
0.4%
8 1
0.2%
7 1
0.2%

이용건수(USE_CNT)
Real number (ℝ)

Distinct20
Distinct (%)4.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.344
Minimum1
Maximum66
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size4.5 KiB
2023-12-10T23:58:46.965304image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11
median1
Q32
95-th percentile7
Maximum66
Range65
Interquartile range (IQR)1

Descriptive statistics

Standard deviation4.1477792
Coefficient of variation (CV)1.7695304
Kurtosis118.54133
Mean2.344
Median Absolute Deviation (MAD)0
Skewness9.0516586
Sum1172
Variance17.204072
MonotonicityNot monotonic
2023-12-10T23:58:47.189625image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=20)
ValueCountFrequency (%)
1 320
64.0%
2 75
 
15.0%
3 41
 
8.2%
4 14
 
2.8%
5 13
 
2.6%
6 9
 
1.8%
7 6
 
1.2%
13 3
 
0.6%
15 3
 
0.6%
9 3
 
0.6%
Other values (10) 13
 
2.6%
ValueCountFrequency (%)
1 320
64.0%
2 75
 
15.0%
3 41
 
8.2%
4 14
 
2.8%
5 13
 
2.6%
6 9
 
1.8%
7 6
 
1.2%
8 2
 
0.4%
9 3
 
0.6%
10 1
 
0.2%
ValueCountFrequency (%)
66 1
 
0.2%
32 1
 
0.2%
22 1
 
0.2%
20 1
 
0.2%
19 1
 
0.2%
16 2
0.4%
15 3
0.6%
13 3
0.6%
12 1
 
0.2%
11 2
0.4%

이용금액(USE_AMT)
Real number (ℝ)

Distinct56
Distinct (%)11.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean11954
Minimum1000
Maximum202000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size4.5 KiB
2023-12-10T23:58:47.451874image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1000
5-th percentile1000
Q14000
median7000
Q315000
95-th percentile36050
Maximum202000
Range201000
Interquartile range (IQR)11000

Descriptive statistics

Standard deviation15560.467
Coefficient of variation (CV)1.3016954
Kurtosis47.99704
Mean11954
Median Absolute Deviation (MAD)5000
Skewness5.1888316
Sum5977000
Variance2.4212814 × 108
MonotonicityNot monotonic
2023-12-10T23:58:47.746239image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2000 52
 
10.4%
5000 48
 
9.6%
1000 39
 
7.8%
4000 35
 
7.0%
6000 32
 
6.4%
3000 29
 
5.8%
10000 28
 
5.6%
8000 26
 
5.2%
9000 20
 
4.0%
15000 17
 
3.4%
Other values (46) 174
34.8%
ValueCountFrequency (%)
1000 39
7.8%
2000 52
10.4%
3000 29
5.8%
4000 35
7.0%
5000 48
9.6%
6000 32
6.4%
7000 16
 
3.2%
8000 26
5.2%
9000 20
 
4.0%
10000 28
5.6%
ValueCountFrequency (%)
202000 1
0.2%
90000 1
0.2%
88000 1
0.2%
73000 1
0.2%
72000 1
0.2%
69000 1
0.2%
66000 2
0.4%
64000 1
0.2%
63000 1
0.2%
61000 1
0.2%

Interactions

2023-12-10T23:58:38.851685image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:58:32.919481image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:58:34.038171image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:58:35.173507image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:58:36.395350image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:58:37.497123image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:58:39.042287image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:58:33.095746image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:58:34.231059image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:58:35.359977image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:58:36.625972image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:58:37.674987image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:58:39.258493image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:58:33.291790image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:58:34.427417image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:58:35.537173image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:58:36.805477image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:58:37.852676image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:58:39.484582image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:58:33.492366image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:58:34.627411image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:58:35.728903image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:58:37.000898image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:58:38.103100image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:58:39.668547image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:58:33.670278image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:58:34.793582image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:58:35.876923image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:58:37.157567image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:58:38.393405image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:58:39.893483image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:58:33.844214image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:58:34.971985image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:58:36.165693image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:58:37.315248image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:58:38.634013image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T23:58:47.917061image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
성별(GENDER)연령(AGE_GR_CD)이용년월일(DATE)업종(UPJONG_CD)품목_및_가맹점(ITEM_CD)이용고객수(USE_CUSTM_CNT)이용건수(USE_CNT)이용금액(USE_AMT)
성별(GENDER)1.0000.0000.0000.0000.1110.0970.0000.000
연령(AGE_GR_CD)0.0001.0000.0820.0000.0000.0860.0000.000
이용년월일(DATE)0.0000.0821.0000.0000.0000.1080.0000.059
업종(UPJONG_CD)0.0000.0000.0001.0000.1140.0710.1260.000
품목_및_가맹점(ITEM_CD)0.1110.0000.0000.1141.0000.0000.0000.088
이용고객수(USE_CUSTM_CNT)0.0970.0860.1080.0710.0001.0000.0000.000
이용건수(USE_CNT)0.0000.0000.0000.1260.0000.0001.0000.370
이용금액(USE_AMT)0.0000.0000.0590.0000.0880.0000.3701.000
2023-12-10T23:58:48.187133image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
성별(GENDER)업종(UPJONG_CD)
성별(GENDER)1.0000.000
업종(UPJONG_CD)0.0001.000
2023-12-10T23:58:48.374830image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연령(AGE_GR_CD)이용년월일(DATE)품목_및_가맹점(ITEM_CD)이용고객수(USE_CUSTM_CNT)이용건수(USE_CNT)이용금액(USE_AMT)성별(GENDER)업종(UPJONG_CD)
연령(AGE_GR_CD)1.000-0.0340.042-0.0070.019-0.1290.0000.000
이용년월일(DATE)-0.0341.000-0.010-0.051-0.040-0.0640.0000.000
품목_및_가맹점(ITEM_CD)0.042-0.0101.0000.063-0.0240.0830.0840.068
이용고객수(USE_CUSTM_CNT)-0.007-0.0510.0631.0000.036-0.0160.0690.045
이용건수(USE_CNT)0.019-0.040-0.0240.0361.000-0.0460.0000.081
이용금액(USE_AMT)-0.129-0.0640.083-0.016-0.0461.0000.0000.000
성별(GENDER)0.0000.0000.0840.0690.0000.0001.0000.000
업종(UPJONG_CD)0.0000.0000.0680.0450.0810.0000.0001.000

Missing values

2023-12-10T23:58:40.184304image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T23:58:40.491331image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

주거지역(RES_AREA_CD)소비지역(USE_AREA_CD)성별(GENDER)연령(AGE_GR_CD)이용년월일(DATE)업종(UPJONG_CD)품목_및_가맹점(ITEM_CD)이용고객수(USE_CUSTM_CNT)이용건수(USE_CNT)이용금액(USE_AMT)
0K07X411420180324111420000
1R13X162420180725415139000
2C07X1122201910251162124000
3K11I022720190128311444000
4R13T091720190927412126000
5D09V1215201707314131116000
6I04B0627201805043163136000
7L01O04262018080712138000
8P10T061620190515116124000
9V09Y031820170207411121000
주거지역(RES_AREA_CD)소비지역(USE_AREA_CD)성별(GENDER)연령(AGE_GR_CD)이용년월일(DATE)업종(UPJONG_CD)품목_및_가맹점(ITEM_CD)이용고객수(USE_CUSTM_CNT)이용건수(USE_CNT)이용금액(USE_AMT)
490L02O02262019021946916000
491W64I02252019090431511523000
492R14R091320190920415314000
493A12X202122017061549131000
494Q03B0725201903254151220000
495V12C01242019120749116000
496O03H022620180715311211000
497P02I04252018102849115000
498I02R091520170415411122000
499M06Q041420180717314116000