Dataset statistics
Number of variables | 8 |
---|---|
Number of observations | 500 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 34.3 KiB |
Average record size in memory | 70.3 B |
Variable types
Numeric | 6 |
---|---|
Categorical | 2 |
Dataset
Description | 샘플 데이터 |
---|---|
Author | 신한카드 |
URL | https://bigdata.seoul.go.kr/data/selectSampleData.do?sample_data_seq=51 |
기준년월(TS_YM) is highly overall correlated with 일별(TS_YMD) | High correlation |
일별(TS_YMD) is highly overall correlated with 기준년월(TS_YM) | High correlation |
시간대(TM) has 8 (1.6%) zeros | Zeros |
Reproduction
Analysis started | 2023-12-10 14:53:59.778403 |
---|---|
Analysis finished | 2023-12-10 14:54:08.075614 |
Duration | 8.3 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
가맹점블록코드(BLCK_CD)
Real number (ℝ)
Distinct | 491 |
---|---|
Distinct (%) | 98.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 196907.08 |
Minimum | 66 |
---|---|
Maximum | 502478 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 4.5 KiB |
Quantile statistics
Minimum | 66 |
---|---|
5-th percentile | 14213.8 |
Q1 | 37391.5 |
median | 214211 |
Q3 | 271625.75 |
95-th percentile | 415451.65 |
Maximum | 502478 |
Range | 502412 |
Interquartile range (IQR) | 234234.25 |
Descriptive statistics
Standard deviation | 128900.51 |
---|---|
Coefficient of variation (CV) | 0.65462607 |
Kurtosis | -0.848883 |
Mean | 196907.08 |
Median Absolute Deviation (MAD) | 67705.5 |
Skewness | 0.053874145 |
Sum | 98453539 |
Variance | 1.6615341 × 1010 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
14226 | 2 | 0.4% |
417221 | 2 | 0.4% |
363482 | 2 | 0.4% |
274501 | 2 | 0.4% |
339231 | 2 | 0.4% |
17869 | 2 | 0.4% |
206706 | 2 | 0.4% |
24276 | 2 | 0.4% |
206974 | 2 | 0.4% |
171889 | 1 | 0.2% |
Other values (481) | 481 |
Value | Count | Frequency (%) |
66 | 1 | |
8287 | 1 | |
8649 | 1 | |
8671 | 1 | |
9013 | 1 | |
9101 | 1 | |
9323 | 1 | |
9328 | 1 | |
10311 | 1 | |
10447 | 1 |
Value | Count | Frequency (%) |
502478 | 1 | |
502471 | 1 | |
501942 | 1 | |
501402 | 1 | |
421289 | 1 | |
420604 | 1 | |
420315 | 1 | |
420249 | 1 | |
420182 | 1 | |
419478 | 1 |
내국인업종코드(SB_UPJONG_CD)
Categorical
Distinct | 47 |
---|---|
Distinct (%) | 9.4% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 4.0 KiB |
SB016 | |
---|---|
SB001 | |
SB008 | |
SB013 | |
SB020 | |
Other values (42) |
Length
Max length | 5 |
---|---|
Median length | 5 |
Mean length | 5 |
Min length | 5 |
Unique
Unique | 11 ? |
---|---|
Unique (%) | 2.2% |
Sample
1st row | SB001 |
---|---|
2nd row | SB007 |
3rd row | SB054 |
4th row | SB039 |
5th row | SB016 |
Common Values
Value | Count | Frequency (%) |
SB016 | 103 | |
SB001 | 62 | |
SB008 | 56 | 11.2% |
SB013 | 27 | 5.4% |
SB020 | 24 | 4.8% |
SB054 | 22 | 4.4% |
SB006 | 21 | 4.2% |
SB005 | 21 | 4.2% |
SB039 | 10 | 2.0% |
SB007 | 10 | 2.0% |
Other values (37) | 144 |
Length
Value | Count | Frequency (%) |
sb016 | 103 | |
sb001 | 62 | |
sb008 | 56 | 11.2% |
sb013 | 27 | 5.4% |
sb020 | 24 | 4.8% |
sb054 | 22 | 4.4% |
sb006 | 21 | 4.2% |
sb005 | 21 | 4.2% |
sb039 | 10 | 2.0% |
sb007 | 10 | 2.0% |
Other values (37) | 144 |
기준년월(TS_YM)
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 55 |
---|---|
Distinct (%) | 11.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 201894.99 |
Minimum | 201701 |
---|---|
Maximum | 202107 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 4.5 KiB |
Quantile statistics
Minimum | 201701 |
---|---|
5-th percentile | 201704 |
Q1 | 201805 |
median | 201905 |
Q3 | 202006 |
95-th percentile | 202104.05 |
Maximum | 202107 |
Range | 406 |
Interquartile range (IQR) | 201 |
Descriptive statistics
Standard deviation | 131.00168 |
---|---|
Coefficient of variation (CV) | 0.0006488605 |
Kurtosis | -1.1621621 |
Mean | 201894.99 |
Median Absolute Deviation (MAD) | 101 |
Skewness | 0.044812573 |
Sum | 1.0094749 × 108 |
Variance | 17161.441 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
201806 | 17 | 3.4% |
202004 | 17 | 3.4% |
202003 | 16 | 3.2% |
201710 | 15 | 3.0% |
201908 | 14 | 2.8% |
201905 | 14 | 2.8% |
201811 | 14 | 2.8% |
202104 | 13 | 2.6% |
201903 | 12 | 2.4% |
202102 | 12 | 2.4% |
Other values (45) | 356 |
Value | Count | Frequency (%) |
201701 | 6 | 1.2% |
201702 | 9 | |
201703 | 7 | |
201704 | 5 | 1.0% |
201705 | 9 | |
201706 | 3 | 0.6% |
201707 | 12 | |
201708 | 10 | |
201709 | 3 | 0.6% |
201710 | 15 |
Value | Count | Frequency (%) |
202107 | 9 | |
202106 | 5 | 1.0% |
202105 | 11 | |
202104 | 13 | |
202103 | 8 | |
202102 | 12 | |
202101 | 8 | |
202012 | 5 | 1.0% |
202011 | 10 | |
202010 | 9 |
일별(TS_YMD)
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 425 |
---|---|
Distinct (%) | 85.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 20189514 |
Minimum | 20170101 |
---|---|
Maximum | 20210729 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 4.5 KiB |
Quantile statistics
Minimum | 20170101 |
---|---|
5-th percentile | 20170422 |
Q1 | 20180508 |
median | 20190528 |
Q3 | 20200618 |
95-th percentile | 20210428 |
Maximum | 20210729 |
Range | 40628 |
Interquartile range (IQR) | 20110.25 |
Descriptive statistics
Standard deviation | 13100.021 |
---|---|
Coefficient of variation (CV) | 0.00064885271 |
Kurtosis | -1.1621859 |
Mean | 20189514 |
Median Absolute Deviation (MAD) | 10080.5 |
Skewness | 0.044670815 |
Sum | 1.0094757 × 1010 |
Variance | 1.7161055 × 108 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
20180804 | 3 | 0.6% |
20210417 | 3 | 0.6% |
20170707 | 3 | 0.6% |
20200424 | 3 | 0.6% |
20181109 | 3 | 0.6% |
20191202 | 3 | 0.6% |
20210401 | 3 | 0.6% |
20180609 | 2 | 0.4% |
20190827 | 2 | 0.4% |
20210218 | 2 | 0.4% |
Other values (415) | 473 |
Value | Count | Frequency (%) |
20170101 | 1 | |
20170102 | 1 | |
20170108 | 1 | |
20170115 | 1 | |
20170125 | 1 | |
20170130 | 1 | |
20170206 | 1 | |
20170211 | 1 | |
20170213 | 1 | |
20170214 | 1 |
Value | Count | Frequency (%) |
20210729 | 1 | |
20210726 | 1 | |
20210722 | 1 | |
20210720 | 1 | |
20210719 | 1 | |
20210715 | 1 | |
20210710 | 1 | |
20210709 | 1 | |
20210706 | 1 | |
20210630 | 1 |
요일(DAW)
Categorical
Distinct | 7 |
---|---|
Distinct (%) | 1.4% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 4.0 KiB |
토요일 | |
---|---|
목요일 | |
금요일 | |
수요일 | |
월요일 | |
Other values (2) |
Length
Max length | 3 |
---|---|
Median length | 3 |
Mean length | 3 |
Min length | 3 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 토요일 |
---|---|
2nd row | 금요일 |
3rd row | 금요일 |
4th row | 월요일 |
5th row | 일요일 |
Common Values
Value | Count | Frequency (%) |
토요일 | 82 | |
목요일 | 81 | |
금요일 | 79 | |
수요일 | 73 | |
월요일 | 71 | |
화요일 | 61 | |
일요일 | 53 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
토요일 | 82 | |
목요일 | 81 | |
금요일 | 79 | |
수요일 | 73 | |
월요일 | 71 | |
화요일 | 61 | |
일요일 | 53 |
시간대(TM)
Real number (ℝ)
ZEROS
 
Distinct | 24 |
---|---|
Distinct (%) | 4.8% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 14.124 |
Minimum | 0 |
---|---|
Maximum | 23 |
Zeros | 8 |
Zeros (%) | 1.6% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 4.5 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 5 |
Q1 | 11 |
median | 14 |
Q3 | 18 |
95-th percentile | 21 |
Maximum | 23 |
Range | 23 |
Interquartile range (IQR) | 7 |
Descriptive statistics
Standard deviation | 5.038591 |
---|---|
Coefficient of variation (CV) | 0.35673966 |
Kurtosis | 0.1290635 |
Mean | 14.124 |
Median Absolute Deviation (MAD) | 4 |
Skewness | -0.59157679 |
Sum | 7062 |
Variance | 25.387399 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
12 | 49 | 9.8% |
18 | 42 | 8.4% |
13 | 41 | 8.2% |
19 | 40 | 8.0% |
14 | 37 | 7.4% |
15 | 32 | 6.4% |
17 | 31 | 6.2% |
16 | 29 | 5.8% |
11 | 26 | 5.2% |
9 | 24 | 4.8% |
Other values (14) | 149 |
Value | Count | Frequency (%) |
0 | 8 | 1.6% |
1 | 6 | 1.2% |
2 | 3 | 0.6% |
3 | 3 | 0.6% |
4 | 4 | 0.8% |
5 | 3 | 0.6% |
6 | 8 | 1.6% |
7 | 12 | |
8 | 17 | |
9 | 24 |
Value | Count | Frequency (%) |
23 | 7 | 1.4% |
22 | 15 | 3.0% |
21 | 23 | |
20 | 24 | |
19 | 40 | |
18 | 42 | |
17 | 31 | |
16 | 29 | |
15 | 32 | |
14 | 37 |
카드이용금액계(AMT_CORR)
Real number (ℝ)
Distinct | 339 |
---|---|
Distinct (%) | 67.8% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 286747.82 |
Minimum | 3018 |
---|---|
Maximum | 11058746 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 4.5 KiB |
Quantile statistics
Minimum | 3018 |
---|---|
5-th percentile | 15064.85 |
Q1 | 38605.25 |
median | 90791.5 |
Q3 | 247350.25 |
95-th percentile | 1153881.5 |
Maximum | 11058746 |
Range | 11055728 |
Interquartile range (IQR) | 208745 |
Descriptive statistics
Standard deviation | 802798.64 |
---|---|
Coefficient of variation (CV) | 2.7996678 |
Kurtosis | 117.058 |
Mean | 286747.82 |
Median Absolute Deviation (MAD) | 65641.5 |
Skewness | 9.6932445 |
Sum | 1.4337391 × 108 |
Variance | 6.4448566 × 1011 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
15090.0 | 11 | 2.2% |
25150.0 | 10 | 2.0% |
35210.0 | 8 | 1.6% |
22635.0 | 8 | 1.6% |
60360.0 | 7 | 1.4% |
45270.0 | 7 | 1.4% |
32695.0 | 6 | 1.2% |
50300.0 | 6 | 1.2% |
30180.0 | 6 | 1.2% |
181080.0 | 5 | 1.0% |
Other values (329) | 426 |
Value | Count | Frequency (%) |
3018.0 | 1 | 0.2% |
4527.0 | 1 | 0.2% |
5030.0 | 4 | |
7042.0 | 1 | 0.2% |
7545.0 | 4 | |
8551.0 | 2 | |
9054.0 | 1 | 0.2% |
10060.0 | 2 | |
11820.5 | 1 | 0.2% |
12575.0 | 4 |
Value | Count | Frequency (%) |
11058746.5 | 1 | |
10378951.6 | 1 | |
5030000.0 | 1 | |
3431626.8 | 1 | |
2947580.0 | 1 | |
2590450.0 | 1 | |
2210685.0 | 1 | |
1926842.1 | 1 | |
1912988.3 | 1 | |
1911400.0 | 1 |
카드이용건수(USECT_CORR)
Real number (ℝ)
Distinct | 35 |
---|---|
Distinct (%) | 7.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 15.64114 |
Minimum | 5.03 |
---|---|
Maximum | 241.44 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 4.5 KiB |
Quantile statistics
Minimum | 5.03 |
---|---|
5-th percentile | 5.03 |
Q1 | 5.03 |
median | 10.06 |
Q3 | 15.09 |
95-th percentile | 46.604 |
Maximum | 241.44 |
Range | 236.41 |
Interquartile range (IQR) | 10.06 |
Descriptive statistics
Standard deviation | 22.199805 |
---|---|
Coefficient of variation (CV) | 1.4193214 |
Kurtosis | 32.978572 |
Mean | 15.64114 |
Median Absolute Deviation (MAD) | 5.03 |
Skewness | 4.895071 |
Sum | 7820.57 |
Variance | 492.83132 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
5.03 | 224 | |
10.06 | 95 | |
15.09 | 44 | 8.8% |
20.12 | 28 | 5.6% |
25.15 | 22 | 4.4% |
30.18 | 12 | 2.4% |
14.13 | 11 | 2.2% |
35.21 | 8 | 1.6% |
9.1 | 8 | 1.6% |
19.16 | 6 | 1.2% |
Other values (25) | 42 | 8.4% |
Value | Count | Frequency (%) |
5.03 | 224 | |
9.1 | 8 | 1.6% |
10.06 | 95 | |
14.13 | 11 | 2.2% |
15.09 | 44 | 8.8% |
19.16 | 6 | 1.2% |
20.12 | 28 | 5.6% |
23.23 | 1 | 0.2% |
24.19 | 2 | 0.4% |
25.15 | 22 | 4.4% |
Value | Count | Frequency (%) |
241.44 | 1 | 0.2% |
174.13 | 1 | 0.2% |
136.04 | 1 | 0.2% |
130.78 | 2 | |
120.72 | 1 | 0.2% |
110.66 | 1 | 0.2% |
110.16 | 1 | 0.2% |
94.61 | 2 | |
85.51 | 4 | |
70.42 | 3 |
가맹점블록코드(BLCK_CD) | 내국인업종코드(SB_UPJONG_CD) | 기준년월(TS_YM) | 일별(TS_YMD) | 요일(DAW) | 시간대(TM) | 카드이용금액계(AMT_CORR) | 카드이용건수(USECT_CORR) | |
---|---|---|---|---|---|---|---|---|
가맹점블록코드(BLCK_CD) | 1.000 | 0.447 | 0.000 | 0.000 | 0.000 | 0.000 | 0.514 | 0.302 |
내국인업종코드(SB_UPJONG_CD) | 0.447 | 1.000 | 0.250 | 0.237 | 0.000 | 0.000 | 0.526 | 0.000 |
기준년월(TS_YM) | 0.000 | 0.250 | 1.000 | 1.000 | 0.043 | 0.154 | 0.073 | 0.111 |
일별(TS_YMD) | 0.000 | 0.237 | 1.000 | 1.000 | 0.054 | 0.158 | 0.075 | 0.114 |
요일(DAW) | 0.000 | 0.000 | 0.043 | 0.054 | 1.000 | 0.000 | 0.078 | 0.000 |
시간대(TM) | 0.000 | 0.000 | 0.154 | 0.158 | 0.000 | 1.000 | 0.000 | 0.000 |
카드이용금액계(AMT_CORR) | 0.514 | 0.526 | 0.073 | 0.075 | 0.078 | 0.000 | 1.000 | 0.410 |
카드이용건수(USECT_CORR) | 0.302 | 0.000 | 0.111 | 0.114 | 0.000 | 0.000 | 0.410 | 1.000 |
내국인업종코드(SB_UPJONG_CD) | 요일(DAW) | |
---|---|---|
내국인업종코드(SB_UPJONG_CD) | 1.000 | 0.000 |
요일(DAW) | 0.000 | 1.000 |
가맹점블록코드(BLCK_CD) | 기준년월(TS_YM) | 일별(TS_YMD) | 시간대(TM) | 카드이용금액계(AMT_CORR) | 카드이용건수(USECT_CORR) | 내국인업종코드(SB_UPJONG_CD) | 요일(DAW) | |
---|---|---|---|---|---|---|---|---|
가맹점블록코드(BLCK_CD) | 1.000 | 0.020 | 0.020 | -0.013 | 0.011 | -0.003 | 0.162 | 0.000 |
기준년월(TS_YM) | 0.020 | 1.000 | 1.000 | -0.078 | -0.012 | -0.009 | 0.104 | 0.026 |
일별(TS_YMD) | 0.020 | 1.000 | 1.000 | -0.078 | -0.013 | -0.011 | 0.097 | 0.033 |
시간대(TM) | -0.013 | -0.078 | -0.078 | 1.000 | 0.022 | 0.120 | 0.000 | 0.000 |
카드이용금액계(AMT_CORR) | 0.011 | -0.012 | -0.013 | 0.022 | 1.000 | 0.040 | 0.244 | 0.046 |
카드이용건수(USECT_CORR) | -0.003 | -0.009 | -0.011 | 0.120 | 0.040 | 1.000 | 0.000 | 0.000 |
내국인업종코드(SB_UPJONG_CD) | 0.162 | 0.104 | 0.097 | 0.000 | 0.244 | 0.000 | 1.000 | 0.000 |
요일(DAW) | 0.000 | 0.026 | 0.033 | 0.000 | 0.046 | 0.000 | 0.000 | 1.000 |
가맹점블록코드(BLCK_CD) | 내국인업종코드(SB_UPJONG_CD) | 기준년월(TS_YM) | 일별(TS_YMD) | 요일(DAW) | 시간대(TM) | 카드이용금액계(AMT_CORR) | 카드이용건수(USECT_CORR) | |
---|---|---|---|---|---|---|---|---|
0 | 231793 | SB001 | 201906 | 20190612 | 토요일 | 18 | 58851.0 | 20.12 |
1 | 11694 | SB007 | 201711 | 20171127 | 금요일 | 1 | 21629.0 | 10.06 |
2 | 420182 | SB054 | 201904 | 20190407 | 금요일 | 19 | 25150.0 | 60.36 |
3 | 158487 | SB039 | 201712 | 20171225 | 월요일 | 18 | 60360.0 | 10.06 |
4 | 33587 | SB016 | 201704 | 20170422 | 일요일 | 18 | 111666.0 | 5.03 |
5 | 17869 | SB006 | 201706 | 20170601 | 토요일 | 19 | 181080.0 | 5.03 |
6 | 151716 | SB049 | 202012 | 20201221 | 목요일 | 18 | 45270.0 | 10.06 |
7 | 274538 | SB005 | 202105 | 20210521 | 화요일 | 20 | 264578.0 | 174.13 |
8 | 209856 | SB006 | 201708 | 20170816 | 월요일 | 12 | 326044.6 | 5.03 |
9 | 168334 | SB013 | 201912 | 20191204 | 일요일 | 13 | 294154.4 | 10.06 |
가맹점블록코드(BLCK_CD) | 내국인업종코드(SB_UPJONG_CD) | 기준년월(TS_YM) | 일별(TS_YMD) | 요일(DAW) | 시간대(TM) | 카드이용금액계(AMT_CORR) | 카드이용건수(USECT_CORR) | |
---|---|---|---|---|---|---|---|---|
490 | 224440 | SB054 | 202008 | 20200831 | 화요일 | 17 | 35210.0 | 15.09 |
491 | 19855 | SB013 | 202107 | 20210726 | 금요일 | 17 | 32695.0 | 59.4 |
492 | 269079 | SB016 | 201811 | 20181124 | 월요일 | 14 | 145870.0 | 5.03 |
493 | 218418 | SB008 | 201811 | 20181105 | 수요일 | 9 | 45270.0 | 14.13 |
494 | 365651 | SB020 | 201909 | 20190920 | 금요일 | 15 | 15090.0 | 23.23 |
495 | 23342 | SB016 | 201703 | 20170314 | 토요일 | 19 | 87360.0 | 5.03 |
496 | 21173 | SB019 | 201804 | 20180414 | 월요일 | 14 | 219811.0 | 5.03 |
497 | 47892 | SB016 | 202106 | 20210621 | 금요일 | 15 | 364000.0 | 20.12 |
498 | 28521 | SB001 | 201811 | 20181106 | 목요일 | 19 | 251500.0 | 9.1 |
499 | 11707 | SB054 | 202004 | 20200423 | 수요일 | 15 | 313950.0 | 25.15 |