Dataset statistics
Number of variables | 6 |
---|---|
Number of observations | 500 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 26.0 KiB |
Average record size in memory | 53.3 B |
Variable types
Categorical | 1 |
---|---|
Numeric | 5 |
Dataset
Description | 샘플 데이터 |
---|---|
Author | 신한카드 |
URL | https://bigdata.seoul.go.kr/data/selectSampleData.do?sample_data_seq=318 |
Reproduction
Analysis started | 2023-12-10 14:59:03.764340 |
---|---|
Analysis finished | 2023-12-10 14:59:10.681982 |
Duration | 6.92 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
업종대분류(UPJONG_CLASS1)
Categorical
Distinct | 14 |
---|---|
Distinct (%) | 2.8% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 4.0 KiB |
유통 | |
---|---|
전자상거래 | |
요식/유흥 | |
의료 | |
주유 | |
Other values (9) |
Length
Max length | 9 |
---|---|
Median length | 8 |
Mean length | 4.314 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 요식/유흥 |
---|---|
2nd row | 유통 |
3rd row | 유통 |
4th row | 스포츠/문화/레저 |
5th row | 의료 |
Common Values
Value | Count | Frequency (%) |
유통 | 88 | |
전자상거래 | 79 | |
요식/유흥 | 63 | |
의료 | 43 | |
주유 | 40 | |
가정생활/서비스 | 39 | |
음/식료품 | 34 | 6.8% |
스포츠/문화/레저 | 30 | 6.0% |
여행/교통 | 29 | 5.8% |
미용 | 15 | 3.0% |
Other values (4) | 40 |
Length
Value | Count | Frequency (%) |
유통 | 88 | |
전자상거래 | 79 | |
요식/유흥 | 63 | |
의료 | 43 | |
주유 | 40 | |
가정생활/서비스 | 39 | |
음/식료품 | 34 | 6.8% |
스포츠/문화/레저 | 30 | 6.0% |
여행/교통 | 29 | 5.8% |
미용 | 15 | 3.0% |
Other values (4) | 40 |
기준일자(YMD)
Real number (ℝ)
Distinct | 444 |
---|---|
Distinct (%) | 88.8% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 20183282 |
Minimum | 20160101 |
---|---|
Maximum | 20210731 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 4.5 KiB |
Quantile statistics
Minimum | 20160101 |
---|---|
5-th percentile | 20160409 |
Q1 | 20170528 |
median | 20180908 |
Q3 | 20200208 |
95-th percentile | 20210318 |
Maximum | 20210731 |
Range | 50630 |
Interquartile range (IQR) | 29680.75 |
Descriptive statistics
Standard deviation | 16132.927 |
---|---|
Coefficient of variation (CV) | 0.0007993213 |
Kurtosis | -1.1825464 |
Mean | 20183282 |
Median Absolute Deviation (MAD) | 10502.5 |
Skewness | 0.11523271 |
Sum | 1.0091641 × 1010 |
Variance | 2.6027133 × 108 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
20160604 | 3 | 0.6% |
20190708 | 3 | 0.6% |
20180607 | 3 | 0.6% |
20200831 | 3 | 0.6% |
20200802 | 3 | 0.6% |
20170303 | 2 | 0.4% |
20171002 | 2 | 0.4% |
20160212 | 2 | 0.4% |
20190629 | 2 | 0.4% |
20191105 | 2 | 0.4% |
Other values (434) | 475 |
Value | Count | Frequency (%) |
20160101 | 1 | |
20160102 | 2 | |
20160106 | 1 | |
20160110 | 1 | |
20160117 | 1 | |
20160120 | 1 | |
20160128 | 1 | |
20160201 | 1 | |
20160203 | 1 | |
20160212 | 2 |
Value | Count | Frequency (%) |
20210731 | 2 | |
20210728 | 1 | |
20210723 | 2 | |
20210721 | 1 | |
20210714 | 1 | |
20210709 | 1 | |
20210704 | 1 | |
20210701 | 1 | |
20210629 | 1 | |
20210628 | 1 |
시간대구간(TIME)
Real number (ℝ)
Distinct | 6 |
---|---|
Distinct (%) | 1.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 3.802 |
Minimum | 1 |
---|---|
Maximum | 6 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 4.5 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 3 |
median | 4 |
Q3 | 5 |
95-th percentile | 6 |
Maximum | 6 |
Range | 5 |
Interquartile range (IQR) | 2 |
Descriptive statistics
Standard deviation | 1.4543129 |
---|---|
Coefficient of variation (CV) | 0.3825126 |
Kurtosis | -0.91117675 |
Mean | 3.802 |
Median Absolute Deviation (MAD) | 1 |
Skewness | -0.15451449 |
Sum | 1901 |
Variance | 2.1150261 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
3 | 111 | |
5 | 108 | |
4 | 107 | |
6 | 71 | |
2 | 71 | |
1 | 32 | 6.4% |
Value | Count | Frequency (%) |
1 | 32 | 6.4% |
2 | 71 | |
3 | 111 | |
4 | 107 | |
5 | 108 | |
6 | 71 |
Value | Count | Frequency (%) |
6 | 71 | |
5 | 108 | |
4 | 107 | |
3 | 111 | |
2 | 71 | |
1 | 32 | 6.4% |
고객주소집계구별(TOT_REG_CD)
Real number (ℝ)
Distinct | 496 |
---|---|
Distinct (%) | 99.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1.1147393 × 1012 |
Minimum | 1.101053 × 1012 |
---|---|
Maximum | 1.125072 × 1012 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 4.5 KiB |
Quantile statistics
Minimum | 1.101053 × 1012 |
---|---|
5-th percentile | 1.1040031 × 1012 |
Q1 | 1.1090638 × 1012 |
median | 1.115066 × 1012 |
Q3 | 1.1210713 × 1012 |
95-th percentile | 1.124077 × 1012 |
Maximum | 1.125072 × 1012 |
Range | 2.4019 × 1010 |
Interquartile range (IQR) | 1.2007489 × 1010 |
Descriptive statistics
Standard deviation | 6.7868108 × 109 |
---|---|
Coefficient of variation (CV) | 0.0060882493 |
Kurtosis | -1.1166061 |
Mean | 1.1147393 × 1012 |
Median Absolute Deviation (MAD) | 6.003525 × 109 |
Skewness | -0.19786845 |
Sum | 5.5736965 × 1014 |
Variance | 4.6060801 × 1019 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1122055020804 | 2 | 0.4% |
1112072010025 | 2 | 0.4% |
1108083020102 | 2 | 0.4% |
1124059030119 | 2 | 0.4% |
1108059020407 | 1 | 0.2% |
1115071030104 | 1 | 0.2% |
1114071010011 | 1 | 0.2% |
1124080020102 | 1 | 0.2% |
1106086010107 | 1 | 0.2% |
1123076010009 | 1 | 0.2% |
Other values (486) | 486 |
Value | Count | Frequency (%) |
1101053020002 | 1 | |
1101054010002 | 1 | |
1101055020005 | 1 | |
1101056020002 | 1 | |
1101061030201 | 1 | |
1101067010102 | 1 | |
1101068010002 | 1 | |
1101072010019 | 1 | |
1102067020001 | 1 | |
1102069010002 | 1 |
Value | Count | Frequency (%) |
1125072020311 | 1 | |
1125072010002 | 1 | |
1125071020030 | 1 | |
1125071020027 | 1 | |
1125071020026 | 1 | |
1125065022601 | 1 | |
1125065010504 | 1 | |
1125063020301 | 1 | |
1125061020016 | 1 | |
1125061020008 | 1 |
카드이용금액계(AMT_CORR)
Real number (ℝ)
Distinct | 418 |
---|---|
Distinct (%) | 83.6% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 657846.89 |
Minimum | 5 |
---|---|
Maximum | 40847926 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 4.5 KiB |
Quantile statistics
Minimum | 5 |
---|---|
5-th percentile | 15064.85 |
Q1 | 78505.5 |
median | 269834.5 |
Q3 | 632651 |
95-th percentile | 1745415.3 |
Maximum | 40847926 |
Range | 40847921 |
Interquartile range (IQR) | 554145.5 |
Descriptive statistics
Standard deviation | 2221250.6 |
---|---|
Coefficient of variation (CV) | 3.3765465 |
Kurtosis | 221.90479 |
Mean | 657846.89 |
Median Absolute Deviation (MAD) | 219283 |
Skewness | 13.317176 |
Sum | 3.2892345 × 108 |
Variance | 4.9339544 × 1012 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
50300 | 7 | 1.4% |
150900 | 5 | 1.0% |
20120 | 4 | 0.8% |
251500 | 4 | 0.8% |
10060 | 4 | 0.8% |
30180 | 4 | 0.8% |
60360 | 4 | 0.8% |
301800 | 4 | 0.8% |
5030 | 4 | 0.8% |
100600 | 4 | 0.8% |
Other values (408) | 456 |
Value | Count | Frequency (%) |
5 | 1 | 0.2% |
2012 | 2 | |
5030 | 4 | |
5533 | 1 | 0.2% |
6036 | 1 | 0.2% |
6539 | 1 | 0.2% |
8048 | 1 | 0.2% |
8551 | 1 | 0.2% |
9054 | 1 | 0.2% |
9557 | 2 |
Value | Count | Frequency (%) |
40847926 | 1 | |
15536815 | 1 | |
11531979 | 1 | |
11394962 | 1 | |
10559831 | 1 | |
7728042 | 1 | |
5954202 | 1 | |
5111989 | 1 | |
5098006 | 1 | |
4939158 | 1 |
카드이용건수계(USECT_CORR)
Real number (ℝ)
Distinct | 22 |
---|---|
Distinct (%) | 4.4% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 19.002 |
Minimum | 5 |
---|---|
Maximum | 236 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 4.5 KiB |
Quantile statistics
Minimum | 5 |
---|---|
5-th percentile | 5 |
Q1 | 5 |
median | 10 |
Q3 | 25 |
95-th percentile | 55 |
Maximum | 236 |
Range | 231 |
Interquartile range (IQR) | 20 |
Descriptive statistics
Standard deviation | 23.37989 |
---|---|
Coefficient of variation (CV) | 1.230391 |
Kurtosis | 28.241506 |
Mean | 19.002 |
Median Absolute Deviation (MAD) | 5 |
Skewness | 4.2234093 |
Sum | 9501 |
Variance | 546.61923 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
5 | 202 | |
10 | 79 | 15.8% |
15 | 48 | 9.6% |
25 | 35 | 7.0% |
20 | 27 | 5.4% |
30 | 22 | 4.4% |
40 | 17 | 3.4% |
35 | 16 | 3.2% |
45 | 16 | 3.2% |
50 | 10 | 2.0% |
Other values (12) | 28 | 5.6% |
Value | Count | Frequency (%) |
5 | 202 | |
10 | 79 | 15.8% |
15 | 48 | 9.6% |
20 | 27 | 5.4% |
25 | 35 | 7.0% |
30 | 22 | 4.4% |
35 | 16 | 3.2% |
40 | 17 | 3.4% |
45 | 16 | 3.2% |
50 | 10 | 2.0% |
Value | Count | Frequency (%) |
236 | 1 | 0.2% |
211 | 1 | 0.2% |
176 | 1 | 0.2% |
106 | 3 | |
101 | 1 | 0.2% |
96 | 1 | 0.2% |
91 | 2 | |
86 | 1 | 0.2% |
80 | 1 | 0.2% |
70 | 4 |
업종대분류(UPJONG_CLASS1) | 기준일자(YMD) | 시간대구간(TIME) | 고객주소집계구별(TOT_REG_CD) | 카드이용금액계(AMT_CORR) | 카드이용건수계(USECT_CORR) | |
---|---|---|---|---|---|---|
업종대분류(UPJONG_CLASS1) | 1.000 | 0.087 | 0.000 | 0.123 | 0.079 | 0.077 |
기준일자(YMD) | 0.087 | 1.000 | 0.000 | 0.000 | 0.000 | 0.000 |
시간대구간(TIME) | 0.000 | 0.000 | 1.000 | 0.075 | 0.000 | 0.000 |
고객주소집계구별(TOT_REG_CD) | 0.123 | 0.000 | 0.075 | 1.000 | 0.057 | 0.089 |
카드이용금액계(AMT_CORR) | 0.079 | 0.000 | 0.000 | 0.057 | 1.000 | 0.097 |
카드이용건수계(USECT_CORR) | 0.077 | 0.000 | 0.000 | 0.089 | 0.097 | 1.000 |
기준일자(YMD) | 시간대구간(TIME) | 고객주소집계구별(TOT_REG_CD) | 카드이용금액계(AMT_CORR) | 카드이용건수계(USECT_CORR) | 업종대분류(UPJONG_CLASS1) | |
---|---|---|---|---|---|---|
기준일자(YMD) | 1.000 | 0.018 | 0.063 | -0.023 | -0.007 | 0.036 |
시간대구간(TIME) | 0.018 | 1.000 | -0.035 | -0.005 | 0.050 | 0.000 |
고객주소집계구별(TOT_REG_CD) | 0.063 | -0.035 | 1.000 | 0.071 | -0.091 | 0.043 |
카드이용금액계(AMT_CORR) | -0.023 | -0.005 | 0.071 | 1.000 | 0.022 | 0.040 |
카드이용건수계(USECT_CORR) | -0.007 | 0.050 | -0.091 | 0.022 | 1.000 | 0.033 |
업종대분류(UPJONG_CLASS1) | 0.036 | 0.000 | 0.043 | 0.040 | 0.033 | 1.000 |
업종대분류(UPJONG_CLASS1) | 기준일자(YMD) | 시간대구간(TIME) | 고객주소집계구별(TOT_REG_CD) | 카드이용금액계(AMT_CORR) | 카드이용건수계(USECT_CORR) | |
---|---|---|---|---|---|---|
0 | 요식/유흥 | 20161004 | 6 | 1123076010009 | 104624 | 10 |
1 | 유통 | 20210303 | 5 | 1122060030003 | 382592 | 20 |
2 | 유통 | 20170606 | 6 | 1123066022301 | 341688 | 5 |
3 | 스포츠/문화/레저 | 20171206 | 5 | 1124075020103 | 925118 | 5 |
4 | 의료 | 20160909 | 6 | 1113075030002 | 217985 | 50 |
5 | 스포츠/문화/레저 | 20161121 | 3 | 1123073010108 | 592031 | 10 |
6 | 의료 | 20191020 | 3 | 1116051010006 | 34959 | 5 |
7 | 주유 | 20170625 | 3 | 1121052030002 | 5030 | 20 |
8 | 음/식료품 | 20180427 | 6 | 1108068010501 | 331980 | 5 |
9 | 스포츠/문화/레저 | 20180118 | 2 | 1122068040202 | 19617 | 5 |
업종대분류(UPJONG_CLASS1) | 기준일자(YMD) | 시간대구간(TIME) | 고객주소집계구별(TOT_REG_CD) | 카드이용금액계(AMT_CORR) | 카드이용건수계(USECT_CORR) | |
---|---|---|---|---|---|---|
490 | 전자상거래 | 20170808 | 3 | 1105062030506 | 301800 | 5 |
491 | 주유 | 20180531 | 5 | 1114077050301 | 2157870 | 35 |
492 | 여행/교통 | 20160731 | 4 | 1103071040006 | 48791 | 50 |
493 | 요식/유흥 | 20160326 | 1 | 1123065010801 | 716825 | 5 |
494 | 전자상거래 | 20180619 | 4 | 1125072020311 | 65390 | 10 |
495 | 전자상거래 | 20170727 | 2 | 1120069010006 | 90540 | 25 |
496 | 유통 | 20190406 | 3 | 1123072010303 | 1211325 | 20 |
497 | 교육/학원 | 20210107 | 3 | 1119072030101 | 266590 | 15 |
498 | 전자상거래 | 20200426 | 4 | 1105063030203 | 547264 | 25 |
499 | 여행/교통 | 20160625 | 4 | 1103072030001 | 10559831 | 5 |