Dataset statistics
Number of variables | 6 |
---|---|
Number of observations | 500 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 26.0 KiB |
Average record size in memory | 53.3 B |
Variable types
Text | 1 |
---|---|
Numeric | 5 |
Dataset
Description | 샘플 데이터 |
---|---|
Author | 신한카드 |
URL | https://bigdata.seoul.go.kr/data/selectSampleData.do?sample_data_seq=318 |
소액결제건수(MICRO_PYM) has 115 (23.0%) zeros | Zeros |
Reproduction
Analysis started | 2024-04-20 14:17:35.368234 |
---|---|
Analysis finished | 2024-04-20 14:17:41.546379 |
Duration | 6.18 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
Distinct | 63 |
---|---|
Distinct (%) | 12.6% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 4.0 KiB |
Value | Count | Frequency (%) |
ss016 | 23 | 4.6% |
ss068 | 22 | 4.4% |
ss066 | 19 | 3.8% |
ss048 | 18 | 3.6% |
ss055 | 17 | 3.4% |
ss069 | 17 | 3.4% |
ss030 | 16 | 3.2% |
ss006 | 16 | 3.2% |
ss008 | 15 | 3.0% |
ss013 | 15 | 3.0% |
Other values (53) | 322 |
Most occurring characters
Value | Count | Frequency (%) |
S | 1000 | |
0 | 646 | |
6 | 162 | 6.5% |
1 | 126 | 5.0% |
4 | 126 | 5.0% |
5 | 113 | 4.5% |
3 | 95 | 3.8% |
8 | 85 | 3.4% |
2 | 61 | 2.4% |
9 | 44 | 1.8% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 1500 | |
Uppercase Letter | 1000 |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 646 | |
6 | 162 | 10.8% |
1 | 126 | 8.4% |
4 | 126 | 8.4% |
5 | 113 | 7.5% |
3 | 95 | 6.3% |
8 | 85 | 5.7% |
2 | 61 | 4.1% |
9 | 44 | 2.9% |
7 | 42 | 2.8% |
Uppercase Letter
Value | Count | Frequency (%) |
S | 1000 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 1500 | |
Latin | 1000 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 646 | |
6 | 162 | 10.8% |
1 | 126 | 8.4% |
4 | 126 | 8.4% |
5 | 113 | 7.5% |
3 | 95 | 6.3% |
8 | 85 | 5.7% |
2 | 61 | 4.1% |
9 | 44 | 2.9% |
7 | 42 | 2.8% |
Latin
Value | Count | Frequency (%) |
S | 1000 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 2500 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
S | 1000 | |
0 | 646 | |
6 | 162 | 6.5% |
1 | 126 | 5.0% |
4 | 126 | 5.0% |
5 | 113 | 4.5% |
3 | 95 | 3.8% |
8 | 85 | 3.4% |
2 | 61 | 2.4% |
9 | 44 | 1.8% |
기준년월(YM)
Real number (ℝ)
Distinct | 67 |
---|---|
Distinct (%) | 13.4% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 201840.73 |
Minimum | 201601 |
---|---|
Maximum | 202107 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 4.5 KiB |
Quantile statistics
Minimum | 201601 |
---|---|
5-th percentile | 201604 |
Q1 | 201707 |
median | 201811.5 |
Q3 | 202002 |
95-th percentile | 202104 |
Maximum | 202107 |
Range | 506 |
Interquartile range (IQR) | 295 |
Descriptive statistics
Standard deviation | 159.91152 |
---|---|
Coefficient of variation (CV) | 0.00079226586 |
Kurtosis | -1.113749 |
Mean | 201840.73 |
Median Absolute Deviation (MAD) | 106 |
Skewness | 0.039627492 |
Sum | 1.0092036 × 108 |
Variance | 25571.693 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
202101 | 15 | 3.0% |
201812 | 13 | 2.6% |
201902 | 13 | 2.6% |
201708 | 12 | 2.4% |
201903 | 12 | 2.4% |
201805 | 12 | 2.4% |
201707 | 12 | 2.4% |
202009 | 12 | 2.4% |
201603 | 11 | 2.2% |
201901 | 11 | 2.2% |
Other values (57) | 377 |
Value | Count | Frequency (%) |
201601 | 7 | |
201602 | 5 | |
201603 | 11 | |
201604 | 6 | |
201605 | 6 | |
201606 | 8 | |
201607 | 5 | |
201608 | 10 | |
201609 | 5 | |
201610 | 8 |
Value | Count | Frequency (%) |
202107 | 10 | |
202106 | 4 | 0.8% |
202105 | 9 | |
202104 | 6 | 1.2% |
202103 | 8 | |
202102 | 3 | 0.6% |
202101 | 15 | |
202012 | 9 | |
202011 | 6 | 1.2% |
202010 | 8 |
시간대구간(TIME)
Real number (ℝ)
Distinct | 6 |
---|---|
Distinct (%) | 1.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 3.704 |
Minimum | 1 |
---|---|
Maximum | 6 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 4.5 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 3 |
median | 4 |
Q3 | 5 |
95-th percentile | 6 |
Maximum | 6 |
Range | 5 |
Interquartile range (IQR) | 2 |
Descriptive statistics
Standard deviation | 1.4284484 |
---|---|
Coefficient of variation (CV) | 0.38565023 |
Kurtosis | -0.8720676 |
Mean | 3.704 |
Median Absolute Deviation (MAD) | 1 |
Skewness | -0.11519838 |
Sum | 1852 |
Variance | 2.0404649 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
4 | 118 | |
3 | 109 | |
5 | 103 | |
2 | 78 | |
6 | 58 | |
1 | 34 | 6.8% |
Value | Count | Frequency (%) |
1 | 34 | 6.8% |
2 | 78 | |
3 | 109 | |
4 | 118 | |
5 | 103 | |
6 | 58 |
Value | Count | Frequency (%) |
6 | 58 | |
5 | 103 | |
4 | 118 | |
3 | 109 | |
2 | 78 | |
1 | 34 | 6.8% |
고객주소블록코드(BLOCK_CD)
Real number (ℝ)
Distinct | 496 |
---|---|
Distinct (%) | 99.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 205688.84 |
Minimum | 8529 |
---|---|
Maximum | 502813 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 4.5 KiB |
Quantile statistics
Minimum | 8529 |
---|---|
5-th percentile | 12907.1 |
Q1 | 148305 |
median | 217353 |
Q3 | 278712.75 |
95-th percentile | 416782.75 |
Maximum | 502813 |
Range | 494284 |
Interquartile range (IQR) | 130407.75 |
Descriptive statistics
Standard deviation | 129936.98 |
---|---|
Coefficient of variation (CV) | 0.63171623 |
Kurtosis | -0.78792106 |
Mean | 205688.84 |
Median Absolute Deviation (MAD) | 64891 |
Skewness | 0.035625156 |
Sum | 1.0284442 × 108 |
Variance | 1.6883619 × 1010 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
210343 | 2 | 0.4% |
219289 | 2 | 0.4% |
210784 | 2 | 0.4% |
413323 | 2 | 0.4% |
230873 | 1 | 0.2% |
224245 | 1 | 0.2% |
24568 | 1 | 0.2% |
27268 | 1 | 0.2% |
22652 | 1 | 0.2% |
11063 | 1 | 0.2% |
Other values (486) | 486 |
Value | Count | Frequency (%) |
8529 | 1 | |
8573 | 1 | |
8651 | 1 | |
9105 | 1 | |
9152 | 1 | |
9336 | 1 | |
9467 | 1 | |
9556 | 1 | |
9623 | 1 | |
9760 | 1 |
Value | Count | Frequency (%) |
502813 | 1 | |
502754 | 1 | |
500680 | 1 | |
499436 | 1 | |
499426 | 1 | |
422985 | 1 | |
422018 | 1 | |
421912 | 1 | |
421697 | 1 | |
421579 | 1 |
카드이용금액계(AMT_CORR)
Real number (ℝ)
Distinct | 55 |
---|---|
Distinct (%) | 11.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 41.166 |
Minimum | 5 |
---|---|
Maximum | 573 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 4.5 KiB |
Quantile statistics
Minimum | 5 |
---|---|
5-th percentile | 5 |
Q1 | 5 |
median | 15 |
Q3 | 35 |
95-th percentile | 191.25 |
Maximum | 573 |
Range | 568 |
Interquartile range (IQR) | 30 |
Descriptive statistics
Standard deviation | 72.566896 |
---|---|
Coefficient of variation (CV) | 1.7627871 |
Kurtosis | 15.655285 |
Mean | 41.166 |
Median Absolute Deviation (MAD) | 10 |
Skewness | 3.561874 |
Sum | 20583 |
Variance | 5265.9544 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
5 | 171 | |
10 | 77 | |
15 | 45 | 9.0% |
30 | 28 | 5.6% |
25 | 24 | 4.8% |
20 | 18 | 3.6% |
35 | 13 | 2.6% |
40 | 10 | 2.0% |
55 | 10 | 2.0% |
45 | 8 | 1.6% |
Other values (45) | 96 |
Value | Count | Frequency (%) |
5 | 171 | |
10 | 77 | |
15 | 45 | 9.0% |
20 | 18 | 3.6% |
25 | 24 | 4.8% |
30 | 28 | 5.6% |
35 | 13 | 2.6% |
40 | 10 | 2.0% |
45 | 8 | 1.6% |
50 | 4 | 0.8% |
Value | Count | Frequency (%) |
573 | 1 | |
538 | 1 | |
402 | 1 | |
397 | 1 | |
392 | 1 | |
372 | 1 | |
362 | 1 | |
322 | 1 | |
302 | 2 | |
287 | 1 |
소액결제건수(MICRO_PYM)
Real number (ℝ)
ZEROS
 
Distinct | 40 |
---|---|
Distinct (%) | 8.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 25.34 |
Minimum | 0 |
---|---|
Maximum | 483 |
Zeros | 115 |
Zeros (%) | 23.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 4.5 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 5 |
median | 10 |
Q3 | 25 |
95-th percentile | 96 |
Maximum | 483 |
Range | 483 |
Interquartile range (IQR) | 20 |
Descriptive statistics
Standard deviation | 52.024162 |
---|---|
Coefficient of variation (CV) | 2.0530451 |
Kurtosis | 27.991625 |
Mean | 25.34 |
Median Absolute Deviation (MAD) | 10 |
Skewness | 4.7337291 |
Sum | 12670 |
Variance | 2706.5134 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
5 | 125 | |
0 | 115 | |
10 | 57 | |
15 | 33 | 6.6% |
20 | 32 | 6.4% |
25 | 25 | 5.0% |
30 | 18 | 3.6% |
35 | 15 | 3.0% |
70 | 9 | 1.8% |
55 | 9 | 1.8% |
Other values (30) | 62 |
Value | Count | Frequency (%) |
0 | 115 | |
5 | 125 | |
10 | 57 | |
15 | 33 | 6.6% |
20 | 32 | 6.4% |
25 | 25 | 5.0% |
30 | 18 | 3.6% |
35 | 15 | 3.0% |
40 | 8 | 1.6% |
45 | 7 | 1.4% |
Value | Count | Frequency (%) |
483 | 1 | |
402 | 1 | |
362 | 1 | |
357 | 1 | |
307 | 1 | |
292 | 1 | |
257 | 1 | |
226 | 1 | |
221 | 1 | |
211 | 1 |
서울시민업종코드(UPJONG_CD) | 기준년월(YM) | 시간대구간(TIME) | 고객주소블록코드(BLOCK_CD) | 카드이용금액계(AMT_CORR) | 소액결제건수(MICRO_PYM) | |
---|---|---|---|---|---|---|
서울시민업종코드(UPJONG_CD) | 1.000 | 0.000 | 0.000 | 0.211 | 0.296 | 0.478 |
기준년월(YM) | 0.000 | 1.000 | 0.028 | 0.032 | 0.000 | 0.248 |
시간대구간(TIME) | 0.000 | 0.028 | 1.000 | 0.000 | 0.136 | 0.000 |
고객주소블록코드(BLOCK_CD) | 0.211 | 0.032 | 0.000 | 1.000 | 0.161 | 0.137 |
카드이용금액계(AMT_CORR) | 0.296 | 0.000 | 0.136 | 0.161 | 1.000 | 0.000 |
소액결제건수(MICRO_PYM) | 0.478 | 0.248 | 0.000 | 0.137 | 0.000 | 1.000 |
기준년월(YM) | 시간대구간(TIME) | 고객주소블록코드(BLOCK_CD) | 카드이용금액계(AMT_CORR) | 소액결제건수(MICRO_PYM) | |
---|---|---|---|---|---|
기준년월(YM) | 1.000 | 0.008 | -0.044 | 0.018 | -0.035 |
시간대구간(TIME) | 0.008 | 1.000 | 0.035 | 0.003 | -0.052 |
고객주소블록코드(BLOCK_CD) | -0.044 | 0.035 | 1.000 | -0.040 | -0.046 |
카드이용금액계(AMT_CORR) | 0.018 | 0.003 | -0.040 | 1.000 | -0.078 |
소액결제건수(MICRO_PYM) | -0.035 | -0.052 | -0.046 | -0.078 | 1.000 |
서울시민업종코드(UPJONG_CD) | 기준년월(YM) | 시간대구간(TIME) | 고객주소블록코드(BLOCK_CD) | 카드이용금액계(AMT_CORR) | 소액결제건수(MICRO_PYM) | |
---|---|---|---|---|---|---|
0 | SS017 | 202006 | 6 | 11063 | 257 | 25 |
1 | SS013 | 201608 | 6 | 216137 | 86 | 10 |
2 | SS016 | 201802 | 3 | 224819 | 35 | 15 |
3 | SS055 | 201906 | 2 | 19869 | 10 | 0 |
4 | SS038 | 201612 | 5 | 28382 | 10 | 5 |
5 | SS043 | 201606 | 1 | 362903 | 20 | 35 |
6 | SS054 | 202101 | 6 | 17521 | 10 | 25 |
7 | SS054 | 202011 | 2 | 152539 | 75 | 5 |
8 | SS006 | 201608 | 2 | 157228 | 10 | 10 |
9 | SS008 | 202102 | 6 | 16708 | 15 | 5 |
서울시민업종코드(UPJONG_CD) | 기준년월(YM) | 시간대구간(TIME) | 고객주소블록코드(BLOCK_CD) | 카드이용금액계(AMT_CORR) | 소액결제건수(MICRO_PYM) | |
---|---|---|---|---|---|---|
490 | SS068 | 201806 | 1 | 364774 | 55 | 65 |
491 | SS003 | 201709 | 3 | 22044 | 5 | 5 |
492 | SS015 | 201701 | 1 | 15279 | 5 | 10 |
493 | SS044 | 202009 | 5 | 20378 | 10 | 45 |
494 | SS003 | 201602 | 4 | 210795 | 5 | 10 |
495 | SS017 | 201911 | 5 | 366735 | 5 | 5 |
496 | SS016 | 201803 | 2 | 155750 | 10 | 30 |
497 | SS081 | 202103 | 3 | 366426 | 5 | 5 |
498 | SS069 | 201707 | 3 | 214935 | 70 | 10 |
499 | SS069 | 201909 | 4 | 225316 | 30 | 5 |