Dataset statistics
Number of variables | 6 |
---|---|
Number of observations | 500 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 26.0 KiB |
Average record size in memory | 53.3 B |
Variable types
Text | 1 |
---|---|
Categorical | 1 |
Numeric | 4 |
Dataset
Description | 샘플 데이터 |
---|---|
Author | 서울시, 신한카드, KCB(코리아크레딧뷰로) |
URL | https://bigdata.seoul.go.kr/data/selectSampleData.do?sample_data_seq=321 |
홈쇼핑_결재(지출)건수(INDEX05_CNT) has 286 (57.2%) zeros | Zeros |
백화점_할인점_결제(지출)건수(INDEX05_CNT2) has 7 (1.4%) zeros | Zeros |
Reproduction
Analysis started | 2023-12-10 15:02:13.136449 |
---|---|
Analysis finished | 2023-12-10 15:02:16.735168 |
Duration | 3.6 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
서울시_블록ID(BLK_CD)
Text
Distinct | 313 |
---|---|
Distinct (%) | 62.6% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 4.0 KiB |
Value | Count | Frequency (%) |
1*3*1 | 8 | 1.6% |
2*9*1 | 8 | 1.6% |
2*3*9 | 6 | 1.2% |
2*4*6 | 5 | 1.0% |
2*7*7 | 5 | 1.0% |
3*5*9 | 5 | 1.0% |
2*2*5 | 5 | 1.0% |
2*3*7 | 5 | 1.0% |
2*7*3 | 5 | 1.0% |
3*3*5 | 5 | 1.0% |
Other values (253) | 443 |
Most occurring characters
Value | Count | Frequency (%) |
* | 1378 | |
2 | 342 | 11.9% |
1 | 223 | 7.8% |
3 | 212 | 7.4% |
4 | 138 | 4.8% |
9 | 108 | 3.8% |
5 | 105 | 3.7% |
6 | 92 | 3.2% |
0 | 91 | 3.2% |
8 | 87 | 3.0% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 1484 | |
Other Punctuation | 1378 |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
2 | 342 | |
1 | 223 | |
3 | 212 | |
4 | 138 | |
9 | 108 | 7.3% |
5 | 105 | 7.1% |
6 | 92 | 6.2% |
0 | 91 | 6.1% |
8 | 87 | 5.9% |
7 | 86 | 5.8% |
Other Punctuation
Value | Count | Frequency (%) |
* | 1378 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 2862 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
* | 1378 | |
2 | 342 | 11.9% |
1 | 223 | 7.8% |
3 | 212 | 7.4% |
4 | 138 | 4.8% |
9 | 108 | 3.8% |
5 | 105 | 3.7% |
6 | 92 | 3.2% |
0 | 91 | 3.2% |
8 | 87 | 3.0% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 2862 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
* | 1378 | |
2 | 342 | 11.9% |
1 | 223 | 7.8% |
3 | 212 | 7.4% |
4 | 138 | 4.8% |
9 | 108 | 3.8% |
5 | 105 | 3.7% |
6 | 92 | 3.2% |
0 | 91 | 3.2% |
8 | 87 | 3.0% |
성별(GENDER)
Categorical
Distinct | 2 |
---|---|
Distinct (%) | 0.4% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 4.0 KiB |
2 | |
---|---|
1 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 2 |
---|---|
2nd row | 2 |
3rd row | 2 |
4th row | 2 |
5th row | 1 |
Common Values
Value | Count | Frequency (%) |
2 | 261 | |
1 | 239 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
2 | 261 | |
1 | 239 |
연령대(AGE)
Real number (ℝ)
Distinct | 7 |
---|---|
Distinct (%) | 1.4% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 4.068 |
Minimum | 1 |
---|---|
Maximum | 7 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 4.5 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 3 |
median | 4 |
Q3 | 6 |
95-th percentile | 7 |
Maximum | 7 |
Range | 6 |
Interquartile range (IQR) | 3 |
Descriptive statistics
Standard deviation | 1.7915902 |
---|---|
Coefficient of variation (CV) | 0.44041058 |
Kurtosis | -1.0558941 |
Mean | 4.068 |
Median Absolute Deviation (MAD) | 1 |
Skewness | -0.010112195 |
Sum | 2034 |
Variance | 3.2097956 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
3 | 89 | |
4 | 86 | |
6 | 81 | |
5 | 80 | |
2 | 73 | |
7 | 50 | |
1 | 41 |
Value | Count | Frequency (%) |
1 | 41 | |
2 | 73 | |
3 | 89 | |
4 | 86 | |
5 | 80 | |
6 | 81 | |
7 | 50 |
Value | Count | Frequency (%) |
7 | 50 | |
6 | 81 | |
5 | 80 | |
4 | 86 | |
3 | 89 | |
2 | 73 | |
1 | 41 |
홈쇼핑_결재(지출)건수(INDEX05_CNT)
Real number (ℝ)
ZEROS
 
Distinct | 101 |
---|---|
Distinct (%) | 20.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 28.9164 |
Minimum | 0 |
---|---|
Maximum | 1125.5 |
Zeros | 286 |
Zeros (%) | 57.2% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 4.5 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 14.7 |
95-th percentile | 143.93 |
Maximum | 1125.5 |
Range | 1125.5 |
Interquartile range (IQR) | 14.7 |
Descriptive statistics
Standard deviation | 92.172013 |
---|---|
Coefficient of variation (CV) | 3.1875342 |
Kurtosis | 53.324804 |
Mean | 28.9164 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 6.3033117 |
Sum | 14458.2 |
Variance | 8495.68 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0.0 | 286 | |
4.9 | 31 | 6.2% |
5.0 | 26 | 5.2% |
9.9 | 11 | 2.2% |
5.1 | 10 | 2.0% |
14.9 | 7 | 1.4% |
15.0 | 5 | 1.0% |
15.1 | 4 | 0.8% |
24.7 | 4 | 0.8% |
14.8 | 4 | 0.8% |
Other values (91) | 112 | 22.4% |
Value | Count | Frequency (%) |
0.0 | 286 | |
4.9 | 31 | 6.2% |
5.0 | 26 | 5.2% |
5.1 | 10 | 2.0% |
9.8 | 3 | 0.6% |
9.9 | 11 | 2.2% |
10.0 | 3 | 0.6% |
10.1 | 3 | 0.6% |
14.7 | 4 | 0.8% |
14.8 | 4 | 0.8% |
Value | Count | Frequency (%) |
1125.5 | 1 | |
679.8 | 1 | |
578.2 | 1 | |
540.6 | 1 | |
517.4 | 1 | |
507.5 | 1 | |
493.7 | 1 | |
387.4 | 1 | |
376.6 | 1 | |
337.8 | 1 |
백화점_할인점_결제(지출)건수(INDEX05_CNT2)
Real number (ℝ)
ZEROS
 
Distinct | 466 |
---|---|
Distinct (%) | 93.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1378.2662 |
Minimum | 0 |
---|---|
Maximum | 11744.8 |
Zeros | 7 |
Zeros (%) | 1.4% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 4.5 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 10.1 |
Q1 | 178.125 |
median | 651 |
Q3 | 1840.525 |
95-th percentile | 5020.71 |
Maximum | 11744.8 |
Range | 11744.8 |
Interquartile range (IQR) | 1662.4 |
Descriptive statistics
Standard deviation | 1778.8555 |
---|---|
Coefficient of variation (CV) | 1.2906472 |
Kurtosis | 5.5302476 |
Mean | 1378.2662 |
Median Absolute Deviation (MAD) | 586.2 |
Skewness | 2.1600106 |
Sum | 689133.1 |
Variance | 3164326.8 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
5.0 | 9 | 1.8% |
0.0 | 7 | 1.4% |
14.9 | 6 | 1.2% |
4.9 | 4 | 0.8% |
20.2 | 3 | 0.6% |
39.9 | 3 | 0.6% |
10.0 | 2 | 0.4% |
74.5 | 2 | 0.4% |
14.7 | 2 | 0.4% |
74.3 | 2 | 0.4% |
Other values (456) | 460 |
Value | Count | Frequency (%) |
0.0 | 7 | |
4.9 | 4 | |
5.0 | 9 | |
5.1 | 1 | 0.2% |
9.9 | 1 | 0.2% |
10.0 | 2 | 0.4% |
10.1 | 2 | 0.4% |
14.7 | 2 | 0.4% |
14.8 | 1 | 0.2% |
14.9 | 6 |
Value | Count | Frequency (%) |
11744.8 | 1 | |
9090.9 | 1 | |
8647.7 | 1 | |
8553.5 | 1 | |
8427.4 | 1 | |
8386.5 | 1 | |
8344.1 | 1 | |
7684.5 | 1 | |
7471.2 | 1 | |
7380.4 | 1 |
홈쇼핑_지수(INDEX05)
Real number (ℝ)
Distinct | 10 |
---|---|
Distinct (%) | 2.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 3.87 |
Minimum | 0 |
---|---|
Maximum | 9 |
Zeros | 5 |
Zeros (%) | 1.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 4.5 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 1 |
Q1 | 2 |
median | 3 |
Q3 | 6 |
95-th percentile | 9 |
Maximum | 9 |
Range | 9 |
Interquartile range (IQR) | 4 |
Descriptive statistics
Standard deviation | 2.5569858 |
---|---|
Coefficient of variation (CV) | 0.66071984 |
Kurtosis | -0.93148931 |
Mean | 3.87 |
Median Absolute Deviation (MAD) | 2 |
Skewness | 0.57727 |
Sum | 1935 |
Variance | 6.5381764 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1 | 99 | |
3 | 92 | |
2 | 90 | |
5 | 46 | |
8 | 45 | |
6 | 36 | 7.2% |
7 | 32 | 6.4% |
4 | 29 | 5.8% |
9 | 26 | 5.2% |
0 | 5 | 1.0% |
Value | Count | Frequency (%) |
0 | 5 | 1.0% |
1 | 99 | |
2 | 90 | |
3 | 92 | |
4 | 29 | 5.8% |
5 | 46 | |
6 | 36 | 7.2% |
7 | 32 | 6.4% |
8 | 45 | |
9 | 26 | 5.2% |
Value | Count | Frequency (%) |
9 | 26 | 5.2% |
8 | 45 | |
7 | 32 | 6.4% |
6 | 36 | 7.2% |
5 | 46 | |
4 | 29 | 5.8% |
3 | 92 | |
2 | 90 | |
1 | 99 | |
0 | 5 | 1.0% |
성별(GENDER) | 연령대(AGE) | 홈쇼핑_결재(지출)건수(INDEX05_CNT) | 백화점_할인점_결제(지출)건수(INDEX05_CNT2) | 홈쇼핑_지수(INDEX05) | |
---|---|---|---|---|---|
성별(GENDER) | 1.000 | 0.000 | 0.000 | 0.079 | 0.127 |
연령대(AGE) | 0.000 | 1.000 | 0.000 | 0.058 | 0.000 |
홈쇼핑_결재(지출)건수(INDEX05_CNT) | 0.000 | 0.000 | 1.000 | 0.000 | 0.045 |
백화점_할인점_결제(지출)건수(INDEX05_CNT2) | 0.079 | 0.058 | 0.000 | 1.000 | 0.000 |
홈쇼핑_지수(INDEX05) | 0.127 | 0.000 | 0.045 | 0.000 | 1.000 |
연령대(AGE) | 홈쇼핑_결재(지출)건수(INDEX05_CNT) | 백화점_할인점_결제(지출)건수(INDEX05_CNT2) | 홈쇼핑_지수(INDEX05) | 성별(GENDER) | |
---|---|---|---|---|---|
연령대(AGE) | 1.000 | -0.015 | -0.033 | 0.018 | 0.000 |
홈쇼핑_결재(지출)건수(INDEX05_CNT) | -0.015 | 1.000 | -0.012 | 0.055 | 0.000 |
백화점_할인점_결제(지출)건수(INDEX05_CNT2) | -0.033 | -0.012 | 1.000 | -0.026 | 0.078 |
홈쇼핑_지수(INDEX05) | 0.018 | 0.055 | -0.026 | 1.000 | 0.097 |
성별(GENDER) | 0.000 | 0.000 | 0.078 | 0.097 | 1.000 |
서울시_블록ID(BLK_CD) | 성별(GENDER) | 연령대(AGE) | 홈쇼핑_결재(지출)건수(INDEX05_CNT) | 백화점_할인점_결제(지출)건수(INDEX05_CNT2) | 홈쇼핑_지수(INDEX05) | |
---|---|---|---|---|---|---|
0 | 2*2*1* | 2 | 5 | 0.0 | 125.0 | 8 |
1 | 2*1*1* | 2 | 1 | 0.0 | 427.2 | 6 |
2 | 1*9*0* | 2 | 2 | 0.0 | 363.0 | 6 |
3 | 2*4*5* | 2 | 4 | 0.0 | 14.9 | 3 |
4 | 4*9*5* | 1 | 5 | 113.8 | 130.2 | 8 |
5 | 2*0*8* | 1 | 6 | 578.2 | 89.7 | 1 |
6 | 2*9*5* | 2 | 5 | 24.7 | 5135.2 | 8 |
7 | 2*1* | 2 | 3 | 0.0 | 562.1 | 2 |
8 | 3*3*9* | 2 | 4 | 0.0 | 4191.9 | 2 |
9 | 2*5*1* | 1 | 6 | 0.0 | 20.1 | 3 |
서울시_블록ID(BLK_CD) | 성별(GENDER) | 연령대(AGE) | 홈쇼핑_결재(지출)건수(INDEX05_CNT) | 백화점_할인점_결제(지출)건수(INDEX05_CNT2) | 홈쇼핑_지수(INDEX05) | |
---|---|---|---|---|---|---|
490 | 3*1*5* | 2 | 3 | 4.9 | 5446.9 | 5 |
491 | 3*0*6* | 2 | 6 | 5.0 | 273.1 | 3 |
492 | 2*9*1 | 1 | 6 | 0.0 | 854.7 | 3 |
493 | 1*2*1* | 2 | 4 | 0.0 | 2762.8 | 3 |
494 | 1*3*1* | 1 | 1 | 0.0 | 90.2 | 3 |
495 | 4*9*6* | 2 | 5 | 0.0 | 9.9 | 2 |
496 | 2*8*0* | 1 | 7 | 5.0 | 5.0 | 3 |
497 | 1*3*1* | 2 | 4 | 15.1 | 3323.9 | 2 |
498 | 2*3*7 | 1 | 6 | 0.0 | 178.8 | 3 |
499 | 2*7*7* | 2 | 2 | 5.1 | 9090.9 | 1 |