Dataset statistics
Number of variables | 9 |
---|---|
Number of observations | 500 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 38.7 KiB |
Average record size in memory | 79.3 B |
Variable types
Numeric | 6 |
---|---|
Text | 1 |
Categorical | 2 |
Dataset
Description | 샘플 데이터 |
---|---|
Author | 서울시(신용보증재단) |
URL | https://bigdata.seoul.go.kr/data/selectSampleData.do?sample_data_seq=324 |
Reproduction
Analysis started | 2024-04-16 19:18:43.565525 |
---|---|
Analysis finished | 2024-04-16 19:18:47.326198 |
Duration | 3.76 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
기준년월(STD_YM)
Real number (ℝ)
Distinct | 24 |
---|---|
Distinct (%) | 4.8% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 201862.08 |
Minimum | 201801 |
---|---|
Maximum | 201912 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 4.5 KiB |
Quantile statistics
Minimum | 201801 |
---|---|
5-th percentile | 201802 |
Q1 | 201807 |
median | 201902 |
Q3 | 201907 |
95-th percentile | 201911 |
Maximum | 201912 |
Range | 111 |
Interquartile range (IQR) | 100 |
Descriptive statistics
Standard deviation | 49.837467 |
---|---|
Coefficient of variation (CV) | 0.0002468887 |
Kurtosis | -1.9404641 |
Mean | 201862.08 |
Median Absolute Deviation (MAD) | 9 |
Skewness | -0.22440154 |
Sum | 1.0093104 × 108 |
Variance | 2483.7731 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
201909 | 31 | 6.2% |
201905 | 30 | 6.0% |
201907 | 29 | 5.8% |
201903 | 27 | 5.4% |
201806 | 26 | 5.2% |
201908 | 26 | 5.2% |
201807 | 23 | 4.6% |
201910 | 22 | 4.4% |
201808 | 22 | 4.4% |
201810 | 21 | 4.2% |
Other values (14) | 243 |
Value | Count | Frequency (%) |
201801 | 15 | |
201802 | 16 | |
201803 | 19 | |
201804 | 16 | |
201805 | 18 | |
201806 | 26 | |
201807 | 23 | |
201808 | 22 | |
201809 | 19 | |
201810 | 21 |
Value | Count | Frequency (%) |
201912 | 17 | |
201911 | 19 | |
201910 | 22 | |
201909 | 31 | |
201908 | 26 | |
201907 | 29 | |
201906 | 18 | |
201905 | 30 | |
201904 | 20 | |
201903 | 27 |
블록코드(BLCK_CD)
Text
Distinct | 302 |
---|---|
Distinct (%) | 60.4% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 4.0 KiB |
Value | Count | Frequency (%) |
2*1*9 | 9 | 1.8% |
2*4*5 | 7 | 1.4% |
2*0*4 | 6 | 1.2% |
2*7*3 | 6 | 1.2% |
2*6*6 | 6 | 1.2% |
2*9*1 | 6 | 1.2% |
2*9*7 | 6 | 1.2% |
2*6*3 | 5 | 1.0% |
2*3*1 | 5 | 1.0% |
2*1*2 | 5 | 1.0% |
Other values (247) | 439 |
Most occurring characters
Value | Count | Frequency (%) |
* | 1383 | |
2 | 343 | 11.9% |
3 | 213 | 7.4% |
1 | 197 | 6.8% |
4 | 145 | 5.0% |
9 | 115 | 4.0% |
5 | 105 | 3.7% |
0 | 96 | 3.3% |
7 | 95 | 3.3% |
8 | 93 | 3.2% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 1493 | |
Other Punctuation | 1383 |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
2 | 343 | |
3 | 213 | |
1 | 197 | |
4 | 145 | |
9 | 115 | 7.7% |
5 | 105 | 7.0% |
0 | 96 | 6.4% |
7 | 95 | 6.4% |
8 | 93 | 6.2% |
6 | 91 | 6.1% |
Other Punctuation
Value | Count | Frequency (%) |
* | 1383 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 2876 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
* | 1383 | |
2 | 343 | 11.9% |
3 | 213 | 7.4% |
1 | 197 | 6.8% |
4 | 145 | 5.0% |
9 | 115 | 4.0% |
5 | 105 | 3.7% |
0 | 96 | 3.3% |
7 | 95 | 3.3% |
8 | 93 | 3.2% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 2876 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
* | 1383 | |
2 | 343 | 11.9% |
3 | 213 | 7.4% |
1 | 197 | 6.8% |
4 | 145 | 5.0% |
9 | 115 | 4.0% |
5 | 105 | 3.7% |
0 | 96 | 3.3% |
7 | 95 | 3.3% |
8 | 93 | 3.2% |
통계청상품코드(STAT_CD)
Categorical
Distinct | 9 |
---|---|
Distinct (%) | 1.8% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 4.0 KiB |
A | |
---|---|
B | |
E | |
L | |
F | 14 |
Other values (4) |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | L |
---|---|
2nd row | B |
3rd row | A |
4th row | A |
5th row | B |
Common Values
Value | Count | Frequency (%) |
A | 201 | |
B | 139 | |
E | 80 | 16.0% |
L | 35 | 7.0% |
F | 14 | 2.8% |
J | 11 | 2.2% |
C | 11 | 2.2% |
G | 5 | 1.0% |
I | 4 | 0.8% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
a | 201 | |
b | 139 | |
e | 80 | 16.0% |
l | 35 | 7.0% |
f | 14 | 2.8% |
j | 11 | 2.2% |
c | 11 | 2.2% |
g | 5 | 1.0% |
i | 4 | 0.8% |
성별코드(SEX_CD)
Categorical
Distinct | 2 |
---|---|
Distinct (%) | 0.4% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 4.0 KiB |
2 | |
---|---|
1 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 2 |
---|---|
2nd row | 2 |
3rd row | 2 |
4th row | 2 |
5th row | 1 |
Common Values
Value | Count | Frequency (%) |
2 | 262 | |
1 | 238 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
2 | 262 | |
1 | 238 |
연령대코드(AGE_CD)
Real number (ℝ)
Distinct | 7 |
---|---|
Distinct (%) | 1.4% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 3.802 |
Minimum | 1 |
---|---|
Maximum | 7 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 4.5 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 2 |
Q1 | 3 |
median | 4 |
Q3 | 5 |
95-th percentile | 6 |
Maximum | 7 |
Range | 6 |
Interquartile range (IQR) | 2 |
Descriptive statistics
Standard deviation | 1.4693925 |
---|---|
Coefficient of variation (CV) | 0.38647882 |
Kurtosis | -0.73796547 |
Mean | 3.802 |
Median Absolute Deviation (MAD) | 1 |
Skewness | 0.21617085 |
Sum | 1901 |
Variance | 2.1591142 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
3 | 111 | |
4 | 105 | |
5 | 105 | |
2 | 100 | |
6 | 46 | |
7 | 19 | 3.8% |
1 | 14 | 2.8% |
Value | Count | Frequency (%) |
1 | 14 | 2.8% |
2 | 100 | |
3 | 111 | |
4 | 105 | |
5 | 105 | |
6 | 46 | |
7 | 19 | 3.8% |
Value | Count | Frequency (%) |
7 | 19 | 3.8% |
6 | 46 | |
5 | 105 | |
4 | 105 | |
3 | 111 | |
2 | 100 | |
1 | 14 | 2.8% |
시간대코드(TIME_CD)
Real number (ℝ)
Distinct | 6 |
---|---|
Distinct (%) | 1.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 3.758 |
Minimum | 1 |
---|---|
Maximum | 6 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 4.5 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 2 |
median | 4 |
Q3 | 5 |
95-th percentile | 6 |
Maximum | 6 |
Range | 5 |
Interquartile range (IQR) | 3 |
Descriptive statistics
Standard deviation | 1.6076707 |
---|---|
Coefficient of variation (CV) | 0.42779956 |
Kurtosis | -1.1862673 |
Mean | 3.758 |
Median Absolute Deviation (MAD) | 1 |
Skewness | -0.13344248 |
Sum | 1879 |
Variance | 2.5846052 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
5 | 106 | |
3 | 91 | |
6 | 89 | |
2 | 88 | |
4 | 80 | |
1 | 46 |
Value | Count | Frequency (%) |
1 | 46 | |
2 | 88 | |
3 | 91 | |
4 | 80 | |
5 | 106 | |
6 | 89 |
Value | Count | Frequency (%) |
6 | 89 | |
5 | 106 | |
4 | 80 | |
3 | 91 | |
2 | 88 | |
1 | 46 |
구매고객수(ACC_CNT)
Real number (ℝ)
Distinct | 66 |
---|---|
Distinct (%) | 13.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 26.276 |
Minimum | 1 |
---|---|
Maximum | 3970 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 4.5 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 1 |
median | 2 |
Q3 | 8 |
95-th percentile | 56.2 |
Maximum | 3970 |
Range | 3969 |
Interquartile range (IQR) | 7 |
Descriptive statistics
Standard deviation | 193.1218 |
---|---|
Coefficient of variation (CV) | 7.349741 |
Kurtosis | 351.59852 |
Mean | 26.276 |
Median Absolute Deviation (MAD) | 1 |
Skewness | 17.644547 |
Sum | 13138 |
Variance | 37296.028 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1 | 173 | |
2 | 86 | |
3 | 36 | 7.2% |
4 | 27 | 5.4% |
5 | 18 | 3.6% |
6 | 17 | 3.4% |
8 | 16 | 3.2% |
7 | 15 | 3.0% |
9 | 11 | 2.2% |
13 | 8 | 1.6% |
Other values (56) | 93 |
Value | Count | Frequency (%) |
1 | 173 | |
2 | 86 | |
3 | 36 | 7.2% |
4 | 27 | 5.4% |
5 | 18 | 3.6% |
6 | 17 | 3.4% |
7 | 15 | 3.0% |
8 | 16 | 3.2% |
9 | 11 | 2.2% |
10 | 6 | 1.2% |
Value | Count | Frequency (%) |
3970 | 1 | |
873 | 1 | |
758 | 1 | |
706 | 1 | |
704 | 1 | |
514 | 1 | |
356 | 1 | |
274 | 1 | |
258 | 1 | |
233 | 1 |
구매건수(PURH_CNT)
Real number (ℝ)
Distinct | 87 |
---|---|
Distinct (%) | 17.4% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 91.26 |
Minimum | 1 |
---|---|
Maximum | 12467 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 4.5 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 2 |
median | 5 |
Q3 | 13 |
95-th percentile | 125.6 |
Maximum | 12467 |
Range | 12466 |
Interquartile range (IQR) | 11 |
Descriptive statistics
Standard deviation | 793.62579 |
---|---|
Coefficient of variation (CV) | 8.6963159 |
Kurtosis | 188.65869 |
Mean | 91.26 |
Median Absolute Deviation (MAD) | 3 |
Skewness | 13.329312 |
Sum | 45630 |
Variance | 629841.89 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
2 | 85 | |
1 | 83 | |
4 | 37 | 7.4% |
3 | 36 | 7.2% |
5 | 29 | 5.8% |
6 | 26 | 5.2% |
8 | 22 | 4.4% |
7 | 21 | 4.2% |
11 | 10 | 2.0% |
9 | 8 | 1.6% |
Other values (77) | 143 |
Value | Count | Frequency (%) |
1 | 83 | |
2 | 85 | |
3 | 36 | |
4 | 37 | |
5 | 29 | 5.8% |
6 | 26 | 5.2% |
7 | 21 | 4.2% |
8 | 22 | 4.4% |
9 | 8 | 1.6% |
10 | 7 | 1.4% |
Value | Count | Frequency (%) |
12467 | 1 | |
10781 | 1 | |
5566 | 1 | |
2798 | 1 | |
2252 | 1 | |
761 | 1 | |
565 | 1 | |
513 | 1 | |
449 | 1 | |
435 | 1 |
구매금액(PURH_AMT)
Real number (ℝ)
Distinct | 142 |
---|---|
Distinct (%) | 28.4% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 652258 |
Minimum | 1000 |
---|---|
Maximum | 51244000 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 4.5 KiB |
Quantile statistics
Minimum | 1000 |
---|---|
5-th percentile | 1000 |
Q1 | 5000 |
median | 13000 |
Q3 | 38500 |
95-th percentile | 2060700 |
Maximum | 51244000 |
Range | 51243000 |
Interquartile range (IQR) | 33500 |
Descriptive statistics
Standard deviation | 3979954.8 |
---|---|
Coefficient of variation (CV) | 6.1018106 |
Kurtosis | 106.08013 |
Mean | 652258 |
Median Absolute Deviation (MAD) | 10000 |
Skewness | 9.7775636 |
Sum | 3.26129 × 108 |
Variance | 1.584004 × 1013 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
5000 | 41 | 8.2% |
1000 | 30 | 6.0% |
9000 | 29 | 5.8% |
3000 | 26 | 5.2% |
4000 | 24 | 4.8% |
2000 | 20 | 4.0% |
6000 | 18 | 3.6% |
8000 | 13 | 2.6% |
14000 | 13 | 2.6% |
11000 | 12 | 2.4% |
Other values (132) | 274 |
Value | Count | Frequency (%) |
1000 | 30 | |
2000 | 20 | |
3000 | 26 | |
4000 | 24 | |
5000 | 41 | |
6000 | 18 | |
7000 | 10 | 2.0% |
8000 | 13 | 2.6% |
9000 | 29 | |
10000 | 12 | 2.4% |
Value | Count | Frequency (%) |
51244000 | 1 | |
44478000 | 1 | |
41526000 | 1 | |
24636000 | 1 | |
17272000 | 1 | |
11908000 | 1 | |
11016000 | 1 | |
10910000 | 1 | |
10392000 | 1 | |
8286000 | 1 |
기준년월(STD_YM) | 통계청상품코드(STAT_CD) | 성별코드(SEX_CD) | 연령대코드(AGE_CD) | 시간대코드(TIME_CD) | 구매고객수(ACC_CNT) | 구매건수(PURH_CNT) | 구매금액(PURH_AMT) | |
---|---|---|---|---|---|---|---|---|
기준년월(STD_YM) | 1.000 | 0.045 | 0.000 | 0.028 | 0.000 | 0.000 | 0.000 | 0.000 |
통계청상품코드(STAT_CD) | 0.045 | 1.000 | 0.000 | 0.000 | 0.040 | 0.052 | 0.000 | 0.131 |
성별코드(SEX_CD) | 0.000 | 0.000 | 1.000 | 0.068 | 0.101 | 0.010 | 0.000 | 0.000 |
연령대코드(AGE_CD) | 0.028 | 0.000 | 0.068 | 1.000 | 0.000 | 0.000 | 0.000 | 0.293 |
시간대코드(TIME_CD) | 0.000 | 0.040 | 0.101 | 0.000 | 1.000 | 0.123 | 0.000 | 0.082 |
구매고객수(ACC_CNT) | 0.000 | 0.052 | 0.010 | 0.000 | 0.123 | 1.000 | 0.404 | 0.000 |
구매건수(PURH_CNT) | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.404 | 1.000 | 0.000 |
구매금액(PURH_AMT) | 0.000 | 0.131 | 0.000 | 0.293 | 0.082 | 0.000 | 0.000 | 1.000 |
통계청상품코드(STAT_CD) | 성별코드(SEX_CD) | |
---|---|---|
통계청상품코드(STAT_CD) | 1.000 | 0.000 |
성별코드(SEX_CD) | 0.000 | 1.000 |
기준년월(STD_YM) | 연령대코드(AGE_CD) | 시간대코드(TIME_CD) | 구매고객수(ACC_CNT) | 구매건수(PURH_CNT) | 구매금액(PURH_AMT) | 통계청상품코드(STAT_CD) | 성별코드(SEX_CD) | |
---|---|---|---|---|---|---|---|---|
기준년월(STD_YM) | 1.000 | -0.030 | 0.072 | 0.027 | 0.006 | 0.040 | 0.022 | 0.000 |
연령대코드(AGE_CD) | -0.030 | 1.000 | -0.032 | 0.009 | -0.011 | -0.109 | 0.000 | 0.072 |
시간대코드(TIME_CD) | 0.072 | -0.032 | 1.000 | -0.021 | -0.008 | 0.005 | 0.019 | 0.073 |
구매고객수(ACC_CNT) | 0.027 | 0.009 | -0.021 | 1.000 | -0.030 | 0.006 | 0.033 | 0.005 |
구매건수(PURH_CNT) | 0.006 | -0.011 | -0.008 | -0.030 | 1.000 | 0.057 | 0.000 | 0.000 |
구매금액(PURH_AMT) | 0.040 | -0.109 | 0.005 | 0.006 | 0.057 | 1.000 | 0.055 | 0.000 |
통계청상품코드(STAT_CD) | 0.022 | 0.000 | 0.019 | 0.033 | 0.000 | 0.055 | 1.000 | 0.000 |
성별코드(SEX_CD) | 0.000 | 0.072 | 0.073 | 0.005 | 0.000 | 0.000 | 0.000 | 1.000 |
기준년월(STD_YM) | 블록코드(BLCK_CD) | 통계청상품코드(STAT_CD) | 성별코드(SEX_CD) | 연령대코드(AGE_CD) | 시간대코드(TIME_CD) | 구매고객수(ACC_CNT) | 구매건수(PURH_CNT) | 구매금액(PURH_AMT) | |
---|---|---|---|---|---|---|---|---|---|
0 | 201901 | 2*8*9* | L | 2 | 6 | 2 | 11 | 2 | 2000 |
1 | 201811 | 2*5*1* | B | 2 | 3 | 5 | 1 | 2 | 24000 |
2 | 201811 | 4*2*7 | A | 2 | 3 | 5 | 6 | 29 | 69000 |
3 | 201809 | 2*7*0* | A | 2 | 2 | 6 | 2 | 6 | 11000 |
4 | 201910 | 2*1*4* | B | 1 | 2 | 6 | 10 | 7 | 18000 |
5 | 201806 | 3*9*3* | B | 1 | 6 | 3 | 9 | 1 | 5000 |
6 | 201906 | 3*7*3* | B | 1 | 6 | 5 | 9 | 13 | 5000 |
7 | 201901 | 3*0*9* | L | 1 | 3 | 1 | 2 | 7 | 18000 |
8 | 201911 | 4*4*4* | E | 2 | 4 | 6 | 35 | 4 | 5000 |
9 | 201812 | 1*6*1 | A | 1 | 3 | 3 | 7 | 1 | 3000 |
기준년월(STD_YM) | 블록코드(BLCK_CD) | 통계청상품코드(STAT_CD) | 성별코드(SEX_CD) | 연령대코드(AGE_CD) | 시간대코드(TIME_CD) | 구매고객수(ACC_CNT) | 구매건수(PURH_CNT) | 구매금액(PURH_AMT) | |
---|---|---|---|---|---|---|---|---|---|
490 | 201905 | 3*6*1* | A | 1 | 7 | 5 | 233 | 14 | 41000 |
491 | 201811 | 2*0*0* | A | 1 | 2 | 4 | 13 | 24 | 96000 |
492 | 201808 | 3*8*6* | A | 1 | 2 | 2 | 5 | 11 | 47000 |
493 | 201907 | 2*6*0* | E | 1 | 5 | 2 | 1 | 9 | 1000 |
494 | 201807 | 4*0*0* | B | 2 | 4 | 2 | 18 | 2 | 6000 |
495 | 201907 | 1*5*0* | A | 2 | 6 | 1 | 2 | 6 | 32000 |
496 | 201912 | 3*2*3* | A | 2 | 2 | 4 | 17 | 3 | 188000 |
497 | 201801 | 3*5*6* | A | 2 | 3 | 1 | 8 | 4 | 237000 |
498 | 201901 | 1*5*7 | B | 1 | 4 | 4 | 40 | 50 | 9000 |
499 | 201909 | 2*0*8* | L | 1 | 4 | 4 | 1 | 1 | 5000 |