Dataset statistics
Number of variables | 9 |
---|---|
Number of observations | 500 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 38.7 KiB |
Average record size in memory | 79.3 B |
Variable types
Numeric | 6 |
---|---|
Text | 1 |
Categorical | 2 |
Dataset
Description | 샘플 데이터 |
---|---|
Author | 서울시(신용보증재단) |
URL | https://bigdata.seoul.go.kr/data/selectSampleData.do?sample_data_seq=324 |
Reproduction
Analysis started | 2024-04-16 19:17:48.311118 |
---|---|
Analysis finished | 2024-04-16 19:17:53.964341 |
Duration | 5.65 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
기준년월(STD_YM)
Real number (ℝ)
Distinct | 24 |
---|---|
Distinct (%) | 4.8% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 201856.14 |
Minimum | 201801 |
---|---|
Maximum | 201912 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 4.5 KiB |
Quantile statistics
Minimum | 201801 |
---|---|
5-th percentile | 201802 |
Q1 | 201806 |
median | 201812 |
Q3 | 201906 |
95-th percentile | 201911 |
Maximum | 201912 |
Range | 111 |
Interquartile range (IQR) | 100 |
Descriptive statistics
Standard deviation | 50.265497 |
---|---|
Coefficient of variation (CV) | 0.00024901644 |
Kurtosis | -1.9888102 |
Mean | 201856.14 |
Median Absolute Deviation (MAD) | 11 |
Skewness | 0.0082646058 |
Sum | 1.0092807 × 108 |
Variance | 2526.6202 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
201906 | 28 | 5.6% |
201810 | 27 | 5.4% |
201910 | 25 | 5.0% |
201803 | 25 | 5.0% |
201903 | 24 | 4.8% |
201802 | 23 | 4.6% |
201804 | 23 | 4.6% |
201909 | 23 | 4.6% |
201806 | 23 | 4.6% |
201901 | 23 | 4.6% |
Other values (14) | 256 |
Value | Count | Frequency (%) |
201801 | 21 | |
201802 | 23 | |
201803 | 25 | |
201804 | 23 | |
201805 | 22 | |
201806 | 23 | |
201807 | 21 | |
201808 | 16 | |
201809 | 16 | |
201810 | 27 |
Value | Count | Frequency (%) |
201912 | 21 | |
201911 | 18 | |
201910 | 25 | |
201909 | 23 | |
201908 | 14 | |
201907 | 19 | |
201906 | 28 | |
201905 | 13 | |
201904 | 20 | |
201903 | 24 |
블록코드(BLCK_CD)
Text
Distinct | 332 |
---|---|
Distinct (%) | 66.4% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 4.0 KiB |
Value | Count | Frequency (%) |
4*9*4 | 5 | 1.0% |
2*9*4 | 5 | 1.0% |
2*7*5 | 5 | 1.0% |
2*2*3 | 5 | 1.0% |
2*0*2 | 5 | 1.0% |
2*1*2 | 4 | 0.8% |
1*4*1 | 4 | 0.8% |
2*1*3 | 4 | 0.8% |
2*2*2 | 4 | 0.8% |
1*4*4 | 4 | 0.8% |
Other values (275) | 455 |
Most occurring characters
Value | Count | Frequency (%) |
* | 1378 | |
2 | 300 | 10.5% |
1 | 234 | 8.2% |
3 | 202 | 7.1% |
4 | 188 | 6.6% |
9 | 108 | 3.8% |
5 | 102 | 3.6% |
8 | 92 | 3.2% |
0 | 88 | 3.1% |
7 | 87 | 3.0% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 1484 | |
Other Punctuation | 1378 |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
2 | 300 | |
1 | 234 | |
3 | 202 | |
4 | 188 | |
9 | 108 | 7.3% |
5 | 102 | 6.9% |
8 | 92 | 6.2% |
0 | 88 | 5.9% |
7 | 87 | 5.9% |
6 | 83 | 5.6% |
Other Punctuation
Value | Count | Frequency (%) |
* | 1378 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 2862 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
* | 1378 | |
2 | 300 | 10.5% |
1 | 234 | 8.2% |
3 | 202 | 7.1% |
4 | 188 | 6.6% |
9 | 108 | 3.8% |
5 | 102 | 3.6% |
8 | 92 | 3.2% |
0 | 88 | 3.1% |
7 | 87 | 3.0% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 2862 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
* | 1378 | |
2 | 300 | 10.5% |
1 | 234 | 8.2% |
3 | 202 | 7.1% |
4 | 188 | 6.6% |
9 | 108 | 3.8% |
5 | 102 | 3.6% |
8 | 92 | 3.2% |
0 | 88 | 3.1% |
7 | 87 | 3.0% |
통계청상품코드(STAT_CD)
Categorical
Distinct | 9 |
---|---|
Distinct (%) | 1.8% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 4.0 KiB |
A | |
---|---|
E | |
B | |
L | |
I | 17 |
Other values (4) |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | 0.2% |
Sample
1st row | A |
---|---|
2nd row | B |
3rd row | A |
4th row | L |
5th row | A |
Common Values
Value | Count | Frequency (%) |
A | 210 | |
E | 105 | |
B | 83 | 16.6% |
L | 53 | 10.6% |
I | 17 | 3.4% |
J | 16 | 3.2% |
C | 10 | 2.0% |
G | 5 | 1.0% |
F | 1 | 0.2% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
a | 210 | |
e | 105 | |
b | 83 | 16.6% |
l | 53 | 10.6% |
i | 17 | 3.4% |
j | 16 | 3.2% |
c | 10 | 2.0% |
g | 5 | 1.0% |
f | 1 | 0.2% |
성별코드(SEX_CD)
Categorical
Distinct | 2 |
---|---|
Distinct (%) | 0.4% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 4.0 KiB |
2 | |
---|---|
1 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 2 |
---|---|
2nd row | 2 |
3rd row | 2 |
4th row | 2 |
5th row | 2 |
Common Values
Value | Count | Frequency (%) |
2 | 343 | |
1 | 157 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
2 | 343 | |
1 | 157 |
연령대코드(AGE_CD)
Real number (ℝ)
Distinct | 7 |
---|---|
Distinct (%) | 1.4% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 4.146 |
Minimum | 1 |
---|---|
Maximum | 7 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 4.5 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 2 |
Q1 | 3 |
median | 4 |
Q3 | 5 |
95-th percentile | 7 |
Maximum | 7 |
Range | 6 |
Interquartile range (IQR) | 2 |
Descriptive statistics
Standard deviation | 1.3756704 |
---|---|
Coefficient of variation (CV) | 0.33180665 |
Kurtosis | -0.63200686 |
Mean | 4.146 |
Median Absolute Deviation (MAD) | 1 |
Skewness | 0.29200102 |
Sum | 2073 |
Variance | 1.8924689 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
4 | 135 | |
3 | 121 | |
5 | 99 | |
6 | 59 | |
2 | 55 | |
7 | 30 | 6.0% |
1 | 1 | 0.2% |
Value | Count | Frequency (%) |
1 | 1 | 0.2% |
2 | 55 | |
3 | 121 | |
4 | 135 | |
5 | 99 | |
6 | 59 | |
7 | 30 | 6.0% |
Value | Count | Frequency (%) |
7 | 30 | 6.0% |
6 | 59 | |
5 | 99 | |
4 | 135 | |
3 | 121 | |
2 | 55 | |
1 | 1 | 0.2% |
시간대코드(TIME_CD)
Real number (ℝ)
Distinct | 6 |
---|---|
Distinct (%) | 1.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 4.226 |
Minimum | 1 |
---|---|
Maximum | 6 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 4.5 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 2 |
Q1 | 3 |
median | 4 |
Q3 | 5 |
95-th percentile | 6 |
Maximum | 6 |
Range | 5 |
Interquartile range (IQR) | 2 |
Descriptive statistics
Standard deviation | 1.1686126 |
---|---|
Coefficient of variation (CV) | 0.27652924 |
Kurtosis | -0.43670259 |
Mean | 4.226 |
Median Absolute Deviation (MAD) | 1 |
Skewness | -0.34141426 |
Sum | 2113 |
Variance | 1.3656553 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
5 | 159 | |
4 | 131 | |
3 | 109 | |
6 | 68 | |
2 | 26 | 5.2% |
1 | 7 | 1.4% |
Value | Count | Frequency (%) |
1 | 7 | 1.4% |
2 | 26 | 5.2% |
3 | 109 | |
4 | 131 | |
5 | 159 | |
6 | 68 |
Value | Count | Frequency (%) |
6 | 68 | |
5 | 159 | |
4 | 131 | |
3 | 109 | |
2 | 26 | 5.2% |
1 | 7 | 1.4% |
구매_고객수(ACC_CNT)
Real number (ℝ)
Distinct | 41 |
---|---|
Distinct (%) | 8.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 6.106 |
Minimum | 1 |
---|---|
Maximum | 151 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 4.5 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 1 |
median | 2 |
Q3 | 6 |
95-th percentile | 23 |
Maximum | 151 |
Range | 150 |
Interquartile range (IQR) | 5 |
Descriptive statistics
Standard deviation | 11.02442 |
---|---|
Coefficient of variation (CV) | 1.8055061 |
Kurtosis | 63.695516 |
Mean | 6.106 |
Median Absolute Deviation (MAD) | 1 |
Skewness | 6.1656901 |
Sum | 3053 |
Variance | 121.53784 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1 | 184 | |
2 | 90 | |
3 | 55 | 11.0% |
5 | 22 | 4.4% |
4 | 20 | 4.0% |
7 | 12 | 2.4% |
9 | 9 | 1.8% |
11 | 9 | 1.8% |
6 | 9 | 1.8% |
10 | 8 | 1.6% |
Other values (31) | 82 |
Value | Count | Frequency (%) |
1 | 184 | |
2 | 90 | |
3 | 55 | 11.0% |
4 | 20 | 4.0% |
5 | 22 | 4.4% |
6 | 9 | 1.8% |
7 | 12 | 2.4% |
8 | 7 | 1.4% |
9 | 9 | 1.8% |
10 | 8 | 1.6% |
Value | Count | Frequency (%) |
151 | 1 | |
65 | 1 | |
62 | 1 | |
52 | 1 | |
46 | 1 | |
44 | 2 | |
43 | 1 | |
41 | 1 | |
40 | 1 | |
39 | 2 |
구매건수(PURH_CNT)
Real number (ℝ)
Distinct | 54 |
---|---|
Distinct (%) | 10.8% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 8.876 |
Minimum | 1 |
---|---|
Maximum | 390 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 4.5 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 1 |
median | 3 |
Q3 | 8 |
95-th percentile | 33.05 |
Maximum | 390 |
Range | 389 |
Interquartile range (IQR) | 7 |
Descriptive statistics
Standard deviation | 22.537802 |
---|---|
Coefficient of variation (CV) | 2.5391846 |
Kurtosis | 168.58853 |
Mean | 8.876 |
Median Absolute Deviation (MAD) | 2 |
Skewness | 10.983416 |
Sum | 4438 |
Variance | 507.95253 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1 | 130 | |
2 | 101 | |
3 | 52 | 10.4% |
4 | 27 | 5.4% |
5 | 25 | 5.0% |
9 | 17 | 3.4% |
6 | 15 | 3.0% |
7 | 14 | 2.8% |
8 | 12 | 2.4% |
12 | 9 | 1.8% |
Other values (44) | 98 |
Value | Count | Frequency (%) |
1 | 130 | |
2 | 101 | |
3 | 52 | 10.4% |
4 | 27 | 5.4% |
5 | 25 | 5.0% |
6 | 15 | 3.0% |
7 | 14 | 2.8% |
8 | 12 | 2.4% |
9 | 17 | 3.4% |
10 | 6 | 1.2% |
Value | Count | Frequency (%) |
390 | 1 | |
151 | 1 | |
110 | 1 | |
104 | 1 | |
78 | 1 | |
77 | 1 | |
74 | 1 | |
73 | 1 | |
71 | 1 | |
62 | 1 |
구매금액(PURH_AMT)
Real number (ℝ)
Distinct | 143 |
---|---|
Distinct (%) | 28.6% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 48250 |
Minimum | 1000 |
---|---|
Maximum | 1244000 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 4.5 KiB |
Quantile statistics
Minimum | 1000 |
---|---|
5-th percentile | 2000 |
Q1 | 6000 |
median | 17000 |
Q3 | 54000 |
95-th percentile | 195050 |
Maximum | 1244000 |
Range | 1243000 |
Interquartile range (IQR) | 48000 |
Descriptive statistics
Standard deviation | 92966.279 |
---|---|
Coefficient of variation (CV) | 1.9267623 |
Kurtosis | 67.884392 |
Mean | 48250 |
Median Absolute Deviation (MAD) | 13000 |
Skewness | 6.6770884 |
Sum | 24125000 |
Variance | 8.642729 × 109 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
5000 | 29 | 5.8% |
4000 | 28 | 5.6% |
1000 | 24 | 4.8% |
2000 | 22 | 4.4% |
10000 | 21 | 4.2% |
3000 | 17 | 3.4% |
6000 | 16 | 3.2% |
9000 | 16 | 3.2% |
8000 | 15 | 3.0% |
18000 | 12 | 2.4% |
Other values (133) | 300 |
Value | Count | Frequency (%) |
1000 | 24 | |
2000 | 22 | |
3000 | 17 | |
4000 | 28 | |
5000 | 29 | |
6000 | 16 | |
7000 | 9 | 1.8% |
8000 | 15 | |
9000 | 16 | |
10000 | 21 |
Value | Count | Frequency (%) |
1244000 | 1 | |
825000 | 1 | |
652000 | 1 | |
395000 | 1 | |
361000 | 1 | |
359000 | 1 | |
334000 | 1 | |
312000 | 1 | |
281000 | 1 | |
272000 | 1 |
기준년월(STD_YM) | 통계청상품코드(STAT_CD) | 성별코드(SEX_CD) | 연령대코드(AGE_CD) | 시간대코드(TIME_CD) | 구매_고객수(ACC_CNT) | 구매건수(PURH_CNT) | 구매금액(PURH_AMT) | |
---|---|---|---|---|---|---|---|---|
기준년월(STD_YM) | 1.000 | 0.038 | 0.030 | 0.024 | 0.013 | 0.000 | 0.000 | 0.000 |
통계청상품코드(STAT_CD) | 0.038 | 1.000 | 0.000 | 0.081 | 0.000 | 0.297 | 0.000 | 0.207 |
성별코드(SEX_CD) | 0.030 | 0.000 | 1.000 | 0.083 | 0.000 | 0.066 | 0.024 | 0.054 |
연령대코드(AGE_CD) | 0.024 | 0.081 | 0.083 | 1.000 | 0.000 | 0.000 | 0.000 | 0.000 |
시간대코드(TIME_CD) | 0.013 | 0.000 | 0.000 | 0.000 | 1.000 | 0.083 | 0.000 | 0.053 |
구매_고객수(ACC_CNT) | 0.000 | 0.297 | 0.066 | 0.000 | 0.083 | 1.000 | 0.000 | 0.000 |
구매건수(PURH_CNT) | 0.000 | 0.000 | 0.024 | 0.000 | 0.000 | 0.000 | 1.000 | 0.000 |
구매금액(PURH_AMT) | 0.000 | 0.207 | 0.054 | 0.000 | 0.053 | 0.000 | 0.000 | 1.000 |
통계청상품코드(STAT_CD) | 성별코드(SEX_CD) | |
---|---|---|
통계청상품코드(STAT_CD) | 1.000 | 0.000 |
성별코드(SEX_CD) | 0.000 | 1.000 |
기준년월(STD_YM) | 연령대코드(AGE_CD) | 시간대코드(TIME_CD) | 구매_고객수(ACC_CNT) | 구매건수(PURH_CNT) | 구매금액(PURH_AMT) | 통계청상품코드(STAT_CD) | 성별코드(SEX_CD) | |
---|---|---|---|---|---|---|---|---|
기준년월(STD_YM) | 1.000 | 0.056 | -0.009 | 0.084 | 0.020 | 0.046 | 0.034 | 0.000 |
연령대코드(AGE_CD) | 0.056 | 1.000 | 0.052 | -0.021 | -0.027 | -0.022 | 0.042 | 0.088 |
시간대코드(TIME_CD) | -0.009 | 0.052 | 1.000 | 0.025 | 0.019 | -0.016 | 0.000 | 0.000 |
구매_고객수(ACC_CNT) | 0.084 | -0.021 | 0.025 | 1.000 | 0.015 | 0.001 | 0.179 | 0.080 |
구매건수(PURH_CNT) | 0.020 | -0.027 | 0.019 | 0.015 | 1.000 | -0.063 | 0.000 | 0.029 |
구매금액(PURH_AMT) | 0.046 | -0.022 | -0.016 | 0.001 | -0.063 | 1.000 | 0.112 | 0.058 |
통계청상품코드(STAT_CD) | 0.034 | 0.042 | 0.000 | 0.179 | 0.000 | 0.112 | 1.000 | 0.000 |
성별코드(SEX_CD) | 0.000 | 0.088 | 0.000 | 0.080 | 0.029 | 0.058 | 0.000 | 1.000 |
기준년월(STD_YM) | 블록코드(BLCK_CD) | 통계청상품코드(STAT_CD) | 성별코드(SEX_CD) | 연령대코드(AGE_CD) | 시간대코드(TIME_CD) | 구매_고객수(ACC_CNT) | 구매건수(PURH_CNT) | 구매금액(PURH_AMT) | |
---|---|---|---|---|---|---|---|---|---|
0 | 201910 | 2*1*2* | A | 2 | 4 | 3 | 25 | 2 | 15000 |
1 | 201905 | 3*5*8* | B | 2 | 4 | 2 | 1 | 2 | 14000 |
2 | 201906 | 4*9*0* | A | 2 | 4 | 4 | 22 | 2 | 42000 |
3 | 201809 | 2*4*8 | L | 2 | 4 | 3 | 23 | 3 | 20000 |
4 | 201810 | 4*1*0* | A | 2 | 3 | 4 | 2 | 8 | 2000 |
5 | 201802 | 2*1*2* | B | 1 | 4 | 6 | 1 | 1 | 29000 |
6 | 201801 | 2*9*1 | A | 2 | 6 | 6 | 1 | 1 | 4000 |
7 | 201804 | 1*8*4* | E | 1 | 6 | 5 | 1 | 9 | 123000 |
8 | 201808 | 3*3*7* | A | 2 | 3 | 3 | 2 | 3 | 17000 |
9 | 201812 | 1*9*0 | I | 1 | 3 | 6 | 2 | 4 | 15000 |
기준년월(STD_YM) | 블록코드(BLCK_CD) | 통계청상품코드(STAT_CD) | 성별코드(SEX_CD) | 연령대코드(AGE_CD) | 시간대코드(TIME_CD) | 구매_고객수(ACC_CNT) | 구매건수(PURH_CNT) | 구매금액(PURH_AMT) | |
---|---|---|---|---|---|---|---|---|---|
490 | 201906 | 4*9*4* | A | 2 | 6 | 6 | 5 | 9 | 53000 |
491 | 201903 | 2*9*7* | B | 2 | 3 | 5 | 15 | 390 | 10000 |
492 | 201906 | 2*0*8 | A | 1 | 5 | 5 | 13 | 16 | 22000 |
493 | 201905 | 1*3*7* | A | 2 | 6 | 6 | 2 | 1 | 140000 |
494 | 201801 | 2*2*8* | A | 1 | 3 | 3 | 5 | 5 | 100000 |
495 | 201810 | 2*0*4* | B | 2 | 3 | 5 | 3 | 24 | 16000 |
496 | 201905 | 4*8*1* | A | 2 | 6 | 5 | 1 | 3 | 21000 |
497 | 201910 | 9*5* | A | 2 | 7 | 5 | 10 | 8 | 4000 |
498 | 201903 | 2*6*9* | A | 2 | 4 | 3 | 1 | 1 | 116000 |
499 | 201909 | 3*8*4* | A | 2 | 3 | 5 | 3 | 3 | 49000 |