Dataset statistics
Number of variables | 6 |
---|---|
Number of observations | 500 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 26.0 KiB |
Average record size in memory | 53.3 B |
Variable types
Text | 1 |
---|---|
Categorical | 1 |
Numeric | 4 |
Dataset
Description | 샘플 데이터 |
---|---|
Author | 서울시, 신한카드, KCB(코리아크레딧뷰로) |
URL | https://bigdata.seoul.go.kr/data/selectSampleData.do?sample_data_seq=321 |
Reproduction
Analysis started | 2023-12-10 15:02:01.721471 |
---|---|
Analysis finished | 2023-12-10 15:02:06.749674 |
Duration | 5.03 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
서울시_블록ID(BLK_CD)
Text
Distinct | 327 |
---|---|
Distinct (%) | 65.4% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 4.0 KiB |
Value | Count | Frequency (%) |
2*6*6 | 6 | 1.2% |
2*3*7 | 6 | 1.2% |
2*6*2 | 6 | 1.2% |
2*7*4 | 5 | 1.0% |
1*3*5 | 5 | 1.0% |
2*0*4 | 5 | 1.0% |
2*0*5 | 4 | 0.8% |
1*3*3 | 4 | 0.8% |
2*2*2 | 4 | 0.8% |
2*0*6 | 4 | 0.8% |
Other values (267) | 451 |
Most occurring characters
Value | Count | Frequency (%) |
* | 1370 | |
2 | 313 | 10.9% |
1 | 231 | 8.1% |
3 | 196 | 6.9% |
4 | 156 | 5.5% |
6 | 108 | 3.8% |
5 | 107 | 3.7% |
8 | 104 | 3.6% |
0 | 100 | 3.5% |
9 | 93 | 3.3% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 1490 | |
Other Punctuation | 1370 |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
2 | 313 | |
1 | 231 | |
3 | 196 | |
4 | 156 | |
6 | 108 | 7.2% |
5 | 107 | 7.2% |
8 | 104 | 7.0% |
0 | 100 | 6.7% |
9 | 93 | 6.2% |
7 | 82 | 5.5% |
Other Punctuation
Value | Count | Frequency (%) |
* | 1370 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 2860 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
* | 1370 | |
2 | 313 | 10.9% |
1 | 231 | 8.1% |
3 | 196 | 6.9% |
4 | 156 | 5.5% |
6 | 108 | 3.8% |
5 | 107 | 3.7% |
8 | 104 | 3.6% |
0 | 100 | 3.5% |
9 | 93 | 3.3% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 2860 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
* | 1370 | |
2 | 313 | 10.9% |
1 | 231 | 8.1% |
3 | 196 | 6.9% |
4 | 156 | 5.5% |
6 | 108 | 3.8% |
5 | 107 | 3.7% |
8 | 104 | 3.6% |
0 | 100 | 3.5% |
9 | 93 | 3.3% |
성별(GENDER)
Categorical
Distinct | 2 |
---|---|
Distinct (%) | 0.4% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 4.0 KiB |
2 | |
---|---|
1 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 2 |
---|---|
2nd row | 2 |
3rd row | 2 |
4th row | 1 |
5th row | 2 |
Common Values
Value | Count | Frequency (%) |
2 | 289 | |
1 | 211 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
2 | 289 | |
1 | 211 |
연령대(AGE)
Real number (ℝ)
Distinct | 7 |
---|---|
Distinct (%) | 1.4% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 3.678 |
Minimum | 1 |
---|---|
Maximum | 7 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 4.5 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 2 |
Q1 | 3 |
median | 4 |
Q3 | 4 |
95-th percentile | 5 |
Maximum | 7 |
Range | 6 |
Interquartile range (IQR) | 1 |
Descriptive statistics
Standard deviation | 1.1405779 |
---|---|
Coefficient of variation (CV) | 0.31010817 |
Kurtosis | -0.58492961 |
Mean | 3.678 |
Median Absolute Deviation (MAD) | 1 |
Skewness | 0.036313814 |
Sum | 1839 |
Variance | 1.3009178 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
4 | 175 | |
3 | 106 | |
5 | 102 | |
2 | 96 | |
6 | 16 | 3.2% |
7 | 3 | 0.6% |
1 | 2 | 0.4% |
Value | Count | Frequency (%) |
1 | 2 | 0.4% |
2 | 96 | |
3 | 106 | |
4 | 175 | |
5 | 102 | |
6 | 16 | 3.2% |
7 | 3 | 0.6% |
Value | Count | Frequency (%) |
7 | 3 | 0.6% |
6 | 16 | 3.2% |
5 | 102 | |
4 | 175 | |
3 | 106 | |
2 | 96 | |
1 | 2 | 0.4% |
학원비_인당_평균_지출액(INDEX03_AMT)
Real number (ℝ)
Distinct | 411 |
---|---|
Distinct (%) | 82.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1076646 |
Minimum | 0 |
---|---|
Maximum | 15109000 |
Zeros | 1 |
Zeros (%) | 0.2% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 4.5 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 20950 |
Q1 | 113000 |
median | 315000 |
Q3 | 1158250 |
95-th percentile | 4818750 |
Maximum | 15109000 |
Range | 15109000 |
Interquartile range (IQR) | 1045250 |
Descriptive statistics
Standard deviation | 1844792.8 |
---|---|
Coefficient of variation (CV) | 1.7134628 |
Kurtosis | 14.30725 |
Mean | 1076646 |
Median Absolute Deviation (MAD) | 271500 |
Skewness | 3.3470287 |
Sum | 5.38323 × 108 |
Variance | 3.4032606 × 1012 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
127000 | 4 | 0.8% |
4000 | 3 | 0.6% |
63000 | 3 | 0.6% |
87000 | 3 | 0.6% |
38000 | 3 | 0.6% |
66000 | 3 | 0.6% |
52000 | 3 | 0.6% |
121000 | 3 | 0.6% |
278000 | 3 | 0.6% |
17000 | 3 | 0.6% |
Other values (401) | 469 |
Value | Count | Frequency (%) |
0 | 1 | 0.2% |
1000 | 1 | 0.2% |
2000 | 1 | 0.2% |
3000 | 1 | 0.2% |
4000 | 3 | |
6000 | 1 | 0.2% |
7000 | 3 | |
8000 | 1 | 0.2% |
9000 | 1 | 0.2% |
11000 | 1 | 0.2% |
Value | Count | Frequency (%) |
15109000 | 1 | |
11428000 | 1 | |
10748000 | 1 | |
10271000 | 1 | |
9500000 | 1 | |
9267000 | 1 | |
9031000 | 1 | |
8935000 | 1 | |
8509000 | 1 | |
7689000 | 1 |
인당_평균_소득대비_평균_학원비_지출_비율(INDEX03_RT)
Real number (ℝ)
Distinct | 414 |
---|---|
Distinct (%) | 82.8% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 0.0995474 |
Minimum | 0.0003 |
---|---|
Maximum | 1.2877 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 4.5 KiB |
Quantile statistics
Minimum | 0.0003 |
---|---|
5-th percentile | 0.003195 |
Q1 | 0.01295 |
median | 0.0377 |
Q3 | 0.103825 |
95-th percentile | 0.409735 |
Maximum | 1.2877 |
Range | 1.2874 |
Interquartile range (IQR) | 0.090875 |
Descriptive statistics
Standard deviation | 0.15641927 |
---|---|
Coefficient of variation (CV) | 1.5713044 |
Kurtosis | 15.565182 |
Mean | 0.0995474 |
Median Absolute Deviation (MAD) | 0.03155 |
Skewness | 3.3705283 |
Sum | 49.7737 |
Variance | 0.024466988 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0.0152 | 4 | 0.8% |
0.0077 | 4 | 0.8% |
0.0007 | 4 | 0.8% |
0.0058 | 3 | 0.6% |
0.0116 | 3 | 0.6% |
0.0048 | 3 | 0.6% |
0.0056 | 3 | 0.6% |
0.0082 | 3 | 0.6% |
0.0005 | 3 | 0.6% |
0.013 | 3 | 0.6% |
Other values (404) | 467 |
Value | Count | Frequency (%) |
0.0003 | 2 | |
0.0004 | 1 | 0.2% |
0.0005 | 3 | |
0.0007 | 4 | |
0.0008 | 1 | 0.2% |
0.001 | 1 | 0.2% |
0.0011 | 1 | 0.2% |
0.0012 | 3 | |
0.0013 | 1 | 0.2% |
0.0015 | 2 |
Value | Count | Frequency (%) |
1.2877 | 1 | |
1.1764 | 1 | |
1.0011 | 1 | |
0.877 | 1 | |
0.7363 | 1 | |
0.6917 | 1 | |
0.6706 | 1 | |
0.6553 | 1 | |
0.6551 | 1 | |
0.6111 | 1 |
학원비지수(INDEX03)
Real number (ℝ)
Distinct | 9 |
---|---|
Distinct (%) | 1.8% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 4.854 |
Minimum | 1 |
---|---|
Maximum | 9 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 4.5 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 3 |
median | 5 |
Q3 | 7 |
95-th percentile | 9 |
Maximum | 9 |
Range | 8 |
Interquartile range (IQR) | 4 |
Descriptive statistics
Standard deviation | 2.5988829 |
---|---|
Coefficient of variation (CV) | 0.53541057 |
Kurtosis | -1.2230164 |
Mean | 4.854 |
Median Absolute Deviation (MAD) | 2 |
Skewness | 0.098436179 |
Sum | 2427 |
Variance | 6.7541924 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
2 | 62 | |
4 | 60 | |
1 | 59 | |
3 | 58 | |
6 | 57 | |
9 | 57 | |
7 | 51 | |
5 | 50 | |
8 | 46 |
Value | Count | Frequency (%) |
1 | 59 | |
2 | 62 | |
3 | 58 | |
4 | 60 | |
5 | 50 | |
6 | 57 | |
7 | 51 | |
8 | 46 | |
9 | 57 |
Value | Count | Frequency (%) |
9 | 57 | |
8 | 46 | |
7 | 51 | |
6 | 57 | |
5 | 50 | |
4 | 60 | |
3 | 58 | |
2 | 62 | |
1 | 59 |
성별(GENDER) | 연령대(AGE) | 학원비_인당_평균_지출액(INDEX03_AMT) | 인당_평균_소득대비_평균_학원비_지출_비율(INDEX03_RT) | 학원비지수(INDEX03) | |
---|---|---|---|---|---|
성별(GENDER) | 1.000 | 0.045 | 0.000 | 0.010 | 0.139 |
연령대(AGE) | 0.045 | 1.000 | 0.000 | 0.000 | 0.075 |
학원비_인당_평균_지출액(INDEX03_AMT) | 0.000 | 0.000 | 1.000 | 0.275 | 0.079 |
인당_평균_소득대비_평균_학원비_지출_비율(INDEX03_RT) | 0.010 | 0.000 | 0.275 | 1.000 | 0.000 |
학원비지수(INDEX03) | 0.139 | 0.075 | 0.079 | 0.000 | 1.000 |
연령대(AGE) | 학원비_인당_평균_지출액(INDEX03_AMT) | 인당_평균_소득대비_평균_학원비_지출_비율(INDEX03_RT) | 학원비지수(INDEX03) | 성별(GENDER) | |
---|---|---|---|---|---|
연령대(AGE) | 1.000 | 0.058 | -0.035 | 0.045 | 0.048 |
학원비_인당_평균_지출액(INDEX03_AMT) | 0.058 | 1.000 | 0.062 | 0.015 | 0.000 |
인당_평균_소득대비_평균_학원비_지출_비율(INDEX03_RT) | -0.035 | 0.062 | 1.000 | -0.123 | 0.008 |
학원비지수(INDEX03) | 0.045 | 0.015 | -0.123 | 1.000 | 0.138 |
성별(GENDER) | 0.048 | 0.000 | 0.008 | 0.138 | 1.000 |
서울시_블록ID(BLK_CD) | 성별(GENDER) | 연령대(AGE) | 학원비_인당_평균_지출액(INDEX03_AMT) | 인당_평균_소득대비_평균_학원비_지출_비율(INDEX03_RT) | 학원비지수(INDEX03) | |
---|---|---|---|---|---|---|
0 | 2*0*7* | 2 | 3 | 504000 | 0.0031 | 8 |
1 | 5*2*1* | 2 | 2 | 242000 | 0.0817 | 5 |
2 | 4*8*6* | 2 | 3 | 4995000 | 0.0712 | 6 |
3 | 5*2*8* | 1 | 5 | 149000 | 0.0054 | 2 |
4 | 1*6*2* | 2 | 5 | 127000 | 0.0457 | 9 |
5 | 3*4*8* | 2 | 7 | 115000 | 0.0008 | 8 |
6 | 4*7*0* | 2 | 4 | 2181000 | 0.0701 | 2 |
7 | 2*0*4* | 1 | 4 | 1334000 | 0.0092 | 4 |
8 | 2*9*1* | 2 | 4 | 101000 | 0.0777 | 4 |
9 | 2*1*4* | 2 | 3 | 148000 | 0.0033 | 9 |
서울시_블록ID(BLK_CD) | 성별(GENDER) | 연령대(AGE) | 학원비_인당_평균_지출액(INDEX03_AMT) | 인당_평균_소득대비_평균_학원비_지출_비율(INDEX03_RT) | 학원비지수(INDEX03) | |
---|---|---|---|---|---|---|
490 | 1*3*7 | 1 | 3 | 4044000 | 0.1061 | 3 |
491 | 2*4*6* | 2 | 2 | 646000 | 0.0847 | 5 |
492 | 2*1*9 | 2 | 4 | 8935000 | 0.0771 | 6 |
493 | 3*5*8* | 2 | 4 | 328000 | 0.0495 | 9 |
494 | 2*4*1* | 1 | 4 | 7000 | 0.1084 | 2 |
495 | 3*6*9* | 1 | 1 | 2160000 | 0.489 | 1 |
496 | 2*2*4* | 2 | 2 | 17000 | 0.0918 | 6 |
497 | 3*3*0* | 2 | 5 | 3050000 | 0.1346 | 8 |
498 | 3*0*8* | 1 | 5 | 195000 | 0.0267 | 6 |
499 | 2*1*2* | 1 | 2 | 1903000 | 0.0136 | 9 |