Dataset statistics
Number of variables | 6 |
---|---|
Number of observations | 500 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 26.0 KiB |
Average record size in memory | 53.3 B |
Variable types
Text | 1 |
---|---|
Categorical | 1 |
Numeric | 4 |
Dataset
Description | 샘플 데이터 |
---|---|
Author | 서울시, 신한카드, KCB(코리아크레딧뷰로) |
URL | https://bigdata.seoul.go.kr/data/selectSampleData.do?sample_data_seq=321 |
자동차_보유자수(INDEX04_CNT2) has 125 (25.0%) zeros | Zeros |
자가용이용_지수(INDEX04) has 122 (24.4%) zeros | Zeros |
Reproduction
Analysis started | 2023-12-10 15:01:52.368688 |
---|---|
Analysis finished | 2023-12-10 15:01:56.061750 |
Duration | 3.69 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
서울시_블록ID(BLK_CD)
Text
Distinct | 321 |
---|---|
Distinct (%) | 64.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 4.0 KiB |
Value | Count | Frequency (%) |
2*2*7 | 7 | 1.4% |
2*2*2 | 5 | 1.0% |
3*5*2 | 5 | 1.0% |
1*2*9 | 5 | 1.0% |
2*2*8 | 5 | 1.0% |
2*3*3 | 5 | 1.0% |
2*0*9 | 5 | 1.0% |
2*9*3 | 5 | 1.0% |
2*2*1 | 5 | 1.0% |
2*1*9 | 4 | 0.8% |
Other values (249) | 449 |
Most occurring characters
Value | Count | Frequency (%) |
* | 1368 | |
2 | 347 | 12.1% |
1 | 246 | 8.6% |
3 | 190 | 6.6% |
4 | 150 | 5.2% |
7 | 99 | 3.5% |
5 | 98 | 3.4% |
6 | 94 | 3.3% |
8 | 92 | 3.2% |
0 | 90 | 3.1% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 1494 | |
Other Punctuation | 1368 |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
2 | 347 | |
1 | 246 | |
3 | 190 | |
4 | 150 | |
7 | 99 | 6.6% |
5 | 98 | 6.6% |
6 | 94 | 6.3% |
8 | 92 | 6.2% |
0 | 90 | 6.0% |
9 | 88 | 5.9% |
Other Punctuation
Value | Count | Frequency (%) |
* | 1368 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 2862 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
* | 1368 | |
2 | 347 | 12.1% |
1 | 246 | 8.6% |
3 | 190 | 6.6% |
4 | 150 | 5.2% |
7 | 99 | 3.5% |
5 | 98 | 3.4% |
6 | 94 | 3.3% |
8 | 92 | 3.2% |
0 | 90 | 3.1% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 2862 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
* | 1368 | |
2 | 347 | 12.1% |
1 | 246 | 8.6% |
3 | 190 | 6.6% |
4 | 150 | 5.2% |
7 | 99 | 3.5% |
5 | 98 | 3.4% |
6 | 94 | 3.3% |
8 | 92 | 3.2% |
0 | 90 | 3.1% |
성별(GENDER)
Categorical
Distinct | 2 |
---|---|
Distinct (%) | 0.4% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 4.0 KiB |
1 | |
---|---|
2 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 2 |
---|---|
2nd row | 1 |
3rd row | 1 |
4th row | 2 |
5th row | 1 |
Common Values
Value | Count | Frequency (%) |
1 | 287 | |
2 | 213 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
1 | 287 | |
2 | 213 |
연령대(AGE)
Real number (ℝ)
Distinct | 7 |
---|---|
Distinct (%) | 1.4% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 4.206 |
Minimum | 1 |
---|---|
Maximum | 7 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 4.5 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 2 |
Q1 | 3 |
median | 4 |
Q3 | 5 |
95-th percentile | 7 |
Maximum | 7 |
Range | 6 |
Interquartile range (IQR) | 2 |
Descriptive statistics
Standard deviation | 1.3825291 |
---|---|
Coefficient of variation (CV) | 0.32870402 |
Kurtosis | -0.74738805 |
Mean | 4.206 |
Median Absolute Deviation (MAD) | 1 |
Skewness | 0.15623462 |
Sum | 2103 |
Variance | 1.9113868 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
5 | 120 | |
4 | 117 | |
3 | 114 | |
6 | 64 | |
2 | 56 | |
7 | 28 | 5.6% |
1 | 1 | 0.2% |
Value | Count | Frequency (%) |
1 | 1 | 0.2% |
2 | 56 | |
3 | 114 | |
4 | 117 | |
5 | 120 | |
6 | 64 | |
7 | 28 | 5.6% |
Value | Count | Frequency (%) |
7 | 28 | 5.6% |
6 | 64 | |
5 | 120 | |
4 | 117 | |
3 | 114 | |
2 | 56 | |
1 | 1 | 0.2% |
주유소_결재(지출)건수(INDEX04_CNT)
Real number (ℝ)
Distinct | 86 |
---|---|
Distinct (%) | 17.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 72.578 |
Minimum | 5 |
---|---|
Maximum | 981 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 4.5 KiB |
Quantile statistics
Minimum | 5 |
---|---|
5-th percentile | 5 |
Q1 | 15 |
median | 45 |
Q3 | 96 |
95-th percentile | 251.55 |
Maximum | 981 |
Range | 976 |
Interquartile range (IQR) | 81 |
Descriptive statistics
Standard deviation | 89.48501 |
---|---|
Coefficient of variation (CV) | 1.2329495 |
Kurtosis | 23.989135 |
Mean | 72.578 |
Median Absolute Deviation (MAD) | 35 |
Skewness | 3.6053462 |
Sum | 36289 |
Variance | 8007.5671 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
5 | 62 | 12.4% |
10 | 35 | 7.0% |
15 | 32 | 6.4% |
20 | 30 | 6.0% |
25 | 25 | 5.0% |
45 | 24 | 4.8% |
35 | 23 | 4.6% |
30 | 22 | 4.4% |
55 | 15 | 3.0% |
40 | 14 | 2.8% |
Other values (76) | 218 |
Value | Count | Frequency (%) |
5 | 62 | |
10 | 35 | |
15 | 32 | |
20 | 30 | |
25 | 25 | |
30 | 22 | 4.4% |
35 | 23 | 4.6% |
40 | 14 | 2.8% |
41 | 1 | 0.2% |
45 | 24 | 4.8% |
Value | Count | Frequency (%) |
981 | 1 | |
548 | 1 | |
433 | 1 | |
406 | 1 | |
388 | 1 | |
382 | 1 | |
362 | 1 | |
357 | 1 | |
352 | 1 | |
332 | 1 |
자동차_보유자수(INDEX04_CNT2)
Real number (ℝ)
ZEROS
 
Distinct | 11 |
---|---|
Distinct (%) | 2.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1.652 |
Minimum | 0 |
---|---|
Maximum | 11 |
Zeros | 125 |
Zeros (%) | 25.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 4.5 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0.75 |
median | 1 |
Q3 | 2 |
95-th percentile | 5 |
Maximum | 11 |
Range | 11 |
Interquartile range (IQR) | 1.25 |
Descriptive statistics
Standard deviation | 1.7183718 |
---|---|
Coefficient of variation (CV) | 1.0401766 |
Kurtosis | 3.5787407 |
Mean | 1.652 |
Median Absolute Deviation (MAD) | 1 |
Skewness | 1.6723553 |
Sum | 826 |
Variance | 2.9528016 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1 | 179 | |
0 | 125 | |
2 | 88 | |
3 | 38 | 7.6% |
4 | 32 | 6.4% |
5 | 17 | 3.4% |
6 | 14 | 2.8% |
8 | 3 | 0.6% |
9 | 2 | 0.4% |
7 | 1 | 0.2% |
Value | Count | Frequency (%) |
0 | 125 | |
1 | 179 | |
2 | 88 | |
3 | 38 | 7.6% |
4 | 32 | 6.4% |
5 | 17 | 3.4% |
6 | 14 | 2.8% |
7 | 1 | 0.2% |
8 | 3 | 0.6% |
9 | 2 | 0.4% |
Value | Count | Frequency (%) |
11 | 1 | 0.2% |
9 | 2 | 0.4% |
8 | 3 | 0.6% |
7 | 1 | 0.2% |
6 | 14 | 2.8% |
5 | 17 | 3.4% |
4 | 32 | 6.4% |
3 | 38 | 7.6% |
2 | 88 | |
1 | 179 |
자가용이용_지수(INDEX04)
Real number (ℝ)
ZEROS
 
Distinct | 10 |
---|---|
Distinct (%) | 2.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 3.112 |
Minimum | 0 |
---|---|
Maximum | 9 |
Zeros | 122 |
Zeros (%) | 24.4% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 4.5 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 1 |
median | 2 |
Q3 | 5 |
95-th percentile | 9 |
Maximum | 9 |
Range | 9 |
Interquartile range (IQR) | 4 |
Descriptive statistics
Standard deviation | 2.9128036 |
---|---|
Coefficient of variation (CV) | 0.93599088 |
Kurtosis | -0.79110439 |
Mean | 3.112 |
Median Absolute Deviation (MAD) | 2 |
Skewness | 0.6937214 |
Sum | 1556 |
Variance | 8.4844248 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 122 | |
2 | 78 | |
1 | 65 | |
3 | 61 | |
8 | 33 | 6.6% |
9 | 32 | 6.4% |
7 | 31 | 6.2% |
5 | 31 | 6.2% |
4 | 27 | 5.4% |
6 | 20 | 4.0% |
Value | Count | Frequency (%) |
0 | 122 | |
1 | 65 | |
2 | 78 | |
3 | 61 | |
4 | 27 | 5.4% |
5 | 31 | 6.2% |
6 | 20 | 4.0% |
7 | 31 | 6.2% |
8 | 33 | 6.6% |
9 | 32 | 6.4% |
Value | Count | Frequency (%) |
9 | 32 | 6.4% |
8 | 33 | 6.6% |
7 | 31 | 6.2% |
6 | 20 | 4.0% |
5 | 31 | 6.2% |
4 | 27 | 5.4% |
3 | 61 | |
2 | 78 | |
1 | 65 | |
0 | 122 |
성별(GENDER) | 연령대(AGE) | 주유소_결재(지출)건수(INDEX04_CNT) | 자동차_보유자수(INDEX04_CNT2) | 자가용이용_지수(INDEX04) | |
---|---|---|---|---|---|
성별(GENDER) | 1.000 | 0.000 | 0.000 | 0.050 | 0.000 |
연령대(AGE) | 0.000 | 1.000 | 0.138 | 0.000 | 0.070 |
주유소_결재(지출)건수(INDEX04_CNT) | 0.000 | 0.138 | 1.000 | 0.164 | 0.000 |
자동차_보유자수(INDEX04_CNT2) | 0.050 | 0.000 | 0.164 | 1.000 | 0.000 |
자가용이용_지수(INDEX04) | 0.000 | 0.070 | 0.000 | 0.000 | 1.000 |
연령대(AGE) | 주유소_결재(지출)건수(INDEX04_CNT) | 자동차_보유자수(INDEX04_CNT2) | 자가용이용_지수(INDEX04) | 성별(GENDER) | |
---|---|---|---|---|---|
연령대(AGE) | 1.000 | 0.026 | -0.004 | -0.001 | 0.000 |
주유소_결재(지출)건수(INDEX04_CNT) | 0.026 | 1.000 | 0.032 | -0.033 | 0.000 |
자동차_보유자수(INDEX04_CNT2) | -0.004 | 0.032 | 1.000 | -0.005 | 0.037 |
자가용이용_지수(INDEX04) | -0.001 | -0.033 | -0.005 | 1.000 | 0.000 |
성별(GENDER) | 0.000 | 0.000 | 0.037 | 0.000 | 1.000 |
서울시_블록ID(BLK_CD) | 성별(GENDER) | 연령대(AGE) | 주유소_결재(지출)건수(INDEX04_CNT) | 자동차_보유자수(INDEX04_CNT2) | 자가용이용_지수(INDEX04) | |
---|---|---|---|---|---|---|
0 | 3*9*5* | 2 | 3 | 10 | 2 | 0 |
1 | 2*7*7* | 1 | 4 | 65 | 1 | 3 |
2 | 2*2*8* | 1 | 6 | 10 | 1 | 4 |
3 | 2*3*3* | 2 | 5 | 45 | 0 | 1 |
4 | 1*4*7 | 1 | 4 | 201 | 2 | 1 |
5 | 2*3*1* | 2 | 3 | 327 | 1 | 8 |
6 | 3*4*7* | 2 | 4 | 266 | 1 | 7 |
7 | 3*7*9* | 2 | 6 | 15 | 2 | 0 |
8 | 2*4*8 | 2 | 6 | 292 | 2 | 7 |
9 | 2*5*8* | 2 | 5 | 30 | 2 | 0 |
서울시_블록ID(BLK_CD) | 성별(GENDER) | 연령대(AGE) | 주유소_결재(지출)건수(INDEX04_CNT) | 자동차_보유자수(INDEX04_CNT2) | 자가용이용_지수(INDEX04) | |
---|---|---|---|---|---|---|
490 | 3*2*2* | 2 | 5 | 30 | 1 | 1 |
491 | 1*0*1 | 1 | 3 | 15 | 2 | 0 |
492 | 2*1*4* | 2 | 3 | 50 | 1 | 6 |
493 | 2*9*3* | 1 | 5 | 10 | 1 | 1 |
494 | 3*5*9 | 2 | 6 | 45 | 0 | 4 |
495 | 1*6*7 | 1 | 5 | 141 | 0 | 4 |
496 | 1*5*7 | 1 | 2 | 80 | 0 | 4 |
497 | 1*6*4 | 2 | 5 | 166 | 1 | 2 |
498 | 1*9*6 | 2 | 7 | 10 | 1 | 8 |
499 | 3*3*4* | 2 | 5 | 50 | 1 | 0 |