Dataset statistics
Number of variables | 6 |
---|---|
Number of observations | 99 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 5.3 KiB |
Average record size in memory | 55.3 B |
Variable types
Numeric | 5 |
---|---|
Categorical | 1 |
Dataset
Description | 샘플 데이터 |
---|---|
Author | 코리아크레딧뷰로 / 장윤상 |
URL | https://www.bigdata-transportation.kr/frn/prdt/detail?prdtId=PRDTNUM_000000020198 |
ADM_CD is highly overall correlated with POP_CNT and 1 other fields | High correlation |
POP_CNT is highly overall correlated with ADM_CD and 1 other fields | High correlation |
ECO_BH_CNT is highly overall correlated with ADM_CD and 1 other fields | High correlation |
Reproduction
Analysis started | 2023-12-11 22:34:42.040423 |
---|---|
Analysis finished | 2023-12-11 22:34:44.201640 |
Duration | 2.16 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
BS_YR_MON
Real number (ℝ)
Distinct | 10 |
---|---|
Distinct (%) | 10.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 202057.78 |
Minimum | 201912 |
---|---|
Maximum | 202203 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1023.0 B |
Quantile statistics
Minimum | 201912 |
---|---|
5-th percentile | 201912 |
Q1 | 202006 |
median | 202012 |
Q3 | 202109 |
95-th percentile | 202203 |
Maximum | 202203 |
Range | 291 |
Interquartile range (IQR) | 103 |
Descriptive statistics
Standard deviation | 78.626287 |
---|---|
Coefficient of variation (CV) | 0.00038912774 |
Kurtosis | -0.52972509 |
Mean | 202057.78 |
Median Absolute Deviation (MAD) | 91 |
Skewness | 0.1640678 |
Sum | 20003720 |
Variance | 6182.093 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
202003 | 13 | |
202112 | 11 | |
202006 | 11 | |
202009 | 11 | |
202203 | 11 | |
202106 | 10 | |
202012 | 9 | |
202103 | 8 | |
201912 | 8 | |
202109 | 7 |
Value | Count | Frequency (%) |
201912 | 8 | |
202003 | 13 | |
202006 | 11 | |
202009 | 11 | |
202012 | 9 | |
202103 | 8 | |
202106 | 10 | |
202109 | 7 | |
202112 | 11 | |
202203 | 11 |
Value | Count | Frequency (%) |
202203 | 11 | |
202112 | 11 | |
202109 | 7 | |
202106 | 10 | |
202103 | 8 | |
202012 | 9 | |
202009 | 11 | |
202006 | 11 | |
202003 | 13 | |
201912 | 8 |
ADM_CD
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 98 |
---|---|
Distinct (%) | 99.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 37171433 |
Minimum | 11110690 |
---|---|
Maximum | 50110560 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1023.0 B |
Quantile statistics
Minimum | 11110690 |
---|---|
5-th percentile | 11440600 |
Q1 | 28200620 |
median | 42110620 |
Q3 | 47170400 |
95-th percentile | 48750379 |
Maximum | 50110560 |
Range | 38999870 |
Interquartile range (IQR) | 18969780 |
Descriptive statistics
Standard deviation | 12656844 |
---|---|
Coefficient of variation (CV) | 0.34049923 |
Kurtosis | -0.22471058 |
Mean | 37171433 |
Median Absolute Deviation (MAD) | 6000105 |
Skewness | -1.0557503 |
Sum | 3.6799718 × 109 |
Variance | 1.6019571 × 1014 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
42130330 | 2 | 2.0% |
11620630 | 1 | 1.0% |
50110560 | 1 | 1.0% |
41800310 | 1 | 1.0% |
45140646 | 1 | 1.0% |
46790395 | 1 | 1.0% |
47210610 | 1 | 1.0% |
44150570 | 1 | 1.0% |
41480530 | 1 | 1.0% |
47130250 | 1 | 1.0% |
Other values (88) | 88 |
Value | Count | Frequency (%) |
11110690 | 1 | |
11140590 | 1 | |
11170530 | 1 | |
11170650 | 1 | |
11170690 | 1 | |
11470590 | 1 | |
11470611 | 1 | |
11530560 | 1 | |
11545510 | 1 | |
11560650 | 1 |
Value | Count | Frequency (%) |
50110560 | 1 | |
48890430 | 1 | |
48880410 | 1 | |
48850410 | 1 | |
48840370 | 1 | |
48740380 | 1 | |
48740320 | 1 | |
48720330 | 1 | |
48310600 | 1 | |
48250320 | 1 |
GENDER
Categorical
Distinct | 2 |
---|---|
Distinct (%) | 2.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 924.0 B |
2 | |
---|---|
1 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 2 |
---|---|
2nd row | 2 |
3rd row | 1 |
4th row | 1 |
5th row | 2 |
Common Values
Value | Count | Frequency (%) |
2 | 50 | |
1 | 49 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
2 | 50 | |
1 | 49 |
AGE_CD
Real number (ℝ)
Distinct | 11 |
---|---|
Distinct (%) | 11.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 48 |
Minimum | 25 |
---|---|
Maximum | 71 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1023.0 B |
Quantile statistics
Minimum | 25 |
---|---|
5-th percentile | 25 |
Q1 | 35 |
median | 50 |
Q3 | 60 |
95-th percentile | 71 |
Maximum | 71 |
Range | 46 |
Interquartile range (IQR) | 25 |
Descriptive statistics
Standard deviation | 14.937284 |
---|---|
Coefficient of variation (CV) | 0.31119341 |
Kurtosis | -1.155448 |
Mean | 48 |
Median Absolute Deviation (MAD) | 15 |
Skewness | -0.00018748811 |
Sum | 4752 |
Variance | 223.12245 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
45 | 12 | |
25 | 12 | |
55 | 11 | |
35 | 11 | |
50 | 11 | |
60 | 9 | |
70 | 8 | |
71 | 7 | |
30 | 7 | |
40 | 6 |
Value | Count | Frequency (%) |
25 | 12 | |
30 | 7 | |
35 | 11 | |
40 | 6 | |
45 | 12 | |
50 | 11 | |
55 | 11 | |
60 | 9 | |
65 | 5 | |
70 | 8 |
Value | Count | Frequency (%) |
71 | 7 | |
70 | 8 | |
65 | 5 | |
60 | 9 | |
55 | 11 | |
50 | 11 | |
45 | 12 | |
40 | 6 | |
35 | 11 | |
30 | 7 |
POP_CNT
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 92 |
---|---|
Distinct (%) | 92.9% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 493.35354 |
Minimum | 12 |
---|---|
Maximum | 1784 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1023.0 B |
Quantile statistics
Minimum | 12 |
---|---|
5-th percentile | 26.7 |
Q1 | 111 |
median | 378 |
Q3 | 744 |
95-th percentile | 1373.2 |
Maximum | 1784 |
Range | 1772 |
Interquartile range (IQR) | 633 |
Descriptive statistics
Standard deviation | 447.61145 |
---|---|
Coefficient of variation (CV) | 0.90728334 |
Kurtosis | 0.37086166 |
Mean | 493.35354 |
Median Absolute Deviation (MAD) | 292 |
Skewness | 1.0259837 |
Sum | 48842 |
Variance | 200356.01 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
216 | 3 | 3.0% |
359 | 2 | 2.0% |
1611 | 2 | 2.0% |
210 | 2 | 2.0% |
65 | 2 | 2.0% |
561 | 2 | 2.0% |
1187 | 1 | 1.0% |
197 | 1 | 1.0% |
748 | 1 | 1.0% |
1091 | 1 | 1.0% |
Other values (82) | 82 |
Value | Count | Frequency (%) |
12 | 1 | |
13 | 1 | |
17 | 1 | |
20 | 1 | |
24 | 1 | |
27 | 1 | |
29 | 1 | |
34 | 1 | |
49 | 1 | |
54 | 1 |
Value | Count | Frequency (%) |
1784 | 1 | |
1706 | 1 | |
1611 | 2 | |
1564 | 1 | |
1352 | 1 | |
1297 | 1 | |
1187 | 1 | |
1164 | 1 | |
1137 | 1 | |
1102 | 1 |
ECO_BH_CNT
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 93 |
---|---|
Distinct (%) | 93.9% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 414.41414 |
Minimum | 10 |
---|---|
Maximum | 1580 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1023.0 B |
Quantile statistics
Minimum | 10 |
---|---|
5-th percentile | 16.6 |
Q1 | 82.5 |
median | 272 |
Q3 | 670 |
95-th percentile | 1163.9 |
Maximum | 1580 |
Range | 1570 |
Interquartile range (IQR) | 587.5 |
Descriptive statistics
Standard deviation | 392.70698 |
---|---|
Coefficient of variation (CV) | 0.94761965 |
Kurtosis | 0.43659719 |
Mean | 414.41414 |
Median Absolute Deviation (MAD) | 225 |
Skewness | 1.0607633 |
Sum | 41027 |
Variance | 154218.78 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
139 | 3 | 3.0% |
247 | 2 | 2.0% |
54 | 2 | 2.0% |
13 | 2 | 2.0% |
277 | 2 | 2.0% |
598 | 1 | 1.0% |
161 | 1 | 1.0% |
12 | 1 | 1.0% |
176 | 1 | 1.0% |
666 | 1 | 1.0% |
Other values (83) | 83 |
Value | Count | Frequency (%) |
10 | 1 | |
11 | 1 | |
12 | 1 | |
13 | 2 | |
17 | 1 | |
21 | 1 | |
23 | 1 | |
26 | 1 | |
39 | 1 | |
40 | 1 |
Value | Count | Frequency (%) |
1580 | 1 | |
1567 | 1 | |
1421 | 1 | |
1291 | 1 | |
1226 | 1 | |
1157 | 1 | |
1156 | 1 | |
1103 | 1 | |
1040 | 1 | |
924 | 1 |
BS_YR_MON | ADM_CD | GENDER | AGE_CD | POP_CNT | ECO_BH_CNT | |
---|---|---|---|---|---|---|
BS_YR_MON | 1.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 |
ADM_CD | 0.000 | 1.000 | 0.293 | 0.260 | 0.477 | 0.591 |
GENDER | 0.000 | 0.293 | 1.000 | 0.000 | 0.433 | 0.000 |
AGE_CD | 0.000 | 0.260 | 0.000 | 1.000 | 0.254 | 0.267 |
POP_CNT | 0.000 | 0.477 | 0.433 | 0.254 | 1.000 | 0.971 |
ECO_BH_CNT | 0.000 | 0.591 | 0.000 | 0.267 | 0.971 | 1.000 |
BS_YR_MON | ADM_CD | AGE_CD | POP_CNT | ECO_BH_CNT | GENDER | |
---|---|---|---|---|---|---|
BS_YR_MON | 1.000 | -0.055 | 0.064 | 0.057 | 0.038 | 0.000 |
ADM_CD | -0.055 | 1.000 | 0.019 | -0.529 | -0.513 | 0.221 |
AGE_CD | 0.064 | 0.019 | 1.000 | -0.085 | -0.142 | 0.000 |
POP_CNT | 0.057 | -0.529 | -0.085 | 1.000 | 0.990 | 0.317 |
ECO_BH_CNT | 0.038 | -0.513 | -0.142 | 0.990 | 1.000 | 0.000 |
GENDER | 0.000 | 0.221 | 0.000 | 0.317 | 0.000 | 1.000 |
BS_YR_MON | ADM_CD | GENDER | AGE_CD | POP_CNT | ECO_BH_CNT | |
---|---|---|---|---|---|---|
0 | 202112 | 11620630 | 2 | 65 | 692 | 598 |
1 | 202006 | 45130400 | 2 | 55 | 57 | 40 |
2 | 202012 | 42820250 | 1 | 71 | 378 | 247 |
3 | 202009 | 48740320 | 1 | 70 | 60 | 47 |
4 | 202112 | 48127545 | 2 | 45 | 548 | 501 |
5 | 202203 | 41173510 | 2 | 25 | 865 | 833 |
6 | 202106 | 48740380 | 2 | 35 | 90 | 43 |
7 | 202203 | 45740320 | 1 | 35 | 20 | 13 |
8 | 202103 | 27170600 | 1 | 30 | 270 | 258 |
9 | 202009 | 30110590 | 1 | 45 | 394 | 277 |
BS_YR_MON | ADM_CD | GENDER | AGE_CD | POP_CNT | ECO_BH_CNT | |
---|---|---|---|---|---|---|
89 | 202012 | 29200550 | 2 | 45 | 160 | 139 |
90 | 202203 | 45140410 | 1 | 30 | 139 | 86 |
91 | 202006 | 48890430 | 2 | 25 | 65 | 63 |
92 | 202003 | 28177620 | 2 | 25 | 802 | 733 |
93 | 202112 | 42770320 | 2 | 45 | 77 | 71 |
94 | 202106 | 48850410 | 1 | 71 | 420 | 277 |
95 | 202109 | 48220665 | 2 | 25 | 259 | 253 |
96 | 202109 | 41220330 | 2 | 30 | 777 | 747 |
97 | 202003 | 46720370 | 2 | 35 | 54 | 41 |
98 | 202003 | 41220600 | 1 | 30 | 561 | 522 |