Dataset statistics
Number of variables | 6 |
---|---|
Number of observations | 99 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 5.3 KiB |
Average record size in memory | 55.3 B |
Variable types
Categorical | 3 |
---|---|
Numeric | 3 |
Dataset
Description | 샘플 데이터 |
---|---|
Author | 코리아크레딧뷰로 / 장윤상 |
URL | https://www.bigdata-transportation.kr/frn/prdt/detail?prdtId=PRDTNUM_000000020210 |
BS_YR_MON has constant value "" | Constant |
AGE_CD is highly overall correlated with SUM_LN_BAL | High correlation |
POP_CNT is highly overall correlated with SUM_LN_BAL | High correlation |
SUM_LN_BAL is highly overall correlated with AGE_CD and 1 other fields | High correlation |
SUM_LN_BAL has unique values | Unique |
Reproduction
Analysis started | 2023-12-11 22:34:59.269903 |
---|---|
Analysis finished | 2023-12-11 22:35:00.491371 |
Duration | 1.22 second |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
BS_YR_MON
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 1.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 924.0 B |
201912 |
---|
Length
Max length | 6 |
---|---|
Median length | 6 |
Mean length | 6 |
Min length | 6 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 201912 |
---|---|
2nd row | 201912 |
3rd row | 201912 |
4th row | 201912 |
5th row | 201912 |
Common Values
Value | Count | Frequency (%) |
201912 | 99 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
201912 | 99 |
PRV_CD
Categorical
Distinct | 5 |
---|---|
Distinct (%) | 5.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 924.0 B |
11110 | |
---|---|
11140 | |
11170 | |
11200 | |
11215 |
Length
Max length | 5 |
---|---|
Median length | 5 |
Mean length | 5 |
Min length | 5 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 11110 |
---|---|
2nd row | 11110 |
3rd row | 11110 |
4th row | 11110 |
5th row | 11110 |
Common Values
Value | Count | Frequency (%) |
11110 | 22 | |
11140 | 22 | |
11170 | 22 | |
11200 | 22 | |
11215 | 11 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
11110 | 22 | |
11140 | 22 | |
11170 | 22 | |
11200 | 22 | |
11215 | 11 |
GENDER
Categorical
Distinct | 2 |
---|---|
Distinct (%) | 2.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 924.0 B |
1 | |
---|---|
2 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 1 |
---|---|
2nd row | 1 |
3rd row | 1 |
4th row | 1 |
5th row | 1 |
Common Values
Value | Count | Frequency (%) |
1 | 55 | |
2 | 44 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
1 | 55 | |
2 | 44 |
AGE_CD
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 11 |
---|---|
Distinct (%) | 11.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 49.636364 |
Minimum | 25 |
---|---|
Maximum | 71 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1023.0 B |
Quantile statistics
Minimum | 25 |
---|---|
5-th percentile | 25 |
Q1 | 35 |
median | 50 |
Q3 | 65 |
95-th percentile | 71 |
Maximum | 71 |
Range | 46 |
Interquartile range (IQR) | 30 |
Descriptive statistics
Standard deviation | 15.346644 |
---|---|
Coefficient of variation (CV) | 0.30918147 |
Kurtosis | -1.2980204 |
Mean | 49.636364 |
Median Absolute Deviation (MAD) | 15 |
Skewness | -0.092468713 |
Sum | 4914 |
Variance | 235.51948 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
25 | 9 | |
30 | 9 | |
35 | 9 | |
40 | 9 | |
45 | 9 | |
50 | 9 | |
55 | 9 | |
60 | 9 | |
65 | 9 | |
70 | 9 |
Value | Count | Frequency (%) |
25 | 9 | |
30 | 9 | |
35 | 9 | |
40 | 9 | |
45 | 9 | |
50 | 9 | |
55 | 9 | |
60 | 9 | |
65 | 9 | |
70 | 9 |
Value | Count | Frequency (%) |
71 | 9 | |
70 | 9 | |
65 | 9 | |
60 | 9 | |
55 | 9 | |
50 | 9 | |
45 | 9 | |
40 | 9 | |
35 | 9 | |
30 | 9 |
POP_CNT
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 96 |
---|---|
Distinct (%) | 97.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 854.41414 |
Minimum | 35 |
---|---|
Maximum | 2186 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1023.0 B |
Quantile statistics
Minimum | 35 |
---|---|
5-th percentile | 71.8 |
Q1 | 498.5 |
median | 811 |
Q3 | 1193 |
95-th percentile | 1858 |
Maximum | 2186 |
Range | 2151 |
Interquartile range (IQR) | 694.5 |
Descriptive statistics
Standard deviation | 518.8596 |
---|---|
Coefficient of variation (CV) | 0.60726944 |
Kurtosis | -0.2896325 |
Mean | 854.41414 |
Median Absolute Deviation (MAD) | 346 |
Skewness | 0.50519806 |
Sum | 84587 |
Variance | 269215.29 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1858 | 2 | 2.0% |
777 | 2 | 2.0% |
567 | 2 | 2.0% |
47 | 1 | 1.0% |
1232 | 1 | 1.0% |
1930 | 1 | 1.0% |
1602 | 1 | 1.0% |
1691 | 1 | 1.0% |
872 | 1 | 1.0% |
411 | 1 | 1.0% |
Other values (86) | 86 |
Value | Count | Frequency (%) |
35 | 1 | |
43 | 1 | |
44 | 1 | |
47 | 1 | |
52 | 1 | |
74 | 1 | |
80 | 1 | |
81 | 1 | |
87 | 1 | |
204 | 1 |
Value | Count | Frequency (%) |
2186 | 1 | |
2040 | 1 | |
2012 | 1 | |
1930 | 1 | |
1858 | 2 | |
1829 | 1 | |
1691 | 1 | |
1610 | 1 | |
1602 | 1 | |
1555 | 1 |
SUM_LN_BAL
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 99 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 96633077 |
Minimum | 167279 |
---|---|
Maximum | 2.855001 × 108 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1023.0 B |
Quantile statistics
Minimum | 167279 |
---|---|
5-th percentile | 723402.6 |
Q1 | 35609735 |
median | 84018977 |
Q3 | 1.4272992 × 108 |
95-th percentile | 2.3738833 × 108 |
Maximum | 2.855001 × 108 |
Range | 2.8533283 × 108 |
Interquartile range (IQR) | 1.0712018 × 108 |
Descriptive statistics
Standard deviation | 74253447 |
---|---|
Coefficient of variation (CV) | 0.76840611 |
Kurtosis | -0.53720982 |
Mean | 96633077 |
Median Absolute Deviation (MAD) | 54274017 |
Skewness | 0.58621078 |
Sum | 9.5666746 × 109 |
Variance | 5.5135745 × 1015 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
757588 | 1 | 1.0% |
155302422 | 1 | 1.0% |
202615164 | 1 | 1.0% |
237962154 | 1 | 1.0% |
234092241 | 1 | 1.0% |
205325186 | 1 | 1.0% |
226508212 | 1 | 1.0% |
84941146 | 1 | 1.0% |
13626140 | 1 | 1.0% |
723627 | 1 | 1.0% |
Other values (89) | 89 |
Value | Count | Frequency (%) |
167279 | 1 | |
197251 | 1 | |
214065 | 1 | |
446743 | 1 | |
721383 | 1 | |
723627 | 1 | |
757588 | 1 | |
932432 | 1 | |
2211873 | 1 | |
4230108 | 1 |
Value | Count | Frequency (%) |
285500105 | 1 | |
255503820 | 1 | |
242748400 | 1 | |
242484604 | 1 | |
237962154 | 1 | |
237324568 | 1 | |
234640756 | 1 | |
234092241 | 1 | |
226508212 | 1 | |
205325186 | 1 |
PRV_CD | GENDER | AGE_CD | POP_CNT | SUM_LN_BAL | |
---|---|---|---|---|---|
PRV_CD | 1.000 | 0.204 | 0.000 | 0.689 | 0.538 |
GENDER | 0.204 | 1.000 | 0.000 | 0.397 | 0.509 |
AGE_CD | 0.000 | 0.000 | 1.000 | 0.744 | 0.720 |
POP_CNT | 0.689 | 0.397 | 0.744 | 1.000 | 0.873 |
SUM_LN_BAL | 0.538 | 0.509 | 0.720 | 0.873 | 1.000 |
PRV_CD | GENDER | |
---|---|---|
PRV_CD | 1.000 | 0.245 |
GENDER | 0.245 | 1.000 |
AGE_CD | POP_CNT | SUM_LN_BAL | PRV_CD | GENDER | |
---|---|---|---|---|---|
AGE_CD | 1.000 | 0.361 | 0.574 | 0.000 | 0.000 |
POP_CNT | 0.361 | 1.000 | 0.893 | 0.344 | 0.291 |
SUM_LN_BAL | 0.574 | 0.893 | 1.000 | 0.241 | 0.370 |
PRV_CD | 0.000 | 0.344 | 0.241 | 1.000 | 0.245 |
GENDER | 0.000 | 0.291 | 0.370 | 0.245 | 1.000 |
BS_YR_MON | PRV_CD | GENDER | AGE_CD | POP_CNT | SUM_LN_BAL | |
---|---|---|---|---|---|---|
0 | 201912 | 11110 | 1 | 25 | 47 | 757588 |
1 | 201912 | 11110 | 1 | 30 | 225 | 5436771 |
2 | 201912 | 11110 | 1 | 35 | 521 | 27710484 |
3 | 201912 | 11110 | 1 | 40 | 734 | 62684726 |
4 | 201912 | 11110 | 1 | 45 | 777 | 129186547 |
5 | 201912 | 11110 | 1 | 50 | 997 | 105383136 |
6 | 201912 | 11110 | 1 | 55 | 1022 | 116259168 |
7 | 201912 | 11110 | 1 | 60 | 1120 | 144205047 |
8 | 201912 | 11110 | 1 | 65 | 923 | 138292994 |
9 | 201912 | 11110 | 1 | 70 | 567 | 79484684 |
BS_YR_MON | PRV_CD | GENDER | AGE_CD | POP_CNT | SUM_LN_BAL | |
---|---|---|---|---|---|---|
89 | 201912 | 11215 | 1 | 30 | 535 | 13366457 |
90 | 201912 | 11215 | 1 | 35 | 1049 | 55815531 |
91 | 201912 | 11215 | 1 | 40 | 1531 | 119324740 |
92 | 201912 | 11215 | 1 | 45 | 1610 | 144190873 |
93 | 201912 | 11215 | 1 | 50 | 2012 | 198909694 |
94 | 201912 | 11215 | 1 | 55 | 2040 | 194101186 |
95 | 201912 | 11215 | 1 | 60 | 2186 | 237324568 |
96 | 201912 | 11215 | 1 | 65 | 1829 | 189403270 |
97 | 201912 | 11215 | 1 | 70 | 966 | 141268958 |
98 | 201912 | 11215 | 1 | 71 | 954 | 123926181 |