Dataset statistics
Number of variables | 6 |
---|---|
Number of observations | 99 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 5.3 KiB |
Average record size in memory | 55.3 B |
Variable types
Categorical | 2 |
---|---|
Numeric | 4 |
Dataset
Description | 샘플 데이터 |
---|---|
Author | 코리아크레딧뷰로 / 장윤상 |
URL | https://www.bigdata-transportation.kr/frn/prdt/detail?prdtId=PRDTNUM_000000020213 |
BS_YR_MON has constant value "" | Constant |
POP_CNT is highly overall correlated with SUM_LN_BAL and 1 other fields | High correlation |
SUM_LN_BAL is highly overall correlated with POP_CNT | High correlation |
GENDER is highly overall correlated with POP_CNT | High correlation |
SUM_LN_BAL has 3 (3.0%) zeros | Zeros |
Reproduction
Analysis started | 2024-04-21 01:30:23.609869 |
---|---|
Analysis finished | 2024-04-21 01:30:27.824625 |
Duration | 4.21 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
BS_YR_MON
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 1.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 920.0 B |
201912 |
---|
Length
Max length | 6 |
---|---|
Median length | 6 |
Mean length | 6 |
Min length | 6 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 201912 |
---|---|
2nd row | 201912 |
3rd row | 201912 |
4th row | 201912 |
5th row | 201912 |
Common Values
Value | Count | Frequency (%) |
201912 | 99 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
201912 | 99 |
PRV_CD
Real number (ℝ)
Distinct | 7 |
---|---|
Distinct (%) | 7.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 11185.455 |
Minimum | 11110 |
---|---|
Maximum | 11260 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1019.0 B |
Quantile statistics
Minimum | 11110 |
---|---|
5-th percentile | 11110 |
Q1 | 11140 |
median | 11200 |
Q3 | 11215 |
95-th percentile | 11233 |
Maximum | 11260 |
Range | 150 |
Interquartile range (IQR) | 75 |
Descriptive statistics
Standard deviation | 44.199749 |
---|---|
Coefficient of variation (CV) | 0.003951538 |
Kurtosis | -0.96669026 |
Mean | 11185.455 |
Median Absolute Deviation (MAD) | 30 |
Skewness | -0.38196219 |
Sum | 1107360 |
Variance | 1953.6178 |
Monotonicity | Increasing |
Value | Count | Frequency (%) |
11230 | 18 | |
11200 | 17 | |
11170 | 16 | |
11215 | 16 | |
11110 | 14 | |
11140 | 13 | |
11260 | 5 | 5.1% |
Value | Count | Frequency (%) |
11110 | 14 | |
11140 | 13 | |
11170 | 16 | |
11200 | 17 | |
11215 | 16 | |
11230 | 18 | |
11260 | 5 | 5.1% |
Value | Count | Frequency (%) |
11260 | 5 | 5.1% |
11230 | 18 | |
11215 | 16 | |
11200 | 17 | |
11170 | 16 | |
11140 | 13 | |
11110 | 14 |
GENDER
Categorical
HIGH CORRELATION
 
Distinct | 2 |
---|---|
Distinct (%) | 2.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 920.0 B |
1 | |
---|---|
2 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 1 |
---|---|
2nd row | 1 |
3rd row | 1 |
4th row | 1 |
5th row | 1 |
Common Values
Value | Count | Frequency (%) |
1 | 64 | |
2 | 35 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
1 | 64 | |
2 | 35 |
AGE_CD
Real number (ℝ)
Distinct | 10 |
---|---|
Distinct (%) | 10.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 53.070707 |
Minimum | 30 |
---|---|
Maximum | 71 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1019.0 B |
Quantile statistics
Minimum | 30 |
---|---|
5-th percentile | 30 |
Q1 | 45 |
median | 55 |
Q3 | 65 |
95-th percentile | 71 |
Maximum | 71 |
Range | 41 |
Interquartile range (IQR) | 20 |
Descriptive statistics
Standard deviation | 13.097166 |
---|---|
Coefficient of variation (CV) | 0.24678711 |
Kurtosis | -1.1315423 |
Mean | 53.070707 |
Median Absolute Deviation (MAD) | 10 |
Skewness | -0.19934397 |
Sum | 5254 |
Variance | 171.53577 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
50 | 13 | |
45 | 11 | |
60 | 11 | |
65 | 11 | |
55 | 10 | |
70 | 10 | |
40 | 9 | |
71 | 9 | |
30 | 8 | |
35 | 7 |
Value | Count | Frequency (%) |
30 | 8 | |
35 | 7 | |
40 | 9 | |
45 | 11 | |
50 | 13 | |
55 | 10 | |
60 | 11 | |
65 | 11 | |
70 | 10 | |
71 | 9 |
Value | Count | Frequency (%) |
71 | 9 | |
70 | 10 | |
65 | 11 | |
60 | 11 | |
55 | 10 | |
50 | 13 | |
45 | 11 | |
40 | 9 | |
35 | 7 | |
30 | 8 |
POP_CNT
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 53 |
---|---|
Distinct (%) | 53.5% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 36.070707 |
Minimum | 3 |
---|---|
Maximum | 185 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1019.0 B |
Quantile statistics
Minimum | 3 |
---|---|
5-th percentile | 3 |
Q1 | 5.5 |
median | 18 |
Q3 | 45.5 |
95-th percentile | 131.8 |
Maximum | 185 |
Range | 182 |
Interquartile range (IQR) | 40 |
Descriptive statistics
Standard deviation | 44.41209 |
---|---|
Coefficient of variation (CV) | 1.2312509 |
Kurtosis | 2.2997154 |
Mean | 36.070707 |
Median Absolute Deviation (MAD) | 14 |
Skewness | 1.7280239 |
Sum | 3571 |
Variance | 1972.4337 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
3 | 10 | 10.1% |
4 | 9 | 9.1% |
6 | 6 | 6.1% |
5 | 6 | 6.1% |
11 | 5 | 5.1% |
9 | 3 | 3.0% |
18 | 3 | 3.0% |
24 | 2 | 2.0% |
130 | 2 | 2.0% |
45 | 2 | 2.0% |
Other values (43) | 51 |
Value | Count | Frequency (%) |
3 | 10 | |
4 | 9 | |
5 | 6 | |
6 | 6 | |
7 | 2 | 2.0% |
8 | 2 | 2.0% |
9 | 3 | 3.0% |
10 | 1 | 1.0% |
11 | 5 | |
13 | 2 | 2.0% |
Value | Count | Frequency (%) |
185 | 1 | |
184 | 1 | |
159 | 1 | |
150 | 1 | |
148 | 1 | |
130 | 2 | |
129 | 1 | |
117 | 1 | |
110 | 1 | |
105 | 1 |
SUM_LN_BAL
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 97 |
---|---|
Distinct (%) | 98.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1928966.4 |
Minimum | 0 |
---|---|
Maximum | 10091411 |
Zeros | 3 |
Zeros (%) | 3.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1019.0 B |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 29750.3 |
Q1 | 309323 |
median | 818092 |
Q3 | 2749857 |
95-th percentile | 6765305.4 |
Maximum | 10091411 |
Range | 10091411 |
Interquartile range (IQR) | 2440534 |
Descriptive statistics
Standard deviation | 2281677.3 |
---|---|
Coefficient of variation (CV) | 1.1828497 |
Kurtosis | 2.203342 |
Mean | 1928966.4 |
Median Absolute Deviation (MAD) | 767478 |
Skewness | 1.6172294 |
Sum | 1.9096767 × 108 |
Variance | 5.2060511 × 1012 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 3 | 3.0% |
36196 | 1 | 1.0% |
1389355 | 1 | 1.0% |
91072 | 1 | 1.0% |
594322 | 1 | 1.0% |
386455 | 1 | 1.0% |
33509 | 1 | 1.0% |
5149079 | 1 | 1.0% |
3732660 | 1 | 1.0% |
7335830 | 1 | 1.0% |
Other values (87) | 87 |
Value | Count | Frequency (%) |
0 | 3 | |
2470 | 1 | 1.0% |
28646 | 1 | 1.0% |
29873 | 1 | 1.0% |
33509 | 1 | 1.0% |
36196 | 1 | 1.0% |
50614 | 1 | 1.0% |
60211 | 1 | 1.0% |
73914 | 1 | 1.0% |
79270 | 1 | 1.0% |
Value | Count | Frequency (%) |
10091411 | 1 | |
9020465 | 1 | |
8773742 | 1 | |
7335830 | 1 | |
6892794 | 1 | |
6751140 | 1 | |
6613142 | 1 | |
5953590 | 1 | |
5922112 | 1 | |
5739670 | 1 |
PRV_CD | GENDER | AGE_CD | POP_CNT | SUM_LN_BAL | |
---|---|---|---|---|---|
PRV_CD | 1.000 | 0.000 | 0.000 | 0.139 | 0.000 |
GENDER | 0.000 | 1.000 | 0.000 | 0.782 | 0.515 |
AGE_CD | 0.000 | 0.000 | 1.000 | 0.060 | 0.000 |
POP_CNT | 0.139 | 0.782 | 0.060 | 1.000 | 0.883 |
SUM_LN_BAL | 0.000 | 0.515 | 0.000 | 0.883 | 1.000 |
PRV_CD | AGE_CD | POP_CNT | SUM_LN_BAL | GENDER | |
---|---|---|---|---|---|
PRV_CD | 1.000 | -0.105 | 0.315 | 0.214 | 0.000 |
AGE_CD | -0.105 | 1.000 | 0.236 | 0.310 | 0.000 |
POP_CNT | 0.315 | 0.236 | 1.000 | 0.901 | 0.591 |
SUM_LN_BAL | 0.214 | 0.310 | 0.901 | 1.000 | 0.497 |
GENDER | 0.000 | 0.000 | 0.591 | 0.497 | 1.000 |
BS_YR_MON | PRV_CD | GENDER | AGE_CD | POP_CNT | SUM_LN_BAL | |
---|---|---|---|---|---|---|
0 | 201912 | 11110 | 1 | 35 | 4 | 36196 |
1 | 201912 | 11110 | 1 | 40 | 6 | 623165 |
2 | 201912 | 11110 | 1 | 45 | 16 | 520401 |
3 | 201912 | 11110 | 1 | 50 | 19 | 818092 |
4 | 201912 | 11110 | 1 | 55 | 21 | 890756 |
5 | 201912 | 11110 | 1 | 60 | 44 | 2479834 |
6 | 201912 | 11110 | 1 | 65 | 31 | 1932908 |
7 | 201912 | 11110 | 1 | 70 | 26 | 1828442 |
8 | 201912 | 11110 | 1 | 71 | 16 | 789489 |
9 | 201912 | 11110 | 2 | 45 | 3 | 0 |
BS_YR_MON | PRV_CD | GENDER | AGE_CD | POP_CNT | SUM_LN_BAL | |
---|---|---|---|---|---|---|
89 | 201912 | 11230 | 2 | 55 | 11 | 629331 |
90 | 201912 | 11230 | 2 | 60 | 11 | 266834 |
91 | 201912 | 11230 | 2 | 65 | 9 | 150167 |
92 | 201912 | 11230 | 2 | 70 | 5 | 109524 |
93 | 201912 | 11230 | 2 | 71 | 3 | 318748 |
94 | 201912 | 11260 | 1 | 30 | 18 | 1567929 |
95 | 201912 | 11260 | 1 | 35 | 24 | 818006 |
96 | 201912 | 11260 | 1 | 40 | 67 | 3860045 |
97 | 201912 | 11260 | 1 | 45 | 88 | 5085995 |
98 | 201912 | 11260 | 1 | 50 | 159 | 6751140 |