Dataset statistics
Number of variables | 9 |
---|---|
Number of observations | 1000 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 79.2 KiB |
Average record size in memory | 81.1 B |
Variable types
Categorical | 5 |
---|---|
Numeric | 4 |
Dataset
Description | 내지역주택연금RAWDATA에 대한 데이터로, 주택지역도시구분코드, 고객번호, 연령구간, 감정금액 등의 항목을 제공합니다. |
---|---|
Author | 한국주택금융공사 |
URL | https://www.data.go.kr/data/15073023/fileData.do |
BASIS_DY has constant value "" | Constant |
AGE_SECTN has constant value "" | Constant |
JUDGE_AMT is highly overall correlated with PNSN_PAYFORM_CD and 1 other fields | High correlation |
PNSN_PAYFORM_CD is highly overall correlated with JUDGE_AMT | High correlation |
GUARNT_EXEC_AMT is highly overall correlated with JUDGE_AMT | High correlation |
PNSN_PROD_DVCD is highly imbalanced (50.1%) | Imbalance |
GUARNT_ISSUE_CNT is highly imbalanced (80.6%) | Imbalance |
GUARNT_EXEC_AMT is highly skewed (γ1 = 20.60116992) | Skewed |
CUST_NO has unique values | Unique |
GUARNT_EXEC_AMT has 13 (1.3%) zeros | Zeros |
Reproduction
Analysis started | 2023-12-11 23:47:23.272990 |
---|---|
Analysis finished | 2023-12-11 23:47:25.944130 |
Duration | 2.67 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
BASIS_DY
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 7.9 KiB |
202006 |
---|
Length
Max length | 6 |
---|---|
Median length | 6 |
Mean length | 6 |
Min length | 6 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 202006 |
---|---|
2nd row | 202006 |
3rd row | 202006 |
4th row | 202006 |
5th row | 202006 |
Common Values
Value | Count | Frequency (%) |
202006 | 1000 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
202006 | 1000 |
HOUSE_LOC_CITY_DVCD
Categorical
Distinct | 3 |
---|---|
Distinct (%) | 0.3% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 7.9 KiB |
28 | |
---|---|
29 | |
30 |
Length
Max length | 2 |
---|---|
Median length | 2 |
Mean length | 2 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 30 |
---|---|
2nd row | 30 |
3rd row | 30 |
4th row | 30 |
5th row | 30 |
Common Values
Value | Count | Frequency (%) |
28 | 585 | |
29 | 247 | |
30 | 168 | 16.8% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
28 | 585 | |
29 | 247 | |
30 | 168 | 16.8% |
CUST_NO
Real number (ℝ)
UNIQUE
 
Distinct | 1000 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1.2132185 × 108 |
Minimum | 7986907 |
---|---|
Maximum | 1.42296 × 108 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 8.9 KiB |
Quantile statistics
Minimum | 7986907 |
---|---|
5-th percentile | 93169678 |
Q1 | 1.150409 × 108 |
median | 1.2333852 × 108 |
Q3 | 1.300265 × 108 |
95-th percentile | 1.408883 × 108 |
Maximum | 1.42296 × 108 |
Range | 1.3430909 × 108 |
Interquartile range (IQR) | 14985605 |
Descriptive statistics
Standard deviation | 16681368 |
---|---|
Coefficient of variation (CV) | 0.13749682 |
Kurtosis | 12.252093 |
Mean | 1.2132185 × 108 |
Median Absolute Deviation (MAD) | 7602420.5 |
Skewness | -2.6387022 |
Sum | 1.2132185 × 1011 |
Variance | 2.7826803 × 1014 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
142071404 | 1 | 0.1% |
125904996 | 1 | 0.1% |
126428248 | 1 | 0.1% |
126362896 | 1 | 0.1% |
126353643 | 1 | 0.1% |
126327763 | 1 | 0.1% |
126294443 | 1 | 0.1% |
126243122 | 1 | 0.1% |
126236870 | 1 | 0.1% |
126225124 | 1 | 0.1% |
Other values (990) | 990 |
Value | Count | Frequency (%) |
7986907 | 1 | |
10824027 | 1 | |
17564966 | 1 | |
17700940 | 1 | |
19911546 | 1 | |
23923038 | 1 | |
28611028 | 1 | |
29817818 | 1 | |
34981409 | 1 | |
45688678 | 1 |
Value | Count | Frequency (%) |
142296001 | 1 | |
142192299 | 1 | |
142188384 | 1 | |
142130196 | 1 | |
142106450 | 1 | |
142101594 | 1 | |
142074126 | 1 | |
142071404 | 1 | |
142039310 | 1 | |
142024590 | 1 |
AGE_SECTN
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 7.9 KiB |
65 |
---|
Length
Max length | 2 |
---|---|
Median length | 2 |
Mean length | 2 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 65 |
---|---|
2nd row | 65 |
3rd row | 65 |
4th row | 65 |
5th row | 65 |
Common Values
Value | Count | Frequency (%) |
65 | 1000 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
65 | 1000 |
JUDGE_AMT
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 535 |
---|---|
Distinct (%) | 53.5% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2.0070178 × 108 |
Minimum | 20787000 |
---|---|
Maximum | 8.9561783 × 108 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 8.9 KiB |
Quantile statistics
Minimum | 20787000 |
---|---|
5-th percentile | 72000000 |
Q1 | 1.15 × 108 |
median | 1.615 × 108 |
Q3 | 2.4613986 × 108 |
95-th percentile | 4.6525 × 108 |
Maximum | 8.9561783 × 108 |
Range | 8.7483083 × 108 |
Interquartile range (IQR) | 1.3113986 × 108 |
Descriptive statistics
Standard deviation | 1.2986514 × 108 |
---|---|
Coefficient of variation (CV) | 0.64705526 |
Kurtosis | 5.3751776 |
Mean | 2.0070178 × 108 |
Median Absolute Deviation (MAD) | 58500000 |
Skewness | 2.0167282 |
Sum | 2.0070178 × 1011 |
Variance | 1.6864954 × 1016 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
125000000 | 17 | 1.7% |
150000000 | 16 | 1.6% |
110000000 | 15 | 1.5% |
120000000 | 15 | 1.5% |
135000000 | 15 | 1.5% |
140000000 | 13 | 1.3% |
100000000 | 13 | 1.3% |
160000000 | 13 | 1.3% |
130000000 | 11 | 1.1% |
115000000 | 11 | 1.1% |
Other values (525) | 861 |
Value | Count | Frequency (%) |
20787000 | 1 | |
37000000 | 1 | |
38376000 | 1 | |
40326900 | 1 | |
44000000 | 2 | |
46813200 | 1 | |
47500000 | 1 | |
49000000 | 1 | |
50000000 | 2 | |
50500000 | 1 |
Value | Count | Frequency (%) |
895617830 | 1 | |
894071630 | 1 | |
890983200 | 1 | |
831795720 | 1 | |
831070860 | 1 | |
805295000 | 1 | |
752500000 | 1 | |
731634000 | 1 | |
700000000 | 2 | |
699492000 | 1 |
PNSN_PAYFORM_CD
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 7 |
---|---|
Distinct (%) | 0.7% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 3.337 |
Minimum | 1 |
---|---|
Maximum | 8 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 8.9 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 1 |
median | 1 |
Q3 | 7 |
95-th percentile | 8 |
Maximum | 8 |
Range | 7 |
Interquartile range (IQR) | 6 |
Descriptive statistics
Standard deviation | 2.8725742 |
---|---|
Coefficient of variation (CV) | 0.86082536 |
Kurtosis | -1.5156261 |
Mean | 3.337 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 0.6014012 |
Sum | 3337 |
Variance | 8.2516827 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1 | 507 | |
7 | 231 | |
2 | 124 | 12.4% |
8 | 90 | 9.0% |
6 | 26 | 2.6% |
4 | 21 | 2.1% |
5 | 1 | 0.1% |
Value | Count | Frequency (%) |
1 | 507 | |
2 | 124 | 12.4% |
4 | 21 | 2.1% |
5 | 1 | 0.1% |
6 | 26 | 2.6% |
7 | 231 | |
8 | 90 | 9.0% |
Value | Count | Frequency (%) |
8 | 90 | 9.0% |
7 | 231 | |
6 | 26 | 2.6% |
5 | 1 | 0.1% |
4 | 21 | 2.1% |
2 | 124 | 12.4% |
1 | 507 |
PNSN_PROD_DVCD
Categorical
IMBALANCE
 
Distinct | 3 |
---|---|
Distinct (%) | 0.3% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 7.9 KiB |
1 | |
---|---|
22 | |
21 | 5 |
Length
Max length | 2 |
---|---|
Median length | 1 |
Mean length | 1.218 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 1 |
---|---|
2nd row | 1 |
3rd row | 1 |
4th row | 1 |
5th row | 1 |
Common Values
Value | Count | Frequency (%) |
1 | 782 | |
22 | 213 | 21.3% |
21 | 5 | 0.5% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
1 | 782 | |
22 | 213 | 21.3% |
21 | 5 | 0.5% |
GUARNT_ISSUE_CNT
Categorical
IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 7.9 KiB |
0 | |
---|---|
1 | 30 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 1 |
---|---|
2nd row | 1 |
3rd row | 1 |
4th row | 0 |
5th row | 0 |
Common Values
Value | Count | Frequency (%) |
0 | 970 | |
1 | 30 | 3.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 970 | |
1 | 30 | 3.0% |
GUARNT_EXEC_AMT
Real number (ℝ)
HIGH CORRELATION
  SKEWED
  ZEROS
 
Distinct | 988 |
---|---|
Distinct (%) | 98.8% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1372617.4 |
Minimum | 0 |
---|---|
Maximum | 1.9732403 × 108 |
Zeros | 13 |
Zeros (%) | 1.3% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 8.9 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 243349.2 |
Q1 | 486196.25 |
median | 688554 |
Q3 | 1027116.5 |
95-th percentile | 2044521.4 |
Maximum | 1.9732403 × 108 |
Range | 1.9732403 × 108 |
Interquartile range (IQR) | 540920.25 |
Descriptive statistics
Standard deviation | 7638776.7 |
---|---|
Coefficient of variation (CV) | 5.5651174 |
Kurtosis | 481.59876 |
Mean | 1372617.4 |
Median Absolute Deviation (MAD) | 244927.5 |
Skewness | 20.60117 |
Sum | 1.3726174 × 109 |
Variance | 5.835091 × 1013 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 13 | 1.3% |
1229039 | 1 | 0.1% |
885049 | 1 | 0.1% |
580662 | 1 | 0.1% |
769342 | 1 | 0.1% |
774677 | 1 | 0.1% |
504835 | 1 | 0.1% |
1048581 | 1 | 0.1% |
695919 | 1 | 0.1% |
485627 | 1 | 0.1% |
Other values (978) | 978 |
Value | Count | Frequency (%) |
0 | 13 | |
27494 | 1 | 0.1% |
49508 | 1 | 0.1% |
95506 | 1 | 0.1% |
111438 | 1 | 0.1% |
113060 | 1 | 0.1% |
122550 | 1 | 0.1% |
129595 | 1 | 0.1% |
164158 | 1 | 0.1% |
168762 | 1 | 0.1% |
Value | Count | Frequency (%) |
197324028 | 1 | |
112929900 | 1 | |
43557540 | 1 | |
42973371 | 1 | |
39223485 | 1 | |
31951370 | 1 | |
18172410 | 1 | |
16509663 | 1 | |
15973763 | 1 | |
10517378 | 1 |
HOUSE_LOC_CITY_DVCD | CUST_NO | JUDGE_AMT | PNSN_PAYFORM_CD | PNSN_PROD_DVCD | GUARNT_ISSUE_CNT | GUARNT_EXEC_AMT | |
---|---|---|---|---|---|---|---|
HOUSE_LOC_CITY_DVCD | 1.000 | 0.396 | 0.453 | 0.316 | 0.351 | 0.017 | 0.092 |
CUST_NO | 0.396 | 1.000 | 0.000 | 0.416 | 0.303 | 0.335 | 0.185 |
JUDGE_AMT | 0.453 | 0.000 | 1.000 | 0.456 | 0.264 | 0.000 | 0.428 |
PNSN_PAYFORM_CD | 0.316 | 0.416 | 0.456 | 1.000 | 0.386 | 0.000 | 0.161 |
PNSN_PROD_DVCD | 0.351 | 0.303 | 0.264 | 0.386 | 1.000 | 0.000 | 0.000 |
GUARNT_ISSUE_CNT | 0.017 | 0.335 | 0.000 | 0.000 | 0.000 | 1.000 | 0.271 |
GUARNT_EXEC_AMT | 0.092 | 0.185 | 0.428 | 0.161 | 0.000 | 0.271 | 1.000 |
HOUSE_LOC_CITY_DVCD | PNSN_PROD_DVCD | GUARNT_ISSUE_CNT | |
---|---|---|---|
HOUSE_LOC_CITY_DVCD | 1.000 | 0.123 | 0.029 |
PNSN_PROD_DVCD | 0.123 | 1.000 | 0.000 |
GUARNT_ISSUE_CNT | 0.029 | 0.000 | 1.000 |
CUST_NO | JUDGE_AMT | PNSN_PAYFORM_CD | GUARNT_EXEC_AMT | HOUSE_LOC_CITY_DVCD | PNSN_PROD_DVCD | GUARNT_ISSUE_CNT | |
---|---|---|---|---|---|---|---|
CUST_NO | 1.000 | 0.017 | 0.042 | 0.030 | 0.262 | 0.189 | 0.256 |
JUDGE_AMT | 0.017 | 1.000 | -0.621 | 0.694 | 0.305 | 0.163 | 0.000 |
PNSN_PAYFORM_CD | 0.042 | -0.621 | 1.000 | -0.427 | 0.223 | 0.281 | 0.000 |
GUARNT_EXEC_AMT | 0.030 | 0.694 | -0.427 | 1.000 | 0.069 | 0.000 | 0.330 |
HOUSE_LOC_CITY_DVCD | 0.262 | 0.305 | 0.223 | 0.069 | 1.000 | 0.123 | 0.029 |
PNSN_PROD_DVCD | 0.189 | 0.163 | 0.281 | 0.000 | 0.123 | 1.000 | 0.000 |
GUARNT_ISSUE_CNT | 0.256 | 0.000 | 0.000 | 0.330 | 0.029 | 0.000 | 1.000 |
BASIS_DY | HOUSE_LOC_CITY_DVCD | CUST_NO | AGE_SECTN | JUDGE_AMT | PNSN_PAYFORM_CD | PNSN_PROD_DVCD | GUARNT_ISSUE_CNT | GUARNT_EXEC_AMT | |
---|---|---|---|---|---|---|---|---|---|
0 | 202006 | 30 | 142071404 | 65 | 50500000 | 7 | 1 | 1 | 1229039 |
1 | 202006 | 30 | 141930106 | 65 | 650000000 | 6 | 1 | 1 | 197324028 |
2 | 202006 | 30 | 141623332 | 65 | 115000000 | 8 | 1 | 1 | 18172410 |
3 | 202006 | 30 | 141213030 | 65 | 350000000 | 6 | 1 | 0 | 476668 |
4 | 202006 | 30 | 140235187 | 65 | 92500000 | 7 | 1 | 0 | 415549 |
5 | 202006 | 30 | 140164290 | 65 | 415000000 | 1 | 22 | 0 | 1402689 |
6 | 202006 | 30 | 140069052 | 65 | 275000000 | 1 | 22 | 0 | 772177 |
7 | 202006 | 30 | 138307645 | 65 | 700000000 | 1 | 1 | 0 | 1739284 |
8 | 202006 | 30 | 138039036 | 65 | 115000000 | 7 | 1 | 0 | 379262 |
9 | 202006 | 30 | 137778806 | 65 | 339922800 | 1 | 22 | 0 | 0 |
BASIS_DY | HOUSE_LOC_CITY_DVCD | CUST_NO | AGE_SECTN | JUDGE_AMT | PNSN_PAYFORM_CD | PNSN_PROD_DVCD | GUARNT_ISSUE_CNT | GUARNT_EXEC_AMT | |
---|---|---|---|---|---|---|---|---|---|
990 | 202006 | 28 | 111008385 | 65 | 94096900 | 8 | 1 | 0 | 454075 |
991 | 202006 | 28 | 111007865 | 65 | 71750000 | 8 | 1 | 0 | 453399 |
992 | 202006 | 28 | 110988370 | 65 | 153500000 | 1 | 1 | 0 | 991040 |
993 | 202006 | 28 | 110942772 | 65 | 107500000 | 7 | 1 | 0 | 405587 |
994 | 202006 | 28 | 110892958 | 65 | 277500000 | 4 | 1 | 0 | 1084847 |
995 | 202006 | 28 | 110892330 | 65 | 96000000 | 8 | 1 | 0 | 299727 |
996 | 202006 | 28 | 110891917 | 65 | 255000000 | 1 | 22 | 0 | 867374 |
997 | 202006 | 28 | 110748251 | 65 | 127000000 | 7 | 1 | 0 | 699968 |
998 | 202006 | 28 | 110729911 | 65 | 108134600 | 7 | 1 | 0 | 423328 |
999 | 202006 | 28 | 110729526 | 65 | 77000000 | 8 | 1 | 0 | 258434 |