Dataset statistics
Number of variables | 9 |
---|---|
Number of observations | 186 |
Missing cells | 373 |
Missing cells (%) | 22.3% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 13.9 KiB |
Average record size in memory | 76.7 B |
Variable types
Numeric | 2 |
---|---|
Categorical | 3 |
DateTime | 3 |
Boolean | 1 |
Dataset
Description | 한국주택금융공사 주택연금부 업무 관련 공개 데이터 (해당 부서의 업무와 관련된 데이터베이스에서 공개 가능한 원천 데이터) |
---|---|
Author | 한국주택금융공사 |
URL | https://www.data.go.kr/data/15073084/fileData.do |
CNSL_DY is highly overall correlated with REQ_DY and 3 other fields | High correlation |
PRS_DVCD is highly overall correlated with REQ_DY and 1 other fields | High correlation |
REQ_DY is highly overall correlated with PRS_DVCD and 1 other fields | High correlation |
CNSL_HOPE_BNK_CD is highly overall correlated with CNSL_DY | High correlation |
CTRL_BRCD is highly overall correlated with CNSL_DY | High correlation |
PRS_DVCD is highly imbalanced (65.5%) | Imbalance |
CNSL_DY is highly imbalanced (78.0%) | Imbalance |
PRS_TS has 168 (90.3%) missing values | Missing |
FIN_CNSL_YN has 61 (32.8%) missing values | Missing |
CNSL_HOPE_BNK_CD has 144 (77.4%) missing values | Missing |
REQ_TS has unique values | Unique |
Reproduction
Analysis started | 2023-12-12 07:04:47.709831 |
---|---|
Analysis finished | 2023-12-12 07:04:49.076475 |
Duration | 1.37 second |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
REQ_DY
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 145 |
---|---|
Distinct (%) | 78.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 20172228 |
Minimum | 20121021 |
---|---|
Maximum | 20201025 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.8 KiB |
Quantile statistics
Minimum | 20121021 |
---|---|
5-th percentile | 20141147 |
Q1 | 20160477 |
median | 20170166 |
Q3 | 20190918 |
95-th percentile | 20201021 |
Maximum | 20201025 |
Range | 80004 |
Interquartile range (IQR) | 30440.75 |
Descriptive statistics
Standard deviation | 19878.089 |
---|---|
Coefficient of variation (CV) | 0.00098541858 |
Kurtosis | -0.42484298 |
Mean | 20172228 |
Median Absolute Deviation (MAD) | 10054 |
Skewness | -0.16124294 |
Sum | 3.7520345 × 109 |
Variance | 3.951384 × 108 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
20201021 | 7 | 3.8% |
20160425 | 5 | 2.7% |
20201022 | 5 | 2.7% |
20200106 | 3 | 1.6% |
20170126 | 3 | 1.6% |
20190531 | 3 | 1.6% |
20160321 | 2 | 1.1% |
20121206 | 2 | 1.1% |
20121122 | 2 | 1.1% |
20160421 | 2 | 1.1% |
Other values (135) | 152 |
Value | Count | Frequency (%) |
20121021 | 1 | |
20121122 | 2 | |
20121206 | 2 | |
20130506 | 1 | |
20130708 | 1 | |
20141119 | 1 | |
20141121 | 1 | |
20141128 | 1 | |
20141205 | 1 | |
20141210 | 1 |
Value | Count | Frequency (%) |
20201025 | 1 | 0.5% |
20201023 | 2 | 1.1% |
20201022 | 5 | |
20201021 | 7 | |
20201004 | 1 | 0.5% |
20200928 | 1 | 0.5% |
20200925 | 1 | 0.5% |
20200426 | 1 | 0.5% |
20200410 | 1 | 0.5% |
20200318 | 1 | 0.5% |
CTRL_BRCD
Categorical
HIGH CORRELATION
 
Distinct | 22 |
---|---|
Distinct (%) | 11.8% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.6 KiB |
TLB | |
---|---|
TBA | |
TPA | |
XXX | |
ABN | |
Other values (17) |
Length
Max length | 3 |
---|---|
Median length | 3 |
Mean length | 3 |
Min length | 3 |
Unique
Unique | 3 ? |
---|---|
Unique (%) | 1.6% |
Sample
1st row | TRA |
---|---|
2nd row | QAD |
3rd row | TLB |
4th row | TRA |
5th row | TAB |
Common Values
Value | Count | Frequency (%) |
TLB | 46 | |
TBA | 43 | |
TPA | 15 | 8.1% |
XXX | 10 | 5.4% |
ABN | 10 | 5.4% |
TAB | 9 | 4.8% |
THA | 7 | 3.8% |
TRA | 5 | 2.7% |
TAC | 5 | 2.7% |
THB | 5 | 2.7% |
Other values (12) | 31 |
Length
Value | Count | Frequency (%) |
tlb | 46 | |
tba | 43 | |
tpa | 15 | 8.1% |
xxx | 10 | 5.4% |
abn | 10 | 5.4% |
tab | 9 | 4.8% |
tha | 7 | 3.8% |
tra | 5 | 2.7% |
tac | 5 | 2.7% |
thb | 5 | 2.7% |
Other values (12) | 31 |
PRS_DVCD
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | 1.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.6 KiB |
1 | |
---|---|
2 | 12 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 1 |
---|---|
2nd row | 1 |
3rd row | 1 |
4th row | 1 |
5th row | 1 |
Common Values
Value | Count | Frequency (%) |
1 | 174 | |
2 | 12 | 6.5% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
1 | 174 | |
2 | 12 | 6.5% |
CNSL_DY
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 4 |
---|---|
Distinct (%) | 2.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.6 KiB |
<NA> | |
---|---|
20201022 | 6 |
20201023 | 4 |
20201026 | 2 |
Length
Max length | 8 |
---|---|
Median length | 4 |
Mean length | 4.2580645 |
Min length | 4 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | <NA> |
---|---|
2nd row | <NA> |
3rd row | <NA> |
4th row | <NA> |
5th row | <NA> |
Common Values
Value | Count | Frequency (%) |
<NA> | 174 | |
20201022 | 6 | 3.2% |
20201023 | 4 | 2.2% |
20201026 | 2 | 1.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
na | 174 | |
20201022 | 6 | 3.2% |
20201023 | 4 | 2.2% |
20201026 | 2 | 1.1% |
REQ_TS
Date
UNIQUE
 
Distinct | 186 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.6 KiB |
Minimum | 2012-10-21 16:02:49 |
---|---|
Maximum | 2020-10-25 12:44:27 |
REG_TS
Date
Distinct | 144 |
---|---|
Distinct (%) | 77.4% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.6 KiB |
Minimum | 2012-10-22 02:31:32 |
---|---|
Maximum | 2020-10-26 02:32:24 |
PRS_TS
Date
MISSING
 
Distinct | 18 |
---|---|
Distinct (%) | 100.0% |
Missing | 168 |
Missing (%) | 90.3% |
Memory size | 1.6 KiB |
Minimum | 2013-05-10 14:47:38 |
---|---|
Maximum | 2020-10-26 09:28:15 |
FIN_CNSL_YN
Boolean
MISSING
 
Distinct | 2 |
---|---|
Distinct (%) | 1.6% |
Missing | 61 |
Missing (%) | 32.8% |
Memory size | 504.0 B |
True | |
---|---|
False | |
(Missing) |
Value | Count | Frequency (%) |
True | 65 | |
False | 60 | |
(Missing) | 61 |
CNSL_HOPE_BNK_CD
Real number (ℝ)
HIGH CORRELATION
  MISSING
 
Distinct | 22 |
---|---|
Distinct (%) | 52.4% |
Missing | 144 |
Missing (%) | 77.4% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 176926.4 |
Minimum | 34571 |
---|---|
Maximum | 816760 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.8 KiB |
Quantile statistics
Minimum | 34571 |
---|---|
5-th percentile | 41713 |
Q1 | 103265 |
median | 114679 |
Q3 | 204602 |
95-th percentile | 320213.35 |
Maximum | 816760 |
Range | 782189 |
Interquartile range (IQR) | 101337 |
Descriptive statistics
Standard deviation | 169856.06 |
---|---|
Coefficient of variation (CV) | 0.96003796 |
Kurtosis | 8.3201124 |
Mean | 176926.4 |
Median Absolute Deviation (MAD) | 54362 |
Skewness | 2.7060798 |
Sum | 7430909 |
Variance | 2.8851082 × 1010 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
114679 | 11 | 5.9% |
111436 | 3 | 1.6% |
310020 | 3 | 1.6% |
201647 | 2 | 1.1% |
100719 | 2 | 1.1% |
41713 | 2 | 1.1% |
320201 | 2 | 1.1% |
204602 | 2 | 1.1% |
816760 | 2 | 1.1% |
111135 | 1 | 0.5% |
Other values (12) | 12 | 6.5% |
(Missing) | 144 |
Value | Count | Frequency (%) |
34571 | 1 | |
41030 | 1 | |
41713 | 2 | |
46310 | 1 | |
48046 | 1 | |
55165 | 1 | |
65469 | 1 | |
66691 | 1 | |
100719 | 2 | |
110903 | 1 |
Value | Count | Frequency (%) |
816760 | 2 | 1.1% |
320214 | 1 | 0.5% |
320201 | 2 | 1.1% |
310062 | 1 | 0.5% |
310020 | 3 | 1.6% |
208527 | 1 | 0.5% |
204602 | 2 | 1.1% |
201647 | 2 | 1.1% |
115665 | 1 | 0.5% |
114679 | 11 |
REQ_DY | CTRL_BRCD | PRS_DVCD | CNSL_DY | PRS_TS | FIN_CNSL_YN | CNSL_HOPE_BNK_CD | |
---|---|---|---|---|---|---|---|
REQ_DY | 1.000 | 0.830 | 0.699 | NaN | 1.000 | 0.439 | 0.439 |
CTRL_BRCD | 0.830 | 1.000 | 0.568 | 1.000 | 1.000 | 0.231 | 0.588 |
PRS_DVCD | 0.699 | 0.568 | 1.000 | NaN | 1.000 | 0.000 | 0.000 |
CNSL_DY | NaN | 1.000 | NaN | 1.000 | 1.000 | 0.000 | NaN |
PRS_TS | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
FIN_CNSL_YN | 0.439 | 0.231 | 0.000 | 0.000 | 1.000 | 1.000 | 0.140 |
CNSL_HOPE_BNK_CD | 0.439 | 0.588 | 0.000 | NaN | 1.000 | 0.140 | 1.000 |
CTRL_BRCD | CNSL_DY | PRS_DVCD | FIN_CNSL_YN | |
---|---|---|---|---|
CTRL_BRCD | 1.000 | 0.577 | 0.428 | 0.167 |
CNSL_DY | 0.577 | 1.000 | 1.000 | 0.000 |
PRS_DVCD | 0.428 | 1.000 | 1.000 | 0.000 |
FIN_CNSL_YN | 0.167 | 0.000 | 0.000 | 1.000 |
REQ_DY | CNSL_HOPE_BNK_CD | CTRL_BRCD | PRS_DVCD | CNSL_DY | FIN_CNSL_YN | |
---|---|---|---|---|---|---|
REQ_DY | 1.000 | -0.141 | 0.479 | 0.534 | 1.000 | 0.311 |
CNSL_HOPE_BNK_CD | -0.141 | 1.000 | 0.401 | 0.000 | 1.000 | 0.168 |
CTRL_BRCD | 0.479 | 0.401 | 1.000 | 0.428 | 0.577 | 0.167 |
PRS_DVCD | 0.534 | 0.000 | 0.428 | 1.000 | 1.000 | 0.000 |
CNSL_DY | 1.000 | 1.000 | 0.577 | 1.000 | 1.000 | 0.000 |
FIN_CNSL_YN | 0.311 | 0.168 | 0.167 | 0.000 | 0.000 | 1.000 |
REQ_DY | CTRL_BRCD | PRS_DVCD | CNSL_DY | REQ_TS | REG_TS | PRS_TS | FIN_CNSL_YN | CNSL_HOPE_BNK_CD | |
---|---|---|---|---|---|---|---|---|---|
0 | 20141128 | TRA | 1 | <NA> | 2014/11/28 19:28:28 | 2014/11/29 02:31:29 | <NA> | <NA> | <NA> |
1 | 20141210 | QAD | 1 | <NA> | 2014/12/10 14:48:46 | 2014/12/11 02:35:33 | <NA> | <NA> | <NA> |
2 | 20141121 | TLB | 1 | <NA> | 2014/11/21 11:23:37 | 2014/11/22 02:31:30 | <NA> | <NA> | <NA> |
3 | 20141119 | TRA | 1 | <NA> | 2014/11/19 09:24:49 | 2014/11/20 02:31:28 | <NA> | <NA> | <NA> |
4 | 20141205 | TAB | 1 | <NA> | 2014/12/05 09:48:07 | 2014/12/06 02:31:29 | <NA> | <NA> | <NA> |
5 | 20150223 | TAC | 1 | <NA> | 2015/02/23 17:36:33 | 2015/02/24 02:31:27 | <NA> | <NA> | <NA> |
6 | 20150206 | TAB | 1 | <NA> | 2015/02/06 12:58:00 | 2015/02/07 02:36:02 | <NA> | <NA> | <NA> |
7 | 20150109 | TRA | 1 | <NA> | 2015/01/09 12:42:09 | 2015/01/10 02:31:31 | <NA> | <NA> | <NA> |
8 | 20150313 | TBA | 1 | <NA> | 2015/03/13 10:16:12 | 2015/03/14 02:31:30 | <NA> | <NA> | <NA> |
9 | 20150516 | TPB | 1 | <NA> | 2015/05/16 09:58:30 | 2015/05/17 02:31:22 | <NA> | <NA> | <NA> |
REQ_DY | CTRL_BRCD | PRS_DVCD | CNSL_DY | REQ_TS | REG_TS | PRS_TS | FIN_CNSL_YN | CNSL_HOPE_BNK_CD | |
---|---|---|---|---|---|---|---|---|---|
176 | 20190918 | TLB | 1 | <NA> | 2019/09/18 14:07:02 | 2019/09/19 02:32:27 | <NA> | Y | 114679 |
177 | 20190917 | TLB | 1 | <NA> | 2019/09/17 16:38:40 | 2019/09/18 02:32:28 | <NA> | Y | 114679 |
178 | 20190923 | TLB | 1 | <NA> | 2019/09/23 18:51:36 | 2019/09/24 02:32:42 | <NA> | N | <NA> |
179 | 20190920 | TLB | 1 | <NA> | 2019/09/20 14:27:38 | 2019/09/21 02:32:27 | <NA> | N | <NA> |
180 | 20191025 | TLB | 1 | <NA> | 2019/10/25 13:49:33 | 2019/10/26 02:32:19 | <NA> | N | <NA> |
181 | 20190927 | TLB | 1 | <NA> | 2019/09/27 09:47:38 | 2019/09/28 02:32:17 | <NA> | N | <NA> |
182 | 20191008 | TLB | 1 | <NA> | 2019/10/08 02:47:12 | 2019/10/08 02:48:51 | <NA> | N | <NA> |
183 | 20191009 | TLB | 1 | <NA> | 2019/10/09 13:42:08 | 2019/10/10 02:32:39 | <NA> | N | <NA> |
184 | 20191014 | TLB | 1 | <NA> | 2019/10/14 10:28:58 | 2019/10/15 02:32:29 | <NA> | Y | 114679 |
185 | 20191014 | TLB | 1 | <NA> | 2019/10/14 11:24:28 | 2019/10/15 02:32:29 | <NA> | N | <NA> |