Dataset statistics
Number of variables | 5 |
---|---|
Number of observations | 1000 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 162 |
Duplicate rows (%) | 16.2% |
Total size in memory | 40.2 KiB |
Average record size in memory | 41.1 B |
Variable types
Categorical | 3 |
---|---|
Numeric | 1 |
Boolean | 1 |
Dataset
Description | 한국주택금융공사 유동화자산부 업무 관련 공개 공공데이터 (해당 부서의 업무와 관련된 데이터베이스에서 공개 가능한 원천 데이터) |
---|---|
Author | 한국주택금융공사 |
URL | https://www.data.go.kr/data/15073299/fileData.do |
ARTRGT_NOTICE_YN has constant value "" | Constant |
Dataset has 162 (16.2%) duplicate rows | Duplicates |
TREAT_ORG_CD is highly overall correlated with HOLD_CD | High correlation |
HOLD_CD is highly overall correlated with LOAN_TREAT_DY and 2 other fields | High correlation |
LIQD_PLAN_CD is highly overall correlated with LOAN_TREAT_DY and 1 other fields | High correlation |
LOAN_TREAT_DY is highly overall correlated with LIQD_PLAN_CD and 1 other fields | High correlation |
LIQD_PLAN_CD is highly imbalanced (71.4%) | Imbalance |
Reproduction
Analysis started | 2023-12-12 11:03:14.730752 |
---|---|
Analysis finished | 2023-12-12 11:03:15.400136 |
Duration | 0.67 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
LIQD_PLAN_CD
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 21 |
---|---|
Distinct (%) | 2.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 7.9 KiB |
KHFCMB2020S-34 | |
---|---|
KHFCMB2020S-33 | |
KHFCMB2019S-08 | 7 |
KHFCMB2019S-03 | 6 |
KHFCMB2018S-30 | 5 |
Other values (16) | 31 |
Length
Max length | 14 |
---|---|
Median length | 14 |
Mean length | 14 |
Min length | 14 |
Unique
Unique | 7 ? |
---|---|
Unique (%) | 0.7% |
Sample
1st row | KHFCMB2020S-34 |
---|---|
2nd row | KHFCMB2020S-34 |
3rd row | KHFCMB2020S-34 |
4th row | KHFCMB2020S-34 |
5th row | KHFCMB2020S-34 |
Common Values
Value | Count | Frequency (%) |
KHFCMB2020S-34 | 708 | |
KHFCMB2020S-33 | 243 | 24.3% |
KHFCMB2019S-08 | 7 | 0.7% |
KHFCMB2019S-03 | 6 | 0.6% |
KHFCMB2018S-30 | 5 | 0.5% |
KHFCMB2019S-05 | 4 | 0.4% |
KHFCMB2019S-24 | 4 | 0.4% |
KHFCMB2019S-12 | 3 | 0.3% |
KHFCMB2019S-13 | 3 | 0.3% |
KHFCMB2019S-19 | 2 | 0.2% |
Other values (11) | 15 | 1.5% |
Length
Value | Count | Frequency (%) |
khfcmb2020s-34 | 708 | |
khfcmb2020s-33 | 243 | 24.3% |
khfcmb2019s-08 | 7 | 0.7% |
khfcmb2019s-03 | 6 | 0.6% |
khfcmb2018s-30 | 5 | 0.5% |
khfcmb2019s-05 | 4 | 0.4% |
khfcmb2019s-24 | 4 | 0.4% |
khfcmb2019s-12 | 3 | 0.3% |
khfcmb2019s-13 | 3 | 0.3% |
khfcmb2019s-07 | 2 | 0.2% |
Other values (11) | 15 | 1.5% |
HOLD_CD
Categorical
HIGH CORRELATION
 
Distinct | 44 |
---|---|
Distinct (%) | 4.4% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 7.9 KiB |
B004-2020-0099 | |
---|---|
B081-2020-0100 | |
B081-2020-0101 | |
B088-2020-0105 | |
B10-2020-0081 | |
Other values (39) |
Length
Max length | 14 |
---|---|
Median length | 14 |
Mean length | 13.909 |
Min length | 13 |
Unique
Unique | 17 ? |
---|---|
Unique (%) | 1.7% |
Sample
1st row | B081-2020-0101 |
---|---|
2nd row | B081-2020-0101 |
3rd row | B081-2020-0101 |
4th row | B081-2020-0101 |
5th row | B081-2020-0101 |
Common Values
Value | Count | Frequency (%) |
B004-2020-0099 | 177 | |
B081-2020-0100 | 145 | |
B081-2020-0101 | 115 | |
B088-2020-0105 | 113 | |
B10-2020-0081 | 91 | |
B023-2020-0037 | 62 | 6.2% |
B004-2020-0098 | 60 | 6.0% |
B003-2020-0078 | 54 | 5.4% |
B020-2020-0101 | 28 | 2.8% |
B003-2020-0077 | 22 | 2.2% |
Other values (34) | 133 |
Length
Value | Count | Frequency (%) |
b004-2020-0099 | 177 | |
b081-2020-0100 | 145 | |
b081-2020-0101 | 115 | |
b088-2020-0105 | 113 | |
b10-2020-0081 | 91 | |
b023-2020-0037 | 62 | 6.2% |
b004-2020-0098 | 60 | 6.0% |
b003-2020-0078 | 54 | 5.4% |
b020-2020-0101 | 28 | 2.8% |
b003-2020-0077 | 22 | 2.2% |
Other values (34) | 133 |
TREAT_ORG_CD
Categorical
HIGH CORRELATION
 
Distinct | 10 |
---|---|
Distinct (%) | 1.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 7.9 KiB |
B081 | |
---|---|
B004 | |
B088 | |
B003 | |
B010 | |
Other values (5) |
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 4 |
Min length | 4 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | 0.1% |
Sample
1st row | B081 |
---|---|
2nd row | B081 |
3rd row | B081 |
4th row | B081 |
5th row | B081 |
Common Values
Value | Count | Frequency (%) |
B081 | 299 | |
B004 | 238 | |
B088 | 123 | |
B003 | 95 | 9.5% |
B010 | 91 | 9.1% |
B023 | 66 | 6.6% |
B020 | 56 | 5.6% |
B031 | 21 | 2.1% |
B039 | 10 | 1.0% |
B007 | 1 | 0.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
b081 | 299 | |
b004 | 238 | |
b088 | 123 | |
b003 | 95 | 9.5% |
b010 | 91 | 9.1% |
b023 | 66 | 6.6% |
b020 | 56 | 5.6% |
b031 | 21 | 2.1% |
b039 | 10 | 1.0% |
b007 | 1 | 0.1% |
LOAN_TREAT_DY
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 182 |
---|---|
Distinct (%) | 18.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 20199127 |
Minimum | 20150327 |
---|---|
Maximum | 20200907 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 8.9 KiB |
Quantile statistics
Minimum | 20150327 |
---|---|
5-th percentile | 20190621 |
Q1 | 20200331 |
median | 20200518 |
Q3 | 20200529 |
95-th percentile | 20200807 |
Maximum | 20200907 |
Range | 50580 |
Interquartile range (IQR) | 198 |
Descriptive statistics
Standard deviation | 6397.0291 |
---|---|
Coefficient of variation (CV) | 0.00031669829 |
Kurtosis | 39.37088 |
Mean | 20199127 |
Median Absolute Deviation (MAD) | 83 |
Skewness | -5.9652355 |
Sum | 2.0199127 × 1010 |
Variance | 40921982 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
20200529 | 95 | 9.5% |
20200515 | 51 | 5.1% |
20200520 | 46 | 4.6% |
20200508 | 35 | 3.5% |
20200522 | 35 | 3.5% |
20200511 | 26 | 2.6% |
20200306 | 25 | 2.5% |
20200518 | 25 | 2.5% |
20200331 | 24 | 2.4% |
20200528 | 23 | 2.3% |
Other values (172) | 615 |
Value | Count | Frequency (%) |
20150327 | 1 | |
20150407 | 1 | |
20150410 | 1 | |
20150414 | 1 | |
20150415 | 1 | |
20150417 | 2 | |
20150420 | 1 | |
20150422 | 1 | |
20150427 | 1 | |
20150429 | 1 |
Value | Count | Frequency (%) |
20200907 | 1 | 0.1% |
20200904 | 3 | |
20200903 | 1 | 0.1% |
20200902 | 1 | 0.1% |
20200901 | 3 | |
20200831 | 5 | |
20200828 | 4 | |
20200827 | 5 | |
20200826 | 2 | 0.2% |
20200825 | 2 | 0.2% |
ARTRGT_NOTICE_YN
Boolean
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.1 KiB |
False |
---|
Value | Count | Frequency (%) |
False | 1000 |
LIQD_PLAN_CD | HOLD_CD | TREAT_ORG_CD | LOAN_TREAT_DY | |
---|---|---|---|---|
LIQD_PLAN_CD | 1.000 | 1.000 | 0.742 | 0.911 |
HOLD_CD | 1.000 | 1.000 | 1.000 | 0.982 |
TREAT_ORG_CD | 0.742 | 1.000 | 1.000 | 0.485 |
LOAN_TREAT_DY | 0.911 | 0.982 | 0.485 | 1.000 |
TREAT_ORG_CD | HOLD_CD | LIQD_PLAN_CD | |
---|---|---|---|
TREAT_ORG_CD | 1.000 | 0.983 | 0.382 |
HOLD_CD | 0.983 | 1.000 | 0.988 |
LIQD_PLAN_CD | 0.382 | 0.988 | 1.000 |
LOAN_TREAT_DY | LIQD_PLAN_CD | HOLD_CD | TREAT_ORG_CD | |
---|---|---|---|---|
LOAN_TREAT_DY | 1.000 | 0.741 | 0.885 | 0.219 |
LIQD_PLAN_CD | 0.741 | 1.000 | 0.988 | 0.382 |
HOLD_CD | 0.885 | 0.988 | 1.000 | 0.983 |
TREAT_ORG_CD | 0.219 | 0.382 | 0.983 | 1.000 |
LIQD_PLAN_CD | HOLD_CD | TREAT_ORG_CD | LOAN_TREAT_DY | ARTRGT_NOTICE_YN | |
---|---|---|---|---|---|
0 | KHFCMB2020S-34 | B081-2020-0101 | B081 | 20200512 | N |
1 | KHFCMB2020S-34 | B081-2020-0101 | B081 | 20200318 | N |
2 | KHFCMB2020S-34 | B081-2020-0101 | B081 | 20200306 | N |
3 | KHFCMB2020S-34 | B081-2020-0101 | B081 | 20200306 | N |
4 | KHFCMB2020S-34 | B081-2020-0101 | B081 | 20200520 | N |
5 | KHFCMB2020S-34 | B081-2020-0101 | B081 | 20200520 | N |
6 | KHFCMB2020S-34 | B081-2020-0101 | B081 | 20200521 | N |
7 | KHFCMB2020S-34 | B081-2020-0101 | B081 | 20200520 | N |
8 | KHFCMB2020S-34 | B081-2020-0101 | B081 | 20200515 | N |
9 | KHFCMB2020S-34 | B081-2020-0101 | B081 | 20200221 | N |
LIQD_PLAN_CD | HOLD_CD | TREAT_ORG_CD | LOAN_TREAT_DY | ARTRGT_NOTICE_YN | |
---|---|---|---|---|---|
990 | KHFCMB2020S-33 | B088-2020-0105 | B088 | 20200529 | N |
991 | KHFCMB2020S-33 | B088-2020-0105 | B088 | 20200529 | N |
992 | KHFCMB2020S-33 | B088-2020-0105 | B088 | 20200304 | N |
993 | KHFCMB2020S-33 | B088-2020-0105 | B088 | 20200407 | N |
994 | KHFCMB2020S-33 | B088-2020-0105 | B088 | 20200525 | N |
995 | KHFCMB2020S-33 | B088-2020-0105 | B088 | 20200601 | N |
996 | KHFCMB2020S-33 | B088-2020-0105 | B088 | 20200529 | N |
997 | KHFCMB2020S-33 | B088-2020-0105 | B088 | 20200302 | N |
998 | KHFCMB2020S-33 | B088-2020-0105 | B088 | 20200601 | N |
999 | KHFCMB2020S-33 | B088-2020-0105 | B088 | 20200604 | N |
Most frequently occurring
LIQD_PLAN_CD | HOLD_CD | TREAT_ORG_CD | LOAN_TREAT_DY | ARTRGT_NOTICE_YN | # duplicates | |
---|---|---|---|---|---|---|
117 | KHFCMB2020S-34 | B081-2020-0100 | B081 | 20200529 | N | 38 |
37 | KHFCMB2020S-33 | B088-2020-0105 | B088 | 20200529 | N | 29 |
82 | KHFCMB2020S-34 | B004-2020-0099 | B004 | 20200508 | N | 25 |
87 | KHFCMB2020S-34 | B004-2020-0099 | B004 | 20200515 | N | 21 |
83 | KHFCMB2020S-34 | B004-2020-0099 | B004 | 20200511 | N | 16 |
32 | KHFCMB2020S-33 | B088-2020-0105 | B088 | 20200522 | N | 14 |
79 | KHFCMB2020S-34 | B004-2020-0099 | B004 | 20200504 | N | 14 |
90 | KHFCMB2020S-34 | B004-2020-0099 | B004 | 20200520 | N | 14 |
7 | KHFCMB2020S-33 | B003-2020-0077 | B003 | 20200331 | N | 12 |
38 | KHFCMB2020S-33 | B088-2020-0105 | B088 | 20200601 | N | 12 |