Dataset statistics
Number of variables | 6 |
---|---|
Number of observations | 1000 |
Missing cells | 950 |
Missing cells (%) | 15.8% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 52.9 KiB |
Average record size in memory | 54.1 B |
Variable types
Numeric | 4 |
---|---|
Categorical | 2 |
Dataset
Description | 한국주택금융공사 채권관리부 업무 관련 공개 공공데이터 (해당 부서의 업무와 관련된 데이터베이스에서 공개 가능한 원천 데이터) |
---|---|
Author | 한국주택금융공사 |
URL | https://www.data.go.kr/data/15072891/fileData.do |
PETITN_ACPT_DY is highly overall correlated with CO_LAWST_POS_CD and 1 other fields | High correlation |
CO_LAWST_POS_CD is highly overall correlated with PETITN_ACPT_DY | High correlation |
LAWST_CLSS_DVCD is highly overall correlated with PETITN_ACPT_DY | High correlation |
CO_LAWST_POS_CD is highly imbalanced (91.3%) | Imbalance |
LAWST_CLSS_DVCD is highly imbalanced (91.1%) | Imbalance |
PETITN_ACPT_DY has 950 (95.0%) missing values | Missing |
ACPT_PTNO has unique values | Unique |
Reproduction
Analysis started | 2023-12-12 19:42:26.411663 |
---|---|
Analysis finished | 2023-12-12 19:42:28.794334 |
Duration | 2.38 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
ACPT_PTNO
Real number (ℝ)
UNIQUE
 
Distinct | 1000 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2.0201304 × 1010 |
Minimum | 2.0201301 × 1010 |
---|---|
Maximum | 2.0201305 × 1010 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 8.9 KiB |
Quantile statistics
Minimum | 2.0201301 × 1010 |
---|---|
5-th percentile | 2.0201303 × 1010 |
Q1 | 2.0201303 × 1010 |
median | 2.0201304 × 1010 |
Q3 | 2.0201304 × 1010 |
95-th percentile | 2.0201304 × 1010 |
Maximum | 2.0201305 × 1010 |
Range | 3171 |
Interquartile range (IQR) | 786.5 |
Descriptive statistics
Standard deviation | 443.22577 |
---|---|
Coefficient of variation (CV) | 2.1940454 × 10-8 |
Kurtosis | 0.0045814669 |
Mean | 2.0201304 × 1010 |
Median Absolute Deviation (MAD) | 394.5 |
Skewness | -0.29593241 |
Sum | 2.0201304 × 1013 |
Variance | 196449.08 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
20201304549 | 1 | 0.1% |
20201303574 | 1 | 0.1% |
20201303591 | 1 | 0.1% |
20201303589 | 1 | 0.1% |
20201303588 | 1 | 0.1% |
20201303603 | 1 | 0.1% |
20201303586 | 1 | 0.1% |
20201303587 | 1 | 0.1% |
20201303584 | 1 | 0.1% |
20201303581 | 1 | 0.1% |
Other values (990) | 990 |
Value | Count | Frequency (%) |
20201301378 | 1 | |
20201301723 | 1 | |
20201302946 | 1 | |
20201303073 | 1 | |
20201303085 | 1 | |
20201303121 | 1 | |
20201303122 | 1 | |
20201303123 | 1 | |
20201303124 | 1 | |
20201303125 | 1 |
Value | Count | Frequency (%) |
20201304549 | 1 | |
20201304548 | 1 | |
20201304545 | 1 | |
20201304544 | 1 | |
20201304543 | 1 | |
20201304542 | 1 | |
20201304540 | 1 | |
20201304539 | 1 | |
20201304538 | 1 | |
20201304537 | 1 |
MDBTR_CUST_NO
Real number (ℝ)
Distinct | 880 |
---|---|
Distinct (%) | 88.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 82278016 |
Minimum | 268952 |
---|---|
Maximum | 1.362029 × 108 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 8.9 KiB |
Quantile statistics
Minimum | 268952 |
---|---|
5-th percentile | 24283638 |
Q1 | 66891528 |
median | 88117561 |
Q3 | 1.01497 × 108 |
95-th percentile | 1.2122536 × 108 |
Maximum | 1.362029 × 108 |
Range | 1.3593395 × 108 |
Interquartile range (IQR) | 34605476 |
Descriptive statistics
Standard deviation | 29403751 |
---|---|
Coefficient of variation (CV) | 0.35737069 |
Kurtosis | -0.24947517 |
Mean | 82278016 |
Median Absolute Deviation (MAD) | 18052321 |
Skewness | -0.68862151 |
Sum | 8.2278016 × 1010 |
Variance | 8.6458058 × 1014 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
93908202 | 3 | 0.3% |
118880218 | 3 | 0.3% |
81675356 | 3 | 0.3% |
21564211 | 3 | 0.3% |
66870861 | 3 | 0.3% |
65681214 | 3 | 0.3% |
73584824 | 3 | 0.3% |
77911778 | 3 | 0.3% |
88825154 | 3 | 0.3% |
101462546 | 3 | 0.3% |
Other values (870) | 970 |
Value | Count | Frequency (%) |
268952 | 1 | |
782315 | 1 | |
3001912 | 1 | |
3792483 | 2 | |
6105860 | 1 | |
6867603 | 1 | |
8763695 | 1 | |
9088654 | 1 | |
10023174 | 1 | |
10295313 | 2 |
Value | Count | Frequency (%) |
136202904 | 1 | |
135204725 | 1 | |
130928776 | 1 | |
128943598 | 1 | |
128881391 | 1 | |
128236111 | 1 | |
128050287 | 2 | |
127947126 | 1 | |
127129038 | 2 | |
126786456 | 1 |
LAWST_TYP_CD
Real number (ℝ)
Distinct | 14 |
---|---|
Distinct (%) | 1.4% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 11.978 |
Minimum | 1 |
---|---|
Maximum | 99 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 8.9 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 1 |
median | 15 |
Q3 | 15 |
95-th percentile | 16 |
Maximum | 99 |
Range | 98 |
Interquartile range (IQR) | 14 |
Descriptive statistics
Standard deviation | 11.288709 |
---|---|
Coefficient of variation (CV) | 0.94245358 |
Kurtosis | 36.387236 |
Mean | 11.978 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 4.8154287 |
Sum | 11978 |
Variance | 127.43495 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
15 | 643 | |
1 | 287 | |
21 | 26 | 2.6% |
99 | 11 | 1.1% |
20 | 11 | 1.1% |
16 | 4 | 0.4% |
2 | 4 | 0.4% |
6 | 3 | 0.3% |
11 | 3 | 0.3% |
7 | 2 | 0.2% |
Other values (4) | 6 | 0.6% |
Value | Count | Frequency (%) |
1 | 287 | |
2 | 4 | 0.4% |
4 | 1 | 0.1% |
6 | 3 | 0.3% |
7 | 2 | 0.2% |
8 | 2 | 0.2% |
10 | 1 | 0.1% |
11 | 3 | 0.3% |
12 | 2 | 0.2% |
15 | 643 |
Value | Count | Frequency (%) |
99 | 11 | 1.1% |
21 | 26 | 2.6% |
20 | 11 | 1.1% |
16 | 4 | 0.4% |
15 | 643 | |
12 | 2 | 0.2% |
11 | 3 | 0.3% |
10 | 1 | 0.1% |
8 | 2 | 0.2% |
7 | 2 | 0.2% |
CO_LAWST_POS_CD
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 7.9 KiB |
1 | |
---|---|
2 | 11 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 1 |
---|---|
2nd row | 1 |
3rd row | 1 |
4th row | 1 |
5th row | 1 |
Common Values
Value | Count | Frequency (%) |
1 | 989 | |
2 | 11 | 1.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
1 | 989 | |
2 | 11 | 1.1% |
LAWST_CLSS_DVCD
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 3 |
---|---|
Distinct (%) | 0.3% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 7.9 KiB |
1 | |
---|---|
2 | 18 |
3 | 1 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | 0.1% |
Sample
1st row | 1 |
---|---|
2nd row | 1 |
3rd row | 1 |
4th row | 1 |
5th row | 1 |
Common Values
Value | Count | Frequency (%) |
1 | 981 | |
2 | 18 | 1.8% |
3 | 1 | 0.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
1 | 981 | |
2 | 18 | 1.8% |
3 | 1 | 0.1% |
PETITN_ACPT_DY
Real number (ℝ)
HIGH CORRELATION
  MISSING
 
Distinct | 45 |
---|---|
Distinct (%) | 90.0% |
Missing | 950 |
Missing (%) | 95.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 20197687 |
Minimum | 20120917 |
---|---|
Maximum | 20201005 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 8.9 KiB |
Quantile statistics
Minimum | 20120917 |
---|---|
5-th percentile | 20190869 |
Q1 | 20200347 |
median | 20200610 |
Q3 | 20200722 |
95-th percentile | 20200913 |
Maximum | 20201005 |
Range | 80088 |
Interquartile range (IQR) | 374.5 |
Descriptive statistics
Standard deviation | 11578.97 |
---|---|
Coefficient of variation (CV) | 0.00057328197 |
Kurtosis | 41.403365 |
Mean | 20197687 |
Median Absolute Deviation (MAD) | 196.5 |
Skewness | -6.20946 |
Sum | 1.0098843 × 109 |
Variance | 1.3407254 × 108 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
20200717 | 2 | 0.2% |
20200409 | 2 | 0.2% |
20200615 | 2 | 0.2% |
20200807 | 2 | 0.2% |
20200629 | 2 | 0.2% |
20191206 | 1 | 0.1% |
20200828 | 1 | 0.1% |
20190829 | 1 | 0.1% |
20200722 | 1 | 0.1% |
20200713 | 1 | 0.1% |
Other values (35) | 35 | 3.5% |
(Missing) | 950 |
Value | Count | Frequency (%) |
20120917 | 1 | |
20190617 | 1 | |
20190829 | 1 | |
20190918 | 1 | |
20191122 | 1 | |
20191206 | 1 | |
20191213 | 1 | |
20191218 | 1 | |
20200302 | 1 | |
20200311 | 1 |
Value | Count | Frequency (%) |
20201005 | 1 | |
20200923 | 1 | |
20200922 | 1 | |
20200902 | 1 | |
20200831 | 1 | |
20200828 | 1 | |
20200825 | 1 | |
20200813 | 1 | |
20200807 | 2 | |
20200724 | 1 |
ACPT_PTNO | MDBTR_CUST_NO | LAWST_TYP_CD | CO_LAWST_POS_CD | LAWST_CLSS_DVCD | PETITN_ACPT_DY | |
---|---|---|---|---|---|---|
ACPT_PTNO | 1.000 | 0.000 | 0.156 | 0.000 | 0.000 | 0.000 |
MDBTR_CUST_NO | 0.000 | 1.000 | 0.241 | 0.000 | 0.117 | 0.000 |
LAWST_TYP_CD | 0.156 | 0.241 | 1.000 | 0.541 | 0.265 | 0.000 |
CO_LAWST_POS_CD | 0.000 | 0.000 | 0.541 | 1.000 | 0.220 | NaN |
LAWST_CLSS_DVCD | 0.000 | 0.117 | 0.265 | 0.220 | 1.000 | NaN |
PETITN_ACPT_DY | 0.000 | 0.000 | 0.000 | NaN | NaN | 1.000 |
LAWST_CLSS_DVCD | CO_LAWST_POS_CD | |
---|---|---|
LAWST_CLSS_DVCD | 1.000 | 0.360 |
CO_LAWST_POS_CD | 0.360 | 1.000 |
ACPT_PTNO | MDBTR_CUST_NO | LAWST_TYP_CD | PETITN_ACPT_DY | CO_LAWST_POS_CD | LAWST_CLSS_DVCD | |
---|---|---|---|---|---|---|
ACPT_PTNO | 1.000 | -0.012 | 0.028 | 0.425 | 0.293 | 0.000 |
MDBTR_CUST_NO | -0.012 | 1.000 | 0.084 | 0.043 | 0.023 | 0.069 |
LAWST_TYP_CD | 0.028 | 0.084 | 1.000 | 0.363 | 0.369 | 0.253 |
PETITN_ACPT_DY | 0.425 | 0.043 | 0.363 | 1.000 | 1.000 | 1.000 |
CO_LAWST_POS_CD | 0.293 | 0.023 | 0.369 | 1.000 | 1.000 | 0.360 |
LAWST_CLSS_DVCD | 0.000 | 0.069 | 0.253 | 1.000 | 0.360 | 1.000 |
ACPT_PTNO | MDBTR_CUST_NO | LAWST_TYP_CD | CO_LAWST_POS_CD | LAWST_CLSS_DVCD | PETITN_ACPT_DY | |
---|---|---|---|---|---|---|
0 | 20201304549 | 98434771 | 1 | 1 | 1 | <NA> |
1 | 20201304548 | 47876459 | 99 | 1 | 1 | <NA> |
2 | 20201304545 | 46504654 | 1 | 1 | 1 | <NA> |
3 | 20201304544 | 87114989 | 1 | 1 | 1 | <NA> |
4 | 20201304543 | 83066963 | 15 | 1 | 1 | <NA> |
5 | 20201304542 | 100124306 | 15 | 1 | 1 | <NA> |
6 | 20201304539 | 77261716 | 1 | 1 | 1 | 20190617 |
7 | 20201304538 | 109950050 | 1 | 1 | 1 | <NA> |
8 | 20201304540 | 87879059 | 1 | 1 | 1 | <NA> |
9 | 20201304536 | 124895404 | 15 | 1 | 1 | <NA> |
ACPT_PTNO | MDBTR_CUST_NO | LAWST_TYP_CD | CO_LAWST_POS_CD | LAWST_CLSS_DVCD | PETITN_ACPT_DY | |
---|---|---|---|---|---|---|
990 | 20201303142 | 97049413 | 15 | 1 | 1 | <NA> |
991 | 20201303133 | 113350206 | 20 | 1 | 1 | <NA> |
992 | 20201303132 | 118566749 | 15 | 1 | 1 | <NA> |
993 | 20201303131 | 20927396 | 11 | 1 | 1 | <NA> |
994 | 20201303135 | 25559141 | 1 | 1 | 1 | <NA> |
995 | 20201303125 | 125385036 | 15 | 1 | 1 | 20191218 |
996 | 20201303124 | 32122587 | 15 | 1 | 1 | <NA> |
997 | 20201303123 | 110531222 | 15 | 1 | 1 | 20200319 |
998 | 20201303073 | 3792483 | 2 | 1 | 1 | <NA> |
999 | 20201303121 | 126786456 | 15 | 1 | 1 | <NA> |