Dataset statistics
Number of variables | 9 |
---|---|
Number of observations | 1000 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 76.3 KiB |
Average record size in memory | 78.1 B |
Variable types
Categorical | 5 |
---|---|
Numeric | 3 |
Text | 1 |
Dataset
Description | 한국주택금융공사 유동화자산부 업무 관련 공개 공공데이터 (해당 부서의 업무와 관련된 데이터베이스에서 공개 가능한 원천 데이터) |
---|---|
Author | 한국주택금융공사 |
URL | https://www.data.go.kr/data/15073309/fileData.do |
OBJECT_CODE has constant value "" | Constant |
OFFER_CNT has constant value "" | Constant |
DWRT_BASIS_DY is highly overall correlated with SEQ and 1 other fields | High correlation |
RCV_DY is highly overall correlated with SEQ and 1 other fields | High correlation |
SEQ is highly overall correlated with TRD_SEQ and 2 other fields | High correlation |
TRD_SEQ is highly overall correlated with SEQ | High correlation |
PAY_INT_AMT has 89 (8.9%) zeros | Zeros |
Reproduction
Analysis started | 2023-12-12 10:17:28.249665 |
---|---|
Analysis finished | 2023-12-12 10:17:29.677002 |
Duration | 1.43 second |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
LOAN_ORG_CD
Categorical
Distinct | 12 |
---|---|
Distinct (%) | 1.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 7.9 KiB |
B088 | |
---|---|
B020 | |
B081 | |
B004 | |
B010 | |
Other values (7) |
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 4 |
Min length | 4 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | 0.1% |
Sample
1st row | B003 |
---|---|
2nd row | B088 |
3rd row | B020 |
4th row | B081 |
5th row | B088 |
Common Values
Value | Count | Frequency (%) |
B088 | 229 | |
B020 | 223 | |
B081 | 206 | |
B004 | 162 | |
B010 | 79 | 7.9% |
B003 | 67 | 6.7% |
B039 | 12 | 1.2% |
B032 | 9 | 0.9% |
I001 | 6 | 0.6% |
B005 | 4 | 0.4% |
Other values (2) | 3 | 0.3% |
Length
Value | Count | Frequency (%) |
b088 | 229 | |
b020 | 223 | |
b081 | 206 | |
b004 | 162 | |
b010 | 79 | 7.9% |
b003 | 67 | 6.7% |
b039 | 12 | 1.2% |
b032 | 9 | 0.9% |
i001 | 6 | 0.6% |
b005 | 4 | 0.4% |
Other values (2) | 3 | 0.3% |
RCV_DY
Categorical
HIGH CORRELATION
 
Distinct | 3 |
---|---|
Distinct (%) | 0.3% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 7.9 KiB |
20201024 | |
---|---|
20201025 | |
20201023 |
Length
Max length | 8 |
---|---|
Median length | 8 |
Mean length | 8 |
Min length | 8 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 20201025 |
---|---|
2nd row | 20201025 |
3rd row | 20201025 |
4th row | 20201025 |
5th row | 20201025 |
Common Values
Value | Count | Frequency (%) |
20201024 | 526 | |
20201025 | 403 | |
20201023 | 71 | 7.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
20201024 | 526 | |
20201025 | 403 | |
20201023 | 71 | 7.1% |
DWRT_BASIS_DY
Categorical
HIGH CORRELATION
 
Distinct | 3 |
---|---|
Distinct (%) | 0.3% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 7.9 KiB |
20201024 | |
---|---|
20201025 | |
20201023 |
Length
Max length | 8 |
---|---|
Median length | 8 |
Mean length | 8 |
Min length | 8 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 20201025 |
---|---|
2nd row | 20201025 |
3rd row | 20201025 |
4th row | 20201025 |
5th row | 20201025 |
Common Values
Value | Count | Frequency (%) |
20201024 | 526 | |
20201025 | 403 | |
20201023 | 71 | 7.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
20201024 | 526 | |
20201025 | 403 | |
20201023 | 71 | 7.1% |
OBJECT_CODE
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 7.9 KiB |
US |
---|
Length
Max length | 2 |
---|---|
Median length | 2 |
Mean length | 2 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | US |
---|---|
2nd row | US |
3rd row | US |
4th row | US |
5th row | US |
Common Values
Value | Count | Frequency (%) |
US | 1000 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
us | 1000 |
OFFER_CNT
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 7.9 KiB |
1 |
---|
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 1 |
---|---|
2nd row | 1 |
3rd row | 1 |
4th row | 1 |
5th row | 1 |
Common Values
Value | Count | Frequency (%) |
1 | 1000 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
1 | 1000 |
SEQ
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 597 |
---|---|
Distinct (%) | 59.7% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2104.924 |
Minimum | 1 |
---|---|
Maximum | 27133 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 8.9 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 25.95 |
Q1 | 125.75 |
median | 250.5 |
Q3 | 375.25 |
95-th percentile | 26422.25 |
Maximum | 27133 |
Range | 27132 |
Interquartile range (IQR) | 249.5 |
Descriptive statistics
Standard deviation | 6763.3224 |
---|---|
Coefficient of variation (CV) | 3.2130958 |
Kurtosis | 9.2216976 |
Mean | 2104.924 |
Median Absolute Deviation (MAD) | 125 |
Skewness | 3.3454616 |
Sum | 2104924 |
Variance | 45742530 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
398 | 2 | 0.2% |
140 | 2 | 0.2% |
197 | 2 | 0.2% |
146 | 2 | 0.2% |
221 | 2 | 0.2% |
55 | 2 | 0.2% |
54 | 2 | 0.2% |
120 | 2 | 0.2% |
165 | 2 | 0.2% |
36 | 2 | 0.2% |
Other values (587) | 980 |
Value | Count | Frequency (%) |
1 | 2 | |
2 | 2 | |
3 | 2 | |
4 | 2 | |
5 | 2 | |
6 | 2 | |
7 | 2 | |
8 | 2 | |
9 | 2 | |
10 | 2 |
Value | Count | Frequency (%) |
27133 | 1 | |
27131 | 1 | |
27118 | 1 | |
27114 | 1 | |
27075 | 1 | |
27055 | 1 | |
27024 | 1 | |
27019 | 1 | |
27014 | 1 | |
26996 | 1 |
HOLD_CD
Text
Distinct | 569 |
---|---|
Distinct (%) | 56.9% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 7.9 KiB |
Length
Max length | 14 |
---|---|
Median length | 14 |
Mean length | 13.92 |
Min length | 13 |
Characters and Unicode
Total characters | 13920 |
---|---|
Distinct characters | 13 |
Distinct categories | 3 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 370 ? |
---|---|
Unique (%) | 37.0% |
Sample
1st row | B003-2020-0074 |
---|---|
2nd row | B088-2020-0105 |
3rd row | B020-2020-0093 |
4th row | B081-2020-0097 |
5th row | B088-2020-0102 |
Value | Count | Frequency (%) |
b088-2020-0105 | 18 | 1.8% |
b003-2020-0075 | 10 | 1.0% |
b020-2019-0110 | 10 | 1.0% |
b081-2020-0097 | 8 | 0.8% |
b081-2015-0061 | 8 | 0.8% |
b004-2020-0096 | 8 | 0.8% |
b003-2020-0074 | 7 | 0.7% |
b020-2020-0093 | 7 | 0.7% |
b088-2020-0102 | 7 | 0.7% |
b081-2016-0033 | 7 | 0.7% |
Other values (559) | 910 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 4814 | |
- | 2000 | |
2 | 1727 | 12.4% |
1 | 1469 | 10.6% |
B | 994 | 7.1% |
8 | 955 | 6.9% |
9 | 367 | 2.6% |
7 | 355 | 2.6% |
4 | 341 | 2.4% |
6 | 308 | 2.2% |
Other values (3) | 590 | 4.2% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 10920 | |
Dash Punctuation | 2000 | 14.4% |
Uppercase Letter | 1000 | 7.2% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 4814 | |
2 | 1727 | 15.8% |
1 | 1469 | 13.5% |
8 | 955 | 8.7% |
9 | 367 | 3.4% |
7 | 355 | 3.3% |
4 | 341 | 3.1% |
6 | 308 | 2.8% |
3 | 297 | 2.7% |
5 | 287 | 2.6% |
Uppercase Letter
Value | Count | Frequency (%) |
B | 994 | |
I | 6 | 0.6% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 2000 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 12920 | |
Latin | 1000 | 7.2% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 4814 | |
- | 2000 | |
2 | 1727 | 13.4% |
1 | 1469 | 11.4% |
8 | 955 | 7.4% |
9 | 367 | 2.8% |
7 | 355 | 2.7% |
4 | 341 | 2.6% |
6 | 308 | 2.4% |
3 | 297 | 2.3% |
Latin
Value | Count | Frequency (%) |
B | 994 | |
I | 6 | 0.6% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 13920 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 4814 | |
- | 2000 | |
2 | 1727 | 12.4% |
1 | 1469 | 10.6% |
B | 994 | 7.1% |
8 | 955 | 6.9% |
9 | 367 | 2.6% |
7 | 355 | 2.6% |
4 | 341 | 2.4% |
6 | 308 | 2.2% |
Other values (3) | 590 | 4.2% |
TRD_SEQ
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 155 |
---|---|
Distinct (%) | 15.5% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 48.778 |
Minimum | 3 |
---|---|
Maximum | 540 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 8.9 KiB |
Quantile statistics
Minimum | 3 |
---|---|
5-th percentile | 7 |
Q1 | 16 |
median | 40 |
Q3 | 67 |
95-th percentile | 118.1 |
Maximum | 540 |
Range | 537 |
Interquartile range (IQR) | 51 |
Descriptive statistics
Standard deviation | 46.007972 |
---|---|
Coefficient of variation (CV) | 0.94321152 |
Kurtosis | 29.409999 |
Mean | 48.778 |
Median Absolute Deviation (MAD) | 25 |
Skewness | 3.7798119 |
Sum | 48778 |
Variance | 2116.7334 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
7 | 31 | 3.1% |
11 | 26 | 2.6% |
13 | 26 | 2.6% |
12 | 25 | 2.5% |
10 | 24 | 2.4% |
8 | 22 | 2.2% |
9 | 22 | 2.2% |
15 | 20 | 2.0% |
14 | 19 | 1.9% |
16 | 19 | 1.9% |
Other values (145) | 766 |
Value | Count | Frequency (%) |
3 | 2 | 0.2% |
4 | 8 | 0.8% |
5 | 6 | 0.6% |
6 | 17 | |
7 | 31 | |
8 | 22 | |
9 | 22 | |
10 | 24 | |
11 | 26 | |
12 | 25 |
Value | Count | Frequency (%) |
540 | 1 | |
539 | 1 | |
355 | 1 | |
279 | 1 | |
278 | 1 | |
277 | 1 | |
253 | 1 | |
252 | 1 | |
203 | 1 | |
202 | 1 |
PAY_INT_AMT
Real number (ℝ)
ZEROS
 
Distinct | 877 |
---|---|
Distinct (%) | 87.7% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 108671.12 |
Minimum | 0 |
---|---|
Maximum | 1065902 |
Zeros | 89 |
Zeros (%) | 8.9% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 8.9 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 763.5 |
median | 21458.5 |
Q3 | 183223 |
95-th percentile | 410189.1 |
Maximum | 1065902 |
Range | 1065902 |
Interquartile range (IQR) | 182459.5 |
Descriptive statistics
Standard deviation | 147985.73 |
---|---|
Coefficient of variation (CV) | 1.3617761 |
Kurtosis | 2.8828828 |
Mean | 108671.12 |
Median Absolute Deviation (MAD) | 21458.5 |
Skewness | 1.6004735 |
Sum | 1.0867112 × 108 |
Variance | 2.1899777 × 1010 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 89 | 8.9% |
65 | 3 | 0.3% |
84 | 3 | 0.3% |
1180 | 3 | 0.3% |
1462 | 2 | 0.2% |
3762 | 2 | 0.2% |
1807 | 2 | 0.2% |
4918 | 2 | 0.2% |
90 | 2 | 0.2% |
1724 | 2 | 0.2% |
Other values (867) | 890 |
Value | Count | Frequency (%) |
0 | 89 | |
1 | 2 | 0.2% |
3 | 1 | 0.1% |
4 | 1 | 0.1% |
8 | 1 | 0.1% |
9 | 1 | 0.1% |
10 | 1 | 0.1% |
11 | 2 | 0.2% |
12 | 1 | 0.1% |
15 | 1 | 0.1% |
Value | Count | Frequency (%) |
1065902 | 1 | |
768301 | 1 | |
713536 | 1 | |
658388 | 1 | |
622030 | 1 | |
621779 | 1 | |
616380 | 1 | |
593259 | 1 | |
593053 | 1 | |
585532 | 1 |
LOAN_ORG_CD | RCV_DY | DWRT_BASIS_DY | SEQ | TRD_SEQ | PAY_INT_AMT | |
---|---|---|---|---|---|---|
LOAN_ORG_CD | 1.000 | 0.297 | 0.297 | 0.248 | 0.336 | 0.124 |
RCV_DY | 0.297 | 1.000 | 1.000 | 1.000 | 0.210 | 0.300 |
DWRT_BASIS_DY | 0.297 | 1.000 | 1.000 | 1.000 | 0.210 | 0.300 |
SEQ | 0.248 | 1.000 | 1.000 | 1.000 | 0.238 | 0.203 |
TRD_SEQ | 0.336 | 0.210 | 0.210 | 0.238 | 1.000 | 0.126 |
PAY_INT_AMT | 0.124 | 0.300 | 0.300 | 0.203 | 0.126 | 1.000 |
DWRT_BASIS_DY | LOAN_ORG_CD | RCV_DY | |
---|---|---|---|
DWRT_BASIS_DY | 1.000 | 0.139 | 1.000 |
LOAN_ORG_CD | 0.139 | 1.000 | 0.139 |
RCV_DY | 1.000 | 0.139 | 1.000 |
SEQ | TRD_SEQ | PAY_INT_AMT | LOAN_ORG_CD | RCV_DY | DWRT_BASIS_DY | |
---|---|---|---|---|---|---|
SEQ | 1.000 | -0.713 | 0.176 | 0.192 | 0.999 | 0.999 |
TRD_SEQ | -0.713 | 1.000 | -0.389 | 0.149 | 0.135 | 0.135 |
PAY_INT_AMT | 0.176 | -0.389 | 1.000 | 0.053 | 0.138 | 0.138 |
LOAN_ORG_CD | 0.192 | 0.149 | 0.053 | 1.000 | 0.139 | 0.139 |
RCV_DY | 0.999 | 0.135 | 0.138 | 0.139 | 1.000 | 1.000 |
DWRT_BASIS_DY | 0.999 | 0.135 | 0.138 | 0.139 | 1.000 | 1.000 |
LOAN_ORG_CD | RCV_DY | DWRT_BASIS_DY | OBJECT_CODE | OFFER_CNT | SEQ | HOLD_CD | TRD_SEQ | PAY_INT_AMT | |
---|---|---|---|---|---|---|---|---|---|
0 | B003 | 20201025 | 20201025 | US | 1 | 398 | B003-2020-0074 | 7 | 494497 |
1 | B088 | 20201025 | 20201025 | US | 1 | 397 | B088-2020-0105 | 6 | 622030 |
2 | B020 | 20201025 | 20201025 | US | 1 | 396 | B020-2020-0093 | 14 | 99 |
3 | B081 | 20201025 | 20201025 | US | 1 | 402 | B081-2020-0097 | 8 | 242462 |
4 | B088 | 20201025 | 20201025 | US | 1 | 392 | B088-2020-0102 | 10 | 316471 |
5 | B088 | 20201025 | 20201025 | US | 1 | 264 | B088-2020-0102 | 31 | 1369 |
6 | B020 | 20201025 | 20201025 | US | 1 | 394 | B020-2020-0093 | 6 | 334086 |
7 | B020 | 20201025 | 20201025 | US | 1 | 388 | B020-2020-0088 | 21 | 593259 |
8 | B020 | 20201025 | 20201025 | US | 1 | 403 | B020-2020-0085 | 12 | 4467 |
9 | B020 | 20201025 | 20201025 | US | 1 | 395 | B020-2020-0085 | 9 | 2704 |
LOAN_ORG_CD | RCV_DY | DWRT_BASIS_DY | OBJECT_CODE | OFFER_CNT | SEQ | HOLD_CD | TRD_SEQ | PAY_INT_AMT | |
---|---|---|---|---|---|---|---|---|---|
990 | B081 | 20201023 | 20201023 | US | 1 | 26288 | B081-2020-0096 | 7 | 585532 |
991 | B081 | 20201023 | 20201023 | US | 1 | 26138 | B081-2020-0096 | 13 | 207 |
992 | B020 | 20201023 | 20201023 | US | 1 | 26497 | B020-2020-0093 | 6 | 130583 |
993 | B088 | 20201023 | 20201023 | US | 1 | 26793 | B088-2020-0102 | 15 | 2406 |
994 | B081 | 20201023 | 20201023 | US | 1 | 26100 | B081-2020-0097 | 7 | 86000 |
995 | B004 | 20201023 | 20201023 | US | 1 | 26565 | B004-2020-0096 | 7 | 302477 |
996 | B088 | 20201023 | 20201023 | US | 1 | 26689 | B088-2020-0102 | 7 | 138634 |
997 | B081 | 20201023 | 20201023 | US | 1 | 26257 | B081-2020-0097 | 7 | 70520 |
998 | B004 | 20201023 | 20201023 | US | 1 | 24937 | B004-2020-0097 | 8 | 234959 |
999 | B081 | 20201023 | 20201023 | US | 1 | 26072 | B081-2020-0096 | 7 | 390895 |