Overview

Dataset statistics

Number of variables8
Number of observations1000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory64.6 KiB
Average record size in memory66.1 B

Variable types

Text1
Categorical4
Numeric2
DateTime1

Dataset

Description한국주택금융공사 유동화자산부 업무 관련 공개 공공데이터 (해당 부서의 업무와 관련된 데이터베이스에서 공개 가능한 원천 데이터)
Author한국주택금융공사
URLhttps://www.data.go.kr/data/15073061/fileData.do

Alerts

PROD_GRP_NM is highly overall correlated with PROD_GRP_CDHigh correlation
LOAN_ORG_CD is highly overall correlated with LOAN_ORG_NMHigh correlation
LOAN_ORG_NM is highly overall correlated with LOAN_ORG_CDHigh correlation
PROD_GRP_CD is highly overall correlated with PROD_GRP_NMHigh correlation
LOAN_RAMT is highly overall correlated with LOAN_CNTHigh correlation
LOAN_CNT is highly overall correlated with LOAN_RAMTHigh correlation
LOAN_RAMT has unique valuesUnique

Reproduction

Analysis started2023-12-12 12:14:43.388763
Analysis finished2023-12-12 12:14:45.105277
Duration1.72 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct61
Distinct (%)6.1%
Missing0
Missing (%)0.0%
Memory size7.9 KiB
2023-12-12T21:14:45.345390image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length14
Mean length14
Min length14

Characters and Unicode

Total characters14000
Distinct characters18
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique5 ?
Unique (%)0.5%

Sample

1st rowKHFCMB2017S-23
2nd rowKHFCMB2017S-23
3rd rowKHFCMB2017S-23
4th rowKHFCMB2017S-23
5th rowKHFCMB2017S-23
ValueCountFrequency (%)
khfcmb2015s-17 45
 
4.5%
khfcmb2017s-23 37
 
3.7%
khfcmb2015s-05 36
 
3.6%
khfcmb2015s-03 35
 
3.5%
khfcmb2016s-03 34
 
3.4%
khfcmb2015s-20 34
 
3.4%
khfcmb2015s-07 33
 
3.3%
khfcmb2015s-22 33
 
3.3%
khfcmb2015s-23 31
 
3.1%
khfcmb2016s-05 29
 
2.9%
Other values (51) 653
65.3%
2023-12-12T21:14:45.838107image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 1852
13.2%
2 1400
10.0%
1 1012
 
7.2%
K 1000
 
7.1%
- 1000
 
7.1%
F 1000
 
7.1%
C 1000
 
7.1%
M 1000
 
7.1%
B 1000
 
7.1%
H 1000
 
7.1%
Other values (8) 2736
19.5%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter 7000
50.0%
Decimal Number 6000
42.9%
Dash Punctuation 1000
 
7.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 1852
30.9%
2 1400
23.3%
1 1012
16.9%
5 839
14.0%
6 225
 
3.8%
7 207
 
3.5%
4 192
 
3.2%
3 179
 
3.0%
8 66
 
1.1%
9 28
 
0.5%
Uppercase Letter
ValueCountFrequency (%)
K 1000
14.3%
F 1000
14.3%
C 1000
14.3%
M 1000
14.3%
B 1000
14.3%
H 1000
14.3%
S 1000
14.3%
Dash Punctuation
ValueCountFrequency (%)
- 1000
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 7000
50.0%
Latin 7000
50.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 1852
26.5%
2 1400
20.0%
1 1012
14.5%
- 1000
14.3%
5 839
12.0%
6 225
 
3.2%
7 207
 
3.0%
4 192
 
2.7%
3 179
 
2.6%
8 66
 
0.9%
Latin
ValueCountFrequency (%)
K 1000
14.3%
F 1000
14.3%
C 1000
14.3%
M 1000
14.3%
B 1000
14.3%
H 1000
14.3%
S 1000
14.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 14000
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 1852
13.2%
2 1400
10.0%
1 1012
 
7.2%
K 1000
 
7.1%
- 1000
 
7.1%
F 1000
 
7.1%
C 1000
 
7.1%
M 1000
 
7.1%
B 1000
 
7.1%
H 1000
 
7.1%
Other values (8) 2736
19.5%

LOAN_ORG_CD
Categorical

HIGH CORRELATION 

Distinct28
Distinct (%)2.8%
Missing0
Missing (%)0.0%
Memory size7.9 KiB
B081
86 
B020
86 
B004
83 
B010
81 
B088
79 
Other values (23)
585 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique1 ?
Unique (%)0.1%

Sample

1st rowI001
2nd rowI001
3rd rowB088
4th rowB088
5th rowB088

Common Values

ValueCountFrequency (%)
B081 86
 
8.6%
B020 86
 
8.6%
B004 83
 
8.3%
B010 81
 
8.1%
B088 79
 
7.9%
B003 75
 
7.5%
B039 69
 
6.9%
B023 62
 
6.2%
B005 59
 
5.9%
B032 48
 
4.8%
Other values (18) 272
27.2%

Length

2023-12-12T21:14:46.027001image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
b081 86
 
8.6%
b020 86
 
8.6%
b004 83
 
8.3%
b010 81
 
8.1%
b088 79
 
7.9%
b003 75
 
7.5%
b039 69
 
6.9%
b023 62
 
6.2%
b005 59
 
5.9%
b032 48
 
4.8%
Other values (18) 272
27.2%

PROD_GRP_CD
Categorical

HIGH CORRELATION 

Distinct5
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size7.9 KiB
C
277 
M
276 
T
241 
A
202 
P
 
4

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowT
2nd rowC
3rd rowT
4th rowC
5th rowA

Common Values

ValueCountFrequency (%)
C 277
27.7%
M 276
27.6%
T 241
24.1%
A 202
20.2%
P 4
 
0.4%

Length

2023-12-12T21:14:46.190154image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T21:14:46.351232image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
c 277
27.7%
m 276
27.6%
t 241
24.1%
a 202
20.2%
p 4
 
0.4%

LOAN_ORG_NM
Categorical

HIGH CORRELATION 

Distinct22
Distinct (%)2.2%
Missing0
Missing (%)0.0%
Memory size7.9 KiB
하나은행
86 
우리은행
86 
국민은행
83 
농협은행
81 
신한은행
79 
Other values (17)
585 

Length

Max length7
Median length4
Mean length3.969
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row삼성생명
2nd row삼성생명
3rd row신한은행
4th row신한은행
5th row신한은행

Common Values

ValueCountFrequency (%)
하나은행 86
 
8.6%
우리은행 86
 
8.6%
국민은행 83
 
8.3%
농협은행 81
 
8.1%
신한은행 79
 
7.9%
기업은행 75
 
7.5%
경남은행 69
 
6.9%
SC은행 62
 
6.2%
<NA> 61
 
6.1%
외환은행 59
 
5.9%
Other values (12) 259
25.9%

Length

2023-12-12T21:14:46.513644image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
하나은행 86
 
8.6%
우리은행 86
 
8.6%
국민은행 83
 
8.3%
농협은행 81
 
8.1%
신한은행 79
 
7.9%
기업은행 75
 
7.5%
경남은행 69
 
6.9%
sc은행 62
 
6.2%
na 61
 
6.1%
외환은행 59
 
5.9%
Other values (12) 259
25.9%

PROD_GRP_NM
Categorical

HIGH CORRELATION 

Distinct5
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size7.9 KiB
Conforming Loan
277 
(t+e)-보금자리론
276 
u-보금자리론
241 
내집마련 디딤돌
202 
MBS-SWAP
 
4

Length

Max length15
Median length11
Mean length10.526
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowu-보금자리론
2nd rowConforming Loan
3rd rowu-보금자리론
4th rowConforming Loan
5th row내집마련 디딤돌

Common Values

ValueCountFrequency (%)
Conforming Loan 277
27.7%
(t+e)-보금자리론 276
27.6%
u-보금자리론 241
24.1%
내집마련 디딤돌 202
20.2%
MBS-SWAP 4
 
0.4%

Length

2023-12-12T21:14:46.708221image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T21:14:46.844577image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
conforming 277
18.7%
loan 277
18.7%
t+e)-보금자리론 276
18.7%
u-보금자리론 241
16.3%
내집마련 202
13.7%
디딤돌 202
13.7%
mbs-swap 4
 
0.3%

LOAN_RAMT
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct1000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean7.2069243 × 1010
Minimum31625000
Maximum2.05185 × 1012
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size8.9 KiB
2023-12-12T21:14:47.019090image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum31625000
5-th percentile3.3980523 × 108
Q15.3001221 × 109
median1.8488541 × 1010
Q35.8077585 × 1010
95-th percentile3.8024363 × 1011
Maximum2.05185 × 1012
Range2.0518184 × 1012
Interquartile range (IQR)5.2777463 × 1010

Descriptive statistics

Standard deviation1.6963431 × 1011
Coefficient of variation (CV)2.3537685
Kurtosis38.147065
Mean7.2069243 × 1010
Median Absolute Deviation (MAD)1.6711585 × 1010
Skewness5.2903431
Sum7.2069243 × 1013
Variance2.8775799 × 1022
MonotonicityNot monotonic
2023-12-12T21:14:47.229531image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1713034563 1
 
0.1%
44688677937 1
 
0.1%
247000000 1
 
0.1%
1312782044 1
 
0.1%
2408083940 1
 
0.1%
4231084311 1
 
0.1%
34806428138 1
 
0.1%
592343459 1
 
0.1%
10092078355 1
 
0.1%
733000000 1
 
0.1%
Other values (990) 990
99.0%
ValueCountFrequency (%)
31625000 1
0.1%
45000000 1
0.1%
50000000 1
0.1%
60264317 1
0.1%
65000000 1
0.1%
76700000 1
0.1%
80880811 1
0.1%
81166670 1
0.1%
91432602 1
0.1%
92000000 1
0.1%
ValueCountFrequency (%)
2051850024421 1
0.1%
1666665798016 1
0.1%
1191412690709 1
0.1%
1184978549620 1
0.1%
1184488528940 1
0.1%
1183419327546 1
0.1%
1064086296730 1
0.1%
1058321723229 1
0.1%
967219233086 1
0.1%
898575562991 1
0.1%

LOAN_CNT
Real number (ℝ)

HIGH CORRELATION 

Distinct620
Distinct (%)62.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean777.296
Minimum1
Maximum21845
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size8.9 KiB
2023-12-12T21:14:47.426253image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile4
Q159.75
median217.5
Q3676
95-th percentile3529.9
Maximum21845
Range21844
Interquartile range (IQR)616.25

Descriptive statistics

Standard deviation1838.2265
Coefficient of variation (CV)2.364899
Kurtosis44.594851
Mean777.296
Median Absolute Deviation (MAD)199.5
Skewness5.8194308
Sum777296
Variance3379076.7
MonotonicityNot monotonic
2023-12-12T21:14:47.631256image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 18
 
1.8%
4 17
 
1.7%
2 15
 
1.5%
3 15
 
1.5%
8 11
 
1.1%
6 11
 
1.1%
11 8
 
0.8%
7 8
 
0.8%
5 7
 
0.7%
12 7
 
0.7%
Other values (610) 883
88.3%
ValueCountFrequency (%)
1 18
1.8%
2 15
1.5%
3 15
1.5%
4 17
1.7%
5 7
 
0.7%
6 11
1.1%
7 8
0.8%
8 11
1.1%
9 6
 
0.6%
10 4
 
0.4%
ValueCountFrequency (%)
21845 1
0.1%
18605 1
0.1%
15948 1
0.1%
15434 1
0.1%
13886 1
0.1%
13672 1
0.1%
13554 1
0.1%
12297 1
0.1%
11106 1
0.1%
8419 1
0.1%

REG_DT
Date

Distinct61
Distinct (%)6.1%
Missing0
Missing (%)0.0%
Memory size7.9 KiB
Minimum2013-12-05 13:12:34
Maximum2017-09-25 11:03:59
2023-12-12T21:14:47.836129image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:14:48.050048image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

Interactions

2023-12-12T21:14:44.465363image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:14:44.160729image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:14:44.623929image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:14:44.314384image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T21:14:48.174007image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
LIQD_PLAN_CDLOAN_ORG_CDPROD_GRP_CDLOAN_ORG_NMPROD_GRP_NMLOAN_RAMTLOAN_CNTREG_DT
LIQD_PLAN_CD1.0000.3770.9230.4540.9230.5880.6161.000
LOAN_ORG_CD0.3771.0000.7401.0000.7400.2380.1400.377
PROD_GRP_CD0.9230.7401.0000.7031.0000.2530.2410.923
LOAN_ORG_NM0.4541.0000.7031.0000.7030.2770.2260.454
PROD_GRP_NM0.9230.7401.0000.7031.0000.2530.2410.923
LOAN_RAMT0.5880.2380.2530.2770.2531.0000.9440.588
LOAN_CNT0.6160.1400.2410.2260.2410.9441.0000.616
REG_DT1.0000.3770.9230.4540.9230.5880.6161.000
2023-12-12T21:14:48.333852image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
PROD_GRP_NMLOAN_ORG_CDLOAN_ORG_NMPROD_GRP_CD
PROD_GRP_NM1.0000.4630.4331.000
LOAN_ORG_CD0.4631.0001.0000.463
LOAN_ORG_NM0.4331.0001.0000.433
PROD_GRP_CD1.0000.4630.4331.000
2023-12-12T21:14:48.457976image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
LOAN_RAMTLOAN_CNTLOAN_ORG_CDPROD_GRP_CDLOAN_ORG_NMPROD_GRP_NM
LOAN_RAMT1.0000.9880.0930.1570.1170.157
LOAN_CNT0.9881.0000.0520.1410.0880.141
LOAN_ORG_CD0.0930.0521.0000.4631.0000.463
PROD_GRP_CD0.1570.1410.4631.0000.4331.000
LOAN_ORG_NM0.1170.0881.0000.4331.0000.433
PROD_GRP_NM0.1570.1410.4631.0000.4331.000

Missing values

2023-12-12T21:14:44.821703image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T21:14:45.019614image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

LIQD_PLAN_CDLOAN_ORG_CDPROD_GRP_CDLOAN_ORG_NMPROD_GRP_NMLOAN_RAMTLOAN_CNTREG_DT
0KHFCMB2017S-23I001T삼성생명u-보금자리론1713034563132017/09/25 11:03:59
1KHFCMB2017S-23I001C삼성생명Conforming Loan4537885718342017/09/25 11:03:59
2KHFCMB2017S-23B088T신한은행u-보금자리론12236378279962017/09/25 11:03:59
3KHFCMB2017S-23B088C신한은행Conforming Loan260012983792092017/09/25 11:03:59
4KHFCMB2017S-23B088A신한은행내집마련 디딤돌445012100213632017/09/25 11:03:59
5KHFCMB2017S-23B081C하나은행Conforming Loan9453052099582017/09/25 11:03:59
6KHFCMB2017S-23B081A하나은행내집마련 디딤돌316452062322592017/09/25 11:03:59
7KHFCMB2017S-23B039T경남은행u-보금자리론4922363393422017/09/25 11:03:59
8KHFCMB2017S-23B039C경남은행Conforming Loan216469617521692017/09/25 11:03:59
9KHFCMB2017S-23B039A경남은행내집마련 디딤돌1481640224152017/09/25 11:03:59
LIQD_PLAN_CDLOAN_ORG_CDPROD_GRP_CDLOAN_ORG_NMPROD_GRP_NMLOAN_RAMTLOAN_CNTREG_DT
990KHFCMB2007S-04B003M기업은행(t+e)-보금자리론4074321940562013/12/05 13:15:28
991KHFCMB2007S-03I002M<NA>(t+e)-보금자리론120784822361782013/12/05 13:15:18
992KHFCMB2007S-03I001M삼성생명(t+e)-보금자리론152063082061892013/12/05 13:15:18
993KHFCMB2007S-03B088M신한은행(t+e)-보금자리론4804722297702013/12/05 13:15:18
994KHFCMB2007S-03B081M하나은행(t+e)-보금자리론12968164543515182013/12/05 13:15:18
995KHFCMB2007S-03B039M경남은행(t+e)-보금자리론157507958402882013/12/05 13:15:18
996KHFCMB2007S-03B034M광주은행(t+e)-보금자리론4173233402982013/12/05 13:15:18
997KHFCMB2007S-03B027M씨티은행(t+e)-보금자리론5610047403752013/12/05 13:15:18
998KHFCMB2007S-03B023MSC은행(t+e)-보금자리론257514930513342013/12/05 13:15:18
999KHFCMB2007S-03B020M우리은행(t+e)-보금자리론301436881483982013/12/05 13:15:18