Overview

Dataset statistics

Number of variables6
Number of observations364
Missing cells46
Missing cells (%)2.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory18.6 KiB
Average record size in memory52.4 B

Variable types

Text1
DateTime1
Numeric4

Dataset

Description한국주택금융공사에서 발행한 주택저당채권담보부채권(MBB) 현황에 대한 데이터 입니다. 주택저당증권, 트랜치, 발행금액 등의 정보가 포함되어있으며 공공데이터 개방 정책에 따라 등록되었습니다.
Author한국주택금융공사
URLhttps://www.data.go.kr/data/15073663/fileData.do

Alerts

발행금액 is highly overall correlated with 연월말 모기지담보증권 잔액 and 1 other fieldsHigh correlation
연월말 모기지담보증권 잔액 is highly overall correlated with 발행금액 and 1 other fieldsHigh correlation
가중평균 발행금리 is highly overall correlated with 발행금액 and 1 other fieldsHigh correlation
가중평균 발행금리 has 46 (12.6%) missing valuesMissing
주택저당증권 has unique valuesUnique
연월말 모기지담보증권 잔액 has 38 (10.4%) zerosZeros

Reproduction

Analysis started2023-12-11 23:59:35.589638
Analysis finished2023-12-11 23:59:37.681338
Duration2.09 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

주택저당증권
Text

UNIQUE 

Distinct364
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size3.0 KiB
2023-12-12T08:59:37.955500image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length10
Mean length9.6950549
Min length9

Characters and Unicode

Total characters3529
Distinct characters14
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique364 ?
Unique (%)100.0%

Sample

1st rowMBS 20001
2nd rowMBS 20002
3rd rowMBS 20003
4th rowMBS 20011
5th rowMBS 20012
ValueCountFrequency (%)
mbs 364
50.0%
201625 1
 
0.1%
201623 1
 
0.1%
201622 1
 
0.1%
201621 1
 
0.1%
201620 1
 
0.1%
201619 1
 
0.1%
201618 1
 
0.1%
201617 1
 
0.1%
201616 1
 
0.1%
Other values (355) 355
48.8%
2023-12-12T08:59:38.384064image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2 546
15.5%
0 527
14.9%
1 470
13.3%
368
10.4%
M 364
10.3%
B 364
10.3%
S 364
10.3%
3 100
 
2.8%
5 74
 
2.1%
9 71
 
2.0%
Other values (4) 281
8.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 2069
58.6%
Uppercase Letter 1092
30.9%
Space Separator 368
 
10.4%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 546
26.4%
0 527
25.5%
1 470
22.7%
3 100
 
4.8%
5 74
 
3.6%
9 71
 
3.4%
8 71
 
3.4%
6 70
 
3.4%
4 70
 
3.4%
7 70
 
3.4%
Uppercase Letter
ValueCountFrequency (%)
M 364
33.3%
B 364
33.3%
S 364
33.3%
Space Separator
ValueCountFrequency (%)
368
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 2437
69.1%
Latin 1092
30.9%

Most frequent character per script

Common
ValueCountFrequency (%)
2 546
22.4%
0 527
21.6%
1 470
19.3%
368
15.1%
3 100
 
4.1%
5 74
 
3.0%
9 71
 
2.9%
8 71
 
2.9%
6 70
 
2.9%
4 70
 
2.9%
Latin
ValueCountFrequency (%)
M 364
33.3%
B 364
33.3%
S 364
33.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 3529
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2 546
15.5%
0 527
14.9%
1 470
13.3%
368
10.4%
M 364
10.3%
B 364
10.3%
S 364
10.3%
3 100
 
2.8%
5 74
 
2.1%
9 71
 
2.0%
Other values (4) 281
8.0%
Distinct357
Distinct (%)98.1%
Missing0
Missing (%)0.0%
Memory size3.0 KiB
Minimum2000-04-07 00:00:00
Maximum2020-06-26 00:00:00
2023-12-12T08:59:38.556872image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T08:59:38.698278image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

트랜치
Real number (ℝ)

Distinct11
Distinct (%)3.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean7.7142857
Minimum1
Maximum13
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.3 KiB
2023-12-12T08:59:38.839571image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q19
median9
Q39
95-th percentile9
Maximum13
Range12
Interquartile range (IQR)0

Descriptive statistics

Standard deviation2.7485462
Coefficient of variation (CV)0.35629302
Kurtosis1.4521457
Mean7.7142857
Median Absolute Deviation (MAD)0
Skewness-1.6730485
Sum2808
Variance7.5545061
MonotonicityNot monotonic
2023-12-12T08:59:38.969205image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=11)
ValueCountFrequency (%)
9 277
76.1%
1 41
 
11.3%
5 15
 
4.1%
4 12
 
3.3%
7 10
 
2.7%
13 3
 
0.8%
8 2
 
0.5%
11 1
 
0.3%
10 1
 
0.3%
3 1
 
0.3%
ValueCountFrequency (%)
1 41
 
11.3%
2 1
 
0.3%
3 1
 
0.3%
4 12
 
3.3%
5 15
 
4.1%
7 10
 
2.7%
8 2
 
0.5%
9 277
76.1%
10 1
 
0.3%
11 1
 
0.3%
ValueCountFrequency (%)
13 3
 
0.8%
11 1
 
0.3%
10 1
 
0.3%
9 277
76.1%
8 2
 
0.5%
7 10
 
2.7%
5 15
 
4.1%
4 12
 
3.3%
3 1
 
0.3%
2 1
 
0.3%

발행금액
Real number (ℝ)

HIGH CORRELATION 

Distinct357
Distinct (%)98.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean8186.1071
Minimum166
Maximum50628
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.3 KiB
2023-12-12T08:59:39.101269image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum166
5-th percentile1536.45
Q13859.5
median6003
Q310775.75
95-th percentile17781.6
Maximum50628
Range50462
Interquartile range (IQR)6916.25

Descriptive statistics

Standard deviation7061.6363
Coefficient of variation (CV)0.86263668
Kurtosis9.7025693
Mean8186.1071
Median Absolute Deviation (MAD)2611
Skewness2.6660505
Sum2979743
Variance49866707
MonotonicityNot monotonic
2023-12-12T08:59:39.249273image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
8614 3
 
0.8%
3671 2
 
0.5%
5102 2
 
0.5%
6714 2
 
0.5%
2203 2
 
0.5%
4020 2
 
0.5%
3976 1
 
0.3%
15226 1
 
0.3%
3051 1
 
0.3%
15290 1
 
0.3%
Other values (347) 347
95.3%
ValueCountFrequency (%)
166 1
0.3%
180 1
0.3%
399 1
0.3%
480 1
0.3%
490 1
0.3%
499 1
0.3%
699 1
0.3%
808 1
0.3%
959 1
0.3%
967 1
0.3%
ValueCountFrequency (%)
50628 1
0.3%
40457 1
0.3%
40416 1
0.3%
39866 1
0.3%
39548 1
0.3%
39318 1
0.3%
39150 1
0.3%
35782 1
0.3%
33911 1
0.3%
33591 1
0.3%

연월말 모기지담보증권 잔액
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct297
Distinct (%)81.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3722.4643
Minimum0
Maximum33591
Zeros38
Zeros (%)10.4%
Negative0
Negative (%)0.0%
Memory size3.3 KiB
2023-12-12T08:59:39.740626image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q198.75
median660.5
Q36332.25
95-th percentile14742.35
Maximum33591
Range33591
Interquartile range (IQR)6233.5

Descriptive statistics

Standard deviation5289.1178
Coefficient of variation (CV)1.4208646
Kurtosis5.0716422
Mean3722.4643
Median Absolute Deviation (MAD)660.5
Skewness1.9984928
Sum1354977
Variance27974768
MonotonicityNot monotonic
2023-12-12T08:59:39.926375image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 38
 
10.4%
4 11
 
3.0%
3 5
 
1.4%
72 3
 
0.8%
27 2
 
0.5%
5909 2
 
0.5%
7004 2
 
0.5%
193 2
 
0.5%
220 2
 
0.5%
96 2
 
0.5%
Other values (287) 295
81.0%
ValueCountFrequency (%)
0 38
10.4%
3 5
 
1.4%
4 11
 
3.0%
5 1
 
0.3%
7 1
 
0.3%
13 1
 
0.3%
14 1
 
0.3%
26 1
 
0.3%
27 2
 
0.5%
30 1
 
0.3%
ValueCountFrequency (%)
33591 1
0.3%
29917 1
0.3%
22664 1
0.3%
21966 1
0.3%
20362 1
0.3%
19269 1
0.3%
18425 1
0.3%
18396 1
0.3%
18295 1
0.3%
18294 1
0.3%

가중평균 발행금리
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct221
Distinct (%)69.5%
Missing46
Missing (%)12.6%
Infinite0
Infinite (%)0.0%
Mean3.2143711
Minimum1.36
Maximum9.66
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.3 KiB
2023-12-12T08:59:40.113754image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1.36
5-th percentile1.65
Q12.1925
median2.845
Q33.9875
95-th percentile5.7115
Maximum9.66
Range8.3
Interquartile range (IQR)1.795

Descriptive statistics

Standard deviation1.41625
Coefficient of variation (CV)0.44059942
Kurtosis1.8387673
Mean3.2143711
Median Absolute Deviation (MAD)0.78
Skewness1.2627941
Sum1022.17
Variance2.0057641
MonotonicityNot monotonic
2023-12-12T08:59:40.282263image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
3.11 5
 
1.4%
2.29 5
 
1.4%
2.22 5
 
1.4%
1.72 4
 
1.1%
3.12 4
 
1.1%
3.16 4
 
1.1%
2.31 4
 
1.1%
2.49 3
 
0.8%
3.08 3
 
0.8%
1.98 3
 
0.8%
Other values (211) 278
76.4%
(Missing) 46
 
12.6%
ValueCountFrequency (%)
1.36 1
0.3%
1.37 1
0.3%
1.47 1
0.3%
1.48 1
0.3%
1.51 1
0.3%
1.52 1
0.3%
1.53 1
0.3%
1.54 1
0.3%
1.59 1
0.3%
1.61 1
0.3%
ValueCountFrequency (%)
9.66 1
0.3%
8.68 1
0.3%
8.33 1
0.3%
7.78 1
0.3%
7.36 1
0.3%
7.1 1
0.3%
6.64 1
0.3%
6.62 1
0.3%
6.26 1
0.3%
5.88 1
0.3%

Interactions

2023-12-12T08:59:37.101126image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T08:59:35.776310image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T08:59:36.289527image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T08:59:36.696020image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T08:59:37.181440image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T08:59:35.897923image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T08:59:36.391240image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T08:59:36.815278image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T08:59:37.272006image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T08:59:36.003142image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T08:59:36.485433image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T08:59:36.918776image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T08:59:37.383356image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T08:59:36.175831image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T08:59:36.601110image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T08:59:37.012092image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T08:59:40.388585image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
트랜치발행금액연월말 모기지담보증권 잔액가중평균 발행금리
트랜치1.0000.2770.3240.788
발행금액0.2771.0000.9690.503
연월말 모기지담보증권 잔액0.3240.9691.0000.531
가중평균 발행금리0.7880.5030.5311.000
2023-12-12T08:59:40.501451image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
트랜치발행금액연월말 모기지담보증권 잔액가중평균 발행금리
트랜치1.0000.4150.1730.068
발행금액0.4151.0000.792-0.737
연월말 모기지담보증권 잔액0.1730.7921.000-0.887
가중평균 발행금리0.068-0.737-0.8871.000

Missing values

2023-12-12T08:59:37.512125image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T08:59:37.628019image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

주택저당증권발행일트랜치발행금액연월말 모기지담보증권 잔액가중평균 발행금리
0MBS 200012000-04-0711397609.66
1MBS 200022000-09-0113500008.68
2MBS 200032000-12-0813381308.33
3MBS 200112001-05-1813237707.78
4MBS 200122001-09-208505006.62
5MBS 200212002-01-23418007.1
6MBS 200222002-02-219510207.36
7MBS 200312003-04-029310005.51
8MBS 200322003-08-04716605.66
9MBS 200412004-06-1510552005.0
주택저당증권발행일트랜치발행금액연월말 모기지담보증권 잔액가중평균 발행금리
354MBS 2020122020-04-289804080401.76
355MBS 2020132020-05-129812581251.65
356MBS 2020142020-05-155404440441.75
357MBS 2020152020-05-199861486141.66
358MBS 2020162020-05-225504050401.73
359MBS 2020172020-05-26912084120841.65
360MBS 2020182020-06-09913104131041.66
361MBS 2020192020-06-16914010140101.63
362MBS 2020202020-06-23914291142911.71
363MBS 2020212020-06-269861486141.68