Dataset statistics
Number of variables | 7 |
---|---|
Number of observations | 100 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 8 |
Duplicate rows (%) | 8.0% |
Total size in memory | 5.7 KiB |
Average record size in memory | 58.3 B |
Variable types
Text | 2 |
---|---|
Categorical | 4 |
Numeric | 1 |
Dataset
Description | 한국주택금융공사의 DM발송정보에 대한 데이터로, 발송일, 최고서종류 등 에 대한 정보를 포함하고 있습니다. 공공데이터 개방 정책에 따라 공개됩니다. |
---|---|
Author | 한국주택금융공사 |
URL | https://www.data.go.kr/data/15073300/fileData.do |
Dataset has 8 (8.0%) duplicate rows | Duplicates |
등록자사번 is highly overall correlated with 최고서종류 and 3 other fields | High correlation |
발송부점 is highly overall correlated with 최고서종류 and 2 other fields | High correlation |
최고서종류 is highly overall correlated with 최고일 and 2 other fields | High correlation |
발송일 is highly overall correlated with 최고일 and 2 other fields | High correlation |
최고일 is highly overall correlated with 최고서종류 and 2 other fields | High correlation |
발송일 is highly imbalanced (89.8%) | Imbalance |
발송부점 is highly imbalanced (63.0%) | Imbalance |
등록자사번 is highly imbalanced (51.5%) | Imbalance |
Reproduction
Analysis started | 2023-12-12 00:30:21.440052 |
---|---|
Analysis finished | 2023-12-12 00:30:22.215337 |
Duration | 0.78 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
유동화계획코드
Text
Distinct | 64 |
---|---|
Distinct (%) | 64.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
Length
Max length | 14 |
---|---|
Median length | 14 |
Mean length | 14 |
Min length | 14 |
Characters and Unicode
Total characters | 1400 |
---|---|
Distinct characters | 20 |
Distinct categories | 3 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 38 ? |
---|---|
Unique (%) | 38.0% |
Sample
1st row | KHFCMB2017S_14 |
---|---|
2nd row | KHFCMB2013S_22 |
3rd row | KHFCMB2017S_02 |
4th row | KHFCMB2017S_13 |
5th row | KHFCMB2017S_04 |
Value | Count | Frequency (%) |
khfcmb2014s_19 | 4 | 4.0% |
khfcmb2012s_31 | 4 | 4.0% |
khfcmb2017s_05 | 4 | 4.0% |
khfcmb2013s_27 | 3 | 3.0% |
khfcmb2017s_04 | 3 | 3.0% |
khfcmb2012s_17 | 3 | 3.0% |
khfcmb2015s_26 | 3 | 3.0% |
khfcmb2017s_27 | 2 | 2.0% |
khfcmb2016s_25 | 2 | 2.0% |
khfcmb2015s_17 | 2 | 2.0% |
Other values (54) | 70 |
Most occurring characters
Value | Count | Frequency (%) |
2 | 151 | |
1 | 145 | |
0 | 135 | |
B | 101 | 7.2% |
K | 100 | 7.1% |
H | 100 | 7.1% |
F | 100 | 7.1% |
C | 100 | 7.1% |
M | 100 | 7.1% |
_ | 100 | 7.1% |
Other values (10) | 268 |
Most occurring categories
Value | Count | Frequency (%) |
Uppercase Letter | 702 | |
Decimal Number | 598 | |
Connector Punctuation | 100 | 7.1% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
2 | 151 | |
1 | 145 | |
0 | 135 | |
7 | 38 | 6.4% |
3 | 32 | 5.4% |
5 | 29 | 4.8% |
6 | 24 | 4.0% |
8 | 22 | 3.7% |
4 | 15 | 2.5% |
9 | 7 | 1.2% |
Uppercase Letter
Value | Count | Frequency (%) |
B | 101 | |
K | 100 | |
H | 100 | |
F | 100 | |
C | 100 | |
M | 100 | |
S | 97 | |
L | 2 | 0.3% |
A | 2 | 0.3% |
Connector Punctuation
Value | Count | Frequency (%) |
_ | 100 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 702 | |
Common | 698 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
2 | 151 | |
1 | 145 | |
0 | 135 | |
_ | 100 | |
7 | 38 | 5.4% |
3 | 32 | 4.6% |
5 | 29 | 4.2% |
6 | 24 | 3.4% |
8 | 22 | 3.2% |
4 | 15 | 2.1% |
Latin
Value | Count | Frequency (%) |
B | 101 | |
K | 100 | |
H | 100 | |
F | 100 | |
C | 100 | |
M | 100 | |
S | 97 | |
L | 2 | 0.3% |
A | 2 | 0.3% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 1400 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
2 | 151 | |
1 | 145 | |
0 | 135 | |
B | 101 | 7.2% |
K | 100 | 7.1% |
H | 100 | 7.1% |
F | 100 | 7.1% |
C | 100 | 7.1% |
M | 100 | 7.1% |
_ | 100 | 7.1% |
Other values (10) | 268 |
보유목적코드
Text
Distinct | 77 |
---|---|
Distinct (%) | 77.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
Length
Max length | 14 |
---|---|
Median length | 14 |
Mean length | 14 |
Min length | 14 |
Characters and Unicode
Total characters | 1400 |
---|---|
Distinct characters | 12 |
Distinct categories | 3 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 57 ? |
---|---|
Unique (%) | 57.0% |
Sample
1st row | B088_2017_0078 |
---|---|
2nd row | B088_2013_0019 |
3rd row | B020_2017_0010 |
4th row | B081_2017_0063 |
5th row | B003_2017_0024 |
Value | Count | Frequency (%) |
b081_2018_0026 | 3 | 3.0% |
b003_2012_0035 | 3 | 3.0% |
b003_2017_0030 | 3 | 3.0% |
b081_2015_0092 | 2 | 2.0% |
b081_2017_0137 | 2 | 2.0% |
b010_2017_0071 | 2 | 2.0% |
b081_2012_0043 | 2 | 2.0% |
b010_2017_0121 | 2 | 2.0% |
b004_2012_0027 | 2 | 2.0% |
b004_2015_0022 | 2 | 2.0% |
Other values (67) | 77 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 465 | |
_ | 200 | |
1 | 181 | 12.9% |
2 | 167 | 11.9% |
B | 100 | 7.1% |
8 | 74 | 5.3% |
3 | 61 | 4.4% |
4 | 42 | 3.0% |
7 | 41 | 2.9% |
6 | 30 | 2.1% |
Other values (2) | 39 | 2.8% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 1100 | |
Connector Punctuation | 200 | 14.3% |
Uppercase Letter | 100 | 7.1% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 465 | |
1 | 181 | 16.5% |
2 | 167 | 15.2% |
8 | 74 | 6.7% |
3 | 61 | 5.5% |
4 | 42 | 3.8% |
7 | 41 | 3.7% |
6 | 30 | 2.7% |
5 | 28 | 2.5% |
9 | 11 | 1.0% |
Connector Punctuation
Value | Count | Frequency (%) |
_ | 200 |
Uppercase Letter
Value | Count | Frequency (%) |
B | 100 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 1300 | |
Latin | 100 | 7.1% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 465 | |
_ | 200 | |
1 | 181 | 13.9% |
2 | 167 | 12.8% |
8 | 74 | 5.7% |
3 | 61 | 4.7% |
4 | 42 | 3.2% |
7 | 41 | 3.2% |
6 | 30 | 2.3% |
5 | 28 | 2.2% |
Latin
Value | Count | Frequency (%) |
B | 100 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 1400 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 465 | |
_ | 200 | |
1 | 181 | 12.9% |
2 | 167 | 11.9% |
B | 100 | 7.1% |
8 | 74 | 5.3% |
3 | 61 | 4.4% |
4 | 42 | 3.0% |
7 | 41 | 2.9% |
6 | 30 | 2.1% |
Other values (2) | 39 | 2.8% |
발송일
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 3 |
---|---|
Distinct (%) | 3.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
2020-01-09 | |
---|---|
2019-12-26 | 1 |
2019-12-30 | 1 |
Length
Max length | 10 |
---|---|
Median length | 10 |
Mean length | 10 |
Min length | 10 |
Unique
Unique | 2 ? |
---|---|
Unique (%) | 2.0% |
Sample
1st row | 2020-01-09 |
---|---|
2nd row | 2020-01-09 |
3rd row | 2020-01-09 |
4th row | 2020-01-09 |
5th row | 2020-01-09 |
Common Values
Value | Count | Frequency (%) |
2020-01-09 | 98 | |
2019-12-26 | 1 | 1.0% |
2019-12-30 | 1 | 1.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
2020-01-09 | 98 | |
2019-12-26 | 1 | 1.0% |
2019-12-30 | 1 | 1.0% |
최고서종류
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 9 |
---|---|
Distinct (%) | 9.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 21.18 |
Minimum | 12 |
---|---|
Maximum | 27 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 12 |
---|---|
5-th percentile | 13 |
Q1 | 22 |
median | 22 |
Q3 | 23 |
95-th percentile | 23.05 |
Maximum | 27 |
Range | 15 |
Interquartile range (IQR) | 1 |
Descriptive statistics
Standard deviation | 3.4855546 |
---|---|
Coefficient of variation (CV) | 0.16456821 |
Kurtosis | 1.4968898 |
Mean | 21.18 |
Median Absolute Deviation (MAD) | 1 |
Skewness | -1.6653702 |
Sum | 2118 |
Variance | 12.149091 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
22 | 41 | |
23 | 38 | |
13 | 9 | 9.0% |
14 | 4 | 4.0% |
24 | 3 | 3.0% |
27 | 2 | 2.0% |
15 | 1 | 1.0% |
12 | 1 | 1.0% |
16 | 1 | 1.0% |
Value | Count | Frequency (%) |
12 | 1 | 1.0% |
13 | 9 | 9.0% |
14 | 4 | 4.0% |
15 | 1 | 1.0% |
16 | 1 | 1.0% |
22 | 41 | |
23 | 38 | |
24 | 3 | 3.0% |
27 | 2 | 2.0% |
Value | Count | Frequency (%) |
27 | 2 | 2.0% |
24 | 3 | 3.0% |
23 | 38 | |
22 | 41 | |
16 | 1 | 1.0% |
15 | 1 | 1.0% |
14 | 4 | 4.0% |
13 | 9 | 9.0% |
12 | 1 | 1.0% |
최고일
Categorical
HIGH CORRELATION
 
Distinct | 9 |
---|---|
Distinct (%) | 9.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
2020-02-19 | |
---|---|
2020-01-29 | |
2020-01-23 | |
2020-01-09 | |
2020-02-10 | 3 |
Other values (4) |
Length
Max length | 10 |
---|---|
Median length | 10 |
Mean length | 10 |
Min length | 10 |
Unique
Unique | 3 ? |
---|---|
Unique (%) | 3.0% |
Sample
1st row | 2020-01-23 |
---|---|
2nd row | 2020-01-23 |
3rd row | 2020-01-19 |
4th row | 2020-01-09 |
5th row | 2020-01-09 |
Common Values
Value | Count | Frequency (%) |
2020-02-19 | 37 | |
2020-01-29 | 33 | |
2020-01-23 | 12 | 12.0% |
2020-01-09 | 10 | 10.0% |
2020-02-10 | 3 | 3.0% |
2020-01-19 | 2 | 2.0% |
2020-01-16 | 1 | 1.0% |
2020-02-02 | 1 | 1.0% |
2020-01-30 | 1 | 1.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
2020-02-19 | 37 | |
2020-01-29 | 33 | |
2020-01-23 | 12 | 12.0% |
2020-01-09 | 10 | 10.0% |
2020-02-10 | 3 | 3.0% |
2020-01-19 | 2 | 2.0% |
2020-01-16 | 1 | 1.0% |
2020-02-02 | 1 | 1.0% |
2020-01-30 | 1 | 1.0% |
발송부점
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 7 |
---|---|
Distinct (%) | 7.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
AAZ | |
---|---|
THB | 6 |
TLB | 4 |
TMB | 4 |
TPA | 1 |
Other values (2) | 2 |
Length
Max length | 3 |
---|---|
Median length | 3 |
Mean length | 3 |
Min length | 3 |
Unique
Unique | 3 ? |
---|---|
Unique (%) | 3.0% |
Sample
1st row | TPA |
---|---|
2nd row | TLB |
3rd row | TRA |
4th row | AAZ |
5th row | AAZ |
Common Values
Value | Count | Frequency (%) |
AAZ | 83 | |
THB | 6 | 6.0% |
TLB | 4 | 4.0% |
TMB | 4 | 4.0% |
TPA | 1 | 1.0% |
TRA | 1 | 1.0% |
TQA | 1 | 1.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
aaz | 83 | |
thb | 6 | 6.0% |
tlb | 4 | 4.0% |
tmb | 4 | 4.0% |
tpa | 1 | 1.0% |
tra | 1 | 1.0% |
tqa | 1 | 1.0% |
등록자사번
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 9 |
---|---|
Distinct (%) | 9.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
aaz01 | |
---|---|
8889 | |
1601 | 6 |
1854 | 4 |
1598 | 3 |
Other values (4) | 5 |
Length
Max length | 5 |
---|---|
Median length | 5 |
Mean length | 4.73 |
Min length | 4 |
Unique
Unique | 3 ? |
---|---|
Unique (%) | 3.0% |
Sample
1st row | 1913 |
---|---|
2nd row | 1878 |
3rd row | 1604 |
4th row | 8889 |
5th row | 8889 |
Common Values
Value | Count | Frequency (%) |
aaz01 | 73 | |
8889 | 9 | 9.0% |
1601 | 6 | 6.0% |
1854 | 4 | 4.0% |
1598 | 3 | 3.0% |
1679 | 2 | 2.0% |
1913 | 1 | 1.0% |
1878 | 1 | 1.0% |
1604 | 1 | 1.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
aaz01 | 73 | |
8889 | 9 | 9.0% |
1601 | 6 | 6.0% |
1854 | 4 | 4.0% |
1598 | 3 | 3.0% |
1679 | 2 | 2.0% |
1913 | 1 | 1.0% |
1878 | 1 | 1.0% |
1604 | 1 | 1.0% |
유동화계획코드 | 보유목적코드 | 발송일 | 최고서종류 | 최고일 | 발송부점 | 등록자사번 | |
---|---|---|---|---|---|---|---|
유동화계획코드 | 1.000 | 1.000 | 1.000 | 0.884 | 0.937 | 0.996 | 0.979 |
보유목적코드 | 1.000 | 1.000 | 1.000 | 0.796 | 0.927 | 1.000 | 0.979 |
발송일 | 1.000 | 1.000 | 1.000 | 0.452 | 0.940 | 0.753 | 0.922 |
최고서종류 | 0.884 | 0.796 | 0.452 | 1.000 | 0.856 | 0.785 | 0.894 |
최고일 | 0.937 | 0.927 | 0.940 | 0.856 | 1.000 | 0.732 | 0.936 |
발송부점 | 0.996 | 1.000 | 0.753 | 0.785 | 0.732 | 1.000 | 0.976 |
등록자사번 | 0.979 | 0.979 | 0.922 | 0.894 | 0.936 | 0.976 | 1.000 |
등록자사번 | 발송일 | 발송부점 | 최고일 | |
---|---|---|---|---|
등록자사번 | 1.000 | 0.654 | 0.943 | 0.594 |
발송일 | 0.654 | 1.000 | 0.670 | 0.689 |
발송부점 | 0.943 | 0.670 | 1.000 | 0.498 |
최고일 | 0.594 | 0.689 | 0.498 | 1.000 |
최고서종류 | 발송일 | 최고일 | 발송부점 | 등록자사번 | |
---|---|---|---|---|---|
최고서종류 | 1.000 | 0.205 | 0.670 | 0.576 | 0.618 |
발송일 | 0.205 | 1.000 | 0.689 | 0.670 | 0.654 |
최고일 | 0.670 | 0.689 | 1.000 | 0.498 | 0.594 |
발송부점 | 0.576 | 0.670 | 0.498 | 1.000 | 0.943 |
등록자사번 | 0.618 | 0.654 | 0.594 | 0.943 | 1.000 |
유동화계획코드 | 보유목적코드 | 발송일 | 최고서종류 | 최고일 | 발송부점 | 등록자사번 | |
---|---|---|---|---|---|---|---|
0 | KHFCMB2017S_14 | B088_2017_0078 | 2020-01-09 | 13 | 2020-01-23 | TPA | 1913 |
1 | KHFCMB2013S_22 | B088_2013_0019 | 2020-01-09 | 15 | 2020-01-23 | TLB | 1878 |
2 | KHFCMB2017S_02 | B020_2017_0010 | 2020-01-09 | 13 | 2020-01-19 | TRA | 1604 |
3 | KHFCMB2017S_13 | B081_2017_0063 | 2020-01-09 | 22 | 2020-01-09 | AAZ | 8889 |
4 | KHFCMB2017S_04 | B003_2017_0024 | 2020-01-09 | 22 | 2020-01-09 | AAZ | 8889 |
5 | KHFCMB2016S_03 | B081_2016_0010 | 2019-12-26 | 13 | 2020-01-09 | TQA | 1679 |
6 | KHFCMB2017S_25 | B020_2017_0150 | 2019-12-30 | 12 | 2020-01-16 | AAZ | 1679 |
7 | KHFCMB2015S_23 | B081_2015_0092 | 2020-01-09 | 14 | 2020-01-23 | TMB | 1854 |
8 | KHFCMB2015S_23 | B081_2015_0092 | 2020-01-09 | 14 | 2020-01-23 | TMB | 1854 |
9 | KHFCMB2014S_19 | B020_2014_0049 | 2020-01-09 | 23 | 2020-01-09 | AAZ | 8889 |
유동화계획코드 | 보유목적코드 | 발송일 | 최고서종류 | 최고일 | 발송부점 | 등록자사번 | |
---|---|---|---|---|---|---|---|
90 | KHFCMB2015S_17 | B020_2015_0054 | 2020-01-09 | 22 | 2020-02-19 | AAZ | aaz01 |
91 | KHFCMB2013S_27 | B020_2013_0014 | 2020-01-09 | 22 | 2020-02-19 | AAZ | aaz01 |
92 | KHFCMB2019S_17 | B020_2019_0070 | 2020-01-09 | 22 | 2020-02-19 | AAZ | aaz01 |
93 | KHFCMB2018S_13 | B020_2018_0065 | 2020-01-09 | 22 | 2020-02-19 | AAZ | aaz01 |
94 | KHFCMB2014S_18 | B004_2014_0030 | 2020-01-09 | 22 | 2020-02-19 | AAZ | aaz01 |
95 | KHFCMB2015S_21 | B010_2015_0074 | 2020-01-09 | 22 | 2020-02-19 | AAZ | aaz01 |
96 | KHFCMB2015S_25 | B010_2015_0106 | 2020-01-09 | 22 | 2020-02-19 | AAZ | aaz01 |
97 | KHFCMB2012S_31 | B088_2012_0026 | 2020-01-09 | 22 | 2020-02-19 | AAZ | aaz01 |
98 | KHFCMB2012S_03 | B088_2012_0007 | 2020-01-09 | 22 | 2020-02-19 | AAZ | aaz01 |
99 | KHFCMB2011S_21 | B004_2011_0021 | 2020-01-09 | 22 | 2020-02-19 | AAZ | aaz01 |
Most frequently occurring
유동화계획코드 | 보유목적코드 | 발송일 | 최고서종류 | 최고일 | 발송부점 | 등록자사번 | # duplicates | |
---|---|---|---|---|---|---|---|---|
0 | KHFCMB2012S_17 | B003_2012_0035 | 2020-01-09 | 23 | 2020-01-29 | AAZ | aaz01 | 3 |
1 | KHFCMB2012S_20 | B004_2012_0027 | 2020-01-09 | 23 | 2020-01-29 | AAZ | aaz01 | 2 |
2 | KHFCMB2012S_31 | B081_2012_0043 | 2020-01-09 | 23 | 2020-01-29 | AAZ | aaz01 | 2 |
3 | KHFCMB2015S_22 | B081_2015_0088 | 2020-01-09 | 14 | 2020-01-23 | TMB | 1854 | 2 |
4 | KHFCMB2015S_23 | B081_2015_0092 | 2020-01-09 | 14 | 2020-01-23 | TMB | 1854 | 2 |
5 | KHFCMB2017S_05 | B003_2017_0030 | 2020-01-09 | 13 | 2020-01-23 | TLB | 1598 | 2 |
6 | KHFCMB2017S_17 | B010_2017_0071 | 2020-01-09 | 22 | 2020-02-19 | AAZ | aaz01 | 2 |
7 | KHFCMB2018S_12 | B088_2018_0037 | 2020-01-09 | 13 | 2020-01-23 | THB | 1601 | 2 |