Dataset statistics
Number of variables | 8 |
---|---|
Number of observations | 100 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 2 |
Duplicate rows (%) | 2.0% |
Total size in memory | 6.9 KiB |
Average record size in memory | 70.3 B |
Variable types
Categorical | 6 |
---|---|
Numeric | 1 |
Boolean | 1 |
Dataset
Description | 정책모기지 상품 중 한국주택금융공사에서 운영하는 보금자리론 이용 고객들의 주택연금 신혼부부여부 현황 관련된 데이터를 제공하고 있습니다. |
---|---|
Author | 한국주택금융공사 |
URL | https://www.data.go.kr/data/15090210/fileData.do |
최초등록부점 has constant value "" | Constant |
Dataset has 2 (2.0%) duplicate rows | Duplicates |
상품대분류 is highly overall correlated with 상품중분류 and 1 other fields | High correlation |
상품중분류 is highly overall correlated with 상품대분류 and 2 other fields | High correlation |
상품 is highly overall correlated with 상품대분류 and 1 other fields | High correlation |
부점 is highly overall correlated with 상품중분류 | High correlation |
신혼부부여부 is highly imbalanced (71.4%) | Imbalance |
Reproduction
Analysis started | 2023-12-12 09:16:55.350242 |
---|---|
Analysis finished | 2023-12-12 09:16:56.178905 |
Duration | 0.83 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
상품대분류
Categorical
HIGH CORRELATION
 
Distinct | 2 |
---|---|
Distinct (%) | 2.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
2 | |
---|---|
1 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 2 |
---|---|
2nd row | 2 |
3rd row | 2 |
4th row | 2 |
5th row | 2 |
Common Values
Value | Count | Frequency (%) |
2 | 57 | |
1 | 43 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
2 | 57 | |
1 | 43 |
상품중분류
Categorical
HIGH CORRELATION
 
Distinct | 3 |
---|---|
Distinct (%) | 3.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
3 | |
---|---|
1 | |
2 | 4 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 3 |
---|---|
2nd row | 3 |
3rd row | 3 |
4th row | 3 |
5th row | 3 |
Common Values
Value | Count | Frequency (%) |
3 | 53 | |
1 | 43 | |
2 | 4 | 4.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
3 | 53 | |
1 | 43 | |
2 | 4 | 4.0% |
상품
Categorical
HIGH CORRELATION
 
Distinct | 5 |
---|---|
Distinct (%) | 5.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
120301 | |
---|---|
110108 | |
120305 | |
110101 | |
120201 |
Length
Max length | 6 |
---|---|
Median length | 6 |
Mean length | 6 |
Min length | 6 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 120301 |
---|---|
2nd row | 120301 |
3rd row | 120301 |
4th row | 120301 |
5th row | 120301 |
Common Values
Value | Count | Frequency (%) |
120301 | 33 | |
110108 | 24 | |
120305 | 20 | |
110101 | 19 | |
120201 | 4 | 4.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
120301 | 33 | |
110108 | 24 | |
120305 | 20 | |
110101 | 19 | |
120201 | 4 | 4.0% |
금액
Real number (ℝ)
Distinct | 66 |
---|---|
Distinct (%) | 66.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1.62106 × 108 |
Minimum | 20000000 |
---|---|
Maximum | 4.35 × 108 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 20000000 |
---|---|
5-th percentile | 35700000 |
Q1 | 80000000 |
median | 1.46 × 108 |
Q3 | 2.38 × 108 |
95-th percentile | 3.181 × 108 |
Maximum | 4.35 × 108 |
Range | 4.15 × 108 |
Interquartile range (IQR) | 1.58 × 108 |
Descriptive statistics
Standard deviation | 1.0073767 × 108 |
---|---|
Coefficient of variation (CV) | 0.62143083 |
Kurtosis | -0.61748243 |
Mean | 1.62106 × 108 |
Median Absolute Deviation (MAD) | 75000000 |
Skewness | 0.55546501 |
Sum | 1.62106 × 1010 |
Variance | 1.0148078 × 1016 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
300000000 | 9 | 9.0% |
100000000 | 6 | 6.0% |
200000000 | 4 | 4.0% |
156000000 | 4 | 4.0% |
50000000 | 3 | 3.0% |
140000000 | 3 | 3.0% |
80000000 | 3 | 3.0% |
270000000 | 3 | 3.0% |
30000000 | 2 | 2.0% |
90000000 | 2 | 2.0% |
Other values (56) | 61 |
Value | Count | Frequency (%) |
20000000 | 2 | |
23000000 | 1 | 1.0% |
30000000 | 2 | |
36000000 | 1 | 1.0% |
40000000 | 2 | |
41400000 | 1 | 1.0% |
42000000 | 1 | 1.0% |
45700000 | 1 | 1.0% |
50000000 | 3 | |
52000000 | 1 | 1.0% |
Value | Count | Frequency (%) |
435000000 | 1 | 1.0% |
412000000 | 1 | 1.0% |
382000000 | 1 | 1.0% |
332000000 | 1 | 1.0% |
320000000 | 1 | 1.0% |
318000000 | 1 | 1.0% |
300000000 | 9 | |
294000000 | 1 | 1.0% |
292000000 | 1 | 1.0% |
281000000 | 1 | 1.0% |
부점
Categorical
HIGH CORRELATION
 
Distinct | 22 |
---|---|
Distinct (%) | 22.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
THO | |
---|---|
THA | |
TLA | |
TPB | |
QAD | |
Other values (17) |
Length
Max length | 3 |
---|---|
Median length | 3 |
Mean length | 2.98 |
Min length | 1 |
Unique
Unique | 5 ? |
---|---|
Unique (%) | 5.0% |
Sample
1st row | QAD |
---|---|
2nd row | TAA |
3rd row | TBA |
4th row | TBA |
5th row | QAD |
Common Values
Value | Count | Frequency (%) |
THO | 13 | |
THA | 12 | |
TLA | 9 | |
TPB | 8 | 8.0% |
QAD | 7 | 7.0% |
TAC | 7 | 7.0% |
THB | 7 | 7.0% |
TBA | 6 | 6.0% |
TAB | 5 | 5.0% |
TLB | 4 | 4.0% |
Other values (12) | 22 |
Length
Value | Count | Frequency (%) |
tho | 13 | |
tha | 12 | |
tla | 9 | |
tpb | 8 | 8.0% |
qad | 7 | 7.0% |
tac | 7 | 7.0% |
thb | 7 | 7.0% |
tba | 6 | 6.0% |
tab | 5 | 5.0% |
tlb | 4 | 4.0% |
Other values (12) | 22 |
최초등록일시
Categorical
Distinct | 2 |
---|---|
Distinct (%) | 2.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
2021/06/08 | |
---|---|
2021/06/07 |
Length
Max length | 10 |
---|---|
Median length | 10 |
Mean length | 10 |
Min length | 10 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 2021/06/08 |
---|---|
2nd row | 2021/06/08 |
3rd row | 2021/06/08 |
4th row | 2021/06/08 |
5th row | 2021/06/08 |
Common Values
Value | Count | Frequency (%) |
2021/06/08 | 76 | |
2021/06/07 | 24 | 24.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
2021/06/08 | 76 | |
2021/06/07 | 24 | 24.0% |
최초등록부점
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 1.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
999 |
---|
Length
Max length | 3 |
---|---|
Median length | 3 |
Mean length | 3 |
Min length | 3 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 999 |
---|---|
2nd row | 999 |
3rd row | 999 |
4th row | 999 |
5th row | 999 |
Common Values
Value | Count | Frequency (%) |
999 | 100 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
999 | 100 |
신혼부부여부
Boolean
IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | 2.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 232.0 B |
False | |
---|---|
True | 5 |
Value | Count | Frequency (%) |
False | 95 | |
True | 5 | 5.0% |
상품대분류 | 상품중분류 | 상품 | 금액 | 부점 | 최초등록일시 | 신혼부부여부 | |
---|---|---|---|---|---|---|---|
상품대분류 | 1.000 | 1.000 | 1.000 | 0.553 | 0.641 | 0.656 | 0.301 |
상품중분류 | 1.000 | 1.000 | 1.000 | 0.462 | 0.781 | 0.293 | 0.136 |
상품 | 1.000 | 1.000 | 1.000 | 0.653 | 0.691 | 0.376 | 0.146 |
금액 | 0.553 | 0.462 | 0.653 | 1.000 | 0.000 | 0.527 | 0.000 |
부점 | 0.641 | 0.781 | 0.691 | 0.000 | 1.000 | 0.166 | 0.000 |
최초등록일시 | 0.656 | 0.293 | 0.376 | 0.527 | 0.166 | 1.000 | 0.000 |
신혼부부여부 | 0.301 | 0.136 | 0.146 | 0.000 | 0.000 | 0.000 | 1.000 |
신혼부부여부 | 상품대분류 | 최초등록일시 | 상품중분류 | 부점 | 상품 | |
---|---|---|---|---|---|---|
신혼부부여부 | 1.000 | 0.194 | 0.000 | 0.224 | 0.000 | 0.175 |
상품대분류 | 0.194 | 1.000 | 0.456 | 0.995 | 0.457 | 0.985 |
최초등록일시 | 0.000 | 0.456 | 1.000 | 0.471 | 0.108 | 0.451 |
상품중분류 | 0.224 | 0.995 | 0.471 | 1.000 | 0.525 | 0.990 |
부점 | 0.000 | 0.457 | 0.108 | 0.525 | 1.000 | 0.387 |
상품 | 0.175 | 0.985 | 0.451 | 0.990 | 0.387 | 1.000 |
금액 | 상품대분류 | 상품중분류 | 상품 | 부점 | 최초등록일시 | 신혼부부여부 | |
---|---|---|---|---|---|---|---|
금액 | 1.000 | 0.420 | 0.311 | 0.311 | 0.000 | 0.386 | 0.000 |
상품대분류 | 0.420 | 1.000 | 0.995 | 0.985 | 0.457 | 0.456 | 0.194 |
상품중분류 | 0.311 | 0.995 | 1.000 | 0.990 | 0.525 | 0.471 | 0.224 |
상품 | 0.311 | 0.985 | 0.990 | 1.000 | 0.387 | 0.451 | 0.175 |
부점 | 0.000 | 0.457 | 0.525 | 0.387 | 1.000 | 0.108 | 0.000 |
최초등록일시 | 0.386 | 0.456 | 0.471 | 0.451 | 0.108 | 1.000 | 0.000 |
신혼부부여부 | 0.000 | 0.194 | 0.224 | 0.175 | 0.000 | 0.000 | 1.000 |
상품대분류 | 상품중분류 | 상품 | 금액 | 부점 | 최초등록일시 | 최초등록부점 | 신혼부부여부 | |
---|---|---|---|---|---|---|---|---|
0 | 2 | 3 | 120301 | 318000000 | QAD | 2021/06/08 | 999 | N |
1 | 2 | 3 | 120301 | 272000000 | TAA | 2021/06/08 | 999 | N |
2 | 2 | 3 | 120301 | 40000000 | TBA | 2021/06/08 | 999 | N |
3 | 2 | 3 | 120301 | 80000000 | TBA | 2021/06/08 | 999 | N |
4 | 2 | 3 | 120301 | 98000000 | QAD | 2021/06/08 | 999 | N |
5 | 2 | 3 | 120301 | 150000000 | THA | 2021/06/08 | 999 | N |
6 | 2 | 3 | 120301 | 435000000 | THA | 2021/06/08 | 999 | N |
7 | 2 | 3 | 120301 | 50000000 | THB | 2021/06/08 | 999 | N |
8 | 2 | 3 | 120301 | 41400000 | THB | 2021/06/08 | 999 | N |
9 | 2 | 3 | 120301 | 42000000 | TJA | 2021/06/08 | 999 | N |
상품대분류 | 상품중분류 | 상품 | 금액 | 부점 | 최초등록일시 | 최초등록부점 | 신혼부부여부 | |
---|---|---|---|---|---|---|---|---|
90 | 2 | 3 | 120301 | 65000000 | TAA | 2021/06/07 | 999 | N |
91 | 2 | 3 | 120301 | 80500000 | TJA | 2021/06/07 | 999 | N |
92 | 2 | 3 | 120301 | 100000000 | THO | 2021/06/07 | 999 | N |
93 | 2 | 3 | 120301 | 300000000 | TLB | 2021/06/07 | 999 | N |
94 | 2 | 3 | 120301 | 120000000 | THB | 2021/06/07 | 999 | N |
95 | 2 | 3 | 120301 | 90000000 | TLA | 2021/06/07 | 999 | N |
96 | 2 | 3 | 120301 | 73500000 | TLA | 2021/06/07 | 999 | N |
97 | 2 | 3 | 120301 | 60000000 | THO | 2021/06/07 | 999 | N |
98 | 2 | 3 | 120301 | 45700000 | THO | 2021/06/07 | 999 | N |
99 | 2 | 3 | 120301 | 23000000 | TPB | 2021/06/07 | 999 | N |
Most frequently occurring
상품대분류 | 상품중분류 | 상품 | 금액 | 부점 | 최초등록일시 | 최초등록부점 | 신혼부부여부 | # duplicates | |
---|---|---|---|---|---|---|---|---|---|
0 | 1 | 1 | 110108 | 300000000 | THB | 2021/06/08 | 999 | N | 2 |
1 | 2 | 3 | 120305 | 156000000 | QAD | 2021/06/08 | 999 | N | 2 |