Overview

Dataset statistics

Number of variables8
Number of observations100
Missing cells0
Missing cells (%)0.0%
Duplicate rows2
Duplicate rows (%)2.0%
Total size in memory6.9 KiB
Average record size in memory70.3 B

Variable types

Categorical6
Numeric1
Boolean1

Dataset

Description정책모기지 상품 중 한국주택금융공사에서 운영하는 보금자리론 이용 고객들의 주택연금 입주자전용보금자리론여부 현황 관련된 데이터를 제공하고 있습니다.
Author한국주택금융공사
URLhttps://www.data.go.kr/data/15090201/fileData.do

Alerts

최초등록부점 has constant value ""Constant
입주자전용보금자리론여부 has constant value ""Constant
Dataset has 2 (2.0%) duplicate rowsDuplicates
상품대분류 is highly overall correlated with 상품중분류 and 1 other fieldsHigh correlation
상품 is highly overall correlated with 상품대분류 and 1 other fieldsHigh correlation
상품중분류 is highly overall correlated with 상품대분류 and 2 other fieldsHigh correlation
부점 is highly overall correlated with 상품중분류High correlation

Reproduction

Analysis started2023-12-12 16:27:57.350227
Analysis finished2023-12-12 16:27:57.891280
Duration0.54 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

상품대분류
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
2
57 
1
43 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2
2nd row2
3rd row2
4th row2
5th row2

Common Values

ValueCountFrequency (%)
2 57
57.0%
1 43
43.0%

Length

2023-12-13T01:27:57.952965image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T01:27:58.043807image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2 57
57.0%
1 43
43.0%

상품중분류
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)3.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
3
53 
1
43 
2
 
4

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row3
2nd row3
3rd row3
4th row3
5th row3

Common Values

ValueCountFrequency (%)
3 53
53.0%
1 43
43.0%
2 4
 
4.0%

Length

2023-12-13T01:27:58.140395image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T01:27:58.237916image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
3 53
53.0%
1 43
43.0%
2 4
 
4.0%

상품
Categorical

HIGH CORRELATION 

Distinct5
Distinct (%)5.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
120301
33 
110108
24 
120305
20 
110101
19 
120201

Length

Max length6
Median length6
Mean length6
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row120301
2nd row120301
3rd row120301
4th row120301
5th row120301

Common Values

ValueCountFrequency (%)
120301 33
33.0%
110108 24
24.0%
120305 20
20.0%
110101 19
19.0%
120201 4
 
4.0%

Length

2023-12-13T01:27:58.333456image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T01:27:58.438205image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
120301 33
33.0%
110108 24
24.0%
120305 20
20.0%
110101 19
19.0%
120201 4
 
4.0%

금액
Real number (ℝ)

Distinct66
Distinct (%)66.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.62106 × 108
Minimum20000000
Maximum4.35 × 108
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-13T01:27:58.548876image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum20000000
5-th percentile35700000
Q180000000
median1.46 × 108
Q32.38 × 108
95-th percentile3.181 × 108
Maximum4.35 × 108
Range4.15 × 108
Interquartile range (IQR)1.58 × 108

Descriptive statistics

Standard deviation1.0073767 × 108
Coefficient of variation (CV)0.62143083
Kurtosis-0.61748243
Mean1.62106 × 108
Median Absolute Deviation (MAD)75000000
Skewness0.55546501
Sum1.62106 × 1010
Variance1.0148078 × 1016
MonotonicityNot monotonic
2023-12-13T01:27:58.673069image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
300000000 9
 
9.0%
100000000 6
 
6.0%
200000000 4
 
4.0%
156000000 4
 
4.0%
50000000 3
 
3.0%
140000000 3
 
3.0%
80000000 3
 
3.0%
270000000 3
 
3.0%
30000000 2
 
2.0%
90000000 2
 
2.0%
Other values (56) 61
61.0%
ValueCountFrequency (%)
20000000 2
2.0%
23000000 1
 
1.0%
30000000 2
2.0%
36000000 1
 
1.0%
40000000 2
2.0%
41400000 1
 
1.0%
42000000 1
 
1.0%
45700000 1
 
1.0%
50000000 3
3.0%
52000000 1
 
1.0%
ValueCountFrequency (%)
435000000 1
 
1.0%
412000000 1
 
1.0%
382000000 1
 
1.0%
332000000 1
 
1.0%
320000000 1
 
1.0%
318000000 1
 
1.0%
300000000 9
9.0%
294000000 1
 
1.0%
292000000 1
 
1.0%
281000000 1
 
1.0%

부점
Categorical

HIGH CORRELATION 

Distinct22
Distinct (%)22.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
THO
13 
THA
12 
TLA
TPB
QAD
Other values (17)
51 

Length

Max length3
Median length3
Mean length2.98
Min length1

Unique

Unique5 ?
Unique (%)5.0%

Sample

1st rowQAD
2nd rowTAA
3rd rowTBA
4th rowTBA
5th rowQAD

Common Values

ValueCountFrequency (%)
THO 13
13.0%
THA 12
12.0%
TLA 9
9.0%
TPB 8
 
8.0%
QAD 7
 
7.0%
TAC 7
 
7.0%
THB 7
 
7.0%
TBA 6
 
6.0%
TAB 5
 
5.0%
TLB 4
 
4.0%
Other values (12) 22
22.0%

Length

2023-12-13T01:27:58.823981image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
tho 13
13.0%
tha 12
12.0%
tla 9
9.0%
tpb 8
 
8.0%
qad 7
 
7.0%
tac 7
 
7.0%
thb 7
 
7.0%
tba 6
 
6.0%
tab 5
 
5.0%
tlb 4
 
4.0%
Other values (12) 22
22.0%
Distinct2
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
2021/06/08
76 
2021/06/07
24 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2021/06/08
2nd row2021/06/08
3rd row2021/06/08
4th row2021/06/08
5th row2021/06/08

Common Values

ValueCountFrequency (%)
2021/06/08 76
76.0%
2021/06/07 24
 
24.0%

Length

2023-12-13T01:27:58.944600image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T01:27:59.023800image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2021/06/08 76
76.0%
2021/06/07 24
 
24.0%

최초등록부점
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
999
100 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row999
2nd row999
3rd row999
4th row999
5th row999

Common Values

ValueCountFrequency (%)
999 100
100.0%

Length

2023-12-13T01:27:59.113159image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T01:27:59.190119image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
999 100
100.0%
Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size232.0 B
False
100 
ValueCountFrequency (%)
False 100
100.0%
2023-12-13T01:27:59.265987image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Interactions

2023-12-13T01:27:57.627425image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T01:27:59.327592image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
상품대분류상품중분류상품금액부점최초등록일시
상품대분류1.0001.0001.0000.5530.6410.656
상품중분류1.0001.0001.0000.4620.7810.293
상품1.0001.0001.0000.6530.6910.376
금액0.5530.4620.6531.0000.0000.527
부점0.6410.7810.6910.0001.0000.166
최초등록일시0.6560.2930.3760.5270.1661.000
2023-12-13T01:27:59.426302image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
상품대분류최초등록일시상품상품중분류부점
상품대분류1.0000.4560.9850.9950.457
최초등록일시0.4561.0000.4510.4710.108
상품0.9850.4511.0000.9900.387
상품중분류0.9950.4710.9901.0000.525
부점0.4570.1080.3870.5251.000
2023-12-13T01:27:59.509649image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
금액상품대분류상품중분류상품부점최초등록일시
금액1.0000.4200.3110.3110.0000.386
상품대분류0.4201.0000.9950.9850.4570.456
상품중분류0.3110.9951.0000.9900.5250.471
상품0.3110.9850.9901.0000.3870.451
부점0.0000.4570.5250.3871.0000.108
최초등록일시0.3860.4560.4710.4510.1081.000

Missing values

2023-12-13T01:27:57.724083image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T01:27:57.844262image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

상품대분류상품중분류상품금액부점최초등록일시최초등록부점입주자전용보금자리론여부
023120301318000000QAD2021/06/08999N
123120301272000000TAA2021/06/08999N
22312030140000000TBA2021/06/08999N
32312030180000000TBA2021/06/08999N
42312030198000000QAD2021/06/08999N
523120301150000000THA2021/06/08999N
623120301435000000THA2021/06/08999N
72312030150000000THB2021/06/08999N
82312030141400000THB2021/06/08999N
92312030142000000TJA2021/06/08999N
상품대분류상품중분류상품금액부점최초등록일시최초등록부점입주자전용보금자리론여부
902312030165000000TAA2021/06/07999N
912312030180500000TJA2021/06/07999N
9223120301100000000THO2021/06/07999N
9323120301300000000TLB2021/06/07999N
9423120301120000000THB2021/06/07999N
952312030190000000TLA2021/06/07999N
962312030173500000TLA2021/06/07999N
972312030160000000THO2021/06/07999N
982312030145700000THO2021/06/07999N
992312030123000000TPB2021/06/07999N

Duplicate rows

Most frequently occurring

상품대분류상품중분류상품금액부점최초등록일시최초등록부점입주자전용보금자리론여부# duplicates
011110108300000000THB2021/06/08999N2
123120305156000000QAD2021/06/08999N2