Overview

Dataset statistics

Number of variables5
Number of observations796
Missing cells0
Missing cells (%)0.0%
Duplicate rows1
Duplicate rows (%)0.1%
Total size in memory32.8 KiB
Average record size in memory42.2 B

Variable types

Numeric2
Categorical3

Dataset

Description보증 공급실적의 보증 상품별 보증료 구성비중에 대한 데이터로 보증 상품별, 연도별, 보증료에 대한 데이터를 제공합니다.
URLhttps://www.data.go.kr/data/15012108/fileData.do

Alerts

Dataset has 1 (0.1%) duplicate rowsDuplicates
보증상품 is highly overall correlated with 보증분류High correlation
보증분류 is highly overall correlated with 보증상품High correlation
has 358 (45.0%) zerosZeros

Reproduction

Analysis started2023-12-12 03:31:00.204727
Analysis finished2023-12-12 03:31:01.388479
Duration1.18 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연도
Real number (ℝ)

Distinct15
Distinct (%)1.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2016.25
Minimum2009
Maximum2023
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size7.1 KiB
2023-12-12T12:31:01.471948image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2009
5-th percentile2009
Q12012.75
median2016
Q32020
95-th percentile2023
Maximum2023
Range14
Interquartile range (IQR)7.25

Descriptive statistics

Standard deviation4.342745
Coefficient of variation (CV)0.0021538723
Kurtosis-1.2167593
Mean2016.25
Median Absolute Deviation (MAD)4
Skewness-0.079546007
Sum1604935
Variance18.859434
MonotonicityIncreasing
2023-12-12T12:31:01.674294image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=15)
ValueCountFrequency (%)
2021 58
 
7.3%
2022 58
 
7.3%
2023 58
 
7.3%
2019 56
 
7.0%
2020 56
 
7.0%
2017 54
 
6.8%
2018 54
 
6.8%
2016 52
 
6.5%
2013 51
 
6.4%
2009 50
 
6.3%
Other values (5) 249
31.3%
ValueCountFrequency (%)
2009 50
6.3%
2010 50
6.3%
2011 50
6.3%
2012 49
6.2%
2013 51
6.4%
2014 50
6.3%
2015 50
6.3%
2016 52
6.5%
2017 54
6.8%
2018 54
6.8%
ValueCountFrequency (%)
2023 58
7.3%
2022 58
7.3%
2021 58
7.3%
2020 56
7.0%
2019 56
7.0%
2018 54
6.8%
2017 54
6.8%
2016 52
6.5%
2015 50
6.3%
2014 50
6.3%

보증상품
Categorical

HIGH CORRELATION 

Distinct29
Distinct (%)3.6%
Missing0
Missing (%)0.0%
Memory size6.3 KiB
주택분양보증
 
30
임대관리
 
30
오피스텔분양
 
30
주택임대
 
30
임대보증금
 
30
Other values (24)
646 

Length

Max length10
Median length8
Mean length6.0301508
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row주택분양보증
2nd row주상복합주택분양
3rd row오피스텔분양
4th row주택임대
5th row임대보증금

Common Values

ValueCountFrequency (%)
주택분양보증 30
 
3.8%
임대관리 30
 
3.8%
오피스텔분양 30
 
3.8%
주택임대 30
 
3.8%
임대보증금 30
 
3.8%
조합주택시공 30
 
3.8%
하자보수 30
 
3.8%
감리비예치 30
 
3.8%
인허가 30
 
3.8%
하도급대금지급 30
 
3.8%
Other values (19) 496
62.3%

Length

2023-12-12T12:31:01.916240image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
주택분양보증 30
 
3.8%
모기지 30
 
3.8%
임대주택매입자금 30
 
3.8%
임차료지급 30
 
3.8%
전세대출특약 30
 
3.8%
전세보증금반환 30
 
3.8%
기금전세자금대출 30
 
3.8%
전세임대주택반환 30
 
3.8%
리모델링자금 30
 
3.8%
정비사업자금대출 30
 
3.8%
Other values (19) 496
62.3%

보증분류
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size6.3 KiB
기업보증
496 
개인보증
300 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row기업보증
2nd row기업보증
3rd row기업보증
4th row기업보증
5th row기업보증

Common Values

ValueCountFrequency (%)
기업보증 496
62.3%
개인보증 300
37.7%

Length

2023-12-12T12:31:02.087084image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T12:31:02.221577image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
기업보증 496
62.3%
개인보증 300
37.7%
Distinct2
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size6.3 KiB
보증료
398 
구성비
398 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row보증료
2nd row보증료
3rd row보증료
4th row보증료
5th row보증료

Common Values

ValueCountFrequency (%)
보증료 398
50.0%
구성비 398
50.0%

Length

2023-12-12T12:31:02.356249image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T12:31:02.459343image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
보증료 398
50.0%
구성비 398
50.0%


Real number (ℝ)

ZEROS 

Distinct168
Distinct (%)21.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean89.172111
Minimum-38
Maximum4087
Zeros358
Zeros (%)45.0%
Negative8
Negative (%)1.0%
Memory size7.1 KiB
2023-12-12T12:31:02.585786image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum-38
5-th percentile0
Q10
median1
Q321.25
95-th percentile446.5
Maximum4087
Range4125
Interquartile range (IQR)21.25

Descriptive statistics

Standard deviation342.36464
Coefficient of variation (CV)3.839369
Kurtosis52.135284
Mean89.172111
Median Absolute Deviation (MAD)1
Skewness6.4983953
Sum70981
Variance117213.55
MonotonicityNot monotonic
2023-12-12T12:31:02.747465image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 358
45.0%
1 72
 
9.0%
2 35
 
4.4%
6 17
 
2.1%
3 16
 
2.0%
5 16
 
2.0%
4 15
 
1.9%
7 11
 
1.4%
9 7
 
0.9%
-1 6
 
0.8%
Other values (158) 243
30.5%
ValueCountFrequency (%)
-38 1
 
0.1%
-7 1
 
0.1%
-1 6
 
0.8%
0 358
45.0%
1 72
 
9.0%
2 35
 
4.4%
3 16
 
2.0%
4 15
 
1.9%
5 16
 
2.0%
6 17
 
2.1%
ValueCountFrequency (%)
4087 1
0.1%
3660 1
0.1%
2437 1
0.1%
2351 1
0.1%
2318 1
0.1%
2100 1
0.1%
2029 1
0.1%
1998 1
0.1%
1744 1
0.1%
1721 1
0.1%

Interactions

2023-12-12T12:31:00.815065image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T12:31:00.510815image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T12:31:00.976709image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T12:31:00.671851image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T12:31:02.857828image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연도보증상품보증분류보증료_및_구성비중
연도1.0000.0000.0000.0000.000
보증상품0.0001.0001.0000.0000.517
보증분류0.0001.0001.0000.0000.063
보증료_및_구성비중0.0000.0000.0001.0000.307
0.0000.5170.0630.3071.000
2023-12-12T12:31:03.016157image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
보증상품보증료_및_구성비중보증분류
보증상품1.0000.0000.983
보증료_및_구성비중0.0001.0000.000
보증분류0.9830.0001.000
2023-12-12T12:31:03.140656image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연도보증상품보증분류보증료_및_구성비중
연도1.0000.1620.0000.0000.000
0.1621.0000.2300.0470.229
보증상품0.0000.2301.0000.9830.000
보증분류0.0000.0470.9831.0000.000
보증료_및_구성비중0.0000.2290.0000.0001.000

Missing values

2023-12-12T12:31:01.158746image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T12:31:01.323842image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연도보증상품보증분류보증료_및_구성비중
02009주택분양보증기업보증보증료1280
12009주상복합주택분양기업보증보증료159
22009오피스텔분양기업보증보증료0
32009주택임대기업보증보증료37
42009임대보증금기업보증보증료80
52009조합주택시공기업보증보증료24
62009하자보수기업보증보증료74
72009감리비예치기업보증보증료0
82009인허가기업보증보증료0
92009하도급대금지급기업보증보증료4
연도보증상품보증분류보증료_및_구성비중
7862023주택구입자금개인보증구성비5
7872023주택임차자금개인보증구성비0
7882023정비사업자금대출개인보증구성비26
7892023리모델링자금개인보증구성비2
7902023전세임대주택반환개인보증구성비0
7912023기금전세자금대출개인보증구성비0
7922023전세보증금반환개인보증구성비19
7932023전세대출특약개인보증구성비3
7942023임차료지급개인보증구성비1
7952023임대주택매입자금개인보증구성비-1

Duplicate rows

Most frequently occurring

연도보증상품보증분류보증료_및_구성비중# duplicates
02013임대주택매입자금개인보증구성비02