Overview

Dataset statistics

Number of variables6
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows1616
Duplicate rows (%)16.2%
Total size in memory566.4 KiB
Average record size in memory58.0 B

Variable types

Categorical4
DateTime1
Numeric1

Dataset

Description대출기관 별 금리정보
Author경기신용보증재단
URLhttps://data.gg.go.kr/portal/data/service/selectServicePage.do?&infId=V453D5A3MO9LMZET8MYF29626206&infSeq=1

Alerts

Dataset has 1616 (16.2%) duplicate rowsDuplicates
기업구분 is highly imbalanced (86.5%)Imbalance
개인/법인구분 is highly imbalanced (61.1%)Imbalance

Reproduction

Analysis started2023-12-10 21:38:27.338369
Analysis finished2023-12-10 21:38:28.051995
Duration0.71 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

기업구분
Categorical

IMBALANCE 

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
소상공인
9702 
소기업
 
246
중기업
 
52

Length

Max length4
Median length4
Mean length3.9702
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row소상공인
2nd row소상공인
3rd row소상공인
4th row소상공인
5th row소상공인

Common Values

ValueCountFrequency (%)
소상공인 9702
97.0%
소기업 246
 
2.5%
중기업 52
 
0.5%

Length

2023-12-11T06:38:28.114391image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T06:38:28.222857image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
소상공인 9702
97.0%
소기업 246
 
2.5%
중기업 52
 
0.5%

개인/법인구분
Categorical

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
개인
9238 
법인
 
762

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row개인
2nd row개인
3rd row개인
4th row개인
5th row개인

Common Values

ValueCountFrequency (%)
개인 9238
92.4%
법인 762
 
7.6%

Length

2023-12-11T06:38:28.344126image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T06:38:28.430248image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
개인 9238
92.4%
법인 762
 
7.6%

보증년도
Categorical

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2015
6174 
2017
3056 
2016
770 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2015
2nd row2015
3rd row2015
4th row2015
5th row2015

Common Values

ValueCountFrequency (%)
2015 6174
61.7%
2017 3056
30.6%
2016 770
 
7.7%

Length

2023-12-11T06:38:28.554901image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T06:38:28.652155image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2015 6174
61.7%
2017 3056
30.6%
2016 770
 
7.7%
Distinct288
Distinct (%)2.9%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2015-01-02 00:00:00
Maximum2017-06-09 00:00:00
2023-12-11T06:38:28.798173image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:38:28.975704image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct17
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
농협은행
2289 
하나은행
1871 
신한은행
1263 
국민은행
1228 
중소기업은행
949 
Other values (12)
2400 

Length

Max length6
Median length4
Mean length4.3943
Min length2

Unique

Unique3 ?
Unique (%)< 0.1%

Sample

1st row농협은행
2nd rowSC제일은행
3rd row하나은행
4th row상호저축은행
5th row신한은행

Common Values

ValueCountFrequency (%)
농협은행 2289
22.9%
하나은행 1871
18.7%
신한은행 1263
12.6%
국민은행 1228
12.3%
중소기업은행 949
9.5%
우리은행 851
 
8.5%
상호저축은행 589
 
5.9%
SC제일은행 443
 
4.4%
새마을금고 227
 
2.3%
신협 169
 
1.7%
Other values (7) 121
 
1.2%

Length

2023-12-11T06:38:29.185322image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
농협은행 2289
22.9%
하나은행 1871
18.7%
신한은행 1263
12.6%
국민은행 1228
12.3%
중소기업은행 949
9.5%
우리은행 851
 
8.5%
상호저축은행 589
 
5.9%
sc제일은행 443
 
4.4%
새마을금고 227
 
2.3%
신협 169
 
1.7%
Other values (7) 121
 
1.2%
Distinct86
Distinct (%)0.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.63879
Minimum1
Maximum9.5
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-11T06:38:29.347155image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2
Q12.8
median3.3
Q34
95-th percentile7.6
Maximum9.5
Range8.5
Interquartile range (IQR)1.2

Descriptive statistics

Standard deviation1.5103842
Coefficient of variation (CV)0.4150787
Kurtosis3.3869508
Mean3.63879
Median Absolute Deviation (MAD)0.6
Skewness1.8174174
Sum36387.9
Variance2.2812605
MonotonicityNot monotonic
2023-12-11T06:38:29.503100image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
3.3 1088
 
10.9%
2.9 895
 
8.9%
2.8 536
 
5.4%
3.9 532
 
5.3%
2.3 518
 
5.2%
4.0 372
 
3.7%
2.1 362
 
3.6%
3.8 350
 
3.5%
3.0 341
 
3.4%
3.2 327
 
3.3%
Other values (76) 4679
46.8%
ValueCountFrequency (%)
1.0 22
 
0.2%
1.1 2
 
< 0.1%
1.2 4
 
< 0.1%
1.3 6
 
0.1%
1.4 21
 
0.2%
1.5 21
 
0.2%
1.6 13
 
0.1%
1.7 30
 
0.3%
1.8 130
1.3%
1.9 61
0.6%
ValueCountFrequency (%)
9.5 3
 
< 0.1%
9.4 4
 
< 0.1%
9.3 5
 
0.1%
9.2 21
0.2%
9.1 34
0.3%
9.0 50
0.5%
8.9 30
0.3%
8.8 21
0.2%
8.7 5
 
0.1%
8.6 9
 
0.1%

Interactions

2023-12-11T06:38:27.715987image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T06:38:29.602081image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
기업구분개인/법인구분보증년도대출기관(은행명)대출금리(최초대출시)
기업구분1.0000.2740.0540.2270.111
개인/법인구분0.2741.0000.0160.2280.122
보증년도0.0540.0161.0000.2510.503
대출기관(은행명)0.2270.2280.2511.0000.735
대출금리(최초대출시)0.1110.1220.5030.7351.000
2023-12-11T06:38:29.733323image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
개인/법인구분보증년도기업구분대출기관(은행명)
개인/법인구분1.0000.0260.4440.205
보증년도0.0261.0000.0160.139
기업구분0.4440.0161.0000.124
대출기관(은행명)0.2050.1390.1241.000
2023-12-11T06:38:29.822430image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
대출금리(최초대출시)기업구분개인/법인구분보증년도대출기관(은행명)
대출금리(최초대출시)1.0000.0610.0920.3500.393
기업구분0.0611.0000.4440.0160.124
개인/법인구분0.0920.4441.0000.0260.205
보증년도0.3500.0160.0261.0000.139
대출기관(은행명)0.3930.1240.2050.1391.000

Missing values

2023-12-11T06:38:27.849802image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T06:38:27.994100image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

기업구분개인/법인구분보증년도보증일자대출기관(은행명)대출금리(최초대출시)
38214소상공인개인20152015-05-19농협은행2.9
21635소상공인개인20152015-02-25SC제일은행3.8
43310소상공인개인20152015-05-27하나은행2.9
54267소상공인개인20152015-07-15상호저축은행8.5
39528소상공인개인20152015-05-27신한은행2.9
34284소상공인개인20152015-05-20신한은행2.9
55023소상공인개인20152015-11-04우리은행2.8
57863소상공인개인20152015-09-22신협6.8
27275소상공인개인20152015-02-24국민은행4.3
21570소기업개인20152015-02-04신한은행5.0
기업구분개인/법인구분보증년도보증일자대출기관(은행명)대출금리(최초대출시)
22421소상공인개인20152015-04-13신한은행3.4
35564소상공인개인20152015-07-16하나은행2.1
51627소상공인개인20152015-07-16하나은행3.5
54151소상공인개인20152015-07-13새마을금고2.8
28072소상공인개인20162016-12-07상호저축은행7.6
52722소상공인개인20152015-08-10신협6.8
10434소상공인개인20172017-04-04농협은행3.7
23020소상공인개인20152015-02-05우리은행4.3
37894소상공인개인20152015-05-18농협은행2.9
26709소상공인개인20152015-01-07상호저축은행9.1

Duplicate rows

Most frequently occurring

기업구분개인/법인구분보증년도보증일자대출기관(은행명)대출금리(최초대출시)# duplicates
728소상공인개인20152015-08-05하나은행2.524
1354소상공인개인20172017-03-28신한은행2.320
178소상공인개인20152015-04-15하나은행2.919
719소상공인개인20152015-08-04하나은행2.517
689소상공인개인20152015-07-30하나은행2.516
1536소상공인개인20172017-05-31하나은행3.915
224소상공인개인20152015-04-29국민은행2.914
679소상공인개인20152015-07-29농협은행3.314
1129소상공인개인20172017-03-03농협은행2.114
1457소상공인개인20172017-04-06신한은행2.114