Overview

Dataset statistics

Number of variables5
Number of observations1000
Missing cells0
Missing cells (%)0.0%
Duplicate rows27
Duplicate rows (%)2.7%
Total size in memory43.1 KiB
Average record size in memory44.1 B

Variable types

Categorical1
Numeric4

Dataset

Description신용보증기금의 보증 이용 기업 중 유망창업기업에 해당하는 기업들의 보증 이용 현황 명세 관련된 데이터이니 참고바랍니다.
Author신용보증기금
URLhttps://www.data.go.kr/data/15089408/fileData.do

Alerts

Dataset has 27 (2.7%) duplicate rowsDuplicates
상담누계건수 is highly overall correlated with 승인건수 and 2 other fieldsHigh correlation
승인건수 is highly overall correlated with 상담누계건수 and 2 other fieldsHigh correlation
취급금액 is highly overall correlated with 상담누계건수 and 2 other fieldsHigh correlation
보증잔액건수 is highly overall correlated with 상담누계건수 and 2 other fieldsHigh correlation
상담누계건수 has 297 (29.7%) zerosZeros
승인건수 has 269 (26.9%) zerosZeros
취급금액 has 60 (6.0%) zerosZeros
보증잔액건수 has 135 (13.5%) zerosZeros

Reproduction

Analysis started2023-12-12 09:09:45.977852
Analysis finished2023-12-12 09:09:48.363097
Duration2.39 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct8
Distinct (%)0.8%
Missing0
Missing (%)0.0%
Memory size7.9 KiB
퍼스트펭귄보증
172 
전체
130 
유망창업보증소계
125 
창업성장보증I
125 
창업초기보증
122 
Other values (3)
326 

Length

Max length8
Median length7
Mean length6.336
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row유망창업보증소계
2nd row신생기업보증
3rd row유망창업보증소계
4th row창업초기보증
5th row신생기업보증

Common Values

ValueCountFrequency (%)
퍼스트펭귄보증 172
17.2%
전체 130
13.0%
유망창업보증소계 125
12.5%
창업성장보증I 125
12.5%
창업초기보증 122
12.2%
예비창업자보증 115
11.5%
신생기업보증 114
11.4%
창업성장보증II 97
9.7%

Length

2023-12-12T18:09:48.461660image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T18:09:48.624688image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
퍼스트펭귄보증 172
17.2%
전체 130
13.0%
유망창업보증소계 125
12.5%
창업성장보증i 125
12.5%
창업초기보증 122
12.2%
예비창업자보증 115
11.5%
신생기업보증 114
11.4%
창업성장보증ii 97
9.7%

상담누계건수
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct98
Distinct (%)9.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean17.711
Minimum0
Maximum316
Zeros297
Zeros (%)29.7%
Negative0
Negative (%)0.0%
Memory size8.9 KiB
2023-12-12T18:09:48.801389image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median6
Q319.25
95-th percentile80
Maximum316
Range316
Interquartile range (IQR)19.25

Descriptive statistics

Standard deviation32.0221
Coefficient of variation (CV)1.8080346
Kurtosis23.02092
Mean17.711
Median Absolute Deviation (MAD)6
Skewness3.9728732
Sum17711
Variance1025.4149
MonotonicityNot monotonic
2023-12-12T18:09:48.996957image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 297
29.7%
1 66
 
6.6%
2 53
 
5.3%
6 29
 
2.9%
10 27
 
2.7%
4 26
 
2.6%
7 25
 
2.5%
14 24
 
2.4%
3 23
 
2.3%
11 22
 
2.2%
Other values (88) 408
40.8%
ValueCountFrequency (%)
0 297
29.7%
1 66
 
6.6%
2 53
 
5.3%
3 23
 
2.3%
4 26
 
2.6%
5 21
 
2.1%
6 29
 
2.9%
7 25
 
2.5%
8 16
 
1.6%
9 19
 
1.9%
ValueCountFrequency (%)
316 1
0.1%
289 1
0.1%
259 1
0.1%
242 1
0.1%
210 1
0.1%
203 1
0.1%
201 2
0.2%
165 1
0.1%
141 1
0.1%
139 1
0.1%

승인건수
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct139
Distinct (%)13.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean32.183
Minimum0
Maximum348
Zeros269
Zeros (%)26.9%
Negative0
Negative (%)0.0%
Memory size8.9 KiB
2023-12-12T18:09:49.184549image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median12
Q339.25
95-th percentile139
Maximum348
Range348
Interquartile range (IQR)39.25

Descriptive statistics

Standard deviation49.480358
Coefficient of variation (CV)1.5374688
Kurtosis6.694944
Mean32.183
Median Absolute Deviation (MAD)12
Skewness2.3840467
Sum32183
Variance2448.3058
MonotonicityNot monotonic
2023-12-12T18:09:49.384337image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 269
26.9%
1 55
 
5.5%
2 45
 
4.5%
3 20
 
2.0%
9 18
 
1.8%
5 17
 
1.7%
21 16
 
1.6%
19 16
 
1.6%
17 15
 
1.5%
36 14
 
1.4%
Other values (129) 515
51.5%
ValueCountFrequency (%)
0 269
26.9%
1 55
 
5.5%
2 45
 
4.5%
3 20
 
2.0%
4 13
 
1.3%
5 17
 
1.7%
6 10
 
1.0%
7 13
 
1.3%
8 12
 
1.2%
9 18
 
1.8%
ValueCountFrequency (%)
348 1
 
0.1%
320 1
 
0.1%
275 2
0.2%
268 1
 
0.1%
257 1
 
0.1%
241 2
0.2%
228 1
 
0.1%
225 3
0.3%
223 2
0.2%
203 2
0.2%

취급금액
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct812
Distinct (%)81.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.6203159 × 1010
Minimum0
Maximum2.2299547 × 1011
Zeros60
Zeros (%)6.0%
Negative0
Negative (%)0.0%
Memory size8.9 KiB
2023-12-12T18:09:49.542713image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q11.32763 × 109
median9.8351975 × 109
Q33.403519 × 1010
95-th percentile1.1265608 × 1011
Maximum2.2299547 × 1011
Range2.2299547 × 1011
Interquartile range (IQR)3.270756 × 1010

Descriptive statistics

Standard deviation3.8671812 × 1010
Coefficient of variation (CV)1.4758454
Kurtosis5.3384318
Mean2.6203159 × 1010
Median Absolute Deviation (MAD)9.5983225 × 109
Skewness2.2385714
Sum2.6203159 × 1013
Variance1.495509 × 1021
MonotonicityNot monotonic
2023-12-12T18:09:49.695635image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 60
 
6.0%
300000000 6
 
0.6%
100000000 6
 
0.6%
500000000 5
 
0.5%
237500000 5
 
0.5%
200000000 4
 
0.4%
150000000 4
 
0.4%
50000000 4
 
0.4%
190000000 4
 
0.4%
30600000 3
 
0.3%
Other values (802) 899
89.9%
ValueCountFrequency (%)
0 60
6.0%
30600000 3
 
0.3%
45000000 1
 
0.1%
48562500 1
 
0.1%
50000000 4
 
0.4%
54000000 1
 
0.1%
54150000 1
 
0.1%
56000000 1
 
0.1%
57000000 1
 
0.1%
64000000 1
 
0.1%
ValueCountFrequency (%)
222995472649 1
0.1%
221138972649 1
0.1%
219221678750 1
0.1%
203187968700 1
0.1%
201083968700 1
0.1%
194183229400 2
0.2%
190834869000 1
0.1%
187674869000 1
0.1%
182462560000 1
0.1%
181172560000 1
0.1%

보증잔액건수
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct319
Distinct (%)31.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean144.08
Minimum0
Maximum1240
Zeros135
Zeros (%)13.5%
Negative0
Negative (%)0.0%
Memory size8.9 KiB
2023-12-12T18:09:49.837671image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q12
median40
Q3195
95-th percentile630.9
Maximum1240
Range1240
Interquartile range (IQR)193

Descriptive statistics

Standard deviation217.4055
Coefficient of variation (CV)1.5089221
Kurtosis4.6703442
Mean144.08
Median Absolute Deviation (MAD)40
Skewness2.1196156
Sum144080
Variance47265.153
MonotonicityNot monotonic
2023-12-12T18:09:50.019533image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 135
 
13.5%
1 87
 
8.7%
2 36
 
3.6%
3 25
 
2.5%
9 15
 
1.5%
6 15
 
1.5%
4 13
 
1.3%
11 12
 
1.2%
15 11
 
1.1%
7 10
 
1.0%
Other values (309) 641
64.1%
ValueCountFrequency (%)
0 135
13.5%
1 87
8.7%
2 36
 
3.6%
3 25
 
2.5%
4 13
 
1.3%
5 10
 
1.0%
6 15
 
1.5%
7 10
 
1.0%
8 8
 
0.8%
9 15
 
1.5%
ValueCountFrequency (%)
1240 1
0.1%
1237 1
0.1%
1167 1
0.1%
1162 1
0.1%
1096 1
0.1%
1093 1
0.1%
979 2
0.2%
974 1
0.1%
962 1
0.1%
932 1
0.1%

Interactions

2023-12-12T18:09:47.632029image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:09:46.176536image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:09:46.658914image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:09:47.173263image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:09:47.753544image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:09:46.273608image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:09:46.794474image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:09:47.280594image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:09:47.877027image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:09:46.404891image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:09:46.950716image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:09:47.391535image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:09:47.990894image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:09:46.532069image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:09:47.056737image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:09:47.501594image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T18:09:50.121141image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
유망창업기업보증구분상담누계건수승인건수취급금액보증잔액건수
유망창업기업보증구분1.0000.4340.5070.5330.568
상담누계건수0.4341.0000.8590.8480.815
승인건수0.5070.8591.0000.7970.833
취급금액0.5330.8480.7971.0000.934
보증잔액건수0.5680.8150.8330.9341.000
2023-12-12T18:09:50.235969image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
상담누계건수승인건수취급금액보증잔액건수유망창업기업보증구분
상담누계건수1.0000.9590.8970.8940.224
승인건수0.9591.0000.9230.9420.277
취급금액0.8970.9231.0000.8940.289
보증잔액건수0.8940.9420.8941.0000.314
유망창업기업보증구분0.2240.2770.2890.3141.000

Missing values

2023-12-12T18:09:48.182663image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T18:09:48.316358image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

유망창업기업보증구분상담누계건수승인건수취급금액보증잔액건수
0유망창업보증소계173339501044230225
1신생기업보증214539869539156198
2유망창업보증소계56119101048099156521
3창업초기보증21618261670000128
4신생기업보증85337167550000296
5유망창업보증소계487853904072500416
6창업초기보증203335272150000188
7창업초기보증112926784430000167
8유망창업보증소계36107105180305000690
9창업성장보증I8202376882500043
유망창업기업보증구분상담누계건수승인건수취급금액보증잔액건수
990창업성장보증I14221436485000131
991예비창업자보증710755910000057
992퍼스트펭귄보증0027075000003
993창업성장보증I18221544681300018
994신생기업보증62325467900000168
995창업성장보증I1110710000003
996전체85195132166327503828
997신생기업보증206628115200000199
998예비창업자보증13171600000020
999창업성장보증I2218150000006

Duplicate rows

Most frequently occurring

유망창업기업보증구분상담누계건수승인건수취급금액보증잔액건수# duplicates
8퍼스트펭귄보증000112
1예비창업자보증00016
26퍼스트펭귄보증01004
2예비창업자보증00023
6창업초기보증00013
9퍼스트펭귄보증00023
22퍼스트펭귄보증0050000000003
23퍼스트펭귄보증0060000000003
0신생기업보증00032
3예비창업자보증00032