Overview

Dataset statistics

Number of variables9
Number of observations500
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory36.8 KiB
Average record size in memory75.3 B

Variable types

Categorical2
Text1
Numeric3
Boolean3

Dataset

Description해당 파일 데이터는 신용보증기금의 보증고객 창업기업 성과공유금 정보에 대해 확인하실 수 있는 자료이니 데이터 활용에 참고하여 주시기 바랍니다.
Author신용보증기금
URLhttps://www.data.go.kr/data/15093270/fileData.do

Alerts

업무구분 has constant value ""Constant
유가증권시장상장여부 has constant value ""Constant
평가년도매출금액 is highly overall correlated with 2차년도실제매출금액 and 1 other fieldsHigh correlation
2차년도실제매출금액 is highly overall correlated with 평가년도매출금액 and 1 other fieldsHigh correlation
2차년도예상매출금액 is highly overall correlated with 평가년도매출금액 and 1 other fieldsHigh correlation
평가년도매출금액 has 16 (3.2%) zerosZeros
2차년도실제매출금액 has 13 (2.6%) zerosZeros

Reproduction

Analysis started2023-12-12 09:37:43.251594
Analysis finished2023-12-12 09:37:45.125182
Duration1.87 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업무구분
Categorical

CONSTANT 

Distinct1
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size4.0 KiB
신용보증
500 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row신용보증
2nd row신용보증
3rd row신용보증
4th row신용보증
5th row신용보증

Common Values

ValueCountFrequency (%)
신용보증 500
100.0%

Length

2023-12-12T18:37:45.211068image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T18:37:45.349759image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
신용보증 500
100.0%
Distinct2
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size4.0 KiB
평가년도4기
285 
평가년도5기
215 

Length

Max length6
Median length6
Mean length6
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row평가년도4기
2nd row평가년도4기
3rd row평가년도5기
4th row평가년도4기
5th row평가년도4기

Common Values

ValueCountFrequency (%)
평가년도4기 285
57.0%
평가년도5기 215
43.0%

Length

2023-12-12T18:37:45.478216image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T18:37:45.591332image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
평가년도4기 285
57.0%
평가년도5기 215
43.0%
Distinct427
Distinct (%)85.4%
Missing0
Missing (%)0.0%
Memory size4.0 KiB
2023-12-12T18:37:45.829786image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length11
Median length8
Mean length7.168
Min length1

Characters and Unicode

Total characters3584
Distinct characters12
Distinct categories3 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique420 ?
Unique (%)84.0%

Sample

1st row112309718
2nd row36948436
3rd row12177053
4th row21993049
5th row65596915
ValueCountFrequency (%)
0 68
 
13.6%
22848443 2
 
0.4%
61000000 2
 
0.4%
48088997 2
 
0.4%
25000000 2
 
0.4%
18542616 2
 
0.4%
15461540 2
 
0.4%
18140348 1
 
0.2%
20957777 1
 
0.2%
98609196 1
 
0.2%
Other values (417) 417
83.4%
2023-12-12T18:37:46.281653image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 506
14.1%
1 423
11.8%
2 354
9.9%
3 352
9.8%
5 344
9.6%
6 343
9.6%
4 329
9.2%
9 299
8.3%
8 295
8.2%
7 289
8.1%
Other values (2) 50
 
1.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 3534
98.6%
Open Punctuation 25
 
0.7%
Close Punctuation 25
 
0.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 506
14.3%
1 423
12.0%
2 354
10.0%
3 352
10.0%
5 344
9.7%
6 343
9.7%
4 329
9.3%
9 299
8.5%
8 295
8.3%
7 289
8.2%
Open Punctuation
ValueCountFrequency (%)
( 25
100.0%
Close Punctuation
ValueCountFrequency (%)
) 25
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 3584
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 506
14.1%
1 423
11.8%
2 354
9.9%
3 352
9.8%
5 344
9.6%
6 343
9.6%
4 329
9.2%
9 299
8.3%
8 295
8.2%
7 289
8.1%
Other values (2) 50
 
1.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 3584
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 506
14.1%
1 423
11.8%
2 354
9.9%
3 352
9.8%
5 344
9.6%
6 343
9.6%
4 329
9.2%
9 299
8.3%
8 295
8.2%
7 289
8.1%
Other values (2) 50
 
1.4%

평가년도매출금액
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct481
Distinct (%)96.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.2031737 × 109
Minimum0
Maximum3.003952 × 1010
Zeros16
Zeros (%)3.2%
Negative0
Negative (%)0.0%
Memory size4.5 KiB
2023-12-12T18:37:46.452903image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile26949000
Q12.3376142 × 108
median6.5567955 × 108
Q31.396417 × 109
95-th percentile3.5345392 × 109
Maximum3.003952 × 1010
Range3.003952 × 1010
Interquartile range (IQR)1.1626555 × 109

Descriptive statistics

Standard deviation2.1074664 × 109
Coefficient of variation (CV)1.7515895
Kurtosis77.151163
Mean1.2031737 × 109
Median Absolute Deviation (MAD)4.8245947 × 108
Skewness7.0530684
Sum6.0158686 × 1011
Variance4.4414148 × 1018
MonotonicityNot monotonic
2023-12-12T18:37:46.671298image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 16
 
3.2%
461587320 2
 
0.4%
348123574 2
 
0.4%
764973048 2
 
0.4%
426400000 2
 
0.4%
223754720 1
 
0.2%
3461505851 1
 
0.2%
536626600 1
 
0.2%
2204300940 1
 
0.2%
2023665969 1
 
0.2%
Other values (471) 471
94.2%
ValueCountFrequency (%)
0 16
3.2%
48 1
 
0.2%
564185 1
 
0.2%
1177286 1
 
0.2%
1818182 1
 
0.2%
2734841 1
 
0.2%
6818180 1
 
0.2%
9000000 1
 
0.2%
9545454 1
 
0.2%
18000000 1
 
0.2%
ValueCountFrequency (%)
30039519805 1
0.2%
13817249117 1
0.2%
13804479547 1
0.2%
11102090850 1
0.2%
9973340202 1
0.2%
9408397737 1
0.2%
9268967000 1
0.2%
8975726738 1
0.2%
8634341015 1
0.2%
7648287007 1
0.2%

2차년도실제매출금액
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct385
Distinct (%)77.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean8.762516 × 108
Minimum0
Maximum1.5837381 × 1010
Zeros13
Zeros (%)2.6%
Negative0
Negative (%)0.0%
Memory size4.5 KiB
2023-12-12T18:37:46.855528image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile42119092
Q11.9219507 × 108
median4.700585 × 108
Q39.5964617 × 108
95-th percentile3.3907205 × 109
Maximum1.5837381 × 1010
Range1.5837381 × 1010
Interquartile range (IQR)7.6745111 × 108

Descriptive statistics

Standard deviation1.3623389 × 109
Coefficient of variation (CV)1.5547348
Kurtosis51.176615
Mean8.762516 × 108
Median Absolute Deviation (MAD)3.2614528 × 108
Skewness5.7486278
Sum4.381258 × 1011
Variance1.8559672 × 1018
MonotonicityNot monotonic
2023-12-12T18:37:47.072684image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 13
 
2.6%
300356625 2
 
0.4%
410652008 2
 
0.4%
427042650 2
 
0.4%
90989542 2
 
0.4%
1349880596 2
 
0.4%
3827187398 2
 
0.4%
42119092 2
 
0.4%
300794682 2
 
0.4%
203270000 2
 
0.4%
Other values (375) 469
93.8%
ValueCountFrequency (%)
0 13
2.6%
50 1
 
0.2%
909090 1
 
0.2%
2475300 1
 
0.2%
14409263 2
 
0.4%
22754549 2
 
0.4%
23125000 1
 
0.2%
23241700 1
 
0.2%
34820000 1
 
0.2%
38769000 1
 
0.2%
ValueCountFrequency (%)
15837381457 1
0.2%
14700702461 1
0.2%
7193472543 1
0.2%
6181495815 1
0.2%
5662976299 1
0.2%
4757968984 1
0.2%
4706148515 1
0.2%
4620192981 1
0.2%
4543037060 2
0.4%
4186064121 1
0.2%

2차년도예상매출금액
Real number (ℝ)

HIGH CORRELATION 

Distinct95
Distinct (%)19.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.2115074 × 1011
Minimum70000000
Maximum8 × 1013
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size4.5 KiB
2023-12-12T18:37:47.264991image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum70000000
5-th percentile1.895 × 108
Q14 × 108
median7 × 108
Q31.306 × 109
95-th percentile3.81 × 109
Maximum8 × 1013
Range7.999993 × 1013
Interquartile range (IQR)9.06 × 108

Descriptive statistics

Standard deviation5.0544992 × 1012
Coefficient of variation (CV)15.738713
Kurtosis247.48387
Mean3.2115074 × 1011
Median Absolute Deviation (MAD)4 × 108
Skewness15.76369
Sum1.6057537 × 1014
Variance2.5547962 × 1025
MonotonicityNot monotonic
2023-12-12T18:37:47.478651image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1000000000 32
 
6.4%
500000000 30
 
6.0%
1500000000 28
 
5.6%
600000000 26
 
5.2%
400000000 24
 
4.8%
300000000 23
 
4.6%
800000000 23
 
4.6%
200000000 22
 
4.4%
450000000 14
 
2.8%
700000000 14
 
2.8%
Other values (85) 264
52.8%
ValueCountFrequency (%)
70000000 1
 
0.2%
80000000 1
 
0.2%
90000000 1
 
0.2%
100000000 5
1.0%
108000000 1
 
0.2%
120000000 1
 
0.2%
130000000 1
 
0.2%
140000000 1
 
0.2%
150000000 6
1.2%
154000000 2
 
0.4%
ValueCountFrequency (%)
80000000000000 2
0.4%
13000000000 1
 
0.2%
12672000000 1
 
0.2%
12000000000 1
 
0.2%
10000000000 1
 
0.2%
9500000000 1
 
0.2%
9000000000 1
 
0.2%
6500000000 1
 
0.2%
6000000000 4
0.8%
5000000000 3
0.6%
Distinct2
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size632.0 B
False
390 
True
110 
ValueCountFrequency (%)
False 390
78.0%
True 110
 
22.0%
2023-12-12T18:37:47.641786image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Distinct2
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size632.0 B
False
434 
True
66 
ValueCountFrequency (%)
False 434
86.8%
True 66
 
13.2%
2023-12-12T18:37:48.121753image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Distinct1
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size632.0 B
False
500 
ValueCountFrequency (%)
False 500
100.0%
2023-12-12T18:37:48.218171image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Interactions

2023-12-12T18:37:44.233345image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:37:43.563373image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:37:43.912647image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:37:44.363001image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:37:43.649903image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:37:44.007450image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:37:44.575695image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:37:43.771670image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:37:44.109295image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T18:37:48.287472image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
성과공유금평가년도구분평가년도매출금액2차년도실제매출금액2차년도예상매출금액실제매출10퍼센트이상달성여부예상매출2배이상달성여부
성과공유금평가년도구분1.0000.1330.1220.0000.0000.171
평가년도매출금액0.1331.0000.9240.0000.0000.406
2차년도실제매출금액0.1220.9241.0000.0000.2390.321
2차년도예상매출금액0.0000.0000.0001.0000.0000.000
실제매출10퍼센트이상달성여부0.0000.0000.2390.0001.0000.087
예상매출2배이상달성여부0.1710.4060.3210.0000.0871.000
2023-12-12T18:37:48.410117image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
예상매출2배이상달성여부실제매출10퍼센트이상달성여부성과공유금평가년도구분
예상매출2배이상달성여부1.0000.0550.109
실제매출10퍼센트이상달성여부0.0551.0000.000
성과공유금평가년도구분0.1090.0001.000
2023-12-12T18:37:48.527317image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
평가년도매출금액2차년도실제매출금액2차년도예상매출금액성과공유금평가년도구분실제매출10퍼센트이상달성여부예상매출2배이상달성여부
평가년도매출금액1.0000.7880.6210.0950.0000.291
2차년도실제매출금액0.7881.0000.6540.0870.1710.230
2차년도예상매출금액0.6210.6541.0000.0000.0000.000
성과공유금평가년도구분0.0950.0870.0001.0000.0000.109
실제매출10퍼센트이상달성여부0.0000.1710.0000.0001.0000.055
예상매출2배이상달성여부0.2910.2300.0000.1090.0551.000

Missing values

2023-12-12T18:37:44.834307image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T18:37:45.051420image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업무구분성과공유금평가년도구분평가년도당기순이익금액평가년도매출금액2차년도실제매출금액2차년도예상매출금액실제매출10퍼센트이상달성여부예상매출2배이상달성여부유가증권시장상장여부
0신용보증평가년도4기1123097181532797011313598350400000000YYN
1신용보증평가년도4기36948436293047290329304729032500000000NNN
2신용보증평가년도5기12177053167079649142117246600000000NNN
3신용보증평가년도4기21993049273484140951112133000000000NNN
4신용보증평가년도4기65596915191560879011002822001500000000NNN
5신용보증평가년도5기56754384181992231415854944331500000000NNN
6신용보증평가년도4기46491731162155799215854944331500000000NNN
7신용보증평가년도5기8539162140596040198645109300000000NNN
8신용보증평가년도4기0107083836198645109300000000NNN
9신용보증평가년도5기19693511250288455188530750240000000YNN
업무구분성과공유금평가년도구분평가년도당기순이익금액평가년도매출금액2차년도실제매출금액2차년도예상매출금액실제매출10퍼센트이상달성여부예상매출2배이상달성여부유가증권시장상장여부
490신용보증평가년도5기28588000654653000425648273600000000NNN
491신용보증평가년도4기04552100647349266901500000000NNN
492신용보증평가년도5기20240385112150872911733100661800000000NNN
493신용보증평가년도5기28586425238847035278383282400000000YNN
494신용보증평가년도4기413589206536800000037917235982500000000YYN
495신용보증평가년도4기0236869430238763223300000000NNN
496신용보증평가년도4기80928176334114964841860641212500000000NNN
497신용보증평가년도4기0557952444686343380000000000000NNN
498신용보증평가년도4기(89151670)242260562280063829500000000NNN
499신용보증평가년도4기63882308372822605317853222400000000YNN