Overview

Dataset statistics

Number of variables4
Number of observations120
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory4.2 KiB
Average record size in memory36.1 B

Variable types

Categorical2
Numeric2

Dataset

Description경기신용보증재단 창업기업 보증 지원 현황
Author경기신용보증재단
URLhttps://data.gg.go.kr/portal/data/service/selectServicePage.do?&infId=V57DV2H792SYUSAJBYZM31418616&infSeq=1

Alerts

보증건수 is highly overall correlated with 보증금액High correlation
보증금액 is highly overall correlated with 보증건수High correlation
보증건수 has 5 (4.2%) zerosZeros
보증금액 has 5 (4.2%) zerosZeros

Reproduction

Analysis started2024-04-20 18:32:56.168633
Analysis finished2024-04-20 18:32:57.743790
Duration1.58 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시군명
Categorical

Distinct30
Distinct (%)25.0%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
가평군
 
4
고양시
 
4
과천시
 
4
광명시
 
4
광주시
 
4
Other values (25)
100 

Length

Max length4
Median length3
Mean length3.1
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row가평군
2nd row가평군
3rd row가평군
4th row가평군
5th row고양시

Common Values

ValueCountFrequency (%)
가평군 4
 
3.3%
고양시 4
 
3.3%
과천시 4
 
3.3%
광명시 4
 
3.3%
광주시 4
 
3.3%
구리시 4
 
3.3%
군포시 4
 
3.3%
김포시 4
 
3.3%
남양주시 4
 
3.3%
동두천시 4
 
3.3%
Other values (20) 80
66.7%

Length

2024-04-21T03:32:57.812765image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
가평군 4
 
3.3%
고양시 4
 
3.3%
하남시 4
 
3.3%
포천시 4
 
3.3%
평택시 4
 
3.3%
파주시 4
 
3.3%
이천시 4
 
3.3%
의정부시 4
 
3.3%
의왕시 4
 
3.3%
용인시 4
 
3.3%
Other values (20) 80
66.7%

기준년도
Categorical

Distinct4
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
2020
30 
2021
30 
2022
30 
2023
30 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2020
2nd row2021
3rd row2022
4th row2023
5th row2020

Common Values

ValueCountFrequency (%)
2020 30
25.0%
2021 30
25.0%
2022 30
25.0%
2023 30
25.0%

Length

2024-04-21T03:32:57.907411image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-21T03:32:57.988855image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2020 30
25.0%
2021 30
25.0%
2022 30
25.0%
2023 30
25.0%

보증건수
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct45
Distinct (%)37.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean17.425
Minimum0
Maximum86
Zeros5
Zeros (%)4.2%
Negative0
Negative (%)0.0%
Memory size1.2 KiB
2024-04-21T03:32:58.082957image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile1
Q14.75
median11.5
Q325.5
95-th percentile54.05
Maximum86
Range86
Interquartile range (IQR)20.75

Descriptive statistics

Standard deviation17.515257
Coefficient of variation (CV)1.0051798
Kurtosis2.9462433
Mean17.425
Median Absolute Deviation (MAD)8.5
Skewness1.6568726
Sum2091
Variance306.78424
MonotonicityNot monotonic
2024-04-21T03:32:58.189101image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=45)
ValueCountFrequency (%)
2 10
 
8.3%
10 7
 
5.8%
7 7
 
5.8%
13 7
 
5.8%
3 6
 
5.0%
4 5
 
4.2%
0 5
 
4.2%
11 5
 
4.2%
5 4
 
3.3%
1 4
 
3.3%
Other values (35) 60
50.0%
ValueCountFrequency (%)
0 5
4.2%
1 4
 
3.3%
2 10
8.3%
3 6
5.0%
4 5
4.2%
5 4
 
3.3%
6 3
 
2.5%
7 7
5.8%
8 3
 
2.5%
9 1
 
0.8%
ValueCountFrequency (%)
86 1
0.8%
80 1
0.8%
74 1
0.8%
60 1
0.8%
57 1
0.8%
55 1
0.8%
54 1
0.8%
52 1
0.8%
48 1
0.8%
47 1
0.8%

보증금액
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct111
Distinct (%)92.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.1786567 × 109
Minimum0
Maximum1.11928 × 1010
Zeros5
Zeros (%)4.2%
Negative0
Negative (%)0.0%
Memory size1.2 KiB
2024-04-21T03:32:58.302080image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile31300000
Q13.047 × 108
median8.525 × 108
Q31.448 × 109
95-th percentile3.20956 × 109
Maximum1.11928 × 1010
Range1.11928 × 1010
Interquartile range (IQR)1.1433 × 109

Descriptive statistics

Standard deviation1.4546146 × 109
Coefficient of variation (CV)1.2341292
Kurtosis20.806275
Mean1.1786567 × 109
Median Absolute Deviation (MAD)5.8145 × 108
Skewness3.7604128
Sum1.414388 × 1011
Variance2.1159037 × 1018
MonotonicityNot monotonic
2024-04-21T03:32:58.415739image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 5
 
4.2%
90000000 3
 
2.5%
225000000 2
 
1.7%
378000000 2
 
1.7%
99000000 2
 
1.7%
645500000 1
 
0.8%
326000000 1
 
0.8%
2704500000 1
 
0.8%
252000000 1
 
0.8%
2755700000 1
 
0.8%
Other values (101) 101
84.2%
ValueCountFrequency (%)
0 5
4.2%
18000000 1
 
0.8%
32000000 1
 
0.8%
38000000 1
 
0.8%
45000000 1
 
0.8%
51500000 1
 
0.8%
59600000 1
 
0.8%
65000000 1
 
0.8%
72000000 1
 
0.8%
74000000 1
 
0.8%
ValueCountFrequency (%)
11192800000 1
0.8%
7515900000 1
0.8%
4557000000 1
0.8%
4299900000 1
0.8%
4166500000 1
0.8%
3243000000 1
0.8%
3207800000 1
0.8%
2926000000 1
0.8%
2755700000 1
0.8%
2704500000 1
0.8%

Interactions

2024-04-21T03:32:57.456310image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T03:32:57.176788image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T03:32:57.520532image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T03:32:57.284310image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-21T03:32:58.488467image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시군명기준년도보증건수보증금액
시군명1.0000.0000.6190.443
기준년도0.0001.0000.4880.362
보증건수0.6190.4881.0000.781
보증금액0.4430.3620.7811.000
2024-04-21T03:32:58.562748image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시군명기준년도
시군명1.0000.000
기준년도0.0001.000
2024-04-21T03:32:58.626726image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
보증건수보증금액시군명기준년도
보증건수1.0000.8970.2510.326
보증금액0.8971.0000.1750.252
시군명0.2510.1751.0000.000
기준년도0.3260.2520.0001.000

Missing values

2024-04-21T03:32:57.618691image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-21T03:32:57.704686image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시군명기준년도보증건수보증금액
0가평군2020118000000
1가평군20215136000000
2가평군202200
3가평군2023259600000
4고양시202091678500000
5고양시2021292538000000
6고양시2022265000000
7고양시2023272135800000
8과천시2020272000000
9과천시202100
시군명기준년도보증건수보증금액
110포천시20223225000000
111포천시202311919100000
112하남시20207297000000
113하남시202119797500000
114하남시2022290000000
115하남시2023131429900000
116화성시2020472661500000
117화성시2021804166500000
118화성시202217864000000
119화성시20238611192800000