Overview

Dataset statistics

Number of variables4
Number of observations85
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory3.0 KiB
Average record size in memory36.6 B

Variable types

Categorical2
Numeric2

Dataset

Description청년전용창업자금 관련 데이터로 최근 5년(2018년부터 2022년까지)의 연도별 지원금액과 지원지역 및 지원 건수를 확인할 수있습니다
Author중소벤처기업진흥공단
URLhttps://www.data.go.kr/data/15107136/fileData.do

Alerts

지원금액(백만원) is highly overall correlated with 지원건수High correlation
지원건수 is highly overall correlated with 지원금액(백만원)High correlation

Reproduction

Analysis started2023-12-12 08:24:48.110669
Analysis finished2023-12-12 08:24:48.812804
Duration0.7 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

지원연도
Categorical

Distinct5
Distinct (%)5.9%
Missing0
Missing (%)0.0%
Memory size812.0 B
2018
17 
2019
17 
2020
17 
2021
17 
2022
17 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2018
2nd row2018
3rd row2018
4th row2018
5th row2018

Common Values

ValueCountFrequency (%)
2018 17
20.0%
2019 17
20.0%
2020 17
20.0%
2021 17
20.0%
2022 17
20.0%

Length

2023-12-12T17:24:48.884571image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T17:24:49.001721image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2018 17
20.0%
2019 17
20.0%
2020 17
20.0%
2021 17
20.0%
2022 17
20.0%

지역
Categorical

Distinct17
Distinct (%)20.0%
Missing0
Missing (%)0.0%
Memory size812.0 B
강원도
 
5
경기도
 
5
경상남도
 
5
경상북도
 
5
광주광역시
 
5
Other values (12)
60 

Length

Max length7
Median length5
Mean length4.6470588
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row강원도
2nd row경기도
3rd row경상남도
4th row경상북도
5th row광주광역시

Common Values

ValueCountFrequency (%)
강원도 5
 
5.9%
경기도 5
 
5.9%
경상남도 5
 
5.9%
경상북도 5
 
5.9%
광주광역시 5
 
5.9%
대구광역시 5
 
5.9%
대전광역시 5
 
5.9%
부산광역시 5
 
5.9%
서울특별시 5
 
5.9%
세종특별자치시 5
 
5.9%
Other values (7) 35
41.2%

Length

2023-12-12T17:24:49.161737image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
강원도 5
 
5.9%
세종특별자치시 5
 
5.9%
충청남도 5
 
5.9%
제주특별자치도 5
 
5.9%
전라북도 5
 
5.9%
전라남도 5
 
5.9%
인천광역시 5
 
5.9%
울산광역시 5
 
5.9%
서울특별시 5
 
5.9%
경기도 5
 
5.9%
Other values (7) 35
41.2%

지원금액(백만원)
Real number (ℝ)

HIGH CORRELATION 

Distinct81
Distinct (%)95.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean10117.647
Minimum580
Maximum49750
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size897.0 B
2023-12-12T17:24:49.341525image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum580
5-th percentile1534
Q14000
median6530
Q39450
95-th percentile37808
Maximum49750
Range49170
Interquartile range (IQR)5450

Descriptive statistics

Standard deviation11513.653
Coefficient of variation (CV)1.1379774
Kurtosis4.569568
Mean10117.647
Median Absolute Deviation (MAD)2730
Skewness2.3172657
Sum860000
Variance1.3256422 × 108
MonotonicityNot monotonic
2023-12-12T17:24:49.526524image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2900 2
 
2.4%
4000 2
 
2.4%
8700 2
 
2.4%
10330 2
 
2.4%
6530 1
 
1.2%
9350 1
 
1.2%
1530 1
 
1.2%
49750 1
 
1.2%
13200 1
 
1.2%
5090 1
 
1.2%
Other values (71) 71
83.5%
ValueCountFrequency (%)
580 1
1.2%
790 1
1.2%
1070 1
1.2%
1500 1
1.2%
1530 1
1.2%
1550 1
1.2%
1600 1
1.2%
1940 1
1.2%
2200 1
1.2%
2420 1
1.2%
ValueCountFrequency (%)
49750 1
1.2%
48900 1
1.2%
48080 1
1.2%
45600 1
1.2%
38180 1
1.2%
36320 1
1.2%
35440 1
1.2%
32410 1
1.2%
30590 1
1.2%
29450 1
1.2%

지원건수
Real number (ℝ)

HIGH CORRELATION 

Distinct68
Distinct (%)80.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean109.94118
Minimum6
Maximum543
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size897.0 B
2023-12-12T17:24:49.696280image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum6
5-th percentile18.2
Q144
median74
Q399
95-th percentile461.8
Maximum543
Range537
Interquartile range (IQR)55

Descriptive statistics

Standard deviation129.9808
Coefficient of variation (CV)1.1822759
Kurtosis4.019428
Mean109.94118
Median Absolute Deviation (MAD)29
Skewness2.2768455
Sum9345
Variance16895.008
MonotonicityNot monotonic
2023-12-12T17:24:49.865732image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
53 3
 
3.5%
27 2
 
2.4%
55 2
 
2.4%
29 2
 
2.4%
115 2
 
2.4%
83 2
 
2.4%
52 2
 
2.4%
75 2
 
2.4%
26 2
 
2.4%
45 2
 
2.4%
Other values (58) 64
75.3%
ValueCountFrequency (%)
6 1
1.2%
10 2
2.4%
14 1
1.2%
18 1
1.2%
19 1
1.2%
21 2
2.4%
22 1
1.2%
25 1
1.2%
26 2
2.4%
27 2
2.4%
ValueCountFrequency (%)
543 1
1.2%
517 1
1.2%
507 1
1.2%
481 1
1.2%
471 1
1.2%
425 1
1.2%
414 1
1.2%
394 1
1.2%
390 1
1.2%
349 1
1.2%

Interactions

2023-12-12T17:24:48.456426image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:24:48.284427image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:24:48.548669image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:24:48.368284image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T17:24:49.979522image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
지원연도지역지원금액(백만원)지원건수
지원연도1.0000.0000.2170.194
지역0.0001.0000.7410.781
지원금액(백만원)0.2170.7411.0000.986
지원건수0.1940.7810.9861.000
2023-12-12T17:24:50.094836image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
지원연도지역
지원연도1.0000.000
지역0.0001.000
2023-12-12T17:24:50.190828image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
지원금액(백만원)지원건수지원연도지역
지원금액(백만원)1.0000.9760.1440.423
지원건수0.9761.0000.1200.464
지원연도0.1440.1201.0000.000
지역0.4230.4640.0001.000

Missing values

2023-12-12T17:24:48.671093image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T17:24:48.777841image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

지원연도지역지원금액(백만원)지원건수
02018강원도242027
12018경기도32410425
22018경상남도10330105
32018경상북도776087
42018광주광역시573070
52018대구광역시9150103
62018대전광역시492059
72018부산광역시10330106
82018서울특별시36320471
92018세종특별자치시79010
지원연도지역지원금액(백만원)지원건수
752022부산광역시12400103
762022서울특별시48900481
772022세종특별자치시310029
782022울산광역시360032
792022인천광역시812094
802022전라남도584052
812022전라북도870086
822022제주특별자치도290027
832022충청남도804077
842022충청북도810068