Overview

Dataset statistics

Number of variables5
Number of observations80
Missing cells0
Missing cells (%)0.0%
Duplicate rows1
Duplicate rows (%)1.2%
Total size in memory3.5 KiB
Average record size in memory44.6 B

Variable types

Numeric3
Categorical2

Dataset

Description한국지역난방공사의 연도별 중소기업 지원 실적입니다. (연도, 사업명, 지원금액(백만원), 기업수(개)에 대한 데이터 입니다.)
Author한국지역난방공사
URLhttps://www.data.go.kr/data/15071147/fileData.do

Alerts

지원금액 단위 has constant value ""Constant
Dataset has 1 (1.2%) duplicate rowsDuplicates
지원금액 has 1 (1.2%) zerosZeros

Reproduction

Analysis started2024-04-06 08:19:57.519906
Analysis finished2024-04-06 08:20:00.699559
Duration3.18 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연도
Real number (ℝ)

Distinct9
Distinct (%)11.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2019.95
Minimum2015
Maximum2023
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size852.0 B
2024-04-06T17:20:00.902732image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2015
5-th percentile2016
Q12019
median2020
Q32021.25
95-th percentile2023
Maximum2023
Range8
Interquartile range (IQR)2.25

Descriptive statistics

Standard deviation2.0369995
Coefficient of variation (CV)0.0010084406
Kurtosis-0.21724148
Mean2019.95
Median Absolute Deviation (MAD)1
Skewness-0.57572921
Sum161596
Variance4.1493671
MonotonicityIncreasing
2024-04-06T17:20:01.203674image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=9)
ValueCountFrequency (%)
2020 17
21.2%
2021 14
17.5%
2019 13
16.2%
2022 13
16.2%
2023 7
8.8%
2016 5
 
6.2%
2018 5
 
6.2%
2017 4
 
5.0%
2015 2
 
2.5%
ValueCountFrequency (%)
2015 2
 
2.5%
2016 5
 
6.2%
2017 4
 
5.0%
2018 5
 
6.2%
2019 13
16.2%
2020 17
21.2%
2021 14
17.5%
2022 13
16.2%
2023 7
8.8%
ValueCountFrequency (%)
2023 7
8.8%
2022 13
16.2%
2021 14
17.5%
2020 17
21.2%
2019 13
16.2%
2018 5
 
6.2%
2017 4
 
5.0%
2016 5
 
6.2%
2015 2
 
2.5%

사업명
Categorical

Distinct34
Distinct (%)42.5%
Missing0
Missing (%)0.0%
Memory size772.0 B
해외마케팅지원사업
기술개발촉진사업
산업혁신운동
 
5
직간적자금지원
 
4
혁신파트너십
 
4
Other values (29)
52 

Length

Max length20
Median length14
Mean length9.6
Min length4

Unique

Unique16 ?
Unique (%)20.0%

Sample

1st row해외마케팅지원사업
2nd row산업혁신운동
3rd row기술개발촉진사업
4th row해외마케팅지원사업
5th row해외동반진출로드쇼

Common Values

ValueCountFrequency (%)
해외마케팅지원사업 8
 
10.0%
기술개발촉진사업 7
 
8.8%
산업혁신운동 5
 
6.2%
직간적자금지원 4
 
5.0%
혁신파트너십 4
 
5.0%
창업벤처기업 지원 4
 
5.0%
구매조건부신제품개발 4
 
5.0%
협력업체 장기재직지원 4
 
5.0%
스마트공장구축 지원 4
 
5.0%
임치지원 3
 
3.8%
Other values (24) 33
41.2%

Length

2024-04-06T17:20:01.501054image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
지원 15
 
11.2%
해외마케팅지원사업 8
 
6.0%
기술개발촉진사업 7
 
5.2%
창업벤처기업 6
 
4.5%
협력업체 6
 
4.5%
스마트공장구축 5
 
3.7%
산업혁신운동 5
 
3.7%
혁신파트너십 4
 
3.0%
구매조건부신제품개발 4
 
3.0%
직간적자금지원 4
 
3.0%
Other values (40) 70
52.2%

지원금액
Real number (ℝ)

ZEROS 

Distinct55
Distinct (%)68.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean281.86625
Minimum0
Maximum6255
Zeros1
Zeros (%)1.2%
Negative0
Negative (%)0.0%
Memory size852.0 B
2024-04-06T17:20:01.786845image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0.99
Q111.5
median32
Q3127
95-th percentile748.4
Maximum6255
Range6255
Interquartile range (IQR)115.5

Descriptive statistics

Standard deviation943.94965
Coefficient of variation (CV)3.3489275
Kurtosis26.2754
Mean281.86625
Median Absolute Deviation (MAD)31
Skewness4.9994772
Sum22549.3
Variance891040.93
MonotonicityNot monotonic
2024-04-06T17:20:02.071036image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
200.0 5
 
6.2%
1.0 4
 
5.0%
10.0 4
 
5.0%
20.0 3
 
3.8%
18.0 3
 
3.8%
120.0 2
 
2.5%
23.0 2
 
2.5%
80.0 2
 
2.5%
108.0 2
 
2.5%
6.0 2
 
2.5%
Other values (45) 51
63.7%
ValueCountFrequency (%)
0.0 1
 
1.2%
0.2 1
 
1.2%
0.55 1
 
1.2%
0.8 1
 
1.2%
1.0 4
5.0%
1.2 1
 
1.2%
2.0 1
 
1.2%
3.5 1
 
1.2%
3.8 1
 
1.2%
6.0 2
2.5%
ValueCountFrequency (%)
6255.0 1
1.2%
4669.0 1
1.2%
2702.0 1
1.2%
2637.0 1
1.2%
649.0 1
1.2%
518.0 1
1.2%
481.0 1
1.2%
250.0 1
1.2%
212.0 1
1.2%
210.0 1
1.2%

지원금액 단위
Categorical

CONSTANT 

Distinct1
Distinct (%)1.2%
Missing0
Missing (%)0.0%
Memory size772.0 B
백만원
80 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row백만원
2nd row백만원
3rd row백만원
4th row백만원
5th row백만원

Common Values

ValueCountFrequency (%)
백만원 80
100.0%

Length

2024-04-06T17:20:02.330800image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-06T17:20:02.534062image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
백만원 80
100.0%

기업수
Real number (ℝ)

Distinct27
Distinct (%)33.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean16.3625
Minimum1
Maximum280
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size852.0 B
2024-04-06T17:20:02.755568image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q14
median6
Q313
95-th percentile55.5
Maximum280
Range279
Interquartile range (IQR)9

Descriptive statistics

Standard deviation36.303309
Coefficient of variation (CV)2.2186896
Kurtosis36.631635
Mean16.3625
Median Absolute Deviation (MAD)3
Skewness5.5774421
Sum1309
Variance1317.9302
MonotonicityNot monotonic
2024-04-06T17:20:02.986404image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=27)
ValueCountFrequency (%)
5 12
15.0%
4 9
11.2%
7 7
 
8.8%
3 7
 
8.8%
2 5
 
6.2%
1 5
 
6.2%
6 4
 
5.0%
10 3
 
3.8%
20 3
 
3.8%
12 3
 
3.8%
Other values (17) 22
27.5%
ValueCountFrequency (%)
1 5
6.2%
2 5
6.2%
3 7
8.8%
4 9
11.2%
5 12
15.0%
6 4
 
5.0%
7 7
8.8%
8 3
 
3.8%
9 1
 
1.2%
10 3
 
3.8%
ValueCountFrequency (%)
280 1
1.2%
120 1
1.2%
116 1
1.2%
65 1
1.2%
55 1
1.2%
45 2
2.5%
30 1
1.2%
29 1
1.2%
26 1
1.2%
24 1
1.2%

Interactions

2024-04-06T17:19:59.133068image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T17:19:57.797795image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T17:19:58.365799image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T17:19:59.437867image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T17:19:58.005594image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T17:19:58.633206image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T17:19:59.659952image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T17:19:58.183761image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T17:19:58.891505image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-06T17:20:03.147367image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연도사업명지원금액기업수
연도1.0000.0000.0000.149
사업명0.0001.0000.0000.788
지원금액0.0000.0001.0000.371
기업수0.1490.7880.3711.000
2024-04-06T17:20:03.348947image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연도지원금액기업수사업명
연도1.000-0.0380.3510.000
지원금액-0.0381.0000.2900.000
기업수0.3510.2901.0000.395
사업명0.0000.0000.3951.000

Missing values

2024-04-06T17:20:00.337723image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-06T17:20:00.624289image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연도사업명지원금액지원금액 단위기업수
02015해외마케팅지원사업18.0백만원4
12015산업혁신운동136.0백만원8
22016기술개발촉진사업32.0백만원6
32016해외마케팅지원사업93.0백만원13
42016해외동반진출로드쇼18.0백만원4
52016산업혁신운동108.0백만원6
62016상생서포터즈 청년창업 프로그램28.0백만원6
72017기술개발촉진사업22.0백만원4
82017해외마케팅지원사업60.0백만원5
92017산업혁신운동86.0백만원5
연도사업명지원금액지원금액 단위기업수
702022구매조건부신제품개발649.0백만원3
712022창업벤처기업 지원200.0백만원20
722022정선 마을기업 축제 지원10.0백만원1
732023협력 중소기업 휴가비 지원15.0백만원29
742023협력 중소기업 장기재직 지원6.0백만원13
752023혁신파트너십160.0백만원8
762023스마트공장구축 지원120.0백만원12
772023협력 중소기업 기술보호 지원0.0백만원7
782023협력 중소기업 추석 명절선물9.75백만원65
792023창업벤처기업 지원200.0백만원20

Duplicate rows

Most frequently occurring

연도사업명지원금액지원금액 단위기업수# duplicates
02020임치지원1.0백만원42