Overview

Dataset statistics

Number of variables3
Number of observations918
Missing cells517
Missing cells (%)18.8%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory23.4 KiB
Average record size in memory26.1 B

Variable types

Numeric2
Categorical1

Dataset

Description한국광해광업공단은 석탄산업의 생산 기반 유지와 연탄의 안정적인 공급을 위해 석·연탄산업 지원 사업을 실시하고 있으며 이를 통해 자원안보와 서민생활보호 및 폐광지역 고용창출 등에 이바지하고 있습니다. 이와 관련, 석탄산업 지원내역을 제공합니다.
URLhttps://www.data.go.kr/data/15067742/fileData.do

Alerts

지원금액(백만원) has 517 (56.3%) missing valuesMissing

Reproduction

Analysis started2023-12-12 12:15:58.369184
Analysis finished2023-12-12 12:15:59.008358
Duration0.64 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연도
Real number (ℝ)

Distinct34
Distinct (%)3.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2005.5
Minimum1989
Maximum2022
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size8.2 KiB
2023-12-12T21:15:59.069173image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1989
5-th percentile1990
Q11997
median2005.5
Q32014
95-th percentile2021
Maximum2022
Range33
Interquartile range (IQR)17

Descriptive statistics

Standard deviation9.8160563
Coefficient of variation (CV)0.0048945681
Kurtosis-1.2020864
Mean2005.5
Median Absolute Deviation (MAD)8.5
Skewness0
Sum1841049
Variance96.354962
MonotonicityIncreasing
2023-12-12T21:15:59.214453image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=34)
ValueCountFrequency (%)
1989 27
 
2.9%
2015 27
 
2.9%
2009 27
 
2.9%
2010 27
 
2.9%
2011 27
 
2.9%
2012 27
 
2.9%
2013 27
 
2.9%
2014 27
 
2.9%
2016 27
 
2.9%
2007 27
 
2.9%
Other values (24) 648
70.6%
ValueCountFrequency (%)
1989 27
2.9%
1990 27
2.9%
1991 27
2.9%
1992 27
2.9%
1993 27
2.9%
1994 27
2.9%
1995 27
2.9%
1996 27
2.9%
1997 27
2.9%
1998 27
2.9%
ValueCountFrequency (%)
2022 27
2.9%
2021 27
2.9%
2020 27
2.9%
2019 27
2.9%
2018 27
2.9%
2017 27
2.9%
2016 27
2.9%
2015 27
2.9%
2014 27
2.9%
2013 27
2.9%

지원구분
Categorical

Distinct27
Distinct (%)2.9%
Missing0
Missing (%)0.0%
Memory size7.3 KiB
보조지원(전체)
 
34
생산장려금
 
34
탐사비
 
34
대단위화
 
34
갱도굴진
 
34
Other values (22)
748 

Length

Max length8
Median length6
Mean length5.0740741
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row보조지원(전체)
2nd row생산장려금
3rd row탐사비
4th row대단위화
5th row갱도굴진

Common Values

ValueCountFrequency (%)
보조지원(전체) 34
 
3.7%
생산장려금 34
 
3.7%
탐사비 34
 
3.7%
대단위화 34
 
3.7%
갱도굴진 34
 
3.7%
기계화 34
 
3.7%
광산지역공해방지 34
 
3.7%
안전시설 34
 
3.7%
출자금 34
 
3.7%
출연금 34
 
3.7%
Other values (17) 578
63.0%

Length

2023-12-12T21:15:59.384459image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
보조지원(전체 34
 
3.7%
재해위로금 34
 
3.7%
석탄광운영자금 34
 
3.7%
석탄광시설자금 34
 
3.7%
융자지원(전체 34
 
3.7%
기타 34
 
3.7%
생산안정지원금 34
 
3.7%
수송비 34
 
3.7%
산재보험료 34
 
3.7%
진폐기금 34
 
3.7%
Other values (17) 578
63.0%

지원금액(백만원)
Real number (ℝ)

MISSING 

Distinct388
Distinct (%)96.8%
Missing517
Missing (%)56.3%
Infinite0
Infinite (%)0.0%
Mean39815.414
Minimum0
Maximum462857
Zeros4
Zeros (%)0.4%
Negative0
Negative (%)0.0%
Memory size8.2 KiB
2023-12-12T21:15:59.587751image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile500
Q14321
median11453
Q339455
95-th percentile196739
Maximum462857
Range462857
Interquartile range (IQR)35134

Descriptive statistics

Standard deviation74886.065
Coefficient of variation (CV)1.880831
Kurtosis14.497516
Mean39815.414
Median Absolute Deviation (MAD)9259
Skewness3.5826264
Sum15965981
Variance5.6079228 × 109
MonotonicityNot monotonic
2023-12-12T21:16:00.083344image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 4
 
0.4%
7678 3
 
0.3%
50000 2
 
0.2%
2860 2
 
0.2%
14000 2
 
0.2%
9000 2
 
0.2%
6000 2
 
0.2%
2182 2
 
0.2%
2957 2
 
0.2%
10000 2
 
0.2%
Other values (378) 378
41.2%
(Missing) 517
56.3%
ValueCountFrequency (%)
0 4
0.4%
12 1
 
0.1%
39 1
 
0.1%
69 1
 
0.1%
75 1
 
0.1%
133 1
 
0.1%
140 1
 
0.1%
162 1
 
0.1%
194 1
 
0.1%
228 1
 
0.1%
ValueCountFrequency (%)
462857 1
0.1%
455983 1
0.1%
450108 1
0.1%
439024 1
0.1%
435783 1
0.1%
432913 1
0.1%
419967 1
0.1%
357201 1
0.1%
279787 1
0.1%
258429 1
0.1%

Interactions

2023-12-12T21:15:58.691712image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:15:58.501541image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:15:58.795157image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:15:58.600263image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T21:16:00.198302image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연도지원구분지원금액(백만원)
연도1.0000.0000.160
지원구분0.0001.0000.628
지원금액(백만원)0.1600.6281.000
2023-12-12T21:16:00.310564image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연도지원금액(백만원)지원구분
연도1.0000.1090.000
지원금액(백만원)0.1091.0000.255
지원구분0.0000.2551.000

Missing values

2023-12-12T21:15:58.902508image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T21:15:58.979251image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연도지원구분지원금액(백만원)
01989보조지원(전체)221869
11989생산장려금2411
21989탐사비4535
31989대단위화758
41989갱도굴진14308
51989기계화9424
61989광산지역공해방지5633
71989안전시설4815
81989출자금9000
91989출연금1570
연도지원구분지원금액(백만원)
9082022학자금1887
9092022진폐기금<NA>
9102022산재보험료14613
9112022수송비<NA>
9122022생산안정지원금15711
9132022기타<NA>
9142022융자지원(전체)<NA>
9152022석탄광시설자금<NA>
9162022석탄광운영자금<NA>
9172022대체산업창업지원<NA>