Overview

Dataset statistics

Number of variables3
Number of observations204
Missing cells52
Missing cells (%)8.5%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory5.3 KiB
Average record size in memory26.6 B

Variable types

Numeric2
Categorical1

Dataset

Description한국광해광업공단은 석탄산업의 생산 기반 유지와 연탄의 안정적인 공급을 위해 석·연탄산업 지원 사업을 실시하고 있으며 이를 통해 자원안보와 서민생활보호 및 폐광지역 고용창출 등에 이바지하고 있습니다. 석탄광, 연탄업계 지원 내역 등을 공개합니다.
URLhttps://www.data.go.kr/data/15067741/fileData.do

Alerts

지원금액(백만원) has 52 (25.5%) missing valuesMissing

Reproduction

Analysis started2023-12-12 07:45:21.024170
Analysis finished2023-12-12 07:45:21.707954
Duration0.68 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연도
Real number (ℝ)

Distinct34
Distinct (%)16.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2005.5
Minimum1989
Maximum2022
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.9 KiB
2023-12-12T16:45:21.774852image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1989
5-th percentile1990
Q11997
median2005.5
Q32014
95-th percentile2021
Maximum2022
Range33
Interquartile range (IQR)17

Descriptive statistics

Standard deviation9.8348431
Coefficient of variation (CV)0.0049039357
Kurtosis-1.2020707
Mean2005.5
Median Absolute Deviation (MAD)8.5
Skewness0
Sum409122
Variance96.724138
MonotonicityIncreasing
2023-12-12T16:45:22.240358image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=34)
ValueCountFrequency (%)
1989 6
 
2.9%
2015 6
 
2.9%
2009 6
 
2.9%
2010 6
 
2.9%
2011 6
 
2.9%
2012 6
 
2.9%
2013 6
 
2.9%
2014 6
 
2.9%
2016 6
 
2.9%
2007 6
 
2.9%
Other values (24) 144
70.6%
ValueCountFrequency (%)
1989 6
2.9%
1990 6
2.9%
1991 6
2.9%
1992 6
2.9%
1993 6
2.9%
1994 6
2.9%
1995 6
2.9%
1996 6
2.9%
1997 6
2.9%
1998 6
2.9%
ValueCountFrequency (%)
2022 6
2.9%
2021 6
2.9%
2020 6
2.9%
2019 6
2.9%
2018 6
2.9%
2017 6
2.9%
2016 6
2.9%
2015 6
2.9%
2014 6
2.9%
2013 6
2.9%

구분
Categorical

Distinct6
Distinct (%)2.9%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
석탄광(생산탄광) 보조
34 
석탄광(생산탄광) 가격지원
34 
석탄광(생산탄광) 융자
34 
석탄광(폐광탄광) 보조
34 
연탄공장 보조
34 

Length

Max length14
Median length13
Mean length10.666667
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row석탄광(생산탄광) 보조
2nd row석탄광(생산탄광) 가격지원
3rd row석탄광(생산탄광) 융자
4th row석탄광(폐광탄광) 보조
5th row연탄공장 보조

Common Values

ValueCountFrequency (%)
석탄광(생산탄광) 보조 34
16.7%
석탄광(생산탄광) 가격지원 34
16.7%
석탄광(생산탄광) 융자 34
16.7%
석탄광(폐광탄광) 보조 34
16.7%
연탄공장 보조 34
16.7%
연탄공장 융자 34
16.7%

Length

2023-12-12T16:45:22.435698image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T16:45:22.566780image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
석탄광(생산탄광 102
25.0%
보조 102
25.0%
융자 68
16.7%
연탄공장 68
16.7%
가격지원 34
 
8.3%
석탄광(폐광탄광 34
 
8.3%

지원금액(백만원)
Real number (ℝ)

MISSING 

Distinct151
Distinct (%)99.3%
Missing52
Missing (%)25.5%
Infinite0
Infinite (%)0.0%
Mean99585.224
Minimum163
Maximum713682
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.9 KiB
2023-12-12T16:45:22.705361image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum163
5-th percentile2460.3
Q133644
median61683
Q3136196
95-th percentile317207.35
Maximum713682
Range713519
Interquartile range (IQR)102552

Descriptive statistics

Standard deviation107196.15
Coefficient of variation (CV)1.0764263
Kurtosis7.8504228
Mean99585.224
Median Absolute Deviation (MAD)36943.5
Skewness2.3865368
Sum15136954
Variance1.1491015 × 1010
MonotonicityNot monotonic
2023-12-12T16:45:22.840082image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
62229 2
 
1.0%
22362 1
 
0.5%
167241 1
 
0.5%
64921 1
 
0.5%
102393 1
 
0.5%
3593 1
 
0.5%
102368 1
 
0.5%
25087 1
 
0.5%
932 1
 
0.5%
47083 1
 
0.5%
Other values (141) 141
69.1%
(Missing) 52
 
25.5%
ValueCountFrequency (%)
163 1
0.5%
237 1
0.5%
523 1
0.5%
674 1
0.5%
932 1
0.5%
1247 1
0.5%
1344 1
0.5%
1445 1
0.5%
3291 1
0.5%
3593 1
0.5%
ValueCountFrequency (%)
713682 1
0.5%
456806 1
0.5%
431666 1
0.5%
406471 1
0.5%
376134 1
0.5%
363116 1
0.5%
349052 1
0.5%
346040 1
0.5%
293617 1
0.5%
292121 1
0.5%

Interactions

2023-12-12T16:45:21.332662image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:45:21.125792image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:45:21.438766image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:45:21.218769image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T16:45:22.944453image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연도구분지원금액(백만원)
연도1.0000.0000.336
구분0.0001.0000.447
지원금액(백만원)0.3360.4471.000
2023-12-12T16:45:23.041050image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연도지원금액(백만원)구분
연도1.000-0.1950.000
지원금액(백만원)-0.1951.0000.266
구분0.0000.2661.000

Missing values

2023-12-12T16:45:21.584216image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T16:45:21.675383image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연도구분지원금액(백만원)
01989석탄광(생산탄광) 보조108507
11989석탄광(생산탄광) 가격지원47083
21989석탄광(생산탄광) 융자43063
31989석탄광(폐광탄광) 보조113362
41989연탄공장 보조26555
51989연탄공장 융자220438
61990석탄광(생산탄광) 보조152053
71990석탄광(생산탄광) 가격지원84024
81990석탄광(생산탄광) 융자38273
91990석탄광(폐광탄광) 보조37504
연도구분지원금액(백만원)
1942021석탄광(생산탄광) 융자<NA>
1952021석탄광(폐광탄광) 보조63304
1962021연탄공장 보조25433
1972021연탄공장 융자<NA>
1982022석탄광(생산탄광) 보조41112
1992022석탄광(생산탄광) 가격지원33264
2002022석탄광(생산탄광) 융자<NA>
2012022석탄광(폐광탄광) 보조44848
2022022연탄공장 보조20155
2032022연탄공장 융자<NA>