Overview

Dataset statistics

Number of variables3
Number of observations39
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.1 KiB
Average record size in memory29.4 B

Variable types

Numeric2
Categorical1

Dataset

Description경기도 교육재정 업무추진비(결산) 현황
Author교육부
URLhttps://data.gg.go.kr/portal/data/service/selectServicePage.do?&infId=FH6GVPOYCDFVNTF8T46P23777107&infSeq=2

Alerts

금액(원) is highly overall correlated with 항목구분명High correlation
항목구분명 is highly overall correlated with 금액(원)High correlation
금액(원) has 13 (33.3%) zerosZeros

Reproduction

Analysis started2023-12-10 21:23:05.844222
Analysis finished2023-12-10 21:23:06.432099
Duration0.59 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

회계연도
Real number (ℝ)

Distinct13
Distinct (%)33.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2016
Minimum2010
Maximum2022
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size483.0 B
2023-12-11T06:23:06.486542image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2010
5-th percentile2010
Q12013
median2016
Q32019
95-th percentile2022
Maximum2022
Range12
Interquartile range (IQR)6

Descriptive statistics

Standard deviation3.79057
Coefficient of variation (CV)0.0018802431
Kurtosis-1.2145002
Mean2016
Median Absolute Deviation (MAD)3
Skewness0
Sum78624
Variance14.368421
MonotonicityDecreasing
2023-12-11T06:23:06.612662image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=13)
ValueCountFrequency (%)
2022 3
 
7.7%
2021 3
 
7.7%
2020 3
 
7.7%
2019 3
 
7.7%
2018 3
 
7.7%
2017 3
 
7.7%
2016 3
 
7.7%
2015 3
 
7.7%
2014 3
 
7.7%
2013 3
 
7.7%
Other values (3) 9
23.1%
ValueCountFrequency (%)
2010 3
7.7%
2011 3
7.7%
2012 3
7.7%
2013 3
7.7%
2014 3
7.7%
2015 3
7.7%
2016 3
7.7%
2017 3
7.7%
2018 3
7.7%
2019 3
7.7%
ValueCountFrequency (%)
2022 3
7.7%
2021 3
7.7%
2020 3
7.7%
2019 3
7.7%
2018 3
7.7%
2017 3
7.7%
2016 3
7.7%
2015 3
7.7%
2014 3
7.7%
2013 3
7.7%

항목구분명
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)7.7%
Missing0
Missing (%)0.0%
Memory size444.0 B
비율
13 
세출결산액
13 
업무추진비
13 

Length

Max length5
Median length5
Mean length4
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row비율
2nd row세출결산액
3rd row업무추진비
4th row비율
5th row세출결산액

Common Values

ValueCountFrequency (%)
비율 13
33.3%
세출결산액 13
33.3%
업무추진비 13
33.3%

Length

2023-12-11T06:23:06.755737image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T06:23:06.884830image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
비율 13
33.3%
세출결산액 13
33.3%
업무추진비 13
33.3%

금액(원)
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct27
Distinct (%)69.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4.7029908 × 1012
Minimum0
Maximum2.234023 × 1013
Zeros13
Zeros (%)33.3%
Negative0
Negative (%)0.0%
Memory size483.0 B
2023-12-11T06:23:06.984062image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median4.2022552 × 109
Q31.0733763 × 1013
95-th percentile1.7618198 × 1013
Maximum2.234023 × 1013
Range2.234023 × 1013
Interquartile range (IQR)1.0733763 × 1013

Descriptive statistics

Standard deviation7.1023718 × 1012
Coefficient of variation (CV)1.510182
Kurtosis-0.37502048
Mean4.7029908 × 1012
Median Absolute Deviation (MAD)4.2022552 × 109
Skewness1.0779845
Sum1.8341664 × 1014
Variance5.0443686 × 1025
MonotonicityNot monotonic
2023-12-11T06:23:07.089228image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=27)
ValueCountFrequency (%)
0 13
33.3%
22340229541920 1
 
2.6%
4639388650 1
 
2.6%
8501558999800 1
 
2.6%
4063951050 1
 
2.6%
9675254653090 1
 
2.6%
3503532760 1
 
2.6%
10394583989270 1
 
2.6%
4202255160 1
 
2.6%
11072941197190 1
 
2.6%
Other values (17) 17
43.6%
ValueCountFrequency (%)
0 13
33.3%
3483700700 1
 
2.6%
3503532760 1
 
2.6%
3726241070 1
 
2.6%
3762388720 1
 
2.6%
3950108250 1
 
2.6%
4063951050 1
 
2.6%
4202255160 1
 
2.6%
4240729930 1
 
2.6%
4639388650 1
 
2.6%
ValueCountFrequency (%)
22340229541920 1
2.6%
18513543282370 1
2.6%
17518715068140 1
2.6%
17324705111050 1
2.6%
15846948377420 1
2.6%
14178389528660 1
2.6%
13363529014800 1
2.6%
12486574088260 1
2.6%
12140148462570 1
2.6%
11072941197190 1
2.6%

Interactions

2023-12-11T06:23:06.143362image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:23:05.933819image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:23:06.224727image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:23:06.060246image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T06:23:07.164202image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
회계연도항목구분명금액(원)
회계연도1.0000.0000.000
항목구분명0.0001.0000.730
금액(원)0.0000.7301.000
2023-12-11T06:23:07.239976image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
회계연도금액(원)항목구분명
회계연도1.0000.1770.000
금액(원)0.1771.0000.577
항목구분명0.0000.5771.000

Missing values

2023-12-11T06:23:06.325663image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T06:23:06.404310image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

회계연도항목구분명금액(원)
02022비율0
12022세출결산액22340229541920
22022업무추진비7748173560
32021비율0
42021세출결산액18513543282370
52021업무추진비5870802450
62020비율0
72020세출결산액17324705111050
82020업무추진비5080667760
92019비율0
회계연도항목구분명금액(원)
292013업무추진비4202255160
302012비율0
312012세출결산액10394583989270
322012업무추진비3503532760
332011비율0
342011세출결산액9675254653090
352011업무추진비4063951050
362010비율0
372010세출결산액8501558999800
382010업무추진비4639388650