Overview

Dataset statistics

Number of variables3
Number of observations213
Missing cells0
Missing cells (%)0.0%
Duplicate rows5
Duplicate rows (%)2.3%
Total size in memory5.5 KiB
Average record size in memory26.6 B

Variable types

Numeric2
Categorical1

Dataset

Description경기도 교육재정 사업별 세출 기능별·정책사업별(결산) 현황
Author교육부
URLhttps://data.gg.go.kr/portal/data/service/selectServicePage.do?&infId=YMRHT271U9W11J80TC6K23645757&infSeq=2

Alerts

Dataset has 5 (2.3%) duplicate rowsDuplicates
금액(원) has 19 (8.9%) zerosZeros

Reproduction

Analysis started2023-12-10 21:40:19.721933
Analysis finished2023-12-10 21:40:20.399509
Duration0.68 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

회계연도
Real number (ℝ)

Distinct13
Distinct (%)6.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2016.1362
Minimum2010
Maximum2022
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.0 KiB
2023-12-11T06:40:20.465153image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2010
5-th percentile2010
Q12013
median2016
Q32019
95-th percentile2022
Maximum2022
Range12
Interquartile range (IQR)6

Descriptive statistics

Standard deviation3.8097762
Coefficient of variation (CV)0.0018896423
Kurtosis-1.236811
Mean2016.1362
Median Absolute Deviation (MAD)3
Skewness-0.023109103
Sum429437
Variance14.514395
MonotonicityDecreasing
2023-12-11T06:40:20.620153image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=13)
ValueCountFrequency (%)
2022 20
9.4%
2021 17
 
8.0%
2020 16
 
7.5%
2019 16
 
7.5%
2018 16
 
7.5%
2017 16
 
7.5%
2016 16
 
7.5%
2015 16
 
7.5%
2014 16
 
7.5%
2013 16
 
7.5%
Other values (3) 48
22.5%
ValueCountFrequency (%)
2010 16
7.5%
2011 16
7.5%
2012 16
7.5%
2013 16
7.5%
2014 16
7.5%
2015 16
7.5%
2016 16
7.5%
2017 16
7.5%
2018 16
7.5%
2019 16
7.5%
ValueCountFrequency (%)
2022 20
9.4%
2021 17
8.0%
2020 16
7.5%
2019 16
7.5%
2018 16
7.5%
2017 16
7.5%
2016 16
7.5%
2015 16
7.5%
2014 16
7.5%
2013 16
7.5%

항목구분명
Categorical

Distinct17
Distinct (%)8.0%
Missing0
Missing (%)0.0%
Memory size1.8 KiB
평생교육
26 
예비비 및 기타
14 
학교재정지원관리
 
13
교육일반
 
13
교육행정일반
 
13
Other values (12)
134 

Length

Max length10
Median length9
Mean length6.3192488
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row교수-학습활동지원
2nd row교육복지지원
3rd row교육일반
4th row교육행정일반
5th row기관운영

Common Values

ValueCountFrequency (%)
평생교육 26
 
12.2%
예비비 및 기타 14
 
6.6%
학교재정지원관리 13
 
6.1%
교육일반 13
 
6.1%
교육행정일반 13
 
6.1%
기관운영 13
 
6.1%
보건/급식/체육활동 13
 
6.1%
세출결산액 13
 
6.1%
교수-학습활동지원 13
 
6.1%
유아 및 초중등교육 13
 
6.1%
Other values (7) 69
32.4%

Length

2023-12-11T06:40:20.787832image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
27
 
10.1%
평생교육 26
 
9.7%
예비비 16
 
6.0%
기타 14
 
5.2%
유아 13
 
4.9%
교육복지지원 13
 
4.9%
학교교육여건개선시설 13
 
4.9%
직업교육 13
 
4.9%
재무활동 13
 
4.9%
인적자원운용 13
 
4.9%
Other values (9) 106
39.7%

금액(원)
Real number (ℝ)

ZEROS 

Distinct191
Distinct (%)89.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.5824947 × 1012
Minimum0
Maximum2.234023 × 1013
Zeros19
Zeros (%)8.9%
Negative0
Negative (%)0.0%
Memory size2.0 KiB
2023-12-11T06:40:20.950447image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q12.4473776 × 1010
median5.7218546 × 1011
Q31.7600288 × 1012
95-th percentile1.3217611 × 1013
Maximum2.234023 × 1013
Range2.234023 × 1013
Interquartile range (IQR)1.7355551 × 1012

Descriptive statistics

Standard deviation4.552279 × 1012
Coefficient of variation (CV)1.7627448
Kurtosis3.8000952
Mean2.5824947 × 1012
Median Absolute Deviation (MAD)5.6357584 × 1011
Skewness2.1394551
Sum5.5007136 × 1014
Variance2.0723244 × 1025
MonotonicityNot monotonic
2023-12-11T06:40:21.145934image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 19
 
8.9%
12880310950 2
 
0.9%
10473809939460 2
 
0.9%
14281067060 2
 
0.9%
24473776210 2
 
0.9%
1267731590700 1
 
0.5%
11150471970 1
 
0.5%
61709196345 1
 
0.5%
563805923620 1
 
0.5%
12140148462570 1
 
0.5%
Other values (181) 181
85.0%
ValueCountFrequency (%)
0 19
8.9%
86907500 1
 
0.5%
1082087500 1
 
0.5%
1090000000 1
 
0.5%
1200000000 1
 
0.5%
1683785450 1
 
0.5%
1717819970 1
 
0.5%
1775390000 1
 
0.5%
2012410510 1
 
0.5%
2696651320 1
 
0.5%
ValueCountFrequency (%)
22340229541920 1
0.5%
18513543282370 1
0.5%
17518715068140 1
0.5%
17406604853030 1
0.5%
17324705111050 1
0.5%
16545281341560 1
0.5%
15846948377420 1
0.5%
15212323190230 1
0.5%
14491561889850 1
0.5%
14178389528660 1
0.5%

Interactions

2023-12-11T06:40:20.056376image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:40:19.835592image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:40:20.162137image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:40:19.955265image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T06:40:21.254054image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
회계연도항목구분명금액(원)
회계연도1.0000.0000.118
항목구분명0.0001.0000.736
금액(원)0.1180.7361.000
2023-12-11T06:40:21.354989image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
회계연도금액(원)항목구분명
회계연도1.0000.1180.000
금액(원)0.1181.0000.390
항목구분명0.0000.3901.000

Missing values

2023-12-11T06:40:20.277592image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T06:40:20.363857image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

회계연도항목구분명금액(원)
02022교수-학습활동지원1267731590700
12022교육복지지원1471104993350
22022교육일반3641967407650
32022교육행정일반181574884180
42022기관운영132049127100
52022보건/급식/체육활동1015719372720
62022세출결산액22340229541920
72022예비비0
82022예비비 및 기타0
92022예비비 및 기타0
회계연도항목구분명금액(원)
2032010세출결산액8501558999800
2042010예비비 및 기타0
2052010유아 및 초중등교육8102680615890
2062010인적자원운용5028222787510
2072010재무활동249161200620
2082010직업교육86907500
2092010평생교육11101916130
2102010평생교육11188823630
2112010학교교육여건개선시설938173280630
2122010학교재정지원관리1148644383420

Duplicate rows

Most frequently occurring

회계연도항목구분명금액(원)# duplicates
02011평생교육128803109502
12012평생교육142810670602
22022예비비 및 기타02
32022인건비104738099394602
42022평생교육244737762102