Overview

Dataset statistics

Number of variables6
Number of observations5026
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory245.5 KiB
Average record size in memory50.0 B

Variable types

Categorical4
Text1
Numeric1

Dataset

Description교육 관련 지원 예산 현황
Author행정안전부
URLhttps://data.gg.go.kr/portal/data/service/selectServicePage.do?&infId=CVLUL9RU4CJQ6T0XD1Z522289696&infSeq=1

Alerts

시군명 is highly overall correlated with 회계연도High correlation
회계연도 is highly overall correlated with 시군명High correlation
구분명 is highly overall correlated with 상세코드명High correlation
상세코드명 is highly overall correlated with 구분명High correlation
시군명 is highly imbalanced (76.5%)Imbalance
예산액(원) is highly skewed (γ1 = 30.67275518)Skewed
예산액(원) has 3596 (71.5%) zerosZeros

Reproduction

Analysis started2023-12-10 22:27:20.588096
Analysis finished2023-12-10 22:27:21.189125
Duration0.6 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

회계연도
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size39.4 KiB
2022
4386 
2023
640 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023
2nd row2023
3rd row2023
4th row2023
5th row2023

Common Values

ValueCountFrequency (%)
2022 4386
87.3%
2023 640
 
12.7%

Length

2023-12-11T07:27:21.252480image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T07:27:21.334099image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2022 4386
87.3%
2023 640
 
12.7%

시군명
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct33
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Memory size39.4 KiB
<NA>
4386 
경기도
 
20
고양시
 
20
과천시
 
20
광명시
 
20
Other values (28)
560 

Length

Max length4
Median length4
Mean length3.8846001
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row가평군
2nd row가평군
3rd row가평군
4th row가평군
5th row가평군

Common Values

ValueCountFrequency (%)
<NA> 4386
87.3%
경기도 20
 
0.4%
고양시 20
 
0.4%
과천시 20
 
0.4%
광명시 20
 
0.4%
광주시 20
 
0.4%
구리시 20
 
0.4%
군포시 20
 
0.4%
김포시 20
 
0.4%
남양주시 20
 
0.4%
Other values (23) 460
 
9.2%

Length

2023-12-11T07:27:21.420365image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
na 4386
87.3%
안양시 20
 
0.4%
화성시 20
 
0.4%
하남시 20
 
0.4%
포천시 20
 
0.4%
평택시 20
 
0.4%
파주시 20
 
0.4%
이천시 20
 
0.4%
의정부시 20
 
0.4%
의왕시 20
 
0.4%
Other values (23) 460
 
9.2%
Distinct243
Distinct (%)4.8%
Missing0
Missing (%)0.0%
Memory size39.4 KiB
2023-12-11T07:27:21.717609image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length6
Median length5
Mean length4.8873856
Min length4

Characters and Unicode

Total characters24564
Distinct characters133
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row경기가평군
2nd row경기가평군
3rd row경기가평군
4th row경기가평군
5th row경기가평군
ValueCountFrequency (%)
경기본청 39
 
0.8%
경기여주시 38
 
0.8%
경기안성시 38
 
0.8%
경기양주시 38
 
0.8%
경기양평군 38
 
0.8%
경기연천군 38
 
0.8%
경기오산시 38
 
0.8%
경기용인시 38
 
0.8%
경기의왕시 38
 
0.8%
경기파주시 38
 
0.8%
Other values (233) 4645
92.4%
2023-12-11T07:27:22.194976image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2047
 
8.3%
1948
 
7.9%
1610
 
6.6%
1515
 
6.2%
1322
 
5.4%
1235
 
5.0%
1028
 
4.2%
813
 
3.3%
743
 
3.0%
715
 
2.9%
Other values (123) 11588
47.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 24564
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2047
 
8.3%
1948
 
7.9%
1610
 
6.6%
1515
 
6.2%
1322
 
5.4%
1235
 
5.0%
1028
 
4.2%
813
 
3.3%
743
 
3.0%
715
 
2.9%
Other values (123) 11588
47.2%

Most occurring scripts

ValueCountFrequency (%)
Hangul 24564
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2047
 
8.3%
1948
 
7.9%
1610
 
6.6%
1515
 
6.2%
1322
 
5.4%
1235
 
5.0%
1028
 
4.2%
813
 
3.3%
743
 
3.0%
715
 
2.9%
Other values (123) 11588
47.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 24564
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
2047
 
8.3%
1948
 
7.9%
1610
 
6.6%
1515
 
6.2%
1322
 
5.4%
1235
 
5.0%
1028
 
4.2%
813
 
3.3%
743
 
3.0%
715
 
2.9%
Other values (123) 11588
47.2%

구분명
Categorical

HIGH CORRELATION 

Distinct10
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size39.4 KiB
교육경비보조금
1925 
학교용지매입비
825 
법정부담금
825 
학교급식보조
550 
0
498 
Other values (5)
403 

Length

Max length15
Median length7
Mean length6.0340231
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row교육경비보조금
2nd row교육경비보조금
3rd row학교급식보조
4th row학교용지매입비
5th row학교용지매입비

Common Values

ValueCountFrequency (%)
교육경비보조금 1925
38.3%
학교용지매입비 825
16.4%
법정부담금 825
16.4%
학교급식보조 550
 
10.9%
0 498
 
9.9%
비법정부담금 275
 
5.5%
교육재정교부금법 32
 
0.6%
시군구교육비특별회계법정전출금 32
 
0.6%
지방세법에따른시도법정전출금 32
 
0.6%
지방세법(이양사업) 32
 
0.6%

Length

2023-12-11T07:27:22.370798image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T07:27:22.487848image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
교육경비보조금 1925
38.3%
학교용지매입비 825
16.4%
법정부담금 825
16.4%
학교급식보조 550
 
10.9%
0 498
 
9.9%
비법정부담금 275
 
5.5%
교육재정교부금법 32
 
0.6%
시군구교육비특별회계법정전출금 32
 
0.6%
지방세법에따른시도법정전출금 32
 
0.6%
지방세법(이양사업 32
 
0.6%

상세코드명
Categorical

HIGH CORRELATION 

Distinct20
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size39.4 KiB
기타
550 
교육정보화
 
275
급식비
 
275
취등록세
 
275
학교용지부담금
 
275
Other values (15)
3376 

Length

Max length16
Median length11
Mean length5.9972145
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row교육시설개선
2nd row교육과정운영
3rd row급식비
4th row기타
5th row취등록세

Common Values

ValueCountFrequency (%)
기타 550
 
10.9%
교육정보화 275
 
5.5%
급식비 275
 
5.5%
취등록세 275
 
5.5%
학교용지부담금 275
 
5.5%
비법정부담금 275
 
5.5%
지방교육세 275
 
5.5%
담배소비세 275
 
5.5%
교육시설개선 275
 
5.5%
시도세 275
 
5.5%
Other values (10) 2001
39.8%

Length

2023-12-11T07:27:22.656227image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
기타 550
 
10.9%
시도세 275
 
5.4%
교육과정운영 275
 
5.4%
급식시설 275
 
5.4%
시군구교육비특별회계법정전출금 275
 
5.4%
고교무상교육 275
 
5.4%
공공도서관운영지원 275
 
5.4%
교육정보화 275
 
5.4%
지역주민교과과정 275
 
5.4%
지역체육문화공간시설 275
 
5.4%
Other values (12) 2025
40.1%

예산액(원)
Real number (ℝ)

SKEWED  ZEROS 

Distinct1284
Distinct (%)25.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.8925319 × 109
Minimum0
Maximum2.3045 × 1012
Zeros3596
Zeros (%)71.5%
Negative0
Negative (%)0.0%
Memory size44.3 KiB
2023-12-11T07:27:22.767873image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q365821250
95-th percentile4.6498932 × 109
Maximum2.3045 × 1012
Range2.3045 × 1012
Interquartile range (IQR)65821250

Descriptive statistics

Standard deviation6.1451742 × 1010
Coefficient of variation (CV)15.787088
Kurtosis1032.4518
Mean3.8925319 × 109
Median Absolute Deviation (MAD)0
Skewness30.672755
Sum1.9563865 × 1013
Variance3.7763166 × 1021
MonotonicityNot monotonic
2023-12-11T07:27:22.901355image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 3596
71.5%
300674000 24
 
0.5%
20000000 14
 
0.3%
200000000 12
 
0.2%
100000000 11
 
0.2%
50000000 9
 
0.2%
10000000 9
 
0.2%
30000000 7
 
0.1%
40000000 7
 
0.1%
70000000 7
 
0.1%
Other values (1274) 1330
 
26.5%
ValueCountFrequency (%)
0 3596
71.5%
1068000 1
 
< 0.1%
1079000 1
 
< 0.1%
2000000 1
 
< 0.1%
2670000 1
 
< 0.1%
3000000 1
 
< 0.1%
3516000 1
 
< 0.1%
4263000 1
 
< 0.1%
5000000 3
 
0.1%
6000000 2
 
< 0.1%
ValueCountFrequency (%)
2304500000000 1
< 0.1%
2212600000000 1
< 0.1%
1966277000000 1
< 0.1%
1626814100000 1
< 0.1%
715634950000 1
< 0.1%
644185000000 1
< 0.1%
425085000000 1
< 0.1%
385314000000 1
< 0.1%
371098263000 1
< 0.1%
287500000000 1
< 0.1%

Interactions

2023-12-11T07:27:20.925883image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T07:27:22.985428image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
회계연도시군명구분명상세코드명예산액(원)
회계연도1.000NaN0.5620.3690.030
시군명NaN1.0000.0000.0000.026
구분명0.5620.0001.0000.9950.040
상세코드명0.3690.0000.9951.0000.123
예산액(원)0.0300.0260.0400.1231.000
2023-12-11T07:27:23.093996image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분명시군명회계연도상세코드명
구분명1.0000.0000.4340.873
시군명0.0001.0001.0000.000
회계연도0.4341.0001.0000.291
상세코드명0.8730.0000.2911.000
2023-12-11T07:27:23.181221image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
예산액(원)회계연도시군명구분명상세코드명
예산액(원)1.0000.0320.0090.0200.053
회계연도0.0321.0001.0000.4340.291
시군명0.0091.0001.0000.0000.000
구분명0.0200.4340.0001.0000.873
상세코드명0.0530.2910.0000.8731.000

Missing values

2023-12-11T07:27:21.020010image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T07:27:21.141291image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

회계연도시군명자치단체명구분명상세코드명예산액(원)
02023가평군경기가평군교육경비보조금교육시설개선80789000
12023가평군경기가평군교육경비보조금교육과정운영3359855000
22023가평군경기가평군학교급식보조급식비680000000
32023가평군경기가평군학교용지매입비기타0
42023가평군경기가평군학교용지매입비취등록세0
52023가평군경기가평군학교용지매입비학교용지부담금0
62023가평군경기가평군비법정부담금비법정부담금0
72023가평군경기가평군법정부담금지방교육세0
82023가평군경기가평군법정부담금담배소비세0
92023가평군경기가평군법정부담금시도세0
회계연도시군명자치단체명구분명상세코드명예산액(원)
50162022<NA>울산북구학교급식보조급식비3512363000
50172022<NA>울산북구학교급식보조급식시설0
50182022<NA>울산북구교육경비보조금교육시설개선0
50192022<NA>울산북구교육경비보조금교육과정운영400000000
50202022<NA>울산북구교육경비보조금교육정보화0
50212022<NA>울산북구교육경비보조금지역주민교과과정0
50222022<NA>울산북구교육경비보조금지역체육문화공간시설0
50232022<NA>울산북구교육경비보조금공공도서관운영지원0
50242022<NA>울산북구교육경비보조금기타262167000
50252022<NA>울산북구0고교무상교육0