Overview

Dataset statistics

Number of variables6
Number of observations1738
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory86.7 KiB
Average record size in memory51.1 B

Variable types

Categorical3
Text1
Numeric2

Dataset

Description재원별 단체별 세입예산 현황
Author행정안전부
URLhttps://data.gg.go.kr/portal/data/service/selectServicePage.do?&infId=OQ6U8J0OQ69PD8KCTN4422696618&infSeq=1

Alerts

시군명 is highly overall correlated with 회계연도High correlation
회계연도 is highly overall correlated with 시군명High correlation
시군명 is highly imbalanced (78.8%)Imbalance
예산순계액(원) has 585 (33.7%) zerosZeros
예산총계액(원) has 71 (4.1%) zerosZeros

Reproduction

Analysis started2023-12-10 22:35:55.637519
Analysis finished2023-12-10 22:35:56.345867
Duration0.71 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

회계연도
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.7 KiB
2022
1542 
2023
196 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023
2nd row2023
3rd row2023
4th row2023
5th row2023

Common Values

ValueCountFrequency (%)
2022 1542
88.7%
2023 196
 
11.3%

Length

2023-12-11T07:35:56.397044image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T07:35:56.475754image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2022 1542
88.7%
2023 196
 
11.3%

시군명
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct33
Distinct (%)1.9%
Missing0
Missing (%)0.0%
Memory size13.7 KiB
<NA>
1542 
고양시
 
7
평택시
 
7
군포시
 
7
양주시
 
7
Other values (28)
168 

Length

Max length4
Median length4
Mean length3.8975834
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row가평군
2nd row가평군
3rd row가평군
4th row가평군
5th row가평군

Common Values

ValueCountFrequency (%)
<NA> 1542
88.7%
고양시 7
 
0.4%
평택시 7
 
0.4%
군포시 7
 
0.4%
양주시 7
 
0.4%
부천시 7
 
0.4%
남양주시 6
 
0.3%
시흥시 6
 
0.3%
수원시 6
 
0.3%
성남시 6
 
0.3%
Other values (23) 137
 
7.9%

Length

2023-12-11T07:35:56.557348image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
na 1542
88.7%
평택시 7
 
0.4%
군포시 7
 
0.4%
양주시 7
 
0.4%
부천시 7
 
0.4%
고양시 7
 
0.4%
의정부시 6
 
0.3%
양평군 6
 
0.3%
하남시 6
 
0.3%
포천시 6
 
0.3%
Other values (23) 137
 
7.9%
Distinct243
Distinct (%)14.0%
Missing0
Missing (%)0.0%
Memory size13.7 KiB
2023-12-11T07:35:56.810885image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length6
Median length5
Mean length4.8924051
Min length4

Characters and Unicode

Total characters8503
Distinct characters133
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row경기가평군
2nd row경기가평군
3rd row경기가평군
4th row경기가평군
5th row경기가평군
ValueCountFrequency (%)
경기고양시 14
 
0.8%
경기군포시 14
 
0.8%
경기부천시 14
 
0.8%
경기양주시 14
 
0.8%
경기평택시 14
 
0.8%
경기의정부시 13
 
0.7%
경기양평군 13
 
0.7%
경기안성시 13
 
0.7%
경기이천시 13
 
0.7%
경기포천시 13
 
0.7%
Other values (233) 1603
92.2%
2023-12-11T07:35:57.224729image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
710
 
8.3%
687
 
8.1%
560
 
6.6%
519
 
6.1%
459
 
5.4%
422
 
5.0%
363
 
4.3%
284
 
3.3%
257
 
3.0%
252
 
3.0%
Other values (123) 3990
46.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 8503
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
710
 
8.3%
687
 
8.1%
560
 
6.6%
519
 
6.1%
459
 
5.4%
422
 
5.0%
363
 
4.3%
284
 
3.3%
257
 
3.0%
252
 
3.0%
Other values (123) 3990
46.9%

Most occurring scripts

ValueCountFrequency (%)
Hangul 8503
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
710
 
8.3%
687
 
8.1%
560
 
6.6%
519
 
6.1%
459
 
5.4%
422
 
5.0%
363
 
4.3%
284
 
3.3%
257
 
3.0%
252
 
3.0%
Other values (123) 3990
46.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 8503
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
710
 
8.3%
687
 
8.1%
560
 
6.6%
519
 
6.1%
459
 
5.4%
422
 
5.0%
363
 
4.3%
284
 
3.3%
257
 
3.0%
252
 
3.0%
Other values (123) 3990
46.9%

세목명
Categorical

Distinct7
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size13.7 KiB
보전수입등및내부거래
275 
보조금
275 
지방교부세
275 
세외수입
275 
지방세수입
275 
Other values (2)
363 

Length

Max length10
Median length6
Mean length5.3423475
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row보전수입등및내부거래
2nd row보조금
3rd row조정교부금등
4th row지방교부세
5th row세외수입

Common Values

ValueCountFrequency (%)
보전수입등및내부거래 275
15.8%
보조금 275
15.8%
지방교부세 275
15.8%
세외수입 275
15.8%
지방세수입 275
15.8%
조정교부금등 257
14.8%
지방채 106
 
6.1%

Length

2023-12-11T07:35:57.345392image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T07:35:57.450387image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
보전수입등및내부거래 275
15.8%
보조금 275
15.8%
지방교부세 275
15.8%
세외수입 275
15.8%
지방세수입 275
15.8%
조정교부금등 257
14.8%
지방채 106
 
6.1%

예산순계액(원)
Real number (ℝ)

ZEROS 

Distinct1130
Distinct (%)65.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.9937045 × 1011
Minimum0
Maximum2.1276714 × 1013
Zeros585
Zeros (%)33.7%
Negative0
Negative (%)0.0%
Memory size15.4 KiB
2023-12-11T07:35:57.582294image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median2.6608216 × 1010
Q31.1758332 × 1011
95-th percentile4.860596 × 1011
Maximum2.1276714 × 1013
Range2.1276714 × 1013
Interquartile range (IQR)1.1758332 × 1011

Descriptive statistics

Standard deviation1.0189445 × 1012
Coefficient of variation (CV)5.1108097
Kurtosis221.31304
Mean1.9937045 × 1011
Median Absolute Deviation (MAD)2.6608216 × 1010
Skewness13.571105
Sum3.4650585 × 1014
Variance1.0382478 × 1024
MonotonicityNot monotonic
2023-12-11T07:35:57.937631image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 585
33.7%
1000000000 5
 
0.3%
20000000000 4
 
0.2%
7000000000 3
 
0.2%
60000000000 3
 
0.2%
144751000000 2
 
0.1%
10000000000 2
 
0.1%
440000000000 2
 
0.1%
26400000000 2
 
0.1%
176942000000 2
 
0.1%
Other values (1120) 1128
64.9%
ValueCountFrequency (%)
0 585
33.7%
15488000 1
 
0.1%
235649000 1
 
0.1%
327000000 1
 
0.1%
1000000000 5
 
0.3%
1150000000 2
 
0.1%
1500000000 1
 
0.1%
2613643000 1
 
0.1%
2802171000 1
 
0.1%
2829060000 1
 
0.1%
ValueCountFrequency (%)
21276714000000 1
0.1%
17144600000000 1
0.1%
16024600000000 1
0.1%
14232411299000 1
0.1%
13100842060000 1
0.1%
7978921592000 1
0.1%
5902348500000 1
0.1%
5770693468000 1
0.1%
5282052772000 1
0.1%
5149522011000 1
0.1%

예산총계액(원)
Real number (ℝ)

ZEROS 

Distinct1615
Distinct (%)92.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.7695337 × 1011
Minimum0
Maximum2.3095574 × 1013
Zeros71
Zeros (%)4.1%
Negative0
Negative (%)0.0%
Memory size15.4 KiB
2023-12-11T07:35:58.054454image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile4.4481342 × 109
Q12.7713777 × 1010
median8.7184551 × 1010
Q32.3398142 × 1011
95-th percentile7.911227 × 1011
Maximum2.3095574 × 1013
Range2.3095574 × 1013
Interquartile range (IQR)2.0626764 × 1011

Descriptive statistics

Standard deviation1.0555899 × 1012
Coefficient of variation (CV)3.8114355
Kurtosis224.72945
Mean2.7695337 × 1011
Median Absolute Deviation (MAD)7.045319 × 1010
Skewness13.443571
Sum4.8134495 × 1014
Variance1.11427 × 1024
MonotonicityNot monotonic
2023-12-11T07:35:58.172752image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 71
 
4.1%
7000000000 7
 
0.4%
10000000000 6
 
0.3%
1000000000 5
 
0.3%
12000000000 5
 
0.3%
14000000000 5
 
0.3%
20000000000 5
 
0.3%
6000000000 3
 
0.2%
9000000000 3
 
0.2%
8000000000 3
 
0.2%
Other values (1605) 1625
93.5%
ValueCountFrequency (%)
0 71
4.1%
925901000 1
 
0.1%
1000000000 5
 
0.3%
1150000000 2
 
0.1%
1500000000 1
 
0.1%
2267319000 1
 
0.1%
2646000000 1
 
0.1%
2829060000 1
 
0.1%
3500000000 1
 
0.1%
3641000000 1
 
0.1%
ValueCountFrequency (%)
23095574000000 1
0.1%
17144600000000 1
0.1%
16024600000000 1
0.1%
14232411299000 1
0.1%
13100842060000 1
0.1%
7978921592000 1
0.1%
6790471843000 1
0.1%
5902348500000 1
0.1%
5770693468000 1
0.1%
5282052772000 1
0.1%

Interactions

2023-12-11T07:35:56.072037image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:35:55.911474image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:35:56.141339image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:35:55.991188image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T07:35:58.253712image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
회계연도시군명세목명예산순계액(원)예산총계액(원)
회계연도1.000NaN0.0000.0830.110
시군명NaN1.0000.0000.3910.655
세목명0.0000.0001.0000.1050.086
예산순계액(원)0.0830.3910.1051.0000.996
예산총계액(원)0.1100.6550.0860.9961.000
2023-12-11T07:35:58.337512image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
세목명시군명회계연도
세목명1.0000.0000.000
시군명0.0001.0001.000
회계연도0.0001.0001.000
2023-12-11T07:35:58.410451image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
예산순계액(원)예산총계액(원)회계연도시군명세목명
예산순계액(원)1.0000.4020.0630.1960.056
예산총계액(원)0.4021.0000.0820.4860.046
회계연도0.0630.0821.0001.0000.000
시군명0.1960.4861.0001.0000.000
세목명0.0560.0460.0000.0001.000

Missing values

2023-12-11T07:35:56.229772image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T07:35:56.310093image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

회계연도시군명자치단체명세목명예산순계액(원)예산총계액(원)
02023가평군경기가평군보전수입등및내부거래837405200029897721000
12023가평군경기가평군보조금0182305990000
22023가평군경기가평군조정교부금등093301000000
32023가평군경기가평군지방교부세122824583000122824583000
42023가평군경기가평군세외수입3167458200034120945000
52023가평군경기가평군지방세수입7745600000077456000000
62023경기도경기본청보조금1423241129900014232411299000
72023경기도경기본청보전수입등및내부거래7672115590002192187932000
82023경기도경기본청지방교부세316213182000316213182000
92023경기도경기본청세외수입5321028350001045042584000
회계연도시군명자치단체명세목명예산순계액(원)예산총계액(원)
17282022<NA>강원홍천군지방세수입5693825500056938255000
17292022<NA>강원홍천군세외수입3145426300033378688000
17302022<NA>강원홍천군지방교부세363900000000363900000000
17312022<NA>강원홍천군조정교부금등08000000000
17322022<NA>강원홍천군보조금0239648981000
17332022<NA>강원홍천군보전수입등및내부거래1706209100045003789000
17342022<NA>강원횡성군지방세수입3455832500034558325000
17352022<NA>강원횡성군세외수입2894247500030462691000
17362022<NA>강원횡성군지방교부세258040000000258040000000
17372022<NA>강원횡성군조정교부금등012500000000