Overview

Dataset statistics

Number of variables5
Number of observations3925
Missing cells0
Missing cells (%)0.0%
Duplicate rows229
Duplicate rows (%)5.8%
Total size in memory164.9 KiB
Average record size in memory43.0 B

Variable types

Categorical2
Text1
Numeric2

Dataset

Description부산광역시 사하구 재정정보공개 시스템 내 세입현황에 대한 데이터입니다. 세입과목, 세입전일누계, 금일수납, 합계 등을 제공합니다.
Author부산광역시 사하구
URLhttps://www.data.go.kr/data/15091655/fileData.do

Alerts

세입자료일련번호 has constant value ""Constant
Dataset has 229 (5.8%) duplicate rowsDuplicates
세입금일수납 has 3081 (78.5%) zerosZeros

Reproduction

Analysis started2023-12-12 12:57:38.139731
Analysis finished2023-12-12 12:57:39.008472
Duration0.87 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

세입자료일련번호
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size30.8 KiB
202000000000
3925 

Length

Max length12
Median length12
Mean length12
Min length12

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row202000000000
2nd row202000000000
3rd row202000000000
4th row202000000000
5th row202000000000

Common Values

ValueCountFrequency (%)
202000000000 3925
100.0%

Length

2023-12-12T21:57:39.072359image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T21:57:39.196310image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
202000000000 3925
100.0%

세입과목
Categorical

Distinct7
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size30.8 KiB
일반회계(세외수입)
577 
주차장 특별회계(회계별총계)
577 
일반회계(보전수입등및내부거래)
577 
일반회계(지방세수입)
575 
일반회계(보조금)
575 
Other values (2)
1044 

Length

Max length16
Median length12
Mean length12.029045
Min length9

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row일반회계(지방세수입)
2nd row일반회계(세외수입)
3rd row일반회계(지방교부세)
4th row일반회계(보조금)
5th row주차장 특별회계(회계별총계)

Common Values

ValueCountFrequency (%)
일반회계(세외수입) 577
14.7%
주차장 특별회계(회계별총계) 577
14.7%
일반회계(보전수입등및내부거래) 577
14.7%
일반회계(지방세수입) 575
14.6%
일반회계(보조금) 575
14.6%
일반회계(조정교부금등) 573
14.6%
일반회계(지방교부세) 471
12.0%

Length

2023-12-12T21:57:39.352636image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T21:57:39.504516image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
일반회계(세외수입 577
12.8%
주차장 577
12.8%
특별회계(회계별총계 577
12.8%
일반회계(보전수입등및내부거래 577
12.8%
일반회계(지방세수입 575
12.8%
일반회계(보조금 575
12.8%
일반회계(조정교부금등 573
12.7%
일반회계(지방교부세 471
10.5%
Distinct853
Distinct (%)21.7%
Missing0
Missing (%)0.0%
Memory size30.8 KiB
2023-12-12T21:57:39.761261image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length11.889172
Min length2

Characters and Unicode

Total characters46665
Distinct characters13
Distinct categories4 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique608 ?
Unique (%)15.5%

Sample

1st row62953691130
2nd row21571248910
3rd row7832609160
4th row477358000000
5th row11476288200
ValueCountFrequency (%)
71863631000 290
 
7.4%
81643062290 290
 
7.4%
555331000000 290
 
7.4%
19315930000 290
 
7.4%
68104011660 277
 
7.1%
27468469820 277
 
7.1%
14782295330 276
 
7.0%
7832609160 81
 
2.1%
1629000000 44
 
1.1%
70454254000 40
 
1.0%
Other values (843) 1770
45.1%
2023-12-12T21:57:40.134602image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 11247
24.1%
1 5134
11.0%
6 4718
10.1%
3 3980
 
8.5%
3924
 
8.4%
2 3391
 
7.3%
4 3217
 
6.9%
5 2977
 
6.4%
8 2939
 
6.3%
9 2892
 
6.2%
Other values (3) 2246
 
4.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 42739
91.6%
Space Separator 3924
 
8.4%
Open Punctuation 1
 
< 0.1%
Close Punctuation 1
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 11247
26.3%
1 5134
12.0%
6 4718
11.0%
3 3980
 
9.3%
2 3391
 
7.9%
4 3217
 
7.5%
5 2977
 
7.0%
8 2939
 
6.9%
9 2892
 
6.8%
7 2244
 
5.3%
Space Separator
ValueCountFrequency (%)
3924
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 46665
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 11247
24.1%
1 5134
11.0%
6 4718
10.1%
3 3980
 
8.5%
3924
 
8.4%
2 3391
 
7.3%
4 3217
 
6.9%
5 2977
 
6.4%
8 2939
 
6.3%
9 2892
 
6.2%
Other values (3) 2246
 
4.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 46665
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 11247
24.1%
1 5134
11.0%
6 4718
10.1%
3 3980
 
8.5%
3924
 
8.4%
2 3391
 
7.3%
4 3217
 
6.9%
5 2977
 
6.4%
8 2939
 
6.3%
9 2892
 
6.2%
Other values (3) 2246
 
4.8%

세입금일수납
Real number (ℝ)

ZEROS 

Distinct843
Distinct (%)21.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.8548267 × 108
Minimum0
Maximum3.7 × 1010
Zeros3081
Zeros (%)78.5%
Negative0
Negative (%)0.0%
Memory size34.6 KiB
2023-12-12T21:57:40.298732image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile3.459812 × 108
Maximum3.7 × 1010
Range3.7 × 1010
Interquartile range (IQR)0

Descriptive statistics

Standard deviation1.4991183 × 109
Coefficient of variation (CV)8.0822557
Kurtosis251.83334
Mean1.8548267 × 108
Median Absolute Deviation (MAD)0
Skewness14.333416
Sum7.2801947 × 1011
Variance2.2473558 × 1018
MonotonicityNot monotonic
2023-12-12T21:57:40.440034image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 3081
78.5%
200170 2
 
0.1%
19317000000 2
 
0.1%
39200880 1
 
< 0.1%
35780840 1
 
< 0.1%
51934710 1
 
< 0.1%
19165000 1
 
< 0.1%
10168110 1
 
< 0.1%
712052000 1
 
< 0.1%
39927910 1
 
< 0.1%
Other values (833) 833
 
21.2%
ValueCountFrequency (%)
0 3081
78.5%
80 1
 
< 0.1%
2710 1
 
< 0.1%
5220 1
 
< 0.1%
7170 1
 
< 0.1%
67040 1
 
< 0.1%
161090 1
 
< 0.1%
200170 2
 
0.1%
324400 1
 
< 0.1%
422990 1
 
< 0.1%
ValueCountFrequency (%)
37000000000 1
< 0.1%
32916274000 1
< 0.1%
26452500000 1
< 0.1%
25825223000 1
< 0.1%
22074211000 1
< 0.1%
20680244000 1
< 0.1%
19317000000 2
0.1%
18478620760 1
< 0.1%
16247777000 1
< 0.1%
15584000000 1
< 0.1%

세입합계
Real number (ℝ)

Distinct885
Distinct (%)22.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean8.9695715 × 1010
Minimum141740
Maximum5.55331 × 1011
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size34.6 KiB
2023-12-12T21:57:40.910063image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum141740
5-th percentile1.629 × 109
Q11.256157 × 1010
median2.746847 × 1010
Q37.1863631 × 1010
95-th percentile5.55331 × 1011
Maximum5.55331 × 1011
Range5.5533086 × 1011
Interquartile range (IQR)5.9302062 × 1010

Descriptive statistics

Standard deviation1.5006239 × 1011
Coefficient of variation (CV)1.6730162
Kurtosis4.5248365
Mean8.9695715 × 1010
Median Absolute Deviation (MAD)2.6004364 × 1010
Skewness2.4297171
Sum3.5205568 × 1014
Variance2.251872 × 1022
MonotonicityNot monotonic
2023-12-12T21:57:41.069608image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
81643062290 290
 
7.4%
71863631000 290
 
7.4%
555331000000 290
 
7.4%
19315930000 290
 
7.4%
68104011660 278
 
7.1%
14782295330 278
 
7.1%
27468469820 278
 
7.1%
7832609160 82
 
2.1%
1629000000 44
 
1.1%
70454254000 40
 
1.0%
Other values (875) 1765
45.0%
ValueCountFrequency (%)
141740 1
 
< 0.1%
146960 4
 
0.1%
161090 1
 
< 0.1%
3209040 1
 
< 0.1%
15531590 1
 
< 0.1%
18280860 1
 
< 0.1%
30827970 3
 
0.1%
57792350 1
 
< 0.1%
129000000 17
0.4%
152789920 4
 
0.1%
ValueCountFrequency (%)
555331000000 290
7.4%
483253000000 1
 
< 0.1%
477358000000 3
 
0.1%
477238000000 1
 
< 0.1%
476286000000 1
 
< 0.1%
475196000000 1
 
< 0.1%
474280000000 4
 
0.1%
473528000000 1
 
< 0.1%
473027000000 1
 
< 0.1%
473021000000 1
 
< 0.1%

Interactions

2023-12-12T21:57:38.557440image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:57:38.337131image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:57:38.686813image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:57:38.456289image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T21:57:41.180430image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
세입과목세입금일수납세입합계
세입과목1.0000.1410.743
세입금일수납0.1411.0000.469
세입합계0.7430.4691.000
2023-12-12T21:57:41.277223image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
세입금일수납세입합계세입과목
세입금일수납1.000-0.2050.072
세입합계-0.2051.0000.500
세입과목0.0720.5001.000

Missing values

2023-12-12T21:57:38.847106image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T21:57:38.957445image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

세입자료일련번호세입과목세입전일누계세입금일수납세입합계
0202000000000일반회계(지방세수입)629536911303920088062992892010
1202000000000일반회계(세외수입)21571248910747951021578728420
2202000000000일반회계(지방교부세)783260916007832609160
3202000000000일반회계(보조금)4773580000005894361000483253000000
4202000000000주차장 특별회계(회계별총계)114762882004655692011522845120
5202000000000일반회계(보전수입등및내부거래)65299139280333000065302469280
6202000000000일반회계(조정교부금등)71081638000071081638000
7202000000000일반회계(지방세수입)62953691130062953691130
8202000000000일반회계(세외수입)21571248910021571248910
9202000000000주차장 특별회계(회계별총계)11476288200011476288200
세입자료일련번호세입과목세입전일누계세입금일수납세입합계
3915202000000000일반회계(조정교부금등)71863631000071863631000
3916202000000000일반회계(지방교부세)19315930000019315930000
3917202000000000일반회계(세외수입)27015428580027015428580
3918202000000000일반회계(지방세수입)66829639270066829639270
3919202000000000주차장 특별회계(회계별총계)14559879640014559879640
3920202000000000일반회계(보전수입등및내부거래)81643062290081643062290
3921202000000000일반회계(보조금)5553310000000555331000000
3922202000000000일반회계(조정교부금등)71863631000071863631000
3923202000000000일반회계(지방교부세)19315930000019315930000
3924202000000000일반회계(세외수입)27015428580027015428580

Duplicate rows

Most frequently occurring

세입자료일련번호세입과목세입전일누계세입금일수납세입합계# duplicates
31202000000000일반회계(보전수입등및내부거래)81643062290081643062290290
68202000000000일반회계(보조금)5553310000000555331000000290
132202000000000일반회계(조정교부금등)71863631000071863631000290
137202000000000일반회계(지방교부세)19315930000019315930000290
101202000000000일반회계(세외수입)27468469820027468469820277
173202000000000일반회계(지방세수입)68104011660068104011660277
198202000000000주차장 특별회계(회계별총계)14782295330014782295330276
139202000000000일반회계(지방교부세)78326091600783260916081
123202000000000일반회계(조정교부금등)16290000000162900000043
130202000000000일반회계(조정교부금등)7045425400007045425400039