Overview

Dataset statistics

Number of variables8
Number of observations234
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory15.7 KiB
Average record size in memory68.6 B

Variable types

Categorical6
Numeric2

Dataset

Description지방세 과세를 위해 세원이 되는 과세 대상 유형별 부과된 현황에 대한 데이터로 시도명, 시군구명, 자치단체코드, 과세년도, 세목명, 세원 유형명, 부과건수, 부과금액의 항목을 제공합니다.
Author경상남도 김해시
URLhttps://bigdata.gyeongnam.go.kr/index.gn?menuCd=DOM_000000114002001000&publicdatapk=15079601

Alerts

시도명 has constant value ""Constant
시군구명 has constant value ""Constant
자치단체코드 has constant value ""Constant
세목명 is highly overall correlated with 부과건수 and 1 other fieldsHigh correlation
세원 유형명 is highly overall correlated with 부과건수 and 2 other fieldsHigh correlation
부과건수 is highly overall correlated with 부과금액 and 2 other fieldsHigh correlation
부과금액 is highly overall correlated with 부과건수 and 1 other fieldsHigh correlation
부과건수 has 44 (18.8%) zerosZeros
부과금액 has 44 (18.8%) zerosZeros

Reproduction

Analysis started2023-12-10 23:59:12.133950
Analysis finished2023-12-10 23:59:13.193447
Duration1.06 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시도명
Categorical

CONSTANT 

Distinct1
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
경상남도
234 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row경상남도
2nd row경상남도
3rd row경상남도
4th row경상남도
5th row경상남도

Common Values

ValueCountFrequency (%)
경상남도 234
100.0%

Length

2023-12-11T08:59:13.288709image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T08:59:13.401845image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
경상남도 234
100.0%

시군구명
Categorical

CONSTANT 

Distinct1
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
김해시
234 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row김해시
2nd row김해시
3rd row김해시
4th row김해시
5th row김해시

Common Values

ValueCountFrequency (%)
김해시 234
100.0%

Length

2023-12-11T08:59:13.484913image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T08:59:13.569700image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
김해시 234
100.0%

자치단체코드
Categorical

CONSTANT 

Distinct1
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
48250
234 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row48250
2nd row48250
3rd row48250
4th row48250
5th row48250

Common Values

ValueCountFrequency (%)
48250 234
100.0%

Length

2023-12-11T08:59:13.654877image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T08:59:13.737078image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
48250 234
100.0%

과세년도
Categorical

Distinct5
Distinct (%)2.1%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
2017
47 
2018
47 
2019
47 
2020
47 
2021
46 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2017
2nd row2017
3rd row2017
4th row2017
5th row2017

Common Values

ValueCountFrequency (%)
2017 47
20.1%
2018 47
20.1%
2019 47
20.1%
2020 47
20.1%
2021 46
19.7%

Length

2023-12-11T08:59:13.814853image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T08:59:13.906155image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2017 47
20.1%
2018 47
20.1%
2019 47
20.1%
2020 47
20.1%
2021 46
19.7%

세목명
Categorical

HIGH CORRELATION 

Distinct13
Distinct (%)5.6%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
취득세
45 
주민세
43 
자동차세
35 
재산세
25 
레저세
20 
Other values (8)
66 

Length

Max length7
Median length3
Mean length3.7008547
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row교육세
2nd row담배소비세
3rd row도시계획세
4th row등록면허세
5th row등록면허세

Common Values

ValueCountFrequency (%)
취득세 45
19.2%
주민세 43
18.4%
자동차세 35
15.0%
재산세 25
10.7%
레저세 20
8.5%
지방소득세 20
8.5%
지역자원시설세 11
 
4.7%
등록면허세 10
 
4.3%
교육세 5
 
2.1%
담배소비세 5
 
2.1%
Other values (3) 15
 
6.4%

Length

2023-12-11T08:59:14.017529image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
취득세 45
19.2%
주민세 43
18.4%
자동차세 35
15.0%
재산세 25
10.7%
레저세 20
8.5%
지방소득세 20
8.5%
지역자원시설세 11
 
4.7%
등록면허세 10
 
4.3%
교육세 5
 
2.1%
담배소비세 5
 
2.1%
Other values (3) 15
 
6.4%

세원 유형명
Categorical

HIGH CORRELATION 

Distinct50
Distinct (%)21.4%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
교육세
 
5
주민세(양도소득)
 
5
승합
 
5
주민세(특별징수)
 
5
등록면허세(면허)
 
5
Other values (45)
209 

Length

Max length11
Median length8
Mean length6.0384615
Min length2

Unique

Unique3 ?
Unique (%)1.3%

Sample

1st row교육세
2nd row담배소비세
3rd row도시계획세
4th row등록면허세(면허)
5th row등록면허세(등록)

Common Values

ValueCountFrequency (%)
교육세 5
 
2.1%
주민세(양도소득) 5
 
2.1%
승합 5
 
2.1%
주민세(특별징수) 5
 
2.1%
등록면허세(면허) 5
 
2.1%
등록면허세(등록) 5
 
2.1%
소싸움 5
 
2.1%
경정 5
 
2.1%
경륜 5
 
2.1%
경마 5
 
2.1%
Other values (40) 184
78.6%

Length

2023-12-11T08:59:14.134301image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
교육세 5
 
2.1%
도시계획세 5
 
2.1%
지역자원시설세(특자 5
 
2.1%
주민세(양도소득 5
 
2.1%
주민세(종합소득 5
 
2.1%
지방소득세(특별징수 5
 
2.1%
지방소득세(법인소득 5
 
2.1%
지방소득세(양도소득 5
 
2.1%
지방소득세(종합소득 5
 
2.1%
토지 5
 
2.1%
Other values (40) 184
78.6%

부과건수
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct179
Distinct (%)76.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean71735.756
Minimum0
Maximum1117240
Zeros44
Zeros (%)18.8%
Negative0
Negative (%)0.0%
Memory size2.2 KiB
2023-12-11T08:59:14.272873image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q124
median5140.5
Q350430.25
95-th percentile369870.9
Maximum1117240
Range1117240
Interquartile range (IQR)50406.25

Descriptive statistics

Standard deviation182720.08
Coefficient of variation (CV)2.547127
Kurtosis19.322653
Mean71735.756
Median Absolute Deviation (MAD)5140.5
Skewness4.1680257
Sum16786167
Variance3.3386629 × 1010
MonotonicityNot monotonic
2023-12-11T08:59:14.427441image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 44
 
18.8%
12 8
 
3.4%
82 2
 
0.9%
406 2
 
0.9%
35 2
 
0.9%
28 2
 
0.9%
24 2
 
0.9%
51222 1
 
0.4%
7565 1
 
0.4%
3 1
 
0.4%
Other values (169) 169
72.2%
ValueCountFrequency (%)
0 44
18.8%
3 1
 
0.4%
6 1
 
0.4%
7 1
 
0.4%
8 1
 
0.4%
12 8
 
3.4%
13 1
 
0.4%
14 1
 
0.4%
24 2
 
0.9%
26 1
 
0.4%
ValueCountFrequency (%)
1117240 1
0.4%
1116419 1
0.4%
1100077 1
0.4%
1083627 1
0.4%
1035261 1
0.4%
528839 1
0.4%
524098 1
0.4%
514084 1
0.4%
502753 1
0.4%
500724 1
0.4%

부과금액
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct191
Distinct (%)81.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.7446268 × 1010
Minimum0
Maximum1.18305 × 1011
Zeros44
Zeros (%)18.8%
Negative0
Negative (%)0.0%
Memory size2.2 KiB
2023-12-11T08:59:14.559602image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q125758000
median2.253034 × 109
Q33.3049033 × 1010
95-th percentile6.9405646 × 1010
Maximum1.18305 × 1011
Range1.18305 × 1011
Interquartile range (IQR)3.3023275 × 1010

Descriptive statistics

Standard deviation2.4512348 × 1010
Coefficient of variation (CV)1.4050196
Kurtosis2.4851091
Mean1.7446268 × 1010
Median Absolute Deviation (MAD)2.253034 × 109
Skewness1.6471071
Sum4.0824267 × 1012
Variance6.0085519 × 1020
MonotonicityNot monotonic
2023-12-11T08:59:14.954514image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 44
 
18.8%
95188362000 1
 
0.4%
25032451000 1
 
0.4%
28868000 1
 
0.4%
69248666000 1
 
0.4%
80492000 1
 
0.4%
327085000 1
 
0.4%
9419687000 1
 
0.4%
33274673000 1
 
0.4%
67516384000 1
 
0.4%
Other values (181) 181
77.4%
ValueCountFrequency (%)
0 44
18.8%
1884000 1
 
0.4%
1985000 1
 
0.4%
2235000 1
 
0.4%
2405000 1
 
0.4%
3032000 1
 
0.4%
8421000 1
 
0.4%
10904000 1
 
0.4%
10926000 1
 
0.4%
12361000 1
 
0.4%
ValueCountFrequency (%)
118305000000 1
0.4%
108012000000 1
0.4%
101297000000 1
0.4%
96279146000 1
0.4%
95188362000 1
0.4%
90920016000 1
0.4%
79015140000 1
0.4%
78078256000 1
0.4%
75048659000 1
0.4%
73593779000 1
0.4%

Interactions

2023-12-11T08:59:12.692581image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:59:12.446351image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:59:12.801750image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:59:12.548528image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T08:59:15.059263image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
과세년도세목명세원 유형명부과건수부과금액
과세년도1.0000.0000.0000.0000.000
세목명0.0001.0001.0000.8980.678
세원 유형명0.0001.0001.0000.9990.928
부과건수0.0000.8980.9991.0000.764
부과금액0.0000.6780.9280.7641.000
2023-12-11T08:59:15.169094image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
과세년도세목명세원 유형명
과세년도1.0000.0000.000
세목명0.0001.0000.912
세원 유형명0.0000.9121.000
2023-12-11T08:59:15.254056image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
부과건수부과금액과세년도세목명세원 유형명
부과건수1.0000.7020.0000.7140.890
부과금액0.7021.0000.0000.3590.553
과세년도0.0000.0001.0000.0000.000
세목명0.7140.3590.0001.0000.912
세원 유형명0.8900.5530.0000.9121.000

Missing values

2023-12-11T08:59:12.952629image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T08:59:13.130672image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시도명시군구명자치단체코드과세년도세목명세원 유형명부과건수부과금액
0경상남도김해시482502017교육세교육세108362795188362000
1경상남도김해시482502017담배소비세담배소비세10839126063000
2경상남도김해시482502017도시계획세도시계획세00
3경상남도김해시482502017등록면허세등록면허세(면허)539471082403000
4경상남도김해시482502017등록면허세등록면허세(등록)12422717441352000
5경상남도김해시482502017레저세소싸움00
6경상남도김해시482502017레저세경정12723601000
7경상남도김해시482502017레저세경륜351795897000
8경상남도김해시482502017레저세경마2871723710000
9경상남도김해시482502017자동차세자동차세(주행)1235254897000
시도명시군구명자치단체코드과세년도세목명세원 유형명부과건수부과금액
224경상남도김해시482502021등록면허세등록면허세(면허)641571307256000
225경상남도김해시482502021등록면허세등록면허세(등록)12969616964756000
226경상남도김해시482502021지역자원시설세지역자원시설세(소방)27848316926263000
227경상남도김해시482502021지역자원시설세지역자원시설세(시설)00
228경상남도김해시482502021지역자원시설세지역자원시설세(특자)5082153966000
229경상남도김해시482502021지방소득세지방소득세(특별징수)12191539873276000
230경상남도김해시482502021지방소득세지방소득세(법인소득)867535951349000
231경상남도김해시482502021지방소득세지방소득세(양도소득)1031116822664000
232경상남도김해시482502021지방소득세지방소득세(종합소득)8483315272296000
233경상남도김해시482502021체납체납50072447263910000