Overview

Dataset statistics

Number of variables8
Number of observations228
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory15.3 KiB
Average record size in memory68.6 B

Variable types

Categorical6
Numeric2

Dataset

Description경상북도 군위군의 지방세 세원유형별과세현황에 대한 데이터로 연도별 세목명, 세원 유형명, 부과건수, 부과금액 항목을 제공합니다.
Author경상북도 군위군
URLhttps://www.data.go.kr/data/15079109/fileData.do

Alerts

시도명 has constant value ""Constant
시군구명 has constant value ""Constant
자치단체코드 has constant value ""Constant
세원 유형명 is highly overall correlated with 부과건수 and 1 other fieldsHigh correlation
세목명 is highly overall correlated with 부과건수 and 1 other fieldsHigh correlation
부과건수 is highly overall correlated with 부과금액 and 2 other fieldsHigh correlation
부과금액 is highly overall correlated with 부과건수High correlation
부과건수 has 58 (25.4%) zerosZeros
부과금액 has 58 (25.4%) zerosZeros

Reproduction

Analysis started2023-12-12 16:28:51.504929
Analysis finished2023-12-12 16:28:52.831462
Duration1.33 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시도명
Categorical

CONSTANT 

Distinct1
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
경상북도
228 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row경상북도
2nd row경상북도
3rd row경상북도
4th row경상북도
5th row경상북도

Common Values

ValueCountFrequency (%)
경상북도 228
100.0%

Length

2023-12-13T01:28:52.898653image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T01:28:53.008025image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
경상북도 228
100.0%

시군구명
Categorical

CONSTANT 

Distinct1
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
군위군
228 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row군위군
2nd row군위군
3rd row군위군
4th row군위군
5th row군위군

Common Values

ValueCountFrequency (%)
군위군 228
100.0%

Length

2023-12-13T01:28:53.113756image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T01:28:53.207421image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
군위군 228
100.0%

자치단체코드
Categorical

CONSTANT 

Distinct1
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
47720
228 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row47720
2nd row47720
3rd row47720
4th row47720
5th row47720

Common Values

ValueCountFrequency (%)
47720 228
100.0%

Length

2023-12-13T01:28:53.299160image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T01:28:53.386558image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
47720 228
100.0%

과세년도
Categorical

Distinct5
Distinct (%)2.2%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
2017
47 
2018
47 
2020
47 
2021
46 
2019
41 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2017
2nd row2017
3rd row2017
4th row2017
5th row2017

Common Values

ValueCountFrequency (%)
2017 47
20.6%
2018 47
20.6%
2020 47
20.6%
2021 46
20.2%
2019 41
18.0%

Length

2023-12-13T01:28:53.514574image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T01:28:53.610571image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2017 47
20.6%
2018 47
20.6%
2020 47
20.6%
2021 46
20.2%
2019 41
18.0%

세목명
Categorical

HIGH CORRELATION 

Distinct13
Distinct (%)5.7%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
취득세
45 
주민세
43 
자동차세
35 
재산세
25 
지방소득세
20 
Other values (8)
60 

Length

Max length7
Median length3
Mean length3.7017544
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row교육세
2nd row도시계획세
3rd row취득세
4th row취득세
5th row취득세

Common Values

ValueCountFrequency (%)
취득세 45
19.7%
주민세 43
18.9%
자동차세 35
15.4%
재산세 25
11.0%
지방소득세 20
8.8%
레저세 16
 
7.0%
지역자원시설세 11
 
4.8%
등록면허세 10
 
4.4%
교육세 5
 
2.2%
담배소비세 5
 
2.2%
Other values (3) 13
 
5.7%

Length

2023-12-13T01:28:53.734684image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
취득세 45
19.7%
주민세 43
18.9%
자동차세 35
15.4%
재산세 25
11.0%
지방소득세 20
8.8%
레저세 16
 
7.0%
지역자원시설세 11
 
4.8%
등록면허세 10
 
4.4%
교육세 5
 
2.2%
담배소비세 5
 
2.2%
Other values (3) 13
 
5.7%

세원 유형명
Categorical

HIGH CORRELATION 

Distinct50
Distinct (%)21.9%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
교육세
 
5
재산세(토지)
 
5
기타승용
 
5
자동차세(주행)
 
5
선박
 
5
Other values (45)
203 

Length

Max length11
Median length8
Mean length6.1140351
Min length2

Unique

Unique3 ?
Unique (%)1.3%

Sample

1st row교육세
2nd row도시계획세
3rd row건축물
4th row주택(개별)
5th row주택(단독)

Common Values

ValueCountFrequency (%)
교육세 5
 
2.2%
재산세(토지) 5
 
2.2%
기타승용 5
 
2.2%
자동차세(주행) 5
 
2.2%
선박 5
 
2.2%
재산세(선박) 5
 
2.2%
지방소득세(특별징수) 5
 
2.2%
재산세(항공기) 5
 
2.2%
주택(개별) 5
 
2.2%
재산세(주택) 5
 
2.2%
Other values (40) 178
78.1%

Length

2023-12-13T01:28:53.880803image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
교육세 5
 
2.2%
주민세(양도소득 5
 
2.2%
주민세(종합소득 5
 
2.2%
화물 5
 
2.2%
지역자원시설세(특자 5
 
2.2%
승용 5
 
2.2%
재산세(토지 5
 
2.2%
주민세(특별징수 5
 
2.2%
주민세(법인세분 5
 
2.2%
승합 5
 
2.2%
Other values (40) 178
78.1%

부과건수
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct150
Distinct (%)65.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4930.7368
Minimum0
Maximum82404
Zeros58
Zeros (%)25.4%
Negative0
Negative (%)0.0%
Memory size2.1 KiB
2023-12-13T01:28:54.020849image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median283.5
Q33325.25
95-th percentile27331.6
Maximum82404
Range82404
Interquartile range (IQR)3325.25

Descriptive statistics

Standard deviation13446.364
Coefficient of variation (CV)2.7270496
Kurtosis19.574273
Mean4930.7368
Median Absolute Deviation (MAD)283.5
Skewness4.2906859
Sum1124208
Variance1.808047 × 108
MonotonicityNot monotonic
2023-12-13T01:28:54.143255image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 58
25.4%
1 6
 
2.6%
12 5
 
2.2%
73 3
 
1.3%
74 3
 
1.3%
3 3
 
1.3%
6 2
 
0.9%
692 2
 
0.9%
507 2
 
0.9%
12566 2
 
0.9%
Other values (140) 142
62.3%
ValueCountFrequency (%)
0 58
25.4%
1 6
 
2.6%
3 3
 
1.3%
4 1
 
0.4%
6 2
 
0.9%
7 1
 
0.4%
8 1
 
0.4%
12 5
 
2.2%
16 1
 
0.4%
33 1
 
0.4%
ValueCountFrequency (%)
82404 1
0.4%
80770 1
0.4%
78403 1
0.4%
78055 1
0.4%
75922 1
0.4%
44789 1
0.4%
43377 1
0.4%
42530 1
0.4%
41548 1
0.4%
40457 1
0.4%

부과금액
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct170
Distinct (%)74.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6.1741937 × 108
Minimum0
Maximum5.887044 × 109
Zeros58
Zeros (%)25.4%
Negative0
Negative (%)0.0%
Memory size2.1 KiB
2023-12-13T01:28:54.272862image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median1.27656 × 108
Q37.6942875 × 108
95-th percentile2.4370094 × 109
Maximum5.887044 × 109
Range5.887044 × 109
Interquartile range (IQR)7.6942875 × 108

Descriptive statistics

Standard deviation1.0576264 × 109
Coefficient of variation (CV)1.7129789
Kurtosis8.8064943
Mean6.1741937 × 108
Median Absolute Deviation (MAD)1.27656 × 108
Skewness2.7293742
Sum1.4077162 × 1011
Variance1.1185735 × 1018
MonotonicityNot monotonic
2023-12-13T01:28:54.426671image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 58
 
25.4%
22000 2
 
0.9%
1039371000 1
 
0.4%
1434617000 1
 
0.4%
5472273000 1
 
0.4%
330455000 1
 
0.4%
2116619000 1
 
0.4%
19000 1
 
0.4%
502238000 1
 
0.4%
1659595000 1
 
0.4%
Other values (160) 160
70.2%
ValueCountFrequency (%)
0 58
25.4%
16000 1
 
0.4%
19000 1
 
0.4%
22000 2
 
0.9%
25000 1
 
0.4%
61000 1
 
0.4%
201000 1
 
0.4%
857000 1
 
0.4%
895000 1
 
0.4%
974000 1
 
0.4%
ValueCountFrequency (%)
5887044000 1
0.4%
5507900000 1
0.4%
5497770000 1
0.4%
5472273000 1
0.4%
5162812000 1
0.4%
4087956000 1
0.4%
3328336000 1
0.4%
2779497000 1
0.4%
2718154000 1
0.4%
2628567000 1
0.4%

Interactions

2023-12-13T01:28:52.052029image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T01:28:51.833833image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T01:28:52.148429image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T01:28:51.921998image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T01:28:54.524598image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
과세년도세목명세원 유형명부과건수부과금액
과세년도1.0000.0000.0000.0000.000
세목명0.0001.0001.0000.8350.684
세원 유형명0.0001.0001.0000.9510.867
부과건수0.0000.8350.9511.0000.588
부과금액0.0000.6840.8670.5881.000
2023-12-13T01:28:54.622779image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
세원 유형명세목명과세년도
세원 유형명1.0000.9100.000
세목명0.9101.0000.000
과세년도0.0000.0001.000
2023-12-13T01:28:54.723025image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
부과건수부과금액과세년도세목명세원 유형명
부과건수1.0000.7800.0000.5790.680
부과금액0.7801.0000.0000.3750.482
과세년도0.0000.0001.0000.0000.000
세목명0.5790.3750.0001.0000.910
세원 유형명0.6800.4820.0000.9101.000

Missing values

2023-12-13T01:28:52.270812image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T01:28:52.754483image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시도명시군구명자치단체코드과세년도세목명세원 유형명부과건수부과금액
0경상북도군위군477202017교육세교육세759222430343000
1경상북도군위군477202017도시계획세도시계획세00
2경상북도군위군477202017취득세건축물507923134000
3경상북도군위군477202017취득세주택(개별)722584015000
4경상북도군위군477202017취득세주택(단독)73197739000
5경상북도군위군477202017취득세기타67328000
6경상북도군위군477202017취득세항공기00
7경상북도군위군477202017취득세기계장비285291806000
8경상북도군위군477202017취득세차량20731611255000
9경상북도군위군477202017취득세선박00
시도명시군구명자치단체코드과세년도세목명세원 유형명부과건수부과금액
218경상북도군위군477202021지방소득세지방소득세(양도소득)507724138000
219경상북도군위군477202021지방소득세지방소득세(종합소득)2327492566000
220경상북도군위군477202021등록면허세등록면허세(면허)6452105565000
221경상북도군위군477202021등록면허세등록면허세(등록)7706750080000
222경상북도군위군477202021지역자원시설세지역자원시설세(소방)5527334006000
223경상북도군위군477202021지역자원시설세지역자원시설세(시설)00
224경상북도군위군477202021지역자원시설세지역자원시설세(특자)7322875000
225경상북도군위군477202021지방소비세지방소비세75497770000
226경상북도군위군477202021담배소비세담배소비세4741786165000
227경상북도군위군477202021체납체납222381055879000