Overview

Dataset statistics

Number of variables8
Number of observations234
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory15.7 KiB
Average record size in memory68.6 B

Variable types

Categorical6
Numeric2

Dataset

Description지방세 과세를 위해 세원이 되는 과세 대상 유형별 부과된 현황 데이터를 제공하여 물건 유형에 세부담 수준의 형평성 검토 및 부동산 등 관련분야 규제정책 대상 확인 시 기초자료 활용
URLhttps://www.data.go.kr/data/15078680/fileData.do

Alerts

시도명 has constant value ""Constant
시군구명 has constant value ""Constant
자치단체코드 has constant value ""Constant
세목명 is highly overall correlated with 부과건수 and 1 other fieldsHigh correlation
세원 유형명 is highly overall correlated with 부과건수 and 1 other fieldsHigh correlation
부과건수 is highly overall correlated with 부과금액 and 2 other fieldsHigh correlation
부과금액 is highly overall correlated with 부과건수High correlation
부과건수 has 52 (22.2%) zerosZeros
부과금액 has 53 (22.6%) zerosZeros

Reproduction

Analysis started2023-12-12 03:43:09.623042
Analysis finished2023-12-12 03:43:10.823983
Duration1.2 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시도명
Categorical

CONSTANT 

Distinct1
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
전라남도
234 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row전라남도
2nd row전라남도
3rd row전라남도
4th row전라남도
5th row전라남도

Common Values

ValueCountFrequency (%)
전라남도 234
100.0%

Length

2023-12-12T12:43:10.924230image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T12:43:11.054690image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
전라남도 234
100.0%

시군구명
Categorical

CONSTANT 

Distinct1
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
무안군
234 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row무안군
2nd row무안군
3rd row무안군
4th row무안군
5th row무안군

Common Values

ValueCountFrequency (%)
무안군 234
100.0%

Length

2023-12-12T12:43:11.213158image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T12:43:11.374921image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
무안군 234
100.0%

자치단체코드
Categorical

CONSTANT 

Distinct1
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
46840
234 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row46840
2nd row46840
3rd row46840
4th row46840
5th row46840

Common Values

ValueCountFrequency (%)
46840 234
100.0%

Length

2023-12-12T12:43:11.519330image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T12:43:11.630461image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
46840 234
100.0%

과세년도
Categorical

Distinct5
Distinct (%)2.1%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
2017
47 
2018
47 
2019
47 
2020
47 
2021
46 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2017
2nd row2017
3rd row2017
4th row2017
5th row2017

Common Values

ValueCountFrequency (%)
2017 47
20.1%
2018 47
20.1%
2019 47
20.1%
2020 47
20.1%
2021 46
19.7%

Length

2023-12-12T12:43:11.739724image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T12:43:11.886056image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2017 47
20.1%
2018 47
20.1%
2019 47
20.1%
2020 47
20.1%
2021 46
19.7%

세목명
Categorical

HIGH CORRELATION 

Distinct13
Distinct (%)5.6%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
취득세
45 
주민세
43 
자동차세
35 
재산세
25 
레저세
20 
Other values (8)
66 

Length

Max length7
Median length3
Mean length3.7008547
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row담배소비세
2nd row교육세
3rd row도시계획세
4th row취득세
5th row취득세

Common Values

ValueCountFrequency (%)
취득세 45
19.2%
주민세 43
18.4%
자동차세 35
15.0%
재산세 25
10.7%
레저세 20
8.5%
지방소득세 20
8.5%
지역자원시설세 11
 
4.7%
등록면허세 10
 
4.3%
담배소비세 5
 
2.1%
교육세 5
 
2.1%
Other values (3) 15
 
6.4%

Length

2023-12-12T12:43:12.068514image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
취득세 45
19.2%
주민세 43
18.4%
자동차세 35
15.0%
재산세 25
10.7%
레저세 20
8.5%
지방소득세 20
8.5%
지역자원시설세 11
 
4.7%
등록면허세 10
 
4.3%
담배소비세 5
 
2.1%
교육세 5
 
2.1%
Other values (3) 15
 
6.4%

세원 유형명
Categorical

HIGH CORRELATION 

Distinct50
Distinct (%)21.4%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
담배소비세
 
5
승합
 
5
주택(개별)
 
5
3륜이하
 
5
기계장비
 
5
Other values (45)
209 

Length

Max length11
Median length8
Mean length6.0384615
Min length2

Unique

Unique3 ?
Unique (%)1.3%

Sample

1st row담배소비세
2nd row교육세
3rd row도시계획세
4th row건축물
5th row주택(단독)

Common Values

ValueCountFrequency (%)
담배소비세 5
 
2.1%
승합 5
 
2.1%
주택(개별) 5
 
2.1%
3륜이하 5
 
2.1%
기계장비 5
 
2.1%
차량 5
 
2.1%
선박 5
 
2.1%
토지 5
 
2.1%
재산세(건축물) 5
 
2.1%
주택(단독) 5
 
2.1%
Other values (40) 184
78.6%

Length

2023-12-12T12:43:12.245778image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
담배소비세 5
 
2.1%
항공기 5
 
2.1%
주민세(종합소득 5
 
2.1%
승합 5
 
2.1%
교육세 5
 
2.1%
기타승용 5
 
2.1%
승용 5
 
2.1%
주민세(종업원분 5
 
2.1%
주민세(특별징수 5
 
2.1%
체납 5
 
2.1%
Other values (40) 184
78.6%

부과건수
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct167
Distinct (%)71.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean12779.483
Minimum0
Maximum236215
Zeros52
Zeros (%)22.2%
Negative0
Negative (%)0.0%
Memory size2.2 KiB
2023-12-12T12:43:12.423173image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q17
median551.5
Q37789.75
95-th percentile55928.6
Maximum236215
Range236215
Interquartile range (IQR)7782.75

Descriptive statistics

Standard deviation34382.547
Coefficient of variation (CV)2.690449
Kurtosis24.627944
Mean12779.483
Median Absolute Deviation (MAD)551.5
Skewness4.7061003
Sum2990399
Variance1.1821595 × 109
MonotonicityNot monotonic
2023-12-12T12:43:12.664883image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 52
 
22.2%
12 5
 
2.1%
36 4
 
1.7%
8 3
 
1.3%
1 3
 
1.3%
7 3
 
1.3%
276 2
 
0.9%
1184 2
 
0.9%
44 2
 
0.9%
312 1
 
0.4%
Other values (157) 157
67.1%
ValueCountFrequency (%)
0 52
22.2%
1 3
 
1.3%
2 1
 
0.4%
3 1
 
0.4%
6 1
 
0.4%
7 3
 
1.3%
8 3
 
1.3%
9 1
 
0.4%
11 1
 
0.4%
12 5
 
2.1%
ValueCountFrequency (%)
236215 1
0.4%
215219 1
0.4%
211489 1
0.4%
210609 1
0.4%
205371 1
0.4%
84356 1
0.4%
82725 1
0.4%
81375 1
0.4%
80294 1
0.4%
79524 1
0.4%

부과금액
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct182
Distinct (%)77.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.2944768 × 109
Minimum0
Maximum2.6084598 × 1010
Zeros53
Zeros (%)22.6%
Negative0
Negative (%)0.0%
Memory size2.2 KiB
2023-12-12T12:43:13.232944image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q11030000
median2.83004 × 108
Q33.4118022 × 109
95-th percentile9.2593036 × 109
Maximum2.6084598 × 1010
Range2.6084598 × 1010
Interquartile range (IQR)3.4107722 × 109

Descriptive statistics

Standard deviation3.77755 × 109
Coefficient of variation (CV)1.6463666
Kurtosis8.8455185
Mean2.2944768 × 109
Median Absolute Deviation (MAD)2.83004 × 108
Skewness2.5545405
Sum5.3690756 × 1011
Variance1.4269884 × 1019
MonotonicityNot monotonic
2023-12-12T12:43:13.451458image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 53
 
22.6%
5110073000 1
 
0.4%
5993635000 1
 
0.4%
4918000 1
 
0.4%
282245000 1
 
0.4%
7350641000 1
 
0.4%
5822000 1
 
0.4%
15739142000 1
 
0.4%
2985554000 1
 
0.4%
6395592000 1
 
0.4%
Other values (172) 172
73.5%
ValueCountFrequency (%)
0 53
22.6%
465000 1
 
0.4%
466000 1
 
0.4%
613000 1
 
0.4%
902000 1
 
0.4%
970000 1
 
0.4%
997000 1
 
0.4%
1129000 1
 
0.4%
1235000 1
 
0.4%
1526000 1
 
0.4%
ValueCountFrequency (%)
26084598000 1
0.4%
18008990000 1
0.4%
15739142000 1
0.4%
15292269000 1
0.4%
15016861000 1
0.4%
14518486000 1
0.4%
12482957000 1
0.4%
12166076000 1
0.4%
11443686000 1
0.4%
10763932000 1
0.4%

Interactions

2023-12-12T12:43:10.243464image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T12:43:09.950099image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T12:43:10.368783image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T12:43:10.062443image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T12:43:13.581887image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
과세년도세목명세원 유형명부과건수부과금액
과세년도1.0000.0000.0000.0000.000
세목명0.0001.0001.0000.8080.600
세원 유형명0.0001.0001.0000.9620.852
부과건수0.0000.8080.9621.0000.569
부과금액0.0000.6000.8520.5691.000
2023-12-12T12:43:13.717168image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
과세년도세목명세원 유형명
과세년도1.0000.0000.000
세목명0.0001.0000.912
세원 유형명0.0000.9121.000
2023-12-12T12:43:13.831826image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
부과건수부과금액과세년도세목명세원 유형명
부과건수1.0000.7850.0000.5620.727
부과금액0.7851.0000.0000.3200.476
과세년도0.0000.0001.0000.0000.000
세목명0.5620.3200.0001.0000.912
세원 유형명0.7270.4760.0000.9121.000

Missing values

2023-12-12T12:43:10.565276image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T12:43:10.754762image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시도명시군구명자치단체코드과세년도세목명세원 유형명부과건수부과금액
0전라남도무안군468402017담배소비세담배소비세1085110073000
1전라남도무안군468402017교육세교육세2106098462253000
2전라남도무안군468402017도시계획세도시계획세00
3전라남도무안군468402017취득세건축물11774800979000
4전라남도무안군468402017취득세주택(단독)11224913764000
5전라남도무안군468402017취득세주택(개별)14443922174000
6전라남도무안군468402017취득세기타2594402000
7전라남도무안군468402017취득세항공기1970000
8전라남도무안군468402017취득세기계장비242165569000
9전라남도무안군468402017취득세차량67755896014000
시도명시군구명자치단체코드과세년도세목명세원 유형명부과건수부과금액
224전라남도무안군468402021지방소득세지방소득세(특별징수)129276111531000
225전라남도무안군468402021지방소득세지방소득세(법인소득)17714641938000
226전라남도무안군468402021지방소득세지방소득세(양도소득)17921886314000
227전라남도무안군468402021지방소득세지방소득세(종합소득)102711676901000
228전라남도무안군468402021등록면허세등록면허세(면허)21550301324000
229전라남도무안군468402021등록면허세등록면허세(등록)258563090861000
230전라남도무안군468402021지역자원시설세지역자원시설세(소방)356462497466000
231전라남도무안군468402021지역자원시설세지역자원시설세(시설)00
232전라남도무안군468402021지역자원시설세지역자원시설세(특자)362173000
233전라남도무안군468402021체납체납392943063732000