Overview

Dataset statistics

Number of variables8
Number of observations234
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory15.7 KiB
Average record size in memory68.6 B

Variable types

Categorical6
Numeric2

Dataset

Description지방세 과세를 위해 세원이 되는 과세 대상 유형별 부과된 현황을 제공 (시도명 시군구명 자치단체코드 과세년도 세목명 세원 유형명 부과건수 부과금액)
URLhttps://www.data.go.kr/data/15079247/fileData.do

Alerts

시도명 has constant value ""Constant
시군구명 has constant value ""Constant
자치단체코드 has constant value ""Constant
세목명 is highly overall correlated with 부과건수 and 1 other fieldsHigh correlation
세원 유형명 is highly overall correlated with 부과건수 and 1 other fieldsHigh correlation
부과건수 is highly overall correlated with 부과금액 and 2 other fieldsHigh correlation
부과금액 is highly overall correlated with 부과건수High correlation
부과건수 has 57 (24.4%) zerosZeros
부과금액 has 58 (24.8%) zerosZeros

Reproduction

Analysis started2023-12-12 07:59:29.373908
Analysis finished2023-12-12 07:59:30.594509
Duration1.22 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시도명
Categorical

CONSTANT 

Distinct1
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
경기도
234 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row경기도
2nd row경기도
3rd row경기도
4th row경기도
5th row경기도

Common Values

ValueCountFrequency (%)
경기도 234
100.0%

Length

2023-12-12T16:59:30.700662image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T16:59:30.821221image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
경기도 234
100.0%

시군구명
Categorical

CONSTANT 

Distinct1
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
양주시
234 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row양주시
2nd row양주시
3rd row양주시
4th row양주시
5th row양주시

Common Values

ValueCountFrequency (%)
양주시 234
100.0%

Length

2023-12-12T16:59:30.915481image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T16:59:31.008027image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
양주시 234
100.0%

자치단체코드
Categorical

CONSTANT 

Distinct1
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
41630
234 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row41630
2nd row41630
3rd row41630
4th row41630
5th row41630

Common Values

ValueCountFrequency (%)
41630 234
100.0%

Length

2023-12-12T16:59:31.114194image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T16:59:31.240293image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
41630 234
100.0%

과세년도
Categorical

Distinct5
Distinct (%)2.1%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
2017
47 
2018
47 
2019
47 
2020
47 
2021
46 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2017
2nd row2017
3rd row2017
4th row2017
5th row2017

Common Values

ValueCountFrequency (%)
2017 47
20.1%
2018 47
20.1%
2019 47
20.1%
2020 47
20.1%
2021 46
19.7%

Length

2023-12-12T16:59:31.347116image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T16:59:31.447921image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2017 47
20.1%
2018 47
20.1%
2019 47
20.1%
2020 47
20.1%
2021 46
19.7%

세목명
Categorical

HIGH CORRELATION 

Distinct13
Distinct (%)5.6%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
취득세
45 
주민세
43 
자동차세
35 
재산세
25 
지방소득세
20 
Other values (8)
66 

Length

Max length7
Median length3
Mean length3.7008547
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row지방소득세
2nd row지방소득세
3rd row지방소득세
4th row지방소득세
5th row교육세

Common Values

ValueCountFrequency (%)
취득세 45
19.2%
주민세 43
18.4%
자동차세 35
15.0%
재산세 25
10.7%
지방소득세 20
8.5%
레저세 20
8.5%
지역자원시설세 11
 
4.7%
등록면허세 10
 
4.3%
교육세 5
 
2.1%
도시계획세 5
 
2.1%
Other values (3) 15
 
6.4%

Length

2023-12-12T16:59:31.577101image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
취득세 45
19.2%
주민세 43
18.4%
자동차세 35
15.0%
재산세 25
10.7%
지방소득세 20
8.5%
레저세 20
8.5%
지역자원시설세 11
 
4.7%
등록면허세 10
 
4.3%
교육세 5
 
2.1%
도시계획세 5
 
2.1%
Other values (3) 15
 
6.4%

세원 유형명
Categorical

HIGH CORRELATION 

Distinct50
Distinct (%)21.4%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
지방소득세(특별징수)
 
5
기타승용
 
5
항공기
 
5
특수
 
5
지방소득세(종합소득)
 
5
Other values (45)
209 

Length

Max length11
Median length8
Mean length6.0384615
Min length2

Unique

Unique3 ?
Unique (%)1.3%

Sample

1st row지방소득세(특별징수)
2nd row지방소득세(법인소득)
3rd row지방소득세(양도소득)
4th row지방소득세(종합소득)
5th row교육세

Common Values

ValueCountFrequency (%)
지방소득세(특별징수) 5
 
2.1%
기타승용 5
 
2.1%
항공기 5
 
2.1%
특수 5
 
2.1%
지방소득세(종합소득) 5
 
2.1%
교육세 5
 
2.1%
도시계획세 5
 
2.1%
건축물 5
 
2.1%
주택(개별) 5
 
2.1%
자동차세(주행) 5
 
2.1%
Other values (40) 184
78.6%

Length

2023-12-12T16:59:31.703558image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
지방소득세(특별징수 5
 
2.1%
지방소득세(양도소득 5
 
2.1%
지역자원시설세(특자 5
 
2.1%
기타승용 5
 
2.1%
지방소득세(법인소득 5
 
2.1%
승용 5
 
2.1%
지방소비세 5
 
2.1%
담배소비세 5
 
2.1%
등록면허세(면허 5
 
2.1%
체납 5
 
2.1%
Other values (40) 184
78.6%

부과건수
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct171
Distinct (%)73.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean29937.59
Minimum0
Maximum492767
Zeros57
Zeros (%)24.4%
Negative0
Negative (%)0.0%
Memory size2.2 KiB
2023-12-12T16:59:31.843874image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q13
median1937.5
Q320331.75
95-th percentile130878.8
Maximum492767
Range492767
Interquartile range (IQR)20328.75

Descriptive statistics

Standard deviation75674.639
Coefficient of variation (CV)2.5277465
Kurtosis19.901144
Mean29937.59
Median Absolute Deviation (MAD)1937.5
Skewness4.2270225
Sum7005396
Variance5.726651 × 109
MonotonicityNot monotonic
2023-12-12T16:59:32.016924image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 57
 
24.4%
12 5
 
2.1%
16 2
 
0.9%
3 2
 
0.9%
11 2
 
0.9%
275 1
 
0.4%
6 1
 
0.4%
574 1
 
0.4%
19562 1
 
0.4%
17 1
 
0.4%
Other values (161) 161
68.8%
ValueCountFrequency (%)
0 57
24.4%
1 1
 
0.4%
3 2
 
0.9%
6 1
 
0.4%
7 1
 
0.4%
9 1
 
0.4%
11 2
 
0.9%
12 5
 
2.1%
15 1
 
0.4%
16 2
 
0.9%
ValueCountFrequency (%)
492767 1
0.4%
474126 1
0.4%
452505 1
0.4%
424421 1
0.4%
409376 1
0.4%
263046 1
0.4%
252483 1
0.4%
237769 1
0.4%
177927 1
0.4%
174034 1
0.4%

부과금액
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct176
Distinct (%)75.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean8.015533 × 109
Minimum-3376000
Maximum9.710272 × 1010
Zeros58
Zeros (%)24.8%
Negative1
Negative (%)0.4%
Memory size2.2 KiB
2023-12-12T16:59:32.190859image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum-3376000
5-th percentile0
Q13750
median8.117985 × 108
Q31.1879837 × 1010
95-th percentile4.0465678 × 1010
Maximum9.710272 × 1010
Range9.7106096 × 1010
Interquartile range (IQR)1.1879833 × 1010

Descriptive statistics

Standard deviation1.3833132 × 1010
Coefficient of variation (CV)1.7257907
Kurtosis10.228162
Mean8.015533 × 109
Median Absolute Deviation (MAD)8.117985 × 108
Skewness2.7927033
Sum1.8756347 × 1012
Variance1.9135554 × 1020
MonotonicityNot monotonic
2023-12-12T16:59:32.367929image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 58
 
24.8%
1893000 2
 
0.9%
9791390000 1
 
0.4%
569000 1
 
0.4%
33402455000 1
 
0.4%
1031532000 1
 
0.4%
714803000 1
 
0.4%
20699082000 1
 
0.4%
2811000 1
 
0.4%
72849694000 1
 
0.4%
Other values (166) 166
70.9%
ValueCountFrequency (%)
-3376000 1
 
0.4%
0 58
24.8%
15000 1
 
0.4%
18000 1
 
0.4%
222000 1
 
0.4%
450000 1
 
0.4%
569000 1
 
0.4%
1229000 1
 
0.4%
1295000 1
 
0.4%
1893000 2
 
0.9%
ValueCountFrequency (%)
97102720000 1
0.4%
72849694000 1
0.4%
58197216000 1
0.4%
53556930000 1
0.4%
48100332000 1
0.4%
46945862000 1
0.4%
44806651000 1
0.4%
44018502000 1
0.4%
42938397000 1
0.4%
42151946000 1
0.4%

Interactions

2023-12-12T16:59:29.838399image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:59:29.672026image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:59:29.918726image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:59:29.756016image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T16:59:32.479830image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
과세년도세목명세원 유형명부과건수부과금액
과세년도1.0000.0000.0000.0000.000
세목명0.0001.0001.0000.8240.626
세원 유형명0.0001.0001.0000.9190.852
부과건수0.0000.8240.9191.0000.745
부과금액0.0000.6260.8520.7451.000
2023-12-12T16:59:32.602125image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
세목명세원 유형명과세년도
세목명1.0000.9120.000
세원 유형명0.9121.0000.000
과세년도0.0000.0001.000
2023-12-12T16:59:32.717191image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
부과건수부과금액과세년도세목명세원 유형명
부과건수1.0000.7750.0000.5580.595
부과금액0.7751.0000.0000.3410.477
과세년도0.0000.0001.0000.0000.000
세목명0.5580.3410.0001.0000.912
세원 유형명0.5950.4770.0000.9121.000

Missing values

2023-12-12T16:59:30.383424image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T16:59:30.525184image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시도명시군구명자치단체코드과세년도세목명세원 유형명부과건수부과금액
0경기도양주시416302017지방소득세지방소득세(특별징수)377049791390000
1경기도양주시416302017지방소득세지방소득세(법인소득)309012838360000
2경기도양주시416302017지방소득세지방소득세(양도소득)29926843739000
3경기도양주시416302017지방소득세지방소득세(종합소득)204775195568000
4경기도양주시416302017교육세교육세40937631089839000
5경기도양주시416302017도시계획세도시계획세00
6경기도양주시416302017취득세건축물271053556930000
7경기도양주시416302017취득세주택(개별)10743589744000
8경기도양주시416302017취득세주택(단독)627312761062000
9경기도양주시416302017취득세기타332882398000
시도명시군구명자치단체코드과세년도세목명세원 유형명부과건수부과금액
224경기도양주시416302021레저세경륜00
225경기도양주시416302021레저세경마00
226경기도양주시416302021주민세주민세(사업소분)174722186131000
227경기도양주시416302021주민세주민세(개인분)89162898331000
228경기도양주시416302021주민세주민세(종업원분)16102563953000
229경기도양주시416302021주민세주민세(특별징수)00
230경기도양주시416302021주민세주민세(법인세분)00
231경기도양주시416302021주민세주민세(양도소득)00
232경기도양주시416302021주민세주민세(종합소득)00
233경기도양주시416302021체납체납17792717270627000