Overview

Dataset statistics

Number of variables8
Number of observations230
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory15.4 KiB
Average record size in memory68.6 B

Variable types

Categorical6
Numeric2

Dataset

Description지방세 과세를 위해 세원이 되는 과세 대상 유형별 부과된 현황을 제공 (시도명, 시군구명, 자치단체코드, 과세년도, 세목명, 세원 유형명, 부과건수, 부과금액)
URLhttps://www.data.go.kr/data/15078681/fileData.do

Alerts

시도명 has constant value ""Constant
과세년도 has constant value ""Constant
자치단체코드 is highly overall correlated with 시군구명High correlation
세원 유형명 is highly overall correlated with 부과건수 and 1 other fieldsHigh correlation
세목명 is highly overall correlated with 세원 유형명High correlation
시군구명 is highly overall correlated with 자치단체코드High correlation
부과건수 is highly overall correlated with 부과금액 and 1 other fieldsHigh correlation
부과금액 is highly overall correlated with 부과건수High correlation
부과건수 has 61 (26.5%) zerosZeros
부과금액 has 61 (26.5%) zerosZeros

Reproduction

Analysis started2023-12-12 08:12:56.031877
Analysis finished2023-12-12 08:12:57.173855
Duration1.14 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시도명
Categorical

CONSTANT 

Distinct1
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
창원시
230 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row창원시
2nd row창원시
3rd row창원시
4th row창원시
5th row창원시

Common Values

ValueCountFrequency (%)
창원시 230
100.0%

Length

2023-12-12T17:12:57.261693image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T17:12:57.356688image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
창원시 230
100.0%

시군구명
Categorical

HIGH CORRELATION 

Distinct5
Distinct (%)2.2%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
진해구
46 
마산회원구
46 
마산합포구
46 
성산구
46 
의창구
46 

Length

Max length5
Median length3
Mean length3.8
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row진해구
2nd row마산회원구
3rd row마산합포구
4th row성산구
5th row의창구

Common Values

ValueCountFrequency (%)
진해구 46
20.0%
마산회원구 46
20.0%
마산합포구 46
20.0%
성산구 46
20.0%
의창구 46
20.0%

Length

2023-12-12T17:12:57.463850image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T17:12:57.585831image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
진해구 46
20.0%
마산회원구 46
20.0%
마산합포구 46
20.0%
성산구 46
20.0%
의창구 46
20.0%

자치단체코드
Categorical

HIGH CORRELATION 

Distinct5
Distinct (%)2.2%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
48129
46 
48127
46 
48125
46 
48123
46 
48121
46 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row48129
2nd row48127
3rd row48125
4th row48123
5th row48121

Common Values

ValueCountFrequency (%)
48129 46
20.0%
48127 46
20.0%
48125 46
20.0%
48123 46
20.0%
48121 46
20.0%

Length

2023-12-12T17:12:57.719456image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T17:12:57.825989image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
48129 46
20.0%
48127 46
20.0%
48125 46
20.0%
48123 46
20.0%
48121 46
20.0%

과세년도
Categorical

CONSTANT 

Distinct1
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
2021
230 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2021
2nd row2021
3rd row2021
4th row2021
5th row2021

Common Values

ValueCountFrequency (%)
2021 230
100.0%

Length

2023-12-12T17:12:57.965306image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T17:12:58.072639image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2021 230
100.0%

세목명
Categorical

HIGH CORRELATION 

Distinct13
Distinct (%)5.7%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
취득세
45 
자동차세
35 
주민세
35 
재산세
25 
레저세
20 
Other values (8)
70 

Length

Max length7
Median length3
Mean length3.7826087
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row담배소비세
2nd row담배소비세
3rd row담배소비세
4th row담배소비세
5th row담배소비세

Common Values

ValueCountFrequency (%)
취득세 45
19.6%
자동차세 35
15.2%
주민세 35
15.2%
재산세 25
10.9%
레저세 20
8.7%
지방소득세 20
8.7%
지역자원시설세 15
 
6.5%
등록면허세 10
 
4.3%
담배소비세 5
 
2.2%
교육세 5
 
2.2%
Other values (3) 15
 
6.5%

Length

2023-12-12T17:12:58.201899image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
취득세 45
19.6%
자동차세 35
15.2%
주민세 35
15.2%
재산세 25
10.9%
레저세 20
8.7%
지방소득세 20
8.7%
지역자원시설세 15
 
6.5%
등록면허세 10
 
4.3%
담배소비세 5
 
2.2%
교육세 5
 
2.2%
Other values (3) 15
 
6.5%

세원 유형명
Categorical

HIGH CORRELATION 

Distinct46
Distinct (%)20.0%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
담배소비세
 
5
건축물
 
5
자동차세(주행)
 
5
도시계획세
 
5
주택(개별)
 
5
Other values (41)
205 

Length

Max length11
Median length8
Mean length6.0217391
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row담배소비세
2nd row담배소비세
3rd row담배소비세
4th row담배소비세
5th row담배소비세

Common Values

ValueCountFrequency (%)
담배소비세 5
 
2.2%
건축물 5
 
2.2%
자동차세(주행) 5
 
2.2%
도시계획세 5
 
2.2%
주택(개별) 5
 
2.2%
주택(단독) 5
 
2.2%
기타 5
 
2.2%
항공기 5
 
2.2%
기계장비 5
 
2.2%
차량 5
 
2.2%
Other values (36) 180
78.3%

Length

2023-12-12T17:12:58.373532image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
담배소비세 5
 
2.2%
등록면허세(면허 5
 
2.2%
주민세(종합소득 5
 
2.2%
승합 5
 
2.2%
기타승용 5
 
2.2%
승용 5
 
2.2%
지방소득세(특별징수 5
 
2.2%
지방소득세(법인소득 5
 
2.2%
지방소득세(양도소득 5
 
2.2%
지방소득세(종합소득 5
 
2.2%
Other values (36) 180
78.3%

부과건수
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct159
Distinct (%)69.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean27342.748
Minimum0
Maximum513753
Zeros61
Zeros (%)26.5%
Negative0
Negative (%)0.0%
Memory size2.2 KiB
2023-12-12T17:12:58.549780image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median1072.5
Q319758
95-th percentile138661.75
Maximum513753
Range513753
Interquartile range (IQR)19758

Descriptive statistics

Standard deviation71500.512
Coefficient of variation (CV)2.6149717
Kurtosis22.466214
Mean27342.748
Median Absolute Deviation (MAD)1072.5
Skewness4.4224512
Sum6288832
Variance5.1123232 × 109
MonotonicityNot monotonic
2023-12-12T17:12:58.755901image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 61
26.5%
12 5
 
2.2%
2 4
 
1.7%
46 2
 
0.9%
31 2
 
0.9%
139 2
 
0.9%
1105 2
 
0.9%
51196 1
 
0.4%
23511 1
 
0.4%
1801 1
 
0.4%
Other values (149) 149
64.8%
ValueCountFrequency (%)
0 61
26.5%
2 4
 
1.7%
7 1
 
0.4%
9 1
 
0.4%
10 1
 
0.4%
12 5
 
2.2%
31 2
 
0.9%
46 2
 
0.9%
48 1
 
0.4%
59 1
 
0.4%
ValueCountFrequency (%)
513753 1
0.4%
480578 1
0.4%
405779 1
0.4%
362616 1
0.4%
360838 1
0.4%
228962 1
0.4%
209016 1
0.4%
189733 1
0.4%
157555 1
0.4%
149728 1
0.4%

부과금액
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct170
Distinct (%)73.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6.5734273 × 109
Minimum0
Maximum7.1886729 × 1010
Zeros61
Zeros (%)26.5%
Negative0
Negative (%)0.0%
Memory size2.2 KiB
2023-12-12T17:12:58.917765image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median7.53599 × 108
Q39.265134 × 109
95-th percentile2.5770978 × 1010
Maximum7.1886729 × 1010
Range7.1886729 × 1010
Interquartile range (IQR)9.265134 × 109

Descriptive statistics

Standard deviation1.1518004 × 1010
Coefficient of variation (CV)1.7522068
Kurtosis11.136519
Mean6.5734273 × 109
Median Absolute Deviation (MAD)7.53599 × 108
Skewness2.9673726
Sum1.5118883 × 1012
Variance1.3266441 × 1020
MonotonicityNot monotonic
2023-12-12T17:12:59.122597image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 61
 
26.5%
61090454000 1
 
0.4%
12758609000 1
 
0.4%
9125190000 1
 
0.4%
5512258000 1
 
0.4%
5486671000 1
 
0.4%
8651685000 1
 
0.4%
4291724000 1
 
0.4%
3939953000 1
 
0.4%
4852913000 1
 
0.4%
Other values (160) 160
69.6%
ValueCountFrequency (%)
0 61
26.5%
1202000 1
 
0.4%
2263000 1
 
0.4%
5669000 1
 
0.4%
6040000 1
 
0.4%
7211000 1
 
0.4%
7404000 1
 
0.4%
8017000 1
 
0.4%
8327000 1
 
0.4%
8607000 1
 
0.4%
ValueCountFrequency (%)
71886729000 1
0.4%
70106659000 1
0.4%
61090454000 1
0.4%
52014212000 1
0.4%
49133033000 1
0.4%
41225343000 1
0.4%
36123800000 1
0.4%
32236842000 1
0.4%
30853398000 1
0.4%
27838292000 1
0.4%

Interactions

2023-12-12T17:12:56.664563image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:12:56.398570image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:12:56.765392image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:12:56.531541image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T17:12:59.245539image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시군구명자치단체코드세목명세원 유형명부과건수부과금액
시군구명1.0001.0000.0000.0000.0000.133
자치단체코드1.0001.0000.0000.0000.0000.133
세목명0.0000.0001.0001.0000.7560.448
세원 유형명0.0000.0001.0001.0000.8650.609
부과건수0.0000.0000.7560.8651.0000.623
부과금액0.1330.1330.4480.6090.6231.000
2023-12-12T17:12:59.410861image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
자치단체코드세원 유형명세목명시군구명
자치단체코드1.0000.0000.0001.000
세원 유형명0.0001.0000.9210.000
세목명0.0000.9211.0000.000
시군구명1.0000.0000.0001.000
2023-12-12T17:12:59.543458image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
부과건수부과금액시군구명자치단체코드세목명세원 유형명
부과건수1.0000.8010.0000.0000.4730.519
부과금액0.8011.0000.0530.0530.2010.233
시군구명0.0000.0531.0001.0000.0000.000
자치단체코드0.0000.0531.0001.0000.0000.000
세목명0.4730.2010.0000.0001.0000.921
세원 유형명0.5190.2330.0000.0000.9211.000

Missing values

2023-12-12T17:12:56.903574image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T17:12:57.094149image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시도명시군구명자치단체코드과세년도세목명세원 유형명부과건수부과금액
0창원시진해구481292021담배소비세담배소비세00
1창원시마산회원구481272021담배소비세담배소비세00
2창원시마산합포구481252021담배소비세담배소비세00
3창원시성산구481232021담배소비세담배소비세00
4창원시의창구481212021담배소비세담배소비세47870106659000
5창원시진해구481292021교육세교육세36261616189171000
6창원시마산회원구481272021교육세교육세36083811773295000
7창원시마산합포구481252021교육세교육세40577914810967000
8창원시성산구481232021교육세교육세48057826191967000
9창원시의창구481212021교육세교육세51375352014212000
시도명시군구명자치단체코드과세년도세목명세원 유형명부과건수부과금액
220창원시의창구481212021주민세주민세(종업원분)23624784854000
221창원시의창구481212021주민세주민세(특별징수)00
222창원시의창구481212021주민세주민세(법인세분)00
223창원시의창구481212021주민세주민세(양도소득)00
224창원시의창구481212021주민세주민세(종합소득)00
225창원시진해구481292021체납체납14972811577810000
226창원시마산회원구481272021체납체납13732810617056000
227창원시마산합포구481252021체납체납15755510727656000
228창원시성산구481232021체납체납12546214028150000
229창원시의창구481212021체납체납20901620346514000