Overview

Dataset statistics

Number of variables8
Number of observations231
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory15.5 KiB
Average record size in memory68.6 B

Variable types

Categorical6
Numeric2

Dataset

Description대구광역시 서구_세원유형별과세현황_20221231
Author대구광역시 서구
URLhttp://data.daegu.go.kr/open/data/dataView.do?dataSetId=15078529&dataSetDetailId=150785291f1c02e11e433&provdMethod=FILE

Alerts

시도명 has constant value ""Constant
시군구명 has constant value ""Constant
자치단체코드 has constant value ""Constant
세목명 is highly overall correlated with 부과건수 and 1 other fieldsHigh correlation
세원 유형명 is highly overall correlated with 부과건수 and 2 other fieldsHigh correlation
부과건수 is highly overall correlated with 부과금액 and 2 other fieldsHigh correlation
부과금액 is highly overall correlated with 부과건수 and 1 other fieldsHigh correlation
부과건수 has 63 (27.3%) zerosZeros
부과금액 has 63 (27.3%) zerosZeros

Reproduction

Analysis started2023-12-10 20:08:08.696524
Analysis finished2023-12-10 20:08:09.888361
Duration1.19 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시도명
Categorical

CONSTANT 

Distinct1
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
대구광역시
231 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row대구광역시
2nd row대구광역시
3rd row대구광역시
4th row대구광역시
5th row대구광역시

Common Values

ValueCountFrequency (%)
대구광역시 231
100.0%

Length

2023-12-11T05:08:09.967509image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T05:08:10.089502image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
대구광역시 231
100.0%

시군구명
Categorical

CONSTANT 

Distinct1
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
서구
231 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row서구
2nd row서구
3rd row서구
4th row서구
5th row서구

Common Values

ValueCountFrequency (%)
서구 231
100.0%

Length

2023-12-11T05:08:10.215692image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T05:08:10.344116image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
서구 231
100.0%

자치단체코드
Categorical

CONSTANT 

Distinct1
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
27170
231 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row27170
2nd row27170
3rd row27170
4th row27170
5th row27170

Common Values

ValueCountFrequency (%)
27170 231
100.0%

Length

2023-12-11T05:08:10.478939image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T05:08:10.615819image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
27170 231
100.0%

과세년도
Categorical

Distinct5
Distinct (%)2.2%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
2018
47 
2019
47 
2021
46 
2022
46 
2020
45 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2018
2nd row2018
3rd row2018
4th row2018
5th row2018

Common Values

ValueCountFrequency (%)
2018 47
20.3%
2019 47
20.3%
2021 46
19.9%
2022 46
19.9%
2020 45
19.5%

Length

2023-12-11T05:08:10.780473image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T05:08:10.929835image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2018 47
20.3%
2019 47
20.3%
2021 46
19.9%
2022 46
19.9%
2020 45
19.5%

세목명
Categorical

HIGH CORRELATION 

Distinct13
Distinct (%)5.6%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
취득세
45 
주민세
41 
자동차세
35 
재산세
25 
레저세
20 
Other values (8)
65 

Length

Max length7
Median length3
Mean length3.7099567
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row담배소비세
2nd row교육세
3rd row도시계획세
4th row취득세
5th row취득세

Common Values

ValueCountFrequency (%)
취득세 45
19.5%
주민세 41
17.7%
자동차세 35
15.2%
재산세 25
10.8%
레저세 20
8.7%
지방소득세 20
8.7%
지역자원시설세 12
 
5.2%
등록면허세 10
 
4.3%
교육세 5
 
2.2%
지방소비세 5
 
2.2%
Other values (3) 13
 
5.6%

Length

2023-12-11T05:08:11.119048image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
취득세 45
19.5%
주민세 41
17.7%
자동차세 35
15.2%
재산세 25
10.8%
레저세 20
8.7%
지방소득세 20
8.7%
지역자원시설세 12
 
5.2%
등록면허세 10
 
4.3%
교육세 5
 
2.2%
지방소비세 5
 
2.2%
Other values (3) 13
 
5.6%

세원 유형명
Categorical

HIGH CORRELATION 

Distinct50
Distinct (%)21.6%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
승합
 
5
선박
 
5
체납
 
5
건축물
 
5
자동차세(주행)
 
5
Other values (45)
206 

Length

Max length11
Median length8
Mean length6.04329
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row담배소비세
2nd row교육세
3rd row도시계획세
4th row건축물
5th row주택(개별)

Common Values

ValueCountFrequency (%)
승합 5
 
2.2%
선박 5
 
2.2%
체납 5
 
2.2%
건축물 5
 
2.2%
자동차세(주행) 5
 
2.2%
주택(단독) 5
 
2.2%
기타 5
 
2.2%
항공기 5
 
2.2%
기계장비 5
 
2.2%
차량 5
 
2.2%
Other values (40) 181
78.4%

Length

2023-12-11T05:08:11.339654image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
승합 5
 
2.2%
주민세(특별징수 5
 
2.2%
주민세(법인세분 5
 
2.2%
특수 5
 
2.2%
선박 5
 
2.2%
교육세 5
 
2.2%
기타승용 5
 
2.2%
3륜이하 5
 
2.2%
지방소득세(종합소득 5
 
2.2%
주민세(종업원분 5
 
2.2%
Other values (40) 181
78.4%

부과건수
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct164
Distinct (%)71.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean20870.121
Minimum0
Maximum358307
Zeros63
Zeros (%)27.3%
Negative0
Negative (%)0.0%
Memory size2.2 KiB
2023-12-11T05:08:11.532011image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median968
Q316689.5
95-th percentile93579
Maximum358307
Range358307
Interquartile range (IQR)16689.5

Descriptive statistics

Standard deviation54767.542
Coefficient of variation (CV)2.6242082
Kurtosis22.965306
Mean20870.121
Median Absolute Deviation (MAD)968
Skewness4.5253743
Sum4820998
Variance2.9994837 × 109
MonotonicityNot monotonic
2023-12-11T05:08:11.758360image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 63
27.3%
9 3
 
1.3%
34 2
 
0.9%
7 2
 
0.9%
29 2
 
0.9%
1003 1
 
0.4%
15643 1
 
0.4%
1142 1
 
0.4%
55548 1
 
0.4%
18996 1
 
0.4%
Other values (154) 154
66.7%
ValueCountFrequency (%)
0 63
27.3%
2 1
 
0.4%
5 1
 
0.4%
7 2
 
0.9%
9 3
 
1.3%
10 1
 
0.4%
11 1
 
0.4%
14 1
 
0.4%
15 1
 
0.4%
16 1
 
0.4%
ValueCountFrequency (%)
358307 1
0.4%
350605 1
0.4%
332334 1
0.4%
328007 1
0.4%
324042 1
0.4%
134515 1
0.4%
129912 1
0.4%
129716 1
0.4%
122634 1
0.4%
117156 1
0.4%

부과금액
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct169
Distinct (%)73.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.1747362 × 109
Minimum0
Maximum2.1412224 × 1010
Zeros63
Zeros (%)27.3%
Negative0
Negative (%)0.0%
Memory size2.2 KiB
2023-12-11T05:08:11.987361image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median3.33712 × 108
Q34.8026925 × 109
95-th percentile1.234167 × 1010
Maximum2.1412224 × 1010
Range2.1412224 × 1010
Interquartile range (IQR)4.8026925 × 109

Descriptive statistics

Standard deviation4.6664933 × 109
Coefficient of variation (CV)1.4698838
Kurtosis1.7561199
Mean3.1747362 × 109
Median Absolute Deviation (MAD)3.33712 × 108
Skewness1.5584709
Sum7.3336407 × 1011
Variance2.1776159 × 1019
MonotonicityNot monotonic
2023-12-11T05:08:12.262552image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 63
 
27.3%
333712000 1
 
0.4%
1152000 1
 
0.4%
19816336000 1
 
0.4%
7576286000 1
 
0.4%
19090183000 1
 
0.4%
817000 1
 
0.4%
4971030000 1
 
0.4%
10849000 1
 
0.4%
11756000 1
 
0.4%
Other values (159) 159
68.8%
ValueCountFrequency (%)
0 63
27.3%
100000 1
 
0.4%
359000 1
 
0.4%
576000 1
 
0.4%
661000 1
 
0.4%
672000 1
 
0.4%
720000 1
 
0.4%
747000 1
 
0.4%
817000 1
 
0.4%
1152000 1
 
0.4%
ValueCountFrequency (%)
21412224000 1
0.4%
19816336000 1
0.4%
19090183000 1
0.4%
16998552000 1
0.4%
16587560000 1
0.4%
15046031000 1
0.4%
13854936000 1
0.4%
13370468000 1
0.4%
12819485000 1
0.4%
12568994000 1
0.4%

Interactions

2023-12-11T05:08:09.372751image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T05:08:09.092264image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T05:08:09.499578image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T05:08:09.244209image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T05:08:12.411095image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
과세년도세목명세원 유형명부과건수부과금액
과세년도1.0000.0000.0000.0000.000
세목명0.0001.0001.0000.8890.580
세원 유형명0.0001.0001.0000.9980.920
부과건수0.0000.8890.9981.0000.726
부과금액0.0000.5800.9200.7261.000
2023-12-11T05:08:12.562601image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
세목명세원 유형명과세년도
세목명1.0000.9110.000
세원 유형명0.9111.0000.000
과세년도0.0000.0001.000
2023-12-11T05:08:12.701402image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
부과건수부과금액과세년도세목명세원 유형명
부과건수1.0000.8330.0000.7320.845
부과금액0.8331.0000.0000.2830.534
과세년도0.0000.0001.0000.0000.000
세목명0.7320.2830.0001.0000.911
세원 유형명0.8450.5340.0000.9111.000

Missing values

2023-12-11T05:08:09.651115image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T05:08:09.827084image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시도명시군구명자치단체코드과세년도세목명세원 유형명부과건수부과금액
0대구광역시서구271702018담배소비세담배소비세00
1대구광역시서구271702018교육세교육세35830710305226000
2대구광역시서구271702018도시계획세도시계획세00
3대구광역시서구271702018취득세건축물6064065771000
4대구광역시서구271702018취득세주택(개별)18868968081000
5대구광역시서구271702018취득세주택(단독)24395136037000
6대구광역시서구271702018취득세기타4078292000
7대구광역시서구271702018취득세항공기00
8대구광역시서구271702018취득세기계장비275254648000
9대구광역시서구271702018취득세차량1624911743013000
시도명시군구명자치단체코드과세년도세목명세원 유형명부과건수부과금액
221대구광역시서구271702022등록면허세등록면허세(면허)22717793592000
222대구광역시서구271702022등록면허세등록면허세(등록)449372943035000
223대구광역시서구271702022지역자원시설세지역자원시설세(소방)642673217684000
224대구광역시서구271702022지역자원시설세지역자원시설세(시설)10720000
225대구광역시서구271702022지역자원시설세지역자원시설세(특자)2098778000
226대구광역시서구271702022지방소득세지방소득세(특별징수)3086611518954000
227대구광역시서구271702022지방소득세지방소득세(법인소득)215211505248000
228대구광역시서구271702022지방소득세지방소득세(양도소득)13232595258000
229대구광역시서구271702022지방소득세지방소득세(종합소득)297002883040000
230대구광역시서구271702022체납체납1171565006848000