Overview

Dataset statistics

Number of variables8
Number of observations137
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory9.2 KiB
Average record size in memory69.0 B

Variable types

Categorical6
Numeric2

Dataset

Description지방세 과세를 위해 세원이 되는 과세 대상 유형별 부과된 현황을 제공하여 물건 유형에 따른 세부담 수준의 형평성 검토 및 부동산 등 관련분야 규제정책 대상 확인 시 기초자료 활용
Author대구광역시 수성구
URLhttps://www.data.go.kr/data/15079187/fileData.do

Alerts

시도명 has constant value ""Constant
시군구명 has constant value ""Constant
자치단체코드 has constant value ""Constant
부과건수 is highly overall correlated with 부과금액 and 2 other fieldsHigh correlation
부과금액 is highly overall correlated with 부과건수High correlation
세목명 is highly overall correlated with 부과건수 and 1 other fieldsHigh correlation
세원 유형명 is highly overall correlated with 부과건수 and 1 other fieldsHigh correlation
부과건수 has 37 (27.0%) zerosZeros
부과금액 has 37 (27.0%) zerosZeros

Reproduction

Analysis started2023-12-12 01:30:24.638500
Analysis finished2023-12-12 01:30:25.559726
Duration0.92 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시도명
Categorical

CONSTANT 

Distinct1
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
대구광역시
137 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row대구광역시
2nd row대구광역시
3rd row대구광역시
4th row대구광역시
5th row대구광역시

Common Values

ValueCountFrequency (%)
대구광역시 137
100.0%

Length

2023-12-12T10:30:25.697658image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T10:30:25.799017image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
대구광역시 137
100.0%

시군구명
Categorical

CONSTANT 

Distinct1
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
수성구
137 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row수성구
2nd row수성구
3rd row수성구
4th row수성구
5th row수성구

Common Values

ValueCountFrequency (%)
수성구 137
100.0%

Length

2023-12-12T10:30:25.903516image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T10:30:26.002685image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
수성구 137
100.0%

자치단체코드
Categorical

CONSTANT 

Distinct1
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
27260
137 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row27260
2nd row27260
3rd row27260
4th row27260
5th row27260

Common Values

ValueCountFrequency (%)
27260 137
100.0%

Length

2023-12-12T10:30:26.113940image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T10:30:26.234656image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
27260 137
100.0%

과세년도
Categorical

Distinct3
Distinct (%)2.2%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
2021
46 
2022
46 
2020
45 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2020
2nd row2020
3rd row2020
4th row2020
5th row2020

Common Values

ValueCountFrequency (%)
2021 46
33.6%
2022 46
33.6%
2020 45
32.8%

Length

2023-12-12T10:30:26.355596image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T10:30:26.497503image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2021 46
33.6%
2022 46
33.6%
2020 45
32.8%

세목명
Categorical

HIGH CORRELATION 

Distinct13
Distinct (%)9.5%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
취득세
27 
주민세
23 
자동차세
21 
재산세
15 
레저세
12 
Other values (8)
39 

Length

Max length7
Median length3
Mean length3.729927
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row교육세
2nd row취득세
3rd row취득세
4th row취득세
5th row취득세

Common Values

ValueCountFrequency (%)
취득세 27
19.7%
주민세 23
16.8%
자동차세 21
15.3%
재산세 15
10.9%
레저세 12
8.8%
지방소득세 12
8.8%
지역자원시설세 8
 
5.8%
등록면허세 6
 
4.4%
교육세 3
 
2.2%
지방소비세 3
 
2.2%
Other values (3) 7
 
5.1%

Length

2023-12-12T10:30:26.628347image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
취득세 27
19.7%
주민세 23
16.8%
자동차세 21
15.3%
재산세 15
10.9%
레저세 12
8.8%
지방소득세 12
8.8%
지역자원시설세 8
 
5.8%
등록면허세 6
 
4.4%
교육세 3
 
2.2%
지방소비세 3
 
2.2%
Other values (3) 7
 
5.1%

세원 유형명
Categorical

HIGH CORRELATION 

Distinct50
Distinct (%)36.5%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
교육세
 
3
토지
 
3
경정
 
3
주택(개별)
 
3
지방소득세(특별징수)
 
3
Other values (45)
122 

Length

Max length11
Median length8
Mean length6.0437956
Min length2

Unique

Unique4 ?
Unique (%)2.9%

Sample

1st row교육세
2nd row건축물
3rd row주택(개별)
4th row주택(단독)
5th row기타

Common Values

ValueCountFrequency (%)
교육세 3
 
2.2%
토지 3
 
2.2%
경정 3
 
2.2%
주택(개별) 3
 
2.2%
지방소득세(특별징수) 3
 
2.2%
기타 3
 
2.2%
항공기 3
 
2.2%
기계장비 3
 
2.2%
차량 3
 
2.2%
선박 3
 
2.2%
Other values (40) 107
78.1%

Length

2023-12-12T10:30:26.768344image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
교육세 3
 
2.2%
주민세(종합소득 3
 
2.2%
지방소득세(법인소득 3
 
2.2%
승합 3
 
2.2%
토지 3
 
2.2%
건축물 3
 
2.2%
주민세(종업원분 3
 
2.2%
체납 3
 
2.2%
주민세(법인세분 3
 
2.2%
주민세(양도소득 3
 
2.2%
Other values (40) 107
78.1%

부과건수
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct101
Distinct (%)73.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean52244.438
Minimum0
Maximum910033
Zeros37
Zeros (%)27.0%
Negative0
Negative (%)0.0%
Memory size1.3 KiB
2023-12-12T10:30:26.899390image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median1955
Q331482
95-th percentile295291.8
Maximum910033
Range910033
Interquartile range (IQR)31482

Descriptive statistics

Standard deviation146182.61
Coefficient of variation (CV)2.7980511
Kurtosis23.994009
Mean52244.438
Median Absolute Deviation (MAD)1955
Skewness4.6582398
Sum7157488
Variance2.1369355 × 1010
MonotonicityNot monotonic
2023-12-12T10:30:27.074148image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 37
27.0%
896866 1
 
0.7%
356 1
 
0.7%
3031 1
 
0.7%
41 1
 
0.7%
29817 1
 
0.7%
109 1
 
0.7%
290 1
 
0.7%
4913 1
 
0.7%
973 1
 
0.7%
Other values (91) 91
66.4%
ValueCountFrequency (%)
0 37
27.0%
6 1
 
0.7%
7 1
 
0.7%
9 1
 
0.7%
11 1
 
0.7%
24 1
 
0.7%
34 1
 
0.7%
41 1
 
0.7%
46 1
 
0.7%
52 1
 
0.7%
ValueCountFrequency (%)
910033 1
0.7%
906043 1
0.7%
896866 1
0.7%
323816 1
0.7%
317617 1
0.7%
307168 1
0.7%
299391 1
0.7%
294267 1
0.7%
286870 1
0.7%
160866 1
0.7%

부과금액
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct101
Distinct (%)73.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.5738321 × 1010
Minimum0
Maximum9.9996465 × 1010
Zeros37
Zeros (%)27.0%
Negative0
Negative (%)0.0%
Memory size1.3 KiB
2023-12-12T10:30:27.259090image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median4.83216 × 108
Q32.9923377 × 1010
95-th percentile7.0595045 × 1010
Maximum9.9996465 × 1010
Range9.9996465 × 1010
Interquartile range (IQR)2.9923377 × 1010

Descriptive statistics

Standard deviation2.4677219 × 1010
Coefficient of variation (CV)1.5679702
Kurtosis1.2122471
Mean1.5738321 × 1010
Median Absolute Deviation (MAD)4.83216 × 108
Skewness1.5170354
Sum2.15615 × 1012
Variance6.0896513 × 1020
MonotonicityNot monotonic
2023-12-12T10:30:27.444329image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 37
27.0%
51331874000 1
 
0.7%
47709000 1
 
0.7%
47883528000 1
 
0.7%
11953000 1
 
0.7%
72232565000 1
 
0.7%
594758000 1
 
0.7%
333647000 1
 
0.7%
54835568000 1
 
0.7%
14775856000 1
 
0.7%
Other values (91) 91
66.4%
ValueCountFrequency (%)
0 37
27.0%
1218000 1
 
0.7%
3523000 1
 
0.7%
3802000 1
 
0.7%
4118000 1
 
0.7%
11953000 1
 
0.7%
12801000 1
 
0.7%
13854000 1
 
0.7%
14217000 1
 
0.7%
19425000 1
 
0.7%
ValueCountFrequency (%)
99996465000 1
0.7%
93264065000 1
0.7%
82799301000 1
0.7%
77039778000 1
0.7%
72531300000 1
0.7%
72232565000 1
0.7%
71439053000 1
0.7%
70384043000 1
0.7%
67981806000 1
0.7%
60997968000 1
0.7%

Interactions

2023-12-12T10:30:25.145650image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:30:24.944521image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:30:25.233235image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:30:25.054796image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T10:30:27.546405image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
과세년도세목명세원 유형명부과건수부과금액
과세년도1.0000.0000.0000.0000.000
세목명0.0001.0001.0000.8520.598
세원 유형명0.0001.0001.0000.9980.873
부과건수0.0000.8520.9981.0000.643
부과금액0.0000.5980.8730.6431.000
2023-12-12T10:30:27.694231image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
과세년도세원 유형명세목명
과세년도1.0000.0000.000
세원 유형명0.0001.0000.838
세목명0.0000.8381.000
2023-12-12T10:30:27.847258image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
부과건수부과금액과세년도세목명세원 유형명
부과건수1.0000.8440.0000.6690.793
부과금액0.8441.0000.0000.2930.408
과세년도0.0000.0001.0000.0000.000
세목명0.6690.2930.0001.0000.838
세원 유형명0.7930.4080.0000.8381.000

Missing values

2023-12-12T10:30:25.358790image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T10:30:25.500398image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시도명시군구명자치단체코드과세년도세목명세원 유형명부과건수부과금액
0대구광역시수성구272602020교육세교육세89686651331874000
1대구광역시수성구272602020취득세건축물195530499378000
2대구광역시수성구272602020취득세주택(개별)208929923377000
3대구광역시수성구272602020취득세주택(단독)1226699996465000
4대구광역시수성구272602020취득세기타408426116000
5대구광역시수성구272602020취득세항공기00
6대구광역시수성구272602020취득세기계장비99326065000
7대구광역시수성구272602020취득세차량3518770384043000
8대구광역시수성구272602020취득세선박5214217000
9대구광역시수성구272602020취득세토지299645879679000
시도명시군구명자치단체코드과세년도세목명세원 유형명부과건수부과금액
127대구광역시수성구272602022지방소득세지방소득세(양도소득)941144257353000
128대구광역시수성구272602022지방소득세지방소득세(종합소득)9506460997968000
129대구광역시수성구272602022지방소비세지방소비세912339173000
130대구광역시수성구272602022담배소비세담배소비세00
131대구광역시수성구272602022등록면허세등록면허세(면허)508251746842000
132대구광역시수성구272602022등록면허세등록면허세(등록)6245310138488000
133대구광역시수성구272602022지역자원시설세지역자원시설세(소방)2993919472961000
134대구광역시수성구272602022지역자원시설세지역자원시설세(시설)00
135대구광역시수성구272602022지역자원시설세지역자원시설세(특자)30643943000
136대구광역시수성구272602022체납체납13777511808178000