Overview

Dataset statistics

Number of variables8
Number of observations234
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory15.7 KiB
Average record size in memory68.6 B

Variable types

Categorical6
Numeric2

Dataset

Description각 세목별로 세원 유형을 세부적으로 나누어, 그 유형에 해당하는 대전광역시 중구 지방세의 부과건수와 부과금액을 확인할 수 있습니다.
URLhttps://www.data.go.kr/data/15078537/fileData.do

Alerts

시도명 has constant value ""Constant
시군구명 has constant value ""Constant
자치단체코드 has constant value ""Constant
세원 유형명 is highly overall correlated with 부과건수 and 2 other fieldsHigh correlation
세목명 is highly overall correlated with 부과건수 and 1 other fieldsHigh correlation
부과건수 is highly overall correlated with 부과금액 and 2 other fieldsHigh correlation
부과금액 is highly overall correlated with 부과건수 and 1 other fieldsHigh correlation
부과건수 has 68 (29.1%) zerosZeros
부과금액 has 70 (29.9%) zerosZeros

Reproduction

Analysis started2023-12-11 23:20:51.543536
Analysis finished2023-12-11 23:20:52.195242
Duration0.65 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시도명
Categorical

CONSTANT 

Distinct1
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
대전광역시
234 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row대전광역시
2nd row대전광역시
3rd row대전광역시
4th row대전광역시
5th row대전광역시

Common Values

ValueCountFrequency (%)
대전광역시 234
100.0%

Length

2023-12-12T08:20:52.241472image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T08:20:52.305053image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
대전광역시 234
100.0%

시군구명
Categorical

CONSTANT 

Distinct1
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
중구
234 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row중구
2nd row중구
3rd row중구
4th row중구
5th row중구

Common Values

ValueCountFrequency (%)
중구 234
100.0%

Length

2023-12-12T08:20:52.373454image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T08:20:52.437282image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
중구 234
100.0%

자치단체코드
Categorical

CONSTANT 

Distinct1
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
30140
234 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row30140
2nd row30140
3rd row30140
4th row30140
5th row30140

Common Values

ValueCountFrequency (%)
30140 234
100.0%

Length

2023-12-12T08:20:52.504957image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T08:20:52.584702image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
30140 234
100.0%

과세연도
Categorical

Distinct5
Distinct (%)2.1%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
2017
47 
2018
47 
2019
47 
2020
47 
2021
46 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2017
2nd row2017
3rd row2017
4th row2017
5th row2017

Common Values

ValueCountFrequency (%)
2017 47
20.1%
2018 47
20.1%
2019 47
20.1%
2020 47
20.1%
2021 46
19.7%

Length

2023-12-12T08:20:52.661006image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T08:20:52.736920image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2017 47
20.1%
2018 47
20.1%
2019 47
20.1%
2020 47
20.1%
2021 46
19.7%

세목명
Categorical

HIGH CORRELATION 

Distinct13
Distinct (%)5.6%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
취득세
45 
주민세
43 
자동차세
35 
재산세
25 
지방소득세
20 
Other values (8)
66 

Length

Max length7
Median length3
Mean length3.7008547
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row지방소득세
2nd row지방소득세
3rd row지방소득세
4th row지방소득세
5th row담배소비세

Common Values

ValueCountFrequency (%)
취득세 45
19.2%
주민세 43
18.4%
자동차세 35
15.0%
재산세 25
10.7%
지방소득세 20
8.5%
레저세 20
8.5%
지역자원시설세 11
 
4.7%
등록면허세 10
 
4.3%
담배소비세 5
 
2.1%
교육세 5
 
2.1%
Other values (3) 15
 
6.4%

Length

2023-12-12T08:20:52.828686image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
취득세 45
19.2%
주민세 43
18.4%
자동차세 35
15.0%
재산세 25
10.7%
지방소득세 20
8.5%
레저세 20
8.5%
지역자원시설세 11
 
4.7%
등록면허세 10
 
4.3%
담배소비세 5
 
2.1%
교육세 5
 
2.1%
Other values (3) 15
 
6.4%

세원 유형명
Categorical

HIGH CORRELATION 

Distinct50
Distinct (%)21.4%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
지방소득세(특별징수)
 
5
기타승용
 
5
자동차세(주행)
 
5
특수
 
5
지방소득세(종합소득)
 
5
Other values (45)
209 

Length

Max length11
Median length8
Mean length6.0384615
Min length2

Unique

Unique3 ?
Unique (%)1.3%

Sample

1st row지방소득세(특별징수)
2nd row지방소득세(법인소득)
3rd row지방소득세(양도소득)
4th row지방소득세(종합소득)
5th row담배소비세

Common Values

ValueCountFrequency (%)
지방소득세(특별징수) 5
 
2.1%
기타승용 5
 
2.1%
자동차세(주행) 5
 
2.1%
특수 5
 
2.1%
지방소득세(종합소득) 5
 
2.1%
담배소비세 5
 
2.1%
교육세 5
 
2.1%
도시계획세 5
 
2.1%
건축물 5
 
2.1%
주택(단독) 5
 
2.1%
Other values (40) 184
78.6%

Length

2023-12-12T08:20:52.922757image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
지방소득세(특별징수 5
 
2.1%
지방소득세(양도소득 5
 
2.1%
등록면허세(면허 5
 
2.1%
기타승용 5
 
2.1%
지방소득세(법인소득 5
 
2.1%
승용 5
 
2.1%
주민세(종업원분 5
 
2.1%
주민세(특별징수 5
 
2.1%
주민세(법인세분 5
 
2.1%
체납 5
 
2.1%
Other values (40) 184
78.6%

부과건수
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct163
Distinct (%)69.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean26752.197
Minimum0
Maximum452896
Zeros68
Zeros (%)29.1%
Negative0
Negative (%)0.0%
Memory size2.2 KiB
2023-12-12T08:20:53.015461image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median1189.5
Q317521.75
95-th percentile134855.95
Maximum452896
Range452896
Interquartile range (IQR)17521.75

Descriptive statistics

Standard deviation71982.802
Coefficient of variation (CV)2.6907249
Kurtosis22.426756
Mean26752.197
Median Absolute Deviation (MAD)1189.5
Skewness4.4665066
Sum6260014
Variance5.1815238 × 109
MonotonicityNot monotonic
2023-12-12T08:20:53.335437image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 68
29.1%
4 3
 
1.3%
16 2
 
0.9%
7 2
 
0.9%
27639 1
 
0.4%
3985 1
 
0.4%
26 1
 
0.4%
1521 1
 
0.4%
17 1
 
0.4%
1456 1
 
0.4%
Other values (153) 153
65.4%
ValueCountFrequency (%)
0 68
29.1%
1 1
 
0.4%
3 1
 
0.4%
4 3
 
1.3%
6 1
 
0.4%
7 2
 
0.9%
10 1
 
0.4%
16 2
 
0.9%
17 1
 
0.4%
19 1
 
0.4%
ValueCountFrequency (%)
452896 1
0.4%
445640 1
0.4%
442588 1
0.4%
442256 1
0.4%
440598 1
0.4%
155495 1
0.4%
149427 1
0.4%
147316 1
0.4%
145527 1
0.4%
143263 1
0.4%

부과금액
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct165
Distinct (%)70.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.8135075 × 109
Minimum0
Maximum2.961913 × 1010
Zeros70
Zeros (%)29.9%
Negative0
Negative (%)0.0%
Memory size2.2 KiB
2023-12-12T08:20:53.443980image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median2.339045 × 108
Q36.9773718 × 109
95-th percentile1.5602907 × 1010
Maximum2.961913 × 1010
Range2.961913 × 1010
Interquartile range (IQR)6.9773718 × 109

Descriptive statistics

Standard deviation5.7131514 × 109
Coefficient of variation (CV)1.4981356
Kurtosis1.593404
Mean3.8135075 × 109
Median Absolute Deviation (MAD)2.339045 × 108
Skewness1.49353
Sum8.9236074 × 1011
Variance3.2640099 × 1019
MonotonicityNot monotonic
2023-12-12T08:20:53.562054image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 70
29.9%
13006054000 1
 
0.4%
465945000 1
 
0.4%
14454948000 1
 
0.4%
109066000 1
 
0.4%
2876000 1
 
0.4%
219065000 1
 
0.4%
2277000 1
 
0.4%
18966444000 1
 
0.4%
9839000 1
 
0.4%
Other values (155) 155
66.2%
ValueCountFrequency (%)
0 70
29.9%
311000 1
 
0.4%
379000 1
 
0.4%
467000 1
 
0.4%
815000 1
 
0.4%
1017000 1
 
0.4%
1105000 1
 
0.4%
1202000 1
 
0.4%
1349000 1
 
0.4%
1436000 1
 
0.4%
ValueCountFrequency (%)
29619130000 1
0.4%
19198778000 1
0.4%
18966444000 1
0.4%
18030884000 1
0.4%
17965090000 1
0.4%
17853285000 1
0.4%
17816872000 1
0.4%
17695402000 1
0.4%
17546509000 1
0.4%
16895353000 1
0.4%

Interactions

2023-12-12T08:20:51.899246image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T08:20:51.773891image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T08:20:51.959486image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T08:20:51.831690image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T08:20:53.637776image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
과세연도세목명세원 유형명부과건수부과금액
과세연도1.0000.0000.0000.0000.000
세목명0.0001.0001.0000.8750.600
세원 유형명0.0001.0001.0000.9980.905
부과건수0.0000.8750.9981.0000.567
부과금액0.0000.6000.9050.5671.000
2023-12-12T08:20:53.713022image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
세원 유형명세목명과세연도
세원 유형명1.0000.9120.000
세목명0.9121.0000.000
과세연도0.0000.0001.000
2023-12-12T08:20:53.781138image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
부과건수부과금액과세연도세목명세원 유형명
부과건수1.0000.8740.0000.7070.845
부과금액0.8741.0000.0000.3200.566
과세연도0.0000.0001.0000.0000.000
세목명0.7070.3200.0001.0000.912
세원 유형명0.8450.5660.0000.9121.000

Missing values

2023-12-12T08:20:52.062002image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T08:20:52.158928image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시도명시군구명자치단체코드과세연도세목명세원 유형명부과건수부과금액
0대전광역시중구301402017지방소득세지방소득세(특별징수)2763913006054000
1대전광역시중구301402017지방소득세지방소득세(법인소득)208511477943000
2대전광역시중구301402017지방소득세지방소득세(양도소득)33153467021000
3대전광역시중구301402017지방소득세지방소득세(종합소득)226505499364000
4대전광역시중구301402017담배소비세담배소비세00
5대전광역시중구301402017교육세교육세44225613199173000
6대전광역시중구301402017도시계획세도시계획세00
7대전광역시중구301402017취득세건축물116010523797000
8대전광역시중구301402017취득세주택(단독)41668004556000
9대전광역시중구301402017취득세주택(개별)16338285324000
시도명시군구명자치단체코드과세연도세목명세원 유형명부과건수부과금액
224대전광역시중구301402021지역자원시설세지역자원시설세(소방)1234584173690000
225대전광역시중구301402021지역자원시설세지역자원시설세(시설)00
226대전광역시중구301402021지역자원시설세지역자원시설세(특자)51314394000
227대전광역시중구301402021재산세재산세(주택)8678114870269000
228대전광역시중구301402021재산세재산세(토지)2165013066717000
229대전광역시중구301402021재산세재산세(항공기)00
230대전광역시중구301402021재산세재산세(선박)981105000
231대전광역시중구301402021재산세재산세(건축물)174255268341000
232대전광역시중구301402021지방소비세지방소비세77188608000
233대전광역시중구301402021체납체납1494279442359000