Overview

Dataset statistics

Number of variables8
Number of observations234
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory15.7 KiB
Average record size in memory68.6 B

Variable types

Categorical6
Numeric2

Dataset

Description인천광역시 서구의 2017년도부터 2021년도까지 세목 유형별 부과 건수 및 부과금액을 포함하고 있습니다. (주요세목 : 취득세, 재산세, 주민세, 자동차세, 지방소득세 등)
Author인천광역시 서구
URLhttps://data.incheon.go.kr/findData/publicDataDetail?dataId=15078566&srcSe=7661IVAWM27C61E190

Alerts

시도명 has constant value ""Constant
시군구명 has constant value ""Constant
자치단체코드 has constant value ""Constant
세원 유형명 is highly overall correlated with 부과건수 and 1 other fieldsHigh correlation
세목명 is highly overall correlated with 부과건수 and 1 other fieldsHigh correlation
부과건수 is highly overall correlated with 부과금액 and 2 other fieldsHigh correlation
부과금액 is highly overall correlated with 부과건수High correlation
부과건수 has 68 (29.1%) zerosZeros
부과금액 has 69 (29.5%) zerosZeros

Reproduction

Analysis started2024-01-28 12:17:07.699077
Analysis finished2024-01-28 12:17:08.481069
Duration0.78 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시도명
Categorical

CONSTANT 

Distinct1
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
인천광역시
234 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row인천광역시
2nd row인천광역시
3rd row인천광역시
4th row인천광역시
5th row인천광역시

Common Values

ValueCountFrequency (%)
인천광역시 234
100.0%

Length

2024-01-28T21:17:08.546329image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-28T21:17:08.624638image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
인천광역시 234
100.0%

시군구명
Categorical

CONSTANT 

Distinct1
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
서구
234 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row서구
2nd row서구
3rd row서구
4th row서구
5th row서구

Common Values

ValueCountFrequency (%)
서구 234
100.0%

Length

2024-01-28T21:17:08.701082image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-28T21:17:08.783327image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
서구 234
100.0%

자치단체코드
Categorical

CONSTANT 

Distinct1
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
28260
234 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row28260
2nd row28260
3rd row28260
4th row28260
5th row28260

Common Values

ValueCountFrequency (%)
28260 234
100.0%

Length

2024-01-28T21:17:08.871635image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-28T21:17:08.940222image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
28260 234
100.0%

과세년도
Categorical

Distinct5
Distinct (%)2.1%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
2017
47 
2018
47 
2019
47 
2020
47 
2021
46 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2017
2nd row2017
3rd row2017
4th row2017
5th row2017

Common Values

ValueCountFrequency (%)
2017 47
20.1%
2018 47
20.1%
2019 47
20.1%
2020 47
20.1%
2021 46
19.7%

Length

2024-01-28T21:17:09.017791image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-28T21:17:09.098885image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2017 47
20.1%
2018 47
20.1%
2019 47
20.1%
2020 47
20.1%
2021 46
19.7%

세목명
Categorical

HIGH CORRELATION 

Distinct13
Distinct (%)5.6%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
취득세
45 
주민세
43 
자동차세
35 
재산세
25 
지방소득세
20 
Other values (8)
66 

Length

Max length7
Median length3
Mean length3.7008547
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row지방소득세
2nd row지방소득세
3rd row지방소득세
4th row지방소득세
5th row담배소비세

Common Values

ValueCountFrequency (%)
취득세 45
19.2%
주민세 43
18.4%
자동차세 35
15.0%
재산세 25
10.7%
지방소득세 20
8.5%
레저세 20
8.5%
지역자원시설세 11
 
4.7%
등록면허세 10
 
4.3%
담배소비세 5
 
2.1%
교육세 5
 
2.1%
Other values (3) 15
 
6.4%

Length

2024-01-28T21:17:09.197358image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
취득세 45
19.2%
주민세 43
18.4%
자동차세 35
15.0%
재산세 25
10.7%
지방소득세 20
8.5%
레저세 20
8.5%
지역자원시설세 11
 
4.7%
등록면허세 10
 
4.3%
담배소비세 5
 
2.1%
교육세 5
 
2.1%
Other values (3) 15
 
6.4%

세원 유형명
Categorical

HIGH CORRELATION 

Distinct50
Distinct (%)21.4%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
지방소득세(특별징수)
 
5
주민세(특별징수)
 
5
교육세
 
5
승용
 
5
경정
 
5
Other values (45)
209 

Length

Max length11
Median length8
Mean length6.0384615
Min length2

Unique

Unique3 ?
Unique (%)1.3%

Sample

1st row지방소득세(특별징수)
2nd row지방소득세(법인소득)
3rd row지방소득세(양도소득)
4th row지방소득세(종합소득)
5th row담배소비세

Common Values

ValueCountFrequency (%)
지방소득세(특별징수) 5
 
2.1%
주민세(특별징수) 5
 
2.1%
교육세 5
 
2.1%
승용 5
 
2.1%
경정 5
 
2.1%
경륜 5
 
2.1%
경마 5
 
2.1%
재산세(주택) 5
 
2.1%
기타승용 5
 
2.1%
재산세(항공기) 5
 
2.1%
Other values (40) 184
78.6%

Length

2024-01-28T21:17:09.302795image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
지방소득세(특별징수 5
 
2.1%
소싸움 5
 
2.1%
기타 5
 
2.1%
주민세(특별징수 5
 
2.1%
주민세(법인세분 5
 
2.1%
주민세(양도소득 5
 
2.1%
주민세(종합소득 5
 
2.1%
지방소비세 5
 
2.1%
건축물 5
 
2.1%
체납 5
 
2.1%
Other values (40) 184
78.6%

부과건수
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct166
Distinct (%)70.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean66679.829
Minimum0
Maximum1114310
Zeros68
Zeros (%)29.1%
Negative0
Negative (%)0.0%
Memory size2.2 KiB
2024-01-28T21:17:09.405235image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median3435
Q340366.25
95-th percentile327315.85
Maximum1114310
Range1114310
Interquartile range (IQR)40366.25

Descriptive statistics

Standard deviation171055.23
Coefficient of variation (CV)2.5653219
Kurtosis21.799194
Mean66679.829
Median Absolute Deviation (MAD)3435
Skewness4.3699447
Sum15603080
Variance2.9259891 × 1010
MonotonicityNot monotonic
2024-01-28T21:17:09.530660image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 68
29.1%
44 2
 
0.9%
89196 1
 
0.4%
4496 1
 
0.4%
129 1
 
0.4%
39707 1
 
0.4%
2081 1
 
0.4%
1657 1
 
0.4%
38141 1
 
0.4%
8462 1
 
0.4%
Other values (156) 156
66.7%
ValueCountFrequency (%)
0 68
29.1%
6 1
 
0.4%
7 1
 
0.4%
25 1
 
0.4%
36 1
 
0.4%
40 1
 
0.4%
44 2
 
0.9%
56 1
 
0.4%
82 1
 
0.4%
83 1
 
0.4%
ValueCountFrequency (%)
1114310 1
0.4%
1092552 1
0.4%
1067201 1
0.4%
1019677 1
0.4%
949133 1
0.4%
377161 1
0.4%
363834 1
0.4%
362696 1
0.4%
357542 1
0.4%
352069 1
0.4%

부과금액
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct166
Distinct (%)70.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.9122384 × 1010
Minimum0
Maximum2.2366095 × 1011
Zeros69
Zeros (%)29.5%
Negative0
Negative (%)0.0%
Memory size2.2 KiB
2024-01-28T21:17:09.652093image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median1.571911 × 109
Q32.4175324 × 1010
95-th percentile9.1102593 × 1010
Maximum2.2366095 × 1011
Range2.2366095 × 1011
Interquartile range (IQR)2.4175324 × 1010

Descriptive statistics

Standard deviation3.4420987 × 1010
Coefficient of variation (CV)1.8000364
Kurtosis9.5386994
Mean1.9122384 × 1010
Median Absolute Deviation (MAD)1.571911 × 109
Skewness2.7894537
Sum4.4746378 × 1012
Variance1.1848044 × 1021
MonotonicityNot monotonic
2024-01-28T21:17:09.805341image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 69
29.5%
29921709000 1
 
0.4%
2481419000 1
 
0.4%
1747000 1
 
0.4%
23439651000 1
 
0.4%
23460000 1
 
0.4%
50668000 1
 
0.4%
1076148000 1
 
0.4%
425254000 1
 
0.4%
58673000 1
 
0.4%
Other values (156) 156
66.7%
ValueCountFrequency (%)
0 69
29.5%
904000 1
 
0.4%
1003000 1
 
0.4%
1235000 1
 
0.4%
1483000 1
 
0.4%
1747000 1
 
0.4%
3483000 1
 
0.4%
4309000 1
 
0.4%
7284000 1
 
0.4%
8945000 1
 
0.4%
ValueCountFrequency (%)
223660949000 1
0.4%
179426000000 1
0.4%
171336000000 1
0.4%
147236000000 1
0.4%
141373000000 1
0.4%
116570823000 1
0.4%
115782234000 1
0.4%
111411887000 1
0.4%
107358000000 1
0.4%
105884000000 1
0.4%

Interactions

2024-01-28T21:17:08.106334image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-28T21:17:07.961293image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-28T21:17:08.190877image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-28T21:17:08.033365image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-01-28T21:17:09.906246image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
과세년도세목명세원 유형명부과건수부과금액
과세년도1.0000.0000.0000.0000.000
세목명0.0001.0001.0000.8020.518
세원 유형명0.0001.0001.0000.9350.850
부과건수0.0000.8020.9351.0000.565
부과금액0.0000.5180.8500.5651.000
2024-01-28T21:17:09.989393image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
세원 유형명세목명과세년도
세원 유형명1.0000.9120.000
세목명0.9121.0000.000
과세년도0.0000.0001.000
2024-01-28T21:17:10.077907image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
부과건수부과금액과세년도세목명세원 유형명
부과건수1.0000.8430.0000.5540.657
부과금액0.8431.0000.0000.2420.415
과세년도0.0000.0001.0000.0000.000
세목명0.5540.2420.0001.0000.912
세원 유형명0.6570.4150.0000.9121.000

Missing values

2024-01-28T21:17:08.304859image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-28T21:17:08.425302image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시도명시군구명자치단체코드과세년도세목명세원 유형명부과건수부과금액
0인천광역시서구282602017지방소득세지방소득세(특별징수)8919629921709000
1인천광역시서구282602017지방소득세지방소득세(법인소득)587742033424000
2인천광역시서구282602017지방소득세지방소득세(양도소득)644811639505000
3인천광역시서구282602017지방소득세지방소득세(종합소득)5461314890045000
4인천광역시서구282602017담배소비세담배소비세00
5인천광역시서구282602017교육세교육세94913358592795000
6인천광역시서구282602017도시계획세도시계획세00
7인천광역시서구282602017레저세소싸움00
8인천광역시서구282602017레저세경정00
9인천광역시서구282602017레저세경륜00
시도명시군구명자치단체코드과세년도세목명세원 유형명부과건수부과금액
224인천광역시서구282602021취득세기계장비13021657877000
225인천광역시서구282602021취득세차량4667859379248000
226인천광역시서구282602021취득세선박5624041000
227인천광역시서구282602021취득세토지9306223660949000
228인천광역시서구282602021등록면허세등록면허세(면허)685152738635000
229인천광역시서구282602021등록면허세등록면허세(등록)18238726005395000
230인천광역시서구282602021지역자원시설세지역자원시설세(소방)36383419842193000
231인천광역시서구282602021지역자원시설세지역자원시설세(시설)447894615000
232인천광역시서구282602021지역자원시설세지역자원시설세(특자)303734162000
233인천광역시서구282602021체납체납31937336057994000