Overview

Dataset statistics

Number of variables10
Number of observations37
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory3.2 KiB
Average record size in memory89.6 B

Variable types

Categorical6
Numeric4

Dataset

Description경기도 용인시 처인구, 기흥구, 수지구 지방세 과세 및 비과세 세목별 현황입니다. 과세건수, 과세금액, 비과세건수, 비과세금액 등의 데이터를 제공합니다. ※ 데이터기준일자 : 2021-12-31
URLhttps://www.data.go.kr/data/15078564/fileData.do

Alerts

시도명 has constant value ""Constant
과세년도 has constant value ""Constant
데이터기준일자 has constant value ""Constant
자치단체코드 is highly overall correlated with 시군구명High correlation
시군구명 is highly overall correlated with 자치단체코드High correlation
과세건수 is highly overall correlated with 과세금액 and 2 other fieldsHigh correlation
과세금액 is highly overall correlated with 과세건수 and 3 other fieldsHigh correlation
비과세건수 is highly overall correlated with 과세건수 and 2 other fieldsHigh correlation
비과세금액 is highly overall correlated with 과세건수 and 2 other fieldsHigh correlation
세목명 is highly overall correlated with 과세금액High correlation
과세건수 has 11 (29.7%) zerosZeros
과세금액 has 11 (29.7%) zerosZeros
비과세건수 has 15 (40.5%) zerosZeros
비과세금액 has 15 (40.5%) zerosZeros

Reproduction

Analysis started2023-12-12 01:45:05.725141
Analysis finished2023-12-12 01:45:08.155229
Duration2.43 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시도명
Categorical

CONSTANT 

Distinct1
Distinct (%)2.7%
Missing0
Missing (%)0.0%
Memory size428.0 B
경기도
37 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row경기도
2nd row경기도
3rd row경기도
4th row경기도
5th row경기도

Common Values

ValueCountFrequency (%)
경기도 37
100.0%

Length

2023-12-12T10:45:08.224730image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T10:45:08.346232image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
경기도 37
100.0%

시군구명
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)8.1%
Missing0
Missing (%)0.0%
Memory size428.0 B
용인시 처인구
13 
용인시 기흥구
12 
용인시 수지구
12 

Length

Max length7
Median length7
Mean length7
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row용인시 처인구
2nd row용인시 처인구
3rd row용인시 처인구
4th row용인시 처인구
5th row용인시 처인구

Common Values

ValueCountFrequency (%)
용인시 처인구 13
35.1%
용인시 기흥구 12
32.4%
용인시 수지구 12
32.4%

Length

2023-12-12T10:45:08.476020image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T10:45:08.622110image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
용인시 37
50.0%
처인구 13
 
17.6%
기흥구 12
 
16.2%
수지구 12
 
16.2%

자치단체코드
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)8.1%
Missing0
Missing (%)0.0%
Memory size428.0 B
41461
13 
41463
12 
41465
12 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row41461
2nd row41461
3rd row41461
4th row41461
5th row41461

Common Values

ValueCountFrequency (%)
41461 13
35.1%
41463 12
32.4%
41465 12
32.4%

Length

2023-12-12T10:45:08.765323image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T10:45:08.893197image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
41461 13
35.1%
41463 12
32.4%
41465 12
32.4%

과세년도
Categorical

CONSTANT 

Distinct1
Distinct (%)2.7%
Missing0
Missing (%)0.0%
Memory size428.0 B
2021
37 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2021
2nd row2021
3rd row2021
4th row2021
5th row2021

Common Values

ValueCountFrequency (%)
2021 37
100.0%

Length

2023-12-12T10:45:09.022898image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T10:45:09.121311image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2021 37
100.0%

세목명
Categorical

HIGH CORRELATION 

Distinct13
Distinct (%)35.1%
Missing0
Missing (%)0.0%
Memory size428.0 B
자동차세
재산세
주민세
취득세
레저세
Other values (8)
22 

Length

Max length7
Median length5
Mean length4.2162162
Min length3

Unique

Unique1 ?
Unique (%)2.7%

Sample

1st row자동차세
2nd row재산세
3rd row주민세
4th row등록세
5th row취득세

Common Values

ValueCountFrequency (%)
자동차세 3
8.1%
재산세 3
8.1%
주민세 3
8.1%
취득세 3
8.1%
레저세 3
8.1%
교육세 3
8.1%
지방소비세 3
8.1%
등록면허세 3
8.1%
도시계획세 3
8.1%
지역자원시설세 3
8.1%
Other values (3) 7
18.9%

Length

2023-12-12T10:45:09.260285image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
자동차세 3
8.1%
재산세 3
8.1%
주민세 3
8.1%
취득세 3
8.1%
레저세 3
8.1%
교육세 3
8.1%
지방소비세 3
8.1%
등록면허세 3
8.1%
도시계획세 3
8.1%
지역자원시설세 3
8.1%
Other values (3) 7
18.9%

과세건수
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct27
Distinct (%)73.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean162307.84
Minimum0
Maximum888871
Zeros11
Zeros (%)29.7%
Negative0
Negative (%)0.0%
Memory size465.0 B
2023-12-12T10:45:09.389863image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median139465
Q3218175
95-th percentile683866
Maximum888871
Range888871
Interquartile range (IQR)218175

Descriptive statistics

Standard deviation211002.2
Coefficient of variation (CV)1.3000124
Kurtosis4.6034575
Mean162307.84
Median Absolute Deviation (MAD)138990
Skewness2.0865745
Sum6005390
Variance4.4521927 × 1010
MonotonicityNot monotonic
2023-12-12T10:45:09.538285image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=27)
ValueCountFrequency (%)
0 11
29.7%
220429 1
 
2.7%
888871 1
 
2.7%
151535 1
 
2.7%
95066 1
 
2.7%
305737 1
 
2.7%
751950 1
 
2.7%
228970 1
 
2.7%
191115 1
 
2.7%
146701 1
 
2.7%
Other values (17) 17
45.9%
ValueCountFrequency (%)
0 11
29.7%
7 1
 
2.7%
475 1
 
2.7%
35550 1
 
2.7%
55822 1
 
2.7%
59678 1
 
2.7%
95066 1
 
2.7%
128456 1
 
2.7%
139465 1
 
2.7%
139536 1
 
2.7%
ValueCountFrequency (%)
888871 1
2.7%
751950 1
2.7%
666845 1
2.7%
319622 1
2.7%
305737 1
2.7%
297890 1
2.7%
249737 1
2.7%
228970 1
2.7%
220429 1
2.7%
218175 1
2.7%

과세금액
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct27
Distinct (%)73.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6.3045931 × 1010
Minimum0
Maximum3.05207 × 1011
Zeros11
Zeros (%)29.7%
Negative0
Negative (%)0.0%
Memory size465.0 B
2023-12-12T10:45:09.687733image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median1.7414106 × 1010
Q39.8656789 × 1010
95-th percentile2.921906 × 1011
Maximum3.05207 × 1011
Range3.05207 × 1011
Interquartile range (IQR)9.8656789 × 1010

Descriptive statistics

Standard deviation9.1078186 × 1010
Coefficient of variation (CV)1.4446323
Kurtosis2.2187692
Mean6.3045931 × 1010
Median Absolute Deviation (MAD)1.7414106 × 1010
Skewness1.7756722
Sum2.3326995 × 1012
Variance8.2952359 × 1021
MonotonicityNot monotonic
2023-12-12T10:45:09.820138image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=27)
ValueCountFrequency (%)
0 11
29.7%
98656789000 1
 
2.7%
55239420000 1
 
2.7%
118960000000 1
 
2.7%
14114948000 1
 
2.7%
9393258000 1
 
2.7%
47675008000 1
 
2.7%
37454381000 1
 
2.7%
110665000000 1
 
2.7%
5932978000 1
 
2.7%
Other values (17) 17
45.9%
ValueCountFrequency (%)
0 11
29.7%
5932978000 1
 
2.7%
9393258000 1
 
2.7%
10625364000 1
 
2.7%
10779708000 1
 
2.7%
13397556000 1
 
2.7%
14114948000 1
 
2.7%
15770669000 1
 
2.7%
17414106000 1
 
2.7%
19357727000 1
 
2.7%
ValueCountFrequency (%)
305207000000 1
2.7%
297109000000 1
2.7%
290961000000 1
2.7%
276000000000 1
2.7%
140979000000 1
2.7%
123840000000 1
2.7%
118960000000 1
2.7%
111158000000 1
2.7%
110665000000 1
2.7%
98656789000 1
2.7%

비과세건수
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct23
Distinct (%)62.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean13062.541
Minimum0
Maximum76753
Zeros15
Zeros (%)40.5%
Negative0
Negative (%)0.0%
Memory size465.0 B
2023-12-12T10:45:09.966285image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median248
Q312580
95-th percentile73971.4
Maximum76753
Range76753
Interquartile range (IQR)12580

Descriptive statistics

Standard deviation23947.245
Coefficient of variation (CV)1.8332762
Kurtosis2.7194899
Mean13062.541
Median Absolute Deviation (MAD)248
Skewness1.9896983
Sum483314
Variance5.7347054 × 108
MonotonicityNot monotonic
2023-12-12T10:45:10.143231image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=23)
ValueCountFrequency (%)
0 15
40.5%
30929 1
 
2.7%
39 1
 
2.7%
3422 1
 
2.7%
908 1
 
2.7%
62 1
 
2.7%
23152 1
 
2.7%
73333 1
 
2.7%
19196 1
 
2.7%
4671 1
 
2.7%
Other values (13) 13
35.1%
ValueCountFrequency (%)
0 15
40.5%
4 1
 
2.7%
39 1
 
2.7%
62 1
 
2.7%
248 1
 
2.7%
908 1
 
2.7%
1252 1
 
2.7%
3422 1
 
2.7%
3575 1
 
2.7%
4342 1
 
2.7%
ValueCountFrequency (%)
76753 1
2.7%
76525 1
2.7%
73333 1
2.7%
72456 1
2.7%
44154 1
2.7%
30929 1
2.7%
23152 1
2.7%
20069 1
2.7%
19196 1
2.7%
12580 1
2.7%

비과세금액
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct23
Distinct (%)62.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6.5201407 × 109
Minimum0
Maximum6.600461 × 1010
Zeros15
Zeros (%)40.5%
Negative0
Negative (%)0.0%
Memory size465.0 B
2023-12-12T10:45:10.333153image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median538000
Q38.16867 × 108
95-th percentile3.9351853 × 1010
Maximum6.600461 × 1010
Range6.600461 × 1010
Interquartile range (IQR)8.16867 × 108

Descriptive statistics

Standard deviation1.6036931 × 1010
Coefficient of variation (CV)2.4595989
Kurtosis6.0284006
Mean6.5201407 × 109
Median Absolute Deviation (MAD)538000
Skewness2.6001711
Sum2.412452 × 1011
Variance2.5718314 × 1020
MonotonicityNot monotonic
2023-12-12T10:45:10.494318image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=23)
ValueCountFrequency (%)
0 15
40.5%
1225022000 1
 
2.7%
10000 1
 
2.7%
472190000 1
 
2.7%
633845000 1
 
2.7%
11000 1
 
2.7%
761707000 1
 
2.7%
33172868000 1
 
2.7%
238270000 1
 
2.7%
9470075000 1
 
2.7%
Other values (13) 13
35.1%
ValueCountFrequency (%)
0 15
40.5%
10000 1
 
2.7%
11000 1
 
2.7%
65000 1
 
2.7%
538000 1
 
2.7%
45710000 1
 
2.7%
79406000 1
 
2.7%
238270000 1
 
2.7%
390917000 1
 
2.7%
472190000 1
 
2.7%
ValueCountFrequency (%)
66004610000 1
2.7%
51568433000 1
2.7%
36297708000 1
2.7%
36003208000 1
2.7%
33172868000 1
2.7%
9470075000 1
2.7%
2666224000 1
2.7%
1225022000 1
2.7%
900966000 1
2.7%
816867000 1
2.7%

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)2.7%
Missing0
Missing (%)0.0%
Memory size428.0 B
2021-12-31
37 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2021-12-31
2nd row2021-12-31
3rd row2021-12-31
4th row2021-12-31
5th row2021-12-31

Common Values

ValueCountFrequency (%)
2021-12-31 37
100.0%

Length

2023-12-12T10:45:10.642095image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T10:45:10.773336image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2021-12-31 37
100.0%

Interactions

2023-12-12T10:45:07.418924image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:45:06.070184image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:45:06.463717image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:45:06.908204image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:45:07.543674image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:45:06.152866image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:45:06.562144image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:45:07.003623image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:45:07.669294image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:45:06.272797image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:45:06.672775image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:45:07.150400image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:45:07.794508image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:45:06.373226image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:45:06.795127image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:45:07.277195image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T10:45:10.859604image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시군구명자치단체코드세목명과세건수과세금액비과세건수비과세금액
시군구명1.0001.0000.0000.0000.0000.0000.045
자치단체코드1.0001.0000.0000.0000.0000.0000.045
세목명0.0000.0001.0000.7840.8370.7390.513
과세건수0.0000.0000.7841.0000.7090.0000.000
과세금액0.0000.0000.8370.7091.0000.5330.604
비과세건수0.0000.0000.7390.0000.5331.0000.713
비과세금액0.0450.0450.5130.0000.6040.7131.000
2023-12-12T10:45:10.978733image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
자치단체코드시군구명세목명
자치단체코드1.0001.0000.000
시군구명1.0001.0000.000
세목명0.0000.0001.000
2023-12-12T10:45:11.080131image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
과세건수과세금액비과세건수비과세금액시군구명자치단체코드세목명
과세건수1.0000.6290.6010.5590.0000.0000.452
과세금액0.6291.0000.5520.5810.0000.0000.536
비과세건수0.6010.5521.0000.9400.0000.0000.402
비과세금액0.5590.5810.9401.0000.0000.0000.250
시군구명0.0000.0000.0000.0001.0001.0000.000
자치단체코드0.0000.0000.0000.0001.0001.0000.000
세목명0.4520.5360.4020.2500.0000.0001.000

Missing values

2023-12-12T10:45:07.920653image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T10:45:08.089539image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시도명시군구명자치단체코드과세년도세목명과세건수과세금액비과세건수비과세금액데이터기준일자
0경기도용인시 처인구414612021자동차세220429986567890003092912250220002021-12-31
1경기도용인시 처인구414612021재산세24973712384000000076525362977080002021-12-31
2경기도용인시 처인구414612021주민세1284561339755600011128794060002021-12-31
3경기도용인시 처인구414612021등록세0045380002021-12-31
4경기도용인시 처인구414612021취득세5582230520700000012580360032080002021-12-31
5경기도용인시 처인구414612021레저세00002021-12-31
6경기도용인시 처인구414612021교육세66684575993189000248650002021-12-31
7경기도용인시 처인구414612021지방소비세710779708000002021-12-31
8경기도용인시 처인구414612021등록면허세1394651741410600045164965550002021-12-31
9경기도용인시 처인구414612021도시계획세00002021-12-31
시도명시군구명자치단체코드과세년도세목명과세건수과세금액비과세건수비과세금액데이터기준일자
27경기도용인시 수지구414652021재산세19111511066500000073333331728680002021-12-31
28경기도용인시 수지구414652021자동차세22897037454381000231527617070002021-12-31
29경기도용인시 수지구414652021레저세00002021-12-31
30경기도용인시 수지구414652021교육세7519504767500800062110002021-12-31
31경기도용인시 수지구414652021지역자원시설세30573793932580009086338450002021-12-31
32경기도용인시 수지구414652021도시계획세00002021-12-31
33경기도용인시 수지구414652021등록면허세950661411494800034224721900002021-12-31
34경기도용인시 수지구414652021지방소비세00002021-12-31
35경기도용인시 수지구414652021담배소비세00002021-12-31
36경기도용인시 수지구414652021지방소득세151535118960000000002021-12-31