Overview

Dataset statistics

Number of variables9
Number of observations193
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory14.8 KiB
Average record size in memory78.7 B

Variable types

Categorical5
Numeric4

Dataset

Description성남시 연도별 지방세 과세 및 비과세 현황을 세목별로 제공하고 있으며, 세목명, 과세 건수, 과세 금액, 비과세 건수, 비과세 금액 항목으로 구성되어 있습니다
URLhttps://www.data.go.kr/data/15080594/fileData.do

Alerts

시도명 has constant value ""Constant
자치단체코드 is highly overall correlated with 시군구명High correlation
시군구명 is highly overall correlated with 자치단체코드High correlation
과세건수 is highly overall correlated with 과세금액 and 2 other fieldsHigh correlation
과세금액 is highly overall correlated with 과세건수High correlation
비과세건수 is highly overall correlated with 과세건수 and 1 other fieldsHigh correlation
비과세금액 is highly overall correlated with 비과세건수High correlation
세목명 is highly overall correlated with 과세건수High correlation
과세건수 has 52 (26.9%) zerosZeros
과세금액 has 52 (26.9%) zerosZeros
비과세건수 has 78 (40.4%) zerosZeros
비과세금액 has 80 (41.5%) zerosZeros

Reproduction

Analysis started2023-12-12 18:52:42.296739
Analysis finished2023-12-12 18:52:45.556640
Duration3.26 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시도명
Categorical

CONSTANT 

Distinct1
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
경기도
193 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row경기도
2nd row경기도
3rd row경기도
4th row경기도
5th row경기도

Common Values

ValueCountFrequency (%)
경기도 193
100.0%

Length

2023-12-13T03:52:45.669724image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T03:52:45.842680image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
경기도 193
100.0%

시군구명
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)2.1%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
성남시수정구
64 
성남시중원구
63 
성남시분당구
63 
성남시
 
3

Length

Max length6
Median length6
Mean length5.9533679
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row성남시수정구
2nd row성남시수정구
3rd row성남시수정구
4th row성남시수정구
5th row성남시수정구

Common Values

ValueCountFrequency (%)
성남시수정구 64
33.2%
성남시중원구 63
32.6%
성남시분당구 63
32.6%
성남시 3
 
1.6%

Length

2023-12-13T03:52:46.121562image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T03:52:46.319759image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
성남시수정구 64
33.2%
성남시중원구 63
32.6%
성남시분당구 63
32.6%
성남시 3
 
1.6%

자치단체코드
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)2.1%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
41131
64 
41133
63 
41135
63 
41130
 
3

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row41131
2nd row41131
3rd row41131
4th row41131
5th row41131

Common Values

ValueCountFrequency (%)
41131 64
33.2%
41133 63
32.6%
41135 63
32.6%
41130 3
 
1.6%

Length

2023-12-13T03:52:46.513338image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T03:52:46.713370image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
41131 64
33.2%
41133 63
32.6%
41135 63
32.6%
41130 3
 
1.6%

과세년도
Categorical

Distinct5
Distinct (%)2.6%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
2022
40 
2018
39 
2017
38 
2019
38 
2021
38 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2017
2nd row2017
3rd row2017
4th row2017
5th row2017

Common Values

ValueCountFrequency (%)
2022 40
20.7%
2018 39
20.2%
2017 38
19.7%
2019 38
19.7%
2021 38
19.7%

Length

2023-12-13T03:52:46.922525image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T03:52:47.122222image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2022 40
20.7%
2018 39
20.2%
2017 38
19.7%
2019 38
19.7%
2021 38
19.7%

세목명
Categorical

HIGH CORRELATION 

Distinct13
Distinct (%)6.7%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
교육세
16 
지방소비세
16 
담배소비세
16 
자동차세
15 
재산세
15 
Other values (8)
115 

Length

Max length7
Median length5
Mean length4.1761658
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row자동차세
2nd row재산세
3rd row주민세
4th row등록세
5th row취득세

Common Values

ValueCountFrequency (%)
교육세 16
 
8.3%
지방소비세 16
 
8.3%
담배소비세 16
 
8.3%
자동차세 15
 
7.8%
재산세 15
 
7.8%
주민세 15
 
7.8%
취득세 15
 
7.8%
레저세 15
 
7.8%
등록면허세 15
 
7.8%
지역자원시설세 15
 
7.8%
Other values (3) 40
20.7%

Length

2023-12-13T03:52:47.381143image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
교육세 16
 
8.3%
지방소비세 16
 
8.3%
담배소비세 16
 
8.3%
자동차세 15
 
7.8%
재산세 15
 
7.8%
주민세 15
 
7.8%
취득세 15
 
7.8%
레저세 15
 
7.8%
등록면허세 15
 
7.8%
지역자원시설세 15
 
7.8%
Other values (3) 40
20.7%

과세건수
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct141
Distinct (%)73.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean122937.19
Minimum0
Maximum1036481
Zeros52
Zeros (%)26.9%
Negative0
Negative (%)0.0%
Memory size1.8 KiB
2023-12-13T03:52:47.639862image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median66035
Q3125820
95-th percentile411668.8
Maximum1036481
Range1036481
Interquartile range (IQR)125820

Descriptive statistics

Standard deviation188986.64
Coefficient of variation (CV)1.5372618
Kurtosis10.537346
Mean122937.19
Median Absolute Deviation (MAD)66035
Skewness2.9227145
Sum23726877
Variance3.5715951 × 1010
MonotonicityNot monotonic
2023-12-13T03:52:47.911177image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 52
 
26.9%
88 2
 
1.0%
133253 1
 
0.5%
109710 1
 
0.5%
117207 1
 
0.5%
111289 1
 
0.5%
106982 1
 
0.5%
24897 1
 
0.5%
411484 1
 
0.5%
104882 1
 
0.5%
Other values (131) 131
67.9%
ValueCountFrequency (%)
0 52
26.9%
1 1
 
0.5%
7 1
 
0.5%
8 1
 
0.5%
9 1
 
0.5%
27 1
 
0.5%
48 1
 
0.5%
53 1
 
0.5%
55 1
 
0.5%
82 1
 
0.5%
ValueCountFrequency (%)
1036481 1
0.5%
1024812 1
0.5%
1001675 1
0.5%
1001117 1
0.5%
984083 1
0.5%
445956 1
0.5%
429691 1
0.5%
415367 1
0.5%
414726 1
0.5%
411946 1
0.5%

과세금액
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct142
Distinct (%)73.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5.845128 × 1010
Minimum0
Maximum8.50469 × 1011
Zeros52
Zeros (%)26.9%
Negative0
Negative (%)0.0%
Memory size1.8 KiB
2023-12-13T03:52:48.182693image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median2.1037012 × 1010
Q34.8484571 × 1010
95-th percentile3.109818 × 1011
Maximum8.50469 × 1011
Range8.50469 × 1011
Interquartile range (IQR)4.8484571 × 1010

Descriptive statistics

Standard deviation1.243778 × 1011
Coefficient of variation (CV)2.1278884
Kurtosis16.495851
Mean5.845128 × 1010
Median Absolute Deviation (MAD)2.1037012 × 1010
Skewness3.8298299
Sum1.1281097 × 1013
Variance1.5469838 × 1022
MonotonicityNot monotonic
2023-12-13T03:52:48.458442image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 52
 
26.9%
28031471000 1
 
0.5%
56709981000 1
 
0.5%
62753857000 1
 
0.5%
28431739000 1
 
0.5%
41532763000 1
 
0.5%
7732303000 1
 
0.5%
108283000000 1
 
0.5%
42810768000 1
 
0.5%
5324275000 1
 
0.5%
Other values (132) 132
68.4%
ValueCountFrequency (%)
0 52
26.9%
266000 1
 
0.5%
861158000 1
 
0.5%
2441597000 1
 
0.5%
2588303000 1
 
0.5%
3048927000 1
 
0.5%
3385410000 1
 
0.5%
4210534000 1
 
0.5%
4720726000 1
 
0.5%
4821646000 1
 
0.5%
ValueCountFrequency (%)
850469000000 1
0.5%
709718000000 1
0.5%
670447000000 1
0.5%
514679000000 1
0.5%
489597000000 1
0.5%
449406000000 1
0.5%
448844000000 1
0.5%
384498000000 1
0.5%
334083000000 1
0.5%
322035000000 1
0.5%

비과세건수
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct109
Distinct (%)56.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6455.3057
Minimum0
Maximum100070
Zeros78
Zeros (%)40.4%
Negative0
Negative (%)0.0%
Memory size1.8 KiB
2023-12-13T03:52:49.226281image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median28
Q36793
95-th percentile31220.6
Maximum100070
Range100070
Interquartile range (IQR)6793

Descriptive statistics

Standard deviation13652.088
Coefficient of variation (CV)2.1148631
Kurtosis18.853968
Mean6455.3057
Median Absolute Deviation (MAD)28
Skewness3.84674
Sum1245874
Variance1.863795 × 108
MonotonicityNot monotonic
2023-12-13T03:52:49.500936image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 78
40.4%
1 3
 
1.6%
4 3
 
1.6%
2 3
 
1.6%
3 2
 
1.0%
11196 1
 
0.5%
4284 1
 
0.5%
15028 1
 
0.5%
29101 1
 
0.5%
8149 1
 
0.5%
Other values (99) 99
51.3%
ValueCountFrequency (%)
0 78
40.4%
1 3
 
1.6%
2 3
 
1.6%
3 2
 
1.0%
4 3
 
1.6%
5 1
 
0.5%
9 1
 
0.5%
14 1
 
0.5%
15 1
 
0.5%
16 1
 
0.5%
ValueCountFrequency (%)
100070 1
0.5%
85219 1
0.5%
69298 1
0.5%
50091 1
0.5%
41357 1
0.5%
39661 1
0.5%
38865 1
0.5%
35021 1
0.5%
32815 1
0.5%
32213 1
0.5%

비과세금액
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct110
Distinct (%)57.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean7.0119009 × 109
Minimum0
Maximum1.50696 × 1011
Zeros80
Zeros (%)41.5%
Negative0
Negative (%)0.0%
Memory size1.8 KiB
2023-12-13T03:52:49.775667image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median1848000
Q38.26638 × 108
95-th percentile3.6036234 × 1010
Maximum1.50696 × 1011
Range1.50696 × 1011
Interquartile range (IQR)8.26638 × 108

Descriptive statistics

Standard deviation2.1839722 × 1010
Coefficient of variation (CV)3.1146649
Kurtosis22.806757
Mean7.0119009 × 109
Median Absolute Deviation (MAD)1848000
Skewness4.555079
Sum1.3532969 × 1012
Variance4.7697344 × 1020
MonotonicityNot monotonic
2023-12-13T03:52:50.058230image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 80
41.5%
1000 3
 
1.6%
3000 2
 
1.0%
8000 2
 
1.0%
861355000 1
 
0.5%
24812714000 1
 
0.5%
96586000 1
 
0.5%
470503000 1
 
0.5%
11910064000 1
 
0.5%
922000 1
 
0.5%
Other values (100) 100
51.8%
ValueCountFrequency (%)
0 80
41.5%
1000 3
 
1.6%
3000 2
 
1.0%
5000 1
 
0.5%
8000 2
 
1.0%
10000 1
 
0.5%
12000 1
 
0.5%
23000 1
 
0.5%
34000 1
 
0.5%
58000 1
 
0.5%
ValueCountFrequency (%)
150696000000 1
0.5%
136436000000 1
0.5%
119647000000 1
0.5%
111159000000 1
0.5%
105449000000 1
0.5%
54626865000 1
0.5%
51522191000 1
0.5%
40680746000 1
0.5%
37112729000 1
0.5%
37081375000 1
0.5%

Interactions

2023-12-13T03:52:44.677675image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T03:52:42.893002image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T03:52:43.523408image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T03:52:44.120572image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T03:52:44.830386image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T03:52:43.042323image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T03:52:43.665109image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T03:52:44.271595image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T03:52:44.969538image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T03:52:43.215949image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T03:52:43.795025image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T03:52:44.407357image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T03:52:45.112551image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T03:52:43.367807image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T03:52:43.950842image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T03:52:44.528372image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T03:52:50.279833image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시군구명자치단체코드과세년도세목명과세건수과세금액비과세건수비과세금액
시군구명1.0001.0000.0000.0000.4530.1700.0000.024
자치단체코드1.0001.0000.0000.0000.4530.1700.0000.024
과세년도0.0000.0001.0000.0000.0000.0000.0000.000
세목명0.0000.0000.0001.0000.7690.4920.6110.656
과세건수0.4530.4530.0000.7691.0000.5950.5260.298
과세금액0.1700.1700.0000.4920.5951.0000.5270.728
비과세건수0.0000.0000.0000.6110.5260.5271.0000.850
비과세금액0.0240.0240.0000.6560.2980.7280.8501.000
2023-12-13T03:52:50.515647image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
자치단체코드시군구명세목명과세년도
자치단체코드1.0001.0000.0000.000
시군구명1.0001.0000.0000.000
세목명0.0000.0001.0000.000
과세년도0.0000.0000.0001.000
2023-12-13T03:52:50.727700image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
과세건수과세금액비과세건수비과세금액시군구명자치단체코드과세년도세목명
과세건수1.0000.6870.5850.4670.3060.3060.0000.509
과세금액0.6871.0000.4900.4920.1000.1000.0000.226
비과세건수0.5850.4901.0000.9140.0000.0000.0000.313
비과세금액0.4670.4920.9141.0000.0100.0100.0000.372
시군구명0.3060.1000.0000.0101.0001.0000.0000.000
자치단체코드0.3060.1000.0000.0101.0001.0000.0000.000
과세년도0.0000.0000.0000.0000.0000.0001.0000.000
세목명0.5090.2260.3130.3720.0000.0000.0001.000

Missing values

2023-12-13T03:52:45.302781image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T03:52:45.484892image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시도명시군구명자치단체코드과세년도세목명과세건수과세금액비과세건수비과세금액
0경기도성남시수정구411312017자동차세1128332803147100011196861355000
1경기도성남시수정구411312017재산세87657473052710001972426301311000
2경기도성남시수정구411312017주민세1120822441597000666663428000
3경기도성남시수정구411312017등록세00158000
4경기도성남시수정구411312017취득세28350193158000000539029032613000
5경기도성남시수정구411312017레저세0000
6경기도성남시수정구411312017교육세3889913278224600020
7경기도성남시수정구411312017지방소비세0000
8경기도성남시수정구411312017등록면허세628387777485000560150858000
9경기도성남시수정구411312017도시계획세0000
시도명시군구명자치단체코드과세년도세목명과세건수과세금액비과세건수비과세금액
183경기도성남시분당구411352022재산세303135334083000000100070150696000000
184경기도성남시분당구411352022지방소득세32512985046900000000
185경기도성남시분당구411352022지역자원시설세4153672601388200021641454923000
186경기도성남시분당구411352022교육세103648111154500000011334000
187경기도성남시분당구411352022등록면허세141982311800990009795213454000
188경기도성남시분당구411352022지방소비세0000
189경기도성남시분당구411352022담배소비세0000
190경기도성남시분당구411352022레저세1062047224900000
191경기도성남시분당구411352022자동차세30210671894060000350211056218000
192경기도성남시분당구411352022도시계획세0000