Overview

Dataset statistics

Number of variables8
Number of observations221
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory14.6 KiB
Average record size in memory67.6 B

Variable types

Categorical5
Numeric2
Boolean1

Dataset

Description지방세 납세자 현황에 대한 데이터로 시도명, 시군구명, 자치단체코드, 과세년도, 세목명, 납세자유형, 관내여부, 납세자수 등을 제공합니다.
URLhttps://www.data.go.kr/data/15079449/fileData.do

Alerts

시도명 has constant value ""Constant
시군구명 has constant value ""Constant
자치단체코드 has constant value ""Constant
납세자수 is highly overall correlated with 납세자유형High correlation
납세자유형 is highly overall correlated with 납세자수High correlation

Reproduction

Analysis started2023-12-12 03:06:00.695543
Analysis finished2023-12-12 03:06:01.805412
Duration1.11 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시도명
Categorical

CONSTANT 

Distinct1
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
경기도
221 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row경기도
2nd row경기도
3rd row경기도
4th row경기도
5th row경기도

Common Values

ValueCountFrequency (%)
경기도 221
100.0%

Length

2023-12-12T12:06:01.880565image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T12:06:01.998564image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
경기도 221
100.0%

시군구명
Categorical

CONSTANT 

Distinct1
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
의정부시
221 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row의정부시
2nd row의정부시
3rd row의정부시
4th row의정부시
5th row의정부시

Common Values

ValueCountFrequency (%)
의정부시 221
100.0%

Length

2023-12-12T12:06:02.124893image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T12:06:02.246102image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
의정부시 221
100.0%

자치단체코드
Categorical

CONSTANT 

Distinct1
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
41150
221 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row41150
2nd row41150
3rd row41150
4th row41150
5th row41150

Common Values

ValueCountFrequency (%)
41150 221
100.0%

Length

2023-12-12T12:06:02.354647image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T12:06:02.450717image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
41150 221
100.0%

과세년도
Real number (ℝ)

Distinct6
Distinct (%)2.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2019.552
Minimum2017
Maximum2022
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.1 KiB
2023-12-12T12:06:02.535098image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2017
5-th percentile2017
Q12018
median2020
Q32021
95-th percentile2022
Maximum2022
Range5
Interquartile range (IQR)3

Descriptive statistics

Standard deviation1.7064737
Coefficient of variation (CV)0.00084497637
Kurtosis-1.2621355
Mean2019.552
Median Absolute Deviation (MAD)1
Skewness-0.04261492
Sum446321
Variance2.9120527
MonotonicityIncreasing
2023-12-12T12:06:02.646630image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
2020 38
17.2%
2021 38
17.2%
2022 38
17.2%
2018 36
16.3%
2019 36
16.3%
2017 35
15.8%
ValueCountFrequency (%)
2017 35
15.8%
2018 36
16.3%
2019 36
16.3%
2020 38
17.2%
2021 38
17.2%
2022 38
17.2%
ValueCountFrequency (%)
2022 38
17.2%
2021 38
17.2%
2020 38
17.2%
2019 36
16.3%
2018 36
16.3%
2017 35
15.8%

세목명
Categorical

Distinct11
Distinct (%)5.0%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
재산세
24 
주민세
24 
취득세
24 
자동차세
24 
등록면허세
24 
Other values (6)
101 

Length

Max length7
Median length5
Mean length4.1493213
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row등록세
2nd row등록세
3rd row등록세
4th row레저세
5th row레저세

Common Values

ValueCountFrequency (%)
재산세 24
10.9%
주민세 24
10.9%
취득세 24
10.9%
자동차세 24
10.9%
등록면허세 24
10.9%
지방소득세 24
10.9%
지역자원시설세 24
10.9%
등록세 21
9.5%
담배소비세 16
7.2%
레저세 13
5.9%

Length

2023-12-12T12:06:02.773809image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
재산세 24
10.9%
주민세 24
10.9%
취득세 24
10.9%
자동차세 24
10.9%
등록면허세 24
10.9%
지방소득세 24
10.9%
지역자원시설세 24
10.9%
등록세 21
9.5%
담배소비세 16
7.2%
레저세 13
5.9%

납세자유형
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)0.9%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
법인
117 
개인
104 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row개인
2nd row개인
3rd row법인
4th row법인
5th row법인

Common Values

ValueCountFrequency (%)
법인 117
52.9%
개인 104
47.1%

Length

2023-12-12T12:06:02.890216image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T12:06:02.983706image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
법인 117
52.9%
개인 104
47.1%
Distinct2
Distinct (%)0.9%
Missing0
Missing (%)0.0%
Memory size353.0 B
False
113 
True
108 
ValueCountFrequency (%)
False 113
51.1%
True 108
48.9%
2023-12-12T12:06:03.076451image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

납세자수
Real number (ℝ)

HIGH CORRELATION 

Distinct178
Distinct (%)80.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean18523.819
Minimum1
Maximum174777
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.1 KiB
2023-12-12T12:06:03.188246image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q133
median1286
Q315653
95-th percentile118091
Maximum174777
Range174776
Interquartile range (IQR)15620

Descriptive statistics

Standard deviation37904.075
Coefficient of variation (CV)2.0462344
Kurtosis5.8787843
Mean18523.819
Median Absolute Deviation (MAD)1284
Skewness2.556277
Sum4093764
Variance1.4367189 × 109
MonotonicityNot monotonic
2023-12-12T12:06:03.335980image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2 18
 
8.1%
1 12
 
5.4%
14 5
 
2.3%
66 4
 
1.8%
28 3
 
1.4%
3703 2
 
0.9%
30 2
 
0.9%
50 2
 
0.9%
18 2
 
0.9%
704 2
 
0.9%
Other values (168) 169
76.5%
ValueCountFrequency (%)
1 12
5.4%
2 18
8.1%
3 2
 
0.9%
4 1
 
0.5%
6 1
 
0.5%
9 1
 
0.5%
11 1
 
0.5%
12 1
 
0.5%
14 5
 
2.3%
16 1
 
0.5%
ValueCountFrequency (%)
174777 1
0.5%
172539 1
0.5%
164635 1
0.5%
159965 1
0.5%
158290 1
0.5%
148295 1
0.5%
136761 1
0.5%
134004 1
0.5%
123706 1
0.5%
119387 1
0.5%

Interactions

2023-12-12T12:06:01.254803image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T12:06:01.041160image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T12:06:01.366828image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T12:06:01.143421image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T12:06:03.433430image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
과세년도세목명납세자유형관내여부납세자수
과세년도1.0000.0000.0000.0000.000
세목명0.0001.0000.1080.0000.465
납세자유형0.0000.1081.0000.0000.702
관내여부0.0000.0000.0001.0000.385
납세자수0.0000.4650.7020.3851.000
2023-12-12T12:06:03.530487image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
납세자유형관내여부세목명
납세자유형1.0000.0000.100
관내여부0.0001.0000.000
세목명0.1000.0001.000
2023-12-12T12:06:03.630760image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
과세년도납세자수세목명납세자유형관내여부
과세년도1.000-0.0310.0000.0000.000
납세자수-0.0311.0000.2170.5380.290
세목명0.0000.2171.0000.1000.000
납세자유형0.0000.5380.1001.0000.000
관내여부0.0000.2900.0000.0001.000

Missing values

2023-12-12T12:06:01.550000image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T12:06:01.749790image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시도명시군구명자치단체코드과세년도세목명납세자유형관내여부납세자수
0경기도의정부시411502017등록세개인N33
1경기도의정부시411502017등록세개인Y66
2경기도의정부시411502017등록세법인Y2
3경기도의정부시411502017레저세법인N2
4경기도의정부시411502017레저세법인Y2
5경기도의정부시411502017재산세개인N42591
6경기도의정부시411502017재산세개인Y92481
7경기도의정부시411502017재산세법인N605
8경기도의정부시411502017재산세법인Y668
9경기도의정부시411502017주민세개인N30646
시도명시군구명자치단체코드과세년도세목명납세자유형관내여부납세자수
211경기도의정부시411502022등록면허세법인Y2821
212경기도의정부시411502022지방소득세개인N29411
213경기도의정부시411502022지방소득세개인Y118091
214경기도의정부시411502022지방소득세법인N1473
215경기도의정부시411502022지방소득세법인Y3475
216경기도의정부시411502022지방소비세법인Y1
217경기도의정부시411502022지역자원시설세개인N50
218경기도의정부시411502022지역자원시설세개인Y42
219경기도의정부시411502022지역자원시설세법인N20
220경기도의정부시411502022지역자원시설세법인Y18