Overview

Dataset statistics

Number of variables8
Number of observations189
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory12.5 KiB
Average record size in memory67.7 B

Variable types

Categorical5
Numeric2
Boolean1

Dataset

Description2017년부터 2022년까지 서천군 지방세 세목별 납세자유형(개인, 법인을 구분함), 관내/관외 구분, 납세자수에 대한 과세자료를 제공합니다.
Author충청남도
URLhttps://alldam.chungnam.go.kr/index.chungnam?menuCd=DOM_000000201001001001&st=&cds=&orgCd=&apiType=&isOpen=Y&pageIndex=347&beforeMenuCd=DOM_000000201001001000&publicdatapk=15080476

Alerts

시도명 has constant value ""Constant
시군구명 has constant value ""Constant
자치단체코드 has constant value ""Constant
납세자수 is highly overall correlated with 납세자유형High correlation
납세자유형 is highly overall correlated with 납세자수 High correlation

Reproduction

Analysis started2024-01-09 21:30:28.257438
Analysis finished2024-01-09 21:30:28.896040
Duration0.64 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시도명
Categorical

CONSTANT 

Distinct1
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
충청남도
189 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row충청남도
2nd row충청남도
3rd row충청남도
4th row충청남도
5th row충청남도

Common Values

ValueCountFrequency (%)
충청남도 189
100.0%

Length

2024-01-10T06:30:28.946410image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T06:30:29.017599image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
충청남도 189
100.0%

시군구명
Categorical

CONSTANT 

Distinct1
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
서천군
189 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row서천군
2nd row서천군
3rd row서천군
4th row서천군
5th row서천군

Common Values

ValueCountFrequency (%)
서천군 189
100.0%

Length

2024-01-10T06:30:29.091230image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T06:30:29.164745image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
서천군 189
100.0%

자치단체코드
Categorical

CONSTANT 

Distinct1
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
44770
189 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row44770
2nd row44770
3rd row44770
4th row44770
5th row44770

Common Values

ValueCountFrequency (%)
44770 189
100.0%

Length

2024-01-10T06:30:29.238829image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T06:30:29.312944image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
44770 189
100.0%

과세년도
Real number (ℝ)

Distinct6
Distinct (%)3.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2019.545
Minimum2017
Maximum2022
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.8 KiB
2024-01-10T06:30:29.376071image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2017
5-th percentile2017
Q12018
median2020
Q32021
95-th percentile2022
Maximum2022
Range5
Interquartile range (IQR)3

Descriptive statistics

Standard deviation1.7303113
Coefficient of variation (CV)0.00085678276
Kurtosis-1.2965479
Mean2019.545
Median Absolute Deviation (MAD)2
Skewness-0.026809728
Sum381694
Variance2.9939773
MonotonicityIncreasing
2024-01-10T06:30:29.478741image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
2022 34
18.0%
2021 32
16.9%
2017 31
16.4%
2018 31
16.4%
2019 31
16.4%
2020 30
15.9%
ValueCountFrequency (%)
2017 31
16.4%
2018 31
16.4%
2019 31
16.4%
2020 30
15.9%
2021 32
16.9%
2022 34
18.0%
ValueCountFrequency (%)
2022 34
18.0%
2021 32
16.9%
2020 30
15.9%
2019 31
16.4%
2018 31
16.4%
2017 31
16.4%

세목명
Categorical

Distinct11
Distinct (%)5.8%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
재산세
24 
주민세
24 
취득세
24 
자동차세
24 
등록면허세
24 
Other values (6)
69 

Length

Max length7
Median length3
Mean length3.8888889
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row등록세
2nd row등록세
3rd row등록세
4th row재산세
5th row재산세

Common Values

ValueCountFrequency (%)
재산세 24
12.7%
주민세 24
12.7%
취득세 24
12.7%
자동차세 24
12.7%
등록면허세 24
12.7%
지방소득세 24
12.7%
등록세 22
11.6%
담배소비세 15
7.9%
지역자원시설세 3
 
1.6%
지방소비세 3
 
1.6%

Length

2024-01-10T06:30:29.608248image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
재산세 24
12.7%
주민세 24
12.7%
취득세 24
12.7%
자동차세 24
12.7%
등록면허세 24
12.7%
지방소득세 24
12.7%
등록세 22
11.6%
담배소비세 15
7.9%
지역자원시설세 3
 
1.6%
지방소비세 3
 
1.6%

납세자유형
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)1.1%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
법인
98 
개인
91 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row개인
2nd row개인
3rd row법인
4th row개인
5th row개인

Common Values

ValueCountFrequency (%)
법인 98
51.9%
개인 91
48.1%

Length

2024-01-10T06:30:29.733828image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T06:30:30.052941image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
법인 98
51.9%
개인 91
48.1%
Distinct2
Distinct (%)1.1%
Missing0
Missing (%)0.0%
Memory size321.0 B
False
97 
True
92 
ValueCountFrequency (%)
False 97
51.3%
True 92
48.7%
2024-01-10T06:30:30.119407image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

납세자수
Real number (ℝ)

HIGH CORRELATION 

Distinct167
Distinct (%)88.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4508.381
Minimum1
Maximum38116
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.8 KiB
2024-01-10T06:30:30.205396image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q1140
median701
Q32668
95-th percentile25029
Maximum38116
Range38115
Interquartile range (IQR)2528

Descriptive statistics

Standard deviation8767.7517
Coefficient of variation (CV)1.9447673
Kurtosis5.0365323
Mean4508.381
Median Absolute Deviation (MAD)693
Skewness2.4169659
Sum852084
Variance76873470
MonotonicityNot monotonic
2024-01-10T06:30:30.318019image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 12
 
6.3%
4 4
 
2.1%
3 4
 
2.1%
9 3
 
1.6%
707 2
 
1.1%
7 2
 
1.1%
6 2
 
1.1%
156 1
 
0.5%
25325 1
 
0.5%
266 1
 
0.5%
Other values (157) 157
83.1%
ValueCountFrequency (%)
1 12
6.3%
2 1
 
0.5%
3 4
 
2.1%
4 4
 
2.1%
6 2
 
1.1%
7 2
 
1.1%
8 1
 
0.5%
9 3
 
1.6%
12 1
 
0.5%
29 1
 
0.5%
ValueCountFrequency (%)
38116 1
0.5%
37539 1
0.5%
37084 1
0.5%
36498 1
0.5%
36105 1
0.5%
35556 1
0.5%
25506 1
0.5%
25325 1
0.5%
25087 1
0.5%
25079 1
0.5%

Interactions

2024-01-10T06:30:28.585826image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T06:30:28.452862image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T06:30:28.656437image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T06:30:28.516459image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-01-10T06:30:30.393891image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
과세년도세목명납세자유형관내외여부납세자수
과세년도1.0000.0000.0000.0000.000
세목명0.0001.0000.0000.0690.640
납세자유형0.0000.0001.0000.0000.726
관내외여부0.0000.0690.0001.0000.676
납세자수0.0000.6400.7260.6761.000
2024-01-10T06:30:30.471302image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
관내외여부세목명납세자유형
관내외여부1.0000.0630.000
세목명0.0631.0000.000
납세자유형0.0000.0001.000
2024-01-10T06:30:30.542356image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
과세년도납세자수세목명납세자유형관내외여부
과세년도1.000-0.0020.0000.0000.000
납세자수-0.0021.0000.3900.5320.490
세목명0.0000.3901.0000.0000.063
납세자유형0.0000.5320.0001.0000.000
관내외여부0.0000.4900.0630.0001.000

Missing values

2024-01-10T06:30:28.765584image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-10T06:30:28.859662image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시도명시군구명자치단체코드과세년도세목명납세자유형관내외여부납세자수
0충청남도서천군447702017등록세개인N156
1충청남도서천군447702017등록세개인Y148
2충청남도서천군447702017등록세법인Y4
3충청남도서천군447702017재산세개인N35556
4충청남도서천군447702017재산세개인Y24954
5충청남도서천군447702017재산세법인N645
6충청남도서천군447702017재산세법인Y2580
7충청남도서천군447702017주민세개인N2510
8충청남도서천군447702017주민세개인Y23811
9충청남도서천군447702017주민세법인N140
시도명시군구명자치단체코드과세년도세목명납세자유형관내외여부납세자수
179충청남도서천군447702022등록면허세개인N2966
180충청남도서천군447702022등록면허세개인Y10811
181충청남도서천군447702022등록면허세법인N683
182충청남도서천군447702022등록면허세법인Y768
183충청남도서천군447702022지방소득세개인N1501
184충청남도서천군447702022지방소득세개인Y8161
185충청남도서천군447702022지방소득세법인N278
186충청남도서천군447702022지방소득세법인Y792
187충청남도서천군447702022지방소비세법인Y1
188충청남도서천군447702022지역자원시설세법인Y1