Overview

Dataset statistics

Number of variables10
Number of observations198
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory16.4 KiB
Average record size in memory84.7 B

Variable types

Numeric3
Categorical6
Boolean1

Dataset

Description지방세 세목별 납세자 유형(개인및법인)및 관내외 납세자에 대한 현황을 제공하고 관내외 납세자에 대한 부과징수 정책 수립 시 기초자료로 활용하고자 함.
Author전라남도 영암군
URLhttps://www.data.go.kr/data/15078915/fileData.do

Alerts

시도명 has constant value ""Constant
시군구명 has constant value ""Constant
자치단체코드 has constant value ""Constant
데이터기준일 has constant value ""Constant
연번 is highly overall correlated with 과세년도High correlation
과세년도 is highly overall correlated with 연번High correlation
납세자수 is highly overall correlated with 납세자유형High correlation
납세자유형 is highly overall correlated with 납세자수High correlation
연번 has unique valuesUnique

Reproduction

Analysis started2024-04-29 22:44:34.878816
Analysis finished2024-04-29 22:44:37.647635
Duration2.77 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct198
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean99.5
Minimum1
Maximum198
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.9 KiB
2024-04-30T07:44:37.720475image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile10.85
Q150.25
median99.5
Q3148.75
95-th percentile188.15
Maximum198
Range197
Interquartile range (IQR)98.5

Descriptive statistics

Standard deviation57.301832
Coefficient of variation (CV)0.57589781
Kurtosis-1.2
Mean99.5
Median Absolute Deviation (MAD)49.5
Skewness0
Sum19701
Variance3283.5
MonotonicityStrictly increasing
2024-04-30T07:44:37.853252image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.5%
126 1
 
0.5%
128 1
 
0.5%
129 1
 
0.5%
130 1
 
0.5%
131 1
 
0.5%
132 1
 
0.5%
133 1
 
0.5%
134 1
 
0.5%
135 1
 
0.5%
Other values (188) 188
94.9%
ValueCountFrequency (%)
1 1
0.5%
2 1
0.5%
3 1
0.5%
4 1
0.5%
5 1
0.5%
6 1
0.5%
7 1
0.5%
8 1
0.5%
9 1
0.5%
10 1
0.5%
ValueCountFrequency (%)
198 1
0.5%
197 1
0.5%
196 1
0.5%
195 1
0.5%
194 1
0.5%
193 1
0.5%
192 1
0.5%
191 1
0.5%
190 1
0.5%
189 1
0.5%

시도명
Categorical

CONSTANT 

Distinct1
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
전라남도
198 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row전라남도
2nd row전라남도
3rd row전라남도
4th row전라남도
5th row전라남도

Common Values

ValueCountFrequency (%)
전라남도 198
100.0%

Length

2024-04-30T07:44:37.994658image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-30T07:44:38.106053image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
전라남도 198
100.0%

시군구명
Categorical

CONSTANT 

Distinct1
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
영암군
198 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row영암군
2nd row영암군
3rd row영암군
4th row영암군
5th row영암군

Common Values

ValueCountFrequency (%)
영암군 198
100.0%

Length

2024-04-30T07:44:38.199505image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-30T07:44:38.285967image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
영암군 198
100.0%

자치단체코드
Categorical

CONSTANT 

Distinct1
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
46830
198 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row46830
2nd row46830
3rd row46830
4th row46830
5th row46830

Common Values

ValueCountFrequency (%)
46830 198
100.0%

Length

2024-04-30T07:44:38.375058image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-30T07:44:38.460356image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
46830 198
100.0%

과세년도
Real number (ℝ)

HIGH CORRELATION 

Distinct6
Distinct (%)3.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2019.5606
Minimum2017
Maximum2022
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.9 KiB
2024-04-30T07:44:38.539047image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2017
5-th percentile2017
Q12018
median2020
Q32021
95-th percentile2022
Maximum2022
Range5
Interquartile range (IQR)3

Descriptive statistics

Standard deviation1.7258452
Coefficient of variation (CV)0.00085456471
Kurtosis-1.2847044
Mean2019.5606
Median Absolute Deviation (MAD)2
Skewness-0.04063049
Sum399873
Variance2.9785418
MonotonicityIncreasing
2024-04-30T07:44:38.638720image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
2022 36
18.2%
2020 33
16.7%
2021 33
16.7%
2017 32
16.2%
2018 32
16.2%
2019 32
16.2%
ValueCountFrequency (%)
2017 32
16.2%
2018 32
16.2%
2019 32
16.2%
2020 33
16.7%
2021 33
16.7%
2022 36
18.2%
ValueCountFrequency (%)
2022 36
18.2%
2021 33
16.7%
2020 33
16.7%
2019 32
16.2%
2018 32
16.2%
2017 32
16.2%

세목명
Categorical

Distinct11
Distinct (%)5.6%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
등록세
24 
재산세
24 
주민세
24 
취득세
24 
자동차세
24 
Other values (6)
78 

Length

Max length7
Median length5
Mean length3.959596
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row등록세
2nd row등록세
3rd row등록세
4th row등록세
5th row재산세

Common Values

ValueCountFrequency (%)
등록세 24
12.1%
재산세 24
12.1%
주민세 24
12.1%
취득세 24
12.1%
자동차세 24
12.1%
등록면허세 24
12.1%
지방소득세 24
12.1%
담배소비세 18
9.1%
지역자원시설세 7
 
3.5%
지방소비세 3
 
1.5%

Length

2024-04-30T07:44:38.769181image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
등록세 24
12.1%
재산세 24
12.1%
주민세 24
12.1%
취득세 24
12.1%
자동차세 24
12.1%
등록면허세 24
12.1%
지방소득세 24
12.1%
담배소비세 18
9.1%
지역자원시설세 7
 
3.5%
지방소비세 3
 
1.5%

납세자유형
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
법인
106 
개인
92 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row개인
2nd row개인
3rd row법인
4th row법인
5th row개인

Common Values

ValueCountFrequency (%)
법인 106
53.5%
개인 92
46.5%

Length

2024-04-30T07:44:38.906791image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-30T07:44:39.007821image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
법인 106
53.5%
개인 92
46.5%
Distinct2
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size330.0 B
False
99 
True
99 
ValueCountFrequency (%)
False 99
50.0%
True 99
50.0%
2024-04-30T07:44:39.105154image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

납세자수
Real number (ℝ)

HIGH CORRELATION 

Distinct167
Distinct (%)84.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4791.5152
Minimum1
Maximum43015
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.9 KiB
2024-04-30T07:44:39.229620image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1.85
Q1193
median1326
Q34793
95-th percentile23476.05
Maximum43015
Range43014
Interquartile range (IQR)4600

Descriptive statistics

Standard deviation8852.1179
Coefficient of variation (CV)1.8474569
Kurtosis7.7500049
Mean4791.5152
Median Absolute Deviation (MAD)1321
Skewness2.788492
Sum948720
Variance78359991
MonotonicityNot monotonic
2024-04-30T07:44:39.372863image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 10
 
5.1%
2 7
 
3.5%
3 6
 
3.0%
6 4
 
2.0%
4 3
 
1.5%
10 3
 
1.5%
561 2
 
1.0%
4583 2
 
1.0%
1692 2
 
1.0%
9 2
 
1.0%
Other values (157) 157
79.3%
ValueCountFrequency (%)
1 10
5.1%
2 7
3.5%
3 6
3.0%
4 3
 
1.5%
6 4
 
2.0%
7 1
 
0.5%
8 1
 
0.5%
9 2
 
1.0%
10 3
 
1.5%
11 1
 
0.5%
ValueCountFrequency (%)
43015 1
0.5%
42776 1
0.5%
41980 1
0.5%
41410 1
0.5%
40898 1
0.5%
39469 1
0.5%
24581 1
0.5%
23968 1
0.5%
23788 1
0.5%
23516 1
0.5%

데이터기준일
Categorical

CONSTANT 

Distinct1
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
2022-12-31
198 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2022-12-31
2nd row2022-12-31
3rd row2022-12-31
4th row2022-12-31
5th row2022-12-31

Common Values

ValueCountFrequency (%)
2022-12-31 198
100.0%

Length

2024-04-30T07:44:39.489801image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-30T07:44:39.575321image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2022-12-31 198
100.0%

Interactions

2024-04-30T07:44:37.164365image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T07:44:36.434438image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T07:44:36.901314image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T07:44:37.254018image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T07:44:36.592035image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T07:44:36.993932image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T07:44:37.344481image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T07:44:36.817677image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T07:44:37.080388image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-30T07:44:39.639167image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번과세년도세목명납세자유형관내_관외납세자수
연번1.0000.9430.4530.0000.0000.000
과세년도0.9431.0000.0000.0000.0000.000
세목명0.4530.0001.0000.0000.0580.587
납세자유형0.0000.0000.0001.0000.0000.848
관내_관외0.0000.0000.0580.0001.0000.445
납세자수0.0000.0000.5870.8480.4451.000
2024-04-30T07:44:39.913125image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
관내_관외납세자유형세목명
관내_관외1.0000.0000.051
납세자유형0.0001.0000.000
세목명0.0510.0001.000
2024-04-30T07:44:39.989514image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번과세년도납세자수세목명납세자유형관내_관외
연번1.0000.986-0.0290.2100.0000.000
과세년도0.9861.000-0.0110.0000.0000.000
납세자수-0.029-0.0111.0000.3450.6480.317
세목명0.2100.0000.3451.0000.0000.051
납세자유형0.0000.0000.6480.0001.0000.000
관내_관외0.0000.0000.3170.0510.0001.000

Missing values

2024-04-30T07:44:37.453552image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-30T07:44:37.592387image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번시도명시군구명자치단체코드과세년도세목명납세자유형관내_관외납세자수데이터기준일
01전라남도영암군468302017등록세개인N1862022-12-31
12전라남도영암군468302017등록세개인Y1602022-12-31
23전라남도영암군468302017등록세법인N22022-12-31
34전라남도영암군468302017등록세법인Y62022-12-31
45전라남도영암군468302017재산세개인N394692022-12-31
56전라남도영암군468302017재산세개인Y230332022-12-31
67전라남도영암군468302017재산세법인N10452022-12-31
78전라남도영암군468302017재산세법인Y29312022-12-31
89전라남도영암군468302017주민세개인N65732022-12-31
910전라남도영암군468302017주민세개인Y210322022-12-31
연번시도명시군구명자치단체코드과세년도세목명납세자유형관내_관외납세자수데이터기준일
188189전라남도영암군468302022등록면허세개인Y72152022-12-31
189190전라남도영암군468302022등록면허세법인N13152022-12-31
190191전라남도영암군468302022등록면허세법인Y15732022-12-31
191192전라남도영암군468302022지방소득세개인N31842022-12-31
192193전라남도영암군468302022지방소득세개인Y82752022-12-31
193194전라남도영암군468302022지방소득세법인N8072022-12-31
194195전라남도영암군468302022지방소득세법인Y19282022-12-31
195196전라남도영암군468302022지방소비세법인Y12022-12-31
196197전라남도영암군468302022지역자원시설세개인N12022-12-31
197198전라남도영암군468302022지역자원시설세법인Y12022-12-31