Overview

Dataset statistics

Number of variables9
Number of observations207
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory15.5 KiB
Average record size in memory76.6 B

Variable types

Numeric3
Categorical5
Boolean1

Dataset

Description지방세 납세자 현황 데이터로 과세년도와 세목명 별로 납세자 유형, 관내, 관외, 납세자수 등의 항목 데이터를 제공합니다.
Author전라남도 장흥군
URLhttps://www.data.go.kr/data/15078653/fileData.do

Alerts

시도명 has constant value ""Constant
시군구명 has constant value ""Constant
자치단체코드 has constant value ""Constant
연번 is highly overall correlated with 과세년도High correlation
과세년도 is highly overall correlated with 연번High correlation
연번 has unique valuesUnique

Reproduction

Analysis started2024-04-21 01:48:05.398904
Analysis finished2024-04-21 01:48:07.984285
Duration2.59 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct207
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean104
Minimum1
Maximum207
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.9 KiB
2024-04-21T10:48:08.053392image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile11.3
Q152.5
median104
Q3155.5
95-th percentile196.7
Maximum207
Range206
Interquartile range (IQR)103

Descriptive statistics

Standard deviation59.899917
Coefficient of variation (CV)0.57596074
Kurtosis-1.2
Mean104
Median Absolute Deviation (MAD)52
Skewness0
Sum21528
Variance3588
MonotonicityStrictly increasing
2024-04-21T10:48:08.203199image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.5%
2 1
 
0.5%
133 1
 
0.5%
134 1
 
0.5%
135 1
 
0.5%
136 1
 
0.5%
137 1
 
0.5%
138 1
 
0.5%
139 1
 
0.5%
140 1
 
0.5%
Other values (197) 197
95.2%
ValueCountFrequency (%)
1 1
0.5%
2 1
0.5%
3 1
0.5%
4 1
0.5%
5 1
0.5%
6 1
0.5%
7 1
0.5%
8 1
0.5%
9 1
0.5%
10 1
0.5%
ValueCountFrequency (%)
207 1
0.5%
206 1
0.5%
205 1
0.5%
204 1
0.5%
203 1
0.5%
202 1
0.5%
201 1
0.5%
200 1
0.5%
199 1
0.5%
198 1
0.5%

시도명
Categorical

CONSTANT 

Distinct1
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
전라남도
207 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row전라남도
2nd row전라남도
3rd row전라남도
4th row전라남도
5th row전라남도

Common Values

ValueCountFrequency (%)
전라남도 207
100.0%

Length

2024-04-21T10:48:08.335970image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-21T10:48:08.424827image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
전라남도 207
100.0%

시군구명
Categorical

CONSTANT 

Distinct1
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
장흥군
207 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row장흥군
2nd row장흥군
3rd row장흥군
4th row장흥군
5th row장흥군

Common Values

ValueCountFrequency (%)
장흥군 207
100.0%

Length

2024-04-21T10:48:08.516227image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-21T10:48:08.598289image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
장흥군 207
100.0%

자치단체코드
Categorical

CONSTANT 

Distinct1
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
46800
207 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row46800
2nd row46800
3rd row46800
4th row46800
5th row46800

Common Values

ValueCountFrequency (%)
46800 207
100.0%

Length

2024-04-21T10:48:08.689688image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-21T10:48:08.798591image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
46800 207
100.0%

과세년도
Real number (ℝ)

HIGH CORRELATION 

Distinct6
Distinct (%)2.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2019.5217
Minimum2017
Maximum2022
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.9 KiB
2024-04-21T10:48:08.915889image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2017
5-th percentile2017
Q12018
median2020
Q32021
95-th percentile2022
Maximum2022
Range5
Interquartile range (IQR)3

Descriptive statistics

Standard deviation1.7118266
Coefficient of variation (CV)0.00084763961
Kurtosis-1.2695962
Mean2019.5217
Median Absolute Deviation (MAD)1
Skewness-0.019783528
Sum418041
Variance2.9303504
MonotonicityIncreasing
2024-04-21T10:48:09.019605image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
2020 35
16.9%
2021 35
16.9%
2022 35
16.9%
2017 34
16.4%
2018 34
16.4%
2019 34
16.4%
ValueCountFrequency (%)
2017 34
16.4%
2018 34
16.4%
2019 34
16.4%
2020 35
16.9%
2021 35
16.9%
2022 35
16.9%
ValueCountFrequency (%)
2022 35
16.9%
2021 35
16.9%
2020 35
16.9%
2019 34
16.4%
2018 34
16.4%
2017 34
16.4%

세목명
Categorical

Distinct12
Distinct (%)5.8%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
재산세
24 
주민세
24 
취득세
24 
자동차세
24 
등록면허세
24 
Other values (7)
87 

Length

Max length7
Median length5
Mean length4.1111111
Min length3

Unique

Unique1 ?
Unique (%)0.5%

Sample

1st row등록세
2nd row등록세
3rd row등록세
4th row등록세
5th row재산세

Common Values

ValueCountFrequency (%)
재산세 24
11.6%
주민세 24
11.6%
취득세 24
11.6%
자동차세 24
11.6%
등록면허세 24
11.6%
지방소득세 24
11.6%
등록세 23
11.1%
지역자원시설세 18
8.7%
담배소비세 16
7.7%
지방소비세 3
 
1.4%
Other values (2) 3
 
1.4%

Length

2024-04-21T10:48:09.135941image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
재산세 24
11.6%
주민세 24
11.6%
취득세 24
11.6%
자동차세 24
11.6%
등록면허세 24
11.6%
지방소득세 24
11.6%
등록세 23
11.1%
지역자원시설세 18
8.7%
담배소비세 16
7.7%
지방소비세 3
 
1.4%
Other values (2) 3
 
1.4%

납세자유형
Categorical

Distinct2
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
법인
104 
개인
103 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row개인
2nd row개인
3rd row법인
4th row법인
5th row개인

Common Values

ValueCountFrequency (%)
법인 104
50.2%
개인 103
49.8%

Length

2024-04-21T10:48:09.246953image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-21T10:48:09.329083image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
법인 104
50.2%
개인 103
49.8%
Distinct2
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size339.0 B
True
104 
False
103 
ValueCountFrequency (%)
True 104
50.2%
False 103
49.8%
2024-04-21T10:48:09.419734image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

납세자수
Real number (ℝ)

Distinct162
Distinct (%)78.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3258.8406
Minimum1
Maximum28184
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.9 KiB
2024-04-21T10:48:09.532003image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q147
median610
Q31935.5
95-th percentile24042.5
Maximum28184
Range28183
Interquartile range (IQR)1888.5

Descriptive statistics

Standard deviation6686.694
Coefficient of variation (CV)2.0518629
Kurtosis5.5935455
Mean3258.8406
Median Absolute Deviation (MAD)607
Skewness2.5725592
Sum674580
Variance44711877
MonotonicityNot monotonic
2024-04-21T10:48:09.672503image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 14
 
6.8%
3 11
 
5.3%
2 11
 
5.3%
5 4
 
1.9%
14 3
 
1.4%
12 2
 
1.0%
735 2
 
1.0%
204 2
 
1.0%
2685 2
 
1.0%
47 2
 
1.0%
Other values (152) 154
74.4%
ValueCountFrequency (%)
1 14
6.8%
2 11
5.3%
3 11
5.3%
4 1
 
0.5%
5 4
 
1.9%
12 2
 
1.0%
13 1
 
0.5%
14 3
 
1.4%
16 1
 
0.5%
17 1
 
0.5%
ValueCountFrequency (%)
28184 1
0.5%
27756 1
0.5%
27387 1
0.5%
27019 1
0.5%
26586 1
0.5%
26306 1
0.5%
24328 1
0.5%
24289 1
0.5%
24231 1
0.5%
24133 1
0.5%

Interactions

2024-04-21T10:48:07.559988image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T10:48:06.942241image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T10:48:07.331441image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T10:48:07.630789image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T10:48:07.062594image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T10:48:07.412846image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T10:48:07.708677image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T10:48:07.263908image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T10:48:07.488642image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-21T10:48:09.774385image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번과세년도세목명납세자유형관내_관외납세자수
연번1.0000.9420.3760.0000.0000.000
과세년도0.9421.0000.0000.0000.0000.000
세목명0.3760.0001.0000.0000.0940.694
납세자유형0.0000.0000.0001.0000.0000.634
관내_관외0.0000.0000.0940.0001.0000.592
납세자수0.0000.0000.6940.6340.5921.000
2024-04-21T10:48:09.879194image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
세목명납세자유형관내_관외
세목명1.0000.0000.069
납세자유형0.0001.0000.000
관내_관외0.0690.0001.000
2024-04-21T10:48:10.102380image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번과세년도납세자수세목명납세자유형관내_관외
연번1.0000.986-0.0230.1650.0000.000
과세년도0.9861.0000.0130.0000.0000.000
납세자수-0.0230.0131.0000.3760.4730.441
세목명0.1650.0000.3761.0000.0000.069
납세자유형0.0000.0000.4730.0001.0000.000
관내_관외0.0000.0000.4410.0690.0001.000

Missing values

2024-04-21T10:48:07.815724image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-21T10:48:07.933022image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번시도명시군구명자치단체코드과세년도세목명납세자유형관내_관외납세자수
01전라남도장흥군468002017등록세개인N164
12전라남도장흥군468002017등록세개인Y144
23전라남도장흥군468002017등록세법인N3
34전라남도장흥군468002017등록세법인Y3
45전라남도장흥군468002017재산세개인N26306
56전라남도장흥군468002017재산세개인Y24053
67전라남도장흥군468002017재산세법인N706
78전라남도장흥군468002017재산세법인Y2685
89전라남도장흥군468002017주민세개인N2211
910전라남도장흥군468002017주민세개인Y17942
연번시도명시군구명자치단체코드과세년도세목명납세자유형관내_관외납세자수
197198전라남도장흥군468002022등록면허세법인N641
198199전라남도장흥군468002022등록면허세법인Y838
199200전라남도장흥군468002022지방소득세개인N1332
200201전라남도장흥군468002022지방소득세개인Y5635
201202전라남도장흥군468002022지방소득세법인N287
202203전라남도장흥군468002022지방소득세법인Y758
203204전라남도장흥군468002022지방소비세법인Y1
204205전라남도장흥군468002022지역자원시설세개인N2
205206전라남도장흥군468002022지역자원시설세개인Y13
206207전라남도장흥군468002022지역자원시설세법인Y1