Overview

Dataset statistics

Number of variables9
Number of observations201
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory14.9 KiB
Average record size in memory75.7 B

Variable types

Categorical5
Numeric2
Boolean1
DateTime1

Dataset

Description지방세납세자현황으로 취득세 개인과법인 등록면허세 개인과법인 지역자원시설세 개인법인,지방교육세 개인법인 주민세 개인,법인 재산세 개인법인,자동차세 개인법인,지방소득세 개인법인
URLhttps://www.data.go.kr/data/15078939/fileData.do

Alerts

시도명 has constant value ""Constant
시군구명 has constant value ""Constant
자치단체코드 has constant value ""Constant
데이터기준일 has constant value ""Constant

Reproduction

Analysis started2023-12-12 10:11:16.170928
Analysis finished2023-12-12 10:11:17.223477
Duration1.05 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시도명
Categorical

CONSTANT 

Distinct1
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
전라남도
201 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row전라남도
2nd row전라남도
3rd row전라남도
4th row전라남도
5th row전라남도

Common Values

ValueCountFrequency (%)
전라남도 201
100.0%

Length

2023-12-12T19:11:17.318040image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T19:11:17.472238image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
전라남도 201
100.0%

시군구명
Categorical

CONSTANT 

Distinct1
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
화순군
201 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row화순군
2nd row화순군
3rd row화순군
4th row화순군
5th row화순군

Common Values

ValueCountFrequency (%)
화순군 201
100.0%

Length

2023-12-12T19:11:17.639603image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T19:11:17.812342image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
화순군 201
100.0%

자치단체코드
Categorical

CONSTANT 

Distinct1
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
46790
201 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row46790
2nd row46790
3rd row46790
4th row46790
5th row46790

Common Values

ValueCountFrequency (%)
46790 201
100.0%

Length

2023-12-12T19:11:17.968086image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T19:11:18.136567image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
46790 201
100.0%

과세년도
Real number (ℝ)

Distinct6
Distinct (%)3.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2019.602
Minimum2017
Maximum2022
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.9 KiB
2023-12-12T19:11:18.276990image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2017
5-th percentile2017
Q12018
median2020
Q32021
95-th percentile2022
Maximum2022
Range5
Interquartile range (IQR)3

Descriptive statistics

Standard deviation1.7177881
Coefficient of variation (CV)0.00085055775
Kurtosis-1.2739133
Mean2019.602
Median Absolute Deviation (MAD)1
Skewness-0.076378175
Sum405940
Variance2.950796
MonotonicityIncreasing
2023-12-12T19:11:18.492253image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
2022 37
18.4%
2021 35
17.4%
2020 34
16.9%
2018 32
15.9%
2019 32
15.9%
2017 31
15.4%
ValueCountFrequency (%)
2017 31
15.4%
2018 32
15.9%
2019 32
15.9%
2020 34
16.9%
2021 35
17.4%
2022 37
18.4%
ValueCountFrequency (%)
2022 37
18.4%
2021 35
17.4%
2020 34
16.9%
2019 32
15.9%
2018 32
15.9%
2017 31
15.4%

세목명
Categorical

Distinct11
Distinct (%)5.5%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
재산세
24 
주민세
24 
취득세
24 
자동차세
24 
등록면허세
24 
Other values (6)
81 

Length

Max length7
Median length5
Mean length4.1144279
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row등록세
2nd row등록세
3rd row등록세
4th row재산세
5th row재산세

Common Values

ValueCountFrequency (%)
재산세 24
11.9%
주민세 24
11.9%
취득세 24
11.9%
자동차세 24
11.9%
등록면허세 24
11.9%
지방소득세 24
11.9%
등록세 23
11.4%
지역자원시설세 20
10.0%
담배소비세 9
 
4.5%
지방소비세 3
 
1.5%

Length

2023-12-12T19:11:18.724059image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
재산세 24
11.9%
주민세 24
11.9%
취득세 24
11.9%
자동차세 24
11.9%
등록면허세 24
11.9%
지방소득세 24
11.9%
등록세 23
11.4%
지역자원시설세 20
10.0%
담배소비세 9
 
4.5%
지방소비세 3
 
1.5%

납세자유형
Categorical

Distinct2
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
법인
101 
개인
100 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row개인
2nd row개인
3rd row법인
4th row개인
5th row개인

Common Values

ValueCountFrequency (%)
법인 101
50.2%
개인 100
49.8%

Length

2023-12-12T19:11:18.938859image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T19:11:19.084265image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
법인 101
50.2%
개인 100
49.8%
Distinct2
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size333.0 B
False
102 
True
99 
ValueCountFrequency (%)
False 102
50.7%
True 99
49.3%
2023-12-12T19:11:19.215847image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

납세자수
Real number (ℝ)

Distinct169
Distinct (%)84.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5070.4726
Minimum1
Maximum48186
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.9 KiB
2023-12-12T19:11:19.706239image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile3
Q1185
median1091
Q33253
95-th percentile28611
Maximum48186
Range48185
Interquartile range (IQR)3068

Descriptive statistics

Standard deviation10270.827
Coefficient of variation (CV)2.0256152
Kurtosis7.1307626
Mean5070.4726
Median Absolute Deviation (MAD)1083
Skewness2.7575657
Sum1019165
Variance1.0548988 × 108
MonotonicityNot monotonic
2023-12-12T19:11:19.901456image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
7 8
 
4.0%
1 7
 
3.5%
8 7
 
3.5%
6 5
 
2.5%
3 5
 
2.5%
12 2
 
1.0%
1091 2
 
1.0%
5 2
 
1.0%
201 2
 
1.0%
956 2
 
1.0%
Other values (159) 159
79.1%
ValueCountFrequency (%)
1 7
3.5%
3 5
2.5%
4 1
 
0.5%
5 2
 
1.0%
6 5
2.5%
7 8
4.0%
8 7
3.5%
9 1
 
0.5%
10 1
 
0.5%
12 2
 
1.0%
ValueCountFrequency (%)
48186 1
0.5%
47505 1
0.5%
46812 1
0.5%
46337 1
0.5%
45938 1
0.5%
45425 1
0.5%
30505 1
0.5%
29836 1
0.5%
29338 1
0.5%
28823 1
0.5%

데이터기준일
Date

CONSTANT 

Distinct1
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
Minimum2023-06-14 00:00:00
Maximum2023-06-14 00:00:00
2023-12-12T19:11:20.035779image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T19:11:20.146049image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2023-12-12T19:11:16.654725image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T19:11:16.452799image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T19:11:16.793365image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T19:11:16.548710image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T19:11:20.236524image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
과세년도세목명납세자유형관내_관외납세자수
과세년도1.0000.0000.0000.0000.000
세목명0.0001.0000.0000.1660.548
납세자유형0.0000.0001.0000.0000.646
관내_관외0.0000.1660.0001.0000.613
납세자수0.0000.5480.6460.6131.000
2023-12-12T19:11:20.396127image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
세목명관내_관외납세자유형
세목명1.0000.1540.000
관내_관외0.1541.0000.000
납세자유형0.0000.0001.000
2023-12-12T19:11:20.521995image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
과세년도납세자수세목명납세자유형관내_관외
과세년도1.000-0.0660.0000.0000.000
납세자수-0.0661.0000.2940.4830.457
세목명0.0000.2941.0000.0000.154
납세자유형0.0000.4830.0001.0000.000
관내_관외0.0000.4570.1540.0001.000

Missing values

2023-12-12T19:11:16.953225image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T19:11:17.140505image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시도명시군구명자치단체코드과세년도세목명납세자유형관내_관외납세자수데이터기준일
0전라남도화순군467902017등록세개인N1972023-06-14
1전라남도화순군467902017등록세개인Y1312023-06-14
2전라남도화순군467902017등록세법인Y122023-06-14
3전라남도화순군467902017재산세개인N454252023-06-14
4전라남도화순군467902017재산세개인Y282352023-06-14
5전라남도화순군467902017재산세법인N19552023-06-14
6전라남도화순군467902017재산세법인Y26112023-06-14
7전라남도화순군467902017주민세개인N40022023-06-14
8전라남도화순군467902017주민세개인Y261622023-06-14
9전라남도화순군467902017주민세법인N3452023-06-14
시도명시군구명자치단체코드과세년도세목명납세자유형관내_관외납세자수데이터기준일
191전라남도화순군467902022등록면허세법인Y12642023-06-14
192전라남도화순군467902022지방소득세개인N24272023-06-14
193전라남도화순군467902022지방소득세개인Y112002023-06-14
194전라남도화순군467902022지방소득세법인N5272023-06-14
195전라남도화순군467902022지방소득세법인Y13442023-06-14
196전라남도화순군467902022지방소비세법인Y12023-06-14
197전라남도화순군467902022지역자원시설세개인N82023-06-14
198전라남도화순군467902022지역자원시설세개인Y52023-06-14
199전라남도화순군467902022지역자원시설세법인N12023-06-14
200전라남도화순군467902022지역자원시설세법인Y72023-06-14