Overview

Dataset statistics

Number of variables8
Number of observations73
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory4.9 KiB
Average record size in memory68.8 B

Variable types

Categorical5
Numeric2
Boolean1

Dataset

Description연도별 세목별 납세 인원 현황에 관한 데이터로 시도명,과세연도,세목명, 납세자유형, 관내_관외, 납세자수 등의 항목을 제공함
URLhttps://www.data.go.kr/data/15079143/fileData.do

Alerts

시도명 has constant value ""Constant
시군구명 has constant value ""Constant
자치단체코드 has constant value ""Constant

Reproduction

Analysis started2023-12-12 19:57:39.981903
Analysis finished2023-12-12 19:57:41.022015
Duration1.04 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시도명
Categorical

CONSTANT 

Distinct1
Distinct (%)1.4%
Missing0
Missing (%)0.0%
Memory size716.0 B
대구광역시
73 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row대구광역시
2nd row대구광역시
3rd row대구광역시
4th row대구광역시
5th row대구광역시

Common Values

ValueCountFrequency (%)
대구광역시 73
100.0%

Length

2023-12-13T04:57:41.114982image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T04:57:41.249307image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
대구광역시 73
100.0%

시군구명
Categorical

CONSTANT 

Distinct1
Distinct (%)1.4%
Missing0
Missing (%)0.0%
Memory size716.0 B
대구광역시
73 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row대구광역시
2nd row대구광역시
3rd row대구광역시
4th row대구광역시
5th row대구광역시

Common Values

ValueCountFrequency (%)
대구광역시 73
100.0%

Length

2023-12-13T04:57:41.369795image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T04:57:41.502967image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
대구광역시 73
100.0%

자치단체코드
Categorical

CONSTANT 

Distinct1
Distinct (%)1.4%
Missing0
Missing (%)0.0%
Memory size716.0 B
27000
73 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row27000
2nd row27000
3rd row27000
4th row27000
5th row27000

Common Values

ValueCountFrequency (%)
27000 73
100.0%

Length

2023-12-13T04:57:41.652683image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T04:57:41.790227image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
27000 73
100.0%

과세년도
Real number (ℝ)

Distinct6
Distinct (%)8.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2019.3699
Minimum2017
Maximum2022
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size789.0 B
2023-12-13T04:57:41.906959image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2017
5-th percentile2017
Q12018
median2019
Q32021
95-th percentile2022
Maximum2022
Range5
Interquartile range (IQR)3

Descriptive statistics

Standard deviation1.744092
Coefficient of variation (CV)0.00086368131
Kurtosis-1.2784909
Mean2019.3699
Median Absolute Deviation (MAD)1
Skewness0.11921795
Sum147414
Variance3.0418569
MonotonicityIncreasing
2023-12-13T04:57:42.062490image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
2017 14
19.2%
2018 13
17.8%
2019 12
16.4%
2020 12
16.4%
2022 12
16.4%
2021 10
13.7%
ValueCountFrequency (%)
2017 14
19.2%
2018 13
17.8%
2019 12
16.4%
2020 12
16.4%
2021 10
13.7%
2022 12
16.4%
ValueCountFrequency (%)
2022 12
16.4%
2021 10
13.7%
2020 12
16.4%
2019 12
16.4%
2018 13
17.8%
2017 14
19.2%

세목명
Categorical

Distinct6
Distinct (%)8.2%
Missing0
Missing (%)0.0%
Memory size716.0 B
담배소비세
24 
지방소득세
20 
취득세
15 
자동차세
지방소비세

Length

Max length5
Median length5
Mean length4.4931507
Min length3

Unique

Unique1 ?
Unique (%)1.4%

Sample

1st row취득세
2nd row취득세
3rd row취득세
4th row취득세
5th row자동차세

Common Values

ValueCountFrequency (%)
담배소비세 24
32.9%
지방소득세 20
27.4%
취득세 15
20.5%
자동차세 7
 
9.6%
지방소비세 6
 
8.2%
등록면허세 1
 
1.4%

Length

2023-12-13T04:57:42.250791image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T04:57:42.422971image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
담배소비세 24
32.9%
지방소득세 20
27.4%
취득세 15
20.5%
자동차세 7
 
9.6%
지방소비세 6
 
8.2%
등록면허세 1
 
1.4%

납세자유형
Categorical

Distinct2
Distinct (%)2.7%
Missing0
Missing (%)0.0%
Memory size716.0 B
개인
43 
법인
30 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row개인
2nd row개인
3rd row법인
4th row법인
5th row개인

Common Values

ValueCountFrequency (%)
개인 43
58.9%
법인 30
41.1%

Length

2023-12-13T04:57:42.596736image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T04:57:42.728168image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
개인 43
58.9%
법인 30
41.1%
Distinct2
Distinct (%)2.7%
Missing0
Missing (%)0.0%
Memory size205.0 B
True
45 
False
28 
ValueCountFrequency (%)
True 45
61.6%
False 28
38.4%
2023-12-13T04:57:42.854391image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

납세자수
Real number (ℝ)

Distinct20
Distinct (%)27.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5.4246575
Minimum1
Maximum48
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size789.0 B
2023-12-13T04:57:42.968996image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11
median2
Q36
95-th percentile17.4
Maximum48
Range47
Interquartile range (IQR)5

Descriptive statistics

Standard deviation8.0638387
Coefficient of variation (CV)1.4865157
Kurtosis13.212551
Mean5.4246575
Median Absolute Deviation (MAD)1
Skewness3.3337546
Sum396
Variance65.025495
MonotonicityNot monotonic
2023-12-13T04:57:43.093320image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=20)
ValueCountFrequency (%)
1 27
37.0%
2 10
 
13.7%
3 10
 
13.7%
5 3
 
4.1%
7 3
 
4.1%
4 3
 
4.1%
8 2
 
2.7%
6 2
 
2.7%
11 2
 
2.7%
48 1
 
1.4%
Other values (10) 10
 
13.7%
ValueCountFrequency (%)
1 27
37.0%
2 10
 
13.7%
3 10
 
13.7%
4 3
 
4.1%
5 3
 
4.1%
6 2
 
2.7%
7 3
 
4.1%
8 2
 
2.7%
9 1
 
1.4%
10 1
 
1.4%
ValueCountFrequency (%)
48 1
1.4%
37 1
1.4%
27 1
1.4%
18 1
1.4%
17 1
1.4%
16 1
1.4%
14 1
1.4%
13 1
1.4%
12 1
1.4%
11 2
2.7%

Interactions

2023-12-13T04:57:40.492131image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T04:57:40.275737image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T04:57:40.601209image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T04:57:40.377331image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T04:57:43.186949image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
과세년도세목명납세자유형관내_관외납세자수
과세년도1.0000.0000.0000.0000.146
세목명0.0001.0000.3200.4300.142
납세자유형0.0000.3201.0000.1970.201
관내_관외0.0000.4300.1971.0000.142
납세자수0.1460.1420.2010.1421.000
2023-12-13T04:57:43.285800image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
납세자유형세목명관내_관외
납세자유형1.0000.2210.125
세목명0.2211.0000.300
관내_관외0.1250.3001.000
2023-12-13T04:57:43.392639image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
과세년도납세자수세목명납세자유형관내_관외
과세년도1.000-0.0510.0000.0000.000
납세자수-0.0511.0000.0760.2050.143
세목명0.0000.0761.0000.2210.300
납세자유형0.0000.2050.2211.0000.125
관내_관외0.0000.1430.3000.1251.000

Missing values

2023-12-13T04:57:40.745122image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T04:57:40.953903image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시도명시군구명자치단체코드과세년도세목명납세자유형관내_관외납세자수
0대구광역시대구광역시270002017취득세개인N1
1대구광역시대구광역시270002017취득세개인Y3
2대구광역시대구광역시270002017취득세법인N2
3대구광역시대구광역시270002017취득세법인Y2
4대구광역시대구광역시270002017자동차세개인Y1
5대구광역시대구광역시270002017자동차세법인Y1
6대구광역시대구광역시270002017담배소비세개인N8
7대구광역시대구광역시270002017담배소비세개인Y5
8대구광역시대구광역시270002017담배소비세법인N6
9대구광역시대구광역시270002017담배소비세법인Y3
시도명시군구명자치단체코드과세년도세목명납세자유형관내_관외납세자수
63대구광역시대구광역시270002022취득세법인N2
64대구광역시대구광역시270002022자동차세개인Y1
65대구광역시대구광역시270002022담배소비세개인N37
66대구광역시대구광역시270002022담배소비세개인Y3
67대구광역시대구광역시270002022담배소비세법인N2
68대구광역시대구광역시270002022담배소비세법인Y3
69대구광역시대구광역시270002022지방소득세개인Y17
70대구광역시대구광역시270002022지방소득세법인N1
71대구광역시대구광역시270002022지방소득세법인Y2
72대구광역시대구광역시270002022지방소비세개인Y1