Overview

Dataset statistics

Number of variables8
Number of observations207
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory13.7 KiB
Average record size in memory67.6 B

Variable types

Categorical5
Numeric2
Boolean1

Dataset

Description세목별 납세인원 현황을 제공관외 납세자에대한 부과징수 정책 수립시 기초자료로 활용납세자의 유형 : 개인, 법인, 사업자, 기타
Author경상북도 경산시
URLhttps://www.data.go.kr/data/15079702/fileData.do

Alerts

시도명 has constant value ""Constant
시군구명 has constant value ""Constant
자치단체코드 has constant value ""Constant
납세자수 is highly overall correlated with 납세자유형High correlation
납세자유형 is highly overall correlated with 납세자수High correlation

Reproduction

Analysis started2024-03-16 06:35:01.114359
Analysis finished2024-03-16 06:35:04.275991
Duration3.16 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시도명
Categorical

CONSTANT 

Distinct1
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
경상북도
207 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row경상북도
2nd row경상북도
3rd row경상북도
4th row경상북도
5th row경상북도

Common Values

ValueCountFrequency (%)
경상북도 207
100.0%

Length

2024-03-16T06:35:04.493446image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-16T06:35:04.888773image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
경상북도 207
100.0%

시군구명
Categorical

CONSTANT 

Distinct1
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
경산시
207 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row경산시
2nd row경산시
3rd row경산시
4th row경산시
5th row경산시

Common Values

ValueCountFrequency (%)
경산시 207
100.0%

Length

2024-03-16T06:35:05.154840image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-16T06:35:05.485263image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
경산시 207
100.0%

자치단체코드
Categorical

CONSTANT 

Distinct1
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
47290
207 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row47290
2nd row47290
3rd row47290
4th row47290
5th row47290

Common Values

ValueCountFrequency (%)
47290 207
100.0%

Length

2024-03-16T06:35:05.913397image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-16T06:35:06.288231image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
47290 207
100.0%

과세년도
Real number (ℝ)

Distinct6
Distinct (%)2.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2019.599
Minimum2017
Maximum2022
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.9 KiB
2024-03-16T06:35:06.687951image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2017
5-th percentile2017
Q12018
median2020
Q32021
95-th percentile2022
Maximum2022
Range5
Interquartile range (IQR)3

Descriptive statistics

Standard deviation1.7147558
Coefficient of variation (CV)0.00084905754
Kurtosis-1.2645824
Mean2019.599
Median Absolute Deviation (MAD)1
Skewness-0.076836749
Sum418057
Variance2.9403874
MonotonicityDecreasing
2024-03-16T06:35:07.209410image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
2022 38
18.4%
2020 37
17.9%
2021 35
16.9%
2018 33
15.9%
2019 32
15.5%
2017 32
15.5%
ValueCountFrequency (%)
2017 32
15.5%
2018 33
15.9%
2019 32
15.5%
2020 37
17.9%
2021 35
16.9%
2022 38
18.4%
ValueCountFrequency (%)
2022 38
18.4%
2021 35
16.9%
2020 37
17.9%
2019 32
15.5%
2018 33
15.9%
2017 32
15.5%

세목명
Categorical

Distinct12
Distinct (%)5.8%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
등록세
24 
재산세
24 
주민세
24 
취득세
24 
자동차세
24 
Other values (7)
87 

Length

Max length7
Median length5
Mean length4.1014493
Min length3

Unique

Unique1 ?
Unique (%)0.5%

Sample

1st row등록세
2nd row등록세
3rd row등록세
4th row등록세
5th row레저세

Common Values

ValueCountFrequency (%)
등록세 24
11.6%
재산세 24
11.6%
주민세 24
11.6%
취득세 24
11.6%
자동차세 24
11.6%
등록면허세 24
11.6%
지방소득세 24
11.6%
지역자원시설세 20
9.7%
담배소비세 11
5.3%
레저세 4
 
1.9%
Other values (2) 4
 
1.9%

Length

2024-03-16T06:35:07.802830image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
등록세 24
11.6%
재산세 24
11.6%
주민세 24
11.6%
취득세 24
11.6%
자동차세 24
11.6%
등록면허세 24
11.6%
지방소득세 24
11.6%
지역자원시설세 20
9.7%
담배소비세 11
5.3%
레저세 4
 
1.9%
Other values (2) 4
 
1.9%

납세자유형
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
개인
105 
법인
102 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row개인
2nd row개인
3rd row법인
4th row법인
5th row개인

Common Values

ValueCountFrequency (%)
개인 105
50.7%
법인 102
49.3%

Length

2024-03-16T06:35:08.415823image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-16T06:35:08.822028image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
개인 105
50.7%
법인 102
49.3%
Distinct2
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size339.0 B
False
106 
True
101 
ValueCountFrequency (%)
False 106
51.2%
True 101
48.8%
2024-03-16T06:35:09.121861image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

납세자수
Real number (ℝ)

HIGH CORRELATION 

Distinct178
Distinct (%)86.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean13757.444
Minimum1
Maximum105707
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.9 KiB
2024-03-16T06:35:09.717646image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2.3
Q1133
median2103
Q311285
95-th percentile85028.2
Maximum105707
Range105706
Interquartile range (IQR)11152

Descriptive statistics

Standard deviation26004.22
Coefficient of variation (CV)1.8901926
Kurtosis4.0204954
Mean13757.444
Median Absolute Deviation (MAD)2092
Skewness2.2562687
Sum2847791
Variance6.7621945 × 108
MonotonicityNot monotonic
2024-03-16T06:35:10.130099image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 7
 
3.4%
9 5
 
2.4%
3 5
 
2.4%
2 4
 
1.9%
12 3
 
1.4%
15 3
 
1.4%
1026 2
 
1.0%
11 2
 
1.0%
13 2
 
1.0%
4 2
 
1.0%
Other values (168) 172
83.1%
ValueCountFrequency (%)
1 7
3.4%
2 4
1.9%
3 5
2.4%
4 2
 
1.0%
5 2
 
1.0%
6 1
 
0.5%
7 2
 
1.0%
8 2
 
1.0%
9 5
2.4%
10 1
 
0.5%
ValueCountFrequency (%)
105707 1
0.5%
104964 1
0.5%
100791 1
0.5%
98438 1
0.5%
97507 1
0.5%
96394 1
0.5%
95190 1
0.5%
91651 1
0.5%
87627 1
0.5%
85701 1
0.5%

Interactions

2024-03-16T06:35:02.642915image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-16T06:35:01.631671image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-16T06:35:02.898097image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-16T06:35:02.347612image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-16T06:35:10.390078image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
과세년도세목명납세자유형관내 관외납세자수
과세년도1.0000.0000.0000.0000.000
세목명0.0001.0000.0000.2070.481
납세자유형0.0000.0001.0000.0000.734
관내 관외0.0000.2070.0001.0000.601
납세자수0.0000.4810.7340.6011.000
2024-03-16T06:35:10.760705image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
관내 관외납세자유형세목명
관내 관외1.0000.0000.156
납세자유형0.0001.0000.000
세목명0.1560.0001.000
2024-03-16T06:35:10.947538image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
과세년도납세자수세목명납세자유형관내 관외
과세년도1.000-0.0620.0000.0000.000
납세자수-0.0621.0000.2230.5640.455
세목명0.0000.2231.0000.0000.156
납세자유형0.0000.5640.0001.0000.000
관내 관외0.0000.4550.1560.0001.000

Missing values

2024-03-16T06:35:03.422935image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-16T06:35:04.091804image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시도명시군구명자치단체코드과세년도세목명납세자유형관내 관외납세자수
0경상북도경산시472902022등록세개인N196
1경상북도경산시472902022등록세개인Y269
2경상북도경산시472902022등록세법인N2
3경상북도경산시472902022등록세법인Y7
4경상북도경산시472902022레저세개인N3
5경상북도경산시472902022레저세법인N2
6경상북도경산시472902022재산세개인N53145
7경상북도경산시472902022재산세개인Y85701
8경상북도경산시472902022재산세법인N1017
9경상북도경산시472902022재산세법인Y2459
시도명시군구명자치단체코드과세년도세목명납세자유형관내 관외납세자수
197경상북도경산시472902017등록면허세개인Y24946
198경상북도경산시472902017등록면허세법인N1703
199경상북도경산시472902017등록면허세법인Y2543
200경상북도경산시472902017지방소득세개인N5768
201경상북도경산시472902017지방소득세개인Y18648
202경상북도경산시472902017지방소득세법인N976
203경상북도경산시472902017지방소득세법인Y2572
204경상북도경산시472902017지역자원시설세개인N11
205경상북도경산시472902017지역자원시설세개인Y12
206경상북도경산시472902017지역자원시설세법인Y12