Overview

Dataset statistics

Number of variables8
Number of observations36
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.5 KiB
Average record size in memory70.7 B

Variable types

Categorical6
Boolean1
Numeric1

Dataset

Description세목별 납세 인원 현황을 제공하여 관내 및 관외 납세자에 대한 부과징수 정책 수립에 필요한 기초자료 등으로 활용할 수 있습니다.
URLhttps://www.data.go.kr/data/15078492/fileData.do

Alerts

시도명 has constant value ""Constant
시군구명 has constant value ""Constant
자치단체코드 has constant value ""Constant
과세년도 has constant value ""Constant
납세자수 is highly overall correlated with 납세자유형High correlation
납세자유형 is highly overall correlated with 납세자수High correlation

Reproduction

Analysis started2023-12-12 11:58:21.101553
Analysis finished2023-12-12 11:58:21.599707
Duration0.5 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시도명
Categorical

CONSTANT 

Distinct1
Distinct (%)2.8%
Missing0
Missing (%)0.0%
Memory size420.0 B
대구광역시
36 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row대구광역시
2nd row대구광역시
3rd row대구광역시
4th row대구광역시
5th row대구광역시

Common Values

ValueCountFrequency (%)
대구광역시 36
100.0%

Length

2023-12-12T20:58:21.662674image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T20:58:21.772063image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
대구광역시 36
100.0%

시군구명
Categorical

CONSTANT 

Distinct1
Distinct (%)2.8%
Missing0
Missing (%)0.0%
Memory size420.0 B
북구
36 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row북구
2nd row북구
3rd row북구
4th row북구
5th row북구

Common Values

ValueCountFrequency (%)
북구 36
100.0%

Length

2023-12-12T20:58:21.888733image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T20:58:21.977997image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
북구 36
100.0%

자치단체코드
Categorical

CONSTANT 

Distinct1
Distinct (%)2.8%
Missing0
Missing (%)0.0%
Memory size420.0 B
27230
36 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row27230
2nd row27230
3rd row27230
4th row27230
5th row27230

Common Values

ValueCountFrequency (%)
27230 36
100.0%

Length

2023-12-12T20:58:22.079742image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T20:58:22.176501image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
27230 36
100.0%

과세년도
Categorical

CONSTANT 

Distinct1
Distinct (%)2.8%
Missing0
Missing (%)0.0%
Memory size420.0 B
2022
36 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2022
2nd row2022
3rd row2022
4th row2022
5th row2022

Common Values

ValueCountFrequency (%)
2022 36
100.0%

Length

2023-12-12T20:58:22.297246image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T20:58:22.402522image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2022 36
100.0%

세목명
Categorical

Distinct10
Distinct (%)27.8%
Missing0
Missing (%)0.0%
Memory size420.0 B
등록세
재산세
주민세
취득세
자동차세
Other values (5)
16 

Length

Max length7
Median length3
Mean length4.0555556
Min length3

Unique

Unique1 ?
Unique (%)2.8%

Sample

1st row등록세
2nd row등록세
3rd row등록세
4th row등록세
5th row레저세

Common Values

ValueCountFrequency (%)
등록세 4
11.1%
재산세 4
11.1%
주민세 4
11.1%
취득세 4
11.1%
자동차세 4
11.1%
등록면허세 4
11.1%
지방소득세 4
11.1%
지역자원시설세 4
11.1%
레저세 3
8.3%
지방소비세 1
 
2.8%

Length

2023-12-12T20:58:22.543509image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T20:58:22.735999image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
등록세 4
11.1%
재산세 4
11.1%
주민세 4
11.1%
취득세 4
11.1%
자동차세 4
11.1%
등록면허세 4
11.1%
지방소득세 4
11.1%
지역자원시설세 4
11.1%
레저세 3
8.3%
지방소비세 1
 
2.8%

납세자유형
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)5.6%
Missing0
Missing (%)0.0%
Memory size420.0 B
법인
19 
개인
17 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row개인
2nd row개인
3rd row법인
4th row법인
5th row개인

Common Values

ValueCountFrequency (%)
법인 19
52.8%
개인 17
47.2%

Length

2023-12-12T20:58:22.917840image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T20:58:23.031321image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
법인 19
52.8%
개인 17
47.2%
Distinct2
Distinct (%)5.6%
Missing0
Missing (%)0.0%
Memory size168.0 B
False
18 
True
18 
ValueCountFrequency (%)
False 18
50.0%
True 18
50.0%
2023-12-12T20:58:23.155742image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

납세자수
Real number (ℝ)

HIGH CORRELATION 

Distinct32
Distinct (%)88.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean20238.528
Minimum1
Maximum159306
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size456.0 B
2023-12-12T20:58:23.316908image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q146
median1948.5
Q317866.75
95-th percentile127106.5
Maximum159306
Range159305
Interquartile range (IQR)17820.75

Descriptive statistics

Standard deviation41079.248
Coefficient of variation (CV)2.0297548
Kurtosis5.3590262
Mean20238.528
Median Absolute Deviation (MAD)1946.5
Skewness2.48241
Sum728587
Variance1.6875046 × 109
MonotonicityNot monotonic
2023-12-12T20:58:23.501034image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=32)
ValueCountFrequency (%)
2 3
 
8.3%
1 3
 
8.3%
40 1
 
2.8%
85702 1
 
2.8%
17710 1
 
2.8%
39635 1
 
2.8%
2595 1
 
2.8%
3782 1
 
2.8%
23798 1
 
2.8%
4197 1
 
2.8%
Other values (22) 22
61.1%
ValueCountFrequency (%)
1 3
8.3%
2 3
8.3%
26 1
 
2.8%
36 1
 
2.8%
40 1
 
2.8%
48 1
 
2.8%
91 1
 
2.8%
112 1
 
2.8%
452 1
 
2.8%
581 1
 
2.8%
ValueCountFrequency (%)
159306 1
2.8%
142168 1
2.8%
122086 1
2.8%
85702 1
2.8%
45052 1
2.8%
39635 1
2.8%
29468 1
2.8%
23798 1
2.8%
18337 1
2.8%
17710 1
2.8%

Interactions

2023-12-12T20:58:21.282225image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T20:58:23.944903image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
세목명납세자유형관내외여부납세자수
세목명1.0000.0000.0000.000
납세자유형0.0001.0000.0000.523
관내외여부0.0000.0001.0000.000
납세자수0.0000.5230.0001.000
2023-12-12T20:58:24.079565image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
납세자유형관내외여부세목명
납세자유형1.0000.0000.000
관내외여부0.0001.0000.000
세목명0.0000.0001.000
2023-12-12T20:58:24.213749image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
납세자수세목명납세자유형관내외여부
납세자수1.0000.0000.5160.000
세목명0.0001.0000.0000.000
납세자유형0.5160.0001.0000.000
관내외여부0.0000.0000.0001.000

Missing values

2023-12-12T20:58:21.422235image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T20:58:21.551967image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시도명시군구명자치단체코드과세년도세목명납세자유형관내외여부납세자수
0대구광역시북구272302022등록세개인N40
1대구광역시북구272302022등록세개인Y36
2대구광역시북구272302022등록세법인N2
3대구광역시북구272302022등록세법인Y2
4대구광역시북구272302022레저세개인N1
5대구광역시북구272302022레저세법인N2
6대구광역시북구272302022레저세법인Y1
7대구광역시북구272302022재산세개인N45052
8대구광역시북구272302022재산세개인Y122086
9대구광역시북구272302022재산세법인N957
시도명시군구명자치단체코드과세년도세목명납세자유형관내외여부납세자수
26대구광역시북구272302022등록면허세법인Y3782
27대구광역시북구272302022지방소득세개인N23798
28대구광역시북구272302022지방소득세개인Y85702
29대구광역시북구272302022지방소득세법인N1976
30대구광역시북구272302022지방소득세법인Y4197
31대구광역시북구272302022지방소비세법인Y1
32대구광역시북구272302022지역자원시설세개인N91
33대구광역시북구272302022지역자원시설세개인Y112
34대구광역시북구272302022지역자원시설세법인N26
35대구광역시북구272302022지역자원시설세법인Y48