Overview

Dataset statistics

Number of variables9
Number of observations92
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory6.9 KiB
Average record size in memory76.4 B

Variable types

Categorical7
Boolean1
Numeric1

Dataset

Description경기도 용인시 처인구, 기흥구, 수지구 세목별 납세자 현황입니다. 납세자유형, 납세자수 등의 데이터를 제공합니다. ※ 데이터기준일자 : 2021-12-31
URLhttps://www.data.go.kr/data/15078573/fileData.do

Alerts

시도명 has constant value ""Constant
과세년도 has constant value ""Constant
데이터기준일자 has constant value ""Constant
시군구명 is highly overall correlated with 자치단체코드High correlation
자치단체코드 is highly overall correlated with 시군구명High correlation
납세자수 is highly overall correlated with 납세자유형High correlation
납세자유형 is highly overall correlated with 납세자수High correlation

Reproduction

Analysis started2023-12-12 00:55:30.649082
Analysis finished2023-12-12 00:55:31.381900
Duration0.73 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시도명
Categorical

CONSTANT 

Distinct1
Distinct (%)1.1%
Missing0
Missing (%)0.0%
Memory size868.0 B
경기도
92 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row경기도
2nd row경기도
3rd row경기도
4th row경기도
5th row경기도

Common Values

ValueCountFrequency (%)
경기도 92
100.0%

Length

2023-12-12T09:55:31.456369image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T09:55:31.550341image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
경기도 92
100.0%

시군구명
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Memory size868.0 B
용인시 처인구
36 
용인시 기흥구
28 
용인시 수지구
28 

Length

Max length7
Median length7
Mean length7
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row용인시 기흥구
2nd row용인시 기흥구
3rd row용인시 기흥구
4th row용인시 기흥구
5th row용인시 기흥구

Common Values

ValueCountFrequency (%)
용인시 처인구 36
39.1%
용인시 기흥구 28
30.4%
용인시 수지구 28
30.4%

Length

2023-12-12T09:55:31.644124image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T09:55:31.734720image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
용인시 92
50.0%
처인구 36
 
19.6%
기흥구 28
 
15.2%
수지구 28
 
15.2%

자치단체코드
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Memory size868.0 B
41461
36 
41463
28 
41465
28 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row41463
2nd row41463
3rd row41463
4th row41463
5th row41463

Common Values

ValueCountFrequency (%)
41461 36
39.1%
41463 28
30.4%
41465 28
30.4%

Length

2023-12-12T09:55:31.864153image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T09:55:31.962147image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
41461 36
39.1%
41463 28
30.4%
41465 28
30.4%

과세년도
Categorical

CONSTANT 

Distinct1
Distinct (%)1.1%
Missing0
Missing (%)0.0%
Memory size868.0 B
2021
92 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2021
2nd row2021
3rd row2021
4th row2021
5th row2021

Common Values

ValueCountFrequency (%)
2021 92
100.0%

Length

2023-12-12T09:55:32.064183image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T09:55:32.151427image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2021 92
100.0%

세목명
Categorical

Distinct10
Distinct (%)10.9%
Missing0
Missing (%)0.0%
Memory size868.0 B
재산세
12 
주민세
12 
취득세
12 
자동차세
12 
등록면허세
12 
Other values (5)
32 

Length

Max length7
Median length5
Mean length4.2608696
Min length3

Unique

Unique1 ?
Unique (%)1.1%

Sample

1st row재산세
2nd row재산세
3rd row재산세
4th row재산세
5th row주민세

Common Values

ValueCountFrequency (%)
재산세 12
13.0%
주민세 12
13.0%
취득세 12
13.0%
자동차세 12
13.0%
등록면허세 12
13.0%
지방소득세 12
13.0%
지역자원시설세 12
13.0%
등록세 4
 
4.3%
담배소비세 3
 
3.3%
지방소비세 1
 
1.1%

Length

2023-12-12T09:55:32.265522image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T09:55:32.430992image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
재산세 12
13.0%
주민세 12
13.0%
취득세 12
13.0%
자동차세 12
13.0%
등록면허세 12
13.0%
지방소득세 12
13.0%
지역자원시설세 12
13.0%
등록세 4
 
4.3%
담배소비세 3
 
3.3%
지방소비세 1
 
1.1%

납세자유형
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)2.2%
Missing0
Missing (%)0.0%
Memory size868.0 B
개인
46 
법인
46 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row개인
2nd row개인
3rd row법인
4th row법인
5th row개인

Common Values

ValueCountFrequency (%)
개인 46
50.0%
법인 46
50.0%

Length

2023-12-12T09:55:32.573680image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T09:55:32.662497image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
개인 46
50.0%
법인 46
50.0%
Distinct2
Distinct (%)2.2%
Missing0
Missing (%)0.0%
Memory size224.0 B
False
46 
True
46 
ValueCountFrequency (%)
False 46
50.0%
True 46
50.0%
2023-12-12T09:55:32.749124image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

납세자수
Real number (ℝ)

HIGH CORRELATION 

Distinct89
Distinct (%)96.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean20696.261
Minimum1
Maximum149902
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size960.0 B
2023-12-12T09:55:32.871330image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile3.55
Q1746.75
median4296
Q316853.5
95-th percentile113762.05
Maximum149902
Range149901
Interquartile range (IQR)16106.75

Descriptive statistics

Standard deviation36344.121
Coefficient of variation (CV)1.7560718
Kurtosis3.5303246
Mean20696.261
Median Absolute Deviation (MAD)4269.5
Skewness2.1275013
Sum1904056
Variance1.3208951 × 109
MonotonicityNot monotonic
2023-12-12T09:55:33.053125image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
14 2
 
2.2%
3 2
 
2.2%
1 2
 
2.2%
82089 1
 
1.1%
8522 1
 
1.1%
5987 1
 
1.1%
2189 1
 
1.1%
100907 1
 
1.1%
8655 1
 
1.1%
3202 1
 
1.1%
Other values (79) 79
85.9%
ValueCountFrequency (%)
1 2
2.2%
2 1
1.1%
3 2
2.2%
4 1
1.1%
7 1
1.1%
13 1
1.1%
14 2
2.2%
16 1
1.1%
20 1
1.1%
25 1
1.1%
ValueCountFrequency (%)
149902 1
1.1%
141187 1
1.1%
125223 1
1.1%
117909 1
1.1%
114504 1
1.1%
113155 1
1.1%
100907 1
1.1%
93534 1
1.1%
85136 1
1.1%
84922 1
1.1%

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)1.1%
Missing0
Missing (%)0.0%
Memory size868.0 B
2021-12-31
92 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2021-12-31
2nd row2021-12-31
3rd row2021-12-31
4th row2021-12-31
5th row2021-12-31

Common Values

ValueCountFrequency (%)
2021-12-31 92
100.0%

Length

2023-12-12T09:55:33.212001image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T09:55:33.321357image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2021-12-31 92
100.0%

Interactions

2023-12-12T09:55:31.046396image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T09:55:33.399991image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시군구명자치단체코드세목명납세자유형관내여부납세자수
시군구명1.0001.0000.0000.0000.0000.165
자치단체코드1.0001.0000.0000.0000.0000.165
세목명0.0000.0001.0000.0000.0000.000
납세자유형0.0000.0000.0001.0000.0000.537
관내여부0.0000.0000.0000.0001.0000.244
납세자수0.1650.1650.0000.5370.2441.000
2023-12-12T09:55:33.510147image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
관내여부세목명시군구명납세자유형자치단체코드
관내여부1.0000.0000.0000.0000.000
세목명0.0001.0000.0000.0000.000
시군구명0.0000.0001.0000.0001.000
납세자유형0.0000.0000.0001.0000.000
자치단체코드0.0000.0001.0000.0001.000
2023-12-12T09:55:33.646139image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
납세자수시군구명자치단체코드세목명납세자유형관내여부
납세자수1.0000.0630.0630.0000.5180.232
시군구명0.0631.0001.0000.0000.0000.000
자치단체코드0.0631.0001.0000.0000.0000.000
세목명0.0000.0000.0001.0000.0000.000
납세자유형0.5180.0000.0000.0001.0000.000
관내여부0.2320.0000.0000.0000.0001.000

Missing values

2023-12-12T09:55:31.176314image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T09:55:31.328584image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시도명시군구명자치단체코드과세년도세목명납세자유형관내여부납세자수데이터기준일자
0경기도용인시 기흥구414632021재산세개인N432272021-12-31
1경기도용인시 기흥구414632021재산세개인Y1179092021-12-31
2경기도용인시 기흥구414632021재산세법인N15952021-12-31
3경기도용인시 기흥구414632021재산세법인Y17252021-12-31
4경기도용인시 기흥구414632021주민세개인N134372021-12-31
5경기도용인시 기흥구414632021주민세개인Y1499022021-12-31
6경기도용인시 기흥구414632021주민세법인N43132021-12-31
7경기도용인시 기흥구414632021주민세법인Y86322021-12-31
8경기도용인시 기흥구414632021취득세개인N74152021-12-31
9경기도용인시 기흥구414632021취득세개인Y346352021-12-31
시도명시군구명자치단체코드과세년도세목명납세자유형관내여부납세자수데이터기준일자
82경기도용인시 처인구414612021등록면허세법인Y51422021-12-31
83경기도용인시 처인구414612021지방소득세개인N116632021-12-31
84경기도용인시 처인구414612021지방소득세개인Y532002021-12-31
85경기도용인시 처인구414612021지방소득세법인N25952021-12-31
86경기도용인시 처인구414612021지방소득세법인Y55222021-12-31
87경기도용인시 처인구414612021지방소비세법인Y12021-12-31
88경기도용인시 처인구414612021지역자원시설세개인N282021-12-31
89경기도용인시 처인구414612021지역자원시설세개인Y582021-12-31
90경기도용인시 처인구414612021지역자원시설세법인N302021-12-31
91경기도용인시 처인구414612021지역자원시설세법인Y722021-12-31