Overview

Dataset statistics

Number of variables8
Number of observations126
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory8.4 KiB
Average record size in memory68.0 B

Variable types

Categorical6
Boolean1
Numeric1

Dataset

Description세목별 납세 인원 현황을 제공(시도명,시군구명,자치단체코드,과세년도,세목명,납세자유형,관내/관외,납세자수)
URLhttps://www.data.go.kr/data/15080174/fileData.do

Alerts

시도명 has constant value ""Constant
과세년도 has constant value ""Constant
시군구명 is highly overall correlated with 자치단체코드High correlation
자치단체코드 is highly overall correlated with 시군구명High correlation
납세자수 is highly overall correlated with 납세자유형High correlation
납세자유형 is highly overall correlated with 납세자수High correlation

Reproduction

Analysis started2023-12-12 06:41:26.200243
Analysis finished2023-12-12 06:41:26.897829
Duration0.7 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시도명
Categorical

CONSTANT 

Distinct1
Distinct (%)0.8%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
경기도
126 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row경기도
2nd row경기도
3rd row경기도
4th row경기도
5th row경기도

Common Values

ValueCountFrequency (%)
경기도 126
100.0%

Length

2023-12-12T15:41:26.977260image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T15:41:27.090003image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
경기도 126
100.0%

시군구명
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)3.2%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
수원시팔달구
36 
수원시권선구
30 
수원시영통구
30 
수원시장안구
30 

Length

Max length6
Median length6
Mean length6
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row수원시권선구
2nd row수원시권선구
3rd row수원시권선구
4th row수원시권선구
5th row수원시권선구

Common Values

ValueCountFrequency (%)
수원시팔달구 36
28.6%
수원시권선구 30
23.8%
수원시영통구 30
23.8%
수원시장안구 30
23.8%

Length

2023-12-12T15:41:27.194435image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T15:41:27.314201image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
수원시팔달구 36
28.6%
수원시권선구 30
23.8%
수원시영통구 30
23.8%
수원시장안구 30
23.8%

자치단체코드
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)3.2%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
41115
36 
41113
30 
41117
30 
41111
30 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row41113
2nd row41113
3rd row41113
4th row41113
5th row41113

Common Values

ValueCountFrequency (%)
41115 36
28.6%
41113 30
23.8%
41117 30
23.8%
41111 30
23.8%

Length

2023-12-12T15:41:27.452574image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T15:41:27.566556image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
41115 36
28.6%
41113 30
23.8%
41117 30
23.8%
41111 30
23.8%

과세년도
Categorical

CONSTANT 

Distinct1
Distinct (%)0.8%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
2022
126 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2022
2nd row2022
3rd row2022
4th row2022
5th row2022

Common Values

ValueCountFrequency (%)
2022 126
100.0%

Length

2023-12-12T15:41:27.722451image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T15:41:27.822261image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2022 126
100.0%

세목명
Categorical

Distinct11
Distinct (%)8.7%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
재산세
16 
주민세
16 
취득세
16 
자동차세
16 
등록면허세
16 
Other values (6)
46 

Length

Max length7
Median length5
Mean length4.1904762
Min length3

Unique

Unique1 ?
Unique (%)0.8%

Sample

1st row등록세
2nd row등록세
3rd row재산세
4th row재산세
5th row재산세

Common Values

ValueCountFrequency (%)
재산세 16
12.7%
주민세 16
12.7%
취득세 16
12.7%
자동차세 16
12.7%
등록면허세 16
12.7%
지방소득세 16
12.7%
지역자원시설세 16
12.7%
등록세 8
6.3%
레저세 3
 
2.4%
담배소비세 2
 
1.6%

Length

2023-12-12T15:41:27.958755image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
재산세 16
12.7%
주민세 16
12.7%
취득세 16
12.7%
자동차세 16
12.7%
등록면허세 16
12.7%
지방소득세 16
12.7%
지역자원시설세 16
12.7%
등록세 8
6.3%
레저세 3
 
2.4%
담배소비세 2
 
1.6%

납세자유형
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)1.6%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
개인
65 
법인
61 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row개인
2nd row개인
3rd row개인
4th row개인
5th row법인

Common Values

ValueCountFrequency (%)
개인 65
51.6%
법인 61
48.4%

Length

2023-12-12T15:41:28.101351image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T15:41:28.204195image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
개인 65
51.6%
법인 61
48.4%
Distinct2
Distinct (%)1.6%
Missing0
Missing (%)0.0%
Memory size258.0 B
False
63 
True
63 
ValueCountFrequency (%)
False 63
50.0%
True 63
50.0%
2023-12-12T15:41:28.288292image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

납세자수
Real number (ℝ)

HIGH CORRELATION 

Distinct118
Distinct (%)93.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean16467.151
Minimum1
Maximum141695
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.2 KiB
2023-12-12T15:41:28.395302image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile5.25
Q1216.5
median2049.5
Q314577
95-th percentile101328.25
Maximum141695
Range141694
Interquartile range (IQR)14360.5

Descriptive statistics

Standard deviation31791.588
Coefficient of variation (CV)1.9306065
Kurtosis4.7671464
Mean16467.151
Median Absolute Deviation (MAD)2038
Skewness2.3719257
Sum2074861
Variance1.010705 × 109
MonotonicityNot monotonic
2023-12-12T15:41:28.565338image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 3
 
2.4%
13 2
 
1.6%
3 2
 
1.6%
17 2
 
1.6%
12 2
 
1.6%
6 2
 
1.6%
11 2
 
1.6%
141695 1
 
0.8%
1424 1
 
0.8%
2 1
 
0.8%
Other values (108) 108
85.7%
ValueCountFrequency (%)
1 3
2.4%
2 1
 
0.8%
3 2
1.6%
5 1
 
0.8%
6 2
1.6%
7 1
 
0.8%
8 1
 
0.8%
9 1
 
0.8%
11 2
1.6%
12 2
1.6%
ValueCountFrequency (%)
141695 1
0.8%
127419 1
0.8%
126942 1
0.8%
115290 1
0.8%
102918 1
0.8%
102574 1
0.8%
102107 1
0.8%
98992 1
0.8%
90209 1
0.8%
84198 1
0.8%

Interactions

2023-12-12T15:41:26.534697image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T15:41:28.698351image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시군구명자치단체코드세목명납세자유형관내(Y)_관외(N)납세자수
시군구명1.0001.0000.0000.0000.0000.203
자치단체코드1.0001.0000.0000.0000.0000.203
세목명0.0000.0001.0000.0000.0000.000
납세자유형0.0000.0000.0001.0000.0000.681
관내(Y)_관외(N)0.0000.0000.0000.0001.0000.380
납세자수0.2030.2030.0000.6810.3801.000
2023-12-12T15:41:28.836640image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
납세자유형시군구명자치단체코드세목명관내(Y)_관외(N)
납세자유형1.0000.0000.0000.0000.000
시군구명0.0001.0001.0000.0000.000
자치단체코드0.0001.0001.0000.0000.000
세목명0.0000.0000.0001.0000.000
관내(Y)_관외(N)0.0000.0000.0000.0001.000
2023-12-12T15:41:28.994701image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
납세자수시군구명자치단체코드세목명납세자유형관내(Y)_관외(N)
납세자수1.0000.1170.1170.0000.5130.281
시군구명0.1171.0001.0000.0000.0000.000
자치단체코드0.1171.0001.0000.0000.0000.000
세목명0.0000.0000.0001.0000.0000.000
납세자유형0.5130.0000.0000.0001.0000.000
관내(Y)_관외(N)0.2810.0000.0000.0000.0001.000

Missing values

2023-12-12T15:41:26.680785image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T15:41:26.843406image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시도명시군구명자치단체코드과세년도세목명납세자유형관내(Y)_관외(N)납세자수
0경기도수원시권선구411132022등록세개인N13
1경기도수원시권선구411132022등록세개인Y17
2경기도수원시권선구411132022재산세개인N30493
3경기도수원시권선구411132022재산세개인Y98992
4경기도수원시권선구411132022재산세법인N947
5경기도수원시권선구411132022재산세법인Y1017
6경기도수원시권선구411132022주민세개인N10422
7경기도수원시권선구411132022주민세개인Y141695
8경기도수원시권선구411132022주민세법인N1424
9경기도수원시권선구411132022주민세법인Y3417
시도명시군구명자치단체코드과세년도세목명납세자유형관내(Y)_관외(N)납세자수
116경기도수원시팔달구411152022등록면허세법인Y2173
117경기도수원시팔달구411152022지방소득세개인N14199
118경기도수원시팔달구411152022지방소득세개인Y53961
119경기도수원시팔달구411152022지방소득세법인N1357
120경기도수원시팔달구411152022지방소득세법인Y2626
121경기도수원시팔달구411152022지방소비세법인Y1
122경기도수원시팔달구411152022지역자원시설세개인N17
123경기도수원시팔달구411152022지역자원시설세개인Y24
124경기도수원시팔달구411152022지역자원시설세법인N9
125경기도수원시팔달구411152022지역자원시설세법인Y11