Overview

Dataset statistics

Number of variables8
Number of observations196
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory13.0 KiB
Average record size in memory67.7 B

Variable types

Categorical5
Numeric2
Boolean1

Dataset

Description세목별 납세 인원 현황에 대한 데이터로 납세자 유형, 납세자 주소의 관내/관외여부, 납세자 수 등의 항목을 제공하고 있습니다.
URLhttps://www.data.go.kr/data/15078744/fileData.do

Alerts

시도명 has constant value ""Constant
시군구명 has constant value ""Constant
자치단체코드 has constant value ""Constant
납세자수 is highly overall correlated with 납세자유형High correlation
납세자유형 is highly overall correlated with 납세자수High correlation

Reproduction

Analysis started2023-12-12 16:05:54.293153
Analysis finished2023-12-12 16:05:54.930768
Duration0.64 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시도명
Categorical

CONSTANT 

Distinct1
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
대전광역시
196 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row대전광역시
2nd row대전광역시
3rd row대전광역시
4th row대전광역시
5th row대전광역시

Common Values

ValueCountFrequency (%)
대전광역시 196
100.0%

Length

2023-12-13T01:05:54.983490image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T01:05:55.070309image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
대전광역시 196
100.0%

시군구명
Categorical

CONSTANT 

Distinct1
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
대덕구
196 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row대덕구
2nd row대덕구
3rd row대덕구
4th row대덕구
5th row대덕구

Common Values

ValueCountFrequency (%)
대덕구 196
100.0%

Length

2023-12-13T01:05:55.160274image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T01:05:55.311540image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
대덕구 196
100.0%

자치단체코드
Categorical

CONSTANT 

Distinct1
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
30230
196 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row30230
2nd row30230
3rd row30230
4th row30230
5th row30230

Common Values

ValueCountFrequency (%)
30230 196
100.0%

Length

2023-12-13T01:05:55.394949image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T01:05:55.480378image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
30230 196
100.0%

과세년도
Real number (ℝ)

Distinct6
Distinct (%)3.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2019.551
Minimum2017
Maximum2022
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.9 KiB
2023-12-13T01:05:55.582085image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2017
5-th percentile2017
Q12018
median2020
Q32021
95-th percentile2022
Maximum2022
Range5
Interquartile range (IQR)3

Descriptive statistics

Standard deviation1.725361
Coefficient of variation (CV)0.00085432902
Kurtosis-1.2868221
Mean2019.551
Median Absolute Deviation (MAD)2
Skewness-0.038711967
Sum395832
Variance2.9768707
MonotonicityIncreasing
2023-12-13T01:05:55.687169image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
2022 35
17.9%
2020 33
16.8%
2021 33
16.8%
2017 32
16.3%
2018 32
16.3%
2019 31
15.8%
ValueCountFrequency (%)
2017 32
16.3%
2018 32
16.3%
2019 31
15.8%
2020 33
16.8%
2021 33
16.8%
2022 35
17.9%
ValueCountFrequency (%)
2022 35
17.9%
2021 33
16.8%
2020 33
16.8%
2019 31
15.8%
2018 32
16.3%
2017 32
16.3%

세목명
Categorical

Distinct10
Distinct (%)5.1%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
재산세
24 
주민세
24 
취득세
24 
자동차세
24 
등록면허세
24 
Other values (5)
76 

Length

Max length7
Median length5
Mean length4.1326531
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row등록세
2nd row등록세
3rd row등록세
4th row등록세
5th row재산세

Common Values

ValueCountFrequency (%)
재산세 24
12.2%
주민세 24
12.2%
취득세 24
12.2%
자동차세 24
12.2%
등록면허세 24
12.2%
지방소득세 24
12.2%
지역자원시설세 24
12.2%
등록세 23
11.7%
지방소비세 3
 
1.5%
레저세 2
 
1.0%

Length

2023-12-13T01:05:55.817212image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T01:05:55.949267image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
재산세 24
12.2%
주민세 24
12.2%
취득세 24
12.2%
자동차세 24
12.2%
등록면허세 24
12.2%
지방소득세 24
12.2%
지역자원시설세 24
12.2%
등록세 23
11.7%
지방소비세 3
 
1.5%
레저세 2
 
1.0%

납세자유형
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
법인
99 
개인
97 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row개인
2nd row개인
3rd row법인
4th row법인
5th row개인

Common Values

ValueCountFrequency (%)
법인 99
50.5%
개인 97
49.5%

Length

2023-12-13T01:05:56.066172image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T01:05:56.204381image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
법인 99
50.5%
개인 97
49.5%
Distinct2
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size328.0 B
True
99 
False
97 
ValueCountFrequency (%)
True 99
50.5%
False 97
49.5%
2023-12-13T01:05:56.296987image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

납세자수
Real number (ℝ)

HIGH CORRELATION 

Distinct172
Distinct (%)87.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean8921.602
Minimum1
Maximum66458
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.9 KiB
2023-12-13T01:05:56.405750image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2
Q149.25
median1652.5
Q39714.5
95-th percentile50333
Maximum66458
Range66457
Interquartile range (IQR)9665.25

Descriptive statistics

Standard deviation15974.344
Coefficient of variation (CV)1.7905241
Kurtosis4.34276
Mean8921.602
Median Absolute Deviation (MAD)1619.5
Skewness2.2775858
Sum1748634
Variance2.5517965 × 108
MonotonicityNot monotonic
2023-12-13T01:05:56.532020image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 7
 
3.6%
33 4
 
2.0%
2 4
 
2.0%
36 3
 
1.5%
3 3
 
1.5%
17 2
 
1.0%
40 2
 
1.0%
2660 2
 
1.0%
26 2
 
1.0%
46 2
 
1.0%
Other values (162) 165
84.2%
ValueCountFrequency (%)
1 7
3.6%
2 4
2.0%
3 3
1.5%
4 1
 
0.5%
7 1
 
0.5%
15 2
 
1.0%
17 2
 
1.0%
18 1
 
0.5%
22 1
 
0.5%
26 2
 
1.0%
ValueCountFrequency (%)
66458 1
0.5%
66015 1
0.5%
65014 1
0.5%
63717 1
0.5%
61062 1
0.5%
60771 1
0.5%
58966 1
0.5%
57738 1
0.5%
53936 1
0.5%
52277 1
0.5%

Interactions

2023-12-13T01:05:54.624007image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T01:05:54.485759image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T01:05:54.690583image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T01:05:54.553008image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T01:05:56.622264image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
과세년도세목명납세자유형관내_관외납세자수
과세년도1.0000.0000.0000.0000.000
세목명0.0001.0000.0000.0000.698
납세자유형0.0000.0001.0000.0000.799
관내_관외0.0000.0000.0001.0000.553
납세자수0.0000.6980.7990.5531.000
2023-12-13T01:05:56.715008image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
세목명납세자유형관내_관외
세목명1.0000.0000.000
납세자유형0.0001.0000.000
관내_관외0.0000.0001.000
2023-12-13T01:05:56.799505image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
과세년도납세자수세목명납세자유형관내_관외
과세년도1.000-0.0520.0000.0000.000
납세자수-0.0521.0000.2800.6200.417
세목명0.0000.2801.0000.0000.000
납세자유형0.0000.6200.0001.0000.000
관내_관외0.0000.4170.0000.0001.000

Missing values

2023-12-13T01:05:54.781582image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T01:05:54.892153image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시도명시군구명자치단체코드과세년도세목명납세자유형관내_관외납세자수
0대전광역시대덕구302302017등록세개인N47
1대전광역시대덕구302302017등록세개인Y36
2대전광역시대덕구302302017등록세법인N7
3대전광역시대덕구302302017등록세법인Y4
4대전광역시대덕구302302017재산세개인N24662
5대전광역시대덕구302302017재산세개인Y41540
6대전광역시대덕구302302017재산세법인N552
7대전광역시대덕구302302017재산세법인Y928
8대전광역시대덕구302302017주민세개인N20215
9대전광역시대덕구302302017주민세개인Y60771
시도명시군구명자치단체코드과세년도세목명납세자유형관내_관외납세자수
186대전광역시대덕구302302022등록면허세법인Y2645
187대전광역시대덕구302302022지방소득세개인N12276
188대전광역시대덕구302302022지방소득세개인Y35248
189대전광역시대덕구302302022지방소득세법인N1155
190대전광역시대덕구302302022지방소득세법인Y2800
191대전광역시대덕구302302022지방소비세법인Y1
192대전광역시대덕구302302022지역자원시설세개인N36
193대전광역시대덕구302302022지역자원시설세개인Y38
194대전광역시대덕구302302022지역자원시설세법인N15
195대전광역시대덕구302302022지역자원시설세법인Y26