Overview

Dataset statistics

Number of variables9
Number of observations32
Missing cells3
Missing cells (%)1.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.6 KiB
Average record size in memory82.1 B

Variable types

Categorical5
Numeric4

Dataset

Description경상북도 구미시의 지방세 금액 중 비과세·감면액이 차지하는 비율에 대한 데이터로 지방자치단체코드, 세목명, 기준연도, 비과세 금액, 감면 금액, 부과금액, 비과세 감면율을 제공합니다. ※ 매년 통계연감이 확정된 최근 3년간 자료를 연도별로 나타냄
URLhttps://www.data.go.kr/data/15078347/fileData.do

Alerts

시도명 has constant value ""Constant
시군구명 has constant value ""Constant
자치단체코드 has constant value ""Constant
비과세금액 is highly overall correlated with 감면금액 and 2 other fieldsHigh correlation
감면금액 is highly overall correlated with 비과세금액 and 2 other fieldsHigh correlation
부과금액 is highly overall correlated with 비과세금액 and 2 other fieldsHigh correlation
비과세감면율 is highly overall correlated with 비과세금액 and 1 other fieldsHigh correlation
세목명 is highly overall correlated with 부과금액High correlation
비과세금액 has 3 (9.4%) missing valuesMissing
비과세금액 has 5 (15.6%) zerosZeros
부과금액 has 3 (9.4%) zerosZeros
비과세감면율 has 7 (21.9%) zerosZeros

Reproduction

Analysis started2023-12-12 01:42:15.803741
Analysis finished2023-12-12 01:42:18.399405
Duration2.6 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시도명
Categorical

CONSTANT 

Distinct1
Distinct (%)3.1%
Missing0
Missing (%)0.0%
Memory size388.0 B
경상북도
32 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row경상북도
2nd row경상북도
3rd row경상북도
4th row경상북도
5th row경상북도

Common Values

ValueCountFrequency (%)
경상북도 32
100.0%

Length

2023-12-12T10:42:18.487868image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T10:42:18.616030image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
경상북도 32
100.0%

시군구명
Categorical

CONSTANT 

Distinct1
Distinct (%)3.1%
Missing0
Missing (%)0.0%
Memory size388.0 B
구미시
32 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row구미시
2nd row구미시
3rd row구미시
4th row구미시
5th row구미시

Common Values

ValueCountFrequency (%)
구미시 32
100.0%

Length

2023-12-12T10:42:19.009470image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T10:42:19.117182image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
구미시 32
100.0%

자치단체코드
Categorical

CONSTANT 

Distinct1
Distinct (%)3.1%
Missing0
Missing (%)0.0%
Memory size388.0 B
47190
32 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row47190
2nd row47190
3rd row47190
4th row47190
5th row47190

Common Values

ValueCountFrequency (%)
47190 32
100.0%

Length

2023-12-12T10:42:19.264123image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T10:42:19.393791image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
47190 32
100.0%

세목명
Categorical

HIGH CORRELATION 

Distinct8
Distinct (%)25.0%
Missing0
Missing (%)0.0%
Memory size388.0 B
교육세
등록세
재산세
주민세
취득세
Other values (3)
12 

Length

Max length7
Median length3
Mean length3.875
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row교육세
2nd row등록세
3rd row재산세
4th row주민세
5th row취득세

Common Values

ValueCountFrequency (%)
교육세 4
12.5%
등록세 4
12.5%
재산세 4
12.5%
주민세 4
12.5%
취득세 4
12.5%
자동차세 4
12.5%
등록면허세 4
12.5%
지역자원시설세 4
12.5%

Length

2023-12-12T10:42:19.553366image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T10:42:19.716694image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
교육세 4
12.5%
등록세 4
12.5%
재산세 4
12.5%
주민세 4
12.5%
취득세 4
12.5%
자동차세 4
12.5%
등록면허세 4
12.5%
지역자원시설세 4
12.5%

과세년도
Categorical

Distinct4
Distinct (%)12.5%
Missing0
Missing (%)0.0%
Memory size388.0 B
2018
2019
2020
2021

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2018
2nd row2018
3rd row2018
4th row2018
5th row2018

Common Values

ValueCountFrequency (%)
2018 8
25.0%
2019 8
25.0%
2020 8
25.0%
2021 8
25.0%

Length

2023-12-12T10:42:19.909280image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T10:42:20.048240image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2018 8
25.0%
2019 8
25.0%
2020 8
25.0%
2021 8
25.0%

비과세금액
Real number (ℝ)

HIGH CORRELATION  MISSING  ZEROS 

Distinct25
Distinct (%)86.2%
Missing3
Missing (%)9.4%
Infinite0
Infinite (%)0.0%
Mean4.5305921 × 109
Minimum0
Maximum2.4366917 × 1010
Zeros5
Zeros (%)15.6%
Negative0
Negative (%)0.0%
Memory size420.0 B
2023-12-12T10:42:20.183893image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q117169000
median1.7882094 × 108
Q37.229825 × 109
95-th percentile2.2677449 × 1010
Maximum2.4366917 × 1010
Range2.4366917 × 1010
Interquartile range (IQR)7.212656 × 109

Descriptive statistics

Standard deviation8.020843 × 109
Coefficient of variation (CV)1.7703741
Kurtosis1.4389878
Mean4.5305921 × 109
Median Absolute Deviation (MAD)1.7882094 × 108
Skewness1.6810994
Sum1.3138717 × 1011
Variance6.4333922 × 1019
MonotonicityNot monotonic
2023-12-12T10:42:20.359310image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=25)
ValueCountFrequency (%)
0 5
 
15.6%
53850000 1
 
3.1%
10182683000 1
 
3.1%
669478000 1
 
3.1%
65864000 1
 
3.1%
181884000 1
 
3.1%
7229825000 1
 
3.1%
26543000 1
 
3.1%
24366917000 1
 
3.1%
639289000 1
 
3.1%
Other values (15) 15
46.9%
(Missing) 3
 
9.4%
ValueCountFrequency (%)
0 5
15.6%
9550000 1
 
3.1%
11750000 1
 
3.1%
17169000 1
 
3.1%
17556660 1
 
3.1%
26543000 1
 
3.1%
53850000 1
 
3.1%
61390000 1
 
3.1%
65864000 1
 
3.1%
172883000 1
 
3.1%
ValueCountFrequency (%)
24366917000 1
3.1%
23476994850 1
3.1%
21478130000 1
3.1%
20870896000 1
3.1%
10182683000 1
3.1%
10129250000 1
3.1%
9702039460 1
3.1%
7229825000 1
3.1%
669478000 1
3.1%
658628000 1
3.1%

감면금액
Real number (ℝ)

HIGH CORRELATION 

Distinct31
Distinct (%)96.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.8696084 × 109
Minimum2000
Maximum2.6425936 × 1010
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size420.0 B
2023-12-12T10:42:20.559172image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2000
5-th percentile4000
Q11.2661925 × 108
median2.6919408 × 108
Q32.5689545 × 109
95-th percentile2.3529776 × 1010
Maximum2.6425936 × 1010
Range2.6425934 × 1010
Interquartile range (IQR)2.4423352 × 109

Descriptive statistics

Standard deviation7.616617 × 109
Coefficient of variation (CV)1.9683173
Kurtosis4.0845182
Mean3.8696084 × 109
Median Absolute Deviation (MAD)2.6919108 × 108
Skewness2.299627
Sum1.2382747 × 1011
Variance5.8012854 × 1019
MonotonicityNot monotonic
2023-12-12T10:42:20.760438image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=31)
ValueCountFrequency (%)
4000 2
 
6.2%
1565000 1
 
3.1%
18553680 1
 
3.1%
197380000 1
 
3.1%
236007000 1
 
3.1%
1790727000 1
 
3.1%
21619346000 1
 
3.1%
1411096000 1
 
3.1%
5387425000 1
 
3.1%
14575000 1
 
3.1%
Other values (21) 21
65.6%
ValueCountFrequency (%)
2000 1
3.1%
4000 2
6.2%
1565000 1
3.1%
14575000 1
3.1%
18553680 1
3.1%
31239000 1
3.1%
32129000 1
3.1%
158116000 1
3.1%
197380000 1
3.1%
198801000 1
3.1%
ValueCountFrequency (%)
26425936000 1
3.1%
25864746000 1
3.1%
21619346000 1
3.1%
18015144720 1
3.1%
5387425000 1
3.1%
5068146000 1
3.1%
5041211890 1
3.1%
4903637000 1
3.1%
1790727000 1
3.1%
1678281000 1
3.1%

부과금액
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct30
Distinct (%)93.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4.5474745 × 1010
Minimum0
Maximum1.35 × 1011
Zeros3
Zeros (%)9.4%
Negative0
Negative (%)0.0%
Memory size420.0 B
2023-12-12T10:42:20.956480image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q11.2708486 × 1010
median3.5589954 × 1010
Q36.8336234 × 1010
95-th percentile1.233 × 1011
Maximum1.35 × 1011
Range1.35 × 1011
Interquartile range (IQR)5.5627748 × 1010

Descriptive statistics

Standard deviation3.9942998 × 1010
Coefficient of variation (CV)0.87835563
Kurtosis-0.28720117
Mean4.5474745 × 1010
Median Absolute Deviation (MAD)2.5407292 × 1010
Skewness0.8153981
Sum1.4551918 × 1012
Variance1.5954431 × 1021
MonotonicityNot monotonic
2023-12-12T10:42:21.119246image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=30)
ValueCountFrequency (%)
0 3
 
9.4%
42328722000 1
 
3.1%
149920690 1
 
3.1%
14171198000 1
 
3.1%
10839984000 1
 
3.1%
74615492000 1
 
3.1%
135000000000 1
 
3.1%
28427538000 1
 
3.1%
67130428000 1
 
3.1%
44493945000 1
 
3.1%
Other values (20) 20
62.5%
ValueCountFrequency (%)
0 3
9.4%
149920690 1
 
3.1%
9525342000 1
 
3.1%
10839984000 1
 
3.1%
10873055000 1
 
3.1%
10938629400 1
 
3.1%
13298438000 1
 
3.1%
14003409000 1
 
3.1%
14167199000 1
 
3.1%
14171198000 1
 
3.1%
ValueCountFrequency (%)
135000000000 1
3.1%
131000000000 1
3.1%
117000000000 1
3.1%
110000000000 1
3.1%
88310570780 1
3.1%
87258322320 1
3.1%
74615492000 1
3.1%
71953651000 1
3.1%
67130428000 1
3.1%
66298499000 1
3.1%

비과세감면율
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct26
Distinct (%)81.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean10.288125
Minimum0
Maximum44.32
Zeros7
Zeros (%)21.9%
Negative0
Negative (%)0.0%
Memory size420.0 B
2023-12-12T10:42:21.298048image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10.31
median2.88
Q314.59
95-th percentile39.8475
Maximum44.32
Range44.32
Interquartile range (IQR)14.28

Descriptive statistics

Standard deviation14.090112
Coefficient of variation (CV)1.369551
Kurtosis0.38771697
Mean10.288125
Median Absolute Deviation (MAD)2.88
Skewness1.3686654
Sum329.22
Variance198.53126
MonotonicityNot monotonic
2023-12-12T10:42:21.449443image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=26)
ValueCountFrequency (%)
0.0 7
21.9%
32.29 1
 
3.1%
6.12 1
 
3.1%
2.78 1
 
3.1%
2.64 1
 
3.1%
21.29 1
 
3.1%
5.06 1
 
3.1%
44.32 1
 
3.1%
5.93 1
 
3.1%
2.52 1
 
3.1%
Other values (16) 16
50.0%
ValueCountFrequency (%)
0.0 7
21.9%
0.28 1
 
3.1%
0.32 1
 
3.1%
2.12 1
 
3.1%
2.25 1
 
3.1%
2.52 1
 
3.1%
2.57 1
 
3.1%
2.64 1
 
3.1%
2.68 1
 
3.1%
2.78 1
 
3.1%
ValueCountFrequency (%)
44.32 1
3.1%
40.04 1
3.1%
39.69 1
3.1%
33.26 1
3.1%
32.29 1
3.1%
30.91 1
3.1%
21.29 1
3.1%
21.22 1
3.1%
12.38 1
3.1%
6.37 1
3.1%

Interactions

2023-12-12T10:42:17.552702image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:42:16.090415image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:42:16.609056image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:42:17.063189image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:42:17.692415image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:42:16.194221image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:42:16.701705image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:42:17.169747image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:42:17.794635image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:42:16.363843image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:42:16.808194image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:42:17.267786image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:42:17.914525image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:42:16.517170image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:42:16.961060image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:42:17.420080image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T10:42:21.566880image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
세목명과세년도비과세금액감면금액부과금액비과세감면율
세목명1.0000.0000.6730.6900.9280.871
과세년도0.0001.0000.0000.0000.0000.000
비과세금액0.6730.0001.0000.9970.8540.851
감면금액0.6900.0000.9971.0000.8450.846
부과금액0.9280.0000.8540.8451.0000.794
비과세감면율0.8710.0000.8510.8460.7941.000
2023-12-12T10:42:21.706852image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
과세년도세목명
과세년도1.0000.000
세목명0.0001.000
2023-12-12T10:42:21.825512image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
비과세금액감면금액부과금액비과세감면율세목명과세년도
비과세금액1.0000.7920.5670.8320.4270.000
감면금액0.7921.0000.6330.7420.4560.000
부과금액0.5670.6331.0000.4500.7730.000
비과세감면율0.8320.7420.4501.0000.4680.000
세목명0.4270.4560.7730.4681.0000.000
과세년도0.0000.0000.0000.0000.0001.000

Missing values

2023-12-12T10:42:18.094232image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T10:42:18.325072image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시도명시군구명자치단체코드세목명과세년도비과세금액감면금액부과금액비과세감면율
0경상북도구미시47190교육세201801565000423287220000.0
1경상북도구미시47190등록세2018<NA>77030500000.0
2경상북도구미시47190재산세20182087089600049036370006493542200039.69
3경상북도구미시47190주민세20185385000031239000300079690000.28
4경상북도구미시47190취득세2018101826830002586474600011700000000030.91
5경상북도구미시47190자동차세20186586280001281776000577939870003.36
6경상북도구미시47190등록면허세201817169000227289000108730550002.25
7경상북도구미시47190지역자원시설세2018567340000280245000132984380006.37
8경상북도구미시47190교육세201904000411719400000.0
9경상북도구미시47190등록세2019<NA>15811600000.0
시도명시군구명자치단체코드세목명과세년도비과세금액감면금액부과금액비과세감면율
22경상북도구미시47190등록면허세202017556660258143170109386294002.52
23경상북도구미시47190지역자원시설세2020639289000200785000141671990005.93
24경상북도구미시47190교육세202104000444939450000.0
25경상북도구미시47190등록세2021<NA>1457500000.0
26경상북도구미시47190재산세20212436691700053874250006713042800044.32
27경상북도구미시47190주민세2021265430001411096000284275380005.06
28경상북도구미시47190취득세202172298250002161934600013500000000021.29
29경상북도구미시47190자동차세20211818840001790727000746154920002.64
30경상북도구미시47190등록면허세202165864000236007000108399840002.78
31경상북도구미시47190지역자원시설세2021669478000197380000141711980006.12