Overview

Dataset statistics

Number of variables8
Number of observations188
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory12.6 KiB
Average record size in memory68.7 B

Variable types

Categorical6
Numeric2

Dataset

Description경상남도 거창군 지방세 세목별 과세현황에 대한 데이터로 2017년도, 2018년도, 2019년도. 2020년도, 2021년도, 2022년도 세목명, 세원유형명, 부과건수, 부과금액 항목을 제공합니다.
Author경상남도 거창군
URLhttps://bigdata.gyeongnam.go.kr/index.gn?menuCd=DOM_000000114002001000&publicdatapk=15079153

Alerts

시도명 has constant value ""Constant
시군구명 has constant value ""Constant
자치단체코드 has constant value ""Constant
부과건수 is highly overall correlated with 부과금액 and 2 other fieldsHigh correlation
부과금액 is highly overall correlated with 부과건수 and 1 other fieldsHigh correlation
세목명 is highly overall correlated with 부과건수 and 1 other fieldsHigh correlation
세원 유형명 is highly overall correlated with 부과건수 and 2 other fieldsHigh correlation
부과건수 has 50 (26.6%) zerosZeros
부과금액 has 50 (26.6%) zerosZeros

Reproduction

Analysis started2023-12-10 23:35:48.642199
Analysis finished2023-12-10 23:35:49.601049
Duration0.96 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시도명
Categorical

CONSTANT 

Distinct1
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
경상남도
188 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row경상남도
2nd row경상남도
3rd row경상남도
4th row경상남도
5th row경상남도

Common Values

ValueCountFrequency (%)
경상남도 188
100.0%

Length

2023-12-11T08:35:49.941704image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T08:35:50.041087image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
경상남도 188
100.0%

시군구명
Categorical

CONSTANT 

Distinct1
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
거창군
188 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row거창군
2nd row거창군
3rd row거창군
4th row거창군
5th row거창군

Common Values

ValueCountFrequency (%)
거창군 188
100.0%

Length

2023-12-11T08:35:50.130306image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T08:35:50.214322image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
거창군 188
100.0%

자치단체코드
Categorical

CONSTANT 

Distinct1
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
48880
188 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row48880
2nd row48880
3rd row48880
4th row48880
5th row48880

Common Values

ValueCountFrequency (%)
48880 188
100.0%

Length

2023-12-11T08:35:50.300944image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T08:35:50.385401image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
48880 188
100.0%

과세년도
Categorical

Distinct4
Distinct (%)2.1%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
2017
47 
2018
47 
2019
47 
2020
47 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2017
2nd row2017
3rd row2017
4th row2017
5th row2017

Common Values

ValueCountFrequency (%)
2017 47
25.0%
2018 47
25.0%
2019 47
25.0%
2020 47
25.0%

Length

2023-12-11T08:35:50.468478image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T08:35:50.566074image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2017 47
25.0%
2018 47
25.0%
2019 47
25.0%
2020 47
25.0%

세목명
Categorical

HIGH CORRELATION 

Distinct13
Distinct (%)6.9%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
취득세
36 
주민세
36 
자동차세
28 
재산세
20 
레저세
16 
Other values (8)
52 

Length

Max length7
Median length3
Mean length3.6808511
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row담배소비세
2nd row교육세
3rd row도시계획세
4th row취득세
5th row취득세

Common Values

ValueCountFrequency (%)
취득세 36
19.1%
주민세 36
19.1%
자동차세 28
14.9%
재산세 20
10.6%
레저세 16
8.5%
지방소득세 16
8.5%
등록면허세 8
 
4.3%
지역자원시설세 8
 
4.3%
담배소비세 4
 
2.1%
교육세 4
 
2.1%
Other values (3) 12
 
6.4%

Length

2023-12-11T08:35:50.661007image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
취득세 36
19.1%
주민세 36
19.1%
자동차세 28
14.9%
재산세 20
10.6%
레저세 16
8.5%
지방소득세 16
8.5%
등록면허세 8
 
4.3%
지역자원시설세 8
 
4.3%
담배소비세 4
 
2.1%
교육세 4
 
2.1%
Other values (3) 12
 
6.4%

세원 유형명
Categorical

HIGH CORRELATION 

Distinct47
Distinct (%)25.0%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
담배소비세
 
4
소싸움
 
4
도시계획세
 
4
건축물
 
4
주택(개별)
 
4
Other values (42)
168 

Length

Max length11
Median length8
Mean length6.0425532
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row담배소비세
2nd row교육세
3rd row도시계획세
4th row건축물
5th row주택(개별)

Common Values

ValueCountFrequency (%)
담배소비세 4
 
2.1%
소싸움 4
 
2.1%
도시계획세 4
 
2.1%
건축물 4
 
2.1%
주택(개별) 4
 
2.1%
주택(단독) 4
 
2.1%
기타 4
 
2.1%
항공기 4
 
2.1%
기계장비 4
 
2.1%
차량 4
 
2.1%
Other values (37) 148
78.7%

Length

2023-12-11T08:35:50.762560image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
담배소비세 4
 
2.1%
화물 4
 
2.1%
기타승용 4
 
2.1%
승용 4
 
2.1%
주민세(재산분 4
 
2.1%
주민세(종업원분 4
 
2.1%
주민세(특별징수 4
 
2.1%
주민세(법인세분 4
 
2.1%
주민세(양도소득 4
 
2.1%
주민세(종합소득 4
 
2.1%
Other values (37) 148
78.7%

부과건수
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct135
Distinct (%)71.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean8704.7979
Minimum0
Maximum153635
Zeros50
Zeros (%)26.6%
Negative0
Negative (%)0.0%
Memory size1.8 KiB
2023-12-11T08:35:50.867215image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median451.5
Q35895.5
95-th percentile31999.95
Maximum153635
Range153635
Interquartile range (IQR)5895.5

Descriptive statistics

Standard deviation23595.039
Coefficient of variation (CV)2.7105786
Kurtosis25.69106
Mean8704.7979
Median Absolute Deviation (MAD)451.5
Skewness4.813862
Sum1636502
Variance5.5672587 × 108
MonotonicityNot monotonic
2023-12-11T08:35:51.003948image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 50
 
26.6%
12 4
 
2.1%
9 2
 
1.1%
20269 1
 
0.5%
355 1
 
0.5%
1088 1
 
0.5%
1829 1
 
0.5%
26251 1
 
0.5%
9642 1
 
0.5%
13713 1
 
0.5%
Other values (125) 125
66.5%
ValueCountFrequency (%)
0 50
26.6%
2 1
 
0.5%
3 1
 
0.5%
4 1
 
0.5%
5 1
 
0.5%
6 1
 
0.5%
8 1
 
0.5%
9 2
 
1.1%
12 4
 
2.1%
16 1
 
0.5%
ValueCountFrequency (%)
153635 1
0.5%
149320 1
0.5%
148885 1
0.5%
147832 1
0.5%
55191 1
0.5%
54539 1
0.5%
53583 1
0.5%
52780 1
0.5%
33704 1
0.5%
32246 1
0.5%

부과금액
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct139
Distinct (%)73.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.157671 × 109
Minimum0
Maximum7.966664 × 109
Zeros50
Zeros (%)26.6%
Negative0
Negative (%)0.0%
Memory size1.8 KiB
2023-12-11T08:35:51.158438image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median2.30801 × 108
Q31.6144042 × 109
95-th percentile5.1217456 × 109
Maximum7.966664 × 109
Range7.966664 × 109
Interquartile range (IQR)1.6144042 × 109

Descriptive statistics

Standard deviation1.7418778 × 109
Coefficient of variation (CV)1.5046398
Kurtosis2.4355921
Mean1.157671 × 109
Median Absolute Deviation (MAD)2.30801 × 108
Skewness1.7662377
Sum2.1764214 × 1011
Variance3.0341383 × 1018
MonotonicityNot monotonic
2023-12-11T08:35:51.290301image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 50
 
26.6%
4430121000 1
 
0.5%
879937000 1
 
0.5%
547050000 1
 
0.5%
68750000 1
 
0.5%
91669000 1
 
0.5%
262970000 1
 
0.5%
148579000 1
 
0.5%
1036443000 1
 
0.5%
33205000 1
 
0.5%
Other values (129) 129
68.6%
ValueCountFrequency (%)
0 50
26.6%
32000 1
 
0.5%
75000 1
 
0.5%
253000 1
 
0.5%
381000 1
 
0.5%
522000 1
 
0.5%
600000 1
 
0.5%
1315000 1
 
0.5%
2047000 1
 
0.5%
2439000 1
 
0.5%
ValueCountFrequency (%)
7966664000 1
0.5%
7706954000 1
0.5%
5979308000 1
0.5%
5856779000 1
0.5%
5767097000 1
0.5%
5749982000 1
0.5%
5426906000 1
0.5%
5377549000 1
0.5%
5376600000 1
0.5%
5125702000 1
0.5%

Interactions

2023-12-11T08:35:49.141142image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:35:48.909850image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:35:49.240304image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:35:49.033286image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T08:35:51.389085image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
과세년도세목명세원 유형명부과건수부과금액
과세년도1.0000.0000.0000.0000.000
세목명0.0001.0001.0000.8170.731
세원 유형명0.0001.0001.0001.0000.944
부과건수0.0000.8171.0001.0000.816
부과금액0.0000.7310.9440.8161.000
2023-12-11T08:35:51.477217image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
세원 유형명과세년도세목명
세원 유형명1.0000.0000.898
과세년도0.0001.0000.000
세목명0.8980.0001.000
2023-12-11T08:35:51.556652image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
부과건수부과금액과세년도세목명세원 유형명
부과건수1.0000.7700.0000.6090.878
부과금액0.7701.0000.0000.4200.636
과세년도0.0000.0001.0000.0000.000
세목명0.6090.4200.0001.0000.898
세원 유형명0.8780.6360.0000.8981.000

Missing values

2023-12-11T08:35:49.395236image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T08:35:49.542117image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시도명시군구명자치단체코드과세년도세목명세원 유형명부과건수부과금액
0경상남도거창군488802017담배소비세담배소비세1094430121000
1경상남도거창군488802017교육세교육세1478325125702000
2경상남도거창군488802017도시계획세도시계획세00
3경상남도거창군488802017취득세건축물6041862828000
4경상남도거창군488802017취득세주택(개별)11081650051000
5경상남도거창군488802017취득세주택(단독)538934756000
6경상남도거창군488802017취득세기타25120084000
7경상남도거창군488802017취득세항공기00
8경상남도거창군488802017취득세기계장비458555535000
9경상남도거창군488802017취득세차량47373661189000
시도명시군구명자치단체코드과세년도세목명세원 유형명부과건수부과금액
178경상남도거창군488802020지방소비세지방소비세65376600000
179경상남도거창군488802020등록면허세등록면허세(면허)10440158146000
180경상남도거창군488802020등록면허세등록면허세(등록)145391125304000
181경상남도거창군488802020지역자원시설세지역자원시설세(소방)20990914136000
182경상남도거창군488802020지역자원시설세지역자원시설세(특자)19629825000
183경상남도거창군488802020지방소득세지방소득세(특별징수)70542870165000
184경상남도거창군488802020지방소득세지방소득세(법인소득)9111744161000
185경상남도거창군488802020지방소득세지방소득세(양도소득)11101172981000
186경상남도거창군488802020지방소득세지방소득세(종합소득)52001388246000
187경상남도거창군488802020체납체납267281678972000