Overview

Dataset statistics

Number of variables8
Number of observations279
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory18.7 KiB
Average record size in memory68.5 B

Variable types

Categorical5
Numeric3

Dataset

Description2017.1.1.부터 2022.12.31.까지 서천군 세목별 세원 유형별 부과건수 및 부과금액에 대한 과세현황 자료에 대하여 제공합니다.
Author충청남도
URLhttps://alldam.chungnam.go.kr/index.chungnam?menuCd=DOM_000000201001001001&st=&cds=&orgCd=&apiType=&isOpen=Y&pageIndex=347&beforeMenuCd=DOM_000000201001001000&publicdatapk=15080475

Alerts

시도명 has constant value ""Constant
시군구명 has constant value ""Constant
자치단체코드 has constant value ""Constant
세목명 is highly overall correlated with 부과건수 and 1 other fieldsHigh correlation
세원 유형명 is highly overall correlated with 부과건수 and 2 other fieldsHigh correlation
부과건수 is highly overall correlated with 부과금액 and 2 other fieldsHigh correlation
부과금액 is highly overall correlated with 부과건수 and 1 other fieldsHigh correlation
부과건수 has 71 (25.4%) zerosZeros
부과금액 has 71 (25.4%) zerosZeros

Reproduction

Analysis started2024-01-09 21:27:57.647576
Analysis finished2024-01-09 21:27:58.639370
Duration0.99 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시도명
Categorical

CONSTANT 

Distinct1
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
충청남도
279 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row충청남도
2nd row충청남도
3rd row충청남도
4th row충청남도
5th row충청남도

Common Values

ValueCountFrequency (%)
충청남도 279
100.0%

Length

2024-01-10T06:27:58.700363image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T06:27:58.787438image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
충청남도 279
100.0%

시군구명
Categorical

CONSTANT 

Distinct1
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
서천군
279 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row서천군
2nd row서천군
3rd row서천군
4th row서천군
5th row서천군

Common Values

ValueCountFrequency (%)
서천군 279
100.0%

Length

2024-01-10T06:27:58.881075image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T06:27:58.955393image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
서천군 279
100.0%

자치단체코드
Categorical

CONSTANT 

Distinct1
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
44770
279 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row44770
2nd row44770
3rd row44770
4th row44770
5th row44770

Common Values

ValueCountFrequency (%)
44770 279
100.0%

Length

2024-01-10T06:27:59.029703image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T06:27:59.102574image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
44770 279
100.0%

과세년도
Real number (ℝ)

Distinct6
Distinct (%)2.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2019.4839
Minimum2017
Maximum2022
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.6 KiB
2024-01-10T06:27:59.169510image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2017
5-th percentile2017
Q12018
median2019
Q32021
95-th percentile2022
Maximum2022
Range5
Interquartile range (IQR)3

Descriptive statistics

Standard deviation1.7108177
Coefficient of variation (CV)0.00084715591
Kurtosis-1.2694549
Mean2019.4839
Median Absolute Deviation (MAD)1
Skewness0.01465019
Sum563436
Variance2.9268972
MonotonicityIncreasing
2024-01-10T06:27:59.252370image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
2017 47
16.8%
2018 47
16.8%
2019 47
16.8%
2020 46
16.5%
2021 46
16.5%
2022 46
16.5%
ValueCountFrequency (%)
2017 47
16.8%
2018 47
16.8%
2019 47
16.8%
2020 46
16.5%
2021 46
16.5%
2022 46
16.5%
ValueCountFrequency (%)
2022 46
16.5%
2021 46
16.5%
2020 46
16.5%
2019 47
16.8%
2018 47
16.8%
2017 47
16.8%

세목명
Categorical

HIGH CORRELATION 

Distinct13
Distinct (%)4.7%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
취득세
54 
주민세
50 
자동차세
42 
재산세
30 
레저세
24 
Other values (8)
79 

Length

Max length7
Median length3
Mean length3.7096774
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row지역자원시설세
2nd row지역자원시설세
3rd row담배소비세
4th row교육세
5th row도시계획세

Common Values

ValueCountFrequency (%)
취득세 54
19.4%
주민세 50
17.9%
자동차세 42
15.1%
재산세 30
10.8%
레저세 24
8.6%
지방소득세 24
8.6%
지역자원시설세 14
 
5.0%
등록면허세 12
 
4.3%
담배소비세 6
 
2.2%
교육세 6
 
2.2%
Other values (3) 17
 
6.1%

Length

2024-01-10T06:27:59.343888image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
취득세 54
19.4%
주민세 50
17.9%
자동차세 42
15.1%
재산세 30
10.8%
레저세 24
8.6%
지방소득세 24
8.6%
지역자원시설세 14
 
5.0%
등록면허세 12
 
4.3%
담배소비세 6
 
2.2%
교육세 6
 
2.2%
Other values (3) 17
 
6.1%

세원 유형명
Categorical

HIGH CORRELATION 

Distinct50
Distinct (%)17.9%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
지역자원시설세(소방)
 
6
기계장비
 
6
차량
 
6
재산세(건축물)
 
6
교육세
 
6
Other values (45)
249 

Length

Max length11
Median length8
Mean length6.0394265
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row지역자원시설세(소방)
2nd row지역자원시설세(특자)
3rd row담배소비세
4th row교육세
5th row도시계획세

Common Values

ValueCountFrequency (%)
지역자원시설세(소방) 6
 
2.2%
기계장비 6
 
2.2%
차량 6
 
2.2%
재산세(건축물) 6
 
2.2%
교육세 6
 
2.2%
건축물 6
 
2.2%
주택(개별) 6
 
2.2%
주택(단독) 6
 
2.2%
기타 6
 
2.2%
재산세(항공기) 6
 
2.2%
Other values (40) 219
78.5%

Length

2024-01-10T06:27:59.440592image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
지역자원시설세(소방 6
 
2.2%
승용 6
 
2.2%
3륜이하 6
 
2.2%
기계장비 6
 
2.2%
화물 6
 
2.2%
승합 6
 
2.2%
기타승용 6
 
2.2%
체납 6
 
2.2%
지방소비세 6
 
2.2%
주민세(종업원분 6
 
2.2%
Other values (40) 219
78.5%

부과건수
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct195
Distinct (%)69.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean8845.7419
Minimum0
Maximum149650
Zeros71
Zeros (%)25.4%
Negative0
Negative (%)0.0%
Memory size2.6 KiB
2024-01-10T06:27:59.535602image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median461
Q36586.5
95-th percentile39770.2
Maximum149650
Range149650
Interquartile range (IQR)6586.5

Descriptive statistics

Standard deviation23774.73
Coefficient of variation (CV)2.6877034
Kurtosis22.774795
Mean8845.7419
Median Absolute Deviation (MAD)461
Skewness4.5584659
Sum2467962
Variance5.6523781 × 108
MonotonicityNot monotonic
2024-01-10T06:27:59.642140image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 71
 
25.4%
12 6
 
2.2%
94 2
 
0.7%
11 2
 
0.7%
7 2
 
0.7%
23 2
 
0.7%
101 2
 
0.7%
20 2
 
0.7%
2 2
 
0.7%
6 2
 
0.7%
Other values (185) 186
66.7%
ValueCountFrequency (%)
0 71
25.4%
1 1
 
0.4%
2 2
 
0.7%
3 1
 
0.4%
6 2
 
0.7%
7 2
 
0.7%
9 1
 
0.4%
10 1
 
0.4%
11 2
 
0.7%
12 6
 
2.2%
ValueCountFrequency (%)
149650 1
0.4%
148416 1
0.4%
148379 1
0.4%
145355 1
0.4%
144887 1
0.4%
144679 1
0.4%
65462 1
0.4%
64590 1
0.4%
63465 1
0.4%
62965 1
0.4%

부과금액
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct209
Distinct (%)74.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.3448173 × 109
Minimum0
Maximum2.2510836 × 1010
Zeros71
Zeros (%)25.4%
Negative0
Negative (%)0.0%
Memory size2.6 KiB
2024-01-10T06:27:59.748899image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median2.5399 × 108
Q31.7309115 × 109
95-th percentile5.0860576 × 109
Maximum2.2510836 × 1010
Range2.2510836 × 1010
Interquartile range (IQR)1.7309115 × 109

Descriptive statistics

Standard deviation2.2778373 × 109
Coefficient of variation (CV)1.6937895
Kurtosis27.631603
Mean1.3448173 × 109
Median Absolute Deviation (MAD)2.5399 × 108
Skewness3.939207
Sum3.7520401 × 1011
Variance5.1885428 × 1018
MonotonicityNot monotonic
2024-01-10T06:27:59.851986image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 71
 
25.4%
772313000 1
 
0.4%
744085000 1
 
0.4%
1076035000 1
 
0.4%
225299000 1
 
0.4%
1359175000 1
 
0.4%
972504000 1
 
0.4%
1844899000 1
 
0.4%
4321448000 1
 
0.4%
6118785000 1
 
0.4%
Other values (199) 199
71.3%
ValueCountFrequency (%)
0 71
25.4%
370000 1
 
0.4%
1131000 1
 
0.4%
1380000 1
 
0.4%
1534000 1
 
0.4%
1563000 1
 
0.4%
1623000 1
 
0.4%
2083000 1
 
0.4%
2154000 1
 
0.4%
2226000 1
 
0.4%
ValueCountFrequency (%)
22510836000 1
0.4%
11516755000 1
0.4%
8316584000 1
0.4%
7625455000 1
0.4%
7557100000 1
0.4%
7245032000 1
0.4%
6497196000 1
0.4%
6364254000 1
0.4%
6188428000 1
0.4%
6118785000 1
0.4%

Interactions

2024-01-10T06:27:58.263534image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T06:27:57.855558image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T06:27:58.053585image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T06:27:58.334654image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T06:27:57.920887image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T06:27:58.121693image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T06:27:58.404924image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T06:27:57.990245image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T06:27:58.195321image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-01-10T06:28:00.141274image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
과세년도세목명세원 유형명부과건수부과금액
과세년도1.0000.0000.0000.0000.000
세목명0.0001.0001.0000.8350.670
세원 유형명0.0001.0001.0000.9820.843
부과건수0.0000.8350.9821.0000.701
부과금액0.0000.6700.8430.7011.000
2024-01-10T06:28:00.215904image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
세목명세원 유형명
세목명1.0000.928
세원 유형명0.9281.000
2024-01-10T06:28:00.279531image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
과세년도부과건수부과금액세목명세원 유형명
과세년도1.0000.0200.0710.0000.000
부과건수0.0201.0000.7390.6040.811
부과금액0.0710.7391.0000.4060.507
세목명0.0000.6040.4061.0000.928
세원 유형명0.0000.8110.5070.9281.000

Missing values

2024-01-10T06:27:58.490652image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-10T06:27:58.589590image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시도명시군구명자치단체코드과세년도세목명세원 유형명부과건수부과금액
0충청남도서천군447702017지역자원시설세지역자원시설세(소방)13455772313000
1충청남도서천군447702017지역자원시설세지역자원시설세(특자)6402936000
2충청남도서천군447702017담배소비세담배소비세1074124495000
3충청남도서천군447702017교육세교육세1453554775902000
4충청남도서천군447702017도시계획세도시계획세00
5충청남도서천군447702017취득세건축물7911363633000
6충청남도서천군447702017취득세주택(개별)14091398456000
7충청남도서천군447702017취득세주택(단독)298578515000
8충청남도서천군447702017취득세기타1725331000
9충청남도서천군447702017취득세항공기00
시도명시군구명자치단체코드과세년도세목명세원 유형명부과건수부과금액
269충청남도서천군447702022지방소득세지방소득세(법인소득)9222996167000
270충청남도서천군447702022지방소득세지방소득세(양도소득)811761985000
271충청남도서천군447702022지방소득세지방소득세(종합소득)68441055052000
272충청남도서천군447702022등록면허세등록면허세(면허)19983246823000
273충청남도서천군447702022등록면허세등록면허세(등록)119201104702000
274충청남도서천군447702022지역자원시설세지역자원시설세(소방)137151886843000
275충청남도서천군447702022지역자원시설세지역자원시설세(시설)101372712000
276충청남도서천군447702022지역자원시설세지역자원시설세(특자)00
277충청남도서천군447702022체납체납306961883795000
278충청남도서천군447702022담배소비세담배소비세6364175164000