Overview

Dataset statistics

Number of variables10
Number of observations181
Missing cells4
Missing cells (%)0.2%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory15.2 KiB
Average record size in memory85.7 B

Variable types

Categorical6
Boolean1
Numeric3

Dataset

Description2017년부터 2022년까지 서천군 지방세 세목별 수납방법(ARS, 가상계좌, 위택스, 은행창구, 자동화기계 등)에 대한 납부건수 및 납부금액에 대한 과세자료를 제공합니다.
Author충청남도
URLhttps://alldam.chungnam.go.kr/index.chungnam?menuCd=DOM_000000201001001001&st=&cds=&orgCd=&apiType=&isOpen=Y&pageIndex=347&beforeMenuCd=DOM_000000201001001000&publicdatapk=15080478

Alerts

세목명 is highly overall correlated with 시도명 and 2 other fieldsHigh correlation
납부매체전자고지여부 is highly overall correlated with 시도명 and 3 other fieldsHigh correlation
납부매체 is highly overall correlated with 시도명 and 3 other fieldsHigh correlation
시군구명 is highly overall correlated with 납부건수 and 8 other fieldsHigh correlation
납부년도 is highly overall correlated with 시도명 and 2 other fieldsHigh correlation
시도명 is highly overall correlated with 납부건수 and 8 other fieldsHigh correlation
자치단체코드 is highly overall correlated with 납부건수 and 8 other fieldsHigh correlation
납부건수 is highly overall correlated with 납부금액 and 4 other fieldsHigh correlation
납부금액 is highly overall correlated with 납부건수 and 4 other fieldsHigh correlation
납부매체비율 is highly overall correlated with 납부건수 and 4 other fieldsHigh correlation
시도명 is highly imbalanced (95.1%)Imbalance
시군구명 is highly imbalanced (95.1%)Imbalance
자치단체코드 is highly imbalanced (95.1%)Imbalance
납부매체비율 has 5 (2.8%) zerosZeros

Reproduction

Analysis started2024-01-09 23:05:12.762529
Analysis finished2024-01-09 23:05:14.312034
Duration1.55 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시도명
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)1.1%
Missing0
Missing (%)0.0%
Memory size1.5 KiB
충청남도
180 
<NA>
 
1

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique1 ?
Unique (%)0.6%

Sample

1st row충청남도
2nd row충청남도
3rd row충청남도
4th row충청남도
5th row충청남도

Common Values

ValueCountFrequency (%)
충청남도 180
99.4%
<NA> 1
 
0.6%

Length

2024-01-10T08:05:14.371184image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T08:05:14.459310image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
충청남도 180
99.4%
na 1
 
0.6%

시군구명
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)1.1%
Missing0
Missing (%)0.0%
Memory size1.5 KiB
서천군
180 
<NA>
 
1

Length

Max length4
Median length3
Mean length3.0055249
Min length3

Unique

Unique1 ?
Unique (%)0.6%

Sample

1st row서천군
2nd row서천군
3rd row서천군
4th row서천군
5th row서천군

Common Values

ValueCountFrequency (%)
서천군 180
99.4%
<NA> 1
 
0.6%

Length

2024-01-10T08:05:14.576563image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T08:05:14.700581image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
서천군 180
99.4%
na 1
 
0.6%

자치단체코드
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)1.1%
Missing0
Missing (%)0.0%
Memory size1.5 KiB
44770
180 
<NA>
 
1

Length

Max length5
Median length5
Mean length4.9944751
Min length4

Unique

Unique1 ?
Unique (%)0.6%

Sample

1st row44770
2nd row44770
3rd row44770
4th row44770
5th row44770

Common Values

ValueCountFrequency (%)
44770 180
99.4%
<NA> 1
 
0.6%

Length

2024-01-10T08:05:14.825564image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T08:05:15.188026image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
44770 180
99.4%
na 1
 
0.6%

납부년도
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)2.2%
Missing0
Missing (%)0.0%
Memory size1.5 KiB
2017
68 
2018
63 
2019
49 
<NA>
 
1

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique1 ?
Unique (%)0.6%

Sample

1st row2017
2nd row2017
3rd row2017
4th row2017
5th row2017

Common Values

ValueCountFrequency (%)
2017 68
37.6%
2018 63
34.8%
2019 49
27.1%
<NA> 1
 
0.6%

Length

2024-01-10T08:05:15.284821image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T08:05:15.391579image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2017 68
37.6%
2018 63
34.8%
2019 49
27.1%
na 1
 
0.6%

세목명
Categorical

HIGH CORRELATION 

Distinct13
Distinct (%)7.2%
Missing0
Missing (%)0.0%
Memory size1.5 KiB
등록면허세
26 
자동차세
25 
재산세
25 
주민세
25 
지방소득세
22 
Other values (8)
58 

Length

Max length7
Median length3
Mean length3.8121547
Min length3

Unique

Unique2 ?
Unique (%)1.1%

Sample

1st row등록면허세
2nd row자동차세
3rd row재산세
4th row종합토지세
5th row주민세

Common Values

ValueCountFrequency (%)
등록면허세 26
14.4%
자동차세 25
13.8%
재산세 25
13.8%
주민세 25
13.8%
지방소득세 22
12.2%
취득세 22
12.2%
등록세 14
7.7%
면허세 8
 
4.4%
종합토지세 6
 
3.3%
담배소비세 3
 
1.7%
Other values (3) 5
 
2.8%

Length

2024-01-10T08:05:15.504354image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
등록면허세 26
14.4%
자동차세 25
13.8%
재산세 25
13.8%
주민세 25
13.8%
지방소득세 22
12.2%
취득세 22
12.2%
등록세 14
7.7%
면허세 8
 
4.4%
종합토지세 6
 
3.3%
담배소비세 3
 
1.7%
Other values (3) 5
 
2.8%

납부매체
Categorical

HIGH CORRELATION 

Distinct10
Distinct (%)5.5%
Missing0
Missing (%)0.0%
Memory size1.5 KiB
가상계좌
28 
은행창구
28 
위택스
21 
기타
20 
인터넷지로
20 
Other values (5)
64 

Length

Max length5
Median length4
Mean length3.8453039
Min length2

Unique

Unique1 ?
Unique (%)0.6%

Sample

1st rowARS
2nd rowARS
3rd rowARS
4th rowARS
5th rowARS

Common Values

ValueCountFrequency (%)
가상계좌 28
15.5%
은행창구 28
15.5%
위택스 21
11.6%
기타 20
11.0%
인터넷지로 20
11.0%
ARS 19
10.5%
자동화기기 16
8.8%
지자체방문 16
8.8%
자동이체 12
6.6%
<NA> 1
 
0.6%

Length

2024-01-10T08:05:15.620462image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T08:05:15.742683image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
가상계좌 28
15.5%
은행창구 28
15.5%
위택스 21
11.6%
기타 20
11.0%
인터넷지로 20
11.0%
ars 19
10.5%
자동화기기 16
8.8%
지자체방문 16
8.8%
자동이체 12
6.6%
na 1
 
0.6%

납부매체전자고지여부
Boolean

HIGH CORRELATION 

Distinct2
Distinct (%)1.1%
Missing1
Missing (%)0.6%
Memory size494.0 B
False
99 
True
81 
(Missing)
 
1
ValueCountFrequency (%)
False 99
54.7%
True 81
44.8%
(Missing) 1
 
0.6%
2024-01-10T08:05:15.869382image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

납부건수
Real number (ℝ)

HIGH CORRELATION 

Distinct161
Distinct (%)89.4%
Missing1
Missing (%)0.6%
Infinite0
Infinite (%)0.0%
Mean3055.1111
Minimum1
Maximum26931
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.7 KiB
2024-01-10T08:05:15.975629image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile3
Q177.5
median618.5
Q33036
95-th percentile14210.55
Maximum26931
Range26930
Interquartile range (IQR)2958.5

Descriptive statistics

Standard deviation5201.0012
Coefficient of variation (CV)1.7023935
Kurtosis5.5472849
Mean3055.1111
Median Absolute Deviation (MAD)614.5
Skewness2.3665985
Sum549920
Variance27050413
MonotonicityNot monotonic
2024-01-10T08:05:16.096695image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 5
 
2.8%
3 5
 
2.8%
5 3
 
1.7%
2 3
 
1.7%
4 3
 
1.7%
73 2
 
1.1%
22 2
 
1.1%
107 2
 
1.1%
6 2
 
1.1%
15 2
 
1.1%
Other values (151) 151
83.4%
ValueCountFrequency (%)
1 5
2.8%
2 3
1.7%
3 5
2.8%
4 3
1.7%
5 3
1.7%
6 2
 
1.1%
8 1
 
0.6%
11 1
 
0.6%
13 1
 
0.6%
14 1
 
0.6%
ValueCountFrequency (%)
26931 1
0.6%
24519 1
0.6%
22201 1
0.6%
20912 1
0.6%
19376 1
0.6%
17881 1
0.6%
17399 1
0.6%
15710 1
0.6%
14392 1
0.6%
14201 1
0.6%

납부금액
Real number (ℝ)

HIGH CORRELATION 

Distinct180
Distinct (%)100.0%
Missing1
Missing (%)0.6%
Infinite0
Infinite (%)0.0%
Mean7.8970756 × 108
Minimum2560
Maximum8.1107021 × 109
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.7 KiB
2024-01-10T08:05:16.219646image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2560
5-th percentile98625
Q18717735
median1.3191528 × 108
Q37.7257284 × 108
95-th percentile4.1924021 × 109
Maximum8.1107021 × 109
Range8.1106995 × 109
Interquartile range (IQR)7.638551 × 108

Descriptive statistics

Standard deviation1.5323752 × 109
Coefficient of variation (CV)1.9404337
Kurtosis9.7438114
Mean7.8970756 × 108
Median Absolute Deviation (MAD)1.3181548 × 108
Skewness3.0760435
Sum1.4214736 × 1011
Variance2.3481736 × 1018
MonotonicityNot monotonic
2024-01-10T08:05:16.380411image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
405170 1
 
0.6%
944778620 1
 
0.6%
105216140 1
 
0.6%
12268560 1
 
0.6%
707910000 1
 
0.6%
828642780 1
 
0.6%
83601380 1
 
0.6%
83907310 1
 
0.6%
3372686290 1
 
0.6%
36696150 1
 
0.6%
Other values (170) 170
93.9%
ValueCountFrequency (%)
2560 1
0.6%
6180 1
0.6%
12600 1
0.6%
15450 1
0.6%
24840 1
0.6%
33990 1
0.6%
43260 1
0.6%
59820 1
0.6%
80100 1
0.6%
99600 1
0.6%
ValueCountFrequency (%)
8110702090 1
0.6%
7629165770 1
0.6%
7615609500 1
0.6%
7023124190 1
0.6%
6291147830 1
0.6%
5918071440 1
0.6%
5856247160 1
0.6%
5623755530 1
0.6%
5206626730 1
0.6%
4139021900 1
0.6%

납부매체비율
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct162
Distinct (%)90.0%
Missing1
Missing (%)0.6%
Infinite0
Infinite (%)0.0%
Mean13.927556
Minimum0
Maximum90.88
Zeros5
Zeros (%)2.8%
Negative0
Negative (%)0.0%
Memory size1.7 KiB
2024-01-10T08:05:16.550843image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0.01
Q10.9975
median7.375
Q318.94
95-th percentile49.7785
Maximum90.88
Range90.88
Interquartile range (IQR)17.9425

Descriptive statistics

Standard deviation17.246999
Coefficient of variation (CV)1.2383364
Kurtosis4.6097735
Mean13.927556
Median Absolute Deviation (MAD)7.215
Skewness1.9594731
Sum2506.96
Variance297.45896
MonotonicityNot monotonic
2024-01-10T08:05:16.730675image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.0 5
 
2.8%
0.01 5
 
2.8%
0.04 3
 
1.7%
0.06 2
 
1.1%
1.08 2
 
1.1%
0.05 2
 
1.1%
49.77 2
 
1.1%
0.07 2
 
1.1%
0.08 2
 
1.1%
0.14 2
 
1.1%
Other values (152) 153
84.5%
ValueCountFrequency (%)
0.0 5
2.8%
0.01 5
2.8%
0.02 1
 
0.6%
0.04 3
1.7%
0.05 2
 
1.1%
0.06 2
 
1.1%
0.07 2
 
1.1%
0.08 2
 
1.1%
0.14 2
 
1.1%
0.15 1
 
0.6%
ValueCountFrequency (%)
90.88 1
0.6%
87.76 1
0.6%
85.53 1
0.6%
56.18 1
0.6%
53.1 1
0.6%
52.45 1
0.6%
50.75 1
0.6%
50.37 1
0.6%
49.94 1
0.6%
49.77 2
1.1%

Interactions

2024-01-10T08:05:13.699950image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T08:05:13.205378image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T08:05:13.440304image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T08:05:13.769330image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T08:05:13.278195image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T08:05:13.524677image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T08:05:13.855017image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T08:05:13.363426image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T08:05:13.621307image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-01-10T08:05:16.838409image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
납부년도세목명납부매체납부매체전자고지여부납부건수납부금액납부매체비율
납부년도1.0000.0000.0000.0000.0000.0000.000
세목명0.0001.0000.0000.0000.2530.5630.597
납부매체0.0000.0001.0001.0000.4250.3980.458
납부매체전자고지여부0.0000.0001.0001.0000.1750.2150.148
납부건수0.0000.2530.4250.1751.0000.7320.736
납부금액0.0000.5630.3980.2150.7321.0000.408
납부매체비율0.0000.5970.4580.1480.7360.4081.000
2024-01-10T08:05:16.948902image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
세목명납부매체전자고지여부납부매체시군구명납부년도시도명자치단체코드
세목명1.0000.0000.0001.0000.0001.0001.000
납부매체전자고지여부0.0001.0000.9801.0000.0001.0001.000
납부매체0.0000.9801.0001.0000.0001.0001.000
시군구명1.0001.0001.0001.0001.0001.0001.000
납부년도0.0000.0000.0001.0001.0001.0001.000
시도명1.0001.0001.0001.0001.0001.0001.000
자치단체코드1.0001.0001.0001.0001.0001.0001.000
2024-01-10T08:05:17.056848image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
납부건수납부금액납부매체비율시도명시군구명자치단체코드납부년도세목명납부매체납부매체전자고지여부
납부건수1.0000.7790.8321.0001.0001.0000.0000.1060.2070.130
납부금액0.7791.0000.5981.0001.0001.0000.0000.2750.1930.160
납부매체비율0.8320.5981.0001.0001.0001.0000.0000.2970.2440.108
시도명1.0001.0001.0001.0001.0001.0001.0001.0001.0001.000
시군구명1.0001.0001.0001.0001.0001.0001.0001.0001.0001.000
자치단체코드1.0001.0001.0001.0001.0001.0001.0001.0001.0001.000
납부년도0.0000.0000.0001.0001.0001.0001.0000.0000.0000.000
세목명0.1060.2750.2971.0001.0001.0000.0001.0000.0000.000
납부매체0.2070.1930.2441.0001.0001.0000.0000.0001.0000.980
납부매체전자고지여부0.1300.1600.1081.0001.0001.0000.0000.0000.9801.000

Missing values

2024-01-10T08:05:13.952143image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-10T08:05:14.077248image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-01-10T08:05:14.213503image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

시도명시군구명자치단체코드납부년도세목명납부매체납부매체전자고지여부납부건수납부금액납부매체비율
0충청남도서천군447702017등록면허세ARSN394051703.17
1충청남도서천군447702017자동차세ARSN62011046948050.37
2충청남도서천군447702017재산세ARSN4383020298035.58
3충청남도서천군447702017종합토지세ARSN125600.08
4충청남도서천군447702017주민세ARSN10615150208.61
5충청남도서천군447702017지방소득세ARSN2244996501.79
6충청남도서천군447702017취득세ARSN57386400.41
7충청남도서천군447702017등록면허세가상계좌Y4888908322309.74
8충청남도서천군447702017등록세가상계좌Y22394900.0
9충청남도서천군447702017면허세가상계좌Y6339900.01
시도명시군구명자치단체코드납부년도세목명납부매체납부매체전자고지여부납부건수납부금액납부매체비율
171충청남도서천군447702019주민세인터넷지로Y43215452119012.0
172충청남도서천군447702019지방소득세인터넷지로Y2761561584807.66
173충청남도서천군447702019취득세인터넷지로Y1076511516102.97
174충청남도서천군447702019등록면허세자동이체Y1657175951506.92
175충청남도서천군447702019자동차세자동이체Y452153658863018.88
176충청남도서천군447702019재산세자동이체Y1187497163703049.58
177충청남도서천군447702019주민세자동이체Y58977596715024.62
178충청남도서천군447702019등록면허세자동화기기N16521549987206.82
179충청남도서천군447702019등록세자동화기기N3779269800.15
180<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>