Overview

Dataset statistics

Number of variables10
Number of observations251
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory21.0 KiB
Average record size in memory85.5 B

Variable types

Categorical6
Boolean1
Numeric3

Dataset

Description2017년부터 2019년 지방세 납부방법(신용카드, 가상계좌, ARS 등)의 데이터로 전자송달 시장규모 및 분석, 수수료 산정시 기초자료로 활용
Author경상남도 양산시
URLhttps://bigdata.gyeongnam.go.kr/index.gn?menuCd=DOM_000000114002001000&publicdatapk=15079426

Alerts

시도명 has constant value ""Constant
시군구명 has constant value ""Constant
자치단체코드 has constant value ""Constant
납부매체 is highly overall correlated with 납부매체전자고지여부High correlation
납부매체전자고지여부 is highly overall correlated with 납부매체High correlation
납부건수 is highly overall correlated with 납부금액 (원) and 1 other fieldsHigh correlation
납부금액 (원) is highly overall correlated with 납부건수 and 1 other fieldsHigh correlation
납부매체비율 is highly overall correlated with 납부건수 and 1 other fieldsHigh correlation
납부매체비율 has 26 (10.4%) zerosZeros

Reproduction

Analysis started2023-12-11 00:40:47.877506
Analysis finished2023-12-11 00:40:49.616980
Duration1.74 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시도명
Categorical

CONSTANT 

Distinct1
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size2.1 KiB
경상남도
251 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row경상남도
2nd row경상남도
3rd row경상남도
4th row경상남도
5th row경상남도

Common Values

ValueCountFrequency (%)
경상남도 251
100.0%

Length

2023-12-11T09:40:49.693876image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T09:40:49.779547image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
경상남도 251
100.0%

시군구명
Categorical

CONSTANT 

Distinct1
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size2.1 KiB
양산시
251 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row양산시
2nd row양산시
3rd row양산시
4th row양산시
5th row양산시

Common Values

ValueCountFrequency (%)
양산시 251
100.0%

Length

2023-12-11T09:40:49.891963image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T09:40:50.014672image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
양산시 251
100.0%

자치단체코드
Categorical

CONSTANT 

Distinct1
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size2.1 KiB
48330
251 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row48330
2nd row48330
3rd row48330
4th row48330
5th row48330

Common Values

ValueCountFrequency (%)
48330 251
100.0%

Length

2023-12-11T09:40:50.116452image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T09:40:50.230996image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
48330 251
100.0%

납부년도
Categorical

Distinct3
Distinct (%)1.2%
Missing0
Missing (%)0.0%
Memory size2.1 KiB
2019
87 
2018
83 
2017
81 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2017
2nd row2017
3rd row2017
4th row2017
5th row2017

Common Values

ValueCountFrequency (%)
2019 87
34.7%
2018 83
33.1%
2017 81
32.3%

Length

2023-12-11T09:40:50.339849image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T09:40:50.441365image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2019 87
34.7%
2018 83
33.1%
2017 81
32.3%

세목명
Categorical

Distinct12
Distinct (%)4.8%
Missing0
Missing (%)0.0%
Memory size2.1 KiB
등록면허세
31 
자동차세
31 
재산세
31 
주민세
31 
지방소득세
28 
Other values (7)
99 

Length

Max length7
Median length5
Mean length4.0717131
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row등록면허세
2nd row등록면허세
3rd row자동차세
4th row자동차세
5th row재산세

Common Values

ValueCountFrequency (%)
등록면허세 31
12.4%
자동차세 31
12.4%
재산세 31
12.4%
주민세 31
12.4%
지방소득세 28
11.2%
취득세 25
10.0%
지역자원시설세 20
8.0%
등록세 18
7.2%
면허세 14
5.6%
종합토지세 14
5.6%
Other values (2) 8
 
3.2%

Length

2023-12-11T09:40:50.583248image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
등록면허세 31
12.4%
자동차세 31
12.4%
재산세 31
12.4%
주민세 31
12.4%
지방소득세 28
11.2%
취득세 25
10.0%
지역자원시설세 20
8.0%
등록세 18
7.2%
면허세 14
5.6%
종합토지세 14
5.6%
Other values (2) 8
 
3.2%

납부매체
Categorical

HIGH CORRELATION 

Distinct10
Distinct (%)4.0%
Missing0
Missing (%)0.0%
Memory size2.1 KiB
ARS
35 
가상계좌
31 
은행창구
31 
자동화기기
31 
지자체방문
29 
Other values (5)
94 

Length

Max length5
Median length4
Mean length3.9043825
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowARS
2nd rowARS
3rd rowARS
4th rowARS
5th rowARS

Common Values

ValueCountFrequency (%)
ARS 35
13.9%
가상계좌 31
12.4%
은행창구 31
12.4%
자동화기기 31
12.4%
지자체방문 29
11.6%
기타 27
10.8%
위택스 25
10.0%
인터넷지로 24
9.6%
자동이체 12
 
4.8%
페이사납부 6
 
2.4%

Length

2023-12-11T09:40:50.769935image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T09:40:50.937825image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
ars 35
13.9%
가상계좌 31
12.4%
은행창구 31
12.4%
자동화기기 31
12.4%
지자체방문 29
11.6%
기타 27
10.8%
위택스 25
10.0%
인터넷지로 24
9.6%
자동이체 12
 
4.8%
페이사납부 6
 
2.4%

납부매체전자고지여부
Boolean

HIGH CORRELATION 

Distinct2
Distinct (%)0.8%
Missing0
Missing (%)0.0%
Memory size383.0 B
False
138 
True
113 
ValueCountFrequency (%)
False 138
55.0%
True 113
45.0%
2023-12-11T09:40:51.064631image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

납부건수
Real number (ℝ)

HIGH CORRELATION 

Distinct201
Distinct (%)80.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean10801.757
Minimum1
Maximum145782
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.3 KiB
2023-12-11T09:40:51.178515image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2
Q122.5
median1485
Q310720
95-th percentile45210
Maximum145782
Range145781
Interquartile range (IQR)10697.5

Descriptive statistics

Standard deviation22556.879
Coefficient of variation (CV)2.0882601
Kurtosis14.39734
Mean10801.757
Median Absolute Deviation (MAD)1482
Skewness3.5518192
Sum2711241
Variance5.0881277 × 108
MonotonicityNot monotonic
2023-12-11T09:40:51.379798image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 9
 
3.6%
3 9
 
3.6%
4 9
 
3.6%
2 8
 
3.2%
5 4
 
1.6%
15 3
 
1.2%
11 3
 
1.2%
6 3
 
1.2%
7 3
 
1.2%
12 3
 
1.2%
Other values (191) 197
78.5%
ValueCountFrequency (%)
1 9
3.6%
2 8
3.2%
3 9
3.6%
4 9
3.6%
5 4
1.6%
6 3
 
1.2%
7 3
 
1.2%
8 1
 
0.4%
9 2
 
0.8%
11 3
 
1.2%
ValueCountFrequency (%)
145782 1
0.4%
133243 1
0.4%
126270 1
0.4%
112920 1
0.4%
105553 1
0.4%
105066 1
0.4%
83741 1
0.4%
81286 1
0.4%
75104 1
0.4%
66045 1
0.4%

납부금액 (원)
Real number (ℝ)

HIGH CORRELATION 

Distinct250
Distinct (%)99.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5.9703209 × 109
Minimum14600
Maximum1.0488274 × 1011
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.3 KiB
2023-12-11T09:40:51.534410image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum14600
5-th percentile28080
Q13416290
median3.1232534 × 108
Q34.0689562 × 109
95-th percentile3.2321858 × 1010
Maximum1.0488274 × 1011
Range1.0488273 × 1011
Interquartile range (IQR)4.0655399 × 109

Descriptive statistics

Standard deviation1.3623602 × 1010
Coefficient of variation (CV)2.2818877
Kurtosis17.828616
Mean5.9703209 × 109
Median Absolute Deviation (MAD)3.1229412 × 108
Skewness3.7803
Sum1.4985505 × 1012
Variance1.8560252 × 1020
MonotonicityNot monotonic
2023-12-11T09:40:51.717887image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
15450 2
 
0.8%
1696720 1
 
0.4%
49358530 1
 
0.4%
74817300 1
 
0.4%
312325340 1
 
0.4%
615110 1
 
0.4%
5607190020 1
 
0.4%
3397630 1
 
0.4%
42450 1
 
0.4%
1295211560 1
 
0.4%
Other values (240) 240
95.6%
ValueCountFrequency (%)
14600 1
0.4%
15240 1
0.4%
15450 2
0.8%
16480 1
0.4%
18540 1
0.4%
18660 1
0.4%
19750 1
0.4%
22660 1
0.4%
22900 1
0.4%
23170 1
0.4%
ValueCountFrequency (%)
104882742370 1
0.4%
79572368450 1
0.4%
67485661880 1
0.4%
64733380140 1
0.4%
63334288340 1
0.4%
51284934560 1
0.4%
45685416550 1
0.4%
38203030640 1
0.4%
36060464240 1
0.4%
33749166450 1
0.4%

납부매체비율
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct178
Distinct (%)70.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean11.155538
Minimum0
Maximum83.73
Zeros26
Zeros (%)10.4%
Negative0
Negative (%)0.0%
Memory size2.3 KiB
2023-12-11T09:40:51.861062image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10.06
median4.13
Q317.33
95-th percentile39.36
Maximum83.73
Range83.73
Interquartile range (IQR)17.27

Descriptive statistics

Standard deviation14.909768
Coefficient of variation (CV)1.3365351
Kurtosis4.4701426
Mean11.155538
Median Absolute Deviation (MAD)4.13
Skewness1.8523872
Sum2800.04
Variance222.30118
MonotonicityNot monotonic
2023-12-11T09:40:51.988292image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.0 26
 
10.4%
0.01 10
 
4.0%
0.02 8
 
3.2%
0.09 8
 
3.2%
0.05 7
 
2.8%
0.04 7
 
2.8%
0.06 6
 
2.4%
0.03 4
 
1.6%
0.1 3
 
1.2%
0.07 3
 
1.2%
Other values (168) 169
67.3%
ValueCountFrequency (%)
0.0 26
10.4%
0.01 10
 
4.0%
0.02 8
 
3.2%
0.03 4
 
1.6%
0.04 7
 
2.8%
0.05 7
 
2.8%
0.06 6
 
2.4%
0.07 3
 
1.2%
0.08 1
 
0.4%
0.09 8
 
3.2%
ValueCountFrequency (%)
83.73 1
0.4%
80.92 1
0.4%
76.45 1
0.4%
48.23 1
0.4%
47.25 1
0.4%
46.53 1
0.4%
45.84 1
0.4%
45.46 1
0.4%
43.82 1
0.4%
43.46 1
0.4%

Interactions

2023-12-11T09:40:49.069174image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T09:40:48.200814image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T09:40:48.528412image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T09:40:49.163219image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T09:40:48.322300image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T09:40:48.626573image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T09:40:49.260484image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T09:40:48.437568image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T09:40:48.991384image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T09:40:52.068365image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
납부년도세목명납부매체납부매체전자고지여부납부건수납부금액 (원)납부매체비율
납부년도1.0000.0000.0000.0000.0000.0000.000
세목명0.0001.0000.0000.0400.3160.5220.600
납부매체0.0000.0001.0000.9930.4140.3160.457
납부매체전자고지여부0.0000.0400.9931.0000.1500.1260.182
납부건수0.0000.3160.4140.1501.0000.4760.669
납부금액 (원)0.0000.5220.3160.1260.4761.0000.335
납부매체비율0.0000.6000.4570.1820.6690.3351.000
2023-12-11T09:40:52.166770image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
납부매체납부매체전자고지여부납부년도세목명
납부매체1.0000.9110.0000.000
납부매체전자고지여부0.9111.0000.0000.028
납부년도0.0000.0001.0000.000
세목명0.0000.0280.0001.000
2023-12-11T09:40:52.255025image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
납부건수납부금액 (원)납부매체비율납부년도세목명납부매체납부매체전자고지여부
납부건수1.0000.8630.8810.0000.1380.2010.147
납부금액 (원)0.8631.0000.7300.0000.2490.1560.093
납부매체비율0.8810.7301.0000.0000.3420.2500.193
납부년도0.0000.0000.0001.0000.0000.0000.000
세목명0.1380.2490.3420.0001.0000.0000.028
납부매체0.2010.1560.2500.0000.0001.0000.911
납부매체전자고지여부0.1470.0930.1930.0000.0280.9111.000

Missing values

2023-12-11T09:40:49.393143image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T09:40:49.561961image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시도명시군구명자치단체코드납부년도세목명납부매체납부매체전자고지여부납부건수납부금액 (원)납부매체비율
0경상남도양산시483302017등록면허세ARSN12716967201.29
1경상남도양산시483302017등록면허세ARSY7957900.07
2경상남도양산시483302017자동차세ARSN459883239803046.53
3경상남도양산시483302017자동차세ARSY2522256400.25
4경상남도양산시483302017재산세ARSN339164692252034.31
5경상남도양산시483302017재산세ARSY95735300.09
6경상남도양산시483302017주민세ARSN14582202531014.75
7경상남도양산시483302017주민세ARSY487331400.49
8경상남도양산시483302017지방소득세ARSN134545674301.36
9경상남도양산시483302017지방소득세ARSY5688700.05
시도명시군구명자치단체코드납부년도세목명납부매체납부매체전자고지여부납부건수납부금액 (원)납부매체비율
241경상남도양산시483302019주민세지자체방문N37467733487016.93
242경상남도양산시483302019지방소득세지자체방문N8342684417003.77
243경상남도양산시483302019지역자원시설세지자체방문N86976700.04
244경상남도양산시483302019취득세지자체방문N159156375454007.19
245경상남도양산시483302019등록면허세페이사납부Y4510000.09
246경상남도양산시483302019자동차세페이사납부Y122618632568029.04
247경상남도양산시483302019재산세페이사납부Y160627115578038.04
248경상남도양산시483302019주민세페이사납부Y13781873690032.64
249경상남도양산시483302019지방소득세페이사납부Y43683000.09
250경상남도양산시483302019취득세페이사납부Y476450400.09