Overview

Dataset statistics

Number of variables9
Number of observations40
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory3.2 KiB
Average record size in memory81.3 B

Variable types

Categorical5
Numeric4

Dataset

Description과세액 중 비과세액과 감면액이 차지하는 비율 현황에 대한 데이터로 과세년도, 비과세금액, 감면금액, 부과금액, 비과세감면율 등을 제공합니다.
URLhttps://www.data.go.kr/data/15078437/fileData.do

Alerts

시도명 has constant value ""Constant
시군구명 has constant value ""Constant
자치단체코드 has constant value ""Constant
비과세금액 is highly overall correlated with 감면금액 and 3 other fieldsHigh correlation
감면금액 is highly overall correlated with 비과세금액 and 3 other fieldsHigh correlation
부과금액 is highly overall correlated with 비과세금액 and 1 other fieldsHigh correlation
비과세감면율 is highly overall correlated with 비과세금액 and 2 other fieldsHigh correlation
세목명 is highly overall correlated with 비과세금액 and 2 other fieldsHigh correlation
비과세금액 has 10 (25.0%) zerosZeros
부과금액 has 2 (5.0%) zerosZeros
비과세감면율 has 7 (17.5%) zerosZeros

Reproduction

Analysis started2023-12-12 21:47:56.146582
Analysis finished2023-12-12 21:47:58.772488
Duration2.63 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시도명
Categorical

CONSTANT 

Distinct1
Distinct (%)2.5%
Missing0
Missing (%)0.0%
Memory size452.0 B
경기도
40 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row경기도
2nd row경기도
3rd row경기도
4th row경기도
5th row경기도

Common Values

ValueCountFrequency (%)
경기도 40
100.0%

Length

2023-12-13T06:47:58.833940image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T06:47:58.978235image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
경기도 40
100.0%

시군구명
Categorical

CONSTANT 

Distinct1
Distinct (%)2.5%
Missing0
Missing (%)0.0%
Memory size452.0 B
여주시
40 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row여주시
2nd row여주시
3rd row여주시
4th row여주시
5th row여주시

Common Values

ValueCountFrequency (%)
여주시 40
100.0%

Length

2023-12-13T06:47:59.102838image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T06:47:59.214292image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
여주시 40
100.0%

자치단체코드
Categorical

CONSTANT 

Distinct1
Distinct (%)2.5%
Missing0
Missing (%)0.0%
Memory size452.0 B
41670
40 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row41670
2nd row41670
3rd row41670
4th row41670
5th row41670

Common Values

ValueCountFrequency (%)
41670 40
100.0%

Length

2023-12-13T06:47:59.350282image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T06:47:59.473487image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
41670 40
100.0%

세목명
Categorical

HIGH CORRELATION 

Distinct8
Distinct (%)20.0%
Missing0
Missing (%)0.0%
Memory size452.0 B
교육세
등록세
재산세
주민세
취득세
Other values (3)
15 

Length

Max length7
Median length3
Mean length3.875
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row교육세
2nd row등록세
3rd row재산세
4th row주민세
5th row취득세

Common Values

ValueCountFrequency (%)
교육세 5
12.5%
등록세 5
12.5%
재산세 5
12.5%
주민세 5
12.5%
취득세 5
12.5%
자동차세 5
12.5%
등록면허세 5
12.5%
지역자원시설세 5
12.5%

Length

2023-12-13T06:47:59.583498image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T06:47:59.748904image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
교육세 5
12.5%
등록세 5
12.5%
재산세 5
12.5%
주민세 5
12.5%
취득세 5
12.5%
자동차세 5
12.5%
등록면허세 5
12.5%
지역자원시설세 5
12.5%

과세년도
Categorical

Distinct5
Distinct (%)12.5%
Missing0
Missing (%)0.0%
Memory size452.0 B
2018
2019
2020
2021
2022

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2018
2nd row2018
3rd row2018
4th row2018
5th row2018

Common Values

ValueCountFrequency (%)
2018 8
20.0%
2019 8
20.0%
2020 8
20.0%
2021 8
20.0%
2022 8
20.0%

Length

2023-12-13T06:47:59.907857image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T06:48:00.042518image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2018 8
20.0%
2019 8
20.0%
2020 8
20.0%
2021 8
20.0%
2022 8
20.0%

비과세금액
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct31
Distinct (%)77.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.9917055 × 109
Minimum0
Maximum1.3403824 × 1010
Zeros10
Zeros (%)25.0%
Negative0
Negative (%)0.0%
Memory size492.0 B
2023-12-13T06:48:00.191963image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q122500
median73904415
Q37.5030844 × 108
95-th percentile1.1987875 × 1010
Maximum1.3403824 × 1010
Range1.3403824 × 1010
Interquartile range (IQR)7.5028594 × 108

Descriptive statistics

Standard deviation4.0617103 × 109
Coefficient of variation (CV)2.0393127
Kurtosis2.7033193
Mean1.9917055 × 109
Median Absolute Deviation (MAD)73904415
Skewness2.0328591
Sum7.9668219 × 1010
Variance1.6497491 × 1019
MonotonicityNot monotonic
2023-12-13T06:48:00.334892image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=31)
ValueCountFrequency (%)
0 10
25.0%
10909106000 1
 
2.5%
204790000 1
 
2.5%
78378000 1
 
2.5%
87163000 1
 
2.5%
2473290000 1
 
2.5%
650000 1
 
2.5%
13403824000 1
 
2.5%
191651380 1
 
2.5%
35107120 1
 
2.5%
Other values (21) 21
52.5%
ValueCountFrequency (%)
0 10
25.0%
30000 1
 
2.5%
250000 1
 
2.5%
650000 1
 
2.5%
5596000 1
 
2.5%
10258000 1
 
2.5%
28930000 1
 
2.5%
32760000 1
 
2.5%
35107120 1
 
2.5%
71703950 1
 
2.5%
ValueCountFrequency (%)
13403824000 1
2.5%
12660987484 1
2.5%
11952448317 1
2.5%
11262622000 1
2.5%
10909106000 1
2.5%
5507823000 1
2.5%
4987747370 1
2.5%
2477329000 1
2.5%
2473290000 1
2.5%
2386863770 1
2.5%

감면금액
Real number (ℝ)

HIGH CORRELATION 

Distinct39
Distinct (%)97.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.2652221 × 109
Minimum2000
Maximum9.3056644 × 109
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size492.0 B
2023-12-13T06:48:00.512600image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2000
5-th percentile3950
Q18424445
median2.1491 × 108
Q39.304115 × 108
95-th percentile7.0063967 × 109
Maximum9.3056644 × 109
Range9.3056624 × 109
Interquartile range (IQR)9.2198706 × 108

Descriptive statistics

Standard deviation2.3024723 × 109
Coefficient of variation (CV)1.8198167
Kurtosis4.4278906
Mean1.2652221 × 109
Median Absolute Deviation (MAD)2.138015 × 108
Skewness2.2763537
Sum5.0608885 × 1010
Variance5.3013789 × 1018
MonotonicityNot monotonic
2023-12-13T06:48:00.675219image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=39)
ValueCountFrequency (%)
4000 2
 
5.0%
2000 1
 
2.5%
159920500 1
 
2.5%
155248460 1
 
2.5%
5419480 1
 
2.5%
2263610144 1
 
2.5%
426706510 1
 
2.5%
6999126910 1
 
2.5%
586935750 1
 
2.5%
314277240 1
 
2.5%
Other values (29) 29
72.5%
ValueCountFrequency (%)
2000 1
2.5%
3000 1
2.5%
4000 2
5.0%
14000 1
2.5%
2203000 1
2.5%
3552000 1
2.5%
5419480 1
2.5%
5573000 1
2.5%
6607780 1
2.5%
9030000 1
2.5%
ValueCountFrequency (%)
9305664360 1
2.5%
7144522000 1
2.5%
6999126910 1
2.5%
6170798000 1
2.5%
4611139000 1
2.5%
2287556000 1
2.5%
2263610144 1
2.5%
2131937547 1
2.5%
1964343000 1
2.5%
1890455000 1
2.5%

부과금액
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct39
Distinct (%)97.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.8177724 × 1010
Minimum0
Maximum1.27 × 1011
Zeros2
Zeros (%)5.0%
Negative0
Negative (%)0.0%
Memory size492.0 B
2023-12-13T06:48:00.827665image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile18970550
Q13.4658382 × 109
median1.3490646 × 1010
Q34.6027676 × 1010
95-th percentile8.9727379 × 1010
Maximum1.27 × 1011
Range1.27 × 1011
Interquartile range (IQR)4.2561837 × 1010

Descriptive statistics

Standard deviation3.2631578 × 1010
Coefficient of variation (CV)1.1580629
Kurtosis1.8172449
Mean2.8177724 × 1010
Median Absolute Deviation (MAD)1.3466251 × 1010
Skewness1.4282084
Sum1.127109 × 1012
Variance1.0648199 × 1021
MonotonicityNot monotonic
2023-12-13T06:48:00.959540image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=39)
ValueCountFrequency (%)
0 2
 
5.0%
19939227000 1
 
2.5%
3429760920 1
 
2.5%
28514722410 1
 
2.5%
24295160 1
 
2.5%
58360680870 1
 
2.5%
3535573330 1
 
2.5%
119000000000 1
 
2.5%
46498533370 1
 
2.5%
6914618270 1
 
2.5%
Other values (29) 29
72.5%
ValueCountFrequency (%)
0 2
5.0%
19969000 1
2.5%
24295160 1
2.5%
24495320 1
2.5%
3160356000 1
2.5%
3203382000 1
2.5%
3257453360 1
2.5%
3311052000 1
2.5%
3429760920 1
2.5%
3477864000 1
2.5%
ValueCountFrequency (%)
127000000000 1
2.5%
119000000000 1
2.5%
88186715200 1
2.5%
68044543000 1
2.5%
67104022000 1
2.5%
58360680870 1
2.5%
58316297000 1
2.5%
54914645990 1
2.5%
48684769720 1
2.5%
46498533370 1
2.5%

비과세감면율
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct34
Distinct (%)85.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean9.473
Minimum0
Maximum28.84
Zeros7
Zeros (%)17.5%
Negative0
Negative (%)0.0%
Memory size492.0 B
2023-12-13T06:48:01.085182image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q11.42
median6.42
Q313.1025
95-th percentile27.922
Maximum28.84
Range28.84
Interquartile range (IQR)11.6825

Descriptive statistics

Standard deviation9.7297087
Coefficient of variation (CV)1.027099
Kurtosis-0.63178184
Mean9.473
Median Absolute Deviation (MAD)5.34
Skewness0.86900511
Sum378.92
Variance94.667232
MonotonicityNot monotonic
2023-12-13T06:48:01.213137image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=34)
ValueCountFrequency (%)
0.0 7
 
17.5%
9.06 1
 
2.5%
22.3 1
 
2.5%
25.57 1
 
2.5%
12.07 1
 
2.5%
7.88 1
 
2.5%
1.43 1
 
2.5%
5.05 1
 
2.5%
27.9 1
 
2.5%
5.29 1
 
2.5%
Other values (24) 24
60.0%
ValueCountFrequency (%)
0.0 7
17.5%
0.76 1
 
2.5%
1.09 1
 
2.5%
1.39 1
 
2.5%
1.43 1
 
2.5%
1.65 1
 
2.5%
1.82 1
 
2.5%
1.91 1
 
2.5%
1.92 1
 
2.5%
3.9 1
 
2.5%
ValueCountFrequency (%)
28.84 1
2.5%
28.34 1
2.5%
27.9 1
2.5%
26.97 1
2.5%
25.64 1
2.5%
25.57 1
2.5%
23.38 1
2.5%
22.3 1
2.5%
20.03 1
2.5%
16.2 1
2.5%

Interactions

2023-12-13T06:47:58.058461image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T06:47:56.408525image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T06:47:56.865355image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T06:47:57.324117image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T06:47:58.183664image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T06:47:56.545910image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T06:47:56.974005image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T06:47:57.416723image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T06:47:58.285358image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T06:47:56.658919image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T06:47:57.064648image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T06:47:57.506493image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T06:47:58.388197image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T06:47:56.773217image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T06:47:57.218231image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T06:47:57.620700image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T06:48:01.315653image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
세목명과세년도비과세금액감면금액부과금액비과세감면율
세목명1.0000.0000.7320.7290.8850.880
과세년도0.0001.0000.0000.0000.0000.000
비과세금액0.7320.0001.0000.9940.8610.932
감면금액0.7290.0000.9941.0000.8960.940
부과금액0.8850.0000.8610.8961.0000.836
비과세감면율0.8800.0000.9320.9400.8361.000
2023-12-13T06:48:01.431946image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
세목명과세년도
세목명1.0000.000
과세년도0.0001.000
2023-12-13T06:48:01.520192image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
비과세금액감면금액부과금액비과세감면율세목명과세년도
비과세금액1.0000.8310.6380.5640.5030.000
감면금액0.8311.0000.7280.5280.5030.000
부과금액0.6380.7281.0000.1080.4930.000
비과세감면율0.5640.5280.1081.0000.6590.000
세목명0.5030.5030.4930.6591.0000.000
과세년도0.0000.0000.0000.0000.0001.000

Missing values

2023-12-13T06:47:58.518185image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T06:47:58.705700image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시도명시군구명자치단체코드세목명과세년도비과세금액감면금액부과금액비과세감면율
0경기도여주시41670교육세201802000199392270000.0
1경기도여주시41670등록세20180220300000.0
2경기도여주시41670재산세20181090910600018904550004516456400028.34
3경기도여주시41670주민세2018327600002878300032033820001.92
4경기도여주시41670취득세2018550782300061707980005831629700020.03
5경기도여주시41670자동차세2018124238000560804000415650770001.65
6경기도여주시41670등록면허세2018559600020799600046597720004.58
7경기도여주시41670지역자원시설세2018178911000161885000316035600010.78
8경기도여주시41670교육세201904000209387610000.0
9경기도여주시41670등록세20190355200000.0
시도명시군구명자치단체코드세목명과세년도비과세금액감면금액부과금액비과세감면율
30경기도여주시41670등록면허세20213510712031427724069146182705.05
31경기도여주시41670지역자원시설세202119165138015992050038805180709.06
32경기도여주시41670교육세2022014000267190860000.0
33경기도여주시41670등록세2022055730001996900027.9
34경기도여주시41670재산세20221340382400022875560006710402200023.38
35경기도여주시41670주민세20226500002971700039574860000.76
36경기도여주시41670취득세2022247329000071445220001270000000007.55
37경기도여주시41670자동차세202287163000569361000358790780001.82
38경기도여주시41670등록면허세20227837800019647400070420660003.9
39경기도여주시41670지역자원시설세202220479000016779200040634770009.16