Overview

Dataset statistics

Number of variables9
Number of observations24
Missing cells1
Missing cells (%)0.5%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.0 KiB
Average record size in memory83.3 B

Variable types

Categorical5
Numeric4

Dataset

Description3년간(2020~2022) 지방세 과세액 중 비과세금액과 감면금액이 차지하는 비과세 감면 비율 현황을 제공합니다.
Author전라남도 나주시
URLhttps://www.data.go.kr/data/15126698/fileData.do

Alerts

시도명 has constant value ""Constant
시군구명 has constant value ""Constant
자치단체코드 has constant value ""Constant
비과세금액 is highly overall correlated with 감면금액 and 2 other fieldsHigh correlation
감면금액 is highly overall correlated with 비과세금액 and 2 other fieldsHigh correlation
부과금액 is highly overall correlated with 비과세금액 and 2 other fieldsHigh correlation
비과세감면율 is highly overall correlated with 비과세금액 and 2 other fieldsHigh correlation
세목명 is highly overall correlated with 부과금액 and 1 other fieldsHigh correlation
비과세금액 has 1 (4.2%) missing valuesMissing
감면금액 has unique valuesUnique
비과세금액 has 5 (20.8%) zerosZeros
부과금액 has 2 (8.3%) zerosZeros
비과세감면율 has 7 (29.2%) zerosZeros

Reproduction

Analysis started2024-03-14 18:25:12.251574
Analysis finished2024-03-14 18:25:16.344560
Duration4.09 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시도명
Categorical

CONSTANT 

Distinct1
Distinct (%)4.2%
Missing0
Missing (%)0.0%
Memory size320.0 B
전라남도
24 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row전라남도
2nd row전라남도
3rd row전라남도
4th row전라남도
5th row전라남도

Common Values

ValueCountFrequency (%)
전라남도 24
100.0%

Length

2024-03-15T03:25:16.461067image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-15T03:25:16.617270image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
전라남도 24
100.0%

시군구명
Categorical

CONSTANT 

Distinct1
Distinct (%)4.2%
Missing0
Missing (%)0.0%
Memory size320.0 B
나주시
24 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row나주시
2nd row나주시
3rd row나주시
4th row나주시
5th row나주시

Common Values

ValueCountFrequency (%)
나주시 24
100.0%

Length

2024-03-15T03:25:16.834927image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-15T03:25:17.121814image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
나주시 24
100.0%

자치단체코드
Categorical

CONSTANT 

Distinct1
Distinct (%)4.2%
Missing0
Missing (%)0.0%
Memory size320.0 B
46170
24 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row46170
2nd row46170
3rd row46170
4th row46170
5th row46170

Common Values

ValueCountFrequency (%)
46170 24
100.0%

Length

2024-03-15T03:25:17.396175image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-15T03:25:17.692052image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
46170 24
100.0%

세목명
Categorical

HIGH CORRELATION 

Distinct8
Distinct (%)33.3%
Missing0
Missing (%)0.0%
Memory size320.0 B
교육세
등록세
재산세
주민세
취득세
Other values (3)

Length

Max length7
Median length3
Mean length3.875
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row교육세
2nd row등록세
3rd row재산세
4th row주민세
5th row취득세

Common Values

ValueCountFrequency (%)
교육세 3
12.5%
등록세 3
12.5%
재산세 3
12.5%
주민세 3
12.5%
취득세 3
12.5%
자동차세 3
12.5%
등록면허세 3
12.5%
지역자원시설세 3
12.5%

Length

2024-03-15T03:25:17.922607image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-15T03:25:18.145259image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
교육세 3
12.5%
등록세 3
12.5%
재산세 3
12.5%
주민세 3
12.5%
취득세 3
12.5%
자동차세 3
12.5%
등록면허세 3
12.5%
지역자원시설세 3
12.5%

과세년도
Categorical

Distinct3
Distinct (%)12.5%
Missing0
Missing (%)0.0%
Memory size320.0 B
2020
2021
2022

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2020
2nd row2020
3rd row2020
4th row2020
5th row2020

Common Values

ValueCountFrequency (%)
2020 8
33.3%
2021 8
33.3%
2022 8
33.3%

Length

2024-03-15T03:25:18.433092image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-15T03:25:18.743375image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2020 8
33.3%
2021 8
33.3%
2022 8
33.3%

비과세금액
Real number (ℝ)

HIGH CORRELATION  MISSING  ZEROS 

Distinct18
Distinct (%)78.3%
Missing1
Missing (%)4.2%
Infinite0
Infinite (%)0.0%
Mean1.7881793 × 109
Minimum0
Maximum1.1492481 × 1010
Zeros5
Zeros (%)20.8%
Negative0
Negative (%)0.0%
Memory size344.0 B
2024-03-15T03:25:19.063985image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q16445500
median1.41209 × 108
Q39.72958 × 108
95-th percentile1.0671275 × 1010
Maximum1.1492481 × 1010
Range1.1492481 × 1010
Interquartile range (IQR)9.665125 × 108

Descriptive statistics

Standard deviation3.650754 × 109
Coefficient of variation (CV)2.041604
Kurtosis3.277192
Mean1.7881793 × 109
Median Absolute Deviation (MAD)1.41209 × 108
Skewness2.1559078
Sum4.1128123 × 1010
Variance1.3328005 × 1019
MonotonicityNot monotonic
2024-03-15T03:25:19.395292image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=18)
ValueCountFrequency (%)
0 5
20.8%
6491000 2
 
8.3%
145308000 1
 
4.2%
364608000 1
 
4.2%
8775000 1
 
4.2%
151733000 1
 
4.2%
1581308000 1
 
4.2%
11492481000 1
 
4.2%
335885000 1
 
4.2%
13075000 1
 
4.2%
Other values (8) 8
33.3%
ValueCountFrequency (%)
0 5
20.8%
6400000 1
 
4.2%
6491000 2
 
8.3%
8775000 1
 
4.2%
13075000 1
 
4.2%
119326000 1
 
4.2%
141209000 1
 
4.2%
145308000 1
 
4.2%
151733000 1
 
4.2%
332892000 1
 
4.2%
ValueCountFrequency (%)
11492481000 1
4.2%
10744487000 1
4.2%
10012370000 1
4.2%
2897331000 1
4.2%
2767953000 1
4.2%
1581308000 1
4.2%
364608000 1
4.2%
335885000 1
4.2%
332892000 1
4.2%
151733000 1
4.2%

감면금액
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct24
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.1855766 × 109
Minimum9000
Maximum2.2193421 × 1010
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size344.0 B
2024-03-15T03:25:19.607621image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum9000
5-th percentile12700
Q117873000
median1.755415 × 108
Q31.2119225 × 109
95-th percentile9.6028675 × 109
Maximum2.2193421 × 1010
Range2.2193412 × 1010
Interquartile range (IQR)1.1940495 × 109

Descriptive statistics

Standard deviation4.9615965 × 109
Coefficient of variation (CV)2.2701545
Kurtosis11.93871
Mean2.1855766 × 109
Median Absolute Deviation (MAD)1.753675 × 108
Skewness3.289877
Sum5.2453839 × 1010
Variance2.461744 × 1019
MonotonicityNot monotonic
2024-03-15T03:25:20.126408image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=24)
ValueCountFrequency (%)
10000 1
 
4.2%
548443000 1
 
4.2%
136613000 1
 
4.2%
214470000 1
 
4.2%
526722000 1
 
4.2%
7263240000 1
 
4.2%
18118000 1
 
4.2%
3446457000 1
 
4.2%
320000 1
 
4.2%
28000 1
 
4.2%
Other values (14) 14
58.3%
ValueCountFrequency (%)
9000 1
4.2%
10000 1
4.2%
28000 1
4.2%
320000 1
4.2%
1629000 1
4.2%
17144000 1
4.2%
18116000 1
4.2%
18118000 1
4.2%
18473000 1
4.2%
122275000 1
4.2%
ValueCountFrequency (%)
22193421000 1
4.2%
10015743000 1
4.2%
7263240000 1
4.2%
3500264000 1
4.2%
3446457000 1
4.2%
3152639000 1
4.2%
565017000 1
4.2%
548443000 1
4.2%
526722000 1
4.2%
286237000 1
4.2%

부과금액
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct23
Distinct (%)95.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.7719508 × 1010
Minimum0
Maximum5.9486155 × 1010
Zeros2
Zeros (%)8.3%
Negative0
Negative (%)0.0%
Memory size344.0 B
2024-03-15T03:25:20.475278image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile9446850
Q14.9282775 × 109
median1.076969 × 1010
Q32.6206828 × 1010
95-th percentile5.7157902 × 1010
Maximum5.9486155 × 1010
Range5.9486155 × 1010
Interquartile range (IQR)2.1278551 × 1010

Descriptive statistics

Standard deviation1.7870909 × 1010
Coefficient of variation (CV)1.0085443
Kurtosis0.73321775
Mean1.7719508 × 1010
Median Absolute Deviation (MAD)8.4797625 × 109
Skewness1.2391818
Sum4.2526819 × 1011
Variance3.1936937 × 1020
MonotonicityNot monotonic
2024-03-15T03:25:20.859662image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=23)
ValueCountFrequency (%)
0 2
 
8.3%
14091643000 1
 
4.2%
27565251000 1
 
4.2%
5302137000 1
 
4.2%
4596184000 1
 
4.2%
23935783000 1
 
4.2%
59486155000 1
 
4.2%
7447736000 1
 
4.2%
31773897000 1
 
4.2%
15931623000 1
 
4.2%
Other values (13) 13
54.2%
ValueCountFrequency (%)
0 2
8.3%
62979000 1
4.2%
4516875000 1
4.2%
4596184000 1
4.2%
4883519000 1
4.2%
4943197000 1
4.2%
4962979000 1
4.2%
5302137000 1
4.2%
6331017000 1
4.2%
7232592000 1
4.2%
ValueCountFrequency (%)
59486155000 1
4.2%
58263680000 1
4.2%
50891829000 1
4.2%
31773897000 1
4.2%
29623689000 1
4.2%
27565251000 1
4.2%
25754021000 1
4.2%
23935783000 1
4.2%
22170773000 1
4.2%
15931623000 1
4.2%

비과세감면율
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct16
Distinct (%)66.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean12.015
Minimum0
Maximum52
Zeros7
Zeros (%)29.2%
Negative0
Negative (%)0.0%
Memory size344.0 B
2024-03-15T03:25:21.235690image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median3.93
Q311.2175
95-th percentile47.017
Maximum52
Range52
Interquartile range (IQR)11.2175

Descriptive statistics

Standard deviation17.173528
Coefficient of variation (CV)1.4293407
Kurtosis0.91533626
Mean12.015
Median Absolute Deviation (MAD)3.93
Skewness1.5375573
Sum288.36
Variance294.93008
MonotonicityNot monotonic
2024-03-15T03:25:21.482867image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=16)
ValueCountFrequency (%)
0.0 7
29.2%
3.0 3
12.5%
9.0 1
 
4.2%
9.45 1
 
4.2%
4.86 1
 
4.2%
2.83 1
 
4.2%
14.87 1
 
4.2%
0.33 1
 
4.2%
47.02 1
 
4.2%
6.0 1
 
4.2%
Other values (6) 6
25.0%
ValueCountFrequency (%)
0.0 7
29.2%
0.33 1
 
4.2%
2.83 1
 
4.2%
3.0 3
12.5%
4.86 1
 
4.2%
6.0 1
 
4.2%
8.0 1
 
4.2%
9.0 1
 
4.2%
9.45 1
 
4.2%
10.0 1
 
4.2%
ValueCountFrequency (%)
52.0 1
4.2%
47.02 1
4.2%
47.0 1
4.2%
43.0 1
4.2%
25.0 1
4.2%
14.87 1
4.2%
10.0 1
4.2%
9.45 1
4.2%
9.0 1
4.2%
8.0 1
4.2%

Interactions

2024-03-15T03:25:14.549209image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T03:25:12.559780image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T03:25:13.226955image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T03:25:13.904912image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T03:25:14.717931image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T03:25:12.786883image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T03:25:13.377805image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T03:25:14.056612image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T03:25:14.957154image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T03:25:12.925389image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T03:25:13.510566image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T03:25:14.198413image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T03:25:15.204218image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T03:25:13.081833image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T03:25:13.677802image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T03:25:14.381409image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-15T03:25:21.632601image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
세목명과세년도비과세금액감면금액부과금액비과세감면율
세목명1.0000.0000.7020.7020.9600.806
과세년도0.0001.0000.0000.0000.0000.000
비과세금액0.7020.0001.0000.9870.7200.890
감면금액0.7020.0000.9871.0000.8311.000
부과금액0.9600.0000.7200.8311.0000.830
비과세감면율0.8060.0000.8901.0000.8301.000
2024-03-15T03:25:21.827115image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
과세년도세목명
과세년도1.0000.000
세목명0.0001.000
2024-03-15T03:25:22.071878image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
비과세금액감면금액부과금액비과세감면율세목명과세년도
비과세금액1.0000.8680.5870.9190.4710.000
감면금액0.8681.0000.6470.8280.4870.000
부과금액0.5870.6471.0000.4590.6730.000
비과세감면율0.9190.8280.4591.0000.5750.000
세목명0.4710.4870.6730.5751.0000.000
과세년도0.0000.0000.0000.0000.0001.000

Missing values

2024-03-15T03:25:15.581305image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-15T03:25:16.039943image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시도명시군구명자치단체코드세목명과세년도비과세금액감면금액부과금액비과세감면율
0전라남도나주시46170교육세2020010000140916430000.0
1전라남도나주시46170등록세202001629000629790003.0
2전라남도나주시46170재산세20201001237000035002640002575402100052.0
3전라남도나주시46170주민세202064000001714400063310170000.0
4전라남도나주시46170취득세20202897331000100157430005089182900025.0
5전라남도나주시46170자동차세2020141209000565017000221707730003.0
6전라남도나주시46170등록면허세202011932600028623700049629790008.0
7전라남도나주시46170지역자원시설세2020332892000122275000451687500010.0
8전라남도나주시46170교육세202109000155006330000.0
9전라남도나주시46170등록세202101847300000.0
시도명시군구명자치단체코드세목명과세년도비과세금액감면금액부과금액비과세감면율
14전라남도나주시46170등록면허세20211307500028338500048835190006.0
15전라남도나주시46170지역자원시설세202133588500012506600049431970009.0
16전라남도나주시46170교육세2022028000159316230000.0
17전라남도나주시46170등록세2022<NA>32000000.0
18전라남도나주시46170재산세20221149248100034464570003177389700047.02
19전라남도나주시46170주민세202264910001811800074477360000.33
20전라남도나주시46170취득세2022158130800072632400005948615500014.87
21전라남도나주시46170자동차세2022151733000526722000239357830002.83
22전라남도나주시46170등록면허세2022877500021447000045961840004.86
23전라남도나주시46170지역자원시설세202236460800013661300053021370009.45