Overview

Dataset statistics

Number of variables9
Number of observations24
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.0 KiB
Average record size in memory83.5 B

Variable types

Categorical5
Numeric4

Dataset

Description3년간(2019~2021) 지방세 과세액 중 비과세금액과 감면금액이 차지하는 비과세 감면 비율 현황을 제공합니다
Author전라남도 나주시
URLhttps://www.data.go.kr/data/15079612/fileData.do

Alerts

시도명 has constant value ""Constant
시군구명 has constant value ""Constant
자치단체코드 has constant value ""Constant
비과세금액 is highly overall correlated with 감면금액 and 2 other fieldsHigh correlation
감면금액 is highly overall correlated with 비과세금액 and 3 other fieldsHigh correlation
부과금액 is highly overall correlated with 비과세금액 and 2 other fieldsHigh correlation
비과세감면율 is highly overall correlated with 비과세금액 and 2 other fieldsHigh correlation
세목명 is highly overall correlated with 감면금액 and 2 other fieldsHigh correlation
감면금액 has unique valuesUnique
비과세금액 has 6 (25.0%) zerosZeros
부과금액 has 2 (8.3%) zerosZeros
비과세감면율 has 7 (29.2%) zerosZeros

Reproduction

Analysis started2023-12-12 08:04:58.546863
Analysis finished2023-12-12 08:05:00.674962
Duration2.13 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시도명
Categorical

CONSTANT 

Distinct1
Distinct (%)4.2%
Missing0
Missing (%)0.0%
Memory size324.0 B
전라남도
24 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row전라남도
2nd row전라남도
3rd row전라남도
4th row전라남도
5th row전라남도

Common Values

ValueCountFrequency (%)
전라남도 24
100.0%

Length

2023-12-12T17:05:00.739326image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T17:05:00.844942image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
전라남도 24
100.0%

시군구명
Categorical

CONSTANT 

Distinct1
Distinct (%)4.2%
Missing0
Missing (%)0.0%
Memory size324.0 B
나주시
24 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row나주시
2nd row나주시
3rd row나주시
4th row나주시
5th row나주시

Common Values

ValueCountFrequency (%)
나주시 24
100.0%

Length

2023-12-12T17:05:00.951315image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T17:05:01.056382image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
나주시 24
100.0%

자치단체코드
Categorical

CONSTANT 

Distinct1
Distinct (%)4.2%
Missing0
Missing (%)0.0%
Memory size324.0 B
46170
24 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row46170
2nd row46170
3rd row46170
4th row46170
5th row46170

Common Values

ValueCountFrequency (%)
46170 24
100.0%

Length

2023-12-12T17:05:01.165826image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T17:05:01.258542image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
46170 24
100.0%

세목명
Categorical

HIGH CORRELATION 

Distinct8
Distinct (%)33.3%
Missing0
Missing (%)0.0%
Memory size324.0 B
교육세
등록세
재산세
주민세
취득세
Other values (3)

Length

Max length7
Median length3
Mean length3.875
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row교육세
2nd row등록세
3rd row재산세
4th row주민세
5th row취득세

Common Values

ValueCountFrequency (%)
교육세 3
12.5%
등록세 3
12.5%
재산세 3
12.5%
주민세 3
12.5%
취득세 3
12.5%
자동차세 3
12.5%
등록면허세 3
12.5%
지역자원시설세 3
12.5%

Length

2023-12-12T17:05:01.397492image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T17:05:01.586783image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
교육세 3
12.5%
등록세 3
12.5%
재산세 3
12.5%
주민세 3
12.5%
취득세 3
12.5%
자동차세 3
12.5%
등록면허세 3
12.5%
지역자원시설세 3
12.5%

과세년도
Categorical

Distinct3
Distinct (%)12.5%
Missing0
Missing (%)0.0%
Memory size324.0 B
2019
2020
2021

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2019
2nd row2019
3rd row2019
4th row2019
5th row2019

Common Values

ValueCountFrequency (%)
2019 8
33.3%
2020 8
33.3%
2021 8
33.3%

Length

2023-12-12T17:05:01.777570image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T17:05:01.913861image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2019 8
33.3%
2020 8
33.3%
2021 8
33.3%

비과세금액
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct19
Distinct (%)79.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.6478362 × 109
Minimum0
Maximum1.0744487 × 1010
Zeros6
Zeros (%)25.0%
Negative0
Negative (%)0.0%
Memory size348.0 B
2023-12-12T17:05:02.023276image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q14800000
median1.278105 × 108
Q37.5766625 × 108
95-th percentile9.9355539 × 109
Maximum1.0744487 × 1010
Range1.0744487 × 1010
Interquartile range (IQR)7.5286625 × 108

Descriptive statistics

Standard deviation3.3707036 × 109
Coefficient of variation (CV)2.0455331
Kurtosis3.4142105
Mean1.6478362 × 109
Median Absolute Deviation (MAD)1.278105 × 108
Skewness2.1751365
Sum3.954807 × 1010
Variance1.1361643 × 1019
MonotonicityNot monotonic
2023-12-12T17:05:02.167027image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=19)
ValueCountFrequency (%)
0 6
25.0%
9500263000 1
 
4.2%
335885000 1
 
4.2%
13075000 1
 
4.2%
145308000 1
 
4.2%
2767953000 1
 
4.2%
6491000 1
 
4.2%
10744487000 1
 
4.2%
332892000 1
 
4.2%
119326000 1
 
4.2%
Other values (9) 9
37.5%
ValueCountFrequency (%)
0 6
25.0%
6400000 1
 
4.2%
6491000 1
 
4.2%
13075000 1
 
4.2%
19321000 1
 
4.2%
44450000 1
 
4.2%
119326000 1
 
4.2%
136295000 1
 
4.2%
141209000 1
 
4.2%
145308000 1
 
4.2%
ValueCountFrequency (%)
10744487000 1
4.2%
10012370000 1
4.2%
9500263000 1
4.2%
2897331000 1
4.2%
2767953000 1
4.2%
2023010000 1
4.2%
335885000 1
4.2%
332892000 1
4.2%
302004000 1
4.2%
145308000 1
4.2%

감면금액
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct24
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.2773377 × 109
Minimum7000
Maximum2.2193421 × 1010
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size348.0 B
2023-12-12T17:05:02.297173image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum7000
5-th percentile9150
Q118383750
median2.042255 × 108
Q31.2305585 × 109
95-th percentile9.8684512 × 109
Maximum2.2193421 × 1010
Range2.2193414 × 1010
Interquartile range (IQR)1.2121748 × 109

Descriptive statistics

Standard deviation5.0538783 × 109
Coefficient of variation (CV)2.2192046
Kurtosis10.706333
Mean2.2773377 × 109
Median Absolute Deviation (MAD)2.03406 × 108
Skewness3.1245561
Sum5.4656104 × 1010
Variance2.5541686 × 1019
MonotonicityNot monotonic
2023-12-12T17:05:02.443484image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=24)
ValueCountFrequency (%)
7000 1
 
4.2%
565017000 1
 
4.2%
125066000 1
 
4.2%
283385000 1
 
4.2%
548443000 1
 
4.2%
22193421000 1
 
4.2%
18116000 1
 
4.2%
3152639000 1
 
4.2%
18473000 1
 
4.2%
9000 1
 
4.2%
Other values (14) 14
58.3%
ValueCountFrequency (%)
7000 1
4.2%
9000 1
4.2%
10000 1
4.2%
1629000 1
4.2%
17144000 1
4.2%
18116000 1
4.2%
18473000 1
4.2%
19022000 1
4.2%
21127000 1
4.2%
115664000 1
4.2%
ValueCountFrequency (%)
22193421000 1
4.2%
10015743000 1
4.2%
9033798000 1
4.2%
3742710000 1
4.2%
3500264000 1
4.2%
3152639000 1
4.2%
589865000 1
4.2%
565017000 1
4.2%
548443000 1
4.2%
286237000 1
4.2%

부과금액
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct23
Distinct (%)95.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.6880761 × 1010
Minimum0
Maximum5.826368 × 1010
Zeros2
Zeros (%)8.3%
Negative0
Negative (%)0.0%
Memory size348.0 B
2023-12-12T17:05:02.572940image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile9446850
Q14.791858 × 109
median1.013788 × 1010
Q32.6206828 × 1010
95-th percentile4.9732832 × 1010
Maximum5.826368 × 1010
Range5.826368 × 1010
Interquartile range (IQR)2.141497 × 1010

Descriptive statistics

Standard deviation1.6603245 × 1010
Coefficient of variation (CV)0.98356022
Kurtosis0.47934054
Mean1.6880761 × 1010
Median Absolute Deviation (MAD)7.9721575 × 109
Skewness1.1201272
Sum4.0513826 × 1011
Variance2.7566774 × 1020
MonotonicityNot monotonic
2023-12-12T17:05:02.700191image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=23)
ValueCountFrequency (%)
0 2
 
8.3%
13043167000 1
 
4.2%
4943197000 1
 
4.2%
4883519000 1
 
4.2%
27565251000 1
 
4.2%
58263680000 1
 
4.2%
7232592000 1
 
4.2%
29623689000 1
 
4.2%
15500633000 1
 
4.2%
4516875000 1
 
4.2%
Other values (13) 13
54.2%
ValueCountFrequency (%)
0 2
8.3%
62979000 1
4.2%
4268465000 1
4.2%
4478965000 1
4.2%
4516875000 1
4.2%
4883519000 1
4.2%
4943197000 1
4.2%
4962979000 1
4.2%
6127221000 1
4.2%
6331017000 1
4.2%
ValueCountFrequency (%)
58263680000 1
4.2%
50891829000 1
4.2%
43165181000 1
4.2%
32465726000 1
4.2%
29623689000 1
4.2%
27565251000 1
4.2%
25754021000 1
4.2%
24794854000 1
4.2%
22170773000 1
4.2%
15500633000 1
4.2%

비과세감면율
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct15
Distinct (%)62.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean12.833333
Minimum0
Maximum53
Zeros7
Zeros (%)29.2%
Negative0
Negative (%)0.0%
Memory size348.0 B
2023-12-12T17:05:02.860738image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median4.5
Q313.75
95-th percentile51.25
Maximum53
Range53
Interquartile range (IQR)13.75

Descriptive statistics

Standard deviation17.91445
Coefficient of variation (CV)1.3959312
Kurtosis0.65178901
Mean12.833333
Median Absolute Deviation (MAD)4.5
Skewness1.4437276
Sum308
Variance320.92754
MonotonicityNot monotonic
2023-12-12T17:05:02.975361image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=15)
ValueCountFrequency (%)
0 7
29.2%
3 3
12.5%
10 2
 
8.3%
53 1
 
4.2%
1 1
 
4.2%
26 1
 
4.2%
2 1
 
4.2%
7 1
 
4.2%
52 1
 
4.2%
25 1
 
4.2%
Other values (5) 5
20.8%
ValueCountFrequency (%)
0 7
29.2%
1 1
 
4.2%
2 1
 
4.2%
3 3
12.5%
6 1
 
4.2%
7 1
 
4.2%
8 1
 
4.2%
9 1
 
4.2%
10 2
 
8.3%
25 1
 
4.2%
ValueCountFrequency (%)
53 1
4.2%
52 1
4.2%
47 1
4.2%
43 1
4.2%
26 1
4.2%
25 1
4.2%
10 2
8.3%
9 1
4.2%
8 1
4.2%
7 1
4.2%

Interactions

2023-12-12T17:04:59.952784image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:04:58.796324image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:04:59.189032image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:04:59.595647image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:05:00.072723image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:04:58.902348image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:04:59.283864image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:04:59.685018image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:05:00.201423image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:04:58.996459image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:04:59.388372image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:04:59.762380image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:05:00.327748image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:04:59.096752image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:04:59.511017image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:04:59.855497image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T17:05:03.050380image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
세목명과세년도비과세금액감면금액부과금액비과세감면율
세목명1.0000.0000.7120.9550.8510.844
과세년도0.0001.0000.0000.0000.0000.000
비과세금액0.7120.0001.0000.8550.8550.930
감면금액0.9550.0000.8551.0000.9270.868
부과금액0.8510.0000.8550.9271.0000.875
비과세감면율0.8440.0000.9300.8680.8751.000
2023-12-12T17:05:03.172555image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
세목명과세년도
세목명1.0000.000
과세년도0.0001.000
2023-12-12T17:05:03.261484image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
비과세금액감면금액부과금액비과세감면율세목명과세년도
비과세금액1.0000.8610.5810.9140.4870.000
감면금액0.8611.0000.6230.8170.6450.000
부과금액0.5810.6231.0000.3980.6060.000
비과세감면율0.9140.8170.3981.0000.6620.000
세목명0.4870.6450.6060.6621.0000.000
과세년도0.0000.0000.0000.0000.0001.000

Missing values

2023-12-12T17:05:00.469179image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T17:05:00.627517image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시도명시군구명자치단체코드세목명과세년도비과세금액감면금액부과금액비과세감면율
0전라남도나주시46170교육세201907000130431670000
1전라남도나주시46170등록세201902112700000
2전라남도나주시46170재산세2019950026300037427100002479485400053
3전라남도나주시46170주민세2019444500001902200061272210001
4전라남도나주시46170취득세2019202301000090337980004316518100026
5전라남도나주시46170자동차세2019136295000589865000324657260002
6전라남도나주시46170등록면허세20191932100028604000044789650007
7전라남도나주시46170지역자원시설세2019302004000115664000426846500010
8전라남도나주시46170교육세2020010000140916430000
9전라남도나주시46170등록세202001629000629790003
시도명시군구명자치단체코드세목명과세년도비과세금액감면금액부과금액비과세감면율
14전라남도나주시46170등록면허세202011932600028623700049629790008
15전라남도나주시46170지역자원시설세2020332892000122275000451687500010
16전라남도나주시46170교육세202109000155006330000
17전라남도나주시46170등록세202101847300000
18전라남도나주시46170재산세20211074448700031526390002962368900047
19전라남도나주시46170주민세202164910001811600072325920000
20전라남도나주시46170취득세20212767953000221934210005826368000043
21전라남도나주시46170자동차세2021145308000548443000275652510003
22전라남도나주시46170등록면허세20211307500028338500048835190006
23전라남도나주시46170지역자원시설세202133588500012506600049431970009