Overview

Dataset statistics

Number of variables9
Number of observations24
Missing cells3
Missing cells (%)1.4%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.0 KiB
Average record size in memory83.5 B

Variable types

Categorical5
Numeric4

Dataset

Description지방세 비과/감면율 현황
Author강원도 고성군
URLhttps://www.data.go.kr/data/15079524/fileData.do

Alerts

시도명 has constant value ""Constant
시군구명 has constant value ""Constant
자치단체코드 has constant value ""Constant
비과세금액 is highly overall correlated with 감면금액 and 1 other fieldsHigh correlation
감면금액 is highly overall correlated with 비과세금액 and 2 other fieldsHigh correlation
부과금액 is highly overall correlated with 감면금액 and 1 other fieldsHigh correlation
비과세감면율 is highly overall correlated with 비과세금액 and 1 other fieldsHigh correlation
세목명 is highly overall correlated with 부과금액High correlation
비과세금액 has 3 (12.5%) missing valuesMissing
감면금액 has unique valuesUnique
비과세금액 has 3 (12.5%) zerosZeros
부과금액 has 3 (12.5%) zerosZeros
비과세감면율 has 3 (12.5%) zerosZeros

Reproduction

Analysis started2023-12-12 18:55:34.766380
Analysis finished2023-12-12 18:55:38.328006
Duration3.56 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시도명
Categorical

CONSTANT 

Distinct1
Distinct (%)4.2%
Missing0
Missing (%)0.0%
Memory size324.0 B
강원도
24 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row강원도
2nd row강원도
3rd row강원도
4th row강원도
5th row강원도

Common Values

ValueCountFrequency (%)
강원도 24
100.0%

Length

2023-12-13T03:55:38.459104image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T03:55:38.633410image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
강원도 24
100.0%

시군구명
Categorical

CONSTANT 

Distinct1
Distinct (%)4.2%
Missing0
Missing (%)0.0%
Memory size324.0 B
고성군
24 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row고성군
2nd row고성군
3rd row고성군
4th row고성군
5th row고성군

Common Values

ValueCountFrequency (%)
고성군 24
100.0%

Length

2023-12-13T03:55:38.814136image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T03:55:38.996603image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
고성군 24
100.0%

자치단체코드
Categorical

CONSTANT 

Distinct1
Distinct (%)4.2%
Missing0
Missing (%)0.0%
Memory size324.0 B
42820
24 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row42820
2nd row42820
3rd row42820
4th row42820
5th row42820

Common Values

ValueCountFrequency (%)
42820 24
100.0%

Length

2023-12-13T03:55:39.174001image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T03:55:39.336903image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
42820 24
100.0%

세목명
Categorical

HIGH CORRELATION 

Distinct8
Distinct (%)33.3%
Missing0
Missing (%)0.0%
Memory size324.0 B
교육세
등록세
재산세
주민세
취득세
Other values (3)

Length

Max length7
Median length3
Mean length3.875
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row교육세
2nd row등록세
3rd row재산세
4th row주민세
5th row취득세

Common Values

ValueCountFrequency (%)
교육세 3
12.5%
등록세 3
12.5%
재산세 3
12.5%
주민세 3
12.5%
취득세 3
12.5%
자동차세 3
12.5%
등록면허세 3
12.5%
지역자원시설세 3
12.5%

Length

2023-12-13T03:55:39.547491image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T03:55:39.783489image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
교육세 3
12.5%
등록세 3
12.5%
재산세 3
12.5%
주민세 3
12.5%
취득세 3
12.5%
자동차세 3
12.5%
등록면허세 3
12.5%
지역자원시설세 3
12.5%

과세년도
Categorical

Distinct3
Distinct (%)12.5%
Missing0
Missing (%)0.0%
Memory size324.0 B
2017
2018
2019

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2017
2nd row2017
3rd row2017
4th row2017
5th row2017

Common Values

ValueCountFrequency (%)
2017 8
33.3%
2018 8
33.3%
2019 8
33.3%

Length

2023-12-13T03:55:40.012221image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T03:55:40.202263image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2017 8
33.3%
2018 8
33.3%
2019 8
33.3%

비과세금액
Real number (ℝ)

HIGH CORRELATION  MISSING  ZEROS 

Distinct19
Distinct (%)90.5%
Missing3
Missing (%)12.5%
Infinite0
Infinite (%)0.0%
Mean6.3420329 × 108
Minimum0
Maximum4.423154 × 109
Zeros3
Zeros (%)12.5%
Negative0
Negative (%)0.0%
Memory size348.0 B
2023-12-13T03:55:40.376931image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q16955000
median42104000
Q33.01148 × 108
95-th percentile3.660657 × 109
Maximum4.423154 × 109
Range4.423154 × 109
Interquartile range (IQR)2.94193 × 108

Descriptive statistics

Standard deviation1.3686732 × 109
Coefficient of variation (CV)2.1580985
Kurtosis3.4662015
Mean6.3420329 × 108
Median Absolute Deviation (MAD)42104000
Skewness2.2231943
Sum1.3318269 × 1010
Variance1.8732663 × 1018
MonotonicityNot monotonic
2023-12-13T03:55:40.588708image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=19)
ValueCountFrequency (%)
0 3
 
12.5%
19270000 1
 
4.2%
354985000 1
 
4.2%
149153000 1
 
4.2%
5218000 1
 
4.2%
42308000 1
 
4.2%
301148000 1
 
4.2%
12590000 1
 
4.2%
4423154000 1
 
4.2%
137110000 1
 
4.2%
Other values (9) 9
37.5%
(Missing) 3
 
12.5%
ValueCountFrequency (%)
0 3
12.5%
5218000 1
 
4.2%
5513000 1
 
4.2%
6955000 1
 
4.2%
9670000 1
 
4.2%
12590000 1
 
4.2%
19270000 1
 
4.2%
41807000 1
 
4.2%
42104000 1
 
4.2%
42308000 1
 
4.2%
ValueCountFrequency (%)
4423154000 1
4.2%
3660657000 1
4.2%
3530958000 1
4.2%
445890000 1
4.2%
354985000 1
4.2%
301148000 1
4.2%
149153000 1
4.2%
137110000 1
4.2%
129779000 1
4.2%
42308000 1
4.2%

감면금액
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct24
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.3828054 × 108
Minimum50000
Maximum1.692725 × 109
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size348.0 B
2023-12-13T03:55:40.819847image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum50000
5-th percentile102550
Q1842750
median53726500
Q31.895585 × 108
95-th percentile8.843102 × 108
Maximum1.692725 × 109
Range1.692675 × 109
Interquartile range (IQR)1.8871575 × 108

Descriptive statistics

Standard deviation4.1702187 × 108
Coefficient of variation (CV)1.7501298
Kurtosis5.7504914
Mean2.3828054 × 108
Median Absolute Deviation (MAD)53487500
Skewness2.3474841
Sum5.718733 × 109
Variance1.7390724 × 1017
MonotonicityNot monotonic
2023-12-13T03:55:41.081817image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=24)
ValueCountFrequency (%)
293000 1
 
4.2%
118203000 1
 
4.2%
59077000 1
 
4.2%
49765000 1
 
4.2%
115476000 1
 
4.2%
887069000 1
 
4.2%
13579000 1
 
4.2%
758455000 1
 
4.2%
185000 1
 
4.2%
893000 1
 
4.2%
Other values (14) 14
58.3%
ValueCountFrequency (%)
50000 1
4.2%
88000 1
4.2%
185000 1
4.2%
293000 1
4.2%
562000 1
4.2%
692000 1
4.2%
893000 1
4.2%
5250000 1
4.2%
13579000 1
4.2%
41558000 1
4.2%
ValueCountFrequency (%)
1692725000 1
4.2%
887069000 1
4.2%
868677000 1
4.2%
758455000 1
4.2%
409424000 1
4.2%
388022000 1
4.2%
123404000 1
4.2%
118203000 1
4.2%
115476000 1
4.2%
77833000 1
4.2%

부과금액
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct22
Distinct (%)91.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.3073389 × 109
Minimum0
Maximum1.4253332 × 1010
Zeros3
Zeros (%)12.5%
Negative0
Negative (%)0.0%
Memory size348.0 B
2023-12-13T03:55:41.327391image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q16.1987125 × 108
median2.3906425 × 109
Q33.908326 × 109
95-th percentile1.2723009 × 1010
Maximum1.4253332 × 1010
Range1.4253332 × 1010
Interquartile range (IQR)3.2884548 × 109

Descriptive statistics

Standard deviation3.9937718 × 109
Coefficient of variation (CV)1.2075484
Kurtosis2.6413192
Mean3.3073389 × 109
Median Absolute Deviation (MAD)1.674433 × 109
Skewness1.7910962
Sum7.9376134 × 1010
Variance1.5950213 × 1019
MonotonicityNot monotonic
2023-12-13T03:55:41.548268image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=22)
ValueCountFrequency (%)
0 3
 
12.5%
3876301000 1
 
4.2%
3627822000 1
 
4.2%
643949000 1
 
4.2%
1177135000 1
 
4.2%
3617393000 1
 
4.2%
10849793000 1
 
4.2%
547638000 1
 
4.2%
4125750000 1
 
4.2%
3701303000 1
 
4.2%
Other values (12) 12
50.0%
ValueCountFrequency (%)
0 3
12.5%
505024000 1
 
4.2%
507178000 1
 
4.2%
547638000 1
 
4.2%
643949000 1
 
4.2%
787534000 1
 
4.2%
820278000 1
 
4.2%
839263000 1
 
4.2%
879535000 1
 
4.2%
1177135000 1
 
4.2%
ValueCountFrequency (%)
14253332000 1
4.2%
13053576000 1
4.2%
10849793000 1
4.2%
4158597000 1
4.2%
4125750000 1
4.2%
4004401000 1
4.2%
3876301000 1
4.2%
3796182000 1
4.2%
3701303000 1
4.2%
3627822000 1
4.2%

비과세감면율
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct21
Distinct (%)87.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean20.26125
Minimum0
Maximum125.59
Zeros3
Zeros (%)12.5%
Negative0
Negative (%)0.0%
Memory size348.0 B
2023-12-13T03:55:41.783668image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q11.445
median4.805
Q317.0325
95-th percentile104.865
Maximum125.59
Range125.59
Interquartile range (IQR)15.5875

Descriptive statistics

Standard deviation35.567794
Coefficient of variation (CV)1.755459
Kurtosis4.0147567
Mean20.26125
Median Absolute Deviation (MAD)4.8
Skewness2.2586901
Sum486.27
Variance1265.068
MonotonicityNot monotonic
2023-12-13T03:55:41.998813image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=21)
ValueCountFrequency (%)
0.0 3
 
12.5%
0.02 2
 
8.3%
0.01 1
 
4.2%
4.41 1
 
4.2%
32.34 1
 
4.2%
4.67 1
 
4.2%
4.36 1
 
4.2%
10.95 1
 
4.2%
4.78 1
 
4.2%
125.59 1
 
4.2%
Other values (11) 11
45.8%
ValueCountFrequency (%)
0.0 3
12.5%
0.01 1
 
4.2%
0.02 2
8.3%
1.92 1
 
4.2%
4.36 1
 
4.2%
4.41 1
 
4.2%
4.59 1
 
4.2%
4.67 1
 
4.2%
4.78 1
 
4.2%
4.83 1
 
4.2%
ValueCountFrequency (%)
125.59 1
4.2%
106.65 1
4.2%
94.75 1
4.2%
32.34 1
4.2%
23.43 1
4.2%
23.13 1
4.2%
15.0 1
4.2%
10.95 1
4.2%
10.1 1
4.2%
9.37 1
4.2%

Interactions

2023-12-13T03:55:37.202731image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T03:55:35.161882image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T03:55:35.924924image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T03:55:36.606721image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T03:55:37.357846image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T03:55:35.292445image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T03:55:36.082623image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T03:55:36.743766image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T03:55:37.516917image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T03:55:35.547997image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T03:55:36.230550image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T03:55:36.883598image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T03:55:37.689412image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T03:55:35.764066image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T03:55:36.427222image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T03:55:37.025892image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T03:55:42.148586image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
세목명과세년도비과세금액감면금액부과금액비과세감면율
세목명1.0000.0000.3460.7020.9570.534
과세년도0.0001.0000.0790.0000.0000.134
비과세금액0.3460.0791.0000.9860.2630.919
감면금액0.7020.0000.9861.0000.6830.794
부과금액0.9570.0000.2630.6831.0000.000
비과세감면율0.5340.1340.9190.7940.0001.000
2023-12-13T03:55:42.322941image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
과세년도세목명
과세년도1.0000.000
세목명0.0001.000
2023-12-13T03:55:42.488060image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
비과세금액감면금액부과금액비과세감면율세목명과세년도
비과세금액1.0000.8340.3720.8310.1710.000
감면금액0.8341.0000.7250.7440.4870.000
부과금액0.3720.7251.0000.4160.6450.000
비과세감면율0.8310.7440.4161.0000.2890.000
세목명0.1710.4870.6450.2891.0000.000
과세년도0.0000.0000.0000.0000.0001.000

Missing values

2023-12-13T03:55:37.926081image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T03:55:38.221287image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시도명시군구명자치단체코드세목명과세년도비과세금액감면금액부과금액비과세감면율
0강원도고성군42820교육세2017029300038763010000.01
1강원도고성군42820등록세2017<NA>56200000.0
2강원도고성군42820재산세201736606570003880220003796182000106.65
3강원도고성군42820주민세20171927000052500005071780004.83
4강원도고성군42820취득세2017354985000868677000130535760009.37
5강원도고성군42820자동차세20174210400012340400036041500004.59
6강원도고성군42820등록면허세201769550007783300083926300010.1
7강원도고성군42820지역자원시설세20171297790005241200078753400023.13
8강원도고성군42820교육세2018069200040044010000.02
9강원도고성군42820등록세2018<NA>8800000.0
시도명시군구명자치단체코드세목명과세년도비과세금액감면금액부과금액비과세감면율
14강원도고성군42820등록면허세20185513000415580008795350005.35
15강원도고성군42820지역자원시설세20181371100005504100082027800023.43
16강원도고성군42820교육세2019089300037013030000.02
17강원도고성군42820등록세2019<NA>18500000.0
18강원도고성군42820재산세201944231540007584550004125750000125.59
19강원도고성군42820주민세201912590000135790005476380004.78
20강원도고성군42820취득세20193011480008870690001084979300010.95
21강원도고성군42820자동차세20194230800011547600036173930004.36
22강원도고성군42820등록면허세201952180004976500011771350004.67
23강원도고성군42820지역자원시설세20191491530005907700064394900032.34