Overview

Dataset statistics

Number of variables9
Number of observations48
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory3.8 KiB
Average record size in memory80.8 B

Variable types

Categorical4
Numeric5

Dataset

Description2017년부터 2022년까지 지방세 비과, 감면율 현황에 대한 정보(세목별, 과세년도, 비과세 금액, 감면금액, 부과금액, 비과세 감면율)
Author경상남도 통영시
URLhttps://bigdata.gyeongnam.go.kr/index.gn?menuCd=DOM_000000114002001000&publicdatapk=15078255

Alerts

시도명 has constant value ""Constant
시군구명 has constant value ""Constant
자치단체코드 has constant value ""Constant
비과세금액(원) is highly overall correlated with 감면금액(원) and 1 other fieldsHigh correlation
감면금액(원) is highly overall correlated with 비과세금액(원) and 2 other fieldsHigh correlation
부과금액(원) is highly overall correlated with 세목명High correlation
비과세감면율 is highly overall correlated with 비과세금액(원) and 2 other fieldsHigh correlation
세목명 is highly overall correlated with 감면금액(원) and 2 other fieldsHigh correlation
비과세금액(원) has 12 (25.0%) zerosZeros
부과금액(원) has 9 (18.8%) zerosZeros
비과세감면율 has 11 (22.9%) zerosZeros

Reproduction

Analysis started2024-04-20 18:37:52.094958
Analysis finished2024-04-20 18:37:55.405373
Duration3.31 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시도명
Categorical

CONSTANT 

Distinct1
Distinct (%)2.1%
Missing0
Missing (%)0.0%
Memory size516.0 B
경상남도
48 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row경상남도
2nd row경상남도
3rd row경상남도
4th row경상남도
5th row경상남도

Common Values

ValueCountFrequency (%)
경상남도 48
100.0%

Length

2024-04-21T03:37:55.456952image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-21T03:37:55.527769image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
경상남도 48
100.0%

시군구명
Categorical

CONSTANT 

Distinct1
Distinct (%)2.1%
Missing0
Missing (%)0.0%
Memory size516.0 B
통영시
48 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row통영시
2nd row통영시
3rd row통영시
4th row통영시
5th row통영시

Common Values

ValueCountFrequency (%)
통영시 48
100.0%

Length

2024-04-21T03:37:55.602839image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-21T03:37:55.674073image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
통영시 48
100.0%

자치단체코드
Categorical

CONSTANT 

Distinct1
Distinct (%)2.1%
Missing0
Missing (%)0.0%
Memory size516.0 B
48220
48 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row48220
2nd row48220
3rd row48220
4th row48220
5th row48220

Common Values

ValueCountFrequency (%)
48220 48
100.0%

Length

2024-04-21T03:37:55.744511image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-21T03:37:55.812779image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
48220 48
100.0%

세목명
Categorical

HIGH CORRELATION 

Distinct8
Distinct (%)16.7%
Missing0
Missing (%)0.0%
Memory size516.0 B
교육세
등록세
재산세
주민세
취득세
Other values (3)
18 

Length

Max length7
Median length3
Mean length3.875
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row교육세
2nd row등록세
3rd row재산세
4th row주민세
5th row취득세

Common Values

ValueCountFrequency (%)
교육세 6
12.5%
등록세 6
12.5%
재산세 6
12.5%
주민세 6
12.5%
취득세 6
12.5%
자동차세 6
12.5%
등록면허세 6
12.5%
지역자원시설세 6
12.5%

Length

2024-04-21T03:37:55.895417image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-21T03:37:56.016025image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
교육세 6
12.5%
등록세 6
12.5%
재산세 6
12.5%
주민세 6
12.5%
취득세 6
12.5%
자동차세 6
12.5%
등록면허세 6
12.5%
지역자원시설세 6
12.5%

과세년도
Real number (ℝ)

Distinct6
Distinct (%)12.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2019.5
Minimum2017
Maximum2022
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size564.0 B
2024-04-21T03:37:56.138008image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2017
5-th percentile2017
Q12018
median2019.5
Q32021
95-th percentile2022
Maximum2022
Range5
Interquartile range (IQR)3

Descriptive statistics

Standard deviation1.7258979
Coefficient of variation (CV)0.00085461642
Kurtosis-1.2751304
Mean2019.5
Median Absolute Deviation (MAD)1.5
Skewness0
Sum96936
Variance2.9787234
MonotonicityIncreasing
2024-04-21T03:37:56.223496image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
2017 8
16.7%
2018 8
16.7%
2019 8
16.7%
2020 8
16.7%
2021 8
16.7%
2022 8
16.7%
ValueCountFrequency (%)
2017 8
16.7%
2018 8
16.7%
2019 8
16.7%
2020 8
16.7%
2021 8
16.7%
2022 8
16.7%
ValueCountFrequency (%)
2022 8
16.7%
2021 8
16.7%
2020 8
16.7%
2019 8
16.7%
2018 8
16.7%
2017 8
16.7%

비과세금액(원)
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct37
Distinct (%)77.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.0825662 × 109
Minimum0
Maximum7.914607 × 109
Zeros12
Zeros (%)25.0%
Negative0
Negative (%)0.0%
Memory size564.0 B
2024-04-21T03:37:56.319741image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q17215000
median62737000
Q32.7945025 × 108
95-th percentile6.8910924 × 109
Maximum7.914607 × 109
Range7.914607 × 109
Interquartile range (IQR)2.7223525 × 108

Descriptive statistics

Standard deviation2.3027265 × 109
Coefficient of variation (CV)2.1271
Kurtosis3.2487145
Mean1.0825662 × 109
Median Absolute Deviation (MAD)62737000
Skewness2.1892228
Sum5.1963177 × 1010
Variance5.3025493 × 1018
MonotonicityNot monotonic
2024-04-21T03:37:56.584682image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=37)
ValueCountFrequency (%)
0 12
25.0%
55846000 1
 
2.1%
1866461000 1
 
2.1%
55671000 1
 
2.1%
13764000 1
 
2.1%
199935000 1
 
2.1%
7257622000 1
 
2.1%
68361000 1
 
2.1%
1495508000 1
 
2.1%
9620000 1
 
2.1%
Other values (27) 27
56.2%
ValueCountFrequency (%)
0 12
25.0%
9620000 1
 
2.1%
11223000 1
 
2.1%
12423000 1
 
2.1%
13304000 1
 
2.1%
13764000 1
 
2.1%
14434000 1
 
2.1%
52721000 1
 
2.1%
53749000 1
 
2.1%
55671000 1
 
2.1%
ValueCountFrequency (%)
7914607000 1
2.1%
7257622000 1
2.1%
6931657000 1
2.1%
6815758000 1
2.1%
6504679000 1
2.1%
6241341000 1
2.1%
2555433000 1
2.1%
1866461000 1
2.1%
1495508000 1
2.1%
1059273000 1
2.1%

감면금액(원)
Real number (ℝ)

HIGH CORRELATION 

Distinct45
Distinct (%)93.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean7.3879269 × 108
Minimum2000
Maximum8.208883 × 109
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size564.0 B
2024-04-21T03:37:56.721805image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2000
5-th percentile2350
Q133581500
median94116500
Q35.709065 × 108
95-th percentile2.896279 × 109
Maximum8.208883 × 109
Range8.208881 × 109
Interquartile range (IQR)5.37325 × 108

Descriptive statistics

Standard deviation1.5152464 × 109
Coefficient of variation (CV)2.0509765
Kurtosis13.420118
Mean7.3879269 × 108
Median Absolute Deviation (MAD)94114000
Skewness3.411438
Sum3.5462049 × 1010
Variance2.2959717 × 1018
MonotonicityNot monotonic
2024-04-21T03:37:56.861057image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=45)
ValueCountFrequency (%)
2000 3
 
6.2%
3000 2
 
4.2%
370777000 1
 
2.1%
2906535000 1
 
2.1%
384004000 1
 
2.1%
94839000 1
 
2.1%
83573000 1
 
2.1%
640000 1
 
2.1%
1220933000 1
 
2.1%
217549000 1
 
2.1%
Other values (35) 35
72.9%
ValueCountFrequency (%)
2000 3
6.2%
3000 2
4.2%
23000 1
 
2.1%
74000 1
 
2.1%
125000 1
 
2.1%
610000 1
 
2.1%
640000 1
 
2.1%
3445000 1
 
2.1%
23260000 1
 
2.1%
37022000 1
 
2.1%
ValueCountFrequency (%)
8208883000 1
2.1%
5323341000 1
2.1%
2906535000 1
2.1%
2877232000 1
2.1%
2607495000 1
2.1%
2340758000 1
2.1%
1288775000 1
2.1%
1271737000 1
2.1%
1220933000 1
2.1%
1162461000 1
2.1%

부과금액(원)
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct40
Distinct (%)83.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean9.8426984 × 109
Minimum0
Maximum4.2969276 × 1010
Zeros9
Zeros (%)18.8%
Negative0
Negative (%)0.0%
Memory size564.0 B
2024-04-21T03:37:56.991566image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q12.2170275 × 109
median4.8680175 × 109
Q31.5480578 × 1010
95-th percentile2.7654474 × 1010
Maximum4.2969276 × 1010
Range4.2969276 × 1010
Interquartile range (IQR)1.326355 × 1010

Descriptive statistics

Standard deviation1.0239568 × 1010
Coefficient of variation (CV)1.0403212
Kurtosis0.97075873
Mean9.8426984 × 109
Median Absolute Deviation (MAD)4.8680175 × 109
Skewness1.144107
Sum4.7244952 × 1011
Variance1.0484875 × 1020
MonotonicityNot monotonic
2024-04-21T03:37:57.091229image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=40)
ValueCountFrequency (%)
0 9
 
18.8%
16508743000 1
 
2.1%
25923413000 1
 
2.1%
15137856000 1
 
2.1%
3244778000 1
 
2.1%
5056720000 1
 
2.1%
12187396000 1
 
2.1%
20632500000 1
 
2.1%
2113367000 1
 
2.1%
28038509000 1
 
2.1%
Other values (30) 30
62.5%
ValueCountFrequency (%)
0 9
18.8%
34894000 1
 
2.1%
2011936000 1
 
2.1%
2113367000 1
 
2.1%
2251581000 1
 
2.1%
2268935000 1
 
2.1%
2280887000 1
 
2.1%
2927221000 1
 
2.1%
3046957000 1
 
2.1%
3244778000 1
 
2.1%
ValueCountFrequency (%)
42969276000 1
2.1%
31771634000 1
2.1%
28038509000 1
2.1%
26941267000 1
2.1%
25923413000 1
2.1%
20915098000 1
2.1%
20802205000 1
2.1%
20632500000 1
2.1%
20530854000 1
2.1%
19827564000 1
2.1%

비과세감면율
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct18
Distinct (%)37.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean10.166667
Minimum0
Maximum44
Zeros11
Zeros (%)22.9%
Negative0
Negative (%)0.0%
Memory size564.0 B
2024-04-21T03:37:57.182106image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q12.75
median5.5
Q313.25
95-th percentile39
Maximum44
Range44
Interquartile range (IQR)10.5

Descriptive statistics

Standard deviation12.831698
Coefficient of variation (CV)1.2621343
Kurtosis1.3751791
Mean10.166667
Median Absolute Deviation (MAD)5.5
Skewness1.5924139
Sum488
Variance164.65248
MonotonicityNot monotonic
2024-04-21T03:37:57.268587image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=18)
ValueCountFrequency (%)
0 11
22.9%
3 9
18.8%
6 5
10.4%
7 3
 
6.2%
14 3
 
6.2%
39 3
 
6.2%
12 2
 
4.2%
5 2
 
4.2%
38 1
 
2.1%
20 1
 
2.1%
Other values (8) 8
16.7%
ValueCountFrequency (%)
0 11
22.9%
2 1
 
2.1%
3 9
18.8%
4 1
 
2.1%
5 2
 
4.2%
6 5
10.4%
7 3
 
6.2%
10 1
 
2.1%
12 2
 
4.2%
13 1
 
2.1%
ValueCountFrequency (%)
44 1
 
2.1%
41 1
 
2.1%
39 3
6.2%
38 1
 
2.1%
27 1
 
2.1%
20 1
 
2.1%
18 1
 
2.1%
14 3
6.2%
13 1
 
2.1%
12 2
4.2%

Interactions

2024-04-21T03:37:54.893014image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T03:37:53.340515image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T03:37:53.837093image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T03:37:54.203351image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T03:37:54.547925image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T03:37:54.957811image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T03:37:53.454433image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T03:37:53.908094image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T03:37:54.265657image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T03:37:54.629513image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T03:37:55.033365image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T03:37:53.652319image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T03:37:53.998868image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T03:37:54.337884image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T03:37:54.717043image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T03:37:55.106825image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T03:37:53.715666image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T03:37:54.070450image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T03:37:54.395796image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T03:37:54.776090image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T03:37:55.169001image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T03:37:53.777460image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T03:37:54.135824image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T03:37:54.469253image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T03:37:54.835467image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-21T03:37:57.333611image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
세목명과세년도비과세금액(원)감면금액(원)부과금액(원)비과세감면율
세목명1.0000.0000.6100.7460.9610.904
과세년도0.0001.0000.0000.0000.0000.000
비과세금액(원)0.6100.0001.0000.8630.6640.894
감면금액(원)0.7460.0000.8631.0000.8510.897
부과금액(원)0.9610.0000.6640.8511.0000.874
비과세감면율0.9040.0000.8940.8970.8741.000
2024-04-21T03:37:57.433549image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
과세년도비과세금액(원)감면금액(원)부과금액(원)비과세감면율세목명
과세년도1.0000.0090.0180.2530.0030.000
비과세금액(원)0.0091.0000.7860.4400.9030.375
감면금액(원)0.0180.7861.0000.4680.7590.530
부과금액(원)0.2530.4400.4681.0000.2870.685
비과세감면율0.0030.9030.7590.2871.0000.530
세목명0.0000.3750.5300.6850.5301.000

Missing values

2024-04-21T03:37:55.253109image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-21T03:37:55.359170image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시도명시군구명자치단체코드세목명과세년도비과세금액(원)감면금액(원)부과금액(원)비과세감면율
0경상남도통영시48220교육세201702000126043350000
1경상남도통영시48220등록세201702326000000
2경상남도통영시48220재산세201762413410001013909000039
3경상남도통영시48220주민세20171269620003702200007
4경상남도통영시48220취득세201725554330005323341000027
5경상남도통영시48220자동차세20175374900042323900003
6경상남도통영시48220등록면허세20171242300014004700029272210005
7경상남도통영시48220지역자원시설세20171814460008641400046041960006
8경상남도통영시48220교육세201802000134848390000
9경상남도통영시48220등록세2018061000000
시도명시군구명자치단체코드세목명과세년도비과세금액(원)감면금액(원)부과금액(원)비과세감면율
38경상남도통영시48220등록면허세202196200009324700033564380003
39경상남도통영시48220지역자원시설세20212043180008193600049363740006
40경상남도통영시48220교육세2022023000121567010000
41경상남도통영시48220등록세202207400000
42경상남도통영시48220재산세2022791460700012887750002091509800044
43경상남도통영시48220주민세202267028000230663000225158100013
44경상남도통영시48220취득세202269965400026074950002694126700012
45경상남도통영시48220자동차세202257250000351320000147385750003
46경상남도통영시48220등록면허세2022112230006315000033613640002
47경상남도통영시48220지역자원시설세20222158870008373600048315010006