Overview

Dataset statistics

Number of variables9
Number of observations41
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory3.3 KiB
Average record size in memory81.2 B

Variable types

Categorical4
Numeric5

Dataset

Description부산광역시동래구_지방세비과및감면율현황_20221231
Author부산광역시 동래구
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=15087154

Alerts

시도명 has constant value ""Constant
시군구명 has constant value ""Constant
자치단체코드 has constant value ""Constant
비과세금액 is highly overall correlated with 감면금액 and 1 other fieldsHigh correlation
감면금액 is highly overall correlated with 비과세금액 and 2 other fieldsHigh correlation
부과금액 is highly overall correlated with 감면금액 and 1 other fieldsHigh correlation
비과세감면율 is highly overall correlated with 비과세금액 and 2 other fieldsHigh correlation
세목명 is highly overall correlated with 부과금액 and 1 other fieldsHigh correlation
감면금액 has unique valuesUnique
부과금액 has unique valuesUnique
비과세금액 has 5 (12.2%) zerosZeros
비과세감면율 has 5 (12.2%) zerosZeros

Reproduction

Analysis started2023-12-10 16:46:05.734981
Analysis finished2023-12-10 16:46:10.764321
Duration5.03 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시도명
Categorical

CONSTANT 

Distinct1
Distinct (%)2.4%
Missing0
Missing (%)0.0%
Memory size460.0 B
부산광역시
41 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row부산광역시
2nd row부산광역시
3rd row부산광역시
4th row부산광역시
5th row부산광역시

Common Values

ValueCountFrequency (%)
부산광역시 41
100.0%

Length

2023-12-11T01:46:10.875294image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:46:11.007799image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
부산광역시 41
100.0%

시군구명
Categorical

CONSTANT 

Distinct1
Distinct (%)2.4%
Missing0
Missing (%)0.0%
Memory size460.0 B
동래구
41 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row동래구
2nd row동래구
3rd row동래구
4th row동래구
5th row동래구

Common Values

ValueCountFrequency (%)
동래구 41
100.0%

Length

2023-12-11T01:46:11.160791image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:46:11.296577image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
동래구 41
100.0%

자치단체코드
Categorical

CONSTANT 

Distinct1
Distinct (%)2.4%
Missing0
Missing (%)0.0%
Memory size460.0 B
26260
41 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row26260
2nd row26260
3rd row26260
4th row26260
5th row26260

Common Values

ValueCountFrequency (%)
26260 41
100.0%

Length

2023-12-11T01:46:11.429333image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:46:11.543022image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
26260 41
100.0%

세목명
Categorical

HIGH CORRELATION 

Distinct7
Distinct (%)17.1%
Missing0
Missing (%)0.0%
Memory size460.0 B
재산세
주민세
취득세
자동차세
등록면허세
Other values (2)
11 

Length

Max length7
Median length3
Mean length4.0243902
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row재산세
2nd row주민세
3rd row취득세
4th row자동차세
5th row등록면허세

Common Values

ValueCountFrequency (%)
재산세 6
14.6%
주민세 6
14.6%
취득세 6
14.6%
자동차세 6
14.6%
등록면허세 6
14.6%
지역자원시설세 6
14.6%
교육세 5
12.2%

Length

2023-12-11T01:46:11.693714image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:46:11.853943image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
재산세 6
14.6%
주민세 6
14.6%
취득세 6
14.6%
자동차세 6
14.6%
등록면허세 6
14.6%
지역자원시설세 6
14.6%
교육세 5
12.2%

과세년도
Real number (ℝ)

Distinct6
Distinct (%)14.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2019.561
Minimum2017
Maximum2022
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size501.0 B
2023-12-11T01:46:12.006907image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2017
5-th percentile2017
Q12018
median2020
Q32021
95-th percentile2022
Maximum2022
Range5
Interquartile range (IQR)3

Descriptive statistics

Standard deviation1.7036546
Coefficient of variation (CV)0.00084357671
Kurtosis-1.2445947
Mean2019.561
Median Absolute Deviation (MAD)1
Skewness-0.02987987
Sum82802
Variance2.902439
MonotonicityIncreasing
2023-12-11T01:46:12.141827image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
2018 7
17.1%
2019 7
17.1%
2020 7
17.1%
2021 7
17.1%
2022 7
17.1%
2017 6
14.6%
ValueCountFrequency (%)
2017 6
14.6%
2018 7
17.1%
2019 7
17.1%
2020 7
17.1%
2021 7
17.1%
2022 7
17.1%
ValueCountFrequency (%)
2022 7
17.1%
2021 7
17.1%
2020 7
17.1%
2019 7
17.1%
2018 7
17.1%
2017 6
14.6%

비과세금액
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct37
Distinct (%)90.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.9267554 × 109
Minimum0
Maximum1.9026074 × 1010
Zeros5
Zeros (%)12.2%
Negative0
Negative (%)0.0%
Memory size501.0 B
2023-12-11T01:46:12.305936image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q124674000
median1.82535 × 108
Q33.35935 × 109
95-th percentile1.5578706 × 1010
Maximum1.9026074 × 1010
Range1.9026074 × 1010
Interquartile range (IQR)3.334676 × 109

Descriptive statistics

Standard deviation5.4735044 × 109
Coefficient of variation (CV)1.8701612
Kurtosis2.4149423
Mean2.9267554 × 109
Median Absolute Deviation (MAD)1.75596 × 108
Skewness1.9222611
Sum1.1999697 × 1011
Variance2.995925 × 1019
MonotonicityNot monotonic
2023-12-11T01:46:12.622933image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=37)
ValueCountFrequency (%)
0 5
 
12.2%
11524085000 1
 
2.4%
5038000 1
 
2.4%
182535000 1
 
2.4%
6818000 1
 
2.4%
216391000 1
 
2.4%
17395971000 1
 
2.4%
65056000 1
 
2.4%
3359350000 1
 
2.4%
172039000 1
 
2.4%
Other values (27) 27
65.9%
ValueCountFrequency (%)
0 5
12.2%
5038000 1
 
2.4%
6225000 1
 
2.4%
6818000 1
 
2.4%
6939000 1
 
2.4%
9405000 1
 
2.4%
24674000 1
 
2.4%
64443000 1
 
2.4%
65056000 1
 
2.4%
87730000 1
 
2.4%
ValueCountFrequency (%)
19026074000 1
2.4%
17395971000 1
2.4%
15578706000 1
2.4%
14733325000 1
2.4%
13202540000 1
2.4%
11524085000 1
2.4%
6335350000 1
2.4%
5126286000 1
2.4%
4524592000 1
2.4%
3433795000 1
2.4%

감면금액
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct41
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.9400782 × 109
Minimum1000
Maximum3.0503164 × 1010
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size501.0 B
2023-12-11T01:46:12.821397image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1000
5-th percentile3000
Q11.0766 × 108
median1.93684 × 108
Q32.611195 × 109
95-th percentile1.5990639 × 1010
Maximum3.0503164 × 1010
Range3.0503163 × 1010
Interquartile range (IQR)2.503535 × 109

Descriptive statistics

Standard deviation6.4429083 × 109
Coefficient of variation (CV)2.1914071
Kurtosis10.035207
Mean2.9400782 × 109
Median Absolute Deviation (MAD)1.93681 × 108
Skewness3.1303288
Sum1.2054321 × 1011
Variance4.1511067 × 1019
MonotonicityNot monotonic
2023-12-11T01:46:13.051079image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=41)
ValueCountFrequency (%)
2329254000 1
 
2.4%
923096000 1
 
2.4%
11542350000 1
 
2.4%
946827000 1
 
2.4%
66610000 1
 
2.4%
128679000 1
 
2.4%
4000 1
 
2.4%
3269569000 1
 
2.4%
144528000 1
 
2.4%
7246184000 1
 
2.4%
Other values (31) 31
75.6%
ValueCountFrequency (%)
1000 1
2.4%
2000 1
2.4%
3000 1
2.4%
4000 1
2.4%
7000 1
2.4%
34533000 1
2.4%
66610000 1
2.4%
76069000 1
2.4%
82827000 1
2.4%
104264000 1
2.4%
ValueCountFrequency (%)
30503164000 1
2.4%
23354909000 1
2.4%
15990639000 1
2.4%
11542350000 1
2.4%
7246184000 1
2.4%
6347969000 1
2.4%
3388152000 1
2.4%
3269569000 1
2.4%
2973136000 1
2.4%
2856569000 1
2.4%

부과금액
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct41
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.7788041 × 1010
Minimum4.43171 × 109
Maximum1.1608308 × 1011
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size501.0 B
2023-12-11T01:46:13.237293image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum4.43171 × 109
5-th percentile4.717851 × 109
Q15.480137 × 109
median1.8451139 × 1010
Q34.2334885 × 1010
95-th percentile9.7431022 × 1010
Maximum1.1608308 × 1011
Range1.1165137 × 1011
Interquartile range (IQR)3.6854748 × 1010

Descriptive statistics

Standard deviation2.9882307 × 1010
Coefficient of variation (CV)1.0753658
Kurtosis1.4137824
Mean2.7788041 × 1010
Median Absolute Deviation (MAD)1.3125392 × 1010
Skewness1.4896513
Sum1.1393097 × 1012
Variance8.929523 × 1020
MonotonicityNot monotonic
2023-12-11T01:46:13.426430image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=41)
ValueCountFrequency (%)
37892051000 1
 
2.4%
19720835000 1
 
2.4%
97431022000 1
 
2.4%
19301372000 1
 
2.4%
9568423000 1
 
2.4%
5325747000 1
 
2.4%
21947040000 1
 
2.4%
54106717000 1
 
2.4%
5141894000 1
 
2.4%
97974242000 1
 
2.4%
Other values (31) 31
75.6%
ValueCountFrequency (%)
4431710000 1
2.4%
4554473000 1
2.4%
4717851000 1
2.4%
4751779000 1
2.4%
4968850000 1
2.4%
5023859000 1
2.4%
5141894000 1
2.4%
5204641000 1
2.4%
5325747000 1
2.4%
5451571000 1
2.4%
ValueCountFrequency (%)
116083082000 1
2.4%
97974242000 1
2.4%
97431022000 1
2.4%
82808130000 1
2.4%
63778148000 1
2.4%
63580434000 1
2.4%
61828398000 1
2.4%
54106717000 1
2.4%
51019335000 1
2.4%
47020094000 1
2.4%

비과세감면율
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct36
Distinct (%)87.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean11.720244
Minimum0
Maximum38.19
Zeros5
Zeros (%)12.2%
Negative0
Negative (%)0.0%
Memory size501.0 B
2023-12-11T01:46:13.625298image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q11.78
median6.19
Q315.37
95-th percentile37.35
Maximum38.19
Range38.19
Interquartile range (IQR)13.59

Descriptive statistics

Standard deviation13.210074
Coefficient of variation (CV)1.127116
Kurtosis-0.29039527
Mean11.720244
Median Absolute Deviation (MAD)4.63
Skewness1.16238
Sum480.53
Variance174.50605
MonotonicityNot monotonic
2023-12-11T01:46:13.924059image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=36)
ValueCountFrequency (%)
0.0 5
 
12.2%
6.19 2
 
4.9%
36.56 1
 
2.4%
5.85 1
 
2.4%
0.77 1
 
2.4%
6.48 1
 
2.4%
38.19 1
 
2.4%
4.08 1
 
2.4%
10.82 1
 
2.4%
5.55 1
 
2.4%
Other values (26) 26
63.4%
ValueCountFrequency (%)
0.0 5
12.2%
0.65 1
 
2.4%
0.77 1
 
2.4%
1.2 1
 
2.4%
1.53 1
 
2.4%
1.59 1
 
2.4%
1.78 1
 
2.4%
3.86 1
 
2.4%
4.08 1
 
2.4%
5.45 1
 
2.4%
ValueCountFrequency (%)
38.19 1
2.4%
37.41 1
2.4%
37.35 1
2.4%
36.56 1
2.4%
36.36 1
2.4%
35.14 1
2.4%
32.27 1
2.4%
31.73 1
2.4%
31.53 1
2.4%
18.56 1
2.4%

Interactions

2023-12-11T01:46:09.557203image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:46:06.072358image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:46:06.866904image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:46:07.664598image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:46:08.421827image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:46:09.704056image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:46:06.268307image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:46:07.053678image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:46:07.800723image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:46:08.586388image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:46:09.859529image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:46:06.437308image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:46:07.211011image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:46:07.948273image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:46:09.138145image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:46:10.013630image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:46:06.574978image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:46:07.366613image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:46:08.066532image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:46:09.270374image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:46:10.161876image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:46:06.720491image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:46:07.503211image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:46:08.210644image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:46:09.414950image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T01:46:14.076383image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
세목명과세년도비과세금액감면금액부과금액비과세감면율
세목명1.0000.0000.6720.6520.7860.865
과세년도0.0001.0000.0000.0000.0000.000
비과세금액0.6720.0001.0000.8770.9250.757
감면금액0.6520.0000.8771.0000.8710.759
부과금액0.7860.0000.9250.8711.0000.808
비과세감면율0.8650.0000.7570.7590.8081.000
2023-12-11T01:46:14.229468image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
과세년도비과세금액감면금액부과금액비과세감면율세목명
과세년도1.000-0.013-0.0590.221-0.2100.000
비과세금액-0.0131.0000.8410.4610.9400.432
감면금액-0.0590.8411.0000.5720.8590.278
부과금액0.2210.4610.5721.0000.3650.556
비과세감면율-0.2100.9400.8590.3651.0000.722
세목명0.0000.4320.2780.5560.7221.000

Missing values

2023-12-11T01:46:10.390590image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T01:46:10.630706image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시도명시군구명자치단체코드세목명과세년도비과세금액감면금액부과금액비과세감면율
0부산광역시동래구26260재산세20171152408500023292540003789205100036.56
1부산광역시동래구26260주민세201712625000019134000044317100007.17
2부산광역시동래구26260취득세20174524592000159906390006358043400032.27
3부산광역시동래구26260자동차세2017158774000997721000183295540006.31
4부산광역시동래구26260등록면허세201762250003453300062402550000.65
5부산광역시동래구26260지역자원시설세201720342000017609100045544730008.33
6부산광역시동래구26260교육세201801000169961950000.0
7부산광역시동래구26260재산세20181320254000026111950004233488500037.35
8부산광역시동래구26260주민세201812583400019368400047178510006.77
9부산광역시동래구26260취득세2018512628600063479690006182839800018.56
시도명시군구명자치단체코드세목명과세년도비과세금액감면금액부과금액비과세감면율
31부산광역시동래구26260자동차세2021172039000923096000197208350005.55
32부산광역시동래구26260등록면허세2021503800010426400071418270001.53
33부산광역시동래구26260지역자원시설세202121386600012521300054801370006.19
34부산광역시동래구26260교육세202207000244547610000.0
35부산광역시동래구26260재산세20221902607400033881520006377814800035.14
36부산광역시동래구26260주민세20226444300014615400054515710003.86
37부산광역시동래구26260취득세202263353500003050316400011608308200031.73
38부산광역시동래구26260자동차세2022201645000920863000199856180005.62
39부산광역시동래구26260등록면허세2022940500010766000073838270001.59
40부산광역시동래구26260지역자원시설세202221975100013508100057374290006.18