Overview

Dataset statistics

Number of variables9
Number of observations33
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.6 KiB
Average record size in memory82.0 B

Variable types

Categorical5
Numeric4

Dataset

Description부산광역시해운대구_지방세과세현황_20191231
Author부산광역시 해운대구
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=15078914

Alerts

시도명 has constant value ""Constant
시군구명 has constant value ""Constant
자치단체코드 has constant value ""Constant
과세건수 is highly overall correlated with 비과세건수 and 1 other fieldsHigh correlation
비과세건수 is highly overall correlated with 과세건수 and 1 other fieldsHigh correlation
비과세금액 is highly overall correlated with 비과세건수 High correlation
세목명 is highly overall correlated with 과세건수 High correlation
과세건수 has 9 (27.3%) zerosZeros
과세금액 has 9 (27.3%) zerosZeros
비과세건수 has 12 (36.4%) zerosZeros
비과세금액 has 13 (39.4%) zerosZeros

Reproduction

Analysis started2023-12-10 16:59:50.824803
Analysis finished2023-12-10 16:59:54.096696
Duration3.27 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시도명
Categorical

CONSTANT 

Distinct1
Distinct (%)3.0%
Missing0
Missing (%)0.0%
Memory size396.0 B
부산광역시
33 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row부산광역시
2nd row부산광역시
3rd row부산광역시
4th row부산광역시
5th row부산광역시

Common Values

ValueCountFrequency (%)
부산광역시 33
100.0%

Length

2023-12-11T01:59:54.206944image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:59:54.347046image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
부산광역시 33
100.0%

시군구명
Categorical

CONSTANT 

Distinct1
Distinct (%)3.0%
Missing0
Missing (%)0.0%
Memory size396.0 B
해운대구
33 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row해운대구
2nd row해운대구
3rd row해운대구
4th row해운대구
5th row해운대구

Common Values

ValueCountFrequency (%)
해운대구 33
100.0%

Length

2023-12-11T01:59:54.861178image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:59:55.019988image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
해운대구 33
100.0%

자치단체코드
Categorical

CONSTANT 

Distinct1
Distinct (%)3.0%
Missing0
Missing (%)0.0%
Memory size396.0 B
26350
33 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row26350
2nd row26350
3rd row26350
4th row26350
5th row26350

Common Values

ValueCountFrequency (%)
26350 33
100.0%

Length

2023-12-11T01:59:55.187560image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:59:55.343075image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
26350 33
100.0%

과세년도
Categorical

Distinct3
Distinct (%)9.1%
Missing0
Missing (%)0.0%
Memory size396.0 B
2017
12 
2019
12 
2018

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2017
2nd row2017
3rd row2017
4th row2017
5th row2017

Common Values

ValueCountFrequency (%)
2017 12
36.4%
2019 12
36.4%
2018 9
27.3%

Length

2023-12-11T01:59:55.524694image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:59:55.721633image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2017 12
36.4%
2019 12
36.4%
2018 9
27.3%

세목명
Categorical

HIGH CORRELATION 

Distinct13
Distinct (%)39.4%
Missing0
Missing (%)0.0%
Memory size396.0 B
취득세
주민세
재산세
자동차세
등록면허세
Other values (8)
18 

Length

Max length7
Median length5
Mean length4.1818182
Min length3

Unique

Unique1 ?
Unique (%)3.0%

Sample

1st row취득세
2nd row주민세
3rd row재산세
4th row자동차세
5th row레저세

Common Values

ValueCountFrequency (%)
취득세 3
9.1%
주민세 3
9.1%
재산세 3
9.1%
자동차세 3
9.1%
등록면허세 3
9.1%
지역자원시설세 3
9.1%
지방소득세 3
9.1%
교육세 3
9.1%
레저세 2
 
6.1%
담배소비세 2
 
6.1%
Other values (3) 5
15.2%

Length

2023-12-11T01:59:55.937854image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
취득세 3
9.1%
주민세 3
9.1%
재산세 3
9.1%
자동차세 3
9.1%
등록면허세 3
9.1%
지역자원시설세 3
9.1%
지방소득세 3
9.1%
교육세 3
9.1%
레저세 2
 
6.1%
담배소비세 2
 
6.1%
Other values (3) 5
15.2%

과세건수
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct25
Distinct (%)75.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean191315.18
Minimum0
Maximum866095
Zeros9
Zeros (%)27.3%
Negative0
Negative (%)0.0%
Memory size429.0 B
2023-12-11T01:59:56.203108image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median126559
Q3250822
95-th percentile857840.4
Maximum866095
Range866095
Interquartile range (IQR)250822

Descriptive statistics

Standard deviation240754.37
Coefficient of variation (CV)1.2584175
Kurtosis3.6309081
Mean191315.18
Median Absolute Deviation (MAD)125586
Skewness1.9685923
Sum6313401
Variance5.7962665 × 1010
MonotonicityNot monotonic
2023-12-11T01:59:56.408298image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=25)
ValueCountFrequency (%)
0 9
27.3%
21414 1
 
3.0%
308519 1
 
3.0%
866095 1
 
3.0%
132299 1
 
3.0%
323196 1
 
3.0%
120864 1
 
3.0%
247936 1
 
3.0%
252145 1
 
3.0%
173322 1
 
3.0%
Other values (15) 15
45.5%
ValueCountFrequency (%)
0 9
27.3%
18014 1
 
3.0%
18549 1
 
3.0%
21414 1
 
3.0%
113983 1
 
3.0%
118252 1
 
3.0%
119382 1
 
3.0%
120864 1
 
3.0%
126559 1
 
3.0%
132299 1
 
3.0%
ValueCountFrequency (%)
866095 1
3.0%
858693 1
3.0%
857272 1
3.0%
323196 1
3.0%
308519 1
3.0%
300400 1
3.0%
252596 1
3.0%
252145 1
3.0%
250822 1
3.0%
247936 1
3.0%

과세금액
Real number (ℝ)

ZEROS 

Distinct25
Distinct (%)75.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4.1455682 × 1010
Minimum0
Maximum1.4465198 × 1011
Zeros9
Zeros (%)27.3%
Negative0
Negative (%)0.0%
Memory size429.0 B
2023-12-11T01:59:56.591824image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median1.6823571 × 1010
Q38.564285 × 1010
95-th percentile1.2754572 × 1011
Maximum1.4465198 × 1011
Range1.4465198 × 1011
Interquartile range (IQR)8.564285 × 1010

Descriptive statistics

Standard deviation4.7764754 × 1010
Coefficient of variation (CV)1.1521884
Kurtosis-0.59993613
Mean4.1455682 × 1010
Median Absolute Deviation (MAD)1.6823571 × 1010
Skewness0.98541198
Sum1.3680375 × 1012
Variance2.2814717 × 1021
MonotonicityNot monotonic
2023-12-11T01:59:56.772374image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=25)
ValueCountFrequency (%)
0 9
27.3%
144651984000 1
 
3.0%
16823571000 1
 
3.0%
37017918000 1
 
3.0%
121083380000 1
 
3.0%
15821521000 1
 
3.0%
13780335000 1
 
3.0%
33622958000 1
 
3.0%
102611502000 1
 
3.0%
10463821000 1
 
3.0%
Other values (15) 15
45.5%
ValueCountFrequency (%)
0 9
27.3%
8975520000 1
 
3.0%
9573594000 1
 
3.0%
10463821000 1
 
3.0%
13223527000 1
 
3.0%
13780335000 1
 
3.0%
15641788000 1
 
3.0%
15821521000 1
 
3.0%
16823571000 1
 
3.0%
23016990000 1
 
3.0%
ValueCountFrequency (%)
144651984000 1
3.0%
128546786000 1
3.0%
126878338000 1
3.0%
121083380000 1
3.0%
115536222000 1
3.0%
108838525000 1
3.0%
102611502000 1
3.0%
94720413000 1
3.0%
85642850000 1
3.0%
38121807000 1
3.0%

비과세건수
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct22
Distinct (%)66.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean11165.212
Minimum0
Maximum53708
Zeros12
Zeros (%)36.4%
Negative0
Negative (%)0.0%
Memory size429.0 B
2023-12-11T01:59:56.946814image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median3037
Q314480
95-th percentile45990.4
Maximum53708
Range53708
Interquartile range (IQR)14480

Descriptive statistics

Standard deviation16062.409
Coefficient of variation (CV)1.4386121
Kurtosis1.2047247
Mean11165.212
Median Absolute Deviation (MAD)3037
Skewness1.5246466
Sum368452
Variance2.5800099 × 108
MonotonicityNot monotonic
2023-12-11T01:59:57.166469image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=22)
ValueCountFrequency (%)
0 12
36.4%
13540 1
 
3.0%
4509 1
 
3.0%
9 1
 
3.0%
1881 1
 
3.0%
5302 1
 
3.0%
47593 1
 
3.0%
53708 1
 
3.0%
21755 1
 
3.0%
14480 1
 
3.0%
Other values (12) 12
36.4%
ValueCountFrequency (%)
0 12
36.4%
2 1
 
3.0%
7 1
 
3.0%
9 1
 
3.0%
1881 1
 
3.0%
3037 1
 
3.0%
4509 1
 
3.0%
5302 1
 
3.0%
7833 1
 
3.0%
9233 1
 
3.0%
ValueCountFrequency (%)
53708 1
3.0%
47593 1
3.0%
44922 1
3.0%
41776 1
3.0%
37487 1
3.0%
21755 1
3.0%
18962 1
3.0%
17250 1
3.0%
14480 1
3.0%
13973 1
3.0%

비과세금액
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct21
Distinct (%)63.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6.3484868 × 109
Minimum0
Maximum4.4687492 × 1010
Zeros13
Zeros (%)39.4%
Negative0
Negative (%)0.0%
Memory size429.0 B
2023-12-11T01:59:57.386810image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median2.97973 × 108
Q32.071916 × 109
95-th percentile3.7506622 × 1010
Maximum4.4687492 × 1010
Range4.4687492 × 1010
Interquartile range (IQR)2.071916 × 109

Descriptive statistics

Standard deviation1.3049452 × 1010
Coefficient of variation (CV)2.0555216
Kurtosis2.7455265
Mean6.3484868 × 109
Median Absolute Deviation (MAD)2.97973 × 108
Skewness2.0228635
Sum2.0950006 × 1011
Variance1.7028819 × 1020
MonotonicityNot monotonic
2023-12-11T01:59:57.593430image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=21)
ValueCountFrequency (%)
0 13
39.4%
22254713000 1
 
3.0%
297973000 1
 
3.0%
1000 1
 
3.0%
961889000 1
 
3.0%
273017000 1
 
3.0%
2079026000 1
 
3.0%
44687492000 1
 
3.0%
1415929000 1
 
3.0%
23533099000 1
 
3.0%
Other values (11) 11
33.3%
ValueCountFrequency (%)
0 13
39.4%
1000 1
 
3.0%
14273000 1
 
3.0%
273017000 1
 
3.0%
297973000 1
 
3.0%
303944000 1
 
3.0%
961889000 1
 
3.0%
1026445000 1
 
3.0%
1087393000 1
 
3.0%
1415929000 1
 
3.0%
ValueCountFrequency (%)
44687492000 1
3.0%
40270556000 1
3.0%
35664000000 1
3.0%
28582176000 1
3.0%
23533099000 1
3.0%
22254713000 1
3.0%
2112273000 1
3.0%
2079026000 1
3.0%
2071916000 1
3.0%
1434401000 1
3.0%

Interactions

2023-12-11T01:59:53.119816image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:59:51.253886image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:59:51.808579image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:59:52.458168image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:59:53.271698image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:59:51.382143image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:59:51.961389image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:59:52.597775image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:59:53.415994image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:59:51.545022image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:59:52.120962image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:59:52.773539image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:59:53.553018image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:59:51.679522image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:59:52.294768image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:59:52.937766image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T01:59:57.754744image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
과세년도세목명과세건수과세금액비과세건수비과세금액
과세년도1.0000.0000.0000.0000.0000.000
세목명0.0001.0000.9810.8150.7240.446
과세건수0.0000.9811.0000.7200.6880.000
과세금액0.0000.8150.7201.0000.7830.887
비과세건수0.0000.7240.6880.7831.0000.768
비과세금액0.0000.4460.0000.8870.7681.000
2023-12-11T01:59:57.955538image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
세목명과세년도
세목명1.0000.000
과세년도0.0001.000
2023-12-11T01:59:58.114754image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
과세건수과세금액비과세건수비과세금액과세년도세목명
과세건수1.0000.4960.5110.3960.0000.814
과세금액0.4961.0000.4270.4860.0000.482
비과세건수0.5110.4271.0000.9500.0000.365
비과세금액0.3960.4860.9501.0000.0000.172
과세년도0.0000.0000.0000.0001.0000.000
세목명0.8140.4820.3650.1720.0001.000

Missing values

2023-12-11T01:59:53.761015image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T01:59:54.013015image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시도명시군구명자치단체코드과세년도세목명과세건수과세금액비과세건수비과세금액
0부산광역시해운대구263502017취득세214141446519840001354022254713000
1부산광역시해운대구263502017주민세1729308975520000111931429547000
2부산광역시해운대구263502017재산세237910856428500001896235664000000
3부산광역시해운대구263502017자동차세25259633252280000374872071916000
4부산광역시해운대구263502017레저세0000
5부산광역시해운대구263502017담배소비세0000
6부산광역시해운대구263502017지방소비세0000
7부산광역시해운대구263502017등록면허세119382132235270003037303944000
8부산광역시해운대구263502017도시계획세0000
9부산광역시해운대구263502017지역자원시설세3004001564178800078331026445000
시도명시군구명자치단체코드과세년도세목명과세건수과세금액비과세건수비과세금액
23부산광역시해운대구263502019재산세2521451026115020005370844687492000
24부산광역시해운대구263502019자동차세24793633622958000475932079026000
25부산광역시해운대구263502019레저세0000
26부산광역시해운대구263502019담배소비세0000
27부산광역시해운대구263502019지방소비세0000
28부산광역시해운대구263502019등록면허세120864137803350005302273017000
29부산광역시해운대구263502019도시계획세0000
30부산광역시해운대구263502019지역자원시설세323196158215210001881961889000
31부산광역시해운대구263502019지방소득세13229912108338000000
32부산광역시해운대구263502019교육세8660953701791800091000