Overview

Dataset statistics

Number of variables9
Number of observations59
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory4.6 KiB
Average record size in memory80.2 B

Variable types

Categorical5
Numeric4

Dataset

Description부산광역시해운대구_1인당지방세부담액_20211231
Author부산광역시 해운대구
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=15078886

Alerts

시도명 has constant value ""Constant
시군구명 has constant value ""Constant
자치단체코드 has constant value ""Constant
과세건수 is highly overall correlated with 과세금액 and 2 other fieldsHigh correlation
과세금액 is highly overall correlated with 과세건수 and 1 other fieldsHigh correlation
비과세건수 is highly overall correlated with 과세건수 and 2 other fieldsHigh correlation
비과세금액 is highly overall correlated with 비과세건수High correlation
세목명 is highly overall correlated with 과세건수High correlation
과세건수 has 17 (28.8%) zerosZeros
과세금액 has 18 (30.5%) zerosZeros
비과세건수 has 22 (37.3%) zerosZeros
비과세금액 has 25 (42.4%) zerosZeros

Reproduction

Analysis started2023-12-10 17:20:51.937869
Analysis finished2023-12-10 17:20:56.167736
Duration4.23 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시도명
Categorical

CONSTANT 

Distinct1
Distinct (%)1.7%
Missing0
Missing (%)0.0%
Memory size604.0 B
부산광역시
59 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row부산광역시
2nd row부산광역시
3rd row부산광역시
4th row부산광역시
5th row부산광역시

Common Values

ValueCountFrequency (%)
부산광역시 59
100.0%

Length

2023-12-11T02:20:56.328695image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T02:20:56.554555image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
부산광역시 59
100.0%

시군구명
Categorical

CONSTANT 

Distinct1
Distinct (%)1.7%
Missing0
Missing (%)0.0%
Memory size604.0 B
해운대구
59 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row해운대구
2nd row해운대구
3rd row해운대구
4th row해운대구
5th row해운대구

Common Values

ValueCountFrequency (%)
해운대구 59
100.0%

Length

2023-12-11T02:20:56.804617image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T02:20:57.046159image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
해운대구 59
100.0%

자치단체코드
Categorical

CONSTANT 

Distinct1
Distinct (%)1.7%
Missing0
Missing (%)0.0%
Memory size604.0 B
26350
59 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row26350
2nd row26350
3rd row26350
4th row26350
5th row26350

Common Values

ValueCountFrequency (%)
26350 59
100.0%

Length

2023-12-11T02:20:57.285331image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T02:20:58.056305image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
26350 59
100.0%

과세년도
Categorical

Distinct5
Distinct (%)8.5%
Missing0
Missing (%)0.0%
Memory size604.0 B
2020
13 
2021
13 
2017
12 
2019
12 
2018

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2017
2nd row2017
3rd row2017
4th row2017
5th row2017

Common Values

ValueCountFrequency (%)
2020 13
22.0%
2021 13
22.0%
2017 12
20.3%
2019 12
20.3%
2018 9
15.3%

Length

2023-12-11T02:20:58.273771image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T02:20:58.504659image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2020 13
22.0%
2021 13
22.0%
2017 12
20.3%
2019 12
20.3%
2018 9
15.3%

세목명
Categorical

HIGH CORRELATION 

Distinct13
Distinct (%)22.0%
Missing0
Missing (%)0.0%
Memory size604.0 B
취득세
주민세
재산세
자동차세
등록면허세
Other values (8)
34 

Length

Max length7
Median length5
Mean length4.1694915
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row취득세
2nd row주민세
3rd row재산세
4th row자동차세
5th row레저세

Common Values

ValueCountFrequency (%)
취득세 5
8.5%
주민세 5
8.5%
재산세 5
8.5%
자동차세 5
8.5%
등록면허세 5
8.5%
지역자원시설세 5
8.5%
지방소득세 5
8.5%
교육세 5
8.5%
레저세 4
 
6.8%
담배소비세 4
 
6.8%
Other values (3) 11
18.6%

Length

2023-12-11T02:20:58.756539image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
취득세 5
8.5%
주민세 5
8.5%
재산세 5
8.5%
자동차세 5
8.5%
등록면허세 5
8.5%
지역자원시설세 5
8.5%
지방소득세 5
8.5%
교육세 5
8.5%
레저세 4
 
6.8%
담배소비세 4
 
6.8%
Other values (3) 11
18.6%

과세건수
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct42
Distinct (%)71.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean183515.14
Minimum0
Maximum919855
Zeros17
Zeros (%)28.8%
Negative0
Negative (%)0.0%
Memory size663.0 B
2023-12-11T02:20:59.024611image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median128761
Q3251021.5
95-th percentile859433.2
Maximum919855
Range919855
Interquartile range (IQR)251021.5

Descriptive statistics

Standard deviation241209.73
Coefficient of variation (CV)1.314386
Kurtosis3.7228003
Mean183515.14
Median Absolute Deviation (MAD)128757
Skewness2.00732
Sum10827393
Variance5.8182133 × 1010
MonotonicityNot monotonic
2023-12-11T02:20:59.264476image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=42)
ValueCountFrequency (%)
0 17
28.8%
4 2
 
3.4%
21414 1
 
1.7%
144298 1
 
1.7%
866095 1
 
1.7%
30687 1
 
1.7%
175051 1
 
1.7%
260861 1
 
1.7%
251221 1
 
1.7%
150432 1
 
1.7%
Other values (32) 32
54.2%
ValueCountFrequency (%)
0 17
28.8%
4 2
 
3.4%
18014 1
 
1.7%
18549 1
 
1.7%
21414 1
 
1.7%
24399 1
 
1.7%
30687 1
 
1.7%
113983 1
 
1.7%
118252 1
 
1.7%
119382 1
 
1.7%
ValueCountFrequency (%)
919855 1
1.7%
902192 1
1.7%
866095 1
1.7%
858693 1
1.7%
857272 1
1.7%
336026 1
1.7%
333187 1
1.7%
323196 1
1.7%
308519 1
1.7%
300400 1
1.7%

과세금액
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct42
Distinct (%)71.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4.8895028 × 1010
Minimum0
Maximum4.1303 × 1011
Zeros18
Zeros (%)30.5%
Negative0
Negative (%)0.0%
Memory size663.0 B
2023-12-11T02:20:59.515416image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median1.7937813 × 1010
Q37.6122843 × 1010
95-th percentile1.458065 × 1011
Maximum4.1303 × 1011
Range4.1303 × 1011
Interquartile range (IQR)7.6122843 × 1010

Descriptive statistics

Standard deviation7.4002386 × 1010
Coefficient of variation (CV)1.5134951
Kurtosis9.9187415
Mean4.8895028 × 1010
Median Absolute Deviation (MAD)1.7937813 × 1010
Skewness2.7251079
Sum2.8848067 × 1012
Variance5.4763532 × 1021
MonotonicityNot monotonic
2023-12-11T02:20:59.765859image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=42)
ValueCountFrequency (%)
0 18
30.5%
144652000000 1
 
1.7%
66602836000 1
 
1.7%
37017918000 1
 
1.7%
413030000000 1
 
1.7%
10382138000 1
 
1.7%
117074000000 1
 
1.7%
33951687000 1
 
1.7%
23799908000 1
 
1.7%
17937813000 1
 
1.7%
Other values (32) 32
54.2%
ValueCountFrequency (%)
0 18
30.5%
2382596000 1
 
1.7%
8975520000 1
 
1.7%
9573594000 1
 
1.7%
10382138000 1
 
1.7%
10463821000 1
 
1.7%
10672476000 1
 
1.7%
13223527000 1
 
1.7%
13780335000 1
 
1.7%
15641788000 1
 
1.7%
ValueCountFrequency (%)
413030000000 1
1.7%
267823000000 1
1.7%
156197000000 1
1.7%
144652000000 1
1.7%
141427000000 1
1.7%
130688000000 1
1.7%
128547000000 1
1.7%
126878000000 1
1.7%
121083000000 1
1.7%
117074000000 1
1.7%

비과세건수
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct36
Distinct (%)61.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean11356.915
Minimum0
Maximum64359
Zeros22
Zeros (%)37.3%
Negative0
Negative (%)0.0%
Memory size663.0 B
2023-12-11T02:21:00.046089image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median1881
Q314702
95-th percentile49667
Maximum64359
Range64359
Interquartile range (IQR)14702

Descriptive statistics

Standard deviation17753.003
Coefficient of variation (CV)1.5631888
Kurtosis1.7439099
Mean11356.915
Median Absolute Deviation (MAD)1881
Skewness1.6954733
Sum670058
Variance3.1516911 × 108
MonotonicityNot monotonic
2023-12-11T02:21:00.301337image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=36)
ValueCountFrequency (%)
0 22
37.3%
9 2
 
3.4%
1 2
 
3.4%
13540 1
 
1.7%
2079 1
 
1.7%
15094 1
 
1.7%
14455 1
 
1.7%
60254 1
 
1.7%
47764 1
 
1.7%
6631 1
 
1.7%
Other values (26) 26
44.1%
ValueCountFrequency (%)
0 22
37.3%
1 2
 
3.4%
2 1
 
1.7%
5 1
 
1.7%
7 1
 
1.7%
9 2
 
3.4%
1881 1
 
1.7%
2079 1
 
1.7%
2254 1
 
1.7%
3037 1
 
1.7%
ValueCountFrequency (%)
64359 1
1.7%
60254 1
1.7%
53708 1
1.7%
49218 1
1.7%
47764 1
1.7%
47593 1
1.7%
44922 1
1.7%
41776 1
1.7%
37487 1
1.7%
21755 1
1.7%

비과세금액
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct35
Distinct (%)59.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6.1074391 × 109
Minimum0
Maximum5.4181463 × 1010
Zeros25
Zeros (%)42.4%
Negative0
Negative (%)0.0%
Memory size663.0 B
2023-12-11T02:21:00.594244image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median2.97973 × 108
Q31.9994035 × 109
95-th percentile4.071225 × 1010
Maximum5.4181463 × 1010
Range5.4181463 × 1010
Interquartile range (IQR)1.9994035 × 109

Descriptive statistics

Standard deviation1.3479606 × 1010
Coefficient of variation (CV)2.2070799
Kurtosis4.5716209
Mean6.1074391 × 109
Median Absolute Deviation (MAD)2.97973 × 108
Skewness2.3663864
Sum3.603389 × 1011
Variance1.8169978 × 1020
MonotonicityNot monotonic
2023-12-11T02:21:00.937877image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=35)
ValueCountFrequency (%)
0 25
42.4%
22254713000 1
 
1.7%
977891000 1
 
1.7%
16919248000 1
 
1.7%
3091830000 1
 
1.7%
1006573000 1
 
1.7%
48253426000 1
 
1.7%
1926891000 1
 
1.7%
301035000 1
 
1.7%
19695467000 1
 
1.7%
Other values (25) 25
42.4%
ValueCountFrequency (%)
0 25
42.4%
1000 1
 
1.7%
683000 1
 
1.7%
14273000 1
 
1.7%
273017000 1
 
1.7%
297973000 1
 
1.7%
301035000 1
 
1.7%
303944000 1
 
1.7%
400394000 1
 
1.7%
961889000 1
 
1.7%
ValueCountFrequency (%)
54181463000 1
1.7%
48253426000 1
1.7%
44687492000 1
1.7%
40270556000 1
1.7%
35664000000 1
1.7%
28582176000 1
1.7%
23533099000 1
1.7%
22254713000 1
1.7%
19695467000 1
1.7%
16919248000 1
1.7%

Interactions

2023-12-11T02:20:54.893722image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T02:20:52.499534image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T02:20:53.252122image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T02:20:54.070652image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T02:20:55.085011image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T02:20:52.673018image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T02:20:53.458648image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T02:20:54.270349image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T02:20:55.265334image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T02:20:52.861267image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T02:20:53.662865image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T02:20:54.474928image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T02:20:55.465062image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T02:20:53.061844image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T02:20:53.861688image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T02:20:54.693182image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T02:21:01.140438image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
과세년도세목명과세건수과세금액비과세건수비과세금액
과세년도1.0000.0000.0000.0000.0000.000
세목명0.0001.0001.0000.7400.7460.597
과세건수0.0001.0001.0000.4140.6330.272
과세금액0.0000.7400.4141.0000.1220.793
비과세건수0.0000.7460.6330.1221.0000.741
비과세금액0.0000.5970.2720.7930.7411.000
2023-12-11T02:21:01.363343image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
세목명과세년도
세목명1.0000.000
과세년도0.0001.000
2023-12-11T02:21:01.574187image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
과세건수과세금액비과세건수비과세금액과세년도세목명
과세건수1.0000.5960.5730.3980.0000.923
과세금액0.5961.0000.5060.4810.0000.448
비과세건수0.5730.5061.0000.9240.0000.420
비과세금액0.3980.4810.9241.0000.0000.299
과세년도0.0000.0000.0000.0001.0000.000
세목명0.9230.4480.4200.2990.0001.000

Missing values

2023-12-11T02:20:55.733451image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T02:20:56.052847image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시도명시군구명자치단체코드과세년도세목명과세건수과세금액비과세건수비과세금액
0부산광역시해운대구263502017취득세214141446520000001354022254713000
1부산광역시해운대구263502017주민세1729308975520000111931429547000
2부산광역시해운대구263502017재산세237910856428500001896235664000000
3부산광역시해운대구263502017자동차세25259633252280000374872071916000
4부산광역시해운대구263502017레저세0000
5부산광역시해운대구263502017담배소비세0000
6부산광역시해운대구263502017지방소비세0000
7부산광역시해운대구263502017등록면허세119382132235270003037303944000
8부산광역시해운대구263502017도시계획세0000
9부산광역시해운대구263502017지역자원시설세3004001564178800078331026445000
시도명시군구명자치단체코드과세년도세목명과세건수과세금액비과세건수비과세금액
49부산광역시해운대구263502021재산세2670801306880000006435954181463000
50부산광역시해운대구263502021자동차세24915934414180000492182116754000
51부산광역시해운대구263502021레저세0000
52부산광역시해운대구263502021담배소비세0000
53부산광역시해운대구263502021지방소비세4238259600000
54부산광역시해운대구263502021등록면허세128761182970830008480400394000
55부산광역시해운대구263502021도시계획세0000
56부산광역시해운대구263502021지역자원시설세336026182731430002254977038000
57부산광역시해운대구263502021지방소득세16582415619700000000
58부산광역시해운대구263502021교육세9021925381603900050