Overview

Dataset statistics

Number of variables8
Number of observations185
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory12.4 KiB
Average record size in memory68.7 B

Variable types

Categorical6
Numeric2

Dataset

Description부산광역시중구_세원유형별과세현황_20211231
Author부산광역시 중구
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=15078401

Alerts

시도명 has constant value ""Constant
시군구명 has constant value ""Constant
자치단체코드 has constant value ""Constant
부과건수 is highly overall correlated with 부과금액 and 2 other fieldsHigh correlation
부과금액 is highly overall correlated with 부과건수High correlation
세목명 is highly overall correlated with 부과건수 and 1 other fieldsHigh correlation
세원 유형명 is highly overall correlated with 부과건수 and 1 other fieldsHigh correlation
부과건수 has 44 (23.8%) zerosZeros
부과금액 has 44 (23.8%) zerosZeros

Reproduction

Analysis started2023-12-10 16:53:25.406320
Analysis finished2023-12-10 16:53:26.320023
Duration0.91 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시도명
Categorical

CONSTANT 

Distinct1
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
부산광역시
185 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row부산광역시
2nd row부산광역시
3rd row부산광역시
4th row부산광역시
5th row부산광역시

Common Values

ValueCountFrequency (%)
부산광역시 185
100.0%

Length

2023-12-11T01:53:26.402938image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:53:26.516243image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
부산광역시 185
100.0%

시군구명
Categorical

CONSTANT 

Distinct1
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
중구
185 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row중구
2nd row중구
3rd row중구
4th row중구
5th row중구

Common Values

ValueCountFrequency (%)
중구 185
100.0%

Length

2023-12-11T01:53:26.618084image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:53:26.727755image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
중구 185
100.0%

자치단체코드
Categorical

CONSTANT 

Distinct1
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
26110
185 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row26110
2nd row26110
3rd row26110
4th row26110
5th row26110

Common Values

ValueCountFrequency (%)
26110 185
100.0%

Length

2023-12-11T01:53:26.837072image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:53:26.929599image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
26110 185
100.0%

과세년도
Categorical

Distinct4
Distinct (%)2.2%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
2018
47 
2019
47 
2020
47 
2021
44 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2018
2nd row2018
3rd row2018
4th row2018
5th row2018

Common Values

ValueCountFrequency (%)
2018 47
25.4%
2019 47
25.4%
2020 47
25.4%
2021 44
23.8%

Length

2023-12-11T01:53:27.020438image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:53:27.139727image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2018 47
25.4%
2019 47
25.4%
2020 47
25.4%
2021 44
23.8%

세목명
Categorical

HIGH CORRELATION 

Distinct13
Distinct (%)7.0%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
취득세
36 
주민세
34 
자동차세
28 
재산세
20 
레저세
16 
Other values (8)
51 

Length

Max length7
Median length3
Mean length3.6918919
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row지방소비세
2nd row교육세
3rd row도시계획세
4th row취득세
5th row취득세

Common Values

ValueCountFrequency (%)
취득세 36
19.5%
주민세 34
18.4%
자동차세 28
15.1%
재산세 20
10.8%
레저세 16
8.6%
지방소득세 16
8.6%
지역자원시설세 9
 
4.9%
등록면허세 8
 
4.3%
지방소비세 4
 
2.2%
교육세 4
 
2.2%
Other values (3) 10
 
5.4%

Length

2023-12-11T01:53:27.255637image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
취득세 36
19.5%
주민세 34
18.4%
자동차세 28
15.1%
재산세 20
10.8%
레저세 16
8.6%
지방소득세 16
8.6%
지역자원시설세 9
 
4.9%
등록면허세 8
 
4.3%
지방소비세 4
 
2.2%
교육세 4
 
2.2%
Other values (3) 10
 
5.4%

세원 유형명
Categorical

HIGH CORRELATION 

Distinct50
Distinct (%)27.0%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
지방소비세
 
4
선박
 
4
재산세(토지)
 
4
건축물
 
4
경마
 
4
Other values (45)
165 

Length

Max length11
Median length8
Mean length6.0486486
Min length2

Unique

Unique3 ?
Unique (%)1.6%

Sample

1st row지방소비세
2nd row교육세
3rd row도시계획세
4th row건축물
5th row주택(개별)

Common Values

ValueCountFrequency (%)
지방소비세 4
 
2.2%
선박 4
 
2.2%
재산세(토지) 4
 
2.2%
건축물 4
 
2.2%
경마 4
 
2.2%
주택(단독) 4
 
2.2%
기타 4
 
2.2%
항공기 4
 
2.2%
기계장비 4
 
2.2%
차량 4
 
2.2%
Other values (40) 145
78.4%

Length

2023-12-11T01:53:27.378502image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
지방소비세 4
 
2.2%
경륜 4
 
2.2%
지방소득세(특별징수 4
 
2.2%
교육세 4
 
2.2%
선박 4
 
2.2%
주민세(법인세분 4
 
2.2%
주민세(양도소득 4
 
2.2%
체납 4
 
2.2%
소싸움 4
 
2.2%
경정 4
 
2.2%
Other values (40) 145
78.4%

부과건수
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct138
Distinct (%)74.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean8073.1405
Minimum0
Maximum118398
Zeros44
Zeros (%)23.8%
Negative0
Negative (%)0.0%
Memory size1.8 KiB
2023-12-11T01:53:27.506283image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q12
median538
Q36364
95-th percentile32884.2
Maximum118398
Range118398
Interquartile range (IQR)6362

Descriptive statistics

Standard deviation19459.69
Coefficient of variation (CV)2.4104238
Kurtosis19.136323
Mean8073.1405
Median Absolute Deviation (MAD)538
Skewness4.1435015
Sum1493531
Variance3.7867954 × 108
MonotonicityNot monotonic
2023-12-11T01:53:27.657600image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 44
 
23.8%
360 2
 
1.1%
1 2
 
1.1%
12 2
 
1.1%
7 2
 
1.1%
568 1
 
0.5%
226 1
 
0.5%
78 1
 
0.5%
33282 1
 
0.5%
682 1
 
0.5%
Other values (128) 128
69.2%
ValueCountFrequency (%)
0 44
23.8%
1 2
 
1.1%
2 1
 
0.5%
3 1
 
0.5%
4 1
 
0.5%
6 1
 
0.5%
7 2
 
1.1%
9 1
 
0.5%
11 1
 
0.5%
12 2
 
1.1%
ValueCountFrequency (%)
118398 1
0.5%
116213 1
0.5%
114457 1
0.5%
113579 1
0.5%
61865 1
0.5%
61531 1
0.5%
58254 1
0.5%
53514 1
0.5%
35365 1
0.5%
33282 1
0.5%

부과금액
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct142
Distinct (%)76.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.2752811 × 109
Minimum0
Maximum2.2056792 × 1010
Zeros44
Zeros (%)23.8%
Negative0
Negative (%)0.0%
Memory size1.8 KiB
2023-12-11T01:53:27.834018image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q11294000
median3.46018 × 108
Q32.70185 × 109
95-th percentile1.4032385 × 1010
Maximum2.2056792 × 1010
Range2.2056792 × 1010
Interquartile range (IQR)2.700556 × 109

Descriptive statistics

Standard deviation4.1569386 × 109
Coefficient of variation (CV)1.827
Kurtosis7.1295193
Mean2.2752811 × 109
Median Absolute Deviation (MAD)3.46018 × 108
Skewness2.6770414
Sum4.20927 × 1011
Variance1.7280138 × 1019
MonotonicityNot monotonic
2023-12-11T01:53:27.989024image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 44
 
23.8%
15596237000 1
 
0.5%
3063825000 1
 
0.5%
341719000 1
 
0.5%
393301000 1
 
0.5%
182900000 1
 
0.5%
114056000 1
 
0.5%
367610000 1
 
0.5%
14968945000 1
 
0.5%
2363717000 1
 
0.5%
Other values (132) 132
71.4%
ValueCountFrequency (%)
0 44
23.8%
616000 1
 
0.5%
789000 1
 
0.5%
1294000 1
 
0.5%
2254000 1
 
0.5%
2385000 1
 
0.5%
2438000 1
 
0.5%
2445000 1
 
0.5%
2927000 1
 
0.5%
3722000 1
 
0.5%
ValueCountFrequency (%)
22056792000 1
0.5%
19485518000 1
0.5%
17801027000 1
0.5%
17253475000 1
0.5%
15596237000 1
0.5%
15284871000 1
0.5%
15047379000 1
0.5%
14968945000 1
0.5%
14859505000 1
0.5%
14236867000 1
0.5%

Interactions

2023-12-11T01:53:25.893856image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:53:25.702646image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:53:25.983379image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:53:25.799969image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T01:53:28.072947image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
과세년도세목명세원 유형명부과건수부과금액
과세년도1.0000.0000.0000.0000.000
세목명0.0001.0001.0000.8940.578
세원 유형명0.0001.0001.0000.9730.863
부과건수0.0000.8940.9731.0000.589
부과금액0.0000.5780.8630.5891.000
2023-12-11T01:53:28.182597image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
세원 유형명세목명과세년도
세원 유형명1.0000.8860.000
세목명0.8861.0000.000
과세년도0.0000.0001.000
2023-12-11T01:53:28.292001image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
부과건수부과금액과세년도세목명세원 유형명
부과건수1.0000.7740.0000.7030.735
부과금액0.7741.0000.0000.2800.420
과세년도0.0000.0001.0000.0000.000
세목명0.7030.2800.0001.0000.886
세원 유형명0.7350.4200.0000.8861.000

Missing values

2023-12-11T01:53:26.113171image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T01:53:26.264535image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시도명시군구명자치단체코드과세년도세목명세원 유형명부과건수부과금액
0부산광역시중구261102018지방소비세지방소비세00
1부산광역시중구261102018교육세교육세1144575780675000
2부산광역시중구261102018도시계획세도시계획세00
3부산광역시중구261102018취득세건축물3863220758000
4부산광역시중구261102018취득세주택(개별)2711720146000
5부산광역시중구261102018취득세주택(단독)4991150973000
6부산광역시중구261102018취득세기타34237717000
7부산광역시중구261102018취득세항공기00
8부산광역시중구261102018취득세기계장비2789000
9부산광역시중구261102018취득세차량520106167000
시도명시군구명자치단체코드과세년도세목명세원 유형명부과건수부과금액
175부산광역시중구261102021지방소득세지방소득세(특별징수)2696617801027000
176부산광역시중구261102021지방소득세지방소득세(법인소득)245817253475000
177부산광역시중구261102021지방소득세지방소득세(양도소득)5383890833000
178부산광역시중구261102021지방소득세지방소득세(종합소득)6115869410000
179부산광역시중구261102021등록면허세등록면허세(면허)14473589682000
180부산광역시중구261102021등록면허세등록면허세(등록)150062563548000
181부산광역시중구261102021지역자원시설세지역자원시설세(소방)353653571593000
182부산광역시중구261102021지역자원시설세지역자원시설세(시설)00
183부산광역시중구261102021지역자원시설세지역자원시설세(특자)734873000
184부산광역시중구261102021체납체납535142667684000