Overview

Dataset statistics

Number of variables8
Number of observations231
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory15.5 KiB
Average record size in memory68.6 B

Variable types

Categorical6
Numeric2

Dataset

Description부산광역시 중구 세원유형별 과세현황에 관한 자료 (시도명, 시군구명, 자치단체코드, 과세년도, 세목명, 세원유형명, 부과건수, 부과금액 포함)
Author부산광역시 중구
URLhttps://www.data.go.kr/data/15078401/fileData.do

Alerts

시도명 has constant value ""Constant
시군구명 has constant value ""Constant
자치단체코드 has constant value ""Constant
세원 유형명 is highly overall correlated with 부과건수 and 1 other fieldsHigh correlation
세목명 is highly overall correlated with 부과건수 and 1 other fieldsHigh correlation
부과건수 is highly overall correlated with 부과금액 and 2 other fieldsHigh correlation
부과금액 is highly overall correlated with 부과건수High correlation
부과건수 has 55 (23.8%) zerosZeros
부과금액 has 56 (24.2%) zerosZeros

Reproduction

Analysis started2024-04-29 22:42:31.655136
Analysis finished2024-04-29 22:42:34.077967
Duration2.42 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시도명
Categorical

CONSTANT 

Distinct1
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
부산광역시
231 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row부산광역시
2nd row부산광역시
3rd row부산광역시
4th row부산광역시
5th row부산광역시

Common Values

ValueCountFrequency (%)
부산광역시 231
100.0%

Length

2024-04-30T07:42:34.142417image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-30T07:42:34.232922image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
부산광역시 231
100.0%

시군구명
Categorical

CONSTANT 

Distinct1
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
중구
231 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row중구
2nd row중구
3rd row중구
4th row중구
5th row중구

Common Values

ValueCountFrequency (%)
중구 231
100.0%

Length

2024-04-30T07:42:34.327290image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-30T07:42:34.414724image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
중구 231
100.0%

자치단체코드
Categorical

CONSTANT 

Distinct1
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
26110
231 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row26110
2nd row26110
3rd row26110
4th row26110
5th row26110

Common Values

ValueCountFrequency (%)
26110 231
100.0%

Length

2024-04-30T07:42:34.508893image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-30T07:42:34.587547image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
26110 231
100.0%

과세년도
Categorical

Distinct5
Distinct (%)2.2%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
2018
47 
2019
47 
2020
47 
2022
46 
2021
44 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2018
2nd row2018
3rd row2018
4th row2018
5th row2018

Common Values

ValueCountFrequency (%)
2018 47
20.3%
2019 47
20.3%
2020 47
20.3%
2022 46
19.9%
2021 44
19.0%

Length

2024-04-30T07:42:34.678322image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-30T07:42:34.773749image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2018 47
20.3%
2019 47
20.3%
2020 47
20.3%
2022 46
19.9%
2021 44
19.0%

세목명
Categorical

HIGH CORRELATION 

Distinct13
Distinct (%)5.6%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
취득세
45 
주민세
41 
자동차세
35 
재산세
25 
레저세
20 
Other values (8)
65 

Length

Max length7
Median length3
Mean length3.7099567
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row지방소비세
2nd row교육세
3rd row도시계획세
4th row취득세
5th row취득세

Common Values

ValueCountFrequency (%)
취득세 45
19.5%
주민세 41
17.7%
자동차세 35
15.2%
재산세 25
10.8%
레저세 20
8.7%
지방소득세 20
8.7%
지역자원시설세 12
 
5.2%
등록면허세 10
 
4.3%
지방소비세 5
 
2.2%
교육세 5
 
2.2%
Other values (3) 13
 
5.6%

Length

2024-04-30T07:42:34.888770image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
취득세 45
19.5%
주민세 41
17.7%
자동차세 35
15.2%
재산세 25
10.8%
레저세 20
8.7%
지방소득세 20
8.7%
지역자원시설세 12
 
5.2%
등록면허세 10
 
4.3%
지방소비세 5
 
2.2%
교육세 5
 
2.2%
Other values (3) 13
 
5.6%

세원 유형명
Categorical

HIGH CORRELATION 

Distinct50
Distinct (%)21.6%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
지방소비세
 
5
선박
 
5
재산세(토지)
 
5
건축물
 
5
경마
 
5
Other values (45)
206 

Length

Max length11
Median length8
Mean length6.04329
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row지방소비세
2nd row교육세
3rd row도시계획세
4th row건축물
5th row주택(개별)

Common Values

ValueCountFrequency (%)
지방소비세 5
 
2.2%
선박 5
 
2.2%
재산세(토지) 5
 
2.2%
건축물 5
 
2.2%
경마 5
 
2.2%
주택(단독) 5
 
2.2%
기타 5
 
2.2%
항공기 5
 
2.2%
기계장비 5
 
2.2%
차량 5
 
2.2%
Other values (40) 181
78.4%

Length

2024-04-30T07:42:35.010577image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
지방소비세 5
 
2.2%
경륜 5
 
2.2%
지방소득세(특별징수 5
 
2.2%
교육세 5
 
2.2%
선박 5
 
2.2%
주민세(법인세분 5
 
2.2%
주민세(양도소득 5
 
2.2%
체납 5
 
2.2%
소싸움 5
 
2.2%
경정 5
 
2.2%
Other values (40) 181
78.4%

부과건수
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct169
Distinct (%)73.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean8094.4329
Minimum0
Maximum118398
Zeros55
Zeros (%)23.8%
Negative0
Negative (%)0.0%
Memory size2.2 KiB
2024-04-30T07:42:35.153465image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q11.5
median520
Q37339
95-th percentile34323.5
Maximum118398
Range118398
Interquartile range (IQR)7337.5

Descriptive statistics

Standard deviation19425.12
Coefficient of variation (CV)2.3998123
Kurtosis19.195206
Mean8094.4329
Median Absolute Deviation (MAD)520
Skewness4.1452066
Sum1869814
Variance3.7733528 × 108
MonotonicityNot monotonic
2024-04-30T07:42:35.301601image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 55
 
23.8%
1 3
 
1.3%
3 2
 
0.9%
75 2
 
0.9%
12 2
 
0.9%
7 2
 
0.9%
360 2
 
0.9%
9 2
 
0.9%
13 1
 
0.4%
18156 1
 
0.4%
Other values (159) 159
68.8%
ValueCountFrequency (%)
0 55
23.8%
1 3
 
1.3%
2 1
 
0.4%
3 2
 
0.9%
4 1
 
0.4%
6 1
 
0.4%
7 2
 
0.9%
9 2
 
0.9%
11 1
 
0.4%
12 2
 
0.9%
ValueCountFrequency (%)
118398 1
0.4%
116734 1
0.4%
116213 1
0.4%
114457 1
0.4%
113579 1
0.4%
61865 1
0.4%
61531 1
0.4%
58254 1
0.4%
53514 1
0.4%
49722 1
0.4%

부과금액
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct176
Distinct (%)76.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.3499441 × 109
Minimum0
Maximum2.6144811 × 1010
Zeros56
Zeros (%)24.2%
Negative0
Negative (%)0.0%
Memory size2.2 KiB
2024-04-30T07:42:35.460389image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q11041500
median3.4707 × 108
Q32.684767 × 109
95-th percentile1.4548186 × 1010
Maximum2.6144811 × 1010
Range2.6144811 × 1010
Interquartile range (IQR)2.6837255 × 109

Descriptive statistics

Standard deviation4.3832169 × 109
Coefficient of variation (CV)1.865243
Kurtosis8.5254539
Mean2.3499441 × 109
Median Absolute Deviation (MAD)3.4707 × 108
Skewness2.8450114
Sum5.4283709 × 1011
Variance1.921259 × 1019
MonotonicityNot monotonic
2024-04-30T07:42:35.601300image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 56
 
24.2%
3722000 1
 
0.4%
5298000 1
 
0.4%
107006000 1
 
0.4%
2044413000 1
 
0.4%
9498440000 1
 
0.4%
1857253000 1
 
0.4%
15284871000 1
 
0.4%
677503000 1
 
0.4%
4396711000 1
 
0.4%
Other values (166) 166
71.9%
ValueCountFrequency (%)
0 56
24.2%
616000 1
 
0.4%
789000 1
 
0.4%
1294000 1
 
0.4%
2254000 1
 
0.4%
2385000 1
 
0.4%
2438000 1
 
0.4%
2445000 1
 
0.4%
2927000 1
 
0.4%
2942000 1
 
0.4%
ValueCountFrequency (%)
26144811000 1
0.4%
22056792000 1
0.4%
19485518000 1
0.4%
18166032000 1
0.4%
17801027000 1
0.4%
17253475000 1
0.4%
16467345000 1
0.4%
15596237000 1
0.4%
15284871000 1
0.4%
15047379000 1
0.4%

Interactions

2024-04-30T07:42:33.526005image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T07:42:33.167973image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T07:42:33.728462image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T07:42:33.442466image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-30T07:42:35.696483image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
과세년도세목명세원 유형명부과건수부과금액
과세년도1.0000.0000.0000.0000.000
세목명0.0001.0001.0000.8740.591
세원 유형명0.0001.0001.0000.9550.866
부과건수0.0000.8740.9551.0000.574
부과금액0.0000.5910.8660.5741.000
2024-04-30T07:42:35.815117image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
과세년도세원 유형명세목명
과세년도1.0000.0000.000
세원 유형명0.0001.0000.911
세목명0.0000.9111.000
2024-04-30T07:42:35.908985image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
부과건수부과금액과세년도세목명세원 유형명
부과건수1.0000.7760.0000.6470.692
부과금액0.7761.0000.0000.2900.438
과세년도0.0000.0001.0000.0000.000
세목명0.6470.2900.0001.0000.911
세원 유형명0.6920.4380.0000.9111.000

Missing values

2024-04-30T07:42:33.898462image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-30T07:42:34.024704image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시도명시군구명자치단체코드과세년도세목명세원 유형명부과건수부과금액
0부산광역시중구261102018지방소비세지방소비세00
1부산광역시중구261102018교육세교육세1144575780675000
2부산광역시중구261102018도시계획세도시계획세00
3부산광역시중구261102018취득세건축물3863220758000
4부산광역시중구261102018취득세주택(개별)2711720146000
5부산광역시중구261102018취득세주택(단독)4991150973000
6부산광역시중구261102018취득세기타34237717000
7부산광역시중구261102018취득세항공기00
8부산광역시중구261102018취득세기계장비2789000
9부산광역시중구261102018취득세차량520106167000
시도명시군구명자치단체코드과세년도세목명세원 유형명부과건수부과금액
221부산광역시중구261102022지방소득세지방소득세(법인소득)244926144811000
222부산광역시중구261102022지방소득세지방소득세(양도소득)3891271557000
223부산광역시중구261102022지방소득세지방소득세(종합소득)8352739683000
224부산광역시중구261102022담배소비세담배소비세00
225부산광역시중구261102022등록면허세등록면허세(면허)15326603215000
226부산광역시중구261102022등록면허세등록면허세(등록)130312305088000
227부산광역시중구261102022지역자원시설세지역자원시설세(소방)361233726342000
228부산광역시중구261102022지역자원시설세지역자원시설세(시설)00
229부산광역시중구261102022지역자원시설세지역자원시설세(특자)755215000
230부산광역시중구261102022체납체납497223366391000