Overview

Dataset statistics

Number of variables8
Number of observations46
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory3.2 KiB
Average record size in memory70.8 B

Variable types

Categorical6
Numeric2

Dataset

Description대구광역시 달서구_세원유형별 과세 현황_20221231
Author대구광역시 달서구
URLhttp://data.daegu.go.kr/open/data/dataView.do?dataSetId=15079368&dataSetDetailId=150793681d25980dee13b&provdMethod=FILE

Alerts

시도명 has constant value "대구광역시"Constant
시군구명 has constant value "달서구"Constant
자치단체코드 has constant value "27290"Constant
과세년도 has constant value "2022"Constant
세원 유형명 is highly overall correlated with 부과건수 and 2 other fieldsHigh correlation
세목명 is highly overall correlated with 부과건수 and 1 other fieldsHigh correlation
부과건수 is highly overall correlated with 부과금액 and 2 other fieldsHigh correlation
부과금액 is highly overall correlated with 부과건수 and 1 other fieldsHigh correlation
세원 유형명 has unique valuesUnique
부과건수 has 11 (23.9%) zerosZeros
부과금액 has 11 (23.9%) zerosZeros

Reproduction

Analysis started2023-07-15 13:22:33.663679
Analysis finished2023-07-15 13:22:35.244722
Duration1.58 second
Software versionpandas-profiling v3.6.6
Download configurationconfig.json

Variables

시도명
Categorical

Distinct1
Distinct (%)2.2%
Missing0
Missing (%)0.0%
Memory size496.0 B
대구광역시
46 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row대구광역시
2nd row대구광역시
3rd row대구광역시
4th row대구광역시
5th row대구광역시

Common Values

ValueCountFrequency (%)
대구광역시 46
100.0%

Length

2023-07-15T22:22:35.339070image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-07-15T22:22:35.542995image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
ValueCountFrequency (%)
대구광역시 46
100.0%

시군구명
Categorical

Distinct1
Distinct (%)2.2%
Missing0
Missing (%)0.0%
Memory size496.0 B
달서구
46 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row달서구
2nd row달서구
3rd row달서구
4th row달서구
5th row달서구

Common Values

ValueCountFrequency (%)
달서구 46
100.0%

Length

2023-07-15T22:22:36.013162image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-07-15T22:22:36.213716image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
ValueCountFrequency (%)
달서구 46
100.0%
Distinct1
Distinct (%)2.2%
Missing0
Missing (%)0.0%
Memory size496.0 B
27290
46 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row27290
2nd row27290
3rd row27290
4th row27290
5th row27290

Common Values

ValueCountFrequency (%)
27290 46
100.0%

Length

2023-07-15T22:22:36.368386image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-07-15T22:22:36.565104image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
ValueCountFrequency (%)
27290 46
100.0%

과세년도
Categorical

Distinct1
Distinct (%)2.2%
Missing0
Missing (%)0.0%
Memory size496.0 B
2022
46 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2022
2nd row2022
3rd row2022
4th row2022
5th row2022

Common Values

ValueCountFrequency (%)
2022 46
100.0%

Length

2023-07-15T22:22:36.706341image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-07-15T22:22:36.908054image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
ValueCountFrequency (%)
2022 46
100.0%

세목명
Categorical

Distinct13
Distinct (%)28.3%
Missing0
Missing (%)0.0%
Memory size496.0 B
취득세
자동차세
주민세
재산세
레저세
Other values (8)
14 

Length

Max length7
Median length3
Mean length3.7826087
Min length2

Unique

Unique5 ?
Unique (%)10.9%

Sample

1st row교육세
2nd row도시계획세
3rd row취득세
4th row취득세
5th row취득세

Common Values

ValueCountFrequency (%)
취득세 9
19.6%
자동차세 7
15.2%
주민세 7
15.2%
재산세 5
10.9%
레저세 4
8.7%
지방소득세 4
8.7%
지역자원시설세 3
 
6.5%
등록면허세 2
 
4.3%
교육세 1
 
2.2%
도시계획세 1
 
2.2%
Other values (3) 3
 
6.5%

Length

2023-07-15T22:22:37.073738image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
취득세 9
19.6%
자동차세 7
15.2%
주민세 7
15.2%
재산세 5
10.9%
레저세 4
8.7%
지방소득세 4
8.7%
지역자원시설세 3
 
6.5%
등록면허세 2
 
4.3%
교육세 1
 
2.2%
도시계획세 1
 
2.2%
Other values (3) 3
 
6.5%

세원 유형명
Categorical

HIGH CORRELATION  UNIQUE 

Distinct46
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size496.0 B
교육세
 
1
토지
 
1
3륜이하
 
1
건축물
 
1
주택(개별)
 
1
Other values (41)
41 

Length

Max length11
Median length8
Mean length6.0217391
Min length2

Unique

Unique46 ?
Unique (%)100.0%

Sample

1st row교육세
2nd row도시계획세
3rd row건축물
4th row주택(개별)
5th row주택(단독)

Common Values

ValueCountFrequency (%)
교육세 1
 
2.2%
토지 1
 
2.2%
3륜이하 1
 
2.2%
건축물 1
 
2.2%
주택(개별) 1
 
2.2%
주택(단독) 1
 
2.2%
기타 1
 
2.2%
항공기 1
 
2.2%
기계장비 1
 
2.2%
차량 1
 
2.2%
Other values (36) 36
78.3%

Length

2023-07-15T22:22:37.280285image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
교육세 1
 
2.2%
주민세(종합소득 1
 
2.2%
담배소비세 1
 
2.2%
기타승용 1
 
2.2%
승용 1
 
2.2%
주민세(사업소분 1
 
2.2%
주민세(개인분 1
 
2.2%
주민세(종업원분 1
 
2.2%
주민세(특별징수 1
 
2.2%
주민세(법인세분 1
 
2.2%
Other values (36) 36
78.3%

부과건수
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct35
Distinct (%)76.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean67383.652
Minimum0
Maximum1116418
Zeros11
Zeros (%)23.9%
Negative0
Negative (%)0.0%
Memory size542.0 B
2023-07-15T22:22:37.448678image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q19.5
median1691.5
Q336543.5
95-th percentile335291
Maximum1116418
Range1116418
Interquartile range (IQR)36534

Descriptive statistics

Standard deviation182541.82
Coefficient of variation (CV)2.7089927
Kurtosis24.96778
Mean67383.652
Median Absolute Deviation (MAD)1691.5
Skewness4.6428017
Sum3099648
Variance3.3321516 × 1010
MonotonicityNot monotonic
2023-07-15T22:22:37.653561image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
Histogram with fixed size bins (bins=35)
ValueCountFrequency (%)
0 11
23.9%
11 2
 
4.3%
3117 1
 
2.2%
34400 1
 
2.2%
196 1
 
2.2%
32728 1
 
2.2%
191596 1
 
2.2%
34 1
 
2.2%
1618 1
 
2.2%
36 1
 
2.2%
Other values (25) 25
54.3%
ValueCountFrequency (%)
0 11
23.9%
9 1
 
2.2%
11 2
 
4.3%
34 1
 
2.2%
36 1
 
2.2%
76 1
 
2.2%
196 1
 
2.2%
351 1
 
2.2%
983 1
 
2.2%
1182 1
 
2.2%
ValueCountFrequency (%)
1116418 1
2.2%
390822 1
2.2%
367960 1
2.2%
237284 1
2.2%
197753 1
2.2%
191596 1
2.2%
114238 1
2.2%
95636 1
2.2%
87980 1
2.2%
64641 1
2.2%

부과금액
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct36
Distinct (%)78.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.4480937 × 1010
Minimum0
Maximum8.9361521 × 1010
Zeros11
Zeros (%)23.9%
Negative0
Negative (%)0.0%
Memory size542.0 B
2023-07-15T22:22:37.861000image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q13998000
median1.2767335 × 109
Q31.9029515 × 1010
95-th percentile5.7847333 × 1010
Maximum8.9361521 × 1010
Range8.9361521 × 1010
Interquartile range (IQR)1.9025517 × 1010

Descriptive statistics

Standard deviation2.3131641 × 1010
Coefficient of variation (CV)1.5973856
Kurtosis3.2239509
Mean1.4480937 × 1010
Median Absolute Deviation (MAD)1.2767335 × 109
Skewness1.9032624
Sum6.6612311 × 1011
Variance5.3507282 × 1020
MonotonicityNot monotonic
2023-07-15T22:22:38.043143image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
Histogram with fixed size bins (bins=36)
ValueCountFrequency (%)
0 11
23.9%
50616000 1
 
2.2%
19554748000 1
 
2.2%
8222371000 1
 
2.2%
88820456000 1
 
2.2%
540941000 1
 
2.2%
1570458000 1
 
2.2%
89361521000 1
 
2.2%
35733154000 1
 
2.2%
34086000 1
 
2.2%
Other values (26) 26
56.5%
ValueCountFrequency (%)
0 11
23.9%
2988000 1
 
2.2%
7028000 1
 
2.2%
25578000 1
 
2.2%
26724000 1
 
2.2%
34086000 1
 
2.2%
50616000 1
 
2.2%
83874000 1
 
2.2%
211491000 1
 
2.2%
279080000 1
 
2.2%
ValueCountFrequency (%)
89361521000 1
2.2%
88820456000 1
2.2%
60000389000 1
2.2%
51388165000 1
2.2%
46509926000 1
2.2%
46178162000 1
2.2%
42337821000 1
2.2%
40673682000 1
2.2%
35733154000 1
2.2%
21878155000 1
2.2%

Interactions

2023-07-15T22:22:34.412745image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
2023-07-15T22:22:34.024009image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
2023-07-15T22:22:34.610114image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
2023-07-15T22:22:34.224824image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/

Correlations

2023-07-15T22:22:38.251823image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
자치단체코드과세년도부과건수부과금액
자치단체코드NaNNaNNaNNaN
과세년도NaNNaNNaNNaN
부과건수NaNNaN1.0000.334
부과금액NaNNaN0.3341.000
2023-07-15T22:22:38.493869image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
자치단체코드과세년도부과건수부과금액
자치단체코드NaNNaNNaNNaN
과세년도NaNNaNNaNNaN
부과건수NaNNaN1.0000.826
부과금액NaNNaN0.8261.000
2023-07-15T22:22:38.733026image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
자치단체코드과세년도부과건수부과금액
자치단체코드1.000NaNNaNNaN
과세년도NaN1.000NaNNaN
부과건수NaNNaN1.0000.658
부과금액NaNNaN0.6581.000
2023-07-15T22:22:38.969099image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
세목명세원 유형명부과건수부과금액
세목명1.0001.0000.8410.550
세원 유형명1.0001.0001.0001.000
부과건수0.8411.0001.0000.602
부과금액0.5501.0000.6021.000
2023-07-15T22:22:39.164628image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
세원 유형명세목명
세원 유형명1.0001.000
세목명1.0001.000
2023-07-15T22:22:39.345073image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
부과건수부과금액세목명세원 유형명
부과건수1.0000.8260.5901.000
부과금액0.8261.0000.2551.000
세목명0.5900.2551.0001.000
세원 유형명1.0001.0001.0001.000

Missing values

2023-07-15T22:22:34.867101image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
A simple visualization of nullity by column.
2023-07-15T22:22:35.137921image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시도명시군구명자치단체코드과세년도세목명세원 유형명부과건수부과금액
0대구광역시달서구272902022교육세교육세111641846178162000
1대구광역시달서구272902022도시계획세도시계획세00
2대구광역시달서구272902022취득세건축물176519554748000
3대구광역시달서구272902022취득세주택(개별)9838222371000
4대구광역시달서구272902022취득세주택(단독)886888820456000
5대구광역시달서구272902022취득세기타76540941000
6대구광역시달서구272902022취득세항공기00
7대구광역시달서구272902022취득세기계장비11821570458000
8대구광역시달서구272902022취득세차량5165589361521000
9대구광역시달서구272902022취득세선박367028000
시도명시군구명자치단체코드과세년도세목명세원 유형명부과건수부과금액
36대구광역시달서구272902022지방소득세지방소득세(양도소득)739121642060000
37대구광역시달서구272902022지방소득세지방소득세(종합소득)11423821878155000
38대구광역시달서구272902022지방소비세지방소비세917453817000
39대구광역시달서구272902022등록면허세등록면허세(면허)646412284693000
40대구광역시달서구272902022등록면허세등록면허세(등록)9563611759534000
41대구광역시달서구272902022지역자원시설세지역자원시설세(소방)36796013052620000
42대구광역시달서구272902022지역자원시설세지역자원시설세(시설)1125578000
43대구광역시달서구272902022지역자원시설세지역자원시설세(특자)35126724000
44대구광역시달서구272902022담배소비세담배소비세00
45대구광역시달서구272902022체납체납23728414254480000