Overview

Dataset statistics

Number of variables9
Number of observations24
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.0 KiB
Average record size in memory83.5 B

Variable types

Categorical7
Numeric2

Dataset

Description대구광역시_지방세과세현황_20211231
Author대구광역시
URLhttp://data.daegu.go.kr/open/data/dataView.do?dataSetId=15079120&dataSetDetailId=150791201d9aebe287ba1&provdMethod=FILE

Alerts

시도명 has constant value ""Constant
시군구명 has constant value ""Constant
자치단체코드 has constant value ""Constant
비과세건수 has constant value ""Constant
비과세금액 has constant value ""Constant
과세금액 is highly overall correlated with 세목명High correlation
세목명 is highly overall correlated with 과세금액High correlation
과세건수 has 4 (16.7%) zerosZeros
과세금액 has 4 (16.7%) zerosZeros

Reproduction

Analysis started2023-12-10 19:31:35.159648
Analysis finished2023-12-10 19:31:37.755121
Duration2.6 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시도명
Categorical

CONSTANT 

Distinct1
Distinct (%)4.2%
Missing0
Missing (%)0.0%
Memory size324.0 B
대구광역시
24 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row대구광역시
2nd row대구광역시
3rd row대구광역시
4th row대구광역시
5th row대구광역시

Common Values

ValueCountFrequency (%)
대구광역시 24
100.0%

Length

2023-12-11T04:31:37.847445image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T04:31:37.980478image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
대구광역시 24
100.0%

시군구명
Categorical

CONSTANT 

Distinct1
Distinct (%)4.2%
Missing0
Missing (%)0.0%
Memory size324.0 B
대구광역시
24 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row대구광역시
2nd row대구광역시
3rd row대구광역시
4th row대구광역시
5th row대구광역시

Common Values

ValueCountFrequency (%)
대구광역시 24
100.0%

Length

2023-12-11T04:31:38.120747image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T04:31:38.244758image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
대구광역시 24
100.0%

자치단체코드
Categorical

CONSTANT 

Distinct1
Distinct (%)4.2%
Missing0
Missing (%)0.0%
Memory size324.0 B
27000
24 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row27000
2nd row27000
3rd row27000
4th row27000
5th row27000

Common Values

ValueCountFrequency (%)
27000 24
100.0%

Length

2023-12-11T04:31:38.375640image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T04:31:38.512330image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
27000 24
100.0%

과세년도
Categorical

Distinct5
Distinct (%)20.8%
Missing0
Missing (%)0.0%
Memory size324.0 B
2017
2018
2019
2020
2021

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2017
2nd row2017
3rd row2017
4th row2017
5th row2017

Common Values

ValueCountFrequency (%)
2017 6
25.0%
2018 6
25.0%
2019 4
16.7%
2020 4
16.7%
2021 4
16.7%

Length

2023-12-11T04:31:38.639599image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T04:31:38.750806image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2017 6
25.0%
2018 6
25.0%
2019 4
16.7%
2020 4
16.7%
2021 4
16.7%

세목명
Categorical

HIGH CORRELATION 

Distinct6
Distinct (%)25.0%
Missing0
Missing (%)0.0%
Memory size324.0 B
자동차세
담배소비세
지방소비세
교육세
취득세

Length

Max length5
Median length4.5
Mean length4.2083333
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row취득세
2nd row자동차세
3rd row담배소비세
4th row지방소비세
5th row등록면허세

Common Values

ValueCountFrequency (%)
자동차세 5
20.8%
담배소비세 5
20.8%
지방소비세 5
20.8%
교육세 5
20.8%
취득세 2
 
8.3%
등록면허세 2
 
8.3%

Length

2023-12-11T04:31:38.908565image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T04:31:39.072662image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
자동차세 5
20.8%
담배소비세 5
20.8%
지방소비세 5
20.8%
교육세 5
20.8%
취득세 2
 
8.3%
등록면허세 2
 
8.3%

과세건수
Real number (ℝ)

ZEROS 

Distinct13
Distinct (%)54.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean94.541667
Minimum0
Maximum482
Zeros4
Zeros (%)16.7%
Negative0
Negative (%)0.0%
Memory size348.0 B
2023-12-11T04:31:39.212800image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q18
median12
Q3108.75
95-th percentile452.15
Maximum482
Range482
Interquartile range (IQR)100.75

Descriptive statistics

Standard deviation144.14485
Coefficient of variation (CV)1.5246701
Kurtosis2.8853374
Mean94.541667
Median Absolute Deviation (MAD)12
Skewness1.8988878
Sum2269
Variance20777.737
MonotonicityNot monotonic
2023-12-11T04:31:39.397587image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=13)
ValueCountFrequency (%)
12 5
20.8%
0 4
16.7%
111 2
 
8.3%
8 2
 
8.3%
103 2
 
8.3%
482 2
 
8.3%
106 1
 
4.2%
108 1
 
4.2%
6 1
 
4.2%
283 1
 
4.2%
Other values (3) 3
12.5%
ValueCountFrequency (%)
0 4
16.7%
6 1
 
4.2%
8 2
 
8.3%
9 1
 
4.2%
10 1
 
4.2%
12 5
20.8%
103 2
 
8.3%
106 1
 
4.2%
108 1
 
4.2%
111 2
 
8.3%
ValueCountFrequency (%)
482 2
 
8.3%
283 1
 
4.2%
279 1
 
4.2%
111 2
 
8.3%
108 1
 
4.2%
106 1
 
4.2%
103 2
 
8.3%
12 5
20.8%
10 1
 
4.2%
9 1
 
4.2%

과세금액
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct21
Distinct (%)87.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.7882596 × 1011
Minimum0
Maximum7.59788 × 1011
Zeros4
Zeros (%)16.7%
Negative0
Negative (%)0.0%
Memory size348.0 B
2023-12-11T04:31:39.540570image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q15.9523064 × 1010
median1.3494365 × 1011
Q31.7454859 × 1011
95-th percentile5.5990267 × 1011
Maximum7.59788 × 1011
Range7.59788 × 1011
Interquartile range (IQR)1.1502553 × 1011

Descriptive statistics

Standard deviation1.9197548 × 1011
Coefficient of variation (CV)1.0735325
Kurtosis2.9045691
Mean1.7882596 × 1011
Median Absolute Deviation (MAD)7.4807534 × 1010
Skewness1.7426214
Sum4.2918229 × 1012
Variance3.6854587 × 1022
MonotonicityNot monotonic
2023-12-11T04:31:39.688273image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=21)
ValueCountFrequency (%)
0 4
 
16.7%
179811406000 1
 
4.2%
59684587000 1
 
4.2%
759788000000 1
 
4.2%
135678000000 1
 
4.2%
160127000000 1
 
4.2%
60587644000 1
 
4.2%
415799661000 1
 
4.2%
137730474000 1
 
4.2%
129130280000 1
 
4.2%
Other values (11) 11
45.8%
ValueCountFrequency (%)
0 4
16.7%
57362197000 1
 
4.2%
59038497000 1
 
4.2%
59684587000 1
 
4.2%
60587644000 1
 
4.2%
61919886000 1
 
4.2%
129130280000 1
 
4.2%
130398468000 1
 
4.2%
134209300000 1
 
4.2%
135678000000 1
 
4.2%
ValueCountFrequency (%)
759788000000 1
4.2%
585332609000 1
4.2%
415799661000 1
4.2%
376652038000 1
4.2%
374548608000 1
4.2%
179811406000 1
4.2%
172794323000 1
4.2%
160127000000 1
4.2%
157603922000 1
4.2%
143626048000 1
4.2%

비과세건수
Categorical

CONSTANT 

Distinct1
Distinct (%)4.2%
Missing0
Missing (%)0.0%
Memory size324.0 B
0
24 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 24
100.0%

Length

2023-12-11T04:31:39.865217image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T04:31:39.992965image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 24
100.0%

비과세금액
Categorical

CONSTANT 

Distinct1
Distinct (%)4.2%
Missing0
Missing (%)0.0%
Memory size324.0 B
0
24 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 24
100.0%

Length

2023-12-11T04:31:40.101184image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T04:31:40.198153image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 24
100.0%

Interactions

2023-12-11T04:31:37.195455image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T04:31:36.874774image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T04:31:37.314753image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T04:31:37.053002image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T04:31:40.265003image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
과세년도세목명과세건수과세금액
과세년도1.0000.0000.5120.000
세목명0.0001.0000.5620.785
과세건수0.5120.5621.0000.000
과세금액0.0000.7850.0001.000
2023-12-11T04:31:40.402799image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
과세년도세목명
과세년도1.0000.000
세목명0.0001.000
2023-12-11T04:31:40.514290image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
과세건수과세금액과세년도세목명
과세건수1.000-0.0150.4210.365
과세금액-0.0151.0000.0000.590
과세년도0.4210.0001.0000.000
세목명0.3650.5900.0001.000

Missing values

2023-12-11T04:31:37.481576image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T04:31:37.675394image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시도명시군구명자치단체코드과세년도세목명과세건수과세금액비과세건수비과세금액
0대구광역시대구광역시270002017취득세0000
1대구광역시대구광역시270002017자동차세1217981140600000
2대구광역시대구광역시270002017담배소비세11114362604800000
3대구광역시대구광역시270002017지방소비세837665203800000
4대구광역시대구광역시270002017등록면허세0000
5대구광역시대구광역시270002017교육세1116191988600000
6대구광역시대구광역시270002018취득세0000
7대구광역시대구광역시270002018자동차세1217279432300000
8대구광역시대구광역시270002018담배소비세10613420930000000
9대구광역시대구광역시270002018지방소비세837454860800000
시도명시군구명자치단체코드과세년도세목명과세건수과세금액비과세건수비과세금액
14대구광역시대구광역시270002019지방소비세658533260900000
15대구광역시대구광역시270002019교육세1035736219700000
16대구광역시대구광역시270002020자동차세1212913028000000
17대구광역시대구광역시270002020담배소비세28313773047400000
18대구광역시대구광역시270002020지방소비세941579966100000
19대구광역시대구광역시270002020교육세2796058764400000
20대구광역시대구광역시270002021자동차세1216012700000000
21대구광역시대구광역시270002021담배소비세48213567800000000
22대구광역시대구광역시270002021지방소비세1075978800000000
23대구광역시대구광역시270002021교육세4825968458700000