Overview

Dataset statistics

Number of variables8
Number of observations35
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.4 KiB
Average record size in memory70.8 B

Variable types

Categorical6
Boolean1
Numeric1

Dataset

Description대구광역시 달서구_지방세 납세자 현황_20201231
Author대구광역시 달서구
URLhttp://data.daegu.go.kr/open/data/dataView.do?dataSetId=15079379&dataSetDetailId=150793791a8f1b364919e&provdMethod=FILE

Alerts

시도명 has constant value ""Constant
시군구명 has constant value ""Constant
자치단체코드 has constant value ""Constant
과세년도 has constant value ""Constant

Reproduction

Analysis started2024-04-19 05:15:59.174465
Analysis finished2024-04-19 05:15:59.675368
Duration0.5 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시도명
Categorical

CONSTANT 

Distinct1
Distinct (%)2.9%
Missing0
Missing (%)0.0%
Memory size412.0 B
대구광역시
35 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row대구광역시
2nd row대구광역시
3rd row대구광역시
4th row대구광역시
5th row대구광역시

Common Values

ValueCountFrequency (%)
대구광역시 35
100.0%

Length

2024-04-19T14:15:59.739528image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-19T14:15:59.837020image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
대구광역시 35
100.0%

시군구명
Categorical

CONSTANT 

Distinct1
Distinct (%)2.9%
Missing0
Missing (%)0.0%
Memory size412.0 B
달서구
35 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row달서구
2nd row달서구
3rd row달서구
4th row달서구
5th row달서구

Common Values

ValueCountFrequency (%)
달서구 35
100.0%

Length

2024-04-19T14:15:59.963334image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-19T14:16:00.085845image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
달서구 35
100.0%

자치단체코드
Categorical

CONSTANT 

Distinct1
Distinct (%)2.9%
Missing0
Missing (%)0.0%
Memory size412.0 B
27290
35 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row27290
2nd row27290
3rd row27290
4th row27290
5th row27290

Common Values

ValueCountFrequency (%)
27290 35
100.0%

Length

2024-04-19T14:16:00.186605image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-19T14:16:00.286678image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
27290 35
100.0%

과세년도
Categorical

CONSTANT 

Distinct1
Distinct (%)2.9%
Missing0
Missing (%)0.0%
Memory size412.0 B
2020
35 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2020
2nd row2020
3rd row2020
4th row2020
5th row2020

Common Values

ValueCountFrequency (%)
2020 35
100.0%

Length

2024-04-19T14:16:00.393376image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-19T14:16:00.480792image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2020 35
100.0%

세목명
Categorical

Distinct10
Distinct (%)28.6%
Missing0
Missing (%)0.0%
Memory size412.0 B
등록세
재산세
주민세
취득세
자동차세
Other values (5)
15 

Length

Max length7
Median length5
Mean length4.2
Min length3

Unique

Unique1 ?
Unique (%)2.9%

Sample

1st row등록세
2nd row등록세
3rd row등록세
4th row등록세
5th row재산세

Common Values

ValueCountFrequency (%)
등록세 4
11.4%
재산세 4
11.4%
주민세 4
11.4%
취득세 4
11.4%
자동차세 4
11.4%
등록면허세 4
11.4%
지방소득세 4
11.4%
지역자원시설세 4
11.4%
담배소비세 2
5.7%
지방소비세 1
 
2.9%

Length

2024-04-19T14:16:00.582263image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-19T14:16:00.708396image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
등록세 4
11.4%
재산세 4
11.4%
주민세 4
11.4%
취득세 4
11.4%
자동차세 4
11.4%
등록면허세 4
11.4%
지방소득세 4
11.4%
지역자원시설세 4
11.4%
담배소비세 2
5.7%
지방소비세 1
 
2.9%

납세자유형
Categorical

Distinct2
Distinct (%)5.7%
Missing0
Missing (%)0.0%
Memory size412.0 B
개인
18 
법인
17 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row개인
2nd row개인
3rd row법인
4th row법인
5th row개인

Common Values

ValueCountFrequency (%)
개인 18
51.4%
법인 17
48.6%

Length

2024-04-19T14:16:00.840276image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-19T14:16:00.927897image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
개인 18
51.4%
법인 17
48.6%
Distinct2
Distinct (%)5.7%
Missing0
Missing (%)0.0%
Memory size167.0 B
True
18 
False
17 
ValueCountFrequency (%)
True 18
51.4%
False 17
48.6%
2024-04-19T14:16:01.005527image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

납세자수
Real number (ℝ)

Distinct33
Distinct (%)94.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean24692.286
Minimum1
Maximum186597
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size447.0 B
2024-04-19T14:16:01.095667image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q140
median1942
Q325191.5
95-th percentile152049
Maximum186597
Range186596
Interquartile range (IQR)25151.5

Descriptive statistics

Standard deviation48140.213
Coefficient of variation (CV)1.9496054
Kurtosis5.4178865
Mean24692.286
Median Absolute Deviation (MAD)1941
Skewness2.4666463
Sum864230
Variance2.3174801 × 109
MonotonicityNot monotonic
2024-04-19T14:16:01.203983image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=33)
ValueCountFrequency (%)
1 3
 
8.6%
38 1
 
2.9%
79612 1
 
2.9%
11 1
 
2.9%
23879 1
 
2.9%
54747 1
 
2.9%
2789 1
 
2.9%
4563 1
 
2.9%
18293 1
 
2.9%
1847 1
 
2.9%
Other values (23) 23
65.7%
ValueCountFrequency (%)
1 3
8.6%
2 1
 
2.9%
11 1
 
2.9%
16 1
 
2.9%
21 1
 
2.9%
24 1
 
2.9%
38 1
 
2.9%
42 1
 
2.9%
65 1
 
2.9%
514 1
 
2.9%
ValueCountFrequency (%)
186597 1
2.9%
167365 1
2.9%
145485 1
2.9%
79612 1
2.9%
54747 1
2.9%
45933 1
2.9%
45273 1
2.9%
28239 1
2.9%
26504 1
2.9%
23879 1
2.9%

Interactions

2024-04-19T14:15:59.377110image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-19T14:16:01.282118image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
세목명납세자유형관내_관외납세자수
세목명1.0000.0000.0000.000
납세자유형0.0001.0000.0000.465
관내_관외0.0000.0001.0000.202
납세자수0.0000.4650.2021.000
2024-04-19T14:16:01.362358image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
납세자유형관내_관외세목명
납세자유형1.0000.0000.000
관내_관외0.0001.0000.000
세목명0.0000.0001.000
2024-04-19T14:16:01.436307image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
납세자수세목명납세자유형관내_관외
납세자수1.0000.0000.4550.186
세목명0.0001.0000.0000.000
납세자유형0.4550.0001.0000.000
관내_관외0.1860.0000.0001.000

Missing values

2024-04-19T14:15:59.503677image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-19T14:15:59.623591image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시도명시군구명자치단체코드과세년도세목명납세자유형관내_관외납세자수
0대구광역시달서구272902020등록세개인N38
1대구광역시달서구272902020등록세개인Y42
2대구광역시달서구272902020등록세법인N1
3대구광역시달서구272902020등록세법인Y2
4대구광역시달서구272902020재산세개인N45273
5대구광역시달서구272902020재산세개인Y145485
6대구광역시달서구272902020재산세법인N910
7대구광역시달서구272902020재산세법인Y1706
8대구광역시달서구272902020주민세개인N28239
9대구광역시달서구272902020주민세개인Y186597
시도명시군구명자치단체코드과세년도세목명납세자유형관내_관외납세자수
25대구광역시달서구272902020등록면허세법인Y4563
26대구광역시달서구272902020지방소득세개인N18293
27대구광역시달서구272902020지방소득세개인Y79612
28대구광역시달서구272902020지방소득세법인N1847
29대구광역시달서구272902020지방소득세법인Y4982
30대구광역시달서구272902020지방소비세법인Y1
31대구광역시달서구272902020지역자원시설세개인N21
32대구광역시달서구272902020지역자원시설세개인Y65
33대구광역시달서구272902020지역자원시설세법인N16
34대구광역시달서구272902020지역자원시설세법인Y24