Overview

Dataset statistics

Number of variables8
Number of observations97
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory6.5 KiB
Average record size in memory68.4 B

Variable types

Categorical6
Boolean1
Numeric1

Dataset

Description부산광역시연제구_지방세납세자현황_20191231
Author부산광역시 연제구
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=15079343

Alerts

시도명 has constant value ""Constant
시군구명 has constant value ""Constant
자치단체코드 has constant value ""Constant
납세자수 is highly overall correlated with 납세자유형High correlation
납세자유형 is highly overall correlated with 납세자수High correlation

Reproduction

Analysis started2023-12-10 16:50:04.747725
Analysis finished2023-12-10 16:50:08.246975
Duration3.5 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시도명
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size908.0 B
부산광역시
97 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row부산광역시
2nd row부산광역시
3rd row부산광역시
4th row부산광역시
5th row부산광역시

Common Values

ValueCountFrequency (%)
부산광역시 97
100.0%

Length

2023-12-11T01:50:08.390055image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:50:08.597153image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
부산광역시 97
100.0%

시군구명
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size908.0 B
연제구
97 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row연제구
2nd row연제구
3rd row연제구
4th row연제구
5th row연제구

Common Values

ValueCountFrequency (%)
연제구 97
100.0%

Length

2023-12-11T01:50:08.764633image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:50:08.911463image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
연제구 97
100.0%

자치단체코드
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size908.0 B
26470
97 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row26470
2nd row26470
3rd row26470
4th row26470
5th row26470

Common Values

ValueCountFrequency (%)
26470 97
100.0%

Length

2023-12-11T01:50:09.111792image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:50:09.309649image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
26470 97
100.0%

과세년도
Categorical

Distinct3
Distinct (%)3.1%
Missing0
Missing (%)0.0%
Memory size908.0 B
2017
33 
2018
32 
2019
32 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2017
2nd row2017
3rd row2017
4th row2017
5th row2017

Common Values

ValueCountFrequency (%)
2017 33
34.0%
2018 32
33.0%
2019 32
33.0%

Length

2023-12-11T01:50:09.482338image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:50:09.644171image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2017 33
34.0%
2018 32
33.0%
2019 32
33.0%

세목명
Categorical

Distinct9
Distinct (%)9.3%
Missing0
Missing (%)0.0%
Memory size908.0 B
재산세
12 
주민세
12 
취득세
12 
자동차세
12 
등록면허세
12 
Other values (4)
37 

Length

Max length7
Median length3
Mean length4.1134021
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row등록세
2nd row등록세
3rd row등록세
4th row레저세
5th row레저세

Common Values

ValueCountFrequency (%)
재산세 12
12.4%
주민세 12
12.4%
취득세 12
12.4%
자동차세 12
12.4%
등록면허세 12
12.4%
지방소득세 12
12.4%
지역자원시설세 12
12.4%
등록세 7
7.2%
레저세 6
6.2%

Length

2023-12-11T01:50:09.911624image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:50:10.147921image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
재산세 12
12.4%
주민세 12
12.4%
취득세 12
12.4%
자동차세 12
12.4%
등록면허세 12
12.4%
지방소득세 12
12.4%
지역자원시설세 12
12.4%
등록세 7
7.2%
레저세 6
6.2%

납세자유형
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)2.1%
Missing0
Missing (%)0.0%
Memory size908.0 B
개인
51 
법인
46 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row개인
2nd row개인
3rd row법인
4th row개인
5th row법인

Common Values

ValueCountFrequency (%)
개인 51
52.6%
법인 46
47.4%

Length

2023-12-11T01:50:10.372096image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:50:10.577438image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
개인 51
52.6%
법인 46
47.4%
Distinct2
Distinct (%)2.1%
Missing0
Missing (%)0.0%
Memory size229.0 B
False
52 
True
45 
ValueCountFrequency (%)
False 52
53.6%
True 45
46.4%
2023-12-11T01:50:10.708006image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

납세자수
Real number (ℝ)

HIGH CORRELATION 

Distinct89
Distinct (%)91.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean9548.1546
Minimum1
Maximum73925
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1005.0 B
2023-12-11T01:50:10.925176image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2
Q169
median1682
Q312373
95-th percentile51134.6
Maximum73925
Range73924
Interquartile range (IQR)12304

Descriptive statistics

Standard deviation16952.496
Coefficient of variation (CV)1.7754735
Kurtosis4.588203
Mean9548.1546
Median Absolute Deviation (MAD)1659
Skewness2.2669801
Sum926171
Variance2.8738711 × 108
MonotonicityNot monotonic
2023-12-11T01:50:11.205049image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 4
 
4.1%
869 2
 
2.1%
69 2
 
2.1%
23 2
 
2.1%
2 2
 
2.1%
22 2
 
2.1%
55 1
 
1.0%
9 1
 
1.0%
439 1
 
1.0%
55202 1
 
1.0%
Other values (79) 79
81.4%
ValueCountFrequency (%)
1 4
4.1%
2 2
2.1%
3 1
 
1.0%
7 1
 
1.0%
9 1
 
1.0%
10 1
 
1.0%
11 1
 
1.0%
15 1
 
1.0%
22 2
2.1%
23 2
2.1%
ValueCountFrequency (%)
73925 1
1.0%
69696 1
1.0%
66002 1
1.0%
55202 1
1.0%
52585 1
1.0%
50772 1
1.0%
49062 1
1.0%
45220 1
1.0%
42524 1
1.0%
28117 1
1.0%

Interactions

2023-12-11T01:50:07.322402image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T01:50:11.385140image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
과세년도세목명납세자유형관내_관외납세자수
과세년도1.0000.0000.0000.0000.000
세목명0.0001.0000.0000.0000.598
납세자유형0.0000.0001.0000.0000.588
관내_관외0.0000.0000.0001.0000.263
납세자수0.0000.5980.5880.2631.000
2023-12-11T01:50:11.550394image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
과세년도세목명납세자유형관내_관외
과세년도1.0000.0000.0000.000
세목명0.0001.0000.0000.000
납세자유형0.0000.0001.0000.000
관내_관외0.0000.0000.0001.000
2023-12-11T01:50:11.732552image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
납세자수과세년도세목명납세자유형관내_관외
납세자수1.0000.0000.2270.5690.251
과세년도0.0001.0000.0000.0000.000
세목명0.2270.0001.0000.0000.000
납세자유형0.5690.0000.0001.0000.000
관내_관외0.2510.0000.0000.0001.000

Missing values

2023-12-11T01:50:07.892620image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T01:50:08.135648image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시도명시군구명자치단체코드과세년도세목명납세자유형관내_관외납세자수
0부산광역시연제구264702017등록세개인N55
1부산광역시연제구264702017등록세개인Y30
2부산광역시연제구264702017등록세법인N1
3부산광역시연제구264702017레저세개인N3
4부산광역시연제구264702017레저세법인N1
5부산광역시연제구264702017재산세개인N27747
6부산광역시연제구264702017재산세개인Y50772
7부산광역시연제구264702017재산세법인N372
8부산광역시연제구264702017재산세법인Y462
9부산광역시연제구264702017주민세개인N21552
시도명시군구명자치단체코드과세년도세목명납세자유형관내_관외납세자수
87부산광역시연제구264702019등록면허세법인N1682
88부산광역시연제구264702019등록면허세법인Y2255
89부산광역시연제구264702019지방소득세개인N7862
90부산광역시연제구264702019지방소득세개인Y26667
91부산광역시연제구264702019지방소득세법인N869
92부산광역시연제구264702019지방소득세법인Y2270
93부산광역시연제구264702019지역자원시설세개인N36
94부산광역시연제구264702019지역자원시설세개인Y67
95부산광역시연제구264702019지역자원시설세법인N7
96부산광역시연제구264702019지역자원시설세법인Y22