Overview

Dataset statistics

Number of variables8
Number of observations30
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.1 KiB
Average record size in memory71.4 B

Variable types

Categorical6
Boolean1
Numeric1

Dataset

Description부산광역시수영구_지방세납세자현황_20211231
Author부산광역시 수영구
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=15078638

Alerts

시도명 has constant value ""Constant
시군구명 has constant value ""Constant
자치단체코드 has constant value ""Constant
과세년도 has constant value ""Constant
납세자수 is highly overall correlated with 납세자유형High correlation
납세자유형 is highly overall correlated with 납세자수High correlation

Reproduction

Analysis started2023-12-10 16:23:57.699577
Analysis finished2023-12-10 16:23:58.305697
Duration0.61 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시도명
Categorical

CONSTANT 

Distinct1
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Memory size372.0 B
부산광역시
30 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row부산광역시
2nd row부산광역시
3rd row부산광역시
4th row부산광역시
5th row부산광역시

Common Values

ValueCountFrequency (%)
부산광역시 30
100.0%

Length

2023-12-11T01:23:58.384259image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:23:58.505716image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
부산광역시 30
100.0%

시군구명
Categorical

CONSTANT 

Distinct1
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Memory size372.0 B
수영구
30 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row수영구
2nd row수영구
3rd row수영구
4th row수영구
5th row수영구

Common Values

ValueCountFrequency (%)
수영구 30
100.0%

Length

2023-12-11T01:23:58.637574image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:23:58.740460image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
수영구 30
100.0%

자치단체코드
Categorical

CONSTANT 

Distinct1
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Memory size372.0 B
26500
30 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row26500
2nd row26500
3rd row26500
4th row26500
5th row26500

Common Values

ValueCountFrequency (%)
26500 30
100.0%

Length

2023-12-11T01:23:58.848743image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:23:58.945035image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
26500 30
100.0%

과세년도
Categorical

CONSTANT 

Distinct1
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Memory size372.0 B
2021
30 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2021
2nd row2021
3rd row2021
4th row2021
5th row2021

Common Values

ValueCountFrequency (%)
2021 30
100.0%

Length

2023-12-11T01:23:59.068514image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:23:59.161654image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2021 30
100.0%

세목명
Categorical

Distinct9
Distinct (%)30.0%
Missing0
Missing (%)0.0%
Memory size372.0 B
재산세
주민세
취득세
자동차세
등록면허세
Other values (4)
10 

Length

Max length7
Median length5
Mean length4.2666667
Min length3

Unique

Unique2 ?
Unique (%)6.7%

Sample

1st row등록세
2nd row재산세
3rd row재산세
4th row재산세
5th row재산세

Common Values

ValueCountFrequency (%)
재산세 4
13.3%
주민세 4
13.3%
취득세 4
13.3%
자동차세 4
13.3%
등록면허세 4
13.3%
지방소득세 4
13.3%
지역자원시설세 4
13.3%
등록세 1
 
3.3%
지방소비세 1
 
3.3%

Length

2023-12-11T01:23:59.269598image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:23:59.408128image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
재산세 4
13.3%
주민세 4
13.3%
취득세 4
13.3%
자동차세 4
13.3%
등록면허세 4
13.3%
지방소득세 4
13.3%
지역자원시설세 4
13.3%
등록세 1
 
3.3%
지방소비세 1
 
3.3%

납세자유형
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)6.7%
Missing0
Missing (%)0.0%
Memory size372.0 B
법인
16 
개인
14 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row법인
2nd row개인
3rd row개인
4th row법인
5th row법인

Common Values

ValueCountFrequency (%)
법인 16
53.3%
개인 14
46.7%

Length

2023-12-11T01:23:59.553864image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:23:59.660792image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
법인 16
53.3%
개인 14
46.7%
Distinct2
Distinct (%)6.7%
Missing0
Missing (%)0.0%
Memory size162.0 B
True
16 
False
14 
ValueCountFrequency (%)
True 16
53.3%
False 14
46.7%
2023-12-11T01:23:59.760482image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

납세자수
Real number (ℝ)

HIGH CORRELATION 

Distinct29
Distinct (%)96.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean9708.4
Minimum1
Maximum67344
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size402.0 B
2023-12-11T01:23:59.863061image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile5.5
Q1258.75
median1600.5
Q38354.5
95-th percentile47445.7
Maximum67344
Range67343
Interquartile range (IQR)8095.75

Descriptive statistics

Standard deviation17205.871
Coefficient of variation (CV)1.7722664
Kurtosis4.1957989
Mean9708.4
Median Absolute Deviation (MAD)1580
Skewness2.1906686
Sum291252
Variance2.9604201 × 108
MonotonicityNot monotonic
2023-12-11T01:23:59.985829image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=29)
ValueCountFrequency (%)
1 2
 
6.7%
27113 1
 
3.3%
30 1
 
3.3%
11 1
 
3.3%
120 1
 
3.3%
42 1
 
3.3%
1631 1
 
3.3%
716 1
 
3.3%
32692 1
 
3.3%
8363 1
 
3.3%
Other values (19) 19
63.3%
ValueCountFrequency (%)
1 2
6.7%
11 1
3.3%
30 1
3.3%
42 1
3.3%
120 1
3.3%
177 1
3.3%
180 1
3.3%
495 1
3.3%
641 1
3.3%
683 1
3.3%
ValueCountFrequency (%)
67344 1
3.3%
48334 1
3.3%
46360 1
3.3%
32692 1
3.3%
27113 1
3.3%
15814 1
3.3%
10427 1
3.3%
8363 1
3.3%
8329 1
3.3%
6288 1
3.3%

Interactions

2023-12-11T01:23:57.936868image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T01:24:00.111796image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
세목명납세자유형관내_관외납세자수
세목명1.0000.0000.0000.000
납세자유형0.0001.0000.0000.560
관내_관외0.0000.0001.0000.185
납세자수0.0000.5600.1851.000
2023-12-11T01:24:00.220289image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
납세자유형세목명관내_관외
납세자유형1.0000.0000.000
세목명0.0001.0000.000
관내_관외0.0000.0001.000
2023-12-11T01:24:00.317594image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
납세자수세목명납세자유형관내_관외
납세자수1.0000.0000.5410.160
세목명0.0001.0000.0000.000
납세자유형0.5410.0001.0000.000
관내_관외0.1600.0000.0001.000

Missing values

2023-12-11T01:23:58.095607image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T01:23:58.256063image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시도명시군구명자치단체코드과세년도세목명납세자유형관내_관외납세자수
0부산광역시수영구265002021등록세법인Y1
1부산광역시수영구265002021재산세개인N27113
2부산광역시수영구265002021재산세개인Y48334
3부산광역시수영구265002021재산세법인N641
4부산광역시수영구265002021재산세법인Y495
5부산광역시수영구265002021주민세개인N8329
6부산광역시수영구265002021주민세개인Y67344
7부산광역시수영구265002021주민세법인N832
8부산광역시수영구265002021주민세법인Y1969
9부산광역시수영구265002021취득세개인N3629
시도명시군구명자치단체코드과세년도세목명납세자유형관내_관외납세자수
20부산광역시수영구265002021등록면허세법인Y1579
21부산광역시수영구265002021지방소득세개인N8363
22부산광역시수영구265002021지방소득세개인Y32692
23부산광역시수영구265002021지방소득세법인N716
24부산광역시수영구265002021지방소득세법인Y1631
25부산광역시수영구265002021지방소비세법인Y1
26부산광역시수영구265002021지역자원시설세개인N42
27부산광역시수영구265002021지역자원시설세개인Y120
28부산광역시수영구265002021지역자원시설세법인N11
29부산광역시수영구265002021지역자원시설세법인Y30