Overview

Dataset statistics

Number of variables8
Number of observations139
Missing cells40
Missing cells (%)3.6%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory9.4 KiB
Average record size in memory68.9 B

Variable types

Categorical6
Numeric2

Dataset

Description3년간(2020~2022) 지방세 과세를 위해 세원이 되는 과세 대상 유형별 부과된 부과건수 및 부과금액 현황을 제공합니다.
Author전라남도 나주시
URLhttps://www.data.go.kr/data/15126699/fileData.do

Alerts

시도명 has constant value ""Constant
시군구명 has constant value ""Constant
자치단체코드 has constant value ""Constant
부과건수 is highly overall correlated with 부과금액 and 2 other fieldsHigh correlation
부과금액 is highly overall correlated with 부과건수High correlation
세목명 is highly overall correlated with 부과건수 and 1 other fieldsHigh correlation
세원 유형명 is highly overall correlated with 부과건수 and 1 other fieldsHigh correlation
부과건수 has 20 (14.4%) missing valuesMissing
부과금액 has 20 (14.4%) missing valuesMissing
부과건수 has 7 (5.0%) zerosZeros
부과금액 has 7 (5.0%) zerosZeros

Reproduction

Analysis started2024-03-14 17:05:06.888847
Analysis finished2024-03-14 17:05:08.938722
Duration2.05 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시도명
Categorical

CONSTANT 

Distinct1
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
전라남도
139 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row전라남도
2nd row전라남도
3rd row전라남도
4th row전라남도
5th row전라남도

Common Values

ValueCountFrequency (%)
전라남도 139
100.0%

Length

2024-03-15T02:05:09.164810image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-15T02:05:09.593805image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
전라남도 139
100.0%

시군구명
Categorical

CONSTANT 

Distinct1
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
나주시
139 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row나주시
2nd row나주시
3rd row나주시
4th row나주시
5th row나주시

Common Values

ValueCountFrequency (%)
나주시 139
100.0%

Length

2024-03-15T02:05:09.922919image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-15T02:05:10.090707image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
나주시 139
100.0%

자치단체코드
Categorical

CONSTANT 

Distinct1
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
46170
139 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row46170
2nd row46170
3rd row46170
4th row46170
5th row46170

Common Values

ValueCountFrequency (%)
46170 139
100.0%

Length

2024-03-15T02:05:10.373437image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-15T02:05:10.756604image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
46170 139
100.0%

과세년도
Categorical

Distinct3
Distinct (%)2.2%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
2020
47 
2021
46 
2022
46 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2020
2nd row2020
3rd row2020
4th row2020
5th row2020

Common Values

ValueCountFrequency (%)
2020 47
33.8%
2021 46
33.1%
2022 46
33.1%

Length

2024-03-15T02:05:10.983779image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-15T02:05:11.163818image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2020 47
33.8%
2021 46
33.1%
2022 46
33.1%

세목명
Categorical

HIGH CORRELATION 

Distinct13
Distinct (%)9.4%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
취득세
27 
주민세
23 
자동차세
21 
재산세
15 
레저세
12 
Other values (8)
41 

Length

Max length7
Median length3
Mean length3.7482014
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row담배소비세
2nd row교육세
3rd row도시계획세
4th row취득세
5th row취득세

Common Values

ValueCountFrequency (%)
취득세 27
19.4%
주민세 23
16.5%
자동차세 21
15.1%
재산세 15
10.8%
레저세 12
8.6%
지방소득세 12
8.6%
지역자원시설세 8
 
5.8%
등록면허세 6
 
4.3%
담배소비세 3
 
2.2%
교육세 3
 
2.2%
Other values (3) 9
 
6.5%

Length

2024-03-15T02:05:11.430377image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
취득세 27
19.4%
주민세 23
16.5%
자동차세 21
15.1%
재산세 15
10.8%
레저세 12
8.6%
지방소득세 12
8.6%
지역자원시설세 8
 
5.8%
등록면허세 6
 
4.3%
담배소비세 3
 
2.2%
교육세 3
 
2.2%
Other values (3) 9
 
6.5%

세원 유형명
Categorical

HIGH CORRELATION 

Distinct50
Distinct (%)36.0%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
담배소비세
 
3
승합
 
3
건축물
 
3
3륜이하
 
3
기타
 
3
Other values (45)
124 

Length

Max length11
Median length8
Mean length6.028777
Min length2

Unique

Unique4 ?
Unique (%)2.9%

Sample

1st row담배소비세
2nd row교육세
3rd row도시계획세
4th row건축물
5th row주택(개별)

Common Values

ValueCountFrequency (%)
담배소비세 3
 
2.2%
승합 3
 
2.2%
건축물 3
 
2.2%
3륜이하 3
 
2.2%
기타 3
 
2.2%
항공기 3
 
2.2%
기계장비 3
 
2.2%
차량 3
 
2.2%
선박 3
 
2.2%
토지 3
 
2.2%
Other values (40) 109
78.4%

Length

2024-03-15T02:05:11.837466image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
담배소비세 3
 
2.2%
주택(단독 3
 
2.2%
주민세(종합소득 3
 
2.2%
승합 3
 
2.2%
교육세 3
 
2.2%
기타승용 3
 
2.2%
승용 3
 
2.2%
주민세(종업원분 3
 
2.2%
주민세(특별징수 3
 
2.2%
체납 3
 
2.2%
Other values (40) 109
78.4%

부과건수
Real number (ℝ)

HIGH CORRELATION  MISSING  ZEROS 

Distinct108
Distinct (%)90.8%
Missing20
Missing (%)14.4%
Infinite0
Infinite (%)0.0%
Mean23798.697
Minimum0
Maximum327549
Zeros7
Zeros (%)5.0%
Negative0
Negative (%)0.0%
Memory size1.3 KiB
2024-03-15T02:05:12.232414image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q1277
median2376
Q322346
95-th percentile97522.2
Maximum327549
Range327549
Interquartile range (IQR)22069

Descriptive statistics

Standard deviation54981.397
Coefficient of variation (CV)2.3102692
Kurtosis20.718461
Mean23798.697
Median Absolute Deviation (MAD)2370
Skewness4.2923002
Sum2832045
Variance3.022954 × 109
MonotonicityNot monotonic
2024-03-15T02:05:12.671804image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 7
 
5.0%
12 4
 
2.9%
6 2
 
1.4%
2 2
 
1.4%
98271 1
 
0.7%
10478 1
 
0.7%
737 1
 
0.7%
1 1
 
0.7%
250 1
 
0.7%
2376 1
 
0.7%
Other values (98) 98
70.5%
(Missing) 20
 
14.4%
ValueCountFrequency (%)
0 7
5.0%
1 1
 
0.7%
2 2
 
1.4%
3 1
 
0.7%
5 1
 
0.7%
6 2
 
1.4%
7 1
 
0.7%
9 1
 
0.7%
10 1
 
0.7%
11 1
 
0.7%
ValueCountFrequency (%)
327549 1
0.7%
326385 1
0.7%
315753 1
0.7%
100594 1
0.7%
98526 1
0.7%
98271 1
0.7%
97439 1
0.7%
95609 1
0.7%
93650 1
0.7%
78796 1
0.7%

부과금액
Real number (ℝ)

HIGH CORRELATION  MISSING  ZEROS 

Distinct113
Distinct (%)95.0%
Missing20
Missing (%)14.4%
Infinite0
Infinite (%)0.0%
Mean5.3178037 × 109
Minimum0
Maximum2.993979 × 1010
Zeros7
Zeros (%)5.0%
Negative0
Negative (%)0.0%
Memory size1.3 KiB
2024-03-15T02:05:13.360693image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q158869500
median2.233085 × 109
Q31.0112690 × 1010
95-th percentile1.6605964 × 1010
Maximum2.993979 × 1010
Range2.993979 × 1010
Interquartile range (IQR)1.0053821 × 1010

Descriptive statistics

Standard deviation6.3133665 × 109
Coefficient of variation (CV)1.1872131
Kurtosis0.84466894
Mean5.3178037 × 109
Median Absolute Deviation (MAD)2.232949 × 109
Skewness1.161789
Sum6.3281864 × 1011
Variance3.9858596 × 1019
MonotonicityNot monotonic
2024-03-15T02:05:13.807053image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 7
 
5.0%
17856000 1
 
0.7%
742000 1
 
0.7%
11841252000 1
 
0.7%
681033000 1
 
0.7%
263000 1
 
0.7%
1114093000 1
 
0.7%
10918861000 1
 
0.7%
2813154000 1
 
0.7%
15198748000 1
 
0.7%
Other values (103) 103
74.1%
(Missing) 20
 
14.4%
ValueCountFrequency (%)
0 7
5.0%
110000 1
 
0.7%
136000 1
 
0.7%
203000 1
 
0.7%
263000 1
 
0.7%
742000 1
 
0.7%
803000 1
 
0.7%
826000 1
 
0.7%
1185000 1
 
0.7%
2253000 1
 
0.7%
ValueCountFrequency (%)
29939790000 1
0.7%
20194651000 1
0.7%
19122235000 1
0.7%
18166856000 1
0.7%
17614205000 1
0.7%
16918009000 1
0.7%
16571292000 1
0.7%
16243683000 1
0.7%
15931623000 1
0.7%
15834212000 1
0.7%

Interactions

2024-03-15T02:05:07.746116image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T02:05:07.218670image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T02:05:07.944615image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T02:05:07.473610image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-15T02:05:14.101939image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
과세년도세목명세원 유형명부과건수부과금액
과세년도1.0000.0000.0000.0000.000
세목명0.0001.0001.0000.8990.591
세원 유형명0.0001.0001.0001.0000.825
부과건수0.0000.8991.0001.0000.570
부과금액0.0000.5910.8250.5701.000
2024-03-15T02:05:14.324754image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
과세년도세원 유형명세목명
과세년도1.0000.0000.000
세원 유형명0.0001.0000.840
세목명0.0000.8401.000
2024-03-15T02:05:14.483610image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
부과건수부과금액과세년도세목명세원 유형명
부과건수1.0000.5880.0000.7380.769
부과금액0.5881.0000.0000.3080.380
과세년도0.0000.0001.0000.0000.000
세목명0.7380.3080.0001.0000.840
세원 유형명0.7690.3800.0000.8401.000

Missing values

2024-03-15T02:05:08.258197image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-15T02:05:08.555006image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-03-15T02:05:08.812882image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

시도명시군구명자치단체코드과세년도세목명세원 유형명부과건수부과금액
0전라남도나주시461702020담배소비세담배소비세2747878134000
1전라남도나주시461702020교육세교육세31575314091643000
2전라남도나주시461702020도시계획세도시계획세<NA><NA>
3전라남도나주시461702020취득세건축물17347549027000
4전라남도나주시461702020취득세주택(개별)17552104541000
5전라남도나주시461702020취득세주택(단독)239611515749000
6전라남도나주시461702020취득세기타227851796000
7전라남도나주시461702020취득세항공기<NA><NA>
8전라남도나주시461702020취득세기계장비752733260000
9전라남도나주시461702020취득세차량1176311626890000
시도명시군구명자치단체코드과세년도세목명세원 유형명부과건수부과금액
129전라남도나주시461702022등록면허세등록면허세(면허)35191741009000
130전라남도나주시461702022등록면허세등록면허세(등록)344253855175000
131전라남도나주시461702022지역자원시설세지역자원시설세(소방)531505280479000
132전라남도나주시461702022지역자원시설세지역자원시설세(시설)62794000
133전라남도나주시461702022지역자원시설세지역자원시설세(특자)57218864000
134전라남도나주시461702022지방소득세지방소득세(특별징수)2652229939790000
135전라남도나주시461702022지방소득세지방소득세(법인소득)364416243683000
136전라남도나주시461702022지방소득세지방소득세(양도소득)24773167273000
137전라남도나주시461702022지방소득세지방소득세(종합소득)198312233085000
138전라남도나주시461702022체납체납985269335356000