Overview

Dataset statistics

Number of variables8
Number of observations184
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory12.3 KiB
Average record size in memory68.7 B

Variable types

Categorical6
Numeric2

Dataset

Description지방세 과세를 위해 세원이 되는 과세 대상 유형별 부과된 현황을 제공(시도명,시군구명,자치단체코드,과세년도,세목명,세원 유형명,부과건수,부과금액)
URLhttps://www.data.go.kr/data/15080171/fileData.do

Alerts

시도명 has constant value ""Constant
과세년도 has constant value ""Constant
자치단체코드 is highly overall correlated with 시군구명High correlation
시군구명 is highly overall correlated with 자치단체코드High correlation
부과건수 is highly overall correlated with 부과금액High correlation
부과금액 is highly overall correlated with 부과건수High correlation
세목명 is highly overall correlated with 세원 유형명High correlation
세원 유형명 is highly overall correlated with 세목명High correlation
부과건수 has 53 (28.8%) zerosZeros
부과금액 has 53 (28.8%) zerosZeros

Reproduction

Analysis started2023-12-12 19:32:54.924414
Analysis finished2023-12-12 19:32:56.214224
Duration1.29 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시도명
Categorical

CONSTANT 

Distinct1
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
경기도
184 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row경기도
2nd row경기도
3rd row경기도
4th row경기도
5th row경기도

Common Values

ValueCountFrequency (%)
경기도 184
100.0%

Length

2023-12-13T04:32:56.304197image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T04:32:56.450785image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
경기도 184
100.0%

시군구명
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)2.2%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
수원시영통구
46 
수원시팔달구
46 
수원시권선구
46 
수원시장안구
46 

Length

Max length6
Median length6
Mean length6
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row수원시영통구
2nd row수원시팔달구
3rd row수원시권선구
4th row수원시장안구
5th row수원시영통구

Common Values

ValueCountFrequency (%)
수원시영통구 46
25.0%
수원시팔달구 46
25.0%
수원시권선구 46
25.0%
수원시장안구 46
25.0%

Length

2023-12-13T04:32:56.617450image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T04:32:56.771220image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
수원시영통구 46
25.0%
수원시팔달구 46
25.0%
수원시권선구 46
25.0%
수원시장안구 46
25.0%

자치단체코드
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)2.2%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
41117
46 
41115
46 
41113
46 
41111
46 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row41117
2nd row41115
3rd row41113
4th row41111
5th row41117

Common Values

ValueCountFrequency (%)
41117 46
25.0%
41115 46
25.0%
41113 46
25.0%
41111 46
25.0%

Length

2023-12-13T04:32:56.923493image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T04:32:57.068743image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
41117 46
25.0%
41115 46
25.0%
41113 46
25.0%
41111 46
25.0%

과세년도
Categorical

CONSTANT 

Distinct1
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
2022
184 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2022
2nd row2022
3rd row2022
4th row2022
5th row2022

Common Values

ValueCountFrequency (%)
2022 184
100.0%

Length

2023-12-13T04:32:57.231450image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T04:32:57.372465image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2022 184
100.0%

세목명
Categorical

HIGH CORRELATION 

Distinct13
Distinct (%)7.1%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
취득세
36 
자동차세
28 
주민세
28 
재산세
20 
레저세
16 
Other values (8)
56 

Length

Max length7
Median length3
Mean length3.7826087
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row담배소비세
2nd row담배소비세
3rd row담배소비세
4th row담배소비세
5th row도시계획세

Common Values

ValueCountFrequency (%)
취득세 36
19.6%
자동차세 28
15.2%
주민세 28
15.2%
재산세 20
10.9%
레저세 16
8.7%
지방소득세 16
8.7%
지역자원시설세 12
 
6.5%
등록면허세 8
 
4.3%
담배소비세 4
 
2.2%
도시계획세 4
 
2.2%
Other values (3) 12
 
6.5%

Length

2023-12-13T04:32:57.511724image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
취득세 36
19.6%
자동차세 28
15.2%
주민세 28
15.2%
재산세 20
10.9%
레저세 16
8.7%
지방소득세 16
8.7%
지역자원시설세 12
 
6.5%
등록면허세 8
 
4.3%
담배소비세 4
 
2.2%
도시계획세 4
 
2.2%
Other values (3) 12
 
6.5%

세원 유형명
Categorical

HIGH CORRELATION 

Distinct46
Distinct (%)25.0%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
담배소비세
 
4
토지
 
4
3륜이하
 
4
주택(개별)
 
4
주택(단독)
 
4
Other values (41)
164 

Length

Max length11
Median length8
Mean length6.0217391
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row담배소비세
2nd row담배소비세
3rd row담배소비세
4th row담배소비세
5th row도시계획세

Common Values

ValueCountFrequency (%)
담배소비세 4
 
2.2%
토지 4
 
2.2%
3륜이하 4
 
2.2%
주택(개별) 4
 
2.2%
주택(단독) 4
 
2.2%
기타 4
 
2.2%
항공기 4
 
2.2%
기계장비 4
 
2.2%
차량 4
 
2.2%
선박 4
 
2.2%
Other values (36) 144
78.3%

Length

2023-12-13T04:32:57.682207image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
담배소비세 4
 
2.2%
주민세(종합소득 4
 
2.2%
교육세 4
 
2.2%
기타승용 4
 
2.2%
승용 4
 
2.2%
주민세(사업소분 4
 
2.2%
주민세(개인분 4
 
2.2%
주민세(종업원분 4
 
2.2%
주민세(특별징수 4
 
2.2%
주민세(법인세분 4
 
2.2%
Other values (36) 144
78.3%

부과건수
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct127
Distinct (%)69.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean39251.788
Minimum0
Maximum758753
Zeros53
Zeros (%)28.8%
Negative0
Negative (%)0.0%
Memory size1.7 KiB
2023-12-13T04:32:57.889652image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median1169
Q323019.75
95-th percentile210617.2
Maximum758753
Range758753
Interquartile range (IQR)23019.75

Descriptive statistics

Standard deviation103236.51
Coefficient of variation (CV)2.6301099
Kurtosis25.289062
Mean39251.788
Median Absolute Deviation (MAD)1169
Skewness4.5777931
Sum7222329
Variance1.0657778 × 1010
MonotonicityNot monotonic
2023-12-13T04:32:58.101492image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 53
28.8%
12 2
 
1.1%
24 2
 
1.1%
63 2
 
1.1%
32 2
 
1.1%
10 2
 
1.1%
31533 1
 
0.5%
1358 1
 
0.5%
9 1
 
0.5%
36025 1
 
0.5%
Other values (117) 117
63.6%
ValueCountFrequency (%)
0 53
28.8%
7 1
 
0.5%
9 1
 
0.5%
10 2
 
1.1%
11 1
 
0.5%
12 2
 
1.1%
17 1
 
0.5%
19 1
 
0.5%
24 2
 
1.1%
31 1
 
0.5%
ValueCountFrequency (%)
758753 1
0.5%
729602 1
0.5%
492622 1
0.5%
393208 1
0.5%
305536 1
0.5%
295676 1
0.5%
240939 1
0.5%
240576 1
0.5%
236627 1
0.5%
215710 1
0.5%

부과금액
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct132
Distinct (%)71.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.2146748 × 1010
Minimum0
Maximum2.7116909 × 1011
Zeros53
Zeros (%)28.8%
Negative0
Negative (%)0.0%
Memory size1.7 KiB
2023-12-13T04:32:58.309348image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median4.187355 × 108
Q31.59311 × 1010
95-th percentile4.7821098 × 1010
Maximum2.7116909 × 1011
Range2.7116909 × 1011
Interquartile range (IQR)1.59311 × 1010

Descriptive statistics

Standard deviation2.7206925 × 1010
Coefficient of variation (CV)2.2398526
Kurtosis48.018898
Mean1.2146748 × 1010
Median Absolute Deviation (MAD)4.187355 × 108
Skewness5.8489276
Sum2.2350016 × 1012
Variance7.4021677 × 1020
MonotonicityNot monotonic
2023-12-13T04:32:58.497435image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 53
28.8%
1766754000 1
 
0.5%
5794395000 1
 
0.5%
878103000 1
 
0.5%
12711194000 1
 
0.5%
1441319000 1
 
0.5%
3964754000 1
 
0.5%
965757000 1
 
0.5%
10551737000 1
 
0.5%
1438491000 1
 
0.5%
Other values (122) 122
66.3%
ValueCountFrequency (%)
0 53
28.8%
644000 1
 
0.5%
1058000 1
 
0.5%
1434000 1
 
0.5%
1590000 1
 
0.5%
2004000 1
 
0.5%
3585000 1
 
0.5%
3639000 1
 
0.5%
5634000 1
 
0.5%
6006000 1
 
0.5%
ValueCountFrequency (%)
271169088000 1
0.5%
152072534000 1
0.5%
81269707000 1
0.5%
72782970000 1
0.5%
65423881000 1
0.5%
55676723000 1
0.5%
54931554000 1
0.5%
53708436000 1
0.5%
53093278000 1
0.5%
47840943000 1
0.5%

Interactions

2023-12-13T04:32:55.631013image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T04:32:55.342751image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T04:32:55.803930image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T04:32:55.498335image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T04:32:58.620507image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시군구명자치단체코드세목명세원 유형명부과건수부과금액
시군구명1.0001.0000.0000.0000.0000.140
자치단체코드1.0001.0000.0000.0000.0000.140
세목명0.0000.0001.0001.0000.6890.284
세원 유형명0.0000.0001.0001.0000.7470.510
부과건수0.0000.0000.6890.7471.0000.197
부과금액0.1400.1400.2840.5100.1971.000
2023-12-13T04:32:58.785283image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
세목명자치단체코드시군구명세원 유형명
세목명1.0000.0000.0000.898
자치단체코드0.0001.0001.0000.000
시군구명0.0001.0001.0000.000
세원 유형명0.8980.0000.0001.000
2023-12-13T04:32:58.903052image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
부과건수부과금액시군구명자치단체코드세목명세원 유형명
부과건수1.0000.8270.0000.0000.3940.349
부과금액0.8271.0000.1140.1140.1530.227
시군구명0.0000.1141.0001.0000.0000.000
자치단체코드0.0000.1141.0001.0000.0000.000
세목명0.3940.1530.0000.0001.0000.898
세원 유형명0.3490.2270.0000.0000.8981.000

Missing values

2023-12-13T04:32:55.981732image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T04:32:56.144934image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시도명시군구명자치단체코드과세년도세목명세원 유형명부과건수부과금액
0경기도수원시영통구411172022담배소비세담배소비세00
1경기도수원시팔달구411152022담배소비세담배소비세64072782970000
2경기도수원시권선구411132022담배소비세담배소비세00
3경기도수원시장안구411112022담배소비세담배소비세00
4경기도수원시영통구411172022도시계획세도시계획세00
5경기도수원시팔달구411152022도시계획세도시계획세00
6경기도수원시권선구411132022도시계획세도시계획세00
7경기도수원시장안구411112022도시계획세도시계획세00
8경기도수원시영통구411172022취득세주택(개별)2725586363000
9경기도수원시영통구411172022취득세주택(단독)457665423881000
시도명시군구명자치단체코드과세년도세목명세원 유형명부과건수부과금액
174경기도수원시장안구411112022지방소득세지방소득세(양도소득)364712263728000
175경기도수원시장안구411112022지방소득세지방소득세(종합소득)7861310068425000
176경기도수원시영통구411172022교육세교육세75875347840943000
177경기도수원시팔달구411152022교육세교육세39320853093278000
178경기도수원시권선구411132022교육세교육세72960232555939000
179경기도수원시장안구411112022교육세교육세49262221446213000
180경기도수원시영통구411172022체납체납14865317791468000
181경기도수원시팔달구411152022체납체납18175818845665000
182경기도수원시권선구411132022체납체납24093922760610000
183경기도수원시장안구411112022체납체납13960612098701000