Overview

Dataset statistics

Number of variables8
Number of observations235
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory15.7 KiB
Average record size in memory68.6 B

Variable types

Categorical6
Numeric2

Dataset

Description지방세 과세를 위해 세원이 되는 과세 대상 유형별 부과된 현황을 제공
Author경상남도 창원시
URLhttps://bigdata.gyeongnam.go.kr/index.gn?menuCd=DOM_000000114002001000&publicdatapk=15078681

Alerts

시도명 has constant value ""Constant
과세년도 has constant value ""Constant
세원 유형명 is highly overall correlated with 세목명High correlation
시군구명 is highly overall correlated with 자치단체코드High correlation
자치단체코드 is highly overall correlated with 시군구명High correlation
세목명 is highly overall correlated with 세원 유형명High correlation
부과건수 is highly overall correlated with 부과금액High correlation
부과금액 is highly overall correlated with 부과건수High correlation
부과건수 has 50 (21.3%) zerosZeros
부과금액 has 50 (21.3%) zerosZeros

Reproduction

Analysis started2023-12-10 22:45:20.032075
Analysis finished2023-12-10 22:45:20.992015
Duration0.96 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시도명
Categorical

CONSTANT 

Distinct1
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
창원시
235 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row창원시
2nd row창원시
3rd row창원시
4th row창원시
5th row창원시

Common Values

ValueCountFrequency (%)
창원시 235
100.0%

Length

2023-12-11T07:45:21.040269image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T07:45:21.106885image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
창원시 235
100.0%

시군구명
Categorical

HIGH CORRELATION 

Distinct5
Distinct (%)2.1%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
진해구
47 
마산회원구
47 
마산합포구
47 
성산구
47 
의창구
47 

Length

Max length5
Median length3
Mean length3.8
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row진해구
2nd row마산회원구
3rd row마산합포구
4th row성산구
5th row의창구

Common Values

ValueCountFrequency (%)
진해구 47
20.0%
마산회원구 47
20.0%
마산합포구 47
20.0%
성산구 47
20.0%
의창구 47
20.0%

Length

2023-12-11T07:45:21.192370image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T07:45:21.284551image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
진해구 47
20.0%
마산회원구 47
20.0%
마산합포구 47
20.0%
성산구 47
20.0%
의창구 47
20.0%

자치단체코드
Categorical

HIGH CORRELATION 

Distinct5
Distinct (%)2.1%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
48129
47 
48127
47 
48125
47 
48123
47 
48121
47 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row48129
2nd row48127
3rd row48125
4th row48123
5th row48121

Common Values

ValueCountFrequency (%)
48129 47
20.0%
48127 47
20.0%
48125 47
20.0%
48123 47
20.0%
48121 47
20.0%

Length

2023-12-11T07:45:21.371004image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T07:45:21.453924image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
48129 47
20.0%
48127 47
20.0%
48125 47
20.0%
48123 47
20.0%
48121 47
20.0%

과세년도
Categorical

CONSTANT 

Distinct1
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
2020
235 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2020
2nd row2020
3rd row2020
4th row2020
5th row2020

Common Values

ValueCountFrequency (%)
2020 235
100.0%

Length

2023-12-11T07:45:21.543500image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T07:45:21.617093image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2020 235
100.0%

세목명
Categorical

HIGH CORRELATION 

Distinct13
Distinct (%)5.5%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
취득세
45 
주민세
45 
자동차세
35 
재산세
25 
레저세
20 
Other values (8)
65 

Length

Max length7
Median length3
Mean length3.6808511
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row담배소비세
2nd row담배소비세
3rd row담배소비세
4th row담배소비세
5th row담배소비세

Common Values

ValueCountFrequency (%)
취득세 45
19.1%
주민세 45
19.1%
자동차세 35
14.9%
재산세 25
10.6%
레저세 20
8.5%
지방소득세 20
8.5%
등록면허세 10
 
4.3%
지역자원시설세 10
 
4.3%
담배소비세 5
 
2.1%
교육세 5
 
2.1%
Other values (3) 15
 
6.4%

Length

2023-12-11T07:45:21.694917image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
취득세 45
19.1%
주민세 45
19.1%
자동차세 35
14.9%
재산세 25
10.6%
레저세 20
8.5%
지방소득세 20
8.5%
등록면허세 10
 
4.3%
지역자원시설세 10
 
4.3%
담배소비세 5
 
2.1%
교육세 5
 
2.1%
Other values (3) 15
 
6.4%

세원 유형명
Categorical

HIGH CORRELATION 

Distinct47
Distinct (%)20.0%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
담배소비세
 
5
소싸움
 
5
도시계획세
 
5
주택(개별)
 
5
주택(단독)
 
5
Other values (42)
210 

Length

Max length11
Median length8
Mean length6.0425532
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row담배소비세
2nd row담배소비세
3rd row담배소비세
4th row담배소비세
5th row담배소비세

Common Values

ValueCountFrequency (%)
담배소비세 5
 
2.1%
소싸움 5
 
2.1%
도시계획세 5
 
2.1%
주택(개별) 5
 
2.1%
주택(단독) 5
 
2.1%
기타 5
 
2.1%
항공기 5
 
2.1%
기계장비 5
 
2.1%
차량 5
 
2.1%
선박 5
 
2.1%
Other values (37) 185
78.7%

Length

2023-12-11T07:45:21.789085image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
담배소비세 5
 
2.1%
화물 5
 
2.1%
기타승용 5
 
2.1%
승용 5
 
2.1%
지방소득세(특별징수 5
 
2.1%
지방소득세(법인소득 5
 
2.1%
지방소득세(양도소득 5
 
2.1%
지방소득세(종합소득 5
 
2.1%
지방소비세 5
 
2.1%
등록면허세(면허 5
 
2.1%
Other values (37) 185
78.7%

부과건수
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct172
Distinct (%)73.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean26599.191
Minimum0
Maximum560435
Zeros50
Zeros (%)21.3%
Negative0
Negative (%)0.0%
Memory size2.2 KiB
2023-12-11T07:45:21.892885image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q16
median1189
Q318991
95-th percentile139037.4
Maximum560435
Range560435
Interquartile range (IQR)18985

Descriptive statistics

Standard deviation71140.097
Coefficient of variation (CV)2.674521
Kurtosis24.753136
Mean26599.191
Median Absolute Deviation (MAD)1189
Skewness4.5954856
Sum6250810
Variance5.0609133 × 109
MonotonicityNot monotonic
2023-12-11T07:45:21.995570image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 50
 
21.3%
6 9
 
3.8%
16 5
 
2.1%
376 2
 
0.9%
324 2
 
0.9%
19729 1
 
0.4%
47612 1
 
0.4%
3621 1
 
0.4%
3371 1
 
0.4%
27462 1
 
0.4%
Other values (162) 162
68.9%
ValueCountFrequency (%)
0 50
21.3%
1 1
 
0.4%
2 1
 
0.4%
3 1
 
0.4%
4 1
 
0.4%
6 9
 
3.8%
15 1
 
0.4%
16 5
 
2.1%
35 1
 
0.4%
36 1
 
0.4%
ValueCountFrequency (%)
560435 1
0.4%
450997 1
0.4%
395612 1
0.4%
363668 1
0.4%
359464 1
0.4%
225148 1
0.4%
216924 1
0.4%
170855 1
0.4%
157014 1
0.4%
148893 1
0.4%

부과금액
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct186
Distinct (%)79.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6.5625615 × 109
Minimum-1228000
Maximum7.5822932 × 1010
Zeros50
Zeros (%)21.3%
Negative1
Negative (%)0.4%
Memory size2.2 KiB
2023-12-11T07:45:22.126331image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum-1228000
5-th percentile0
Q19221500
median6.27613 × 108
Q38.910117 × 109
95-th percentile2.6450752 × 1010
Maximum7.5822932 × 1010
Range7.582416 × 1010
Interquartile range (IQR)8.9008955 × 109

Descriptive statistics

Standard deviation1.1931729 × 1010
Coefficient of variation (CV)1.8181512
Kurtosis11.945822
Mean6.5625615 × 109
Median Absolute Deviation (MAD)6.27613 × 108
Skewness3.1058843
Sum1.5422019 × 1012
Variance1.4236616 × 1020
MonotonicityNot monotonic
2023-12-11T07:45:22.241516image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 50
 
21.3%
45206000 1
 
0.4%
5638056000 1
 
0.4%
5072942000 1
 
0.4%
3742970000 1
 
0.4%
4814564000 1
 
0.4%
52990127000 1
 
0.4%
42314633000 1
 
0.4%
3766793000 1
 
0.4%
10224249000 1
 
0.4%
Other values (176) 176
74.9%
ValueCountFrequency (%)
-1228000 1
 
0.4%
0 50
21.3%
1249000 1
 
0.4%
2320000 1
 
0.4%
5655000 1
 
0.4%
5774000 1
 
0.4%
6161000 1
 
0.4%
7412000 1
 
0.4%
7476000 1
 
0.4%
9038000 1
 
0.4%
ValueCountFrequency (%)
75822932000 1
0.4%
71385153000 1
0.4%
62982712000 1
0.4%
58468776000 1
0.4%
52990127000 1
0.4%
45623606000 1
0.4%
42314633000 1
0.4%
30208568000 1
0.4%
29150818000 1
0.4%
27953253000 1
0.4%

Interactions

2023-12-11T07:45:20.651462image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:45:20.303501image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:45:20.729833image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:45:20.570507image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T07:45:22.320280image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시군구명자치단체코드세목명세원 유형명부과건수부과금액
시군구명1.0001.0000.0000.0000.0000.143
자치단체코드1.0001.0000.0000.0000.0000.143
세목명0.0000.0001.0001.0000.7340.442
세원 유형명0.0000.0001.0001.0000.8310.657
부과건수0.0000.0000.7340.8311.0000.828
부과금액0.1430.1430.4420.6570.8281.000
2023-12-11T07:45:22.404102image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
세원 유형명시군구명자치단체코드세목명
세원 유형명1.0000.0000.0000.920
시군구명0.0001.0001.0000.000
자치단체코드0.0001.0001.0000.000
세목명0.9200.0000.0001.000
2023-12-11T07:45:22.484067image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
부과건수부과금액시군구명자치단체코드세목명세원 유형명
부과건수1.0000.7780.0000.0000.4250.437
부과금액0.7781.0000.0830.0830.2060.276
시군구명0.0000.0831.0001.0000.0000.000
자치단체코드0.0000.0831.0001.0000.0000.000
세목명0.4250.2060.0000.0001.0000.920
세원 유형명0.4370.2760.0000.0000.9201.000

Missing values

2023-12-11T07:45:20.850987image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T07:45:20.956002image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시도명시군구명자치단체코드과세년도세목명세원 유형명부과건수부과금액
0창원시진해구481292020담배소비세담배소비세645206000
1창원시마산회원구481272020담배소비세담배소비세648742000
2창원시마산합포구481252020담배소비세담배소비세643438000
3창원시성산구481232020담배소비세담배소비세654551000
4창원시의창구481212020담배소비세담배소비세27671385153000
5창원시진해구481292020교육세교육세35946415449455000
6창원시마산회원구481272020교육세교육세36366812631569000
7창원시마산합포구481252020교육세교육세39561214799805000
8창원시성산구481232020교육세교육세45099725696232000
9창원시의창구481212020교육세교육세56043558468776000
시도명시군구명자치단체코드과세년도세목명세원 유형명부과건수부과금액
225창원시의창구481212020주민세주민세(양도소득)00
226창원시의창구481212020주민세주민세(종합소득)00
227창원시의창구481212020주민세주민세(법인균등)5100210546000
228창원시의창구481212020주민세주민세(개인사업)11941300228000
229창원시의창구481212020주민세주민세(개인균등)1002791003495000
230창원시진해구481292020체납체납14889311476208000
231창원시마산회원구481272020체납체납14847510743190000
232창원시마산합포구481252020체납체납15701410894919000
233창원시성산구481232020체납체납12142915750518000
234창원시의창구481212020체납체납22514820625665000