Overview

Dataset statistics

Number of variables9
Number of observations135
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory10.1 KiB
Average record size in memory77.0 B

Variable types

Categorical6
Numeric2
DateTime1

Dataset

Description과세년도 2017~2019에 대하여 세목별로 납세자 유형(개인/법인), 관내/관외 유형, 납세자 수 등에 대한 자료
Author충청남도
URLhttps://alldam.chungnam.go.kr/index.chungnam?menuCd=DOM_000000201001001001&st=&cds=&orgCd=&apiType=&isOpen=Y&pageIndex=346&beforeMenuCd=DOM_000000201001001000&publicdatapk=15080736

Alerts

시도명 has constant value ""Constant
시군구명 has constant value ""Constant
자치단체코드 has constant value ""Constant
데이터기준일 has constant value ""Constant
부과건수 is highly overall correlated with 부과금액 and 2 other fieldsHigh correlation
부과금액 is highly overall correlated with 부과건수High correlation
세목명 is highly overall correlated with 부과건수 and 1 other fieldsHigh correlation
세원 유형명 is highly overall correlated with 부과건수 and 1 other fieldsHigh correlation
부과건수 has 29 (21.5%) zerosZeros
부과금액 has 29 (21.5%) zerosZeros

Reproduction

Analysis started2024-01-09 20:53:11.001317
Analysis finished2024-01-09 20:53:11.739208
Duration0.74 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시도명
Categorical

CONSTANT 

Distinct1
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
충청남도
135 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row충청남도
2nd row충청남도
3rd row충청남도
4th row충청남도
5th row충청남도

Common Values

ValueCountFrequency (%)
충청남도 135
100.0%

Length

2024-01-10T05:53:11.791930image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T05:53:11.868544image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
충청남도 135
100.0%

시군구명
Categorical

CONSTANT 

Distinct1
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
공주시
135 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row공주시
2nd row공주시
3rd row공주시
4th row공주시
5th row공주시

Common Values

ValueCountFrequency (%)
공주시 135
100.0%

Length

2024-01-10T05:53:11.949513image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T05:53:12.025651image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
공주시 135
100.0%

자치단체코드
Categorical

CONSTANT 

Distinct1
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
44150
135 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row44150
2nd row44150
3rd row44150
4th row44150
5th row44150

Common Values

ValueCountFrequency (%)
44150 135
100.0%

Length

2024-01-10T05:53:12.106827image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T05:53:12.182868image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
44150 135
100.0%

과세년도
Categorical

Distinct3
Distinct (%)2.2%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
2017
47 
2018
47 
2019
41 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2017
2nd row2017
3rd row2017
4th row2017
5th row2017

Common Values

ValueCountFrequency (%)
2017 47
34.8%
2018 47
34.8%
2019 41
30.4%

Length

2024-01-10T05:53:12.265595image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T05:53:12.360712image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2017 47
34.8%
2018 47
34.8%
2019 41
30.4%

세목명
Categorical

HIGH CORRELATION 

Distinct13
Distinct (%)9.6%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
취득세
27 
주민세
27 
자동차세
21 
재산세
15 
지방소득세
12 
Other values (8)
33 

Length

Max length7
Median length3
Mean length3.6814815
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row담배소비세
2nd row교육세
3rd row도시계획세
4th row취득세
5th row취득세

Common Values

ValueCountFrequency (%)
취득세 27
20.0%
주민세 27
20.0%
자동차세 21
15.6%
재산세 15
11.1%
지방소득세 12
8.9%
레저세 8
 
5.9%
등록면허세 6
 
4.4%
지역자원시설세 6
 
4.4%
담배소비세 3
 
2.2%
교육세 3
 
2.2%
Other values (3) 7
 
5.2%

Length

2024-01-10T05:53:12.456181image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
취득세 27
20.0%
주민세 27
20.0%
자동차세 21
15.6%
재산세 15
11.1%
지방소득세 12
8.9%
레저세 8
 
5.9%
등록면허세 6
 
4.4%
지역자원시설세 6
 
4.4%
담배소비세 3
 
2.2%
교육세 3
 
2.2%
Other values (3) 7
 
5.2%

세원 유형명
Categorical

HIGH CORRELATION 

Distinct47
Distinct (%)34.8%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
담배소비세
 
3
기계장비
 
3
주민세(법인균등)
 
3
3륜이하
 
3
자동차세(주행)
 
3
Other values (42)
120 

Length

Max length11
Median length8
Mean length6.1703704
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row담배소비세
2nd row교육세
3rd row도시계획세
4th row건축물
5th row주택(개별)

Common Values

ValueCountFrequency (%)
담배소비세 3
 
2.2%
기계장비 3
 
2.2%
주민세(법인균등) 3
 
2.2%
3륜이하 3
 
2.2%
자동차세(주행) 3
 
2.2%
건축물 3
 
2.2%
주택(개별) 3
 
2.2%
주택(단독) 3
 
2.2%
기타 3
 
2.2%
항공기 3
 
2.2%
Other values (37) 105
77.8%

Length

2024-01-10T05:53:12.788493image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
담배소비세 3
 
2.2%
주민세(양도소득 3
 
2.2%
기계장비 3
 
2.2%
승합 3
 
2.2%
기타승용 3
 
2.2%
승용 3
 
2.2%
주민세(재산분 3
 
2.2%
주민세(종업원분 3
 
2.2%
주민세(특별징수 3
 
2.2%
지방소득세(법인소득 3
 
2.2%
Other values (37) 105
77.8%

부과건수
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct102
Distinct (%)75.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean15933.378
Minimum0
Maximum258460
Zeros29
Zeros (%)21.5%
Negative0
Negative (%)0.0%
Memory size1.3 KiB
2024-01-10T05:53:12.890043image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q15.5
median1151
Q310122.5
95-th percentile68658
Maximum258460
Range258460
Interquartile range (IQR)10117

Descriptive statistics

Standard deviation41041.292
Coefficient of variation (CV)2.5758061
Kurtosis24.114435
Mean15933.378
Median Absolute Deviation (MAD)1151
Skewness4.6337659
Sum2151006
Variance1.6843877 × 109
MonotonicityNot monotonic
2024-01-10T05:53:13.010695image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 29
 
21.5%
12 4
 
3.0%
60 2
 
1.5%
1849 2
 
1.5%
85 1
 
0.7%
9 1
 
0.7%
9275 1
 
0.7%
457 1
 
0.7%
2 1
 
0.7%
36 1
 
0.7%
Other values (92) 92
68.1%
ValueCountFrequency (%)
0 29
21.5%
1 1
 
0.7%
2 1
 
0.7%
3 1
 
0.7%
4 1
 
0.7%
5 1
 
0.7%
6 1
 
0.7%
9 1
 
0.7%
12 4
 
3.0%
30 1
 
0.7%
ValueCountFrequency (%)
258460 1
0.7%
253402 1
0.7%
252635 1
0.7%
85902 1
0.7%
83966 1
0.7%
82454 1
0.7%
71374 1
0.7%
67494 1
0.7%
63582 1
0.7%
56416 1
0.7%

부과금액
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct106
Distinct (%)78.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.9546535 × 109
Minimum0
Maximum1.9005448 × 1010
Zeros29
Zeros (%)21.5%
Negative0
Negative (%)0.0%
Memory size1.3 KiB
2024-01-10T05:53:13.127575image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q12865500
median4.44987 × 108
Q34.2216655 × 109
95-th percentile1.1701818 × 1010
Maximum1.9005448 × 1010
Range1.9005448 × 1010
Interquartile range (IQR)4.2188 × 109

Descriptive statistics

Standard deviation4.2547541 × 109
Coefficient of variation (CV)1.4400179
Kurtosis1.4341549
Mean2.9546535 × 109
Median Absolute Deviation (MAD)4.44987 × 108
Skewness1.5128276
Sum3.9887823 × 1011
Variance1.8102932 × 1019
MonotonicityNot monotonic
2024-01-10T05:53:13.252821image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 29
 
21.5%
3445000 2
 
1.5%
8837983000 1
 
0.7%
4659715000 1
 
0.7%
2286000 1
 
0.7%
8134172000 1
 
0.7%
536684000 1
 
0.7%
2045000 1
 
0.7%
535546000 1
 
0.7%
2243599000 1
 
0.7%
Other values (96) 96
71.1%
ValueCountFrequency (%)
0 29
21.5%
5000 1
 
0.7%
384000 1
 
0.7%
574000 1
 
0.7%
2045000 1
 
0.7%
2286000 1
 
0.7%
3445000 2
 
1.5%
3764000 1
 
0.7%
3954000 1
 
0.7%
4482000 1
 
0.7%
ValueCountFrequency (%)
19005448000 1
0.7%
15420494000 1
0.7%
14210122000 1
0.7%
13814603000 1
0.7%
12897497000 1
0.7%
11869625000 1
0.7%
11827628000 1
0.7%
11647899000 1
0.7%
11397854000 1
0.7%
11170513000 1
0.7%

데이터기준일
Date

CONSTANT 

Distinct1
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
Minimum2019-12-31 00:00:00
Maximum2019-12-31 00:00:00
2024-01-10T05:53:13.352647image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:53:13.431431image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2024-01-10T05:53:11.413321image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:53:11.263245image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:53:11.481585image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:53:11.331994image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-01-10T05:53:13.500832image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
과세년도세목명세원 유형명부과건수부과금액
과세년도1.0000.0000.0000.0000.000
세목명0.0001.0001.0000.8470.638
세원 유형명0.0001.0001.0001.0000.886
부과건수0.0000.8471.0001.0000.785
부과금액0.0000.6380.8860.7851.000
2024-01-10T05:53:13.595540image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
과세년도세목명세원 유형명
과세년도1.0000.0000.000
세목명0.0001.0000.849
세원 유형명0.0000.8491.000
2024-01-10T05:53:13.675546image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
부과건수부과금액과세년도세목명세원 유형명
부과건수1.0000.7510.0000.6490.823
부과금액0.7511.0000.0000.3230.465
과세년도0.0000.0001.0000.0000.000
세목명0.6490.3230.0001.0000.849
세원 유형명0.8230.4650.0000.8491.000

Missing values

2024-01-10T05:53:11.581122image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-10T05:53:11.696091image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시도명시군구명자치단체코드과세년도세목명세원 유형명부과건수부과금액데이터기준일
0충청남도공주시441502017담배소비세담배소비세10788379830002019-12-31
1충청남도공주시441502017교육세교육세252635116478990002019-12-31
2충청남도공주시441502017도시계획세도시계획세002019-12-31
3충청남도공주시441502017취득세건축물113332487710002019-12-31
4충청남도공주시441502017취득세주택(개별)154035571040002019-12-31
5충청남도공주시441502017취득세주택(단독)175161057920002019-12-31
6충청남도공주시441502017취득세기타304449870002019-12-31
7충청남도공주시441502017취득세항공기367980002019-12-31
8충청남도공주시441502017취득세기계장비5467906000002019-12-31
9충청남도공주시441502017취득세차량929278022780002019-12-31
시도명시군구명자치단체코드과세년도세목명세원 유형명부과건수부과금액데이터기준일
125충청남도공주시441502019주민세주민세(개인균등)462704637820002019-12-31
126충청남도공주시441502019등록면허세등록면허세(면허)227033931250002019-12-31
127충청남도공주시441502019등록면허세등록면허세(등록)2764726584170002019-12-31
128충청남도공주시441502019지역자원시설세지역자원시설세(소방)3352920515100002019-12-31
129충청남도공주시441502019지역자원시설세지역자원시설세(특자)59403250002019-12-31
130충청남도공주시441502019지방소득세지방소득세(특별징수)20422102019330002019-12-31
131충청남도공주시441502019지방소득세지방소득세(법인소득)2000128974970002019-12-31
132충청남도공주시441502019지방소득세지방소득세(양도소득)139919695920002019-12-31
133충청남도공주시441502019지방소득세지방소득세(종합소득)980622102960002019-12-31
134충청남도공주시441502019체납체납7137444071080002019-12-31