Overview

Dataset statistics

Number of variables9
Number of observations40
Missing cells2
Missing cells (%)0.6%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory3.2 KiB
Average record size in memory81.3 B

Variable types

Categorical4
Numeric5

Dataset

Description과세액 중 비과세액과 감면액이 차지하는 비율 현황에 대한 데이터로 비과세 금액, 감면 금액, 비과세 감면율 등의 항목을 제공합니다.
URLhttps://www.data.go.kr/data/15078628/fileData.do

Alerts

시도명 has constant value ""Constant
시군구명 has constant value ""Constant
자치단체코드 has constant value ""Constant
비과세금액 is highly overall correlated with 감면금액 and 2 other fieldsHigh correlation
감면금액 is highly overall correlated with 비과세금액 and 3 other fieldsHigh correlation
부과금액 is highly overall correlated with 비과세금액 and 3 other fieldsHigh correlation
비과세감면율 is highly overall correlated with 비과세금액 and 3 other fieldsHigh correlation
세목명 is highly overall correlated with 감면금액 and 2 other fieldsHigh correlation
비과세금액 has 2 (5.0%) missing valuesMissing
감면금액 has unique valuesUnique
비과세금액 has 2 (5.0%) zerosZeros
부과금액 has 2 (5.0%) zerosZeros
비과세감면율 has 3 (7.5%) zerosZeros

Reproduction

Analysis started2023-12-12 08:11:53.918161
Analysis finished2023-12-12 08:11:57.528949
Duration3.61 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시도명
Categorical

CONSTANT 

Distinct1
Distinct (%)2.5%
Missing0
Missing (%)0.0%
Memory size452.0 B
대전광역시
40 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row대전광역시
2nd row대전광역시
3rd row대전광역시
4th row대전광역시
5th row대전광역시

Common Values

ValueCountFrequency (%)
대전광역시 40
100.0%

Length

2023-12-12T17:11:57.587939image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T17:11:57.674779image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
대전광역시 40
100.0%

시군구명
Categorical

CONSTANT 

Distinct1
Distinct (%)2.5%
Missing0
Missing (%)0.0%
Memory size452.0 B
대덕구
40 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row대덕구
2nd row대덕구
3rd row대덕구
4th row대덕구
5th row대덕구

Common Values

ValueCountFrequency (%)
대덕구 40
100.0%

Length

2023-12-12T17:11:57.793351image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T17:11:57.911924image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
대덕구 40
100.0%

자치단체코드
Categorical

CONSTANT 

Distinct1
Distinct (%)2.5%
Missing0
Missing (%)0.0%
Memory size452.0 B
30230
40 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row30230
2nd row30230
3rd row30230
4th row30230
5th row30230

Common Values

ValueCountFrequency (%)
30230 40
100.0%

Length

2023-12-12T17:11:58.030785image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T17:11:58.129697image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
30230 40
100.0%

세목명
Categorical

HIGH CORRELATION 

Distinct8
Distinct (%)20.0%
Missing0
Missing (%)0.0%
Memory size452.0 B
재산세
주민세
취득세
자동차세
등록면허세
Other values (3)
10 

Length

Max length7
Median length3
Mean length4.05
Min length3

Unique

Unique1 ?
Unique (%)2.5%

Sample

1st row재산세
2nd row주민세
3rd row취득세
4th row자동차세
5th row등록면허세

Common Values

ValueCountFrequency (%)
재산세 6
15.0%
주민세 6
15.0%
취득세 6
15.0%
자동차세 6
15.0%
등록면허세 6
15.0%
지역자원시설세 6
15.0%
등록세 3
7.5%
교육세 1
 
2.5%

Length

2023-12-12T17:11:58.259499image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T17:11:58.437069image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
재산세 6
15.0%
주민세 6
15.0%
취득세 6
15.0%
자동차세 6
15.0%
등록면허세 6
15.0%
지역자원시설세 6
15.0%
등록세 3
7.5%
교육세 1
 
2.5%

과세년도
Real number (ℝ)

Distinct6
Distinct (%)15.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2019.575
Minimum2017
Maximum2022
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size492.0 B
2023-12-12T17:11:58.590721image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2017
5-th percentile2017
Q12018
median2020
Q32021
95-th percentile2022
Maximum2022
Range5
Interquartile range (IQR)3

Descriptive statistics

Standard deviation1.7229596
Coefficient of variation (CV)0.00085312979
Kurtosis-1.2813987
Mean2019.575
Median Absolute Deviation (MAD)1.5
Skewness-0.054438778
Sum80783
Variance2.9685897
MonotonicityIncreasing
2023-12-12T17:11:58.717350image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
2018 7
17.5%
2020 7
17.5%
2021 7
17.5%
2022 7
17.5%
2017 6
15.0%
2019 6
15.0%
ValueCountFrequency (%)
2017 6
15.0%
2018 7
17.5%
2019 6
15.0%
2020 7
17.5%
2021 7
17.5%
2022 7
17.5%
ValueCountFrequency (%)
2022 7
17.5%
2021 7
17.5%
2020 7
17.5%
2019 6
15.0%
2018 7
17.5%
2017 6
15.0%

비과세금액
Real number (ℝ)

HIGH CORRELATION  MISSING  ZEROS 

Distinct37
Distinct (%)97.4%
Missing2
Missing (%)5.0%
Infinite0
Infinite (%)0.0%
Mean2.3012277 × 109
Minimum0
Maximum1.332034 × 1010
Zeros2
Zeros (%)5.0%
Negative0
Negative (%)0.0%
Memory size492.0 B
2023-12-12T17:11:58.870013image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile27132000
Q11.6191625 × 108
median7.216595 × 108
Q31.2970015 × 109
95-th percentile1.151578 × 1010
Maximum1.332034 × 1010
Range1.332034 × 1010
Interquartile range (IQR)1.1350852 × 109

Descriptive statistics

Standard deviation4.078103 × 109
Coefficient of variation (CV)1.7721424
Kurtosis2.0541979
Mean2.3012277 × 109
Median Absolute Deviation (MAD)5.740865 × 108
Skewness1.9253386
Sum8.7446651 × 1010
Variance1.6630924 × 1019
MonotonicityNot monotonic
2023-12-12T17:11:59.027761image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=37)
ValueCountFrequency (%)
0 2
 
5.0%
229811000 1
 
2.5%
2465900000 1
 
2.5%
871097000 1
 
2.5%
153323000 1
 
2.5%
31920000 1
 
2.5%
199272000 1
 
2.5%
12394028000 1
 
2.5%
949943000 1
 
2.5%
187696000 1
 
2.5%
Other values (27) 27
67.5%
(Missing) 2
 
5.0%
ValueCountFrequency (%)
0 2
5.0%
31920000 1
2.5%
32009000 1
2.5%
32591000 1
2.5%
32636000 1
2.5%
32840000 1
2.5%
33872000 1
2.5%
141823000 1
2.5%
153323000 1
2.5%
187696000 1
2.5%
ValueCountFrequency (%)
13320340000 1
2.5%
12394028000 1
2.5%
11360795000 1
2.5%
10955760000 1
2.5%
10528394000 1
2.5%
10119369000 1
2.5%
2465900000 1
2.5%
1822250000 1
2.5%
1617392000 1
2.5%
1367741000 1
2.5%

감면금액
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct40
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.8691598 × 109
Minimum3000
Maximum1.810265 × 1010
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size492.0 B
2023-12-12T17:11:59.203606image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum3000
5-th percentile23350
Q11.45307 × 108
median4.82627 × 108
Q33.1096948 × 109
95-th percentile1.4034868 × 1010
Maximum1.810265 × 1010
Range1.8102647 × 1010
Interquartile range (IQR)2.9643878 × 109

Descriptive statistics

Standard deviation5.0328715 × 109
Coefficient of variation (CV)1.7541273
Kurtosis2.5604148
Mean2.8691598 × 109
Median Absolute Deviation (MAD)4.53367 × 108
Skewness1.9770217
Sum1.1476639 × 1011
Variance2.5329796 × 1019
MonotonicityNot monotonic
2023-12-12T17:11:59.348699image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=40)
ValueCountFrequency (%)
3000768000 1
 
2.5%
544559000 1
 
2.5%
1120677000 1
 
2.5%
32247000 1
 
2.5%
174818000 1
 
2.5%
24000 1
 
2.5%
3271693000 1
 
2.5%
455566000 1
 
2.5%
15368421000 1
 
2.5%
963905000 1
 
2.5%
Other values (30) 30
75.0%
ValueCountFrequency (%)
3000 1
2.5%
11000 1
2.5%
24000 1
2.5%
11817000 1
2.5%
26273000 1
2.5%
32247000 1
2.5%
38731000 1
2.5%
41726000 1
2.5%
47592000 1
2.5%
80204000 1
2.5%
ValueCountFrequency (%)
18102650000 1
2.5%
15368421000 1
2.5%
13964681000 1
2.5%
13465424000 1
2.5%
13319432000 1
2.5%
11420138000 1
2.5%
3466900000 1
2.5%
3271693000 1
2.5%
3266743000 1
2.5%
3172841000 1
2.5%

부과금액
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct39
Distinct (%)97.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.5746261 × 1010
Minimum0
Maximum4.6374492 × 1010
Zeros2
Zeros (%)5.0%
Negative0
Negative (%)0.0%
Memory size492.0 B
2023-12-12T17:11:59.576184image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile21475700
Q15.85626 × 109
median1.2759749 × 1010
Q32.4761778 × 1010
95-th percentile4.0949131 × 1010
Maximum4.6374492 × 1010
Range4.6374492 × 1010
Interquartile range (IQR)1.8905518 × 1010

Descriptive statistics

Standard deviation1.2564476 × 1010
Coefficient of variation (CV)0.79793394
Kurtosis-0.10869345
Mean1.5746261 × 1010
Median Absolute Deviation (MAD)8.3194695 × 109
Skewness0.88117519
Sum6.2985045 × 1011
Variance1.5786606 × 1020
MonotonicityNot monotonic
2023-12-12T17:11:59.776631image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=39)
ValueCountFrequency (%)
0 2
 
5.0%
27369120000 1
 
2.5%
7008908000 1
 
2.5%
15126492000 1
 
2.5%
6511144000 1
 
2.5%
4148051000 1
 
2.5%
32596251000 1
 
2.5%
12421801000 1
 
2.5%
40802813000 1
 
2.5%
15629288000 1
 
2.5%
Other values (29) 29
72.5%
ValueCountFrequency (%)
0 2
5.0%
22606000 1
2.5%
4040279000 1
2.5%
4142946000 1
2.5%
4148051000 1
2.5%
4176036000 1
2.5%
4263387000 1
2.5%
4424012000 1
2.5%
5491511000 1
2.5%
5977843000 1
2.5%
ValueCountFrequency (%)
46374492000 1
2.5%
43729172000 1
2.5%
40802813000 1
2.5%
34954267000 1
2.5%
32596251000 1
2.5%
30704564000 1
2.5%
29771128000 1
2.5%
28844724000 1
2.5%
28720494000 1
2.5%
27369120000 1
2.5%

비과세감면율
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct37
Distinct (%)92.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean20.8395
Minimum0
Maximum70.42
Zeros3
Zeros (%)7.5%
Negative0
Negative (%)0.0%
Memory size492.0 B
2023-12-12T17:11:59.956721image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q17.35
median9.9
Q344.635
95-th percentile61.6725
Maximum70.42
Range70.42
Interquartile range (IQR)37.285

Descriptive statistics

Standard deviation21.589008
Coefficient of variation (CV)1.0359658
Kurtosis-0.491117
Mean20.8395
Median Absolute Deviation (MAD)8.635
Skewness0.97059145
Sum833.58
Variance466.08528
MonotonicityNot monotonic
2023-12-12T17:12:00.119018image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=37)
ValueCountFrequency (%)
0.0 3
 
7.5%
9.06 2
 
5.0%
47.94 1
 
2.5%
43.71 1
 
2.5%
26.5 1
 
2.5%
8.42 1
 
2.5%
0.99 1
 
2.5%
9.02 1
 
2.5%
48.06 1
 
2.5%
11.31 1
 
2.5%
Other values (27) 27
67.5%
ValueCountFrequency (%)
0.0 3
7.5%
0.84 1
 
2.5%
0.99 1
 
2.5%
1.11 1
 
2.5%
1.23 1
 
2.5%
1.3 1
 
2.5%
1.91 1
 
2.5%
7.29 1
 
2.5%
7.37 1
 
2.5%
8.42 1
 
2.5%
ValueCountFrequency (%)
70.42 1
2.5%
68.37 1
2.5%
61.32 1
2.5%
52.27 1
2.5%
48.06 1
2.5%
48.03 1
2.5%
47.94 1
2.5%
47.64 1
2.5%
47.46 1
2.5%
47.41 1
2.5%

Interactions

2023-12-12T17:11:56.492821image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:11:54.176778image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:11:54.680999image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:11:55.321698image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:11:55.959675image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:11:56.583886image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:11:54.275748image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:11:54.792894image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:11:55.451136image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:11:56.056998image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:11:56.683019image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:11:54.367417image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:11:54.963620image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:11:55.583598image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:11:56.174808image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:11:56.793869image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:11:54.453484image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:11:55.081174image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:11:55.724437image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:11:56.292908image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:11:57.203928image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:11:54.556783image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:11:55.178969image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:11:55.827473image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:11:56.384086image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T17:12:00.245603image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
세목명과세년도비과세금액감면금액부과금액비과세감면율
세목명1.0000.0000.6930.7240.9090.913
과세년도0.0001.0000.0000.0000.0000.000
비과세금액0.6930.0001.0000.7350.9590.699
감면금액0.7240.0000.7351.0000.8870.897
부과금액0.9090.0000.9590.8871.0000.843
비과세감면율0.9130.0000.6990.8970.8431.000
2023-12-12T17:12:00.367101image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
과세년도비과세금액감면금액부과금액비과세감면율세목명
과세년도1.000-0.043-0.0700.136-0.1940.000
비과세금액-0.0431.0000.7760.6990.7800.492
감면금액-0.0700.7761.0000.8760.7730.503
부과금액0.1360.6990.8761.0000.6120.719
비과세감면율-0.1940.7800.7730.6121.0000.547
세목명0.0000.4920.5030.7190.5471.000

Missing values

2023-12-12T17:11:57.336748image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T17:11:57.479089image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시도명시군구명자치단체코드세목명과세년도비과세금액감면금액부과금액비과세감면율
0대전광역시대덕구30230재산세20171011936900030007680002736912000047.94
1대전광역시대덕구30230주민세201710535660004816620001109015000013.84
2대전광역시대덕구30230취득세20171617392000181026500002884472400068.37
3대전광역시대덕구30230자동차세20178769580006852610001520811100010.27
4대전광역시대덕구30230등록면허세2017338720008020400059778430001.91
5대전광역시대덕구30230지역자원시설세201719309700019174200040402790009.53
6대전광역시대덕구30230등록세2018<NA>1100000.0
7대전광역시대덕구30230재산세20181052839400030886460002872049400047.41
8대전광역시대덕구30230주민세201810027400004768330001155839500012.8
9대전광역시대덕구30230취득세2018687458000139646810002389266400061.32
시도명시군구명자치단체코드세목명과세년도비과세금액감면금액부과금액비과세감면율
30대전광역시대덕구30230자동차세2021187696000963905000156292880007.37
31대전광역시대덕구30230등록면허세2021328400002627300070089080000.84
32대전광역시대덕구30230지역자원시설세202122055900017138900042633870009.19
33대전광역시대덕구30230교육세202203000134199060000.0
34대전광역시대덕구30230재산세20221332034000034669000003495426700048.03
35대전광역시대덕구30230주민세202210847830004835920001309769700011.97
36대전광역시대덕구30230취득세20221822250000133194320004372917200034.63
37대전광역시대덕구30230자동차세2022227188000924116000158024880007.29
38대전광역시대덕구30230등록면허세2022320090004759200071888400001.11
39대전광역시대덕구30230지역자원시설세202222981100017121200044240120009.06