Overview

Dataset statistics

Number of variables10
Number of observations45
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory3.9 KiB
Average record size in memory88.9 B

Variable types

Categorical6
Numeric4

Dataset

Description지방세 체납액 규모별 체납 건수를 납세자 유형별(체납액 구간별, 체납금액, 누적체납금액 등)로 제공하는 정보입니다.
URLhttps://www.data.go.kr/data/15079429/fileData.do

Alerts

시도명 has constant value ""Constant
시군구명 has constant value ""Constant
자치단체코드 has constant value ""Constant
체납건수 is highly overall correlated with 누적체납건수High correlation
체납금액 is highly overall correlated with 누적체납금액High correlation
누적체납건수 is highly overall correlated with 체납건수High correlation
누적체납금액 is highly overall correlated with 체납금액High correlation
과세년도 is highly imbalanced (56.7%)Imbalance
체납금액 has unique valuesUnique
누적체납금액 has unique valuesUnique

Reproduction

Analysis started2023-12-12 22:46:47.254193
Analysis finished2023-12-12 22:46:49.699511
Duration2.45 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시도명
Categorical

CONSTANT 

Distinct1
Distinct (%)2.2%
Missing0
Missing (%)0.0%
Memory size492.0 B
경기도
45 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row경기도
2nd row경기도
3rd row경기도
4th row경기도
5th row경기도

Common Values

ValueCountFrequency (%)
경기도 45
100.0%

Length

2023-12-13T07:46:49.758646image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T07:46:49.843082image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
경기도 45
100.0%

시군구명
Categorical

CONSTANT 

Distinct1
Distinct (%)2.2%
Missing0
Missing (%)0.0%
Memory size492.0 B
하남시
45 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row하남시
2nd row하남시
3rd row하남시
4th row하남시
5th row하남시

Common Values

ValueCountFrequency (%)
하남시 45
100.0%

Length

2023-12-13T07:46:49.948580image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T07:46:50.045783image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
하남시 45
100.0%

자치단체코드
Categorical

CONSTANT 

Distinct1
Distinct (%)2.2%
Missing0
Missing (%)0.0%
Memory size492.0 B
41450
45 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row41450
2nd row41450
3rd row41450
4th row41450
5th row41450

Common Values

ValueCountFrequency (%)
41450 45
100.0%

Length

2023-12-13T07:46:50.155886image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T07:46:50.253991image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
41450 45
100.0%

과세년도
Categorical

IMBALANCE 

Distinct2
Distinct (%)4.4%
Missing0
Missing (%)0.0%
Memory size492.0 B
2022
41 
2021
 
4

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2022
2nd row2022
3rd row2022
4th row2022
5th row2022

Common Values

ValueCountFrequency (%)
2022 41
91.1%
2021 4
 
8.9%

Length

2023-12-13T07:46:50.357574image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T07:46:50.456954image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2022 41
91.1%
2021 4
 
8.9%

세목명
Categorical

Distinct7
Distinct (%)15.6%
Missing0
Missing (%)0.0%
Memory size492.0 B
취득세
14 
지방소득세
11 
재산세
10 
자동차세
주민세
Other values (2)

Length

Max length7
Median length3
Mean length3.7111111
Min length3

Unique

Unique2 ?
Unique (%)4.4%

Sample

1st row등록면허세
2nd row자동차세
3rd row자동차세
4th row자동차세
5th row자동차세

Common Values

ValueCountFrequency (%)
취득세 14
31.1%
지방소득세 11
24.4%
재산세 10
22.2%
자동차세 4
 
8.9%
주민세 4
 
8.9%
등록면허세 1
 
2.2%
지역자원시설세 1
 
2.2%

Length

2023-12-13T07:46:50.581167image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T07:46:50.710626image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
취득세 14
31.1%
지방소득세 11
24.4%
재산세 10
22.2%
자동차세 4
 
8.9%
주민세 4
 
8.9%
등록면허세 1
 
2.2%
지역자원시설세 1
 
2.2%

체납액구간
Categorical

Distinct13
Distinct (%)28.9%
Missing0
Missing (%)0.0%
Memory size492.0 B
10만원 미만
50만원~1백만원미만
10만원~30만원미만
30만원~50만원미만
5백만원~1천만원미만
Other values (8)
19 

Length

Max length11
Median length11
Mean length10.177778
Min length7

Unique

Unique3 ?
Unique (%)6.7%

Sample

1st row10만원 미만
2nd row10만원 미만
3rd row10만원~30만원미만
4th row30만원~50만원미만
5th row50만원~1백만원미만

Common Values

ValueCountFrequency (%)
10만원 미만 7
15.6%
50만원~1백만원미만 6
13.3%
10만원~30만원미만 5
11.1%
30만원~50만원미만 4
8.9%
5백만원~1천만원미만 4
8.9%
5천만원~1억원미만 4
8.9%
1백만원~3백만원미만 3
6.7%
1천만원~3천만원미만 3
6.7%
3백만원~5백만원미만 3
6.7%
3천만원~5천만원미만 3
6.7%
Other values (3) 3
6.7%

Length

2023-12-13T07:46:50.844262image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
10만원 7
13.5%
미만 7
13.5%
50만원~1백만원미만 6
11.5%
10만원~30만원미만 5
9.6%
30만원~50만원미만 4
7.7%
5백만원~1천만원미만 4
7.7%
5천만원~1억원미만 4
7.7%
1백만원~3백만원미만 3
5.8%
1천만원~3천만원미만 3
5.8%
3백만원~5백만원미만 3
5.8%
Other values (4) 6
11.5%

체납건수
Real number (ℝ)

HIGH CORRELATION 

Distinct34
Distinct (%)75.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean583.86667
Minimum1
Maximum10118
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size537.0 B
2023-12-13T07:46:50.966437image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q14
median13
Q3227
95-th percentile2226.2
Maximum10118
Range10117
Interquartile range (IQR)223

Descriptive statistics

Standard deviation1611.289
Coefficient of variation (CV)2.7596865
Kurtosis28.847204
Mean583.86667
Median Absolute Deviation (MAD)12
Skewness5.004812
Sum26274
Variance2596252.1
MonotonicityNot monotonic
2023-12-13T07:46:51.090543image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=34)
ValueCountFrequency (%)
1 4
 
8.9%
2 4
 
8.9%
3 3
 
6.7%
5 3
 
6.7%
8 2
 
4.4%
12 1
 
2.2%
608 1
 
2.2%
177 1
 
2.2%
50 1
 
2.2%
227 1
 
2.2%
Other values (24) 24
53.3%
ValueCountFrequency (%)
1 4
8.9%
2 4
8.9%
3 3
6.7%
4 1
 
2.2%
5 3
6.7%
6 1
 
2.2%
7 1
 
2.2%
8 2
4.4%
10 1
 
2.2%
11 1
 
2.2%
ValueCountFrequency (%)
10118 1
2.2%
2271 1
2.2%
2227 1
2.2%
2223 1
2.2%
2159 1
2.2%
1655 1
2.2%
1363 1
2.2%
1078 1
2.2%
1024 1
2.2%
608 1
2.2%

체납금액
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct45
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.2138679 × 108
Minimum64390
Maximum1.2193512 × 109
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size537.0 B
2023-12-13T07:46:51.217244image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum64390
5-th percentile524644
Q129155680
median95384100
Q32.9299254 × 108
95-th percentile7.3150273 × 108
Maximum1.2193512 × 109
Range1.2192868 × 109
Interquartile range (IQR)2.6383686 × 108

Descriptive statistics

Standard deviation2.7186362 × 108
Coefficient of variation (CV)1.2280029
Kurtosis3.4089693
Mean2.2138679 × 108
Median Absolute Deviation (MAD)93945240
Skewness1.8116607
Sum9.9624055 × 109
Variance7.390983 × 1016
MonotonicityNot monotonic
2023-12-13T07:46:51.342808image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=45)
ValueCountFrequency (%)
25779480 1
 
2.2%
68158750 1
 
2.2%
149763960 1
 
2.2%
481332840 1
 
2.2%
154071210 1
 
2.2%
277021260 1
 
2.2%
713114880 1
 
2.2%
64390 1
 
2.2%
343280 1
 
2.2%
482320 1
 
2.2%
Other values (35) 35
77.8%
ValueCountFrequency (%)
64390 1
2.2%
343280 1
2.2%
482320 1
2.2%
693940 1
2.2%
1010760 1
2.2%
1438860 1
2.2%
2396530 1
2.2%
4663650 1
2.2%
7421770 1
2.2%
11340690 1
2.2%
ValueCountFrequency (%)
1219351240 1
2.2%
866587360 1
2.2%
736099690 1
2.2%
713114880 1
2.2%
689895230 1
2.2%
514009130 1
2.2%
481332840 1
2.2%
417122600 1
2.2%
415974160 1
2.2%
401055660 1
2.2%

누적체납건수
Real number (ℝ)

HIGH CORRELATION 

Distinct39
Distinct (%)86.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1409
Minimum2
Maximum21326
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size537.0 B
2023-12-13T07:46:51.464479image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2
5-th percentile3
Q114
median40
Q3605
95-th percentile8331.4
Maximum21326
Range21324
Interquartile range (IQR)591

Descriptive statistics

Standard deviation3736.5088
Coefficient of variation (CV)2.651887
Kurtosis19.063215
Mean1409
Median Absolute Deviation (MAD)37
Skewness4.0960964
Sum63405
Variance13961498
MonotonicityNot monotonic
2023-12-13T07:46:51.573320image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=39)
ValueCountFrequency (%)
14 2
 
4.4%
2 2
 
4.4%
11 2
 
4.4%
3 2
 
4.4%
23 2
 
4.4%
4 2
 
4.4%
162 1
 
2.2%
160 1
 
2.2%
420 1
 
2.2%
179 1
 
2.2%
Other values (29) 29
64.4%
ValueCountFrequency (%)
2 2
4.4%
3 2
4.4%
4 2
4.4%
9 1
2.2%
11 2
4.4%
13 1
2.2%
14 2
4.4%
15 1
2.2%
17 1
2.2%
18 1
2.2%
ValueCountFrequency (%)
21326 1
2.2%
9946 1
2.2%
9237 1
2.2%
4709 1
2.2%
3808 1
2.2%
3246 1
2.2%
3240 1
2.2%
1345 1
2.2%
1320 1
2.2%
1295 1
2.2%

누적체납금액
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct45
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5.1229086 × 108
Minimum111850
Maximum2.7622028 × 109
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size537.0 B
2023-12-13T07:46:51.690929image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum111850
5-th percentile4331338
Q163875690
median2.4482493 × 108
Q37.0372516 × 108
95-th percentile1.8746933 × 109
Maximum2.7622028 × 109
Range2.762091 × 109
Interquartile range (IQR)6.3984947 × 108

Descriptive statistics

Standard deviation6.4677733 × 108
Coefficient of variation (CV)1.2625197
Kurtosis2.7567479
Mean5.1229086 × 108
Median Absolute Deviation (MAD)2.1620659 × 108
Skewness1.7616857
Sum2.3053089 × 1010
Variance4.1832091 × 1017
MonotonicityNot monotonic
2023-12-13T07:46:51.819777image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=45)
ValueCountFrequency (%)
63875690 1
 
2.2%
193441170 1
 
2.2%
704354860 1
 
2.2%
1581723630 1
 
2.2%
409029010 1
 
2.2%
1155872650 1
 
2.2%
2045541100 1
 
2.2%
111850 1
 
2.2%
1308810 1
 
2.2%
4841690 1
 
2.2%
Other values (35) 35
77.8%
ValueCountFrequency (%)
111850 1
2.2%
1308810 1
2.2%
4203750 1
2.2%
4841690 1
2.2%
10708440 1
2.2%
19245090 1
2.2%
27043520 1
2.2%
28618340 1
2.2%
34260380 1
2.2%
45019920 1
2.2%
ValueCountFrequency (%)
2762202810 1
2.2%
2045541100 1
2.2%
1916876190 1
2.2%
1705961980 1
2.2%
1581723630 1
2.2%
1281484160 1
2.2%
1219351240 1
2.2%
1155872650 1
2.2%
1035439370 1
2.2%
882190550 1
2.2%

Interactions

2023-12-13T07:46:48.764969image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T07:46:47.583453image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T07:46:48.001444image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T07:46:48.389881image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T07:46:49.186164image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T07:46:47.703759image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T07:46:48.107824image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T07:46:48.497603image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T07:46:49.280715image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T07:46:47.805566image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T07:46:48.211318image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T07:46:48.600185image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T07:46:49.364039image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T07:46:47.904584image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T07:46:48.294214image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T07:46:48.669914image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T07:46:51.904747image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
과세년도세목명체납액구간체납건수체납금액누적체납건수누적체납금액
과세년도1.0000.2890.3820.0000.3430.0000.000
세목명0.2891.0000.0000.5700.0000.5960.000
체납액구간0.3820.0001.0000.0000.8060.0000.592
체납건수0.0000.5700.0001.0000.0000.8500.000
체납금액0.3430.0000.8060.0001.0000.0000.935
누적체납건수0.0000.5960.0000.8500.0001.0000.000
누적체납금액0.0000.0000.5920.0000.9350.0001.000
2023-12-13T07:46:52.001707image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
과세년도체납액구간세목명
과세년도1.0000.2980.286
체납액구간0.2981.0000.000
세목명0.2860.0001.000
2023-12-13T07:46:52.092742image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
체납건수체납금액누적체납건수누적체납금액과세년도세목명체납액구간
체납건수1.0000.2800.9240.2930.0000.4110.000
체납금액0.2801.0000.1520.9500.3100.0000.490
누적체납건수0.9240.1521.0000.2220.0000.4230.000
누적체납금액0.2930.9500.2221.0000.0000.0000.285
과세년도0.0000.3100.0000.0001.0000.2860.298
세목명0.4110.0000.4230.0000.2861.0000.000
체납액구간0.0000.4900.0000.2850.2980.0001.000

Missing values

2023-12-13T07:46:49.485504image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T07:46:49.649795image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시도명시군구명자치단체코드과세년도세목명체납액구간체납건수체납금액누적체납건수누적체납금액
0경기도하남시414502022등록면허세10만원 미만136325779480324063875690
1경기도하남시414502022자동차세10만원 미만2227953841009237394479220
2경기도하남시414502022자동차세10만원~30만원미만227140105566099461705961980
3경기도하남시414502022자동차세30만원~50만원미만18867003910657230640380
4경기도하남시414502022자동차세50만원~1백만원미만846636505534260380
5경기도하남시414502022재산세10만원 미만1655675047203808147404700
6경기도하남시414502022재산세10만원~30만원미만21594171226003246614376220
7경기도하남시414502022재산세1백만원~3백만원미만357514009130477703725160
8경기도하남시414502022재산세1천만원~3천만원미만2435753578034499626620
9경기도하남시414502022재산세30만원~50만원미만10784159741601320508318570
시도명시군구명자치단체코드과세년도세목명체납액구간체납건수체납금액누적체납건수누적체납금액
35경기도하남시414502022취득세3백만원~5백만원미만8291556801347889090
36경기도하남시414502022취득세3억원~5억원미만3121935124031219351240
37경기도하남시414502022취득세3천만원~5천만원미만2766509504154402250
38경기도하남시414502022취득세50만원~1백만원미만16939401510708440
39경기도하남시414502022취득세5백만원~1천만원미만138896961018123398110
40경기도하남시414502022취득세5천만원~1억원미만1894842603244824930
41경기도하남시414502021취득세50만원~1백만원미만323965302719245090
42경기도하남시414502021취득세5백만원~1천만원미만7524764801185709810
43경기도하남시414502021취득세5억원~10억원미만173609969021281484160
44경기도하남시414502021취득세5천만원~1억원미만1863394702148246510