Overview

Dataset statistics

Number of variables10
Number of observations238
Missing cells0
Missing cells (%)0.0%
Duplicate rows1
Duplicate rows (%)0.4%
Total size in memory20.1 KiB
Average record size in memory86.6 B

Variable types

Categorical6
Numeric4

Dataset

Description체납액 규모별 체납 건수에 대한 데이터로 세목명, 체납액구간, 체납건수, 체납금액, 누적체납건수, 누적체납금액 등을 제공합니다
URLhttps://www.data.go.kr/data/15078434/fileData.do

Alerts

시도명 has constant value ""Constant
시군구명 has constant value ""Constant
자치단체코드 has constant value ""Constant
Dataset has 1 (0.4%) duplicate rowsDuplicates
체납건수 is highly overall correlated with 체납금액 and 1 other fieldsHigh correlation
체납금액 is highly overall correlated with 체납건수 and 1 other fieldsHigh correlation
누적체납건수 is highly overall correlated with 체납건수High correlation
누적체납금액 is highly overall correlated with 체납금액High correlation
체납건수 has 24 (10.1%) zerosZeros
체납금액 has 24 (10.1%) zerosZeros

Reproduction

Analysis started2023-12-12 13:02:32.079356
Analysis finished2023-12-12 13:02:33.964233
Duration1.88 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시도명
Categorical

CONSTANT 

Distinct1
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
경기도
238 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row경기도
2nd row경기도
3rd row경기도
4th row경기도
5th row경기도

Common Values

ValueCountFrequency (%)
경기도 238
100.0%

Length

2023-12-12T22:02:34.025560image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T22:02:34.119809image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
경기도 238
100.0%

시군구명
Categorical

CONSTANT 

Distinct1
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
여주시
238 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row여주시
2nd row여주시
3rd row여주시
4th row여주시
5th row여주시

Common Values

ValueCountFrequency (%)
여주시 238
100.0%

Length

2023-12-12T22:02:34.232483image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T22:02:34.318124image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
여주시 238
100.0%

자치단체코드
Categorical

CONSTANT 

Distinct1
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
41670
238 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row41670
2nd row41670
3rd row41670
4th row41670
5th row41670

Common Values

ValueCountFrequency (%)
41670 238
100.0%

Length

2023-12-12T22:02:34.413488image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T22:02:34.820203image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
41670 238
100.0%

과세년도
Categorical

Distinct5
Distinct (%)2.1%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
2021
89 
2019
44 
2020
44 
2018
31 
2017
30 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2017
2nd row2017
3rd row2017
4th row2017
5th row2017

Common Values

ValueCountFrequency (%)
2021 89
37.4%
2019 44
18.5%
2020 44
18.5%
2018 31
 
13.0%
2017 30
 
12.6%

Length

2023-12-12T22:02:34.931013image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T22:02:35.072945image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2021 89
37.4%
2019 44
18.5%
2020 44
18.5%
2018 31
 
13.0%
2017 30
 
12.6%

세목명
Categorical

Distinct7
Distinct (%)2.9%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
지방소득세
62 
취득세
52 
재산세
48 
주민세
35 
자동차세
23 
Other values (2)
18 

Length

Max length7
Median length3
Mean length3.7773109
Min length3

Unique

Unique1 ?
Unique (%)0.4%

Sample

1st row등록면허세
2nd row자동차세
3rd row자동차세
4th row자동차세
5th row자동차세

Common Values

ValueCountFrequency (%)
지방소득세 62
26.1%
취득세 52
21.8%
재산세 48
20.2%
주민세 35
14.7%
자동차세 23
 
9.7%
등록면허세 17
 
7.1%
지역자원시설세 1
 
0.4%

Length

2023-12-12T22:02:35.232950image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T22:02:35.391743image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
지방소득세 62
26.1%
취득세 52
21.8%
재산세 48
20.2%
주민세 35
14.7%
자동차세 23
 
9.7%
등록면허세 17
 
7.1%
지역자원시설세 1
 
0.4%

체납액구간
Categorical

Distinct20
Distinct (%)8.4%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
10만원 미만
37 
10만원 ~ 30만원 미만
24 
50만원 ~ 1백만원 미만
22 
30만원 ~ 50만원 미만
20 
3백만원 ~ 5백만원 미만
19 
Other values (15)
116 

Length

Max length14
Median length14
Mean length12.071429
Min length6

Unique

Unique1 ?
Unique (%)0.4%

Sample

1st row10만원 미만
2nd row10만원 미만
3rd row10만원~30만원미만
4th row30만원~50만원미만
5th row50만원~1백만원미만

Common Values

ValueCountFrequency (%)
10만원 미만 37
15.5%
10만원 ~ 30만원 미만 24
10.1%
50만원 ~ 1백만원 미만 22
9.2%
30만원 ~ 50만원 미만 20
8.4%
3백만원 ~ 5백만원 미만 19
 
8.0%
1백만원 ~ 3백만원 미만 17
 
7.1%
5백만원 ~ 1천만원 미만 15
 
6.3%
3천만원 ~ 5천만원 미만 12
 
5.0%
1천만원 ~ 3천만원 미만 12
 
5.0%
10만원~30만원미만 11
 
4.6%
Other values (10) 49
20.6%

Length

2023-12-12T22:02:35.551955image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
미만 184
25.5%
147
20.4%
10만원 61
 
8.4%
30만원 44
 
6.1%
50만원 42
 
5.8%
1백만원 39
 
5.4%
3백만원 36
 
5.0%
5백만원 34
 
4.7%
1천만원 27
 
3.7%
3천만원 24
 
3.3%
Other values (12) 84
11.6%

체납건수
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct107
Distinct (%)45.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean341.87395
Minimum0
Maximum6509
Zeros24
Zeros (%)10.1%
Negative0
Negative (%)0.0%
Memory size2.2 KiB
2023-12-12T22:02:35.724105image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q13
median14
Q391.75
95-th percentile1611.5
Maximum6509
Range6509
Interquartile range (IQR)88.75

Descriptive statistics

Standard deviation1070.1156
Coefficient of variation (CV)3.1301467
Kurtosis19.690196
Mean341.87395
Median Absolute Deviation (MAD)13
Skewness4.4104449
Sum81366
Variance1145147.5
MonotonicityNot monotonic
2023-12-12T22:02:35.887525image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 24
 
10.1%
0 24
 
10.1%
3 13
 
5.5%
7 10
 
4.2%
2 10
 
4.2%
4 8
 
3.4%
6 7
 
2.9%
14 5
 
2.1%
18 4
 
1.7%
24 4
 
1.7%
Other values (97) 129
54.2%
ValueCountFrequency (%)
0 24
10.1%
1 24
10.1%
2 10
4.2%
3 13
5.5%
4 8
 
3.4%
5 2
 
0.8%
6 7
 
2.9%
7 10
4.2%
8 2
 
0.8%
9 2
 
0.8%
ValueCountFrequency (%)
6509 1
0.4%
6333 1
0.4%
5780 1
0.4%
5775 1
0.4%
5664 1
0.4%
5274 1
0.4%
5207 1
0.4%
4820 1
0.4%
2254 1
0.4%
2240 1
0.4%

체납금액
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct214
Distinct (%)89.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean54357740
Minimum0
Maximum6.0016932 × 108
Zeros24
Zeros (%)10.1%
Negative0
Negative (%)0.0%
Memory size2.2 KiB
2023-12-12T22:02:36.063975image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q14004225
median19427730
Q386620695
95-th percentile1.8689634 × 108
Maximum6.0016932 × 108
Range6.0016932 × 108
Interquartile range (IQR)82616470

Descriptive statistics

Standard deviation78526187
Coefficient of variation (CV)1.4446183
Kurtosis13.019617
Mean54357740
Median Absolute Deviation (MAD)19129170
Skewness2.9049874
Sum1.2937142 × 1010
Variance6.1663621 × 1015
MonotonicityNot monotonic
2023-12-12T22:02:36.221405image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 24
 
10.1%
33808110 2
 
0.8%
3203080 1
 
0.4%
121647790 1
 
0.4%
103277010 1
 
0.4%
8114980 1
 
0.4%
89068880 1
 
0.4%
178309710 1
 
0.4%
4839150 1
 
0.4%
186258120 1
 
0.4%
Other values (204) 204
85.7%
ValueCountFrequency (%)
0 24
10.1%
14610 1
 
0.4%
148320 1
 
0.4%
448800 1
 
0.4%
470110 1
 
0.4%
474170 1
 
0.4%
500420 1
 
0.4%
537950 1
 
0.4%
964940 1
 
0.4%
1204600 1
 
0.4%
ValueCountFrequency (%)
600169320 1
0.4%
457319170 1
0.4%
381574540 1
0.4%
326001140 1
0.4%
237241230 1
0.4%
234716120 1
0.4%
210051360 1
0.4%
208685990 1
0.4%
208494710 1
0.4%
195603530 1
0.4%

누적체납건수
Real number (ℝ)

HIGH CORRELATION 

Distinct146
Distinct (%)61.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1084.8992
Minimum0
Maximum14674
Zeros2
Zeros (%)0.8%
Negative0
Negative (%)0.0%
Memory size2.2 KiB
2023-12-12T22:02:36.406020image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile1
Q18.25
median54.5
Q3220.75
95-th percentile8365.3
Maximum14674
Range14674
Interquartile range (IQR)212.5

Descriptive statistics

Standard deviation3012.7
Coefficient of variation (CV)2.7769401
Kurtosis9.7347104
Mean1084.8992
Median Absolute Deviation (MAD)52.5
Skewness3.2642096
Sum258206
Variance9076361.4
MonotonicityNot monotonic
2023-12-12T22:02:36.555034image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 18
 
7.6%
2 16
 
6.7%
3 7
 
2.9%
24 5
 
2.1%
5 5
 
2.1%
4 5
 
2.1%
9 5
 
2.1%
8 5
 
2.1%
18 4
 
1.7%
44 4
 
1.7%
Other values (136) 164
68.9%
ValueCountFrequency (%)
0 2
 
0.8%
1 18
7.6%
2 16
6.7%
3 7
 
2.9%
4 5
 
2.1%
5 5
 
2.1%
6 1
 
0.4%
7 1
 
0.4%
8 5
 
2.1%
9 5
 
2.1%
ValueCountFrequency (%)
14674 1
0.4%
13871 1
0.4%
13183 1
0.4%
13138 1
0.4%
13045 1
0.4%
12972 1
0.4%
12602 1
0.4%
12563 1
0.4%
12343 1
0.4%
12053 1
0.4%

누적체납금액
Real number (ℝ)

HIGH CORRELATION 

Distinct232
Distinct (%)97.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.3755083 × 108
Minimum0
Maximum9.8206199 × 108
Zeros2
Zeros (%)0.8%
Negative0
Negative (%)0.0%
Memory size2.2 KiB
2023-12-12T22:02:36.736773image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile3698393.5
Q120764850
median62498665
Q31.7581601 × 108
95-th percentile5.2690418 × 108
Maximum9.8206199 × 108
Range9.8206199 × 108
Interquartile range (IQR)1.5505116 × 108

Descriptive statistics

Standard deviation1.8712403 × 108
Coefficient of variation (CV)1.3603991
Kurtosis6.6116938
Mean1.3755083 × 108
Median Absolute Deviation (MAD)51775890
Skewness2.4418154
Sum3.2737098 × 1010
Variance3.5015402 × 1016
MonotonicityNot monotonic
2023-12-12T22:02:36.896276image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
148320 5
 
2.1%
0 2
 
0.8%
118879100 2
 
0.8%
525891900 1
 
0.4%
946854680 1
 
0.4%
24799560 1
 
0.4%
14642890 1
 
0.4%
367289420 1
 
0.4%
234952650 1
 
0.4%
86721160 1
 
0.4%
Other values (222) 222
93.3%
ValueCountFrequency (%)
0 2
 
0.8%
14610 1
 
0.4%
148320 5
2.1%
747940 1
 
0.4%
1204600 1
 
0.4%
1928690 1
 
0.4%
3604460 1
 
0.4%
3714970 1
 
0.4%
3768400 1
 
0.4%
4104880 1
 
0.4%
ValueCountFrequency (%)
982061990 1
0.4%
946854680 1
0.4%
928793770 1
0.4%
925764580 1
0.4%
847611830 1
0.4%
846958250 1
0.4%
740541810 1
0.4%
725927720 1
0.4%
600169320 1
0.4%
546129430 1
0.4%

Interactions

2023-12-12T22:02:33.405582image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:02:32.377321image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:02:32.707007image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:02:33.045445image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:02:33.492245image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:02:32.461044image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:02:32.787886image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:02:33.138045image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:02:33.576891image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:02:32.545864image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:02:32.867919image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:02:33.223222image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:02:33.658502image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:02:32.629458image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:02:32.950937image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:02:33.319513image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T22:02:37.019732image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
과세년도세목명체납액구간체납건수체납금액누적체납건수누적체납금액
과세년도1.0000.0000.7050.0000.1840.0860.062
세목명0.0001.0000.0000.5750.2630.4280.459
체납액구간0.7050.0001.0000.2190.4300.2900.563
체납건수0.0000.5750.2191.0000.4230.7980.694
체납금액0.1840.2630.4300.4231.0000.1060.865
누적체납건수0.0860.4280.2900.7980.1061.0000.661
누적체납금액0.0620.4590.5630.6940.8650.6611.000
2023-12-12T22:02:37.135176image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
체납액구간과세년도세목명
체납액구간1.0000.3720.000
과세년도0.3721.0000.000
세목명0.0000.0001.000
2023-12-12T22:02:37.233496image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
체납건수체납금액누적체납건수누적체납금액과세년도세목명체납액구간
체납건수1.0000.5120.9480.4410.0000.2310.091
체납금액0.5121.0000.3410.8710.1120.1430.180
누적체납건수0.9480.3411.0000.3530.0470.2410.114
누적체납금액0.4410.8710.3531.0000.0220.2510.206
과세년도0.0000.1120.0470.0221.0000.0000.372
세목명0.2310.1430.2410.2510.0001.0000.000
체납액구간0.0910.1800.1140.2060.3720.0001.000

Missing values

2023-12-12T22:02:33.769949image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T22:02:33.908749image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시도명시군구명자치단체코드과세년도세목명체납액구간체납건수체납금액누적체납건수누적체납금액
0경기도여주시416702017등록면허세10만원 미만22632030806789365490
1경기도여주시416702017자동차세10만원 미만884383341206773277594870
2경기도여주시416702017자동차세10만원~30만원미만6911122614304451725927720
3경기도여주시416702017자동차세30만원~50만원미만25840641012042197350
4경기도여주시416702017자동차세50만원~1백만원미만15379503019952320
5경기도여주시416702017재산세10만원 미만101027659720297384656390
6경기도여주시416702017재산세10만원~30만원미만881409820037459887720
7경기도여주시416702017재산세1백만원~3백만원미만16287714305896131350
8경기도여주시416702017재산세30만원~50만원미만625520303813515180
9경기도여주시416702017재산세3백만원~5백만원미만310632230827853350
시도명시군구명자치단체코드과세년도세목명체납액구간체납건수체납금액누적체납건수누적체납금액
228경기도여주시416702021취득세10만원 미만3415599401355774900
229경기도여주시416702021취득세10만원 ~ 30만원 미만1833261009216691560
230경기도여주시416702021취득세30만원 ~ 50만원 미만83166500207989840
231경기도여주시416702021취득세50만원 ~ 1백만원 미만16123016306546730050
232경기도여주시416702021취득세1백만원 ~ 3백만원 미만18285464205693635620
233경기도여주시416702021취득세3백만원 ~ 5백만원 미만6208663201555486710
234경기도여주시416702021취득세5백만원 ~ 1천만원 미만743588800959080570
235경기도여주시416702021취득세1천만원 ~ 3천만원 미만59732940010176861760
236경기도여주시416702021취득세3천만원 ~ 5천만원 미만146919110146919110
237경기도여주시416702021취득세5천만원 ~ 1억원 미만1969601002188555920

Duplicate rows

Most frequently occurring

시도명시군구명자치단체코드과세년도세목명체납액구간체납건수체납금액누적체납건수누적체납금액# duplicates
0경기도여주시416702021등록면허세10만원 ~ 30만원 미만0011483202