Overview

Dataset statistics

Number of variables10
Number of observations80
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory6.8 KiB
Average record size in memory87.7 B

Variable types

Categorical8
Numeric2

Dataset

Description체납액 규모별 체납건수를 납세자 유형별로 제공, 체납정책 수립시 기초자료로 활용(과세년도, 세목명, 체납액구간, 체납건수, 체납금액, 누적체납건수, 누적 체납금액 데이터 제공)
Author인천광역시
URLhttps://data.incheon.go.kr/findData/publicDataDetail?dataId=15079761&srcSe=7661IVAWM27C61E190

Alerts

시도명 has constant value ""Constant
시군구명 has constant value ""Constant
자치단체코드 has constant value ""Constant
체납건수 has constant value ""Constant
체납금액 has constant value ""Constant
누적체납금액 is highly overall correlated with 체납액구간High correlation
체납액구간 is highly overall correlated with 누적체납금액High correlation
누적체납금액 has unique valuesUnique

Reproduction

Analysis started2024-03-18 05:47:09.807939
Analysis finished2024-03-18 05:47:10.798674
Duration0.99 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시도명
Categorical

CONSTANT 

Distinct1
Distinct (%)1.2%
Missing0
Missing (%)0.0%
Memory size772.0 B
인천광역시
80 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row인천광역시
2nd row인천광역시
3rd row인천광역시
4th row인천광역시
5th row인천광역시

Common Values

ValueCountFrequency (%)
인천광역시 80
100.0%

Length

2024-03-18T14:47:10.863265image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-18T14:47:10.963467image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
인천광역시 80
100.0%

시군구명
Categorical

CONSTANT 

Distinct1
Distinct (%)1.2%
Missing0
Missing (%)0.0%
Memory size772.0 B
인천광역시
80 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row인천광역시
2nd row인천광역시
3rd row인천광역시
4th row인천광역시
5th row인천광역시

Common Values

ValueCountFrequency (%)
인천광역시 80
100.0%

Length

2024-03-18T14:47:11.047675image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-18T14:47:11.125035image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
인천광역시 80
100.0%

자치단체코드
Categorical

CONSTANT 

Distinct1
Distinct (%)1.2%
Missing0
Missing (%)0.0%
Memory size772.0 B
28000
80 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row28000
2nd row28000
3rd row28000
4th row28000
5th row28000

Common Values

ValueCountFrequency (%)
28000 80
100.0%

Length

2024-03-18T14:47:11.207325image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-18T14:47:11.288969image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
28000 80
100.0%

과세년도
Categorical

Distinct4
Distinct (%)5.0%
Missing0
Missing (%)0.0%
Memory size772.0 B
2019
26 
2018
25 
2021
15 
2020
14 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2018
2nd row2018
3rd row2018
4th row2018
5th row2018

Common Values

ValueCountFrequency (%)
2019 26
32.5%
2018 25
31.2%
2021 15
18.8%
2020 14
17.5%

Length

2024-03-18T14:47:11.381547image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-18T14:47:11.475709image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2019 26
32.5%
2018 25
31.2%
2021 15
18.8%
2020 14
17.5%

세목명
Categorical

Distinct6
Distinct (%)7.5%
Missing0
Missing (%)0.0%
Memory size772.0 B
지방소득세
38 
취득세
30 
주민세
등록면허세
 
2
지역자원시설세
 
2

Length

Max length7
Median length5
Mean length4.15
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row등록면허세
2nd row주민세
3rd row주민세
4th row주민세
5th row지방소득세

Common Values

ValueCountFrequency (%)
지방소득세 38
47.5%
취득세 30
37.5%
주민세 6
 
7.5%
등록면허세 2
 
2.5%
지역자원시설세 2
 
2.5%
담배소비세 2
 
2.5%

Length

2024-03-18T14:47:11.612260image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-18T14:47:11.729468image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
지방소득세 38
47.5%
취득세 30
37.5%
주민세 6
 
7.5%
등록면허세 2
 
2.5%
지역자원시설세 2
 
2.5%
담배소비세 2
 
2.5%

체납액구간
Categorical

HIGH CORRELATION 

Distinct13
Distinct (%)16.2%
Missing0
Missing (%)0.0%
Memory size772.0 B
10만원 미만
11 
1천만원~3천만원미만
3천만원~5천만원미만
5백만원~1천만원미만
30만원~50만원미만
Other values (8)
38 

Length

Max length11
Median length11
Mean length10.1
Min length6

Unique

Unique1 ?
Unique (%)1.2%

Sample

1st row10만원 미만
2nd row10만원 미만
3rd row10만원~30만원미만
4th row30만원~50만원미만
5th row10만원 미만

Common Values

ValueCountFrequency (%)
10만원 미만 11
13.8%
1천만원~3천만원미만 8
10.0%
3천만원~5천만원미만 8
10.0%
5백만원~1천만원미만 8
10.0%
30만원~50만원미만 7
8.8%
5천만원~1억원미만 7
8.8%
10만원~30만원미만 6
7.5%
1억원~3억원미만 6
7.5%
3백만원~5백만원미만 6
7.5%
1백만원~3백만원미만 5
6.2%
Other values (3) 8
10.0%

Length

2024-03-18T14:47:11.853628image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
10만원 11
12.1%
미만 11
12.1%
1천만원~3천만원미만 8
8.8%
3천만원~5천만원미만 8
8.8%
5백만원~1천만원미만 8
8.8%
30만원~50만원미만 7
7.7%
5천만원~1억원미만 7
7.7%
10만원~30만원미만 6
6.6%
1억원~3억원미만 6
6.6%
3백만원~5백만원미만 6
6.6%
Other values (4) 13
14.3%

체납건수
Categorical

CONSTANT 

Distinct1
Distinct (%)1.2%
Missing0
Missing (%)0.0%
Memory size772.0 B
0
80 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 80
100.0%

Length

2024-03-18T14:47:11.984622image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-18T14:47:12.088437image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 80
100.0%

체납금액
Categorical

CONSTANT 

Distinct1
Distinct (%)1.2%
Missing0
Missing (%)0.0%
Memory size772.0 B
0
80 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 80
100.0%

Length

2024-03-18T14:47:12.182819image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-18T14:47:12.267733image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 80
100.0%

누적체납건수
Real number (ℝ)

Distinct67
Distinct (%)83.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean7967.125
Minimum1
Maximum303934
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size852.0 B
2024-03-18T14:47:12.361670image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile4
Q123.25
median176
Q3748.25
95-th percentile9179.35
Maximum303934
Range303933
Interquartile range (IQR)725

Descriptive statistics

Standard deviation42555.294
Coefficient of variation (CV)5.3413614
Kurtosis39.51529
Mean7967.125
Median Absolute Deviation (MAD)171.5
Skewness6.2837058
Sum637370
Variance1.8109531 × 109
MonotonicityNot monotonic
2024-03-18T14:47:12.479896image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
5 5
 
6.2%
4 4
 
5.0%
10 2
 
2.5%
25 2
 
2.5%
1 2
 
2.5%
36 2
 
2.5%
7 2
 
2.5%
60 2
 
2.5%
1127 1
 
1.2%
3 1
 
1.2%
Other values (57) 57
71.2%
ValueCountFrequency (%)
1 2
 
2.5%
3 1
 
1.2%
4 4
5.0%
5 5
6.2%
7 2
 
2.5%
9 1
 
1.2%
10 2
 
2.5%
13 1
 
1.2%
16 1
 
1.2%
18 1
 
1.2%
ValueCountFrequency (%)
303934 1
1.2%
232989 1
1.2%
29692 1
1.2%
20795 1
1.2%
8568 1
1.2%
5985 1
1.2%
3791 1
1.2%
2762 1
1.2%
2478 1
1.2%
2361 1
1.2%

누적체납금액
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct80
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.4645289 × 109
Minimum213590
Maximum2.4114127 × 1010
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size852.0 B
2024-03-18T14:47:12.619892image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum213590
5-th percentile15463199
Q12.5356583 × 108
median6.0947032 × 108
Q31.8049313 × 109
95-th percentile3.9378935 × 109
Maximum2.4114127 × 1010
Range2.4113914 × 1010
Interquartile range (IQR)1.5513655 × 109

Descriptive statistics

Standard deviation2.8737156 × 109
Coefficient of variation (CV)1.9622116
Kurtosis49.7536
Mean1.4645289 × 109
Median Absolute Deviation (MAD)5.2075999 × 108
Skewness6.4323957
Sum1.1716231 × 1011
Variance8.2582415 × 1018
MonotonicityNot monotonic
2024-03-18T14:47:12.750466image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
12124500 1
 
1.2%
47360740 1
 
1.2%
501051320 1
 
1.2%
2371859710 1
 
1.2%
3518067870 1
 
1.2%
3093866730 1
 
1.2%
489442420 1
 
1.2%
2182853590 1
 
1.2%
832997600 1
 
1.2%
5755533400 1
 
1.2%
Other values (70) 70
87.5%
ValueCountFrequency (%)
213590 1
1.2%
3207330 1
1.2%
4236140 1
1.2%
12124500 1
1.2%
15638920 1
1.2%
39430440 1
1.2%
40688190 1
1.2%
44317810 1
1.2%
47360740 1
1.2%
57454220 1
1.2%
ValueCountFrequency (%)
24114127340 1
1.2%
5755533400 1
1.2%
5413157510 1
1.2%
3974457670 1
1.2%
3935969070 1
1.2%
3744652720 1
1.2%
3518067870 1
1.2%
3339945960 1
1.2%
3093866730 1
1.2%
3084699580 1
1.2%

Interactions

2024-03-18T14:47:10.450225image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-18T14:47:10.310745image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-18T14:47:10.520845image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-18T14:47:10.379598image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-18T14:47:12.831027image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
과세년도세목명체납액구간누적체납건수누적체납금액
과세년도1.0000.0400.0000.0000.082
세목명0.0401.0000.5250.6210.767
체납액구간0.0000.5251.0000.0000.772
누적체납건수0.0000.6210.0001.0000.504
누적체납금액0.0820.7670.7720.5041.000
2024-03-18T14:47:12.912860image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
세목명과세년도체납액구간
세목명1.0000.0000.275
과세년도0.0001.0000.000
체납액구간0.2750.0001.000
2024-03-18T14:47:13.202136image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
누적체납건수누적체납금액과세년도세목명체납액구간
누적체납건수1.0000.2150.0000.3120.000
누적체납금액0.2151.0000.0000.4290.542
과세년도0.0000.0001.0000.0000.000
세목명0.3120.4290.0001.0000.275
체납액구간0.0000.5420.0000.2751.000

Missing values

2024-03-18T14:47:10.627069image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-18T14:47:10.751182image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시도명시군구명자치단체코드과세년도세목명체납액구간체납건수체납금액누적체납건수누적체납금액
0인천광역시인천광역시280002018등록면허세10만원 미만00112712124500
1인천광역시인천광역시280002018주민세10만원 미만002329893935969070
2인천광역시인천광역시280002018주민세10만원~30만원미만00746132407570
3인천광역시인천광역시280002018주민세30만원~50만원미만0010239430440
4인천광역시인천광역시280002018지방소득세10만원 미만0020795571705260
5인천광역시인천광역시280002018지방소득세10만원~30만원미만0059851107715080
6인천광역시인천광역시280002018지방소득세1백만원~3백만원미만0016992847427120
7인천광역시인천광역시280002018지방소득세1억원~3억원미만004624389650
8인천광역시인천광역시280002018지방소득세1천만원~3천만원미만001111761583890
9인천광역시인천광역시280002018지방소득세30만원~50만원미만001074440780140
시도명시군구명자치단체코드과세년도세목명체납액구간체납건수체납금액누적체납건수누적체납금액
70인천광역시인천광역시280002021지방소득세3백만원~5백만원미만004211622731650
71인천광역시인천광역시280002021지방소득세3천만원~5천만원미만00361305647280
72인천광역시인천광역시280002021지방소득세50만원~1백만원미만0024781737510822
73인천광역시인천광역시280002021지방소득세5백만원~1천만원미만003962756381180
74인천광역시인천광역시280002021지방소득세5천만원~1억원미만00251659598250
75인천광역시인천광역시280002021취득세1억원~3억원미만005699766090
76인천광역시인천광역시280002021취득세1천만원~3천만원미만0036594550980
77인천광역시인천광역시280002021취득세3천만원~5천만원미만0010400446340
78인천광역시인천광역시280002021취득세5백만원~1천만원미만0047349032670
79인천광역시인천광역시280002021취득세5천만원~1억원미만005349293130