Overview

Dataset statistics

Number of variables9
Number of observations28
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.2 KiB
Average record size in memory80.7 B

Variable types

Categorical3
Text2
Numeric4

Dataset

DescriptionARS간편납부 서비스를 통해 경기도 구리시에서 수납된 세금에 대한 현황정보(일자, 세목명, 건수, 납부금액 등)를 제공합니다.
Author경기도 구리시
URLhttps://www.data.go.kr/data/15090464/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
본세합계금액계 is highly overall correlated with 가산금합계금액계 and 2 other fieldsHigh correlation
가산금합계금액계 is highly overall correlated with 본세합계금액계 and 1 other fieldsHigh correlation
전체건수계 is highly overall correlated with 본세합계금액계 and 1 other fieldsHigh correlation
전체합계금액계 is highly overall correlated with 본세합계금액계 and 2 other fieldsHigh correlation
본세합계금액계 has unique valuesUnique
가산금합계금액계 has unique valuesUnique
전체합계금액계 has unique valuesUnique
가산금합계금액계 has 1 (3.6%) zerosZeros

Reproduction

Analysis started2024-04-06 08:22:10.690825
Analysis finished2024-04-06 08:22:14.999380
Duration4.31 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

일자
Categorical

Distinct3
Distinct (%)10.7%
Missing0
Missing (%)0.0%
Memory size356.0 B
2023년 상반기
11 
2022년 상반기
2022년 하반기

Length

Max length9
Median length9
Mean length9
Min length9

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2022년 상반기
2nd row2022년 상반기
3rd row2022년 상반기
4th row2022년 상반기
5th row2022년 상반기

Common Values

ValueCountFrequency (%)
2023년 상반기 11
39.3%
2022년 상반기 9
32.1%
2022년 하반기 8
28.6%

Length

2024-04-06T17:22:15.128381image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-06T17:22:15.327302image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
상반기 20
35.7%
2022년 17
30.4%
2023년 11
19.6%
하반기 8
 
14.3%

주세
Categorical

Distinct12
Distinct (%)42.9%
Missing0
Missing (%)0.0%
Memory size356.0 B
취득세
등록면허세
지역자원시설세
주민세
재산세
Other values (7)
13 

Length

Max length7
Median length5
Mean length4.3214286
Min length3

Unique

Unique4 ?
Unique (%)14.3%

Sample

1st row취득세
2nd row등록세
3rd row등록면허세
4th row지역자원시설세
5th row주민세

Common Values

ValueCountFrequency (%)
취득세 3
10.7%
등록면허세 3
10.7%
지역자원시설세 3
10.7%
주민세 3
10.7%
재산세 3
10.7%
자동차세 3
10.7%
지방소득세 3
10.7%
지방교육세 3
10.7%
등록세 1
 
3.6%
면허세 1
 
3.6%
Other values (2) 2
7.1%

Length

2024-04-06T17:22:15.612517image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
취득세 3
10.7%
등록면허세 3
10.7%
지역자원시설세 3
10.7%
주민세 3
10.7%
재산세 3
10.7%
자동차세 3
10.7%
지방소득세 3
10.7%
지방교육세 3
10.7%
등록세 1
 
3.6%
면허세 1
 
3.6%
Other values (2) 2
7.1%
Distinct26
Distinct (%)92.9%
Missing0
Missing (%)0.0%
Memory size356.0 B
2024-04-06T17:22:16.096110image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length5
Median length3
Mean length2.9642857
Min length1

Characters and Unicode

Total characters83
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique25 ?
Unique (%)89.3%

Sample

1st row18
2nd row1
3rd row305
4th row226
5th row347
ValueCountFrequency (%)
1 3
 
10.7%
18 1
 
3.6%
143 1
 
3.6%
3,086 1
 
3.6%
325 1
 
3.6%
257 1
 
3.6%
272 1
 
3.6%
262 1
 
3.6%
4 1
 
3.6%
21 1
 
3.6%
Other values (16) 16
57.1%
2024-04-06T17:22:16.860532image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2 14
16.9%
1 12
14.5%
3 12
14.5%
5 8
9.6%
4 8
9.6%
6 7
8.4%
, 7
8.4%
7 6
7.2%
8 4
 
4.8%
9 3
 
3.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 76
91.6%
Other Punctuation 7
 
8.4%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 14
18.4%
1 12
15.8%
3 12
15.8%
5 8
10.5%
4 8
10.5%
6 7
9.2%
7 6
7.9%
8 4
 
5.3%
9 3
 
3.9%
0 2
 
2.6%
Other Punctuation
ValueCountFrequency (%)
, 7
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 83
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
2 14
16.9%
1 12
14.5%
3 12
14.5%
5 8
9.6%
4 8
9.6%
6 7
8.4%
, 7
8.4%
7 6
7.2%
8 4
 
4.8%
9 3
 
3.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 83
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2 14
16.9%
1 12
14.5%
3 12
14.5%
5 8
9.6%
4 8
9.6%
6 7
8.4%
, 7
8.4%
7 6
7.2%
8 4
 
4.8%
9 3
 
3.6%
Distinct26
Distinct (%)92.9%
Missing0
Missing (%)0.0%
Memory size356.0 B
2024-04-06T17:22:17.241361image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length5
Median length3
Mean length2.4285714
Min length1

Characters and Unicode

Total characters68
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique25 ?
Unique (%)89.3%

Sample

1st row2
2nd row1
3rd row126
4th row227
5th row341
ValueCountFrequency (%)
1 3
 
10.7%
2 1
 
3.6%
87 1
 
3.6%
927 1
 
3.6%
321 1
 
3.6%
257 1
 
3.6%
277 1
 
3.6%
117 1
 
3.6%
4 1
 
3.6%
5 1
 
3.6%
Other values (16) 16
57.1%
2024-04-06T17:22:18.109385image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 12
17.6%
2 11
16.2%
3 11
16.2%
7 7
10.3%
4 7
10.3%
5 6
8.8%
9 4
 
5.9%
6 3
 
4.4%
8 3
 
4.4%
, 2
 
2.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 66
97.1%
Other Punctuation 2
 
2.9%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 12
18.2%
2 11
16.7%
3 11
16.7%
7 7
10.6%
4 7
10.6%
5 6
9.1%
9 4
 
6.1%
6 3
 
4.5%
8 3
 
4.5%
0 2
 
3.0%
Other Punctuation
ValueCountFrequency (%)
, 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 68
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
1 12
17.6%
2 11
16.2%
3 11
16.2%
7 7
10.3%
4 7
10.3%
5 6
8.8%
9 4
 
5.9%
6 3
 
4.4%
8 3
 
4.4%
, 2
 
2.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 68
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 12
17.6%
2 11
16.2%
3 11
16.2%
7 7
10.3%
4 7
10.3%
5 6
8.8%
9 4
 
5.9%
6 3
 
4.4%
8 3
 
4.4%
, 2
 
2.9%

본세합계금액계
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct28
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean92769573
Minimum4070
Maximum6.1818046 × 108
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size384.0 B
2024-04-06T17:22:18.462429image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum4070
5-th percentile30087.5
Q14409035
median26188560
Q386670575
95-th percentile4.7190679 × 108
Maximum6.1818046 × 108
Range6.1817639 × 108
Interquartile range (IQR)82261540

Descriptive statistics

Standard deviation1.5967154 × 108
Coefficient of variation (CV)1.7211628
Kurtosis5.3340661
Mean92769573
Median Absolute Deviation (MAD)26056060
Skewness2.3872773
Sum2.597548 × 109
Variance2.5495001 × 1016
MonotonicityNot monotonic
2024-04-06T17:22:18.769320image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=28)
ValueCountFrequency (%)
101969690 1
 
3.6%
20772750 1
 
3.6%
170786650 1
 
3.6%
81570870 1
 
3.6%
24750 1
 
3.6%
543136870 1
 
3.6%
59096280 1
 
3.6%
5729640 1
 
3.6%
4193140 1
 
3.6%
4070 1
 
3.6%
Other values (18) 18
64.3%
ValueCountFrequency (%)
4070 1
3.6%
24750 1
3.6%
40000 1
3.6%
225000 1
3.6%
563650 1
3.6%
1096000 1
3.6%
4193140 1
3.6%
4481000 1
3.6%
4930000 1
3.6%
5389440 1
3.6%
ValueCountFrequency (%)
618180460 1
3.6%
543136870 1
3.6%
339622360 1
3.6%
195368300 1
3.6%
192350990 1
3.6%
170786650 1
3.6%
101969690 1
3.6%
81570870 1
3.6%
59599880 1
3.6%
59096280 1
3.6%

가산금합계금액계
Real number (ℝ)

HIGH CORRELATION  UNIQUE  ZEROS 

Distinct28
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1147097.9
Minimum0
Maximum5066820
Zeros1
Zeros (%)3.6%
Negative0
Negative (%)0.0%
Memory size384.0 B
2024-04-06T17:22:19.095159image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile560.5
Q160465
median389015
Q31604905
95-th percentile3739499
Maximum5066820
Range5066820
Interquartile range (IQR)1544440

Descriptive statistics

Standard deviation1509241.8
Coefficient of variation (CV)1.3157045
Kurtosis0.48590216
Mean1147097.9
Median Absolute Deviation (MAD)387400
Skewness1.3169485
Sum32118740
Variance2.2778107 × 1012
MonotonicityNot monotonic
2024-04-06T17:22:19.927078image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=28)
ValueCountFrequency (%)
592250 1
 
3.6%
819210 1
 
3.6%
1276350 1
 
3.6%
3686410 1
 
3.6%
1230 1
 
3.6%
3649030 1
 
3.6%
2736210 1
 
3.6%
1776760 1
 
3.6%
144970 1
 
3.6%
200 1
 
3.6%
Other values (18) 18
64.3%
ValueCountFrequency (%)
0 1
3.6%
200 1
3.6%
1230 1
3.6%
2000 1
3.6%
6500 1
3.6%
6960 1
3.6%
32880 1
3.6%
69660 1
3.6%
72230 1
3.6%
138650 1
3.6%
ValueCountFrequency (%)
5066820 1
3.6%
3753450 1
3.6%
3713590 1
3.6%
3686410 1
3.6%
3649030 1
3.6%
2736210 1
3.6%
1776760 1
3.6%
1547620 1
3.6%
1276350 1
3.6%
1138510 1
3.6%

전체건수계
Real number (ℝ)

HIGH CORRELATION 

Distinct26
Distinct (%)92.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean786.14286
Minimum1
Maximum3792
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size384.0 B
2024-04-06T17:22:20.394314image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q120.25
median259.5
Q3639
95-th percentile3436.15
Maximum3792
Range3791
Interquartile range (IQR)618.75

Descriptive statistics

Standard deviation1210.5308
Coefficient of variation (CV)1.5398356
Kurtosis1.2075527
Mean786.14286
Median Absolute Deviation (MAD)240
Skewness1.6232187
Sum22012
Variance1465384.8
MonotonicityNot monotonic
2024-04-06T17:22:20.669781image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=26)
ValueCountFrequency (%)
1 3
 
10.7%
18 1
 
3.6%
342 1
 
3.6%
3496 1
 
3.6%
143 1
 
3.6%
3087 1
 
3.6%
325 1
 
3.6%
257 1
 
3.6%
278 1
 
3.6%
262 1
 
3.6%
Other values (16) 16
57.1%
ValueCountFrequency (%)
1 3
10.7%
4 1
 
3.6%
5 1
 
3.6%
14 1
 
3.6%
18 1
 
3.6%
21 1
 
3.6%
55 1
 
3.6%
68 1
 
3.6%
143 1
 
3.6%
228 1
 
3.6%
ValueCountFrequency (%)
3792 1
3.6%
3496 1
3.6%
3325 1
3.6%
3087 1
3.6%
1978 1
3.6%
1614 1
3.6%
1515 1
3.6%
347 1
3.6%
342 1
3.6%
325 1
3.6%

전체합계금액계
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct28
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean93916671
Minimum4270
Maximum6.2189405 × 108
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size384.0 B
2024-04-06T17:22:20.898769image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum4270
5-th percentile31587
Q14666677.5
median26601645
Q389583445
95-th percentile4.7432719 × 108
Maximum6.2189405 × 108
Range6.2188978 × 108
Interquartile range (IQR)84916768

Descriptive statistics

Standard deviation1.6053799 × 108
Coefficient of variation (CV)1.7093663
Kurtosis5.3161263
Mean93916671
Median Absolute Deviation (MAD)26464895
Skewness2.3807895
Sum2.6296668 × 109
Variance2.5772447 × 1016
MonotonicityNot monotonic
2024-04-06T17:22:21.171439image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=28)
ValueCountFrequency (%)
102561940 1
 
3.6%
21591960 1
 
3.6%
172063000 1
 
3.6%
85257280 1
 
3.6%
25980 1
 
3.6%
546785900 1
 
3.6%
61832490 1
 
3.6%
7506400 1
 
3.6%
4338110 1
 
3.6%
4270 1
 
3.6%
Other values (18) 18
64.3%
ValueCountFrequency (%)
4270 1
3.6%
25980 1
3.6%
42000 1
3.6%
231500 1
3.6%
997430 1
3.6%
1128880 1
3.6%
4338110 1
3.6%
4776200 1
3.6%
5002230 1
3.6%
5459100 1
3.6%
ValueCountFrequency (%)
621894050 1
3.6%
546785900 1
3.6%
339761010 1
3.6%
197417810 1
3.6%
196915920 1
3.6%
172063000 1
3.6%
102561940 1
3.6%
85257280 1
3.6%
63353330 1
3.6%
61832490 1
3.6%

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)3.6%
Missing0
Missing (%)0.0%
Memory size356.0 B
2024-03-22
28 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2024-03-22
2nd row2024-03-22
3rd row2024-03-22
4th row2024-03-22
5th row2024-03-22

Common Values

ValueCountFrequency (%)
2024-03-22 28
100.0%

Length

2024-04-06T17:22:21.470663image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-06T17:22:21.790714image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2024-03-22 28
100.0%

Interactions

2024-04-06T17:22:13.700322image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T17:22:11.192227image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T17:22:12.047013image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T17:22:12.950316image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T17:22:13.867845image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T17:22:11.363560image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T17:22:12.307768image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T17:22:13.134683image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T17:22:14.046827image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T17:22:11.543362image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T17:22:12.484127image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T17:22:13.336464image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T17:22:14.247104image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T17:22:11.841349image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T17:22:12.697319image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T17:22:13.516511image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-06T17:22:21.964128image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
일자주세본세건수계가산금건수계본세합계금액계가산금합계금액계전체건수계전체합계금액계
일자1.0000.0000.8060.8060.2480.0000.0000.362
주세0.0001.0000.0000.0000.0000.0000.6340.000
본세건수계0.8060.0001.0001.0001.0001.0001.0001.000
가산금건수계0.8060.0001.0001.0001.0001.0001.0001.000
본세합계금액계0.2480.0001.0001.0001.0000.7620.8090.999
가산금합계금액계0.0000.0001.0001.0000.7621.0000.0000.813
전체건수계0.0000.6341.0001.0000.8090.0001.0000.810
전체합계금액계0.3620.0001.0001.0000.9990.8130.8101.000
2024-04-06T17:22:22.278094image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
일자주세
일자1.0000.000
주세0.0001.000
2024-04-06T17:22:22.477902image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
본세합계금액계가산금합계금액계전체건수계전체합계금액계일자주세
본세합계금액계1.0000.7430.6940.9990.1330.000
가산금합계금액계0.7431.0000.4840.7470.0000.000
전체건수계0.6940.4841.0000.6860.0000.222
전체합계금액계0.9990.7470.6861.0000.2280.000
일자0.1330.0000.0000.2281.0000.000
주세0.0000.0000.2220.0000.0001.000

Missing values

2024-04-06T17:22:14.498043image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-06T17:22:14.910551image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

일자주세본세건수계가산금건수계본세합계금액계가산금합계금액계전체건수계전체합계금액계데이터기준일자
02022년 상반기취득세182101969690592250181025619402024-03-22
12022년 상반기등록세1156365043378019974302024-03-22
22022년 상반기등록면허세30512653894406966030554591002024-03-22
32022년 상반기지역자원시설세226227448100029520022847762002024-03-22
42022년 상반기주민세347341546443015723034756216602024-03-22
52022년 상반기재산세289289595998803753450290633533302024-03-22
62022년 상반기자동차세3,325933618180460371359033256218940502024-03-22
72022년 상반기지방소득세23914619235099050668202401974178102024-03-22
82022년 상반기지방교육세3,7771,506195368300154762037921969159202024-03-22
92022년 하반기취득세501138899005113889902024-03-22
일자주세본세건수계가산금건수계본세합계금액계가산금합계금액계전체건수계전체합계금액계데이터기준일자
182023년 상반기면허세444000020004420002024-03-22
192023년 상반기등록면허세26211749300007223026250022302024-03-22
202023년 상반기공동시설세114070200142702024-03-22
212023년 상반기지역자원시설세272277419314014497027843381102024-03-22
222023년 상반기주민세2572575729640177676025775064002024-03-22
232023년 상반기재산세325321590962802736210325618324902024-03-22
242023년 상반기자동차세3,086927543136870364903030875467859002024-03-22
252023년 상반기도시계획세112475012301259802024-03-22
262023년 상반기지방소득세14387815708703686410143852572802024-03-22
272023년 상반기지방교육세3,4631,445170786650127635034961720630002024-03-22