Dataset statistics
Number of variables | 8 |
---|---|
Number of observations | 27 |
Missing cells | 18 |
Missing cells (%) | 8.3% |
Duplicate rows | 1 |
Duplicate rows (%) | 3.7% |
Total size in memory | 2.0 KiB |
Average record size in memory | 75.7 B |
Variable types
Categorical | 2 |
---|---|
Numeric | 6 |
Dataset
Description | 경기도 하남시의 지방세 체납액 징수율에 대한 데이터로, 이월체납조정액, 징수액, 결손액, 미수액, 정리율 등의 항목을 제공합니다. (단위:백만원) |
---|---|
Author | 경기도 하남시 |
URL | https://www.data.go.kr/data/15107161/fileData.do |
Dataset has 1 (3.7%) duplicate rows | Duplicates |
연도 is highly overall correlated with 월 and 6 other fields | High correlation |
구분 is highly overall correlated with 이월체납조정액 and 3 other fields | High correlation |
월 is highly overall correlated with 정리액(징수액) and 3 other fields | High correlation |
이월체납조정액 is highly overall correlated with 정리액(징수액) and 4 other fields | High correlation |
정리액(징수액) is highly overall correlated with 월 and 4 other fields | High correlation |
정리액(결손액) is highly overall correlated with 월 and 4 other fields | High correlation |
미수액 is highly overall correlated with 이월체납조정액 and 3 other fields | High correlation |
정리율 is highly overall correlated with 월 and 3 other fields | High correlation |
월 has 3 (11.1%) missing values | Missing |
이월체납조정액 has 3 (11.1%) missing values | Missing |
정리액(징수액) has 3 (11.1%) missing values | Missing |
정리액(결손액) has 3 (11.1%) missing values | Missing |
미수액 has 3 (11.1%) missing values | Missing |
정리율 has 3 (11.1%) missing values | Missing |
Reproduction
Analysis started | 2024-04-13 13:06:00.016406 |
---|---|
Analysis finished | 2024-04-13 13:06:10.673783 |
Duration | 10.66 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
연도
Categorical
HIGH CORRELATION
 
Distinct | 2 |
---|---|
Distinct (%) | 7.4% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 344.0 B |
2023 | |
---|---|
<NA> |
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 4 |
Min length | 4 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 2023 |
---|---|
2nd row | 2023 |
3rd row | 2023 |
4th row | 2023 |
5th row | 2023 |
Common Values
Value | Count | Frequency (%) |
2023 | 24 | |
<NA> | 3 | 11.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
2023 | 24 | |
na | 3 | 11.1% |
월
Real number (ℝ)
HIGH CORRELATION
  MISSING
 
Distinct | 12 |
---|---|
Distinct (%) | 50.0% |
Missing | 3 |
Missing (%) | 11.1% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 6.5 |
Minimum | 1 |
---|---|
Maximum | 12 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 371.0 B |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1.15 |
Q1 | 3.75 |
median | 6.5 |
Q3 | 9.25 |
95-th percentile | 11.85 |
Maximum | 12 |
Range | 11 |
Interquartile range (IQR) | 5.5 |
Descriptive statistics
Standard deviation | 3.5262987 |
---|---|
Coefficient of variation (CV) | 0.54250749 |
Kurtosis | -1.2156934 |
Mean | 6.5 |
Median Absolute Deviation (MAD) | 3 |
Skewness | 0 |
Sum | 156 |
Variance | 12.434783 |
Monotonicity | Increasing |
Value | Count | Frequency (%) |
1 | 2 | |
2 | 2 | |
3 | 2 | |
4 | 2 | |
5 | 2 | |
6 | 2 | |
7 | 2 | |
8 | 2 | |
9 | 2 | |
10 | 2 | |
Other values (2) | 4 | |
(Missing) | 3 |
Value | Count | Frequency (%) |
1 | 2 | |
2 | 2 | |
3 | 2 | |
4 | 2 | |
5 | 2 | |
6 | 2 | |
7 | 2 | |
8 | 2 | |
9 | 2 | |
10 | 2 |
Value | Count | Frequency (%) |
12 | 2 | |
11 | 2 | |
10 | 2 | |
9 | 2 | |
8 | 2 | |
7 | 2 | |
6 | 2 | |
5 | 2 | |
4 | 2 | |
3 | 2 |
구분
Categorical
HIGH CORRELATION
 
Distinct | 3 |
---|---|
Distinct (%) | 11.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 344.0 B |
시세 | |
---|---|
도세 | |
<NA> |
Length
Max length | 4 |
---|---|
Median length | 2 |
Mean length | 2.2222222 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 시세 |
---|---|
2nd row | 도세 |
3rd row | 시세 |
4th row | 도세 |
5th row | 시세 |
Common Values
Value | Count | Frequency (%) |
시세 | 12 | |
도세 | 12 | |
<NA> | 3 | 11.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
시세 | 12 | |
도세 | 12 | |
na | 3 | 11.1% |
이월체납조정액
Real number (ℝ)
HIGH CORRELATION
  MISSING
 
Distinct | 24 |
---|---|
Distinct (%) | 100.0% |
Missing | 3 |
Missing (%) | 11.1% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 14008.083 |
Minimum | 6235 |
---|---|
Maximum | 21973 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 371.0 B |
Quantile statistics
Minimum | 6235 |
---|---|
5-th percentile | 6245.6 |
Q1 | 6280 |
median | 13902.5 |
Q3 | 21727.25 |
95-th percentile | 21925.25 |
Maximum | 21973 |
Range | 15738 |
Interquartile range (IQR) | 15447.25 |
Descriptive statistics
Standard deviation | 7889.2383 |
---|---|
Coefficient of variation (CV) | 0.56319184 |
Kurtosis | -2.1892408 |
Mean | 14008.083 |
Median Absolute Deviation (MAD) | 7655 |
Skewness | 0.00071484203 |
Sum | 336194 |
Variance | 62240081 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
21791 | 1 | 3.7% |
21973 | 1 | 3.7% |
6358 | 1 | 3.7% |
21932 | 1 | 3.7% |
6346 | 1 | 3.7% |
21887 | 1 | 3.7% |
6326 | 1 | 3.7% |
21840 | 1 | 3.7% |
6314 | 1 | 3.7% |
21844 | 1 | 3.7% |
Other values (14) | 14 | |
(Missing) | 3 | 11.1% |
Value | Count | Frequency (%) |
6235 | 1 | |
6245 | 1 | |
6249 | 1 | |
6255 | 1 | |
6264 | 1 | |
6268 | 1 | |
6284 | 1 | |
6287 | 1 | |
6314 | 1 | |
6326 | 1 |
Value | Count | Frequency (%) |
21973 | 1 | |
21932 | 1 | |
21887 | 1 | |
21844 | 1 | |
21840 | 1 | |
21791 | 1 | |
21706 | 1 | |
21678 | 1 | |
21624 | 1 | |
21559 | 1 |
정리액(징수액)
Real number (ℝ)
HIGH CORRELATION
  MISSING
 
Distinct | 24 |
---|---|
Distinct (%) | 100.0% |
Missing | 3 |
Missing (%) | 11.1% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 3718.9167 |
Minimum | 605 |
---|---|
Maximum | 7572 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 371.0 B |
Quantile statistics
Minimum | 605 |
---|---|
5-th percentile | 1035.4 |
Q1 | 2168.75 |
median | 2580 |
Q3 | 5933.75 |
95-th percentile | 7282.5 |
Maximum | 7572 |
Range | 6967 |
Interquartile range (IQR) | 3765 |
Descriptive statistics
Standard deviation | 2291.7067 |
---|---|
Coefficient of variation (CV) | 0.61622963 |
Kurtosis | -1.3373447 |
Mean | 3718.9167 |
Median Absolute Deviation (MAD) | 1376.5 |
Skewness | 0.46531999 |
Sum | 89254 |
Variance | 5251919.4 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
6236 | 1 | 3.7% |
7572 | 1 | 3.7% |
2591 | 1 | 3.7% |
7317 | 1 | 3.7% |
2569 | 1 | 3.7% |
7087 | 1 | 3.7% |
2532 | 1 | 3.7% |
6857 | 1 | 3.7% |
2499 | 1 | 3.7% |
6651 | 1 | 3.7% |
Other values (14) | 14 | |
(Missing) | 3 | 11.1% |
Value | Count | Frequency (%) |
605 | 1 | |
997 | 1 | |
1253 | 1 | |
1453 | 1 | |
1544 | 1 | |
1655 | 1 | |
2340 | 1 | |
2404 | 1 | |
2463 | 1 | |
2499 | 1 |
Value | Count | Frequency (%) |
7572 | 1 | |
7317 | 1 | |
7087 | 1 | |
6857 | 1 | |
6651 | 1 | |
6236 | 1 | |
5833 | 1 | |
5264 | 1 | |
4744 | 1 | |
4006 | 1 |
정리액(결손액)
Real number (ℝ)
HIGH CORRELATION
  MISSING
 
Distinct | 21 |
---|---|
Distinct (%) | 87.5% |
Missing | 3 |
Missing (%) | 11.1% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 366.91667 |
Minimum | 1 |
---|---|
Maximum | 1242 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 371.0 B |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 2 |
Q1 | 11 |
median | 186.5 |
Q3 | 786.25 |
95-th percentile | 857.7 |
Maximum | 1242 |
Range | 1241 |
Interquartile range (IQR) | 775.25 |
Descriptive statistics
Standard deviation | 395.21507 |
---|---|
Coefficient of variation (CV) | 1.0771249 |
Kurtosis | -1.0237773 |
Mean | 366.91667 |
Median Absolute Deviation (MAD) | 184.5 |
Skewness | 0.62267453 |
Sum | 8806 |
Variance | 156194.95 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
2 | 3 | 11.1% |
823 | 2 | 7.4% |
15 | 1 | 3.7% |
1242 | 1 | 3.7% |
858 | 1 | 3.7% |
831 | 1 | 3.7% |
856 | 1 | 3.7% |
774 | 1 | 3.7% |
674 | 1 | 3.7% |
524 | 1 | 3.7% |
Other values (11) | 11 | |
(Missing) | 3 | 11.1% |
Value | Count | Frequency (%) |
1 | 1 | 3.7% |
2 | 3 | |
3 | 1 | 3.7% |
5 | 1 | 3.7% |
13 | 1 | 3.7% |
15 | 1 | 3.7% |
33 | 1 | 3.7% |
48 | 1 | 3.7% |
67 | 1 | 3.7% |
170 | 1 | 3.7% |
Value | Count | Frequency (%) |
1242 | 1 | |
858 | 1 | |
856 | 1 | |
831 | 1 | |
823 | 2 | |
774 | 1 | |
674 | 1 | |
524 | 1 | |
490 | 1 | |
347 | 1 |
미수액
Real number (ℝ)
HIGH CORRELATION
  MISSING
 
Distinct | 24 |
---|---|
Distinct (%) | 100.0% |
Missing | 3 |
Missing (%) | 11.1% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 9922.25 |
Minimum | 2909 |
---|---|
Maximum | 19870 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 371.0 B |
Quantile statistics
Minimum | 2909 |
---|---|
5-th percentile | 2928.5 |
Q1 | 3909.75 |
median | 9399 |
Q3 | 15180.25 |
95-th percentile | 18477.1 |
Maximum | 19870 |
Range | 16961 |
Interquartile range (IQR) | 11270.5 |
Descriptive statistics
Standard deviation | 6198.4354 |
---|---|
Coefficient of variation (CV) | 0.62470059 |
Kurtosis | -1.8223221 |
Mean | 9922.25 |
Median Absolute Deviation (MAD) | 5585.5 |
Skewness | 0.13869043 |
Sum | 238134 |
Variance | 38420601 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
15065 | 1 | 3.7% |
13159 | 1 | 3.7% |
2909 | 1 | 3.7% |
13784 | 1 | 3.7% |
2921 | 1 | 3.7% |
14026 | 1 | 3.7% |
2971 | 1 | 3.7% |
14309 | 1 | 3.7% |
2992 | 1 | 3.7% |
14669 | 1 | 3.7% |
Other values (14) | 14 | |
(Missing) | 3 | 11.1% |
Value | Count | Frequency (%) |
2909 | 1 | |
2921 | 1 | |
2971 | 1 | |
2992 | 1 | |
3757 | 1 | |
3870 | 1 | |
3923 | 1 | |
4597 | 1 | |
4794 | 1 | |
5029 | 1 |
Value | Count | Frequency (%) |
19870 | 1 | |
18652 | 1 | |
17486 | 1 | |
16710 | 1 | |
16211 | 1 | |
15526 | 1 | |
15065 | 1 | |
14669 | 1 | |
14309 | 1 | |
14026 | 1 |
정리율
Real number (ℝ)
HIGH CORRELATION
  MISSING
 
Distinct | 24 |
---|---|
Distinct (%) | 100.0% |
Missing | 3 |
Missing (%) | 11.1% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 31.33625 |
Minimum | 7.35 |
---|---|
Maximum | 54.25 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 371.0 B |
Quantile statistics
Minimum | 7.35 |
---|---|
5-th percentile | 10.2205 |
Q1 | 22.0325 |
median | 31.86 |
Q3 | 38.765 |
95-th percentile | 53.8305 |
Maximum | 54.25 |
Range | 46.9 |
Interquartile range (IQR) | 16.7325 |
Descriptive statistics
Standard deviation | 13.776929 |
---|---|
Coefficient of variation (CV) | 0.4396483 |
Kurtosis | -0.69395668 |
Mean | 31.33625 |
Median Absolute Deviation (MAD) | 8.415 |
Skewness | 0.12997533 |
Sum | 752.07 |
Variance | 189.80377 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
30.87 | 1 | 3.7% |
40.11 | 1 | 3.7% |
54.25 | 1 | 3.7% |
37.15 | 1 | 3.7% |
53.97 | 1 | 3.7% |
35.92 | 1 | 3.7% |
53.04 | 1 | 3.7% |
34.48 | 1 | 3.7% |
52.61 | 1 | 3.7% |
32.85 | 1 | 3.7% |
Other values (14) | 14 | |
(Missing) | 3 | 11.1% |
Value | Count | Frequency (%) |
7.35 | 1 | |
9.7 | 1 | |
13.17 | 1 | |
15.95 | 1 | |
18.89 | 1 | |
19.97 | 1 | |
22.72 | 1 | |
23.28 | 1 | |
25.22 | 1 | |
26.51 | 1 |
Value | Count | Frequency (%) |
54.25 | 1 | |
53.97 | 1 | |
53.04 | 1 | |
52.61 | 1 | |
40.11 | 1 | |
39.74 | 1 | |
38.44 | 1 | |
37.41 | 1 | |
37.15 | 1 | |
35.92 | 1 |
월 | 구분 | 이월체납조정액 | 정리액(징수액) | 정리액(결손액) | 미수액 | 정리율 | |
---|---|---|---|---|---|---|---|
월 | 1.000 | 0.000 | 0.000 | 0.509 | 0.666 | 0.310 | 0.782 |
구분 | 0.000 | 1.000 | 0.990 | 0.987 | 0.551 | 1.000 | 0.187 |
이월체납조정액 | 0.000 | 0.990 | 1.000 | 0.984 | 0.497 | 1.000 | 0.236 |
정리액(징수액) | 0.509 | 0.987 | 0.984 | 1.000 | 0.755 | 0.926 | 0.865 |
정리액(결손액) | 0.666 | 0.551 | 0.497 | 0.755 | 1.000 | 0.530 | 0.821 |
미수액 | 0.310 | 1.000 | 1.000 | 0.926 | 0.530 | 1.000 | 0.600 |
정리율 | 0.782 | 0.187 | 0.236 | 0.865 | 0.821 | 0.600 | 1.000 |
연도 | 구분 | |
---|---|---|
연도 | 1.000 | 1.000 |
구분 | 1.000 | 1.000 |
월 | 이월체납조정액 | 정리액(징수액) | 정리액(결손액) | 미수액 | 정리율 | 연도 | 구분 | |
---|---|---|---|---|---|---|---|---|
월 | 1.000 | 0.429 | 0.603 | 0.846 | -0.499 | 0.926 | 1.000 | 0.000 |
이월체납조정액 | 0.429 | 1.000 | 0.928 | 0.704 | 0.537 | 0.128 | 1.000 | 0.913 |
정리액(징수액) | 0.603 | 0.928 | 1.000 | 0.768 | 0.367 | 0.338 | 1.000 | 0.721 |
정리액(결손액) | 0.846 | 0.704 | 0.768 | 1.000 | -0.108 | 0.691 | 1.000 | 0.339 |
미수액 | -0.499 | 0.537 | 0.367 | -0.108 | 1.000 | -0.712 | 1.000 | 0.905 |
정리율 | 0.926 | 0.128 | 0.338 | 0.691 | -0.712 | 1.000 | 1.000 | 0.036 |
연도 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
구분 | 0.000 | 0.913 | 0.721 | 0.339 | 0.905 | 0.036 | 1.000 | 1.000 |
연도 | 월 | 구분 | 이월체납조정액 | 정리액(징수액) | 정리액(결손액) | 미수액 | 정리율 | |
---|---|---|---|---|---|---|---|---|
0 | 2023 | 1 | 시세 | 6245 | 605 | 1 | 5639 | 9.7 |
1 | 2023 | 1 | 도세 | 21447 | 1544 | 33 | 19870 | 7.35 |
2 | 2023 | 2 | 시세 | 6264 | 997 | 2 | 5265 | 15.95 |
3 | 2023 | 2 | 도세 | 21482 | 2782 | 48 | 18652 | 13.17 |
4 | 2023 | 3 | 시세 | 6284 | 1253 | 2 | 5029 | 19.97 |
5 | 2023 | 3 | 도세 | 21559 | 4006 | 67 | 17486 | 18.89 |
6 | 2023 | 4 | 시세 | 6249 | 1453 | 2 | 4794 | 23.28 |
7 | 2023 | 4 | 도세 | 21624 | 4744 | 170 | 16710 | 22.72 |
8 | 2023 | 5 | 시세 | 6255 | 1655 | 3 | 4597 | 26.51 |
9 | 2023 | 5 | 도세 | 21678 | 5264 | 203 | 16211 | 25.22 |
연도 | 월 | 구분 | 이월체납조정액 | 정리액(징수액) | 정리액(결손액) | 미수액 | 정리율 | |
---|---|---|---|---|---|---|---|---|
17 | 2023 | 9 | 도세 | 21840 | 6857 | 674 | 14309 | 34.48 |
18 | 2023 | 10 | 시세 | 6326 | 2532 | 823 | 2971 | 53.04 |
19 | 2023 | 10 | 도세 | 21887 | 7087 | 774 | 14026 | 35.92 |
20 | 2023 | 11 | 시세 | 6346 | 2569 | 856 | 2921 | 53.97 |
21 | 2023 | 11 | 도세 | 21932 | 7317 | 831 | 13784 | 37.15 |
22 | 2023 | 12 | 시세 | 6358 | 2591 | 858 | 2909 | 54.25 |
23 | 2023 | 12 | 도세 | 21973 | 7572 | 1242 | 13159 | 40.11 |
24 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
25 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
26 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
Most frequently occurring
연도 | 월 | 구분 | 이월체납조정액 | 정리액(징수액) | 정리액(결손액) | 미수액 | 정리율 | # duplicates | |
---|---|---|---|---|---|---|---|---|---|
0 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | 3 |