Dataset statistics
Number of variables | 6 |
---|---|
Number of observations | 269 |
Missing cells | 236 |
Missing cells (%) | 14.6% |
Duplicate rows | 19 |
Duplicate rows (%) | 7.1% |
Total size in memory | 13.5 KiB |
Average record size in memory | 51.5 B |
Variable types
Categorical | 2 |
---|---|
Numeric | 2 |
DateTime | 2 |
Dataset
Description | 오산시 지방세 ARS카드납부시스템의 선택납부에 대한 데이터로 총합계금액, 선택납부 횟수 등의 항목을 제공합니다. |
---|---|
URL | https://www.data.go.kr/data/15081647/fileData.do |
Dataset has 19 (7.1%) duplicate rows | Duplicates |
과세구분 is highly overall correlated with 총합계금액 and 2 other fields | High correlation |
시구분 is highly overall correlated with 총합계금액 and 2 other fields | High correlation |
총합계금액 is highly overall correlated with 과세구분 and 1 other fields | High correlation |
횟수 is highly overall correlated with 과세구분 and 1 other fields | High correlation |
총합계금액 has 59 (21.9%) missing values | Missing |
횟수 has 59 (21.9%) missing values | Missing |
등록일자 has 59 (21.9%) missing values | Missing |
유지만료일자 has 59 (21.9%) missing values | Missing |
Reproduction
Analysis started | 2023-12-12 12:31:03.538167 |
---|---|
Analysis finished | 2023-12-12 12:31:04.626686 |
Duration | 1.09 second |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
과세구분
Categorical
HIGH CORRELATION
 
Distinct | 2 |
---|---|
Distinct (%) | 0.7% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 2.2 KiB |
통합 | |
---|---|
<NA> |
Length
Max length | 4 |
---|---|
Median length | 2 |
Mean length | 2.4386617 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 통합 |
---|---|
2nd row | 통합 |
3rd row | 통합 |
4th row | 통합 |
5th row | 통합 |
Common Values
Value | Count | Frequency (%) |
통합 | 210 | |
<NA> | 59 | 21.9% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
통합 | 210 | |
na | 59 | 21.9% |
총합계금액
Real number (ℝ)
HIGH CORRELATION
  MISSING
 
Distinct | 99 |
---|---|
Distinct (%) | 47.1% |
Missing | 59 |
Missing (%) | 21.9% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 801470.24 |
Minimum | 15450 |
---|---|
Maximum | 8506860 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 2.5 KiB |
Quantile statistics
Minimum | 15450 |
---|---|
5-th percentile | 100000 |
Q1 | 181540 |
median | 300000 |
Q3 | 747657.5 |
95-th percentile | 3000000 |
Maximum | 8506860 |
Range | 8491410 |
Interquartile range (IQR) | 566117.5 |
Descriptive statistics
Standard deviation | 1355436.2 |
---|---|
Coefficient of variation (CV) | 1.6911872 |
Kurtosis | 14.242091 |
Mean | 801470.24 |
Median Absolute Deviation (MAD) | 200000 |
Skewness | 3.5978903 |
Sum | 1.6830875 × 108 |
Variance | 1.8372074 × 1012 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
100000 | 27 | 10.0% |
1000000 | 18 | 6.7% |
300000 | 16 | 5.9% |
500000 | 12 | 4.5% |
2000000 | 11 | 4.1% |
200000 | 6 | 2.2% |
560310 | 4 | 1.5% |
3000000 | 3 | 1.1% |
243040 | 3 | 1.1% |
4700000 | 3 | 1.1% |
Other values (89) | 107 | |
(Missing) | 59 |
Value | Count | Frequency (%) |
15450 | 1 | 0.4% |
40000 | 1 | 0.4% |
42410 | 1 | 0.4% |
55760 | 1 | 0.4% |
66580 | 1 | 0.4% |
71310 | 1 | 0.4% |
88290 | 1 | 0.4% |
100000 | 27 | |
104230 | 1 | 0.4% |
105810 | 1 | 0.4% |
Value | Count | Frequency (%) |
8506860 | 1 | 0.4% |
8000000 | 2 | 0.7% |
6000000 | 1 | 0.4% |
5628860 | 2 | 0.7% |
5000000 | 1 | 0.4% |
4700000 | 3 | 1.1% |
3000000 | 3 | 1.1% |
2889800 | 1 | 0.4% |
2135950 | 1 | 0.4% |
2000000 | 11 |
횟수
Real number (ℝ)
HIGH CORRELATION
  MISSING
 
Distinct | 12 |
---|---|
Distinct (%) | 5.7% |
Missing | 59 |
Missing (%) | 21.9% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2.0047619 |
Minimum | 1 |
---|---|
Maximum | 27 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 2.5 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 1 |
median | 1 |
Q3 | 2 |
95-th percentile | 6 |
Maximum | 27 |
Range | 26 |
Interquartile range (IQR) | 1 |
Descriptive statistics
Standard deviation | 2.7728996 |
---|---|
Coefficient of variation (CV) | 1.3831566 |
Kurtosis | 40.31584 |
Mean | 2.0047619 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 5.5931787 |
Sum | 421 |
Variance | 7.6889724 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1 | 146 | |
2 | 25 | 9.3% |
3 | 15 | 5.6% |
4 | 11 | 4.1% |
10 | 3 | 1.1% |
6 | 3 | 1.1% |
9 | 2 | 0.7% |
11 | 1 | 0.4% |
20 | 1 | 0.4% |
7 | 1 | 0.4% |
Other values (2) | 2 | 0.7% |
(Missing) | 59 |
Value | Count | Frequency (%) |
1 | 146 | |
2 | 25 | 9.3% |
3 | 15 | 5.6% |
4 | 11 | 4.1% |
5 | 1 | 0.4% |
6 | 3 | 1.1% |
7 | 1 | 0.4% |
9 | 2 | 0.7% |
10 | 3 | 1.1% |
11 | 1 | 0.4% |
Value | Count | Frequency (%) |
27 | 1 | 0.4% |
20 | 1 | 0.4% |
11 | 1 | 0.4% |
10 | 3 | 1.1% |
9 | 2 | 0.7% |
7 | 1 | 0.4% |
6 | 3 | 1.1% |
5 | 1 | 0.4% |
4 | 11 | |
3 | 15 |
시구분
Categorical
HIGH CORRELATION
 
Distinct | 2 |
---|---|
Distinct (%) | 0.7% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 2.2 KiB |
41370 | |
---|---|
<NA> |
Length
Max length | 5 |
---|---|
Median length | 5 |
Mean length | 4.7806691 |
Min length | 4 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 41370 |
---|---|
2nd row | 41370 |
3rd row | 41370 |
4th row | 41370 |
5th row | 41370 |
Common Values
Value | Count | Frequency (%) |
41370 | 210 | |
<NA> | 59 | 21.9% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
41370 | 210 | |
na | 59 | 21.9% |
등록일자
Date
MISSING
 
Distinct | 115 |
---|---|
Distinct (%) | 54.8% |
Missing | 59 |
Missing (%) | 21.9% |
Memory size | 2.2 KiB |
Minimum | 2022-01-03 00:00:00 |
---|---|
Maximum | 2022-12-30 00:00:00 |
유지만료일자
Date
MISSING
 
Distinct | 113 |
---|---|
Distinct (%) | 53.8% |
Missing | 59 |
Missing (%) | 21.9% |
Memory size | 2.2 KiB |
Minimum | 2022-01-04 00:00:00 |
---|---|
Maximum | 2022-12-31 00:00:00 |
총합계금액 | 횟수 | |
---|---|---|
총합계금액 | 1.000 | 0.411 |
횟수 | 0.411 | 1.000 |
과세구분 | 시구분 | |
---|---|---|
과세구분 | 1.000 | 1.000 |
시구분 | 1.000 | 1.000 |
총합계금액 | 횟수 | 과세구분 | 시구분 | |
---|---|---|---|---|
총합계금액 | 1.000 | 0.187 | 1.000 | 1.000 |
횟수 | 0.187 | 1.000 | 1.000 | 1.000 |
과세구분 | 1.000 | 1.000 | 1.000 | 1.000 |
시구분 | 1.000 | 1.000 | 1.000 | 1.000 |
과세구분 | 총합계금액 | 횟수 | 시구분 | 등록일자 | 유지만료일자 | |
---|---|---|---|---|---|---|
0 | 통합 | 133750 | 1 | 41370 | 2022-01-03 | 2022-01-04 |
1 | 통합 | 577210 | 11 | 41370 | 2022-01-03 | 2022-01-04 |
2 | 통합 | 461640 | 1 | 41370 | 2022-01-03 | 2022-01-04 |
3 | 통합 | 461640 | 1 | 41370 | 2022-01-03 | 2022-01-04 |
4 | 통합 | 133280 | 1 | 41370 | 2022-01-04 | 2022-01-05 |
5 | 통합 | 461640 | 1 | 41370 | 2022-01-10 | 2022-01-11 |
6 | 통합 | 267120 | 1 | 41370 | 2022-01-11 | 2022-01-12 |
7 | 통합 | 294950 | 1 | 41370 | 2022-01-13 | 2022-01-14 |
8 | 통합 | 560310 | 1 | 41370 | 2022-01-13 | 2022-01-14 |
9 | 통합 | 560310 | 1 | 41370 | 2022-01-13 | 2022-01-14 |
과세구분 | 총합계금액 | 횟수 | 시구분 | 등록일자 | 유지만료일자 | |
---|---|---|---|---|---|---|
259 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
260 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
261 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
262 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
263 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
264 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
265 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
266 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
267 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
268 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
Most frequently occurring
과세구분 | 총합계금액 | 횟수 | 시구분 | 등록일자 | 유지만료일자 | # duplicates | |
---|---|---|---|---|---|---|---|
18 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | 59 |
13 | 통합 | 2000000 | 4 | 41370 | 2022-01-24 | 2022-01-25 | 5 |
5 | 통합 | 560310 | 1 | 41370 | 2022-01-13 | 2022-01-14 | 4 |
2 | 통합 | 300000 | 1 | 41370 | 2022-04-01 | 2022-04-02 | 3 |
11 | 통합 | 1000000 | 10 | 41370 | 2022-05-03 | 2022-05-04 | 3 |
12 | 통합 | 2000000 | 1 | 41370 | 2022-12-20 | 2022-12-21 | 3 |
15 | 통합 | 4700000 | 1 | 41370 | 2022-02-17 | 2022-02-17 | 3 |
0 | 통합 | 131520 | 1 | 41370 | 2022-01-21 | 2022-01-22 | 2 |
1 | 통합 | 249700 | 1 | 41370 | 2022-02-21 | 2022-02-21 | 2 |
3 | 통합 | 461640 | 1 | 41370 | 2022-01-03 | 2022-01-04 | 2 |