Dataset statistics
Number of variables | 9 |
---|---|
Number of observations | 2983 |
Missing cells | 1468 |
Missing cells (%) | 5.5% |
Duplicate rows | 17 |
Duplicate rows (%) | 0.6% |
Total size in memory | 224.4 KiB |
Average record size in memory | 77.0 B |
Variable types
Categorical | 1 |
---|---|
Numeric | 4 |
Text | 1 |
DateTime | 2 |
Boolean | 1 |
Dataset
Description | 오산시 지방세ARS카드납부시스템 분납내역 세목명,내용,등록일자, 기간설정,금액설정 등의 항목을 제공합니다 |
---|---|
Author | 경기도 오산시 |
URL | https://www.data.go.kr/data/15090250/fileData.do |
Dataset has 17 (0.6%) duplicate rows | Duplicates |
수정일자 has 1468 (49.2%) missing values | Missing |
달설정 is highly skewed (γ1 = 37.71166911) | Skewed |
Reproduction
Analysis started | 2023-12-12 17:23:16.414494 |
---|---|
Analysis finished | 2023-12-12 17:23:19.396993 |
Duration | 2.98 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
분납구분
Categorical
Distinct | 2 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 23.4 KiB |
1 | |
---|---|
2 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 1 |
---|---|
2nd row | 1 |
3rd row | 1 |
4th row | 1 |
5th row | 1 |
Common Values
Value | Count | Frequency (%) |
1 | 1686 | |
2 | 1297 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
1 | 1686 | |
2 | 1297 |
총금액
Real number (ℝ)
Distinct | 2671 |
---|---|
Distinct (%) | 89.5% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1921184.6 |
Minimum | 10000 |
---|---|
Maximum | 1.23 × 108 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 26.3 KiB |
Quantile statistics
Minimum | 10000 |
---|---|
5-th percentile | 236839 |
Q1 | 600510 |
median | 1093000 |
Q3 | 2135760 |
95-th percentile | 4944702 |
Maximum | 1.23 × 108 |
Range | 1.2299 × 108 |
Interquartile range (IQR) | 1535250 |
Descriptive statistics
Standard deviation | 4005507.3 |
---|---|
Coefficient of variation (CV) | 2.0849153 |
Kurtosis | 331.71864 |
Mean | 1921184.6 |
Median Absolute Deviation (MAD) | 629970 |
Skewness | 14.464479 |
Sum | 5.7308937 × 109 |
Variance | 1.6044089 × 1013 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
200000 | 8 | 0.3% |
1410630 | 6 | 0.2% |
4893410 | 6 | 0.2% |
1200000 | 6 | 0.2% |
1628370 | 5 | 0.2% |
2293570 | 5 | 0.2% |
747710 | 5 | 0.2% |
4164700 | 5 | 0.2% |
828080 | 5 | 0.2% |
904330 | 4 | 0.1% |
Other values (2661) | 2928 |
Value | Count | Frequency (%) |
10000 | 1 | |
17460 | 1 | |
20000 | 2 | |
23540 | 1 | |
31000 | 1 | |
39380 | 1 | |
40000 | 1 | |
50000 | 1 | |
50770 | 1 | |
51070 | 1 |
Value | Count | Frequency (%) |
123000000 | 1 | |
53186000 | 2 | |
52092690 | 1 | |
48969080 | 1 | |
46611470 | 1 | |
46226820 | 1 | |
40964625 | 1 | |
35310810 | 1 | |
35125600 | 1 | |
33025840 | 1 |
설정금액
Real number (ℝ)
Distinct | 210 |
---|---|
Distinct (%) | 7.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 217365.38 |
Minimum | 0 |
---|---|
Maximum | 12000000 |
Zeros | 7 |
Zeros (%) | 0.2% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 26.3 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 50000 |
Q1 | 100000 |
median | 150000 |
Q3 | 200000 |
95-th percentile | 500000 |
Maximum | 12000000 |
Range | 12000000 |
Interquartile range (IQR) | 100000 |
Descriptive statistics
Standard deviation | 392138.89 |
---|---|
Coefficient of variation (CV) | 1.8040541 |
Kurtosis | 422.94402 |
Mean | 217365.38 |
Median Absolute Deviation (MAD) | 50000 |
Skewness | 16.855822 |
Sum | 6.4840092 × 108 |
Variance | 1.5377291 × 1011 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
100000 | 927 | |
200000 | 576 | |
50000 | 272 | 9.1% |
300000 | 269 | 9.0% |
150000 | 200 | 6.7% |
500000 | 96 | 3.2% |
250000 | 62 | 2.1% |
400000 | 54 | 1.8% |
30000 | 26 | 0.9% |
70000 | 23 | 0.8% |
Other values (200) | 478 |
Value | Count | Frequency (%) |
0 | 7 | |
3 | 2 | 0.1% |
10 | 2 | 0.1% |
20 | 8 | |
22 | 2 | 0.1% |
30 | 8 | |
34 | 4 | 0.1% |
5000 | 1 | < 0.1% |
10000 | 11 | |
12889 | 2 | 0.1% |
Value | Count | Frequency (%) |
12000000 | 1 | < 0.1% |
10000000 | 1 | < 0.1% |
5000000 | 2 | 0.1% |
3000000 | 4 | 0.1% |
2581140 | 1 | < 0.1% |
2511090 | 1 | < 0.1% |
2047090 | 1 | < 0.1% |
2000000 | 13 | |
1900000 | 1 | < 0.1% |
1800000 | 1 | < 0.1% |
세목
Text
Distinct | 278 |
---|---|
Distinct (%) | 9.3% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 23.4 KiB |
Value | Count | Frequency (%) |
자동차세 | 780 | |
세외수입 | 613 | |
지방세 | 346 | 8.9% |
자동차세외 | 334 | 8.6% |
외 | 298 | 7.7% |
환경개선부담금 | 136 | 3.5% |
지방소득세 | 115 | 3.0% |
과태료 | 95 | 2.5% |
재산세 | 80 | 2.1% |
및 | 76 | 2.0% |
Other values (188) | 998 |
Most occurring characters
Value | Count | Frequency (%) |
세 | 3005 | |
외 | 1687 | 9.4% |
차 | 1386 | 7.8% |
동 | 1219 | 6.8% |
자 | 1215 | 6.8% |
894 | 5.0% | |
입 | 841 | 4.7% |
수 | 840 | 4.7% |
지 | 764 | 4.3% |
방 | 734 | 4.1% |
Other values (117) | 5297 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 16253 | |
Space Separator | 894 | 5.0% |
Other Punctuation | 618 | 3.5% |
Decimal Number | 70 | 0.4% |
Math Symbol | 19 | 0.1% |
Close Punctuation | 14 | 0.1% |
Open Punctuation | 14 | 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
세 | 3005 | |
외 | 1687 | 10.4% |
차 | 1386 | 8.5% |
동 | 1219 | 7.5% |
자 | 1215 | 7.5% |
입 | 841 | 5.2% |
수 | 840 | 5.2% |
지 | 764 | 4.7% |
방 | 734 | 4.5% |
환 | 287 | 1.8% |
Other values (100) | 4275 |
Decimal Number
Value | Count | Frequency (%) |
1 | 18 | |
5 | 14 | |
3 | 12 | |
4 | 6 | 8.6% |
6 | 6 | 8.6% |
2 | 6 | 8.6% |
7 | 5 | 7.1% |
0 | 2 | 2.9% |
8 | 1 | 1.4% |
Other Punctuation
Value | Count | Frequency (%) |
, | 582 | |
. | 34 | 5.5% |
& | 1 | 0.2% |
/ | 1 | 0.2% |
Space Separator
Value | Count | Frequency (%) |
894 |
Math Symbol
Value | Count | Frequency (%) |
+ | 19 |
Close Punctuation
Value | Count | Frequency (%) |
) | 14 |
Open Punctuation
Value | Count | Frequency (%) |
( | 14 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 16253 | |
Common | 1629 | 9.1% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
세 | 3005 | |
외 | 1687 | 10.4% |
차 | 1386 | 8.5% |
동 | 1219 | 7.5% |
자 | 1215 | 7.5% |
입 | 841 | 5.2% |
수 | 840 | 5.2% |
지 | 764 | 4.7% |
방 | 734 | 4.5% |
환 | 287 | 1.8% |
Other values (100) | 4275 |
Common
Value | Count | Frequency (%) |
894 | ||
, | 582 | |
. | 34 | 2.1% |
+ | 19 | 1.2% |
1 | 18 | 1.1% |
) | 14 | 0.9% |
5 | 14 | 0.9% |
( | 14 | 0.9% |
3 | 12 | 0.7% |
4 | 6 | 0.4% |
Other values (7) | 22 | 1.4% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 16252 | |
ASCII | 1629 | 9.1% |
Compat Jamo | 1 | < 0.1% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
세 | 3005 | |
외 | 1687 | 10.4% |
차 | 1386 | 8.5% |
동 | 1219 | 7.5% |
자 | 1215 | 7.5% |
입 | 841 | 5.2% |
수 | 840 | 5.2% |
지 | 764 | 4.7% |
방 | 734 | 4.5% |
환 | 287 | 1.8% |
Other values (99) | 4274 |
ASCII
Value | Count | Frequency (%) |
894 | ||
, | 582 | |
. | 34 | 2.1% |
+ | 19 | 1.2% |
1 | 18 | 1.1% |
) | 14 | 0.9% |
5 | 14 | 0.9% |
( | 14 | 0.9% |
3 | 12 | 0.7% |
4 | 6 | 0.4% |
Other values (7) | 22 | 1.4% |
Compat Jamo
Value | Count | Frequency (%) |
ㅌ | 1 |
달설정
Real number (ℝ)
SKEWED
 
Distinct | 19 |
---|---|
Distinct (%) | 0.6% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1.1716393 |
Minimum | 1 |
---|---|
Maximum | 130 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 26.3 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 1 |
median | 1 |
Q3 | 1 |
95-th percentile | 1 |
Maximum | 130 |
Range | 129 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 2.7180323 |
---|---|
Coefficient of variation (CV) | 2.3198542 |
Kurtosis | 1713.456 |
Mean | 1.1716393 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 37.711669 |
Sum | 3495 |
Variance | 7.3876998 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1 | 2926 | |
3 | 10 | 0.3% |
4 | 9 | 0.3% |
2 | 8 | 0.3% |
10 | 5 | 0.2% |
14 | 5 | 0.2% |
5 | 4 | 0.1% |
9 | 2 | 0.1% |
11 | 2 | 0.1% |
6 | 2 | 0.1% |
Other values (9) | 10 | 0.3% |
Value | Count | Frequency (%) |
1 | 2926 | |
2 | 8 | 0.3% |
3 | 10 | 0.3% |
4 | 9 | 0.3% |
5 | 4 | 0.1% |
6 | 2 | 0.1% |
7 | 2 | 0.1% |
8 | 1 | < 0.1% |
9 | 2 | 0.1% |
10 | 5 | 0.2% |
Value | Count | Frequency (%) |
130 | 1 | < 0.1% |
39 | 1 | < 0.1% |
30 | 1 | < 0.1% |
22 | 1 | < 0.1% |
19 | 1 | < 0.1% |
18 | 1 | < 0.1% |
15 | 1 | < 0.1% |
14 | 5 | |
11 | 2 | 0.1% |
10 | 5 |
일설정
Real number (ℝ)
Distinct | 33 |
---|---|
Distinct (%) | 1.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 16.266845 |
Minimum | 1 |
---|---|
Maximum | 256 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 26.3 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 7 |
median | 15 |
Q3 | 25 |
95-th percentile | 30 |
Maximum | 256 |
Range | 255 |
Interquartile range (IQR) | 18 |
Descriptive statistics
Standard deviation | 12.026653 |
---|---|
Coefficient of variation (CV) | 0.73933527 |
Kurtosis | 103.1205 |
Mean | 16.266845 |
Median Absolute Deviation (MAD) | 10 |
Skewness | 5.1760825 |
Sum | 48524 |
Variance | 144.64037 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1 | 520 | |
30 | 447 | |
15 | 302 | |
25 | 290 | |
20 | 275 | |
10 | 264 | |
5 | 102 | 3.4% |
29 | 62 | 2.1% |
28 | 59 | 2.0% |
11 | 58 | 1.9% |
Other values (23) | 604 |
Value | Count | Frequency (%) |
1 | 520 | |
2 | 28 | 0.9% |
3 | 22 | 0.7% |
4 | 19 | 0.6% |
5 | 102 | 3.4% |
6 | 45 | 1.5% |
7 | 29 | 1.0% |
8 | 20 | 0.7% |
9 | 19 | 0.6% |
10 | 264 |
Value | Count | Frequency (%) |
256 | 1 | < 0.1% |
255 | 1 | < 0.1% |
31 | 19 | 0.6% |
30 | 447 | |
29 | 62 | 2.1% |
28 | 59 | 2.0% |
27 | 46 | 1.5% |
26 | 50 | 1.7% |
25 | 290 | |
24 | 19 | 0.6% |
등록일자
Date
Distinct | 687 |
---|---|
Distinct (%) | 23.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 23.4 KiB |
Minimum | 2014-10-07 00:00:00 |
---|---|
Maximum | 2023-08-11 00:00:00 |
수정일자
Date
MISSING
 
Distinct | 469 |
---|---|
Distinct (%) | 31.0% |
Missing | 1468 |
Missing (%) | 49.2% |
Memory size | 23.4 KiB |
Minimum | 2014-10-08 00:00:00 |
---|---|
Maximum | 2023-08-11 00:00:00 |
삭제여부
Boolean
Distinct | 2 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 3.0 KiB |
False | |
---|---|
True |
Value | Count | Frequency (%) |
False | 1640 | |
True | 1343 |
분납구분 | 총금액 | 설정금액 | 달설정 | 일설정 | 삭제여부 | |
---|---|---|---|---|---|---|
분납구분 | 1.000 | 0.039 | 0.057 | 0.000 | 0.057 | 0.238 |
총금액 | 0.039 | 1.000 | 0.559 | 0.000 | 0.000 | 0.000 |
설정금액 | 0.057 | 0.559 | 1.000 | 0.000 | 0.000 | 0.000 |
달설정 | 0.000 | 0.000 | 0.000 | 1.000 | 0.000 | 0.041 |
일설정 | 0.057 | 0.000 | 0.000 | 0.000 | 1.000 | 0.000 |
삭제여부 | 0.238 | 0.000 | 0.000 | 0.041 | 0.000 | 1.000 |
분납구분 | 삭제여부 | |
---|---|---|
분납구분 | 1.000 | 0.153 |
삭제여부 | 0.153 | 1.000 |
총금액 | 설정금액 | 달설정 | 일설정 | 분납구분 | 삭제여부 | |
---|---|---|---|---|---|---|
총금액 | 1.000 | 0.279 | -0.001 | 0.027 | 0.028 | 0.000 |
설정금액 | 0.279 | 1.000 | 0.079 | 0.022 | 0.041 | 0.000 |
달설정 | -0.001 | 0.079 | 1.000 | 0.009 | 0.000 | 0.027 |
일설정 | 0.027 | 0.022 | 0.009 | 1.000 | 0.095 | 0.000 |
분납구분 | 0.028 | 0.041 | 0.000 | 0.095 | 1.000 | 0.153 |
삭제여부 | 0.000 | 0.000 | 0.027 | 0.000 | 0.153 | 1.000 |
분납구분 | 총금액 | 설정금액 | 세목 | 달설정 | 일설정 | 등록일자 | 수정일자 | 삭제여부 | |
---|---|---|---|---|---|---|---|---|---|
0 | 1 | 2382900 | 100000 | 자동차세 | 1 | 1 | 2014-10-07 | 2015-06-01 | Y |
1 | 1 | 5951000 | 500000 | 취득세 | 1 | 1 | 2014-10-07 | 2015-05-14 | Y |
2 | 1 | 188620 | 100000 | 자동차세 | 1 | 1 | 2014-10-07 | 2015-04-23 | N |
3 | 1 | 839080 | 100000 | 자동차세 | 1 | 1 | 2014-10-07 | 2015-08-25 | Y |
4 | 1 | 173600 | 100000 | 등록세 | 1 | 1 | 2014-10-07 | <NA> | N |
5 | 2 | 901560 | 300000 | 과태료 | 1 | 1 | 2014-10-07 | <NA> | N |
6 | 1 | 212210 | 50000 | 지방소득세 | 1 | 1 | 2014-10-07 | <NA> | N |
7 | 1 | 325080 | 100000 | 자동차세 | 1 | 1 | 2014-10-07 | <NA> | N |
8 | 1 | 1042420 | 100000 | 재산세 | 1 | 1 | 2014-10-07 | <NA> | N |
9 | 1 | 810440 | 200000 | 자동차세 | 1 | 1 | 2014-10-07 | <NA> | N |
분납구분 | 총금액 | 설정금액 | 세목 | 달설정 | 일설정 | 등록일자 | 수정일자 | 삭제여부 | |
---|---|---|---|---|---|---|---|---|---|
2973 | 1 | 414490 | 300000 | 지방세 | 2 | 1 | 2023-06-22 | <NA> | N |
2974 | 1 | 2575000 | 150000 | 자동차세 | 1 | 25 | 2023-08-11 | <NA> | N |
2975 | 1 | 3400000 | 100000 | 자동차 | 1 | 25 | 2023-08-11 | <NA> | N |
2976 | 1 | 7716860 | 100000 | 차 | 1 | 25 | 2023-08-11 | <NA> | N |
2977 | 1 | 3887280 | 200000 | 차 | 1 | 25 | 2023-08-11 | <NA> | N |
2978 | 1 | 1966430 | 100000 | 차 | 1 | 20 | 2023-08-11 | <NA> | N |
2979 | 1 | 4447410 | 100000 | 차 | 1 | 25 | 2023-08-11 | <NA> | N |
2980 | 1 | 4300000 | 100000 | 차 | 1 | 25 | 2023-08-11 | 2023-08-11 | Y |
2981 | 1 | 3598820 | 200000 | 결손 | 1 | 25 | 2023-08-11 | <NA> | N |
2982 | 1 | 4060300 | 300000 | 차 | 1 | 14 | 2023-08-11 | <NA> | N |
Most frequently occurring
분납구분 | 총금액 | 설정금액 | 세목 | 달설정 | 일설정 | 등록일자 | 수정일자 | 삭제여부 | # duplicates | |
---|---|---|---|---|---|---|---|---|---|---|
13 | 2 | 1779680 | 30 | 손해배상 | 1 | 30 | 2014-12-15 | <NA> | N | 4 |
15 | 2 | 4164700 | 34 | 세외수입, 지방세 | 1 | 11 | 2015-01-14 | <NA> | N | 4 |
16 | 2 | 4893410 | 30 | 손해배상,주정차 | 1 | 30 | 2014-12-15 | <NA> | N | 4 |
0 | 1 | 200000 | 100000 | 테스트 | 1 | 13 | 2014-10-15 | 2014-10-16 | Y | 3 |
7 | 1 | 2085340 | 20 | 자동차세 | 1 | 15 | 2015-07-01 | <NA> | N | 3 |
10 | 2 | 536400 | 0 | 주정차위반 | 1 | 10 | 2017-04-07 | <NA> | N | 3 |
1 | 1 | 690030 | 690030 | 자동차세 | 4 | 15 | 2015-02-13 | 2015-02-13 | Y | 2 |
2 | 1 | 747710 | 10 | 재산세 | 1 | 11 | 2015-05-12 | <NA> | N | 2 |
3 | 1 | 768500 | 0 | 자동차세 | 1 | 30 | 2015-06-11 | <NA> | N | 2 |
4 | 1 | 904330 | 20 | 지방세등 | 1 | 15 | 2015-05-19 | <NA> | N | 2 |