Dataset statistics
Number of variables | 7 |
---|---|
Number of observations | 45 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 6 |
Duplicate rows (%) | 13.3% |
Total size in memory | 2.7 KiB |
Average record size in memory | 61.9 B |
Variable types
DateTime | 3 |
---|---|
Categorical | 2 |
Numeric | 2 |
Dataset
Description | 사유림업무지원포털 내 공사유림에서 추진하는 숲가꾸기 사업에 대한 계약시작일, 계약종료일, 계약보증금, 지체상금율, 용역금액 등의 정보 |
---|---|
Author | 산림청 |
URL | https://www.data.go.kr/data/15071634/fileData.do |
Dataset has 6 (13.3%) duplicate rows | Duplicates |
계약보증금 is highly overall correlated with 용역금액 and 1 other fields | High correlation |
용역금액 is highly overall correlated with 계약보증금 and 1 other fields | High correlation |
지체상금율 is highly overall correlated with 계약보증금 and 1 other fields | High correlation |
지체상금율 is highly imbalanced (84.6%) | Imbalance |
Reproduction
Analysis started | 2023-12-12 09:43:10.343657 |
---|---|
Analysis finished | 2023-12-12 09:43:11.247270 |
Duration | 0.9 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
계약일자
Date
Distinct | 17 |
---|---|
Distinct (%) | 37.8% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 492.0 B |
Minimum | 2018-06-15 00:00:00 |
---|---|
Maximum | 2020-06-10 00:00:00 |
계약시작일
Date
Distinct | 19 |
---|---|
Distinct (%) | 42.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 492.0 B |
Minimum | 2018-06-15 00:00:00 |
---|---|
Maximum | 2020-06-11 00:00:00 |
계약종료일
Date
Distinct | 16 |
---|---|
Distinct (%) | 35.6% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 492.0 B |
Minimum | 2018-06-29 00:00:00 |
---|---|
Maximum | 2020-08-12 00:00:00 |
용역업체대표자명
Categorical
Distinct | 16 |
---|---|
Distinct (%) | 35.6% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 492.0 B |
이** | |
---|---|
김** | |
오** | |
용** | |
박** | |
Other values (11) |
Length
Max length | 3 |
---|---|
Median length | 3 |
Mean length | 3 |
Min length | 3 |
Unique
Unique | 10 ? |
---|---|
Unique (%) | 22.2% |
Sample
1st row | 신** |
---|---|
2nd row | 이** |
3rd row | 이** |
4th row | 노** |
5th row | 이** |
Common Values
Value | Count | Frequency (%) |
이** | 13 | |
김** | 13 | |
오** | 3 | 6.7% |
용** | 2 | 4.4% |
박** | 2 | 4.4% |
황** | 2 | 4.4% |
신** | 1 | 2.2% |
노** | 1 | 2.2% |
유** | 1 | 2.2% |
홍** | 1 | 2.2% |
Other values (6) | 6 |
Length
Value | Count | Frequency (%) |
이 | 13 | |
김 | 13 | |
오 | 3 | 6.7% |
용 | 2 | 4.4% |
박 | 2 | 4.4% |
황 | 2 | 4.4% |
신 | 1 | 2.2% |
노 | 1 | 2.2% |
유 | 1 | 2.2% |
홍 | 1 | 2.2% |
Other values (6) | 6 |
계약보증금
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 14 |
---|---|
Distinct (%) | 31.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 386003.93 |
Minimum | 50000 |
---|---|
Maximum | 3200000 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 537.0 B |
Quantile statistics
Minimum | 50000 |
---|---|
5-th percentile | 50000 |
Q1 | 50000 |
median | 50000 |
Q3 | 500000 |
95-th percentile | 1749580 |
Maximum | 3200000 |
Range | 3150000 |
Interquartile range (IQR) | 450000 |
Descriptive statistics
Standard deviation | 651666.25 |
---|---|
Coefficient of variation (CV) | 1.6882373 |
Kurtosis | 7.8982436 |
Mean | 386003.93 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 2.6767455 |
Sum | 17370177 |
Variance | 4.246689 × 1011 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
50000 | 23 | |
100000 | 7 | 15.6% |
500000 | 4 | 8.9% |
1547900 | 1 | 2.2% |
1000000 | 1 | 2.2% |
459895 | 1 | 2.2% |
3200000 | 1 | 2.2% |
245000 | 1 | 2.2% |
1800000 | 1 | 2.2% |
2023500 | 1 | 2.2% |
Other values (4) | 4 | 8.9% |
Value | Count | Frequency (%) |
50000 | 23 | |
100000 | 7 | 15.6% |
245000 | 1 | 2.2% |
459895 | 1 | 2.2% |
479870 | 1 | 2.2% |
500000 | 4 | 8.9% |
611855 | 1 | 2.2% |
852157 | 1 | 2.2% |
1000000 | 1 | 2.2% |
1300000 | 1 | 2.2% |
Value | Count | Frequency (%) |
3200000 | 1 | 2.2% |
2023500 | 1 | 2.2% |
1800000 | 1 | 2.2% |
1547900 | 1 | 2.2% |
1300000 | 1 | 2.2% |
1000000 | 1 | 2.2% |
852157 | 1 | 2.2% |
611855 | 1 | 2.2% |
500000 | 4 | |
479870 | 1 | 2.2% |
용역금액
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 14 |
---|---|
Distinct (%) | 31.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 3790972.7 |
Minimum | 500000 |
---|---|
Maximum | 32000000 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 537.0 B |
Quantile statistics
Minimum | 500000 |
---|---|
5-th percentile | 500000 |
Q1 | 500000 |
median | 500000 |
Q3 | 4798700 |
95-th percentile | 17495800 |
Maximum | 32000000 |
Range | 31500000 |
Interquartile range (IQR) | 4298700 |
Descriptive statistics
Standard deviation | 6557976 |
---|---|
Coefficient of variation (CV) | 1.7298927 |
Kurtosis | 7.7888979 |
Mean | 3790972.7 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 2.6781949 |
Sum | 1.7059377 × 108 |
Variance | 4.300705 × 1013 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
500000 | 23 | |
1000000 | 8 | 17.8% |
5000000 | 3 | 6.7% |
15479000 | 1 | 2.2% |
10000000 | 1 | 2.2% |
4598950 | 1 | 2.2% |
32000000 | 1 | 2.2% |
2450000 | 1 | 2.2% |
18000000 | 1 | 2.2% |
20235000 | 1 | 2.2% |
Other values (4) | 4 | 8.9% |
Value | Count | Frequency (%) |
500000 | 23 | |
1000000 | 8 | 17.8% |
2450000 | 1 | 2.2% |
4598950 | 1 | 2.2% |
4798700 | 1 | 2.2% |
5000000 | 3 | 6.7% |
6118550 | 1 | 2.2% |
8521570 | 1 | 2.2% |
10000000 | 1 | 2.2% |
13892000 | 1 | 2.2% |
Value | Count | Frequency (%) |
32000000 | 1 | 2.2% |
20235000 | 1 | 2.2% |
18000000 | 1 | 2.2% |
15479000 | 1 | 2.2% |
13892000 | 1 | 2.2% |
10000000 | 1 | 2.2% |
8521570 | 1 | 2.2% |
6118550 | 1 | 2.2% |
5000000 | 3 | |
4798700 | 1 | 2.2% |
지체상금율
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | 4.4% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 492.0 B |
0.13 | |
---|---|
0.8 | 1 |
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 3.9777778 |
Min length | 3 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | 2.2% |
Sample
1st row | 0.13 |
---|---|
2nd row | 0.13 |
3rd row | 0.13 |
4th row | 0.13 |
5th row | 0.13 |
Common Values
Value | Count | Frequency (%) |
0.13 | 44 | |
0.8 | 1 | 2.2% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0.13 | 44 | |
0.8 | 1 | 2.2% |
계약일자 | 계약시작일 | 계약종료일 | 용역업체대표자명 | 계약보증금 | 용역금액 | 지체상금율 | |
---|---|---|---|---|---|---|---|
계약일자 | 1.000 | 0.996 | 0.985 | 0.000 | 1.000 | 1.000 | 1.000 |
계약시작일 | 0.996 | 1.000 | 0.984 | 0.000 | 0.999 | 0.999 | 1.000 |
계약종료일 | 0.985 | 0.984 | 1.000 | 0.000 | 0.962 | 0.945 | 1.000 |
용역업체대표자명 | 0.000 | 0.000 | 0.000 | 1.000 | 0.000 | 0.000 | 0.000 |
계약보증금 | 1.000 | 0.999 | 0.962 | 0.000 | 1.000 | 0.996 | 0.635 |
용역금액 | 1.000 | 0.999 | 0.945 | 0.000 | 0.996 | 1.000 | 0.635 |
지체상금율 | 1.000 | 1.000 | 1.000 | 0.000 | 0.635 | 0.635 | 1.000 |
용역업체대표자명 | 지체상금율 | |
---|---|---|
용역업체대표자명 | 1.000 | 0.000 |
지체상금율 | 0.000 | 1.000 |
계약보증금 | 용역금액 | 용역업체대표자명 | 지체상금율 | |
---|---|---|---|---|
계약보증금 | 1.000 | 0.995 | 0.000 | 0.581 |
용역금액 | 0.995 | 1.000 | 0.000 | 0.581 |
용역업체대표자명 | 0.000 | 0.000 | 1.000 | 0.000 |
지체상금율 | 0.581 | 0.581 | 0.000 | 1.000 |
계약일자 | 계약시작일 | 계약종료일 | 용역업체대표자명 | 계약보증금 | 용역금액 | 지체상금율 | |
---|---|---|---|---|---|---|---|
0 | 2018-11-26 | 2018-11-26 | 2018-12-20 | 신** | 1547900 | 15479000 | 0.13 |
1 | 2020-05-26 | 2020-05-26 | 2020-05-31 | 이** | 500000 | 5000000 | 0.13 |
2 | 2020-05-26 | 2020-05-26 | 2020-05-31 | 이** | 50000 | 500000 | 0.13 |
3 | 2020-05-26 | 2020-05-26 | 2020-05-31 | 노** | 50000 | 500000 | 0.13 |
4 | 2020-02-25 | 2020-03-01 | 2020-03-31 | 이** | 1000000 | 10000000 | 0.13 |
5 | 2020-03-13 | 2020-03-13 | 2020-04-21 | 유** | 459895 | 4598950 | 0.13 |
6 | 2020-04-15 | 2020-04-20 | 2020-08-12 | 이** | 3200000 | 32000000 | 0.13 |
7 | 2020-05-26 | 2020-05-26 | 2020-05-31 | 이** | 50000 | 500000 | 0.13 |
8 | 2020-05-27 | 2020-05-28 | 2020-06-30 | 용** | 245000 | 2450000 | 0.13 |
9 | 2020-02-03 | 2020-02-03 | 2020-03-31 | 이** | 1800000 | 18000000 | 0.13 |
계약일자 | 계약시작일 | 계약종료일 | 용역업체대표자명 | 계약보증금 | 용역금액 | 지체상금율 | |
---|---|---|---|---|---|---|---|
35 | 2020-05-27 | 2020-05-27 | 2020-05-31 | 강** | 50000 | 500000 | 0.13 |
36 | 2020-05-27 | 2020-05-27 | 2020-05-31 | 조** | 50000 | 500000 | 0.13 |
37 | 2020-06-10 | 2020-06-10 | 2020-06-30 | 이** | 100000 | 1000000 | 0.13 |
38 | 2020-06-10 | 2020-06-10 | 2020-06-30 | 황** | 100000 | 1000000 | 0.13 |
39 | 2018-06-15 | 2018-06-15 | 2018-06-29 | 이** | 1300000 | 13892000 | 0.8 |
40 | 2020-05-27 | 2020-05-27 | 2020-05-31 | 오** | 50000 | 500000 | 0.13 |
41 | 2020-05-27 | 2020-05-27 | 2020-05-31 | 김** | 50000 | 500000 | 0.13 |
42 | 2020-05-27 | 2020-05-27 | 2020-05-31 | 김** | 50000 | 500000 | 0.13 |
43 | 2020-05-27 | 2020-05-27 | 2020-05-30 | 박** | 50000 | 500000 | 0.13 |
44 | 2020-05-27 | 2020-05-27 | 2020-05-31 | 오** | 50000 | 500000 | 0.13 |
Most frequently occurring
계약일자 | 계약시작일 | 계약종료일 | 용역업체대표자명 | 계약보증금 | 용역금액 | 지체상금율 | # duplicates | |
---|---|---|---|---|---|---|---|---|
1 | 2020-05-26 | 2020-05-26 | 2020-05-31 | 이** | 50000 | 500000 | 0.13 | 3 |
2 | 2020-05-27 | 2020-05-27 | 2020-05-31 | 김** | 50000 | 500000 | 0.13 | 3 |
0 | 2020-05-26 | 2020-05-26 | 2020-05-31 | 김** | 50000 | 500000 | 0.13 | 2 |
3 | 2020-05-27 | 2020-05-27 | 2020-05-31 | 오** | 50000 | 500000 | 0.13 | 2 |
4 | 2020-05-27 | 2020-05-28 | 2020-05-31 | 김** | 50000 | 500000 | 0.13 | 2 |
5 | 2020-06-10 | 2020-06-10 | 2020-06-30 | 이** | 100000 | 1000000 | 0.13 | 2 |