Dataset statistics
Number of variables | 9 |
---|---|
Number of observations | 31 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 2.5 KiB |
Average record size in memory | 82.3 B |
Variable types
Categorical | 1 |
---|---|
Numeric | 5 |
Boolean | 3 |
Dataset
Description | 산림사업용역관리 산림자원용역진척율보고에 대한 데이터입니다.진척보고회차, 용역사업번호, 공정계획면적, 공정실적면적 등을 제공합니다. |
---|---|
Author | 산림청 |
URL | https://www.data.go.kr/data/15124729/fileData.do |
공정계획면적 is highly overall correlated with 공정실적면적 | High correlation |
공정실적면적 is highly overall correlated with 공정계획면적 | High correlation |
공정진척율 is highly overall correlated with 공정적합여부 | High correlation |
공정적합여부 is highly overall correlated with 공정진척율 and 2 other fields | High correlation |
규격적합여부 is highly overall correlated with 공정적합여부 and 1 other fields | High correlation |
품질적합여부 is highly overall correlated with 공정적합여부 and 1 other fields | High correlation |
공정적합여부 is highly imbalanced (65.5%) | Imbalance |
규격적합여부 is highly imbalanced (54.1%) | Imbalance |
품질적합여부 is highly imbalanced (54.1%) | Imbalance |
알림아이디 has unique values | Unique |
Reproduction
Analysis started | 2023-12-12 17:57:21.949248 |
---|---|
Analysis finished | 2023-12-12 17:57:27.287757 |
Duration | 5.34 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
진척보고회차
Categorical
Distinct | 5 |
---|---|
Distinct (%) | 16.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 380.0 B |
1 | |
---|---|
2 | |
3 | |
4 | |
5 | 1 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | 3.2% |
Sample
1st row | 1 |
---|---|
2nd row | 2 |
3rd row | 1 |
4th row | 1 |
5th row | 1 |
Common Values
Value | Count | Frequency (%) |
1 | 14 | |
2 | 6 | |
3 | 5 | 16.1% |
4 | 5 | 16.1% |
5 | 1 | 3.2% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
1 | 14 | |
2 | 6 | |
3 | 5 | 16.1% |
4 | 5 | 16.1% |
5 | 1 | 3.2% |
용역사업번호
Real number (ℝ)
Distinct | 14 |
---|---|
Distinct (%) | 45.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1.6212654 × 108 |
Minimum | 1.2018003 × 108 |
---|---|
Maximum | 2.2019024 × 108 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 411.0 B |
Quantile statistics
Minimum | 1.2018003 × 108 |
---|---|
5-th percentile | 1.2018004 × 108 |
Q1 | 1.2020004 × 108 |
median | 1.2020006 × 108 |
Q3 | 2.2018002 × 108 |
95-th percentile | 2.2019024 × 108 |
Maximum | 2.2019024 × 108 |
Range | 1.0001021 × 108 |
Interquartile range (IQR) | 99979990 |
Descriptive statistics
Standard deviation | 50156252 |
---|---|
Coefficient of variation (CV) | 0.30936485 |
Kurtosis | -2.0165466 |
Mean | 1.6212654 × 108 |
Median Absolute Deviation (MAD) | 20027 |
Skewness | 0.34372056 |
Sum | 5.0259227 × 109 |
Variance | 2.5156496 × 1015 |
Monotonicity | Increasing |
Value | Count | Frequency (%) |
220180021 | 5 | |
120200035 | 4 | |
120200056 | 4 | |
120200063 | 4 | |
220190245 | 4 | |
120180034 | 2 | 6.5% |
120180036 | 1 | 3.2% |
120190057 | 1 | 3.2% |
120190060 | 1 | 3.2% |
120190070 | 1 | 3.2% |
Other values (4) | 4 |
Value | Count | Frequency (%) |
120180034 | 2 | 6.5% |
120180036 | 1 | 3.2% |
120190057 | 1 | 3.2% |
120190060 | 1 | 3.2% |
120190070 | 1 | 3.2% |
120200035 | 4 | |
120200056 | 4 | |
120200063 | 4 | |
220180021 | 5 | |
220180028 | 1 | 3.2% |
Value | Count | Frequency (%) |
220190245 | 4 | |
220190238 | 1 | 3.2% |
220190237 | 1 | 3.2% |
220190205 | 1 | 3.2% |
220180028 | 1 | 3.2% |
220180021 | 5 | |
120200063 | 4 | |
120200056 | 4 | |
120200035 | 4 | |
120190070 | 1 | 3.2% |
공정계획면적
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 18 |
---|---|
Distinct (%) | 58.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 3166.8065 |
Minimum | 1 |
---|---|
Maximum | 20000 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 411.0 B |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 52 |
median | 560 |
Q3 | 4500 |
95-th percentile | 16492.5 |
Maximum | 20000 |
Range | 19999 |
Interquartile range (IQR) | 4448 |
Descriptive statistics
Standard deviation | 5482.4733 |
---|---|
Coefficient of variation (CV) | 1.7312309 |
Kurtosis | 4.5045379 |
Mean | 3166.8065 |
Median Absolute Deviation (MAD) | 558 |
Skewness | 2.2340166 |
Sum | 98171 |
Variance | 30057514 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
100 | 5 | |
1 | 3 | 9.7% |
1000 | 3 | 9.7% |
2 | 3 | 9.7% |
5000 | 3 | 9.7% |
20000 | 2 | 6.5% |
4 | 1 | 3.2% |
2000 | 1 | 3.2% |
3000 | 1 | 3.2% |
4000 | 1 | 3.2% |
Other values (8) | 8 |
Value | Count | Frequency (%) |
1 | 3 | |
2 | 3 | |
3 | 1 | 3.2% |
4 | 1 | 3.2% |
100 | 5 | |
150 | 1 | 3.2% |
200 | 1 | 3.2% |
560 | 1 | 3.2% |
760 | 1 | 3.2% |
1000 | 3 |
Value | Count | Frequency (%) |
20000 | 2 | |
12985 | 1 | 3.2% |
10000 | 1 | 3.2% |
6000 | 1 | 3.2% |
5000 | 3 | |
4000 | 1 | 3.2% |
3000 | 1 | 3.2% |
2000 | 1 | 3.2% |
1000 | 3 | |
760 | 1 | 3.2% |
공정실적면적
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 19 |
---|---|
Distinct (%) | 61.3% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2721.129 |
Minimum | 1 |
---|---|
Maximum | 20000 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 411.0 B |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 4.5 |
median | 200 |
Q3 | 3500 |
95-th percentile | 13992.5 |
Maximum | 20000 |
Range | 19999 |
Interquartile range (IQR) | 3495.5 |
Descriptive statistics
Standard deviation | 5029.972 |
---|---|
Coefficient of variation (CV) | 1.8484871 |
Kurtosis | 4.7381543 |
Mean | 2721.129 |
Median Absolute Deviation (MAD) | 199 |
Skewness | 2.2825438 |
Sum | 84355 |
Variance | 25300619 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
100 | 4 | |
1 | 3 | 9.7% |
900 | 2 | 6.5% |
5000 | 2 | 6.5% |
4000 | 2 | 6.5% |
1000 | 2 | 6.5% |
3 | 2 | 6.5% |
2 | 2 | 6.5% |
200 | 2 | 6.5% |
360 | 1 | 3.2% |
Other values (9) | 9 |
Value | Count | Frequency (%) |
1 | 3 | |
2 | 2 | |
3 | 2 | |
4 | 1 | 3.2% |
5 | 1 | 3.2% |
100 | 4 | |
148 | 1 | 3.2% |
200 | 2 | |
240 | 1 | 3.2% |
360 | 1 | 3.2% |
Value | Count | Frequency (%) |
20000 | 1 | |
15000 | 1 | |
12985 | 1 | |
10000 | 1 | |
5000 | 2 | |
4000 | 2 | |
3000 | 1 | |
1000 | 2 | |
900 | 2 | |
360 | 1 |
공정진척율
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 12 |
---|---|
Distinct (%) | 38.7% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 84.084194 |
Minimum | 5 |
---|---|
Maximum | 100 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 411.0 B |
Quantile statistics
Minimum | 5 |
---|---|
5-th percentile | 22 |
Q1 | 81.665 |
median | 100 |
Q3 | 100 |
95-th percentile | 100 |
Maximum | 100 |
Range | 95 |
Interquartile range (IQR) | 18.335 |
Descriptive statistics
Standard deviation | 28.317783 |
---|---|
Coefficient of variation (CV) | 0.33677891 |
Kurtosis | 1.9264126 |
Mean | 84.084194 |
Median Absolute Deviation (MAD) | 0 |
Skewness | -1.7817722 |
Sum | 2606.61 |
Variance | 801.89682 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
100.0 | 19 | |
90.0 | 2 | 6.5% |
98.67 | 1 | 3.2% |
5.0 | 1 | 3.2% |
75.0 | 1 | 3.2% |
24.0 | 1 | 3.2% |
26.32 | 1 | 3.2% |
64.29 | 1 | 3.2% |
20.0 | 1 | 3.2% |
80.0 | 1 | 3.2% |
Other values (2) | 2 | 6.5% |
Value | Count | Frequency (%) |
5.0 | 1 | |
20.0 | 1 | |
24.0 | 1 | |
26.32 | 1 | |
50.0 | 1 | |
64.29 | 1 | |
75.0 | 1 | |
80.0 | 1 | |
83.33 | 1 | |
90.0 | 2 |
Value | Count | Frequency (%) |
100.0 | 19 | |
98.67 | 1 | 3.2% |
90.0 | 2 | 6.5% |
83.33 | 1 | 3.2% |
80.0 | 1 | 3.2% |
75.0 | 1 | 3.2% |
64.29 | 1 | 3.2% |
50.0 | 1 | 3.2% |
26.32 | 1 | 3.2% |
24.0 | 1 | 3.2% |
공정적합여부
Boolean
HIGH CORRELATION
  IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | 6.5% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 163.0 B |
True | |
---|---|
False | 2 |
Value | Count | Frequency (%) |
True | 29 | |
False | 2 | 6.5% |
규격적합여부
Boolean
HIGH CORRELATION
  IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | 6.5% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 163.0 B |
True | |
---|---|
False |
Value | Count | Frequency (%) |
True | 28 | |
False | 3 | 9.7% |
품질적합여부
Boolean
HIGH CORRELATION
  IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | 6.5% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 163.0 B |
True | |
---|---|
False |
Value | Count | Frequency (%) |
True | 28 | |
False | 3 | 9.7% |
알림아이디
Real number (ℝ)
UNIQUE
 
Distinct | 31 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2.0191411 × 109 |
Minimum | 2.0180201 × 109 |
---|---|
Maximum | 2.0200302 × 109 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 411.0 B |
Quantile statistics
Minimum | 2.0180201 × 109 |
---|---|
5-th percentile | 2.0180201 × 109 |
Q1 | 2.01803 × 109 |
median | 2.0190802 × 109 |
Q3 | 2.0200301 × 109 |
95-th percentile | 2.0200302 × 109 |
Maximum | 2.0200302 × 109 |
Range | 2010076 |
Interquartile range (IQR) | 2000061.5 |
Descriptive statistics
Standard deviation | 831537.58 |
---|---|
Coefficient of variation (CV) | 0.00041182738 |
Kurtosis | -1.4949833 |
Mean | 2.0191411 × 109 |
Median Absolute Deviation (MAD) | 949926 |
Skewness | -0.25408464 |
Sum | 6.2593374 × 1010 |
Variance | 6.9145474 × 1011 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
2018030037 | 1 | 3.2% |
2018030038 | 1 | 3.2% |
2019080189 | 1 | 3.2% |
2019080188 | 1 | 3.2% |
2019080187 | 1 | 3.2% |
2019080186 | 1 | 3.2% |
2019080122 | 1 | 3.2% |
2019080126 | 1 | 3.2% |
2019070125 | 1 | 3.2% |
2018030008 | 1 | 3.2% |
Other values (21) | 21 |
Value | Count | Frequency (%) |
2018020081 | 1 | |
2018020082 | 1 | |
2018020083 | 1 | |
2018020084 | 1 | |
2018020096 | 1 | |
2018030008 | 1 | |
2018030037 | 1 | |
2018030038 | 1 | |
2018030062 | 1 | |
2019070125 | 1 |
Value | Count | Frequency (%) |
2020030157 | 1 | |
2020030156 | 1 | |
2020030155 | 1 | |
2020030154 | 1 | |
2020030152 | 1 | |
2020030114 | 1 | |
2020030113 | 1 | |
2020030112 | 1 | |
2020030111 | 1 | |
2020030017 | 1 |
진척보고회차 | 용역사업번호 | 공정계획면적 | 공정실적면적 | 공정진척율 | 공정적합여부 | 규격적합여부 | 품질적합여부 | 알림아이디 | |
---|---|---|---|---|---|---|---|---|---|
진척보고회차 | 1.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.422 | 0.256 | 0.256 | 0.000 |
용역사업번호 | 0.000 | 1.000 | 0.688 | 0.369 | 0.000 | 0.000 | 0.293 | 0.293 | 0.479 |
공정계획면적 | 0.000 | 0.688 | 1.000 | 0.948 | 0.000 | 0.000 | 0.000 | 0.000 | 0.777 |
공정실적면적 | 0.000 | 0.369 | 0.948 | 1.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.453 |
공정진척율 | 0.000 | 0.000 | 0.000 | 0.000 | 1.000 | 0.762 | 0.909 | 0.909 | 0.473 |
공정적합여부 | 0.422 | 0.000 | 0.000 | 0.000 | 0.762 | 1.000 | 0.771 | 0.771 | 0.000 |
규격적합여부 | 0.256 | 0.293 | 0.000 | 0.000 | 0.909 | 0.771 | 1.000 | 0.955 | 0.081 |
품질적합여부 | 0.256 | 0.293 | 0.000 | 0.000 | 0.909 | 0.771 | 0.955 | 1.000 | 0.081 |
알림아이디 | 0.000 | 0.479 | 0.777 | 0.453 | 0.473 | 0.000 | 0.081 | 0.081 | 1.000 |
공정적합여부 | 규격적합여부 | 진척보고회차 | 품질적합여부 | |
---|---|---|---|---|
공정적합여부 | 1.000 | 0.560 | 0.483 | 0.560 |
규격적합여부 | 0.560 | 1.000 | 0.290 | 0.808 |
진척보고회차 | 0.483 | 0.290 | 1.000 | 0.290 |
품질적합여부 | 0.560 | 0.808 | 0.290 | 1.000 |
용역사업번호 | 공정계획면적 | 공정실적면적 | 공정진척율 | 알림아이디 | 진척보고회차 | 공정적합여부 | 규격적합여부 | 품질적합여부 | |
---|---|---|---|---|---|---|---|---|---|
용역사업번호 | 1.000 | 0.359 | 0.373 | -0.084 | -0.182 | 0.000 | 0.000 | 0.209 | 0.209 |
공정계획면적 | 0.359 | 1.000 | 0.992 | -0.355 | 0.470 | 0.000 | 0.000 | 0.000 | 0.000 |
공정실적면적 | 0.373 | 0.992 | 1.000 | -0.288 | 0.458 | 0.000 | 0.000 | 0.000 | 0.000 |
공정진척율 | -0.084 | -0.355 | -0.288 | 1.000 | -0.404 | 0.000 | 0.518 | 0.489 | 0.489 |
알림아이디 | -0.182 | 0.470 | 0.458 | -0.404 | 1.000 | 0.000 | 0.000 | 0.124 | 0.124 |
진척보고회차 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 1.000 | 0.483 | 0.290 | 0.290 |
공정적합여부 | 0.000 | 0.000 | 0.000 | 0.518 | 0.000 | 0.483 | 1.000 | 0.560 | 0.560 |
규격적합여부 | 0.209 | 0.000 | 0.000 | 0.489 | 0.124 | 0.290 | 0.560 | 1.000 | 0.808 |
품질적합여부 | 0.209 | 0.000 | 0.000 | 0.489 | 0.124 | 0.290 | 0.560 | 0.808 | 1.000 |
진척보고회차 | 용역사업번호 | 공정계획면적 | 공정실적면적 | 공정진척율 | 공정적합여부 | 규격적합여부 | 품질적합여부 | 알림아이디 | |
---|---|---|---|---|---|---|---|---|---|
0 | 1 | 120180034 | 1 | 1 | 100.0 | Y | Y | Y | 2018030037 |
1 | 2 | 120180034 | 2 | 2 | 100.0 | Y | Y | Y | 2018030038 |
2 | 1 | 120180036 | 100 | 100 | 100.0 | Y | Y | Y | 2018030062 |
3 | 1 | 120190057 | 1000 | 900 | 90.0 | Y | Y | Y | 2019080148 |
4 | 1 | 120190060 | 1 | 1 | 100.0 | Y | Y | Y | 2019080161 |
5 | 1 | 120190070 | 1000 | 900 | 90.0 | Y | Y | Y | 2019080268 |
6 | 1 | 120200035 | 100 | 100 | 100.0 | Y | Y | Y | 2020030014 |
7 | 2 | 120200035 | 150 | 148 | 98.67 | Y | Y | Y | 2020030015 |
8 | 3 | 120200035 | 2 | 2 | 100.0 | Y | Y | Y | 2020030017 |
9 | 4 | 120200035 | 100 | 5 | 5.0 | Y | Y | Y | 2020030152 |
진척보고회차 | 용역사업번호 | 공정계획면적 | 공정실적면적 | 공정진척율 | 공정적합여부 | 규격적합여부 | 품질적합여부 | 알림아이디 | |
---|---|---|---|---|---|---|---|---|---|
21 | 4 | 220180021 | 4 | 4 | 100.0 | N | N | N | 2018020084 |
22 | 5 | 220180021 | 100 | 100 | 100.0 | Y | Y | Y | 2018020096 |
23 | 1 | 220180028 | 100 | 100 | 100.0 | Y | Y | Y | 2018030008 |
24 | 1 | 220190205 | 5000 | 1000 | 20.0 | Y | N | N | 2019070125 |
25 | 1 | 220190237 | 5000 | 4000 | 80.0 | Y | Y | Y | 2019080126 |
26 | 1 | 220190238 | 6000 | 5000 | 83.33 | Y | Y | Y | 2019080122 |
27 | 1 | 220190245 | 5000 | 5000 | 100.0 | Y | Y | Y | 2019080186 |
28 | 2 | 220190245 | 4000 | 4000 | 100.0 | Y | Y | Y | 2019080187 |
29 | 3 | 220190245 | 3000 | 3000 | 100.0 | Y | Y | Y | 2019080188 |
30 | 4 | 220190245 | 2000 | 1000 | 50.0 | N | N | N | 2019080189 |