Dataset statistics
Number of variables | 8 |
---|---|
Number of observations | 7118 |
Missing cells | 1790 |
Missing cells (%) | 3.1% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 486.7 KiB |
Average record size in memory | 70.0 B |
Variable types
Numeric | 5 |
---|---|
Categorical | 2 |
Text | 1 |
Dataset
Description | 경상남도_개발공채 데이터입니다. (공사년도, 공사구분, 공사번호, 지급일자, 소요액 , 실적금액 등의 데이터를 포함하고있습니다.) |
---|---|
Author | 경상남도 |
URL | https://bigdata.gyeongnam.go.kr/index.gn?menuCd=DOM_000000114002001000&publicdatapk=15049535 |
부서코드 has constant value "" | Constant |
소요액 is highly overall correlated with 실적금액 | High correlation |
실적금액 is highly overall correlated with 소요액 | High correlation |
소요액 has 1155 (16.2%) missing values | Missing |
실적금액 has 635 (8.9%) missing values | Missing |
소요액 is highly skewed (γ1 = 68.79262527) | Skewed |
실적금액 is highly skewed (γ1 = 59.15766638) | Skewed |
Reproduction
Analysis started | 2023-12-11 00:36:50.158409 |
---|---|
Analysis finished | 2023-12-11 00:36:52.957827 |
Duration | 2.8 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
공사년도
Real number (ℝ)
Distinct | 24 |
---|---|
Distinct (%) | 0.3% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2002.7201 |
Minimum | 1990 |
---|---|
Maximum | 2013 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 62.7 KiB |
Quantile statistics
Minimum | 1990 |
---|---|
5-th percentile | 1991 |
Q1 | 1999 |
median | 2004 |
Q3 | 2009 |
95-th percentile | 2011 |
Maximum | 2013 |
Range | 23 |
Interquartile range (IQR) | 10 |
Descriptive statistics
Standard deviation | 6.4661822 |
---|---|
Coefficient of variation (CV) | 0.0032286998 |
Kurtosis | -0.97022669 |
Mean | 2002.7201 |
Median Absolute Deviation (MAD) | 5 |
Skewness | -0.40839173 |
Sum | 14255362 |
Variance | 41.811512 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
2010 | 697 | 9.8% |
2011 | 547 | 7.7% |
2000 | 442 | 6.2% |
2004 | 433 | 6.1% |
2003 | 404 | 5.7% |
2007 | 386 | 5.4% |
2001 | 373 | 5.2% |
2006 | 368 | 5.2% |
2009 | 365 | 5.1% |
2005 | 352 | 4.9% |
Other values (14) | 2751 |
Value | Count | Frequency (%) |
1990 | 207 | |
1991 | 275 | |
1992 | 195 | |
1993 | 267 | |
1994 | 204 | |
1995 | 256 | |
1996 | 185 | |
1997 | 37 | 0.5% |
1998 | 90 | 1.3% |
1999 | 332 |
Value | Count | Frequency (%) |
2013 | 10 | 0.1% |
2012 | 197 | 2.8% |
2011 | 547 | |
2010 | 697 | |
2009 | 365 | |
2008 | 243 | 3.4% |
2007 | 386 | |
2006 | 368 | |
2005 | 352 | |
2004 | 433 |
공사구분
Categorical
Distinct | 4 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 55.7 KiB |
공사 | |
---|---|
용역 | |
기타 | |
구매 | 43 |
Length
Max length | 2 |
---|---|
Median length | 2 |
Mean length | 2 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 공사 |
---|---|
2nd row | 공사 |
3rd row | 공사 |
4th row | 공사 |
5th row | 공사 |
Common Values
Value | Count | Frequency (%) |
공사 | 5238 | |
용역 | 1291 | 18.1% |
기타 | 546 | 7.7% |
구매 | 43 | 0.6% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
공사 | 5238 | |
용역 | 1291 | 18.1% |
기타 | 546 | 7.7% |
구매 | 43 | 0.6% |
공사번호
Real number (ℝ)
Distinct | 531 |
---|---|
Distinct (%) | 7.5% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 97.908401 |
Minimum | 1 |
---|---|
Maximum | 623 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 62.7 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 6 |
Q1 | 29 |
median | 59 |
Q3 | 100 |
95-th percentile | 413 |
Maximum | 623 |
Range | 622 |
Interquartile range (IQR) | 71 |
Descriptive statistics
Standard deviation | 118.71299 |
---|---|
Coefficient of variation (CV) | 1.2124904 |
Kurtosis | 4.186912 |
Mean | 97.908401 |
Median Absolute Deviation (MAD) | 34 |
Skewness | 2.1870508 |
Sum | 696912 |
Variance | 14092.775 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
29 | 79 | 1.1% |
8 | 77 | 1.1% |
30 | 73 | 1.0% |
21 | 73 | 1.0% |
7 | 71 | 1.0% |
5 | 70 | 1.0% |
33 | 69 | 1.0% |
31 | 69 | 1.0% |
1 | 68 | 1.0% |
20 | 68 | 1.0% |
Other values (521) | 6401 |
Value | Count | Frequency (%) |
1 | 68 | |
2 | 61 | |
3 | 63 | |
4 | 54 | |
5 | 70 | |
6 | 67 | |
7 | 71 | |
8 | 77 | |
9 | 65 | |
10 | 62 |
Value | Count | Frequency (%) |
623 | 1 | |
620 | 1 | |
619 | 1 | |
618 | 1 | |
617 | 2 | |
616 | 1 | |
615 | 1 | |
614 | 1 | |
607 | 1 | |
604 | 1 |
부서코드
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 55.7 KiB |
1 |
---|
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 1 |
---|---|
2nd row | 1 |
3rd row | 1 |
4th row | 1 |
5th row | 1 |
Common Values
Value | Count | Frequency (%) |
1 | 7118 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
1 | 7118 |
순번
Real number (ℝ)
Distinct | 31 |
---|---|
Distinct (%) | 0.4% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2.0973588 |
Minimum | 1 |
---|---|
Maximum | 31 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 62.7 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 1 |
median | 1 |
Q3 | 2 |
95-th percentile | 5 |
Maximum | 31 |
Range | 30 |
Interquartile range (IQR) | 1 |
Descriptive statistics
Standard deviation | 2.1467036 |
---|---|
Coefficient of variation (CV) | 1.0235271 |
Kurtosis | 52.509832 |
Mean | 2.0973588 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 5.9527792 |
Sum | 14929 |
Variance | 4.6083364 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1 | 3591 | |
2 | 1784 | |
3 | 933 | 13.1% |
4 | 395 | 5.5% |
5 | 162 | 2.3% |
6 | 84 | 1.2% |
7 | 38 | 0.5% |
8 | 26 | 0.4% |
9 | 13 | 0.2% |
10 | 12 | 0.2% |
Other values (21) | 80 | 1.1% |
Value | Count | Frequency (%) |
1 | 3591 | |
2 | 1784 | |
3 | 933 | 13.1% |
4 | 395 | 5.5% |
5 | 162 | 2.3% |
6 | 84 | 1.2% |
7 | 38 | 0.5% |
8 | 26 | 0.4% |
9 | 13 | 0.2% |
10 | 12 | 0.2% |
Value | Count | Frequency (%) |
31 | 1 | < 0.1% |
30 | 1 | < 0.1% |
29 | 1 | < 0.1% |
28 | 2 | |
27 | 2 | |
26 | 2 | |
25 | 2 | |
24 | 2 | |
23 | 3 | |
22 | 3 |
지급일자
Text
Distinct | 3079 |
---|---|
Distinct (%) | 43.3% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 55.7 KiB |
Length
Max length | 10 |
---|---|
Median length | 10 |
Mean length | 9.9519528 |
Min length | 4 |
Characters and Unicode
Total characters | 70838 |
---|---|
Distinct characters | 11 |
Distinct categories | 2 ? |
Distinct scripts | 1 ? |
Distinct blocks | 1 ? |
Unique
Unique | 1474 ? |
---|---|
Unique (%) | 20.7% |
Sample
1st row | 1990-03-28 |
---|---|
2nd row | 1990-05-22 |
3rd row | 2002-10-30 |
4th row | 2002-10-30 |
5th row | 1990-04-09 |
Value | Count | Frequency (%) |
2003-12-30 | 34 | 0.5% |
2010-12-31 | 27 | 0.4% |
2005-12-28 | 27 | 0.4% |
2003-09-05 | 25 | 0.4% |
2010-09-17 | 25 | 0.4% |
2010-06-29 | 24 | 0.3% |
2006-12-27 | 22 | 0.3% |
2010-06-28 | 22 | 0.3% |
2004-03-03 | 17 | 0.2% |
2011-06-29 | 17 | 0.2% |
Other values (3069) | 6878 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 17995 | |
- | 14068 | |
2 | 10966 | |
1 | 10416 | |
9 | 5846 | 8.3% |
6 | 2071 | 2.9% |
4 | 2037 | 2.9% |
3 | 2018 | 2.8% |
5 | 1977 | 2.8% |
7 | 1850 | 2.6% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 56770 | |
Dash Punctuation | 14068 | 19.9% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 17995 | |
2 | 10966 | |
1 | 10416 | |
9 | 5846 | 10.3% |
6 | 2071 | 3.6% |
4 | 2037 | 3.6% |
3 | 2018 | 3.6% |
5 | 1977 | 3.5% |
7 | 1850 | 3.3% |
8 | 1594 | 2.8% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 14068 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 70838 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 17995 | |
- | 14068 | |
2 | 10966 | |
1 | 10416 | |
9 | 5846 | 8.3% |
6 | 2071 | 2.9% |
4 | 2037 | 2.9% |
3 | 2018 | 2.8% |
5 | 1977 | 2.8% |
7 | 1850 | 2.6% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 70838 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 17995 | |
- | 14068 | |
2 | 10966 | |
1 | 10416 | |
9 | 5846 | 8.3% |
6 | 2071 | 2.9% |
4 | 2037 | 2.9% |
3 | 2018 | 2.8% |
5 | 1977 | 2.8% |
7 | 1850 | 2.6% |
소요액
Real number (ℝ)
HIGH CORRELATION
  MISSING
  SKEWED
 
Distinct | 2642 |
---|---|
Distinct (%) | 44.3% |
Missing | 1155 |
Missing (%) | 16.2% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 38144867 |
Minimum | 0 |
---|---|
Maximum | 8.118 × 1010 |
Zeros | 5 |
Zeros (%) | 0.1% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 62.7 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 45000 |
Q1 | 685000 |
median | 2715000 |
Q3 | 7647500 |
95-th percentile | 35500000 |
Maximum | 8.118 × 1010 |
Range | 8.118 × 1010 |
Interquartile range (IQR) | 6962500 |
Descriptive statistics
Standard deviation | 1.0963646 × 109 |
---|---|
Coefficient of variation (CV) | 28.742127 |
Kurtosis | 5042.9965 |
Mean | 38144867 |
Median Absolute Deviation (MAD) | 2480000 |
Skewness | 68.792625 |
Sum | 2.2745784 × 1011 |
Variance | 1.2020154 × 1018 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
55000 | 55 | 0.8% |
20000 | 53 | 0.7% |
50000 | 46 | 0.6% |
45000 | 46 | 0.6% |
30000 | 45 | 0.6% |
35000 | 42 | 0.6% |
15000 | 40 | 0.6% |
115000 | 40 | 0.6% |
25000 | 38 | 0.5% |
60000 | 38 | 0.5% |
Other values (2632) | 5520 | |
(Missing) | 1155 | 16.2% |
Value | Count | Frequency (%) |
0 | 5 | 0.1% |
5000 | 5 | 0.1% |
10000 | 30 | |
15000 | 40 | |
20000 | 53 | |
25000 | 38 | |
27000 | 1 | < 0.1% |
30000 | 45 | |
35000 | 42 | |
40000 | 34 |
Value | Count | Frequency (%) |
81180000000 | 1 | |
13692095000 | 1 | |
12750000000 | 1 | |
10539105000 | 1 | |
4922070000 | 1 | |
3519865000 | 1 | |
3256385000 | 1 | |
3162415000 | 1 | |
2735625000 | 1 | |
2638090000 | 1 |
실적금액
Real number (ℝ)
HIGH CORRELATION
  MISSING
  SKEWED
 
Distinct | 2718 |
---|---|
Distinct (%) | 41.9% |
Missing | 635 |
Missing (%) | 8.9% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 11068677 |
Minimum | 0 |
---|---|
Maximum | 1.275 × 1010 |
Zeros | 23 |
Zeros (%) | 0.3% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 62.7 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 45000 |
Q1 | 765000 |
median | 2920000 |
Q3 | 7655000 |
95-th percentile | 29718500 |
Maximum | 1.275 × 1010 |
Range | 1.275 × 1010 |
Interquartile range (IQR) | 6890000 |
Descriptive statistics
Standard deviation | 1.904491 × 108 |
---|---|
Coefficient of variation (CV) | 17.20613 |
Kurtosis | 3654.5187 |
Mean | 11068677 |
Median Absolute Deviation (MAD) | 2620000 |
Skewness | 59.157666 |
Sum | 7.1758234 × 1010 |
Variance | 3.6270858 × 1016 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
55000 | 62 | 0.9% |
20000 | 55 | 0.8% |
30000 | 50 | 0.7% |
50000 | 47 | 0.7% |
45000 | 44 | 0.6% |
15000 | 44 | 0.6% |
115000 | 40 | 0.6% |
35000 | 40 | 0.6% |
60000 | 38 | 0.5% |
25000 | 37 | 0.5% |
Other values (2708) | 6026 | |
(Missing) | 635 | 8.9% |
Value | Count | Frequency (%) |
0 | 23 | |
5000 | 5 | 0.1% |
10000 | 28 | |
15000 | 44 | |
20000 | 55 | |
20555 | 1 | < 0.1% |
25000 | 37 | |
27000 | 1 | < 0.1% |
30000 | 50 | |
35000 | 40 |
Value | Count | Frequency (%) |
12750000000 | 1 | |
8345000000 | 1 | |
1192000000 | 1 | |
441575000 | 1 | |
367180000 | 1 | |
353310000 | 1 | |
249425000 | 1 | |
213680000 | 1 | |
191040000 | 1 | |
176855000 | 1 |
공사년도 | 공사구분 | 공사번호 | 순번 | 소요액 | 실적금액 | |
---|---|---|---|---|---|---|
공사년도 | 1.000 | 0.574 | 0.677 | 0.292 | 0.036 | 0.000 |
공사구분 | 0.574 | 1.000 | 0.554 | 0.102 | 0.000 | 0.000 |
공사번호 | 0.677 | 0.554 | 1.000 | 0.150 | 0.000 | 0.072 |
순번 | 0.292 | 0.102 | 0.150 | 1.000 | 0.041 | 0.000 |
소요액 | 0.036 | 0.000 | 0.000 | 0.041 | 1.000 | 0.750 |
실적금액 | 0.000 | 0.000 | 0.072 | 0.000 | 0.750 | 1.000 |
공사년도 | 공사번호 | 순번 | 소요액 | 실적금액 | 공사구분 | |
---|---|---|---|---|---|---|
공사년도 | 1.000 | 0.460 | -0.167 | -0.392 | -0.387 | 0.385 |
공사번호 | 0.460 | 1.000 | -0.078 | -0.193 | -0.189 | 0.365 |
순번 | -0.167 | -0.078 | 1.000 | 0.248 | 0.289 | 0.035 |
소요액 | -0.392 | -0.193 | 0.248 | 1.000 | 0.960 | 0.000 |
실적금액 | -0.387 | -0.189 | 0.289 | 0.960 | 1.000 | 0.000 |
공사구분 | 0.385 | 0.365 | 0.035 | 0.000 | 0.000 | 1.000 |
공사년도 | 공사구분 | 공사번호 | 부서코드 | 순번 | 지급일자 | 소요액 | 실적금액 | |
---|---|---|---|---|---|---|---|---|
0 | 1990 | 공사 | 5 | 1 | 1 | 1990-03-28 | 2475000 | <NA> |
1 | 1990 | 공사 | 5 | 1 | 2 | 1990-05-22 | <NA> | 2475000 |
2 | 1990 | 공사 | 1 | 1 | 3 | 2002-10-30 | 665000 | <NA> |
3 | 1990 | 공사 | 1 | 1 | 4 | 2002-10-30 | 1350000 | <NA> |
4 | 1990 | 공사 | 2 | 1 | 2 | 1990-04-09 | <NA> | 23985000 |
5 | 1990 | 공사 | 4 | 1 | 1 | 1990-06-27 | <NA> | 10662000 |
6 | 1990 | 공사 | 1 | 1 | 2 | 2002-10-30 | 1320000 | 10686000 |
7 | 1990 | 공사 | 1 | 1 | 1 | 2002-10-30 | 1110000 | <NA> |
8 | 1990 | 공사 | 2 | 1 | 1 | 1990-03-30 | 23985000 | <NA> |
9 | 1990 | 공사 | 6 | 1 | 2 | 1990-06-05 | 534000 | 534000 |
공사년도 | 공사구분 | 공사번호 | 부서코드 | 순번 | 지급일자 | 소요액 | 실적금액 | |
---|---|---|---|---|---|---|---|---|
7108 | 2013 | 용역 | 149 | 1 | 1 | 2013-05-29 | 760000 | 760000 |
7109 | 2013 | 용역 | 155 | 1 | 1 | 2013-06-25 | 75000 | 75000 |
7110 | 1996 | 공사 | 48 | 1 | 1 | 1996-07-04 | <NA> | 8600000 |
7111 | 2000 | 용역 | 56 | 1 | 1 | 2000-12-04 | 1250000 | 1250000 |
7112 | 2004 | 용역 | 2 | 1 | 1 | 2004-03-03 | 3256385000 | <NA> |
7113 | 2004 | 용역 | 2 | 1 | 2 | 2004-04-07 | 240350000 | <NA> |
7114 | 2011 | 용역 | 483 | 1 | 1 | 2012-01-17 | 220000 | 220000 |
7115 | 2004 | 용역 | 2 | 1 | 4 | 2004-05-17 | 1162995000 | <NA> |
7116 | 2010 | 용역 | 97 | 1 | 1 | 2011-02-24 | 255000 | 255000 |
7117 | 2004 | 용역 | 2 | 1 | 3 | 2004-05-06 | 232595000 | <NA> |