Dataset statistics
Number of variables | 10 |
---|---|
Number of observations | 100 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 8.9 KiB |
Average record size in memory | 91.3 B |
Variable types
Categorical | 6 |
---|---|
Numeric | 4 |
Dataset
Description | 샘플 데이터 |
---|---|
Author | 한국수자원공사 |
URL | https://www.bigdata-environment.kr/user/data_market/detail.do?id=c6697020-38bb-11ea-be28-4fa0eb812a46 |
협잡물함수율 has constant value "" | Constant |
액상슬러지함수율 has constant value "" | Constant |
권역 is highly overall correlated with 탈수슬러지처리량 and 3 other fields | High correlation |
하수처리시설명 is highly overall correlated with 탈수슬러지처리량 and 4 other fields | High correlation |
관리단 is highly overall correlated with 탈수슬러지처리량 and 3 other fields | High correlation |
협잡물처리량 is highly overall correlated with 탈수슬러지처리량 and 3 other fields | High correlation |
탈수슬러지처리량 is highly overall correlated with 탈수슬러지함수율 and 4 other fields | High correlation |
탈수슬러지함수율 is highly overall correlated with 탈수슬러지처리량 and 1 other fields | High correlation |
권역 is highly imbalanced (71.4%) | Imbalance |
관리단 is highly imbalanced (71.4%) | Imbalance |
협잡물처리량 is highly imbalanced (82.3%) | Imbalance |
탈수슬러지처리량 has 44 (44.0%) zeros | Zeros |
액상슬러지처리량 has 89 (89.0%) zeros | Zeros |
탈수슬러지함수율 has 56 (56.0%) zeros | Zeros |
Reproduction
Analysis started | 2023-12-10 13:03:21.265545 |
---|---|
Analysis finished | 2023-12-10 13:03:23.718166 |
Duration | 2.45 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
권역
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | 2.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
90 | |
---|---|
91 | 5 |
Length
Max length | 2 |
---|---|
Median length | 2 |
Mean length | 2 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 90 |
---|---|
2nd row | 90 |
3rd row | 90 |
4th row | 90 |
5th row | 90 |
Common Values
Value | Count | Frequency (%) |
90 | 95 | |
91 | 5 | 5.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
90 | 95 | |
91 | 5 | 5.0% |
하수처리시설명
Categorical
HIGH CORRELATION
 
Distinct | 4 |
---|---|
Distinct (%) | 4.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
50001 | |
---|---|
50002 | |
50003 | |
60001 | 5 |
Length
Max length | 5 |
---|---|
Median length | 5 |
Mean length | 5 |
Min length | 5 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 50002 |
---|---|
2nd row | 50001 |
3rd row | 50002 |
4th row | 50002 |
5th row | 50001 |
Common Values
Value | Count | Frequency (%) |
50001 | 51 | |
50002 | 23 | |
50003 | 21 | |
60001 | 5 | 5.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
50001 | 51 | |
50002 | 23 | |
50003 | 21 | |
60001 | 5 | 5.0% |
처리일자
Real number (ℝ)
Distinct | 31 |
---|---|
Distinct (%) | 31.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 20190516 |
Minimum | 20190501 |
---|---|
Maximum | 20190531 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 20190501 |
---|---|
5-th percentile | 20190502 |
Q1 | 20190508 |
median | 20190516 |
Q3 | 20190523 |
95-th percentile | 20190530 |
Maximum | 20190531 |
Range | 30 |
Interquartile range (IQR) | 15.25 |
Descriptive statistics
Standard deviation | 8.8986209 |
---|---|
Coefficient of variation (CV) | 4.4073271 × 10-7 |
Kurtosis | -1.1767057 |
Mean | 20190516 |
Median Absolute Deviation (MAD) | 8 |
Skewness | 0.014283806 |
Sum | 2.0190516 × 109 |
Variance | 79.185455 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
20190507 | 5 | 5.0% |
20190524 | 5 | 5.0% |
20190522 | 5 | 5.0% |
20190515 | 5 | 5.0% |
20190531 | 4 | 4.0% |
20190520 | 4 | 4.0% |
20190514 | 4 | 4.0% |
20190503 | 4 | 4.0% |
20190502 | 4 | 4.0% |
20190513 | 4 | 4.0% |
Other values (21) | 56 |
Value | Count | Frequency (%) |
20190501 | 2 | 2.0% |
20190502 | 4 | |
20190503 | 4 | |
20190504 | 2 | 2.0% |
20190505 | 2 | 2.0% |
20190506 | 3 | |
20190507 | 5 | |
20190508 | 4 | |
20190509 | 4 | |
20190510 | 4 |
Value | Count | Frequency (%) |
20190531 | 4 | |
20190530 | 3 | |
20190529 | 3 | |
20190528 | 3 | |
20190527 | 4 | |
20190526 | 1 | 1.0% |
20190525 | 2 | 2.0% |
20190524 | 5 | |
20190523 | 3 | |
20190522 | 5 |
관리단
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | 2.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
500 | |
---|---|
600 | 5 |
Length
Max length | 3 |
---|---|
Median length | 3 |
Mean length | 3 |
Min length | 3 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 500 |
---|---|
2nd row | 500 |
3rd row | 500 |
4th row | 500 |
5th row | 500 |
Common Values
Value | Count | Frequency (%) |
500 | 95 | |
600 | 5 | 5.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
500 | 95 | |
600 | 5 | 5.0% |
탈수슬러지처리량
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 51 |
---|---|
Distinct (%) | 51.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 3212.79 |
Minimum | 0 |
---|---|
Maximum | 10291 |
Zeros | 44 |
Zeros (%) | 44.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 4675 |
Q3 | 5892.5 |
95-th percentile | 6606.5 |
Maximum | 10291 |
Range | 10291 |
Interquartile range (IQR) | 5892.5 |
Descriptive statistics
Standard deviation | 2976.1388 |
---|---|
Coefficient of variation (CV) | 0.9263409 |
Kurtosis | -1.6617748 |
Mean | 3212.79 |
Median Absolute Deviation (MAD) | 1895 |
Skewness | -0.0088044262 |
Sum | 321279 |
Variance | 8857402.1 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 44 | |
6480 | 3 | 3.0% |
5550 | 2 | 2.0% |
3423 | 2 | 2.0% |
6440 | 2 | 2.0% |
5890 | 2 | 2.0% |
6850 | 1 | 1.0% |
4450 | 1 | 1.0% |
4900 | 1 | 1.0% |
5920 | 1 | 1.0% |
Other values (41) | 41 |
Value | Count | Frequency (%) |
0 | 44 | |
2923 | 1 | 1.0% |
3249 | 1 | 1.0% |
3423 | 2 | 2.0% |
4450 | 1 | 1.0% |
4600 | 1 | 1.0% |
4750 | 1 | 1.0% |
4870 | 1 | 1.0% |
4900 | 1 | 1.0% |
4930 | 1 | 1.0% |
Value | Count | Frequency (%) |
10291 | 1 | 1.0% |
7300 | 1 | 1.0% |
6870 | 1 | 1.0% |
6850 | 1 | 1.0% |
6730 | 1 | 1.0% |
6600 | 1 | 1.0% |
6540 | 1 | 1.0% |
6520 | 1 | 1.0% |
6480 | 3 | |
6440 | 2 |
액상슬러지처리량
Real number (ℝ)
ZEROS
 
Distinct | 11 |
---|---|
Distinct (%) | 11.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 140.4 |
Minimum | 0 |
---|---|
Maximum | 1670 |
Zeros | 89 |
Zeros (%) | 89.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 0 |
95-th percentile | 1214.5 |
Maximum | 1670 |
Range | 1670 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 408.08476 |
---|---|
Coefficient of variation (CV) | 2.9065866 |
Kurtosis | 5.6992412 |
Mean | 140.4 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 2.6914033 |
Sum | 14040 |
Variance | 166533.17 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 89 | |
1520 | 2 | 2.0% |
960 | 1 | 1.0% |
1160 | 1 | 1.0% |
1440 | 1 | 1.0% |
1170 | 1 | 1.0% |
1210 | 1 | 1.0% |
1670 | 1 | 1.0% |
1300 | 1 | 1.0% |
1030 | 1 | 1.0% |
Value | Count | Frequency (%) |
0 | 89 | |
960 | 1 | 1.0% |
1030 | 1 | 1.0% |
1060 | 1 | 1.0% |
1160 | 1 | 1.0% |
1170 | 1 | 1.0% |
1210 | 1 | 1.0% |
1300 | 1 | 1.0% |
1440 | 1 | 1.0% |
1520 | 2 | 2.0% |
Value | Count | Frequency (%) |
1670 | 1 | |
1520 | 2 | |
1440 | 1 | |
1300 | 1 | |
1210 | 1 | |
1170 | 1 | |
1160 | 1 | |
1060 | 1 | |
1030 | 1 | |
960 | 1 |
탈수슬러지함수율
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 31 |
---|---|
Distinct (%) | 31.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 6938.8 |
Minimum | 0 |
---|---|
Maximum | 35860 |
Zeros | 56 |
Zeros (%) | 56.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 11952.5 |
95-th percentile | 21229 |
Maximum | 35860 |
Range | 35860 |
Interquartile range (IQR) | 11952.5 |
Descriptive statistics
Standard deviation | 8859.3209 |
---|---|
Coefficient of variation (CV) | 1.27678 |
Kurtosis | -0.20089228 |
Mean | 6938.8 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 0.93287479 |
Sum | 693880 |
Variance | 78487566 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 56 | |
21060 | 4 | 4.0% |
11970 | 3 | 3.0% |
23900 | 2 | 2.0% |
17920 | 2 | 2.0% |
11930 | 2 | 2.0% |
11940 | 2 | 2.0% |
21040 | 2 | 2.0% |
11950 | 2 | 2.0% |
10510 | 2 | 2.0% |
Other values (21) | 23 |
Value | Count | Frequency (%) |
0 | 56 | |
5970 | 1 | 1.0% |
5980 | 1 | 1.0% |
10480 | 1 | 1.0% |
10510 | 2 | 2.0% |
10520 | 2 | 2.0% |
10530 | 2 | 2.0% |
10540 | 1 | 1.0% |
10550 | 1 | 1.0% |
10570 | 1 | 1.0% |
Value | Count | Frequency (%) |
35860 | 1 | 1.0% |
23900 | 2 | |
23890 | 1 | 1.0% |
23870 | 1 | 1.0% |
21090 | 1 | 1.0% |
21080 | 1 | 1.0% |
21060 | 4 | |
21050 | 1 | 1.0% |
21040 | 2 | |
20880 | 1 | 1.0% |
협잡물처리량
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 4 |
---|---|
Distinct (%) | 4.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
0.0 | |
---|---|
83.5 | 3 |
83.4 | 1 |
82.8 | 1 |
Length
Max length | 4 |
---|---|
Median length | 3 |
Mean length | 3.05 |
Min length | 3 |
Unique
Unique | 2 ? |
---|---|
Unique (%) | 2.0% |
Sample
1st row | 0.0 |
---|---|
2nd row | 0.0 |
3rd row | 0.0 |
4th row | 0.0 |
5th row | 0.0 |
Common Values
Value | Count | Frequency (%) |
0.0 | 95 | |
83.5 | 3 | 3.0% |
83.4 | 1 | 1.0% |
82.8 | 1 | 1.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0.0 | 95 | |
83.5 | 3 | 3.0% |
83.4 | 1 | 1.0% |
82.8 | 1 | 1.0% |
협잡물함수율
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 1.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
0 |
---|
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 0 |
---|---|
2nd row | 0 |
3rd row | 0 |
4th row | 0 |
5th row | 0 |
Common Values
Value | Count | Frequency (%) |
0 | 100 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 100 |
액상슬러지함수율
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 1.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
0 |
---|
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 0 |
---|---|
2nd row | 0 |
3rd row | 0 |
4th row | 0 |
5th row | 0 |
Common Values
Value | Count | Frequency (%) |
0 | 100 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 100 |
권역 | 하수처리시설명 | 처리일자 | 관리단 | 탈수슬러지처리량 | 액상슬러지처리량 | 탈수슬러지함수율 | 협잡물처리량 | |
---|---|---|---|---|---|---|---|---|
권역 | 1.000 | 1.000 | 0.000 | 0.986 | 1.000 | 0.000 | 0.000 | 1.000 |
하수처리시설명 | 1.000 | 1.000 | 0.000 | 1.000 | 0.978 | 0.000 | 0.968 | 0.885 |
처리일자 | 0.000 | 0.000 | 1.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 |
관리단 | 0.986 | 1.000 | 0.000 | 1.000 | 1.000 | 0.000 | 0.000 | 1.000 |
탈수슬러지처리량 | 1.000 | 0.978 | 0.000 | 1.000 | 1.000 | 0.191 | 0.670 | 0.983 |
액상슬러지처리량 | 0.000 | 0.000 | 0.000 | 0.000 | 0.191 | 1.000 | 0.000 | 0.000 |
탈수슬러지함수율 | 0.000 | 0.968 | 0.000 | 0.000 | 0.670 | 0.000 | 1.000 | 0.000 |
협잡물처리량 | 1.000 | 0.885 | 0.000 | 1.000 | 0.983 | 0.000 | 0.000 | 1.000 |
권역 | 하수처리시설명 | 관리단 | 협잡물처리량 | |
---|---|---|---|---|
권역 | 1.000 | 0.990 | 0.894 | 0.990 |
하수처리시설명 | 0.990 | 1.000 | 0.990 | 0.559 |
관리단 | 0.894 | 0.990 | 1.000 | 0.990 |
협잡물처리량 | 0.990 | 0.559 | 0.990 | 1.000 |
처리일자 | 탈수슬러지처리량 | 액상슬러지처리량 | 탈수슬러지함수율 | 권역 | 하수처리시설명 | 관리단 | 협잡물처리량 | |
---|---|---|---|---|---|---|---|---|
처리일자 | 1.000 | -0.000 | -0.044 | 0.075 | 0.000 | 0.000 | 0.000 | 0.000 |
탈수슬러지처리량 | -0.000 | 1.000 | 0.411 | -0.851 | 0.969 | 0.784 | 0.969 | 0.808 |
액상슬러지처리량 | -0.044 | 0.411 | 1.000 | -0.294 | 0.000 | 0.000 | 0.000 | 0.000 |
탈수슬러지함수율 | 0.075 | -0.851 | -0.294 | 1.000 | 0.000 | 0.747 | 0.000 | 0.000 |
권역 | 0.000 | 0.969 | 0.000 | 0.000 | 1.000 | 0.990 | 0.894 | 0.990 |
하수처리시설명 | 0.000 | 0.784 | 0.000 | 0.747 | 0.990 | 1.000 | 0.990 | 0.559 |
관리단 | 0.000 | 0.969 | 0.000 | 0.000 | 0.894 | 0.990 | 1.000 | 0.990 |
협잡물처리량 | 0.000 | 0.808 | 0.000 | 0.000 | 0.990 | 0.559 | 0.990 | 1.000 |
권역 | 하수처리시설명 | 처리일자 | 관리단 | 탈수슬러지처리량 | 액상슬러지처리량 | 탈수슬러지함수율 | 협잡물처리량 | 협잡물함수율 | 액상슬러지함수율 | |
---|---|---|---|---|---|---|---|---|---|---|
0 | 90 | 50002 | 20190527 | 500 | 0 | 0 | 21080 | 0.0 | 0 | 0 |
1 | 90 | 50001 | 20190502 | 500 | 5660 | 0 | 0 | 0.0 | 0 | 0 |
2 | 90 | 50002 | 20190524 | 500 | 0 | 0 | 21060 | 0.0 | 0 | 0 |
3 | 90 | 50002 | 20190525 | 500 | 0 | 0 | 10530 | 0.0 | 0 | 0 |
4 | 90 | 50001 | 20190522 | 500 | 6040 | 0 | 0 | 0.0 | 0 | 0 |
5 | 90 | 50001 | 20190516 | 500 | 6060 | 0 | 0 | 0.0 | 0 | 0 |
6 | 90 | 50002 | 20190521 | 500 | 0 | 0 | 10520 | 0.0 | 0 | 0 |
7 | 90 | 50002 | 20190523 | 500 | 0 | 0 | 10510 | 0.0 | 0 | 0 |
8 | 90 | 50003 | 20190528 | 500 | 0 | 0 | 11960 | 0.0 | 0 | 0 |
9 | 90 | 50002 | 20190508 | 500 | 0 | 0 | 10580 | 0.0 | 0 | 0 |
권역 | 하수처리시설명 | 처리일자 | 관리단 | 탈수슬러지처리량 | 액상슬러지처리량 | 탈수슬러지함수율 | 협잡물처리량 | 협잡물함수율 | 액상슬러지함수율 | |
---|---|---|---|---|---|---|---|---|---|---|
90 | 90 | 50001 | 20190514 | 500 | 5610 | 0 | 0 | 0.0 | 0 | 0 |
91 | 90 | 50001 | 20190501 | 500 | 5760 | 0 | 0 | 0.0 | 0 | 0 |
92 | 90 | 50001 | 20190515 | 500 | 6440 | 1060 | 0 | 0.0 | 0 | 0 |
93 | 90 | 50002 | 20190503 | 500 | 0 | 0 | 21060 | 0.0 | 0 | 0 |
94 | 90 | 50001 | 20190515 | 500 | 5520 | 0 | 0 | 0.0 | 0 | 0 |
95 | 91 | 60001 | 20190506 | 600 | 3423 | 0 | 0 | 83.5 | 0 | 0 |
96 | 91 | 60001 | 20190524 | 600 | 3249 | 0 | 0 | 83.4 | 0 | 0 |
97 | 91 | 60001 | 20190507 | 600 | 10291 | 0 | 0 | 82.8 | 0 | 0 |
98 | 91 | 60001 | 20190515 | 600 | 3423 | 0 | 0 | 83.5 | 0 | 0 |
99 | 91 | 60001 | 20190522 | 600 | 2923 | 0 | 0 | 83.5 | 0 | 0 |