Dataset statistics
Number of variables | 10 |
---|---|
Number of observations | 100 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 8.9 KiB |
Average record size in memory | 91.3 B |
Variable types
Categorical | 5 |
---|---|
Numeric | 5 |
Dataset
Description | 샘플 데이터 |
---|---|
Author | 한국수자원공사 |
URL | https://www.bigdata-environment.kr/user/data_market/detail.do?id=c6697020-38bb-11ea-be28-4fa0eb812a46 |
협잡물함수율 has constant value "" | Constant |
액상슬러지함수율 has constant value "" | Constant |
권역 is highly overall correlated with 하수처리시설명 and 2 other fields | High correlation |
관리단 is highly overall correlated with 하수처리시설명 and 4 other fields | High correlation |
하수처리시설명 is highly overall correlated with 탈수슬러지처리량 and 4 other fields | High correlation |
탈수슬러지처리량 is highly overall correlated with 하수처리시설명 and 3 other fields | High correlation |
탈수슬러지함수율 is highly overall correlated with 탈수슬러지처리량 | High correlation |
협잡물처리량 is highly overall correlated with 하수처리시설명 and 2 other fields | High correlation |
액상슬러지처리량 is highly overall correlated with 하수처리시설명 and 2 other fields | High correlation |
액상슬러지처리량 is highly imbalanced (89.8%) | Imbalance |
탈수슬러지처리량 has 28 (28.0%) zeros | Zeros |
탈수슬러지함수율 has 72 (72.0%) zeros | Zeros |
협잡물처리량 has 71 (71.0%) zeros | Zeros |
Reproduction
Analysis started | 2023-12-10 13:03:43.524677 |
---|---|
Analysis finished | 2023-12-10 13:03:46.243609 |
Duration | 2.72 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
권역
Categorical
HIGH CORRELATION
 
Distinct | 2 |
---|---|
Distinct (%) | 2.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
90 | |
---|---|
91 |
Length
Max length | 2 |
---|---|
Median length | 2 |
Mean length | 2 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 90 |
---|---|
2nd row | 90 |
3rd row | 90 |
4th row | 90 |
5th row | 90 |
Common Values
Value | Count | Frequency (%) |
90 | 69 | |
91 | 31 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
90 | 69 | |
91 | 31 |
하수처리시설명
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 6 |
---|---|
Distinct (%) | 6.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 53001.38 |
Minimum | 40001 |
---|---|
Maximum | 70001 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 40001 |
---|---|
5-th percentile | 50001 |
Q1 | 50001 |
median | 50002 |
Q3 | 60001 |
95-th percentile | 60001 |
Maximum | 70001 |
Range | 30000 |
Interquartile range (IQR) | 10000 |
Descriptive statistics
Standard deviation | 5024.96 |
---|---|
Coefficient of variation (CV) | 0.094808097 |
Kurtosis | 0.059350478 |
Mean | 53001.38 |
Median Absolute Deviation (MAD) | 1 |
Skewness | 0.87718521 |
Sum | 5300138 |
Variance | 25250223 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
50001 | 41 | |
60001 | 29 | |
50002 | 18 | |
50003 | 10 | 10.0% |
70001 | 1 | 1.0% |
40001 | 1 | 1.0% |
Value | Count | Frequency (%) |
40001 | 1 | 1.0% |
50001 | 41 | |
50002 | 18 | |
50003 | 10 | 10.0% |
60001 | 29 | |
70001 | 1 | 1.0% |
Value | Count | Frequency (%) |
70001 | 1 | 1.0% |
60001 | 29 | |
50003 | 10 | 10.0% |
50002 | 18 | |
50001 | 41 | |
40001 | 1 | 1.0% |
처리일자
Real number (ℝ)
Distinct | 31 |
---|---|
Distinct (%) | 31.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 20190117 |
Minimum | 20190101 |
---|---|
Maximum | 20190131 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 20190101 |
---|---|
5-th percentile | 20190103 |
Q1 | 20190110 |
median | 20190117 |
Q3 | 20190125 |
95-th percentile | 20190131 |
Maximum | 20190131 |
Range | 30 |
Interquartile range (IQR) | 15.25 |
Descriptive statistics
Standard deviation | 9.0775781 |
---|---|
Coefficient of variation (CV) | 4.4960503 × 10-7 |
Kurtosis | -1.1925558 |
Mean | 20190117 |
Median Absolute Deviation (MAD) | 8 |
Skewness | -0.09445672 |
Sum | 2.0190117 × 109 |
Variance | 82.402424 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
20190131 | 6 | 6.0% |
20190115 | 5 | 5.0% |
20190128 | 5 | 5.0% |
20190107 | 5 | 5.0% |
20190121 | 5 | 5.0% |
20190114 | 4 | 4.0% |
20190123 | 4 | 4.0% |
20190125 | 4 | 4.0% |
20190129 | 4 | 4.0% |
20190124 | 4 | 4.0% |
Other values (21) | 54 |
Value | Count | Frequency (%) |
20190101 | 2 | 2.0% |
20190102 | 3 | |
20190103 | 3 | |
20190104 | 4 | |
20190105 | 2 | 2.0% |
20190106 | 1 | 1.0% |
20190107 | 5 | |
20190108 | 3 | |
20190109 | 2 | 2.0% |
20190110 | 4 |
Value | Count | Frequency (%) |
20190131 | 6 | |
20190130 | 3 | |
20190129 | 4 | |
20190128 | 5 | |
20190127 | 2 | 2.0% |
20190126 | 2 | 2.0% |
20190125 | 4 | |
20190124 | 4 | |
20190123 | 4 | |
20190122 | 2 | 2.0% |
관리단
Categorical
HIGH CORRELATION
 
Distinct | 4 |
---|---|
Distinct (%) | 4.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
500 | |
---|---|
600 | |
700 | 1 |
400 | 1 |
Length
Max length | 3 |
---|---|
Median length | 3 |
Mean length | 3 |
Min length | 3 |
Unique
Unique | 2 ? |
---|---|
Unique (%) | 2.0% |
Sample
1st row | 500 |
---|---|
2nd row | 500 |
3rd row | 500 |
4th row | 500 |
5th row | 500 |
Common Values
Value | Count | Frequency (%) |
500 | 69 | |
600 | 29 | |
700 | 1 | 1.0% |
400 | 1 | 1.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
500 | 69 | |
600 | 29 | |
700 | 1 | 1.0% |
400 | 1 | 1.0% |
탈수슬러지처리량
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 69 |
---|---|
Distinct (%) | 69.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 9675.09 |
Minimum | 0 |
---|---|
Maximum | 626980 |
Zeros | 28 |
Zeros (%) | 28.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 2646 |
Q3 | 5497.5 |
95-th percentile | 6143.5 |
Maximum | 626980 |
Range | 626980 |
Interquartile range (IQR) | 5497.5 |
Descriptive statistics
Standard deviation | 62526.446 |
---|---|
Coefficient of variation (CV) | 6.4626217 |
Kurtosis | 98.868593 |
Mean | 9675.09 |
Median Absolute Deviation (MAD) | 2646 |
Skewness | 9.9184069 |
Sum | 967509 |
Variance | 3.9095565 × 109 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 28 | |
6090 | 3 | 3.0% |
2736 | 2 | 2.0% |
2774 | 2 | 2.0% |
2141 | 1 | 1.0% |
2606 | 1 | 1.0% |
2710 | 1 | 1.0% |
2600 | 1 | 1.0% |
2654 | 1 | 1.0% |
1624 | 1 | 1.0% |
Other values (59) | 59 |
Value | Count | Frequency (%) |
0 | 28 | |
1624 | 1 | 1.0% |
1851 | 1 | 1.0% |
1938 | 1 | 1.0% |
2040 | 1 | 1.0% |
2052 | 1 | 1.0% |
2078 | 1 | 1.0% |
2090 | 1 | 1.0% |
2128 | 1 | 1.0% |
2141 | 1 | 1.0% |
Value | Count | Frequency (%) |
626980 | 1 | 1.0% |
43020 | 1 | 1.0% |
6960 | 1 | 1.0% |
6350 | 1 | 1.0% |
6210 | 1 | 1.0% |
6140 | 1 | 1.0% |
6090 | 3 | |
6070 | 1 | 1.0% |
5990 | 1 | 1.0% |
5960 | 1 | 1.0% |
액상슬러지처리량
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 3 |
---|---|
Distinct (%) | 3.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
0 | |
---|---|
560 | 1 |
14180 | 1 |
Length
Max length | 5 |
---|---|
Median length | 1 |
Mean length | 1.06 |
Min length | 1 |
Unique
Unique | 2 ? |
---|---|
Unique (%) | 2.0% |
Sample
1st row | 0 |
---|---|
2nd row | 0 |
3rd row | 0 |
4th row | 0 |
5th row | 0 |
Common Values
Value | Count | Frequency (%) |
0 | 98 | |
560 | 1 | 1.0% |
14180 | 1 | 1.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 98 | |
560 | 1 | 1.0% |
14180 | 1 | 1.0% |
탈수슬러지함수율
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 22 |
---|---|
Distinct (%) | 22.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 4756.4 |
Minimum | 0 |
---|---|
Maximum | 59540 |
Zeros | 72 |
Zeros (%) | 72.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 10497.5 |
95-th percentile | 21050.5 |
Maximum | 59540 |
Range | 59540 |
Interquartile range (IQR) | 10497.5 |
Descriptive statistics
Standard deviation | 9210.3739 |
---|---|
Coefficient of variation (CV) | 1.936417 |
Kurtosis | 11.742661 |
Mean | 4756.4 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 2.8102819 |
Sum | 475640 |
Variance | 84830987 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 72 | |
11900 | 3 | 3.0% |
10490 | 3 | 3.0% |
21060 | 2 | 2.0% |
10530 | 2 | 2.0% |
10520 | 2 | 2.0% |
10540 | 1 | 1.0% |
10580 | 1 | 1.0% |
10560 | 1 | 1.0% |
20770 | 1 | 1.0% |
Other values (12) | 12 | 12.0% |
Value | Count | Frequency (%) |
0 | 72 | |
10490 | 3 | 3.0% |
10520 | 2 | 2.0% |
10530 | 2 | 2.0% |
10540 | 1 | 1.0% |
10560 | 1 | 1.0% |
10580 | 1 | 1.0% |
11900 | 3 | 3.0% |
11910 | 1 | 1.0% |
11920 | 1 | 1.0% |
Value | Count | Frequency (%) |
59540 | 1 | |
23830 | 1 | |
23820 | 1 | |
21060 | 2 | |
21050 | 1 | |
21040 | 1 | |
21000 | 1 | |
20990 | 1 | |
20960 | 1 | |
20770 | 1 |
협잡물처리량
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 6 |
---|---|
Distinct (%) | 6.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 24.147 |
Minimum | 0 |
---|---|
Maximum | 83.5 |
Zeros | 71 |
Zeros (%) | 71.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 83.2 |
95-th percentile | 83.4 |
Maximum | 83.5 |
Range | 83.5 |
Interquartile range (IQR) | 83.2 |
Descriptive statistics
Standard deviation | 37.973261 |
---|---|
Coefficient of variation (CV) | 1.5725871 |
Kurtosis | -1.1399668 |
Mean | 24.147 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 0.93978348 |
Sum | 2414.7 |
Variance | 1441.9686 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0.0 | 71 | |
83.3 | 14 | 14.0% |
83.2 | 8 | 8.0% |
83.4 | 3 | 3.0% |
83.5 | 3 | 3.0% |
82.2 | 1 | 1.0% |
Value | Count | Frequency (%) |
0.0 | 71 | |
82.2 | 1 | 1.0% |
83.2 | 8 | 8.0% |
83.3 | 14 | 14.0% |
83.4 | 3 | 3.0% |
83.5 | 3 | 3.0% |
Value | Count | Frequency (%) |
83.5 | 3 | 3.0% |
83.4 | 3 | 3.0% |
83.3 | 14 | 14.0% |
83.2 | 8 | 8.0% |
82.2 | 1 | 1.0% |
0.0 | 71 |
협잡물함수율
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 1.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
0 |
---|
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 0 |
---|---|
2nd row | 0 |
3rd row | 0 |
4th row | 0 |
5th row | 0 |
Common Values
Value | Count | Frequency (%) |
0 | 100 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 100 |
액상슬러지함수율
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 1.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
0 |
---|
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 0 |
---|---|
2nd row | 0 |
3rd row | 0 |
4th row | 0 |
5th row | 0 |
Common Values
Value | Count | Frequency (%) |
0 | 100 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 100 |
권역 | 하수처리시설명 | 처리일자 | 관리단 | 탈수슬러지처리량 | 액상슬러지처리량 | 탈수슬러지함수율 | 협잡물처리량 | |
---|---|---|---|---|---|---|---|---|
권역 | 1.000 | 1.000 | 0.000 | 1.000 | 0.000 | 0.050 | 0.502 | 0.994 |
하수처리시설명 | 1.000 | 1.000 | 0.000 | 1.000 | 1.000 | 0.941 | 0.422 | 1.000 |
처리일자 | 0.000 | 0.000 | 1.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 |
관리단 | 1.000 | 1.000 | 0.000 | 1.000 | 1.000 | 0.669 | 0.144 | 1.000 |
탈수슬러지처리량 | 0.000 | 1.000 | 0.000 | 1.000 | 1.000 | 1.000 | 0.000 | 0.000 |
액상슬러지처리량 | 0.050 | 0.941 | 0.000 | 0.669 | 1.000 | 1.000 | 0.000 | 0.000 |
탈수슬러지함수율 | 0.502 | 0.422 | 0.000 | 0.144 | 0.000 | 0.000 | 1.000 | 0.469 |
협잡물처리량 | 0.994 | 1.000 | 0.000 | 1.000 | 0.000 | 0.000 | 0.469 | 1.000 |
액상슬러지처리량 | 권역 | 관리단 | |
---|---|---|---|
액상슬러지처리량 | 1.000 | 0.082 | 0.694 |
권역 | 0.082 | 1.000 | 0.990 |
관리단 | 0.694 | 0.990 | 1.000 |
하수처리시설명 | 처리일자 | 탈수슬러지처리량 | 탈수슬러지함수율 | 협잡물처리량 | 권역 | 관리단 | 액상슬러지처리량 | |
---|---|---|---|---|---|---|---|---|
하수처리시설명 | 1.000 | -0.047 | -0.584 | 0.142 | 0.792 | 0.990 | 1.000 | 0.694 |
처리일자 | -0.047 | 1.000 | 0.051 | -0.036 | -0.018 | 0.000 | 0.000 | 0.000 |
탈수슬러지처리량 | -0.584 | 0.051 | 1.000 | -0.773 | -0.159 | 0.000 | 0.990 | 0.995 |
탈수슬러지함수율 | 0.142 | -0.036 | -0.773 | 1.000 | -0.385 | 0.354 | 0.089 | 0.000 |
협잡물처리량 | 0.792 | -0.018 | -0.159 | -0.385 | 1.000 | 0.929 | 0.990 | 0.000 |
권역 | 0.990 | 0.000 | 0.000 | 0.354 | 0.929 | 1.000 | 0.990 | 0.082 |
관리단 | 1.000 | 0.000 | 0.990 | 0.089 | 0.990 | 0.990 | 1.000 | 0.694 |
액상슬러지처리량 | 0.694 | 0.000 | 0.995 | 0.000 | 0.000 | 0.082 | 0.694 | 1.000 |
권역 | 하수처리시설명 | 처리일자 | 관리단 | 탈수슬러지처리량 | 액상슬러지처리량 | 탈수슬러지함수율 | 협잡물처리량 | 협잡물함수율 | 액상슬러지함수율 | |
---|---|---|---|---|---|---|---|---|---|---|
0 | 90 | 50001 | 20190122 | 500 | 5760 | 0 | 0 | 0.0 | 0 | 0 |
1 | 90 | 50002 | 20190116 | 500 | 0 | 0 | 10490 | 0.0 | 0 | 0 |
2 | 90 | 50001 | 20190108 | 500 | 6140 | 0 | 0 | 0.0 | 0 | 0 |
3 | 90 | 50001 | 20190109 | 500 | 6090 | 0 | 0 | 0.0 | 0 | 0 |
4 | 90 | 50003 | 20190114 | 500 | 0 | 0 | 23820 | 0.0 | 0 | 0 |
5 | 90 | 50003 | 20190115 | 500 | 0 | 0 | 59540 | 0.0 | 0 | 0 |
6 | 90 | 50003 | 20190117 | 500 | 0 | 0 | 23830 | 0.0 | 0 | 0 |
7 | 90 | 50001 | 20190102 | 500 | 5690 | 0 | 0 | 0.0 | 0 | 0 |
8 | 90 | 50001 | 20190103 | 500 | 5590 | 0 | 0 | 0.0 | 0 | 0 |
9 | 90 | 50001 | 20190104 | 500 | 5290 | 0 | 0 | 0.0 | 0 | 0 |
권역 | 하수처리시설명 | 처리일자 | 관리단 | 탈수슬러지처리량 | 액상슬러지처리량 | 탈수슬러지함수율 | 협잡물처리량 | 협잡물함수율 | 액상슬러지함수율 | |
---|---|---|---|---|---|---|---|---|---|---|
90 | 91 | 60001 | 20190109 | 600 | 2308 | 0 | 0 | 83.5 | 0 | 0 |
91 | 91 | 60001 | 20190119 | 600 | 2774 | 0 | 0 | 83.3 | 0 | 0 |
92 | 91 | 60001 | 20190113 | 600 | 2191 | 0 | 0 | 83.2 | 0 | 0 |
93 | 91 | 60001 | 20190114 | 600 | 2052 | 0 | 0 | 83.3 | 0 | 0 |
94 | 91 | 60001 | 20190103 | 600 | 2128 | 0 | 0 | 83.3 | 0 | 0 |
95 | 91 | 60001 | 20190107 | 600 | 2090 | 0 | 0 | 83.3 | 0 | 0 |
96 | 91 | 40001 | 20190131 | 400 | 43020 | 0 | 0 | 0.0 | 0 | 0 |
97 | 91 | 60001 | 20190120 | 600 | 2774 | 0 | 0 | 83.3 | 0 | 0 |
98 | 91 | 60001 | 20190115 | 600 | 2166 | 0 | 0 | 83.3 | 0 | 0 |
99 | 91 | 60001 | 20190123 | 600 | 2622 | 0 | 0 | 83.3 | 0 | 0 |