Dataset statistics
Number of variables | 10 |
---|---|
Number of observations | 100 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 8.9 KiB |
Average record size in memory | 91.3 B |
Variable types
Categorical | 4 |
---|---|
Numeric | 6 |
Dataset
Description | 샘플 데이터 |
---|---|
Author | 한국수자원공사 |
URL | https://www.bigdata-environment.kr/user/data_market/detail.do?id=c6697020-38bb-11ea-be28-4fa0eb812a46 |
협잡물함수율 has constant value "" | Constant |
액상슬러지함수율 has constant value "" | Constant |
권역 is highly overall correlated with 하수처리시설명 and 2 other fields | High correlation |
관리단 is highly overall correlated with 하수처리시설명 and 4 other fields | High correlation |
하수처리시설명 is highly overall correlated with 협잡물처리량 and 2 other fields | High correlation |
탈수슬러지처리량 is highly overall correlated with 탈수슬러지함수율 and 1 other fields | High correlation |
액상슬러지처리량 is highly overall correlated with 관리단 | High correlation |
탈수슬러지함수율 is highly overall correlated with 탈수슬러지처리량 | High correlation |
협잡물처리량 is highly overall correlated with 하수처리시설명 and 2 other fields | High correlation |
관리단 is highly imbalanced (53.4%) | Imbalance |
탈수슬러지처리량 has 32 (32.0%) zeros | Zeros |
액상슬러지처리량 has 94 (94.0%) zeros | Zeros |
탈수슬러지함수율 has 68 (68.0%) zeros | Zeros |
협잡물처리량 has 77 (77.0%) zeros | Zeros |
Reproduction
Analysis started | 2023-12-10 13:03:36.931428 |
---|---|
Analysis finished | 2023-12-10 13:03:40.514179 |
Duration | 3.58 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
권역
Categorical
HIGH CORRELATION
 
Distinct | 2 |
---|---|
Distinct (%) | 2.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
90 | |
---|---|
91 |
Length
Max length | 2 |
---|---|
Median length | 2 |
Mean length | 2 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 90 |
---|---|
2nd row | 90 |
3rd row | 90 |
4th row | 90 |
5th row | 90 |
Common Values
Value | Count | Frequency (%) |
90 | 75 | |
91 | 25 | 25.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
90 | 75 | |
91 | 25 | 25.0% |
하수처리시설명
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 6 |
---|---|
Distinct (%) | 6.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 52401.46 |
Minimum | 40001 |
---|---|
Maximum | 70001 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 40001 |
---|---|
5-th percentile | 50001 |
Q1 | 50001 |
median | 50002 |
Q3 | 50003 |
95-th percentile | 60001 |
Maximum | 70001 |
Range | 30000 |
Interquartile range (IQR) | 2 |
Descriptive statistics
Standard deviation | 4739.4532 |
---|---|
Coefficient of variation (CV) | 0.09044506 |
Kurtosis | 1.1352198 |
Mean | 52401.46 |
Median Absolute Deviation (MAD) | 1 |
Skewness | 1.2201454 |
Sum | 5240146 |
Variance | 22462417 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
50001 | 43 | |
60001 | 23 | |
50002 | 18 | |
50003 | 14 | 14.0% |
70001 | 1 | 1.0% |
40001 | 1 | 1.0% |
Value | Count | Frequency (%) |
40001 | 1 | 1.0% |
50001 | 43 | |
50002 | 18 | |
50003 | 14 | 14.0% |
60001 | 23 | |
70001 | 1 | 1.0% |
Value | Count | Frequency (%) |
70001 | 1 | 1.0% |
60001 | 23 | |
50003 | 14 | 14.0% |
50002 | 18 | |
50001 | 43 | |
40001 | 1 | 1.0% |
처리일자
Real number (ℝ)
Distinct | 28 |
---|---|
Distinct (%) | 28.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 20190215 |
Minimum | 20190201 |
---|---|
Maximum | 20190228 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 20190201 |
---|---|
5-th percentile | 20190202 |
Q1 | 20190208 |
median | 20190215 |
Q3 | 20190222 |
95-th percentile | 20190228 |
Maximum | 20190228 |
Range | 27 |
Interquartile range (IQR) | 14 |
Descriptive statistics
Standard deviation | 8.1786307 |
---|---|
Coefficient of variation (CV) | 4.0507892 × 10-7 |
Kurtosis | -1.1000885 |
Mean | 20190215 |
Median Absolute Deviation (MAD) | 7 |
Skewness | -0.10272856 |
Sum | 2.0190215 × 109 |
Variance | 66.89 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
20190228 | 7 | 7.0% |
20190201 | 5 | 5.0% |
20190214 | 5 | 5.0% |
20190212 | 5 | 5.0% |
20190218 | 5 | 5.0% |
20190222 | 5 | 5.0% |
20190208 | 4 | 4.0% |
20190213 | 4 | 4.0% |
20190219 | 4 | 4.0% |
20190206 | 4 | 4.0% |
Other values (18) | 52 |
Value | Count | Frequency (%) |
20190201 | 5 | |
20190202 | 2 | 2.0% |
20190203 | 2 | 2.0% |
20190204 | 4 | |
20190205 | 1 | 1.0% |
20190206 | 4 | |
20190207 | 4 | |
20190208 | 4 | |
20190209 | 2 | 2.0% |
20190210 | 2 | 2.0% |
Value | Count | Frequency (%) |
20190228 | 7 | |
20190227 | 4 | |
20190226 | 4 | |
20190225 | 3 | |
20190224 | 1 | 1.0% |
20190223 | 3 | |
20190222 | 5 | |
20190221 | 4 | |
20190220 | 4 | |
20190219 | 4 |
관리단
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 4 |
---|---|
Distinct (%) | 4.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
500 | |
---|---|
600 | |
700 | 1 |
400 | 1 |
Length
Max length | 3 |
---|---|
Median length | 3 |
Mean length | 3 |
Min length | 3 |
Unique
Unique | 2 ? |
---|---|
Unique (%) | 2.0% |
Sample
1st row | 500 |
---|---|
2nd row | 500 |
3rd row | 500 |
4th row | 500 |
5th row | 500 |
Common Values
Value | Count | Frequency (%) |
500 | 75 | |
600 | 23 | 23.0% |
700 | 1 | 1.0% |
400 | 1 | 1.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
500 | 75 | |
600 | 23 | 23.0% |
700 | 1 | 1.0% |
400 | 1 | 1.0% |
탈수슬러지처리량
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 61 |
---|---|
Distinct (%) | 61.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 10521.83 |
Minimum | 0 |
---|---|
Maximum | 653610 |
Zeros | 32 |
Zeros (%) | 32.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 3315.5 |
Q3 | 5920 |
95-th percentile | 9673.05 |
Maximum | 653610 |
Range | 653610 |
Interquartile range (IQR) | 5920 |
Descriptive statistics
Standard deviation | 65248.078 |
---|---|
Coefficient of variation (CV) | 6.2012101 |
Kurtosis | 98.184446 |
Mean | 10521.83 |
Median Absolute Deviation (MAD) | 3184.5 |
Skewness | 9.8703502 |
Sum | 1052183 |
Variance | 4.2573117 × 109 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 32 | |
4580 | 3 | 3.0% |
6870 | 3 | 3.0% |
2561 | 2 | 2.0% |
2752 | 2 | 2.0% |
5920 | 2 | 2.0% |
2615 | 2 | 2.0% |
2523 | 1 | 1.0% |
2923 | 1 | 1.0% |
4320 | 1 | 1.0% |
Other values (51) | 51 |
Value | Count | Frequency (%) |
0 | 32 | |
2485 | 1 | 1.0% |
2508 | 1 | 1.0% |
2523 | 1 | 1.0% |
2561 | 2 | 2.0% |
2600 | 1 | 1.0% |
2615 | 2 | 2.0% |
2692 | 1 | 1.0% |
2752 | 2 | 2.0% |
2791 | 1 | 1.0% |
Value | Count | Frequency (%) |
653610 | 1 | |
57500 | 1 | |
10841 | 1 | |
10342 | 1 | |
10339 | 1 | |
9638 | 1 | |
8792 | 1 | |
7040 | 1 | |
6910 | 1 | |
6900 | 1 |
액상슬러지처리량
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 7 |
---|---|
Distinct (%) | 7.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 236.7 |
Minimum | 0 |
---|---|
Maximum | 12780 |
Zeros | 94 |
Zeros (%) | 94.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 0 |
95-th percentile | 1648.5 |
Maximum | 12780 |
Range | 12780 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 1355.7599 |
---|---|
Coefficient of variation (CV) | 5.7277563 |
Kurtosis | 75.801647 |
Mean | 236.7 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 8.3208935 |
Sum | 23670 |
Variance | 1838085 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 94 | |
2460 | 1 | 1.0% |
1630 | 1 | 1.0% |
2490 | 1 | 1.0% |
2000 | 1 | 1.0% |
2310 | 1 | 1.0% |
12780 | 1 | 1.0% |
Value | Count | Frequency (%) |
0 | 94 | |
1630 | 1 | 1.0% |
2000 | 1 | 1.0% |
2310 | 1 | 1.0% |
2460 | 1 | 1.0% |
2490 | 1 | 1.0% |
12780 | 1 | 1.0% |
Value | Count | Frequency (%) |
12780 | 1 | 1.0% |
2490 | 1 | 1.0% |
2460 | 1 | 1.0% |
2310 | 1 | 1.0% |
2000 | 1 | 1.0% |
1630 | 1 | 1.0% |
0 | 94 |
탈수슬러지함수율
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 31 |
---|---|
Distinct (%) | 31.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 5953.4 |
Minimum | 0 |
---|---|
Maximum | 39010 |
Zeros | 68 |
Zeros (%) | 68.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 10535 |
95-th percentile | 26001 |
Maximum | 39010 |
Range | 39010 |
Interquartile range (IQR) | 10535 |
Descriptive statistics
Standard deviation | 9710.1155 |
---|---|
Coefficient of variation (CV) | 1.6310202 |
Kurtosis | 0.91219055 |
Mean | 5953.4 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 1.4216827 |
Sum | 595340 |
Variance | 94286344 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 68 | |
10510 | 2 | 2.0% |
11950 | 2 | 2.0% |
21060 | 1 | 1.0% |
10530 | 1 | 1.0% |
10550 | 1 | 1.0% |
21020 | 1 | 1.0% |
13020 | 1 | 1.0% |
26000 | 1 | 1.0% |
21010 | 1 | 1.0% |
Other values (21) | 21 | 21.0% |
Value | Count | Frequency (%) |
0 | 68 | |
5970 | 1 | 1.0% |
10480 | 1 | 1.0% |
10490 | 1 | 1.0% |
10510 | 2 | 2.0% |
10520 | 1 | 1.0% |
10530 | 1 | 1.0% |
10550 | 1 | 1.0% |
10560 | 1 | 1.0% |
11950 | 2 | 2.0% |
Value | Count | Frequency (%) |
39010 | 1 | |
31760 | 1 | |
31550 | 1 | |
26050 | 1 | |
26020 | 1 | |
26000 | 1 | |
23830 | 1 | |
21130 | 1 | |
21090 | 1 | |
21080 | 1 |
협잡물처리량
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 8 |
---|---|
Distinct (%) | 8.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 19.16 |
Minimum | 0 |
---|---|
Maximum | 83.5 |
Zeros | 77 |
Zeros (%) | 77.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 0 |
95-th percentile | 83.5 |
Maximum | 83.5 |
Range | 83.5 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 35.23404 |
---|---|
Coefficient of variation (CV) | 1.8389374 |
Kurtosis | -0.30909765 |
Mean | 19.16 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 1.3028461 |
Sum | 1916 |
Variance | 1241.4376 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0.0 | 77 | |
83.4 | 9 | 9.0% |
83.5 | 7 | 7.0% |
83.3 | 3 | 3.0% |
82.6 | 1 | 1.0% |
82.9 | 1 | 1.0% |
83.0 | 1 | 1.0% |
82.5 | 1 | 1.0% |
Value | Count | Frequency (%) |
0.0 | 77 | |
82.5 | 1 | 1.0% |
82.6 | 1 | 1.0% |
82.9 | 1 | 1.0% |
83.0 | 1 | 1.0% |
83.3 | 3 | 3.0% |
83.4 | 9 | 9.0% |
83.5 | 7 | 7.0% |
Value | Count | Frequency (%) |
83.5 | 7 | 7.0% |
83.4 | 9 | 9.0% |
83.3 | 3 | 3.0% |
83.0 | 1 | 1.0% |
82.9 | 1 | 1.0% |
82.6 | 1 | 1.0% |
82.5 | 1 | 1.0% |
0.0 | 77 |
협잡물함수율
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 1.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
0 |
---|
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 0 |
---|---|
2nd row | 0 |
3rd row | 0 |
4th row | 0 |
5th row | 0 |
Common Values
Value | Count | Frequency (%) |
0 | 100 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 100 |
액상슬러지함수율
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 1.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
0 |
---|
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 0 |
---|---|
2nd row | 0 |
3rd row | 0 |
4th row | 0 |
5th row | 0 |
Common Values
Value | Count | Frequency (%) |
0 | 100 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 100 |
권역 | 하수처리시설명 | 처리일자 | 관리단 | 탈수슬러지처리량 | 액상슬러지처리량 | 탈수슬러지함수율 | 협잡물처리량 | |
---|---|---|---|---|---|---|---|---|
권역 | 1.000 | 1.000 | 0.000 | 1.000 | 0.000 | 0.100 | 0.289 | 0.992 |
하수처리시설명 | 1.000 | 1.000 | 0.000 | 1.000 | 1.000 | 0.942 | 0.000 | 1.000 |
처리일자 | 0.000 | 0.000 | 1.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 |
관리단 | 1.000 | 1.000 | 0.000 | 1.000 | 1.000 | 0.673 | 0.000 | 1.000 |
탈수슬러지처리량 | 0.000 | 1.000 | 0.000 | 1.000 | 1.000 | 1.000 | 0.000 | 0.000 |
액상슬러지처리량 | 0.100 | 0.942 | 0.000 | 0.673 | 1.000 | 1.000 | 0.000 | 0.000 |
탈수슬러지함수율 | 0.289 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 1.000 | 0.257 |
협잡물처리량 | 0.992 | 1.000 | 0.000 | 1.000 | 0.000 | 0.000 | 0.257 | 1.000 |
권역 | 관리단 | |
---|---|---|
권역 | 1.000 | 0.990 |
관리단 | 0.990 | 1.000 |
하수처리시설명 | 처리일자 | 탈수슬러지처리량 | 액상슬러지처리량 | 탈수슬러지함수율 | 협잡물처리량 | 권역 | 관리단 | |
---|---|---|---|---|---|---|---|---|
하수처리시설명 | 1.000 | -0.071 | -0.435 | -0.129 | 0.254 | 0.741 | 0.990 | 1.000 |
처리일자 | -0.071 | 1.000 | 0.042 | 0.222 | 0.026 | -0.143 | 0.000 | 0.000 |
탈수슬러지처리량 | -0.435 | 0.042 | 1.000 | 0.229 | -0.802 | 0.017 | 0.000 | 0.990 |
액상슬러지처리량 | -0.129 | 0.222 | 0.229 | 1.000 | -0.169 | -0.137 | 0.164 | 0.699 |
탈수슬러지함수율 | 0.254 | 0.026 | -0.802 | -0.169 | 1.000 | -0.362 | 0.277 | 0.000 |
협잡물처리량 | 0.741 | -0.143 | 0.017 | -0.137 | -0.362 | 1.000 | 0.918 | 0.990 |
권역 | 0.990 | 0.000 | 0.000 | 0.164 | 0.277 | 0.918 | 1.000 | 0.990 |
관리단 | 1.000 | 0.000 | 0.990 | 0.699 | 0.000 | 0.990 | 0.990 | 1.000 |
권역 | 하수처리시설명 | 처리일자 | 관리단 | 탈수슬러지처리량 | 액상슬러지처리량 | 탈수슬러지함수율 | 협잡물처리량 | 협잡물함수율 | 액상슬러지함수율 | |
---|---|---|---|---|---|---|---|---|---|---|
0 | 90 | 50002 | 20190207 | 500 | 0 | 0 | 5970 | 0.0 | 0 | 0 |
1 | 90 | 50001 | 20190219 | 500 | 5420 | 0 | 0 | 0.0 | 0 | 0 |
2 | 90 | 50001 | 20190219 | 500 | 4880 | 0 | 0 | 0.0 | 0 | 0 |
3 | 90 | 50001 | 20190220 | 500 | 6270 | 0 | 0 | 0.0 | 0 | 0 |
4 | 90 | 50001 | 20190220 | 500 | 4580 | 0 | 0 | 0.0 | 0 | 0 |
5 | 90 | 50001 | 20190221 | 500 | 6750 | 0 | 0 | 0.0 | 0 | 0 |
6 | 90 | 50001 | 20190224 | 500 | 6620 | 0 | 0 | 0.0 | 0 | 0 |
7 | 90 | 50003 | 20190228 | 500 | 0 | 0 | 19490 | 0.0 | 0 | 0 |
8 | 90 | 50002 | 20190223 | 500 | 0 | 0 | 21080 | 0.0 | 0 | 0 |
9 | 90 | 50002 | 20190226 | 500 | 0 | 0 | 10490 | 0.0 | 0 | 0 |
권역 | 하수처리시설명 | 처리일자 | 관리단 | 탈수슬러지처리량 | 액상슬러지처리량 | 탈수슬러지함수율 | 협잡물처리량 | 협잡물함수율 | 액상슬러지함수율 | |
---|---|---|---|---|---|---|---|---|---|---|
90 | 91 | 60001 | 20190226 | 600 | 2961 | 0 | 0 | 83.5 | 0 | 0 |
91 | 91 | 60001 | 20190228 | 600 | 8792 | 0 | 0 | 83.0 | 0 | 0 |
92 | 91 | 60001 | 20190223 | 600 | 2523 | 0 | 0 | 83.4 | 0 | 0 |
93 | 91 | 60001 | 20190221 | 600 | 2561 | 0 | 0 | 83.4 | 0 | 0 |
94 | 91 | 60001 | 20190219 | 600 | 2485 | 0 | 0 | 83.4 | 0 | 0 |
95 | 91 | 60001 | 20190203 | 600 | 2752 | 0 | 0 | 83.4 | 0 | 0 |
96 | 91 | 60001 | 20190217 | 600 | 2561 | 0 | 0 | 83.4 | 0 | 0 |
97 | 91 | 60001 | 20190218 | 600 | 10339 | 0 | 0 | 82.5 | 0 | 0 |
98 | 91 | 60001 | 20190202 | 600 | 2615 | 0 | 0 | 83.5 | 0 | 0 |
99 | 91 | 40001 | 20190228 | 400 | 57500 | 0 | 0 | 0.0 | 0 | 0 |