Dataset statistics
Number of variables | 10 |
---|---|
Number of observations | 100 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 8.9 KiB |
Average record size in memory | 91.3 B |
Variable types
Categorical | 6 |
---|---|
Numeric | 4 |
Dataset
Description | 샘플 데이터 |
---|---|
Author | 한국수자원공사 |
URL | https://www.bigdata-environment.kr/user/data_market/detail.do?id=c6697020-38bb-11ea-be28-4fa0eb812a46 |
액상슬러지처리량 has constant value "" | Constant |
협잡물함수율 has constant value "" | Constant |
액상슬러지함수율 has constant value "" | Constant |
권역 is highly overall correlated with 탈수슬러지처리량 and 3 other fields | High correlation |
하수처리시설명 is highly overall correlated with 탈수슬러지처리량 and 4 other fields | High correlation |
관리단 is highly overall correlated with 탈수슬러지처리량 and 3 other fields | High correlation |
탈수슬러지처리량 is highly overall correlated with 탈수슬러지함수율 and 3 other fields | High correlation |
탈수슬러지함수율 is highly overall correlated with 탈수슬러지처리량 and 1 other fields | High correlation |
협잡물처리량 is highly overall correlated with 권역 and 2 other fields | High correlation |
권역 is highly imbalanced (56.4%) | Imbalance |
관리단 is highly imbalanced (56.4%) | Imbalance |
탈수슬러지처리량 has 39 (39.0%) zeros | Zeros |
탈수슬러지함수율 has 61 (61.0%) zeros | Zeros |
협잡물처리량 has 91 (91.0%) zeros | Zeros |
Reproduction
Analysis started | 2023-12-10 13:03:26.942578 |
---|---|
Analysis finished | 2023-12-10 13:03:28.892968 |
Duration | 1.95 second |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
권역
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | 2.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
90 | |
---|---|
91 | 9 |
Length
Max length | 2 |
---|---|
Median length | 2 |
Mean length | 2 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 90 |
---|---|
2nd row | 90 |
3rd row | 90 |
4th row | 90 |
5th row | 90 |
Common Values
Value | Count | Frequency (%) |
90 | 91 | |
91 | 9 | 9.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
90 | 91 | |
91 | 9 | 9.0% |
하수처리시설명
Categorical
HIGH CORRELATION
 
Distinct | 4 |
---|---|
Distinct (%) | 4.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
50001 | |
---|---|
50002 | |
50003 | |
60001 |
Length
Max length | 5 |
---|---|
Median length | 5 |
Mean length | 5 |
Min length | 5 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 50003 |
---|---|
2nd row | 50001 |
3rd row | 50001 |
4th row | 50003 |
5th row | 50001 |
Common Values
Value | Count | Frequency (%) |
50001 | 52 | |
50002 | 26 | |
50003 | 13 | 13.0% |
60001 | 9 | 9.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
50001 | 52 | |
50002 | 26 | |
50003 | 13 | 13.0% |
60001 | 9 | 9.0% |
처리일자
Real number (ℝ)
Distinct | 30 |
---|---|
Distinct (%) | 30.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 20190416 |
Minimum | 20190401 |
---|---|
Maximum | 20190430 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 20190401 |
---|---|
5-th percentile | 20190402 |
Q1 | 20190409 |
median | 20190416 |
Q3 | 20190424 |
95-th percentile | 20190429 |
Maximum | 20190430 |
Range | 29 |
Interquartile range (IQR) | 15 |
Descriptive statistics
Standard deviation | 8.7572135 |
---|---|
Coefficient of variation (CV) | 4.337312 × 10-7 |
Kurtosis | -1.150563 |
Mean | 20190416 |
Median Absolute Deviation (MAD) | 7.5 |
Skewness | -0.12129166 |
Sum | 2.0190416 × 109 |
Variance | 76.688788 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
20190429 | 5 | 5.0% |
20190415 | 5 | 5.0% |
20190422 | 5 | 5.0% |
20190419 | 5 | 5.0% |
20190424 | 5 | 5.0% |
20190430 | 4 | 4.0% |
20190425 | 4 | 4.0% |
20190401 | 4 | 4.0% |
20190417 | 4 | 4.0% |
20190416 | 4 | 4.0% |
Other values (20) | 55 |
Value | Count | Frequency (%) |
20190401 | 4 | |
20190402 | 3 | |
20190403 | 3 | |
20190404 | 4 | |
20190405 | 3 | |
20190406 | 2 | |
20190407 | 2 | |
20190408 | 2 | |
20190409 | 3 | |
20190410 | 4 |
Value | Count | Frequency (%) |
20190430 | 4 | |
20190429 | 5 | |
20190428 | 2 | 2.0% |
20190427 | 3 | |
20190426 | 3 | |
20190425 | 4 | |
20190424 | 5 | |
20190423 | 3 | |
20190422 | 5 | |
20190421 | 2 | 2.0% |
관리단
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | 2.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
500 | |
---|---|
600 | 9 |
Length
Max length | 3 |
---|---|
Median length | 3 |
Mean length | 3 |
Min length | 3 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 500 |
---|---|
2nd row | 500 |
3rd row | 500 |
4th row | 500 |
5th row | 500 |
Common Values
Value | Count | Frequency (%) |
500 | 91 | |
600 | 9 | 9.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
500 | 91 | |
600 | 9 | 9.0% |
탈수슬러지처리량
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 57 |
---|---|
Distinct (%) | 57.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 3410.45 |
Minimum | 0 |
---|---|
Maximum | 11264 |
Zeros | 39 |
Zeros (%) | 39.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 4715 |
Q3 | 5565 |
95-th percentile | 6954.5 |
Maximum | 11264 |
Range | 11264 |
Interquartile range (IQR) | 5565 |
Descriptive statistics
Standard deviation | 2981.2539 |
---|---|
Coefficient of variation (CV) | 0.87415266 |
Kurtosis | -0.87596746 |
Mean | 3410.45 |
Median Absolute Deviation (MAD) | 1550 |
Skewness | 0.14249652 |
Sum | 341045 |
Variance | 8887875 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 39 | |
5860 | 2 | 2.0% |
5370 | 2 | 2.0% |
5000 | 2 | 2.0% |
5830 | 2 | 2.0% |
5040 | 2 | 2.0% |
5960 | 1 | 1.0% |
4620 | 1 | 1.0% |
5170 | 1 | 1.0% |
5560 | 1 | 1.0% |
Other values (47) | 47 |
Value | Count | Frequency (%) |
0 | 39 | |
3115 | 1 | 1.0% |
3269 | 1 | 1.0% |
3423 | 1 | 1.0% |
3496 | 1 | 1.0% |
3648 | 1 | 1.0% |
4430 | 1 | 1.0% |
4500 | 1 | 1.0% |
4530 | 1 | 1.0% |
4620 | 1 | 1.0% |
Value | Count | Frequency (%) |
11264 | 1 | |
10427 | 1 | |
10255 | 1 | |
9258 | 1 | |
7040 | 1 | |
6950 | 1 | |
6540 | 1 | |
6390 | 1 | |
6360 | 1 | |
6270 | 1 |
액상슬러지처리량
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 1.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
0 |
---|
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 0 |
---|---|
2nd row | 0 |
3rd row | 0 |
4th row | 0 |
5th row | 0 |
Common Values
Value | Count | Frequency (%) |
0 | 100 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 100 |
탈수슬러지함수율
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 28 |
---|---|
Distinct (%) | 28.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 6513.3 |
Minimum | 0 |
---|---|
Maximum | 42180 |
Zeros | 61 |
Zeros (%) | 61.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 10917.5 |
95-th percentile | 23890.5 |
Maximum | 42180 |
Range | 42180 |
Interquartile range (IQR) | 10917.5 |
Descriptive statistics
Standard deviation | 9600.4945 |
---|---|
Coefficient of variation (CV) | 1.4739832 |
Kurtosis | 1.8303463 |
Mean | 6513.3 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 1.4923631 |
Sum | 651330 |
Variance | 92169495 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 61 | |
11930 | 5 | 5.0% |
10530 | 3 | 3.0% |
11940 | 3 | 3.0% |
10520 | 2 | 2.0% |
10570 | 2 | 2.0% |
10540 | 2 | 2.0% |
10510 | 2 | 2.0% |
31650 | 1 | 1.0% |
42180 | 1 | 1.0% |
Other values (18) | 18 | 18.0% |
Value | Count | Frequency (%) |
0 | 61 | |
10510 | 2 | 2.0% |
10520 | 2 | 2.0% |
10530 | 3 | 3.0% |
10540 | 2 | 2.0% |
10550 | 1 | 1.0% |
10560 | 1 | 1.0% |
10570 | 2 | 2.0% |
10580 | 1 | 1.0% |
11930 | 5 | 5.0% |
Value | Count | Frequency (%) |
42180 | 1 | |
35820 | 1 | |
31650 | 1 | |
31630 | 1 | |
23900 | 1 | |
23890 | 1 | |
23880 | 1 | |
21130 | 1 | |
21120 | 1 | |
21100 | 1 |
협잡물처리량
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 6 |
---|---|
Distinct (%) | 6.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 7.503 |
Minimum | 0 |
---|---|
Maximum | 83.7 |
Zeros | 91 |
Zeros (%) | 91.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 0 |
95-th percentile | 83.4 |
Maximum | 83.7 |
Range | 83.7 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 23.978342 |
---|---|
Coefficient of variation (CV) | 3.195834 |
Kurtosis | 6.5950441 |
Mean | 7.503 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 2.9091897 |
Sum | 750.3 |
Variance | 574.9609 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0.0 | 91 | |
83.5 | 3 | 3.0% |
83.4 | 2 | 2.0% |
83.3 | 2 | 2.0% |
82.7 | 1 | 1.0% |
83.7 | 1 | 1.0% |
Value | Count | Frequency (%) |
0.0 | 91 | |
82.7 | 1 | 1.0% |
83.3 | 2 | 2.0% |
83.4 | 2 | 2.0% |
83.5 | 3 | 3.0% |
83.7 | 1 | 1.0% |
Value | Count | Frequency (%) |
83.7 | 1 | 1.0% |
83.5 | 3 | 3.0% |
83.4 | 2 | 2.0% |
83.3 | 2 | 2.0% |
82.7 | 1 | 1.0% |
0.0 | 91 |
협잡물함수율
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 1.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
0 |
---|
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 0 |
---|---|
2nd row | 0 |
3rd row | 0 |
4th row | 0 |
5th row | 0 |
Common Values
Value | Count | Frequency (%) |
0 | 100 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 100 |
액상슬러지함수율
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 1.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
0 |
---|
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 0 |
---|---|
2nd row | 0 |
3rd row | 0 |
4th row | 0 |
5th row | 0 |
Common Values
Value | Count | Frequency (%) |
0 | 100 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 100 |
권역 | 하수처리시설명 | 처리일자 | 관리단 | 탈수슬러지처리량 | 탈수슬러지함수율 | 협잡물처리량 | |
---|---|---|---|---|---|---|---|
권역 | 1.000 | 1.000 | 0.000 | 0.995 | 0.991 | 0.055 | 0.995 |
하수처리시설명 | 1.000 | 1.000 | 0.000 | 1.000 | 0.969 | 0.706 | 1.000 |
처리일자 | 0.000 | 0.000 | 1.000 | 0.000 | 0.000 | 0.000 | 0.000 |
관리단 | 0.995 | 1.000 | 0.000 | 1.000 | 0.991 | 0.055 | 0.995 |
탈수슬러지처리량 | 0.991 | 0.969 | 0.000 | 0.991 | 1.000 | 0.535 | 0.991 |
탈수슬러지함수율 | 0.055 | 0.706 | 0.000 | 0.055 | 0.535 | 1.000 | 0.055 |
협잡물처리량 | 0.995 | 1.000 | 0.000 | 0.995 | 0.991 | 0.055 | 1.000 |
권역 | 하수처리시설명 | 관리단 | |
---|---|---|---|
권역 | 1.000 | 0.990 | 0.938 |
하수처리시설명 | 0.990 | 1.000 | 0.990 |
관리단 | 0.938 | 0.990 | 1.000 |
처리일자 | 탈수슬러지처리량 | 탈수슬러지함수율 | 협잡물처리량 | 권역 | 하수처리시설명 | 관리단 | |
---|---|---|---|---|---|---|---|
처리일자 | 1.000 | 0.028 | 0.062 | 0.100 | 0.000 | 0.000 | 0.000 |
탈수슬러지처리량 | 0.028 | 1.000 | -0.837 | 0.185 | 0.889 | 0.749 | 0.889 |
탈수슬러지함수율 | 0.062 | -0.837 | 1.000 | -0.241 | 0.052 | 0.565 | 0.052 |
협잡물처리량 | 0.100 | 0.185 | -0.241 | 1.000 | 0.938 | 0.990 | 0.938 |
권역 | 0.000 | 0.889 | 0.052 | 0.938 | 1.000 | 0.990 | 0.938 |
하수처리시설명 | 0.000 | 0.749 | 0.565 | 0.990 | 0.990 | 1.000 | 0.990 |
관리단 | 0.000 | 0.889 | 0.052 | 0.938 | 0.938 | 0.990 | 1.000 |
권역 | 하수처리시설명 | 처리일자 | 관리단 | 탈수슬러지처리량 | 액상슬러지처리량 | 탈수슬러지함수율 | 협잡물처리량 | 협잡물함수율 | 액상슬러지함수율 | |
---|---|---|---|---|---|---|---|---|---|---|
0 | 90 | 50003 | 20190430 | 500 | 0 | 0 | 35820 | 0.0 | 0 | 0 |
1 | 90 | 50001 | 20190425 | 500 | 5010 | 0 | 0 | 0.0 | 0 | 0 |
2 | 90 | 50001 | 20190401 | 500 | 4530 | 0 | 0 | 0.0 | 0 | 0 |
3 | 90 | 50003 | 20190429 | 500 | 0 | 0 | 17910 | 0.0 | 0 | 0 |
4 | 90 | 50001 | 20190427 | 500 | 5850 | 0 | 0 | 0.0 | 0 | 0 |
5 | 90 | 50001 | 20190426 | 500 | 5180 | 0 | 0 | 0.0 | 0 | 0 |
6 | 90 | 50001 | 20190402 | 500 | 4900 | 0 | 0 | 0.0 | 0 | 0 |
7 | 90 | 50001 | 20190401 | 500 | 6390 | 0 | 0 | 0.0 | 0 | 0 |
8 | 90 | 50001 | 20190423 | 500 | 5730 | 0 | 0 | 0.0 | 0 | 0 |
9 | 90 | 50001 | 20190430 | 500 | 4780 | 0 | 0 | 0.0 | 0 | 0 |
권역 | 하수처리시설명 | 처리일자 | 관리단 | 탈수슬러지처리량 | 액상슬러지처리량 | 탈수슬러지함수율 | 협잡물처리량 | 협잡물함수율 | 액상슬러지함수율 | |
---|---|---|---|---|---|---|---|---|---|---|
90 | 90 | 50001 | 20190425 | 500 | 5930 | 0 | 0 | 0.0 | 0 | 0 |
91 | 91 | 60001 | 20190424 | 600 | 3423 | 0 | 0 | 83.5 | 0 | 0 |
92 | 91 | 60001 | 20190419 | 600 | 9258 | 0 | 0 | 83.4 | 0 | 0 |
93 | 91 | 60001 | 20190406 | 600 | 3269 | 0 | 0 | 83.5 | 0 | 0 |
94 | 91 | 60001 | 20190427 | 600 | 3648 | 0 | 0 | 83.3 | 0 | 0 |
95 | 91 | 60001 | 20190416 | 600 | 3496 | 0 | 0 | 83.3 | 0 | 0 |
96 | 91 | 60001 | 20190413 | 600 | 3115 | 0 | 0 | 83.5 | 0 | 0 |
97 | 91 | 60001 | 20190415 | 600 | 11264 | 0 | 0 | 83.4 | 0 | 0 |
98 | 91 | 60001 | 20190422 | 600 | 10255 | 0 | 0 | 82.7 | 0 | 0 |
99 | 91 | 60001 | 20190429 | 600 | 10427 | 0 | 0 | 83.7 | 0 | 0 |