Dataset statistics
Number of variables | 10 |
---|---|
Number of observations | 100 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 8.9 KiB |
Average record size in memory | 91.3 B |
Variable types
Categorical | 6 |
---|---|
Numeric | 4 |
Dataset
Description | 샘플 데이터 |
---|---|
Author | 한국수자원공사 |
URL | https://www.bigdata-environment.kr/user/data_market/detail.do?id=c6697020-38bb-11ea-be28-4fa0eb812a46 |
협잡물함수율 has constant value "" | Constant |
액상슬러지함수율 has constant value "" | Constant |
권역 is highly overall correlated with 탈수슬러지처리량 and 3 other fields | High correlation |
하수처리시설명 is highly overall correlated with 탈수슬러지처리량 and 4 other fields | High correlation |
관리단 is highly overall correlated with 탈수슬러지처리량 and 3 other fields | High correlation |
협잡물처리량 is highly overall correlated with 탈수슬러지처리량 and 3 other fields | High correlation |
탈수슬러지처리량 is highly overall correlated with 탈수슬러지함수율 and 4 other fields | High correlation |
탈수슬러지함수율 is highly overall correlated with 탈수슬러지처리량 and 1 other fields | High correlation |
권역 is highly imbalanced (63.4%) | Imbalance |
관리단 is highly imbalanced (63.4%) | Imbalance |
협잡물처리량 is highly imbalanced (73.1%) | Imbalance |
탈수슬러지처리량 has 44 (44.0%) zeros | Zeros |
액상슬러지처리량 has 91 (91.0%) zeros | Zeros |
탈수슬러지함수율 has 56 (56.0%) zeros | Zeros |
Reproduction
Analysis started | 2023-12-10 13:03:16.126413 |
---|---|
Analysis finished | 2023-12-10 13:03:18.222502 |
Duration | 2.1 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
권역
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | 2.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
90 | |
---|---|
91 | 7 |
Length
Max length | 2 |
---|---|
Median length | 2 |
Mean length | 2 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 90 |
---|---|
2nd row | 90 |
3rd row | 90 |
4th row | 90 |
5th row | 90 |
Common Values
Value | Count | Frequency (%) |
90 | 93 | |
91 | 7 | 7.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
90 | 93 | |
91 | 7 | 7.0% |
하수처리시설명
Categorical
HIGH CORRELATION
 
Distinct | 4 |
---|---|
Distinct (%) | 4.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
50001 | |
---|---|
50002 | |
50003 | |
60001 |
Length
Max length | 5 |
---|---|
Median length | 5 |
Mean length | 5 |
Min length | 5 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 50003 |
---|---|
2nd row | 50001 |
3rd row | 50003 |
4th row | 50003 |
5th row | 50001 |
Common Values
Value | Count | Frequency (%) |
50001 | 49 | |
50002 | 25 | |
50003 | 19 | 19.0% |
60001 | 7 | 7.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
50001 | 49 | |
50002 | 25 | |
50003 | 19 | 19.0% |
60001 | 7 | 7.0% |
처리일자
Real number (ℝ)
Distinct | 30 |
---|---|
Distinct (%) | 30.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 20190616 |
Minimum | 20190601 |
---|---|
Maximum | 20190630 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 20190601 |
---|---|
5-th percentile | 20190603 |
Q1 | 20190609 |
median | 20190617 |
Q3 | 20190624 |
95-th percentile | 20190629 |
Maximum | 20190630 |
Range | 29 |
Interquartile range (IQR) | 15.25 |
Descriptive statistics
Standard deviation | 8.5070143 |
---|---|
Coefficient of variation (CV) | 4.2133505 × 10-7 |
Kurtosis | -1.1958068 |
Mean | 20190616 |
Median Absolute Deviation (MAD) | 7 |
Skewness | -0.036856072 |
Sum | 2.0190616 × 109 |
Variance | 72.369293 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
20190619 | 5 | 5.0% |
20190614 | 5 | 5.0% |
20190624 | 4 | 4.0% |
20190606 | 4 | 4.0% |
20190626 | 4 | 4.0% |
20190630 | 4 | 4.0% |
20190613 | 4 | 4.0% |
20190610 | 4 | 4.0% |
20190612 | 4 | 4.0% |
20190618 | 4 | 4.0% |
Other values (20) | 58 |
Value | Count | Frequency (%) |
20190601 | 2 | |
20190602 | 1 | 1.0% |
20190603 | 4 | |
20190604 | 3 | |
20190605 | 4 | |
20190606 | 4 | |
20190607 | 4 | |
20190608 | 3 | |
20190609 | 1 | 1.0% |
20190610 | 4 |
Value | Count | Frequency (%) |
20190630 | 4 | |
20190629 | 3 | |
20190628 | 4 | |
20190627 | 3 | |
20190626 | 4 | |
20190625 | 4 | |
20190624 | 4 | |
20190623 | 3 | |
20190622 | 1 | 1.0% |
20190621 | 4 |
관리단
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | 2.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
500 | |
---|---|
600 | 7 |
Length
Max length | 3 |
---|---|
Median length | 3 |
Mean length | 3 |
Min length | 3 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 500 |
---|---|
2nd row | 500 |
3rd row | 500 |
4th row | 500 |
5th row | 500 |
Common Values
Value | Count | Frequency (%) |
500 | 93 | |
600 | 7 | 7.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
500 | 93 | |
600 | 7 | 7.0% |
탈수슬러지처리량
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 52 |
---|---|
Distinct (%) | 52.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 3066.74 |
Minimum | 0 |
---|---|
Maximum | 7410 |
Zeros | 44 |
Zeros (%) | 44.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 3431 |
Q3 | 5840 |
95-th percentile | 6600 |
Maximum | 7410 |
Range | 7410 |
Interquartile range (IQR) | 5840 |
Descriptive statistics
Standard deviation | 2855.256 |
---|---|
Coefficient of variation (CV) | 0.93103949 |
Kurtosis | -1.8568527 |
Mean | 3066.74 |
Median Absolute Deviation (MAD) | 3149 |
Skewness | -0.033752482 |
Sum | 306674 |
Variance | 8152487.1 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 44 | |
5650 | 2 | 2.0% |
5940 | 2 | 2.0% |
5770 | 2 | 2.0% |
6130 | 2 | 2.0% |
6600 | 2 | 2.0% |
7410 | 1 | 1.0% |
4810 | 1 | 1.0% |
3930 | 1 | 1.0% |
4910 | 1 | 1.0% |
Other values (42) | 42 |
Value | Count | Frequency (%) |
0 | 44 | |
2424 | 1 | 1.0% |
2728 | 1 | 1.0% |
3018 | 1 | 1.0% |
3224 | 1 | 1.0% |
3328 | 1 | 1.0% |
3348 | 1 | 1.0% |
3514 | 1 | 1.0% |
3930 | 1 | 1.0% |
4540 | 1 | 1.0% |
Value | Count | Frequency (%) |
7410 | 1 | |
6990 | 1 | |
6750 | 1 | |
6730 | 1 | |
6600 | 2 | |
6560 | 1 | |
6510 | 1 | |
6440 | 1 | |
6410 | 1 | |
6370 | 1 |
액상슬러지처리량
Real number (ℝ)
ZEROS
 
Distinct | 9 |
---|---|
Distinct (%) | 9.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 134.4 |
Minimum | 0 |
---|---|
Maximum | 2830 |
Zeros | 91 |
Zeros (%) | 91.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 0 |
95-th percentile | 1292 |
Maximum | 2830 |
Range | 2830 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 453.64365 |
---|---|
Coefficient of variation (CV) | 3.3753248 |
Kurtosis | 14.599807 |
Mean | 134.4 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 3.6726928 |
Sum | 13440 |
Variance | 205792.57 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 91 | |
1290 | 2 | 2.0% |
1430 | 1 | 1.0% |
1500 | 1 | 1.0% |
1420 | 1 | 1.0% |
1330 | 1 | 1.0% |
1200 | 1 | 1.0% |
1150 | 1 | 1.0% |
2830 | 1 | 1.0% |
Value | Count | Frequency (%) |
0 | 91 | |
1150 | 1 | 1.0% |
1200 | 1 | 1.0% |
1290 | 2 | 2.0% |
1330 | 1 | 1.0% |
1420 | 1 | 1.0% |
1430 | 1 | 1.0% |
1500 | 1 | 1.0% |
2830 | 1 | 1.0% |
Value | Count | Frequency (%) |
2830 | 1 | 1.0% |
1500 | 1 | 1.0% |
1430 | 1 | 1.0% |
1420 | 1 | 1.0% |
1330 | 1 | 1.0% |
1290 | 2 | 2.0% |
1200 | 1 | 1.0% |
1150 | 1 | 1.0% |
0 | 91 |
탈수슬러지함수율
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 35 |
---|---|
Distinct (%) | 35.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 5677.7 |
Minimum | 0 |
---|---|
Maximum | 28370 |
Zeros | 56 |
Zeros (%) | 56.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 11932.5 |
95-th percentile | 18954 |
Maximum | 28370 |
Range | 28370 |
Interquartile range (IQR) | 11932.5 |
Descriptive statistics
Standard deviation | 7400.1448 |
---|---|
Coefficient of variation (CV) | 1.3033702 |
Kurtosis | 0.5243413 |
Mean | 5677.7 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 1.1100744 |
Sum | 567770 |
Variance | 54762143 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 56 | |
11940 | 4 | 4.0% |
11950 | 3 | 3.0% |
17930 | 2 | 2.0% |
5970 | 2 | 2.0% |
9490 | 2 | 2.0% |
9420 | 2 | 2.0% |
9470 | 2 | 2.0% |
13160 | 1 | 1.0% |
18280 | 1 | 1.0% |
Other values (25) | 25 |
Value | Count | Frequency (%) |
0 | 56 | |
5570 | 1 | 1.0% |
5970 | 2 | 2.0% |
5980 | 1 | 1.0% |
6410 | 1 | 1.0% |
6520 | 1 | 1.0% |
9390 | 1 | 1.0% |
9420 | 2 | 2.0% |
9430 | 1 | 1.0% |
9440 | 1 | 1.0% |
Value | Count | Frequency (%) |
28370 | 1 | |
28280 | 1 | |
26320 | 1 | |
19070 | 1 | |
19030 | 1 | |
18950 | 1 | |
18280 | 1 | |
17930 | 2 | |
17910 | 1 | |
17890 | 1 |
협잡물처리량
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 3 |
---|---|
Distinct (%) | 3.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
0.0 | |
---|---|
83.5 | 5 |
83.4 | 2 |
Length
Max length | 4 |
---|---|
Median length | 3 |
Mean length | 3.07 |
Min length | 3 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 0.0 |
---|---|
2nd row | 0.0 |
3rd row | 0.0 |
4th row | 0.0 |
5th row | 0.0 |
Common Values
Value | Count | Frequency (%) |
0.0 | 93 | |
83.5 | 5 | 5.0% |
83.4 | 2 | 2.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0.0 | 93 | |
83.5 | 5 | 5.0% |
83.4 | 2 | 2.0% |
협잡물함수율
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 1.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
0 |
---|
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 0 |
---|---|
2nd row | 0 |
3rd row | 0 |
4th row | 0 |
5th row | 0 |
Common Values
Value | Count | Frequency (%) |
0 | 100 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 100 |
액상슬러지함수율
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 1.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
0 |
---|
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 0 |
---|---|
2nd row | 0 |
3rd row | 0 |
4th row | 0 |
5th row | 0 |
Common Values
Value | Count | Frequency (%) |
0 | 100 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 100 |
권역 | 하수처리시설명 | 처리일자 | 관리단 | 탈수슬러지처리량 | 액상슬러지처리량 | 탈수슬러지함수율 | 협잡물처리량 | |
---|---|---|---|---|---|---|---|---|
권역 | 1.000 | 1.000 | 0.000 | 0.993 | 1.000 | 0.495 | 0.000 | 1.000 |
하수처리시설명 | 1.000 | 1.000 | 0.000 | 1.000 | 0.978 | 0.501 | 0.776 | 0.667 |
처리일자 | 0.000 | 0.000 | 1.000 | 0.000 | 0.063 | 0.000 | 0.000 | 0.079 |
관리단 | 0.993 | 1.000 | 0.000 | 1.000 | 1.000 | 0.495 | 0.000 | 1.000 |
탈수슬러지처리량 | 1.000 | 0.978 | 0.063 | 1.000 | 1.000 | 0.819 | 0.535 | 0.793 |
액상슬러지처리량 | 0.495 | 0.501 | 0.000 | 0.495 | 0.819 | 1.000 | 0.000 | 0.277 |
탈수슬러지함수율 | 0.000 | 0.776 | 0.000 | 0.000 | 0.535 | 0.000 | 1.000 | 0.000 |
협잡물처리량 | 1.000 | 0.667 | 0.079 | 1.000 | 0.793 | 0.277 | 0.000 | 1.000 |
권역 | 하수처리시설명 | 관리단 | 협잡물처리량 | |
---|---|---|---|---|
권역 | 1.000 | 0.990 | 0.922 | 0.995 |
하수처리시설명 | 0.990 | 1.000 | 0.990 | 0.692 |
관리단 | 0.922 | 0.990 | 1.000 | 0.995 |
협잡물처리량 | 0.995 | 0.692 | 0.995 | 1.000 |
처리일자 | 탈수슬러지처리량 | 액상슬러지처리량 | 탈수슬러지함수율 | 권역 | 하수처리시설명 | 관리단 | 협잡물처리량 | |
---|---|---|---|---|---|---|---|---|
처리일자 | 1.000 | -0.097 | 0.014 | -0.080 | 0.000 | 0.000 | 0.000 | 0.061 |
탈수슬러지처리량 | -0.097 | 1.000 | 0.251 | -0.851 | 0.969 | 0.784 | 0.969 | 0.696 |
액상슬러지처리량 | 0.014 | 0.251 | 1.000 | -0.264 | 0.332 | 0.213 | 0.332 | 0.264 |
탈수슬러지함수율 | -0.080 | -0.851 | -0.264 | 1.000 | 0.000 | 0.653 | 0.000 | 0.000 |
권역 | 0.000 | 0.969 | 0.332 | 0.000 | 1.000 | 0.990 | 0.922 | 0.995 |
하수처리시설명 | 0.000 | 0.784 | 0.213 | 0.653 | 0.990 | 1.000 | 0.990 | 0.692 |
관리단 | 0.000 | 0.969 | 0.332 | 0.000 | 0.922 | 0.990 | 1.000 | 0.995 |
협잡물처리량 | 0.061 | 0.696 | 0.264 | 0.000 | 0.995 | 0.692 | 0.995 | 1.000 |
권역 | 하수처리시설명 | 처리일자 | 관리단 | 탈수슬러지처리량 | 액상슬러지처리량 | 탈수슬러지함수율 | 협잡물처리량 | 협잡물함수율 | 액상슬러지함수율 | |
---|---|---|---|---|---|---|---|---|---|---|
0 | 90 | 50003 | 20190624 | 500 | 0 | 0 | 11950 | 0.0 | 0 | 0 |
1 | 90 | 50001 | 20190623 | 500 | 5800 | 0 | 0 | 0.0 | 0 | 0 |
2 | 90 | 50003 | 20190607 | 500 | 0 | 0 | 11940 | 0.0 | 0 | 0 |
3 | 90 | 50003 | 20190621 | 500 | 0 | 0 | 17930 | 0.0 | 0 | 0 |
4 | 90 | 50001 | 20190605 | 500 | 5650 | 0 | 0 | 0.0 | 0 | 0 |
5 | 90 | 50001 | 20190608 | 500 | 6310 | 0 | 0 | 0.0 | 0 | 0 |
6 | 90 | 50003 | 20190605 | 500 | 0 | 0 | 11950 | 0.0 | 0 | 0 |
7 | 90 | 50001 | 20190614 | 500 | 5410 | 0 | 0 | 0.0 | 0 | 0 |
8 | 90 | 50003 | 20190617 | 500 | 0 | 0 | 17930 | 0.0 | 0 | 0 |
9 | 90 | 50003 | 20190620 | 500 | 0 | 0 | 5970 | 0.0 | 0 | 0 |
권역 | 하수처리시설명 | 처리일자 | 관리단 | 탈수슬러지처리량 | 액상슬러지처리량 | 탈수슬러지함수율 | 협잡물처리량 | 협잡물함수율 | 액상슬러지함수율 | |
---|---|---|---|---|---|---|---|---|---|---|
90 | 90 | 50001 | 20190607 | 500 | 5650 | 0 | 0 | 0.0 | 0 | 0 |
91 | 90 | 50001 | 20190621 | 500 | 6410 | 0 | 0 | 0.0 | 0 | 0 |
92 | 90 | 50001 | 20190622 | 500 | 5950 | 0 | 0 | 0.0 | 0 | 0 |
93 | 91 | 60001 | 20190620 | 600 | 3328 | 0 | 0 | 83.4 | 0 | 0 |
94 | 91 | 60001 | 20190619 | 600 | 2424 | 0 | 0 | 83.4 | 0 | 0 |
95 | 91 | 60001 | 20190608 | 600 | 3224 | 0 | 0 | 83.5 | 0 | 0 |
96 | 91 | 60001 | 20190614 | 600 | 3348 | 0 | 0 | 83.5 | 0 | 0 |
97 | 91 | 60001 | 20190629 | 600 | 2728 | 2830 | 0 | 83.5 | 0 | 0 |
98 | 91 | 60001 | 20190606 | 600 | 3018 | 0 | 0 | 83.5 | 0 | 0 |
99 | 91 | 60001 | 20190618 | 600 | 3514 | 0 | 0 | 83.5 | 0 | 0 |