Dataset statistics
Number of variables | 10 |
---|---|
Number of observations | 100 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 8.9 KiB |
Average record size in memory | 91.3 B |
Variable types
Categorical | 6 |
---|---|
Numeric | 4 |
Dataset
Description | 샘플 데이터 |
---|---|
Author | 한국수자원공사 |
URL | https://www.bigdata-environment.kr/user/data_market/detail.do?id=c6697020-38bb-11ea-be28-4fa0eb812a46 |
협잡물함수율 has constant value "" | Constant |
액상슬러지함수율 has constant value "" | Constant |
권역 is highly overall correlated with 하수처리시설명 and 2 other fields | High correlation |
하수처리시설명 is highly overall correlated with 탈수슬러지처리량 and 3 other fields | High correlation |
관리단 is highly overall correlated with 탈수슬러지처리량 and 3 other fields | High correlation |
협잡물처리량 is highly overall correlated with 권역 and 1 other fields | High correlation |
탈수슬러지처리량 is highly overall correlated with 탈수슬러지함수율 and 2 other fields | High correlation |
탈수슬러지함수율 is highly overall correlated with 탈수슬러지처리량 and 1 other fields | High correlation |
관리단 is highly imbalanced (56.6%) | Imbalance |
협잡물처리량 is highly imbalanced (64.9%) | Imbalance |
탈수슬러지처리량 has 38 (38.0%) zeros | Zeros |
액상슬러지처리량 has 93 (93.0%) zeros | Zeros |
탈수슬러지함수율 has 62 (62.0%) zeros | Zeros |
Reproduction
Analysis started | 2023-12-10 13:03:32.088287 |
---|---|
Analysis finished | 2023-12-10 13:03:33.886182 |
Duration | 1.8 second |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
권역
Categorical
HIGH CORRELATION
 
Distinct | 2 |
---|---|
Distinct (%) | 2.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
90 | |
---|---|
91 |
Length
Max length | 2 |
---|---|
Median length | 2 |
Mean length | 2 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 90 |
---|---|
2nd row | 90 |
3rd row | 90 |
4th row | 90 |
5th row | 90 |
Common Values
Value | Count | Frequency (%) |
90 | 84 | |
91 | 16 | 16.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
90 | 84 | |
91 | 16 | 16.0% |
하수처리시설명
Categorical
HIGH CORRELATION
 
Distinct | 5 |
---|---|
Distinct (%) | 5.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
50001 | |
---|---|
50002 | |
60001 | |
50003 | |
40001 | 1 |
Length
Max length | 5 |
---|---|
Median length | 5 |
Mean length | 5 |
Min length | 5 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | 1.0% |
Sample
1st row | 50001 |
---|---|
2nd row | 50002 |
3rd row | 50001 |
4th row | 50001 |
5th row | 50002 |
Common Values
Value | Count | Frequency (%) |
50001 | 46 | |
50002 | 24 | |
60001 | 15 | 15.0% |
50003 | 14 | 14.0% |
40001 | 1 | 1.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
50001 | 46 | |
50002 | 24 | |
60001 | 15 | 15.0% |
50003 | 14 | 14.0% |
40001 | 1 | 1.0% |
처리일자
Real number (ℝ)
Distinct | 31 |
---|---|
Distinct (%) | 31.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 20190316 |
Minimum | 20190301 |
---|---|
Maximum | 20190331 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 20190301 |
---|---|
5-th percentile | 20190303 |
Q1 | 20190308 |
median | 20190317 |
Q3 | 20190323 |
95-th percentile | 20190330 |
Maximum | 20190331 |
Range | 30 |
Interquartile range (IQR) | 15.25 |
Descriptive statistics
Standard deviation | 8.7604252 |
---|---|
Coefficient of variation (CV) | 4.3389242 × 10-7 |
Kurtosis | -1.1674996 |
Mean | 20190316 |
Median Absolute Deviation (MAD) | 7 |
Skewness | -0.020959062 |
Sum | 2.0190316 × 109 |
Variance | 76.745051 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
20190307 | 5 | 5.0% |
20190304 | 5 | 5.0% |
20190318 | 5 | 5.0% |
20190305 | 4 | 4.0% |
20190322 | 4 | 4.0% |
20190319 | 4 | 4.0% |
20190327 | 4 | 4.0% |
20190325 | 4 | 4.0% |
20190320 | 4 | 4.0% |
20190331 | 4 | 4.0% |
Other values (21) | 57 |
Value | Count | Frequency (%) |
20190301 | 2 | 2.0% |
20190302 | 2 | 2.0% |
20190303 | 2 | 2.0% |
20190304 | 5 | |
20190305 | 4 | |
20190306 | 3 | |
20190307 | 5 | |
20190308 | 3 | |
20190309 | 1 | 1.0% |
20190310 | 2 | 2.0% |
Value | Count | Frequency (%) |
20190331 | 4 | |
20190330 | 3 | |
20190329 | 3 | |
20190328 | 2 | |
20190327 | 4 | |
20190326 | 2 | |
20190325 | 4 | |
20190324 | 3 | |
20190323 | 3 | |
20190322 | 4 |
관리단
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 3 |
---|---|
Distinct (%) | 3.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
500 | |
---|---|
600 | |
400 | 1 |
Length
Max length | 3 |
---|---|
Median length | 3 |
Mean length | 3 |
Min length | 3 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | 1.0% |
Sample
1st row | 500 |
---|---|
2nd row | 500 |
3rd row | 500 |
4th row | 500 |
5th row | 500 |
Common Values
Value | Count | Frequency (%) |
500 | 84 | |
600 | 15 | 15.0% |
400 | 1 | 1.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
500 | 84 | |
600 | 15 | 15.0% |
400 | 1 | 1.0% |
탈수슬러지처리량
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 58 |
---|---|
Distinct (%) | 58.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 3842.51 |
Minimum | 0 |
---|---|
Maximum | 64150 |
Zeros | 38 |
Zeros (%) | 38.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 3644 |
Q3 | 5780 |
95-th percentile | 6939.5 |
Maximum | 64150 |
Range | 64150 |
Interquartile range (IQR) | 5780 |
Descriptive statistics
Standard deviation | 6764.1927 |
---|---|
Coefficient of variation (CV) | 1.7603579 |
Kurtosis | 64.803484 |
Mean | 3842.51 |
Median Absolute Deviation (MAD) | 2841 |
Skewness | 7.2783706 |
Sum | 384251 |
Variance | 45754303 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 38 | |
4900 | 3 | 3.0% |
5840 | 2 | 2.0% |
2676 | 2 | 2.0% |
4370 | 2 | 2.0% |
4520 | 1 | 1.0% |
5670 | 1 | 1.0% |
4480 | 1 | 1.0% |
6820 | 1 | 1.0% |
5810 | 1 | 1.0% |
Other values (48) | 48 |
Value | Count | Frequency (%) |
0 | 38 | |
988 | 1 | 1.0% |
2370 | 1 | 1.0% |
2394 | 1 | 1.0% |
2408 | 1 | 1.0% |
2676 | 2 | 2.0% |
2752 | 1 | 1.0% |
3058 | 1 | 1.0% |
3135 | 1 | 1.0% |
3173 | 1 | 1.0% |
Value | Count | Frequency (%) |
64150 | 1 | |
11098 | 1 | |
10666 | 1 | |
10600 | 1 | |
7500 | 1 | |
6910 | 1 | |
6830 | 1 | |
6820 | 1 | |
6760 | 1 | |
6590 | 1 |
액상슬러지처리량
Real number (ℝ)
ZEROS
 
Distinct | 8 |
---|---|
Distinct (%) | 8.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 168.4 |
Minimum | 0 |
---|---|
Maximum | 3410 |
Zeros | 93 |
Zeros (%) | 93.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 0 |
95-th percentile | 1953.5 |
Maximum | 3410 |
Range | 3410 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 631.38571 |
---|---|
Coefficient of variation (CV) | 3.7493213 |
Kurtosis | 12.840404 |
Mean | 168.4 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 3.7088666 |
Sum | 16840 |
Variance | 398647.92 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 93 | |
1730 | 1 | 1.0% |
2580 | 1 | 1.0% |
2600 | 1 | 1.0% |
2210 | 1 | 1.0% |
1940 | 1 | 1.0% |
2370 | 1 | 1.0% |
3410 | 1 | 1.0% |
Value | Count | Frequency (%) |
0 | 93 | |
1730 | 1 | 1.0% |
1940 | 1 | 1.0% |
2210 | 1 | 1.0% |
2370 | 1 | 1.0% |
2580 | 1 | 1.0% |
2600 | 1 | 1.0% |
3410 | 1 | 1.0% |
Value | Count | Frequency (%) |
3410 | 1 | 1.0% |
2600 | 1 | 1.0% |
2580 | 1 | 1.0% |
2370 | 1 | 1.0% |
2210 | 1 | 1.0% |
1940 | 1 | 1.0% |
1730 | 1 | 1.0% |
0 | 93 |
탈수슬러지함수율
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 30 |
---|---|
Distinct (%) | 30.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 6140.5 |
Minimum | 0 |
---|---|
Maximum | 41750 |
Zeros | 62 |
Zeros (%) | 62.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 10560 |
95-th percentile | 24177 |
Maximum | 41750 |
Range | 41750 |
Interquartile range (IQR) | 10560 |
Descriptive statistics
Standard deviation | 9409.4024 |
---|---|
Coefficient of variation (CV) | 1.5323512 |
Kurtosis | 2.1014971 |
Mean | 6140.5 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 1.5956329 |
Sum | 614050 |
Variance | 88536853 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 62 | |
10560 | 4 | 4.0% |
10540 | 2 | 2.0% |
10570 | 2 | 2.0% |
10520 | 2 | 2.0% |
10510 | 2 | 2.0% |
11930 | 2 | 2.0% |
10530 | 2 | 2.0% |
5960 | 1 | 1.0% |
29820 | 1 | 1.0% |
Other values (20) | 20 | 20.0% |
Value | Count | Frequency (%) |
0 | 62 | |
5960 | 1 | 1.0% |
5970 | 1 | 1.0% |
10370 | 1 | 1.0% |
10490 | 1 | 1.0% |
10510 | 2 | 2.0% |
10520 | 2 | 2.0% |
10530 | 2 | 2.0% |
10540 | 2 | 2.0% |
10560 | 4 | 4.0% |
Value | Count | Frequency (%) |
41750 | 1 | |
31710 | 1 | |
31630 | 1 | |
31550 | 1 | |
29820 | 1 | |
23880 | 1 | |
23850 | 1 | |
21190 | 1 | |
21070 | 1 | |
21060 | 1 |
협잡물처리량
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 5 |
---|---|
Distinct (%) | 5.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
0.0 | |
---|---|
83.4 | |
83.3 | 3 |
83.0 | 1 |
82.9 | 1 |
Length
Max length | 4 |
---|---|
Median length | 3 |
Mean length | 3.15 |
Min length | 3 |
Unique
Unique | 2 ? |
---|---|
Unique (%) | 2.0% |
Sample
1st row | 0.0 |
---|---|
2nd row | 0.0 |
3rd row | 0.0 |
4th row | 0.0 |
5th row | 0.0 |
Common Values
Value | Count | Frequency (%) |
0.0 | 85 | |
83.4 | 10 | 10.0% |
83.3 | 3 | 3.0% |
83.0 | 1 | 1.0% |
82.9 | 1 | 1.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0.0 | 85 | |
83.4 | 10 | 10.0% |
83.3 | 3 | 3.0% |
83.0 | 1 | 1.0% |
82.9 | 1 | 1.0% |
협잡물함수율
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 1.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
0 |
---|
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 0 |
---|---|
2nd row | 0 |
3rd row | 0 |
4th row | 0 |
5th row | 0 |
Common Values
Value | Count | Frequency (%) |
0 | 100 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 100 |
액상슬러지함수율
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 1.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
0 |
---|
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 0 |
---|---|
2nd row | 0 |
3rd row | 0 |
4th row | 0 |
5th row | 0 |
Common Values
Value | Count | Frequency (%) |
0 | 100 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 100 |
권역 | 하수처리시설명 | 처리일자 | 관리단 | 탈수슬러지처리량 | 액상슬러지처리량 | 탈수슬러지함수율 | 협잡물처리량 | |
---|---|---|---|---|---|---|---|---|
권역 | 1.000 | 1.000 | 0.000 | 1.000 | 0.127 | 0.000 | 0.230 | 0.841 |
하수처리시설명 | 1.000 | 1.000 | 0.000 | 1.000 | 0.730 | 0.000 | 0.673 | 0.836 |
처리일자 | 0.000 | 0.000 | 1.000 | 0.000 | 0.000 | 0.151 | 0.000 | 0.000 |
관리단 | 1.000 | 1.000 | 0.000 | 1.000 | 0.941 | 0.000 | 0.000 | 0.707 |
탈수슬러지처리량 | 0.127 | 0.730 | 0.000 | 0.941 | 1.000 | 0.000 | 0.000 | 0.253 |
액상슬러지처리량 | 0.000 | 0.000 | 0.151 | 0.000 | 0.000 | 1.000 | 0.000 | 0.000 |
탈수슬러지함수율 | 0.230 | 0.673 | 0.000 | 0.000 | 0.000 | 0.000 | 1.000 | 0.000 |
협잡물처리량 | 0.841 | 0.836 | 0.000 | 0.707 | 0.253 | 0.000 | 0.000 | 1.000 |
권역 | 하수처리시설명 | 관리단 | 협잡물처리량 | |
---|---|---|---|---|
권역 | 1.000 | 0.985 | 0.995 | 0.946 |
하수처리시설명 | 0.985 | 1.000 | 0.990 | 0.467 |
관리단 | 0.995 | 0.990 | 1.000 | 0.685 |
협잡물처리량 | 0.946 | 0.467 | 0.685 | 1.000 |
처리일자 | 탈수슬러지처리량 | 액상슬러지처리량 | 탈수슬러지함수율 | 권역 | 하수처리시설명 | 관리단 | 협잡물처리량 | |
---|---|---|---|---|---|---|---|---|
처리일자 | 1.000 | -0.008 | 0.072 | 0.049 | 0.000 | 0.000 | 0.000 | 0.000 |
탈수슬러지처리량 | -0.008 | 1.000 | 0.163 | -0.833 | 0.208 | 0.716 | 0.704 | 0.194 |
액상슬러지처리량 | 0.072 | 0.163 | 1.000 | -0.133 | 0.000 | 0.000 | 0.000 | 0.000 |
탈수슬러지함수율 | 0.049 | -0.833 | -0.133 | 1.000 | 0.238 | 0.512 | 0.000 | 0.000 |
권역 | 0.000 | 0.208 | 0.000 | 0.238 | 1.000 | 0.985 | 0.995 | 0.946 |
하수처리시설명 | 0.000 | 0.716 | 0.000 | 0.512 | 0.985 | 1.000 | 0.990 | 0.467 |
관리단 | 0.000 | 0.704 | 0.000 | 0.000 | 0.995 | 0.990 | 1.000 | 0.685 |
협잡물처리량 | 0.000 | 0.194 | 0.000 | 0.000 | 0.946 | 0.467 | 0.685 | 1.000 |
권역 | 하수처리시설명 | 처리일자 | 관리단 | 탈수슬러지처리량 | 액상슬러지처리량 | 탈수슬러지함수율 | 협잡물처리량 | 협잡물함수율 | 액상슬러지함수율 | |
---|---|---|---|---|---|---|---|---|---|---|
0 | 90 | 50001 | 20190307 | 500 | 5000 | 0 | 0 | 0.0 | 0 | 0 |
1 | 90 | 50002 | 20190303 | 500 | 0 | 0 | 10540 | 0.0 | 0 | 0 |
2 | 90 | 50001 | 20190304 | 500 | 5880 | 0 | 0 | 0.0 | 0 | 0 |
3 | 90 | 50001 | 20190307 | 500 | 5920 | 0 | 0 | 0.0 | 0 | 0 |
4 | 90 | 50002 | 20190311 | 500 | 0 | 0 | 31550 | 0.0 | 0 | 0 |
5 | 90 | 50002 | 20190304 | 500 | 0 | 0 | 20160 | 0.0 | 0 | 0 |
6 | 90 | 50001 | 20190302 | 500 | 5770 | 0 | 0 | 0.0 | 0 | 0 |
7 | 90 | 50001 | 20190303 | 500 | 5910 | 0 | 0 | 0.0 | 0 | 0 |
8 | 90 | 50001 | 20190306 | 500 | 4370 | 0 | 0 | 0.0 | 0 | 0 |
9 | 90 | 50002 | 20190321 | 500 | 0 | 0 | 10570 | 0.0 | 0 | 0 |
권역 | 하수처리시설명 | 처리일자 | 관리단 | 탈수슬러지처리량 | 액상슬러지처리량 | 탈수슬러지함수율 | 협잡물처리량 | 협잡물함수율 | 액상슬러지함수율 | |
---|---|---|---|---|---|---|---|---|---|---|
90 | 91 | 60001 | 20190324 | 600 | 2370 | 0 | 0 | 83.4 | 0 | 0 |
91 | 91 | 60001 | 20190323 | 600 | 2394 | 0 | 0 | 83.3 | 0 | 0 |
92 | 91 | 60001 | 20190322 | 600 | 3249 | 0 | 0 | 83.4 | 0 | 0 |
93 | 91 | 60001 | 20190307 | 600 | 3135 | 0 | 0 | 83.4 | 0 | 0 |
94 | 91 | 60001 | 20190305 | 600 | 3268 | 0 | 0 | 83.3 | 0 | 0 |
95 | 91 | 60001 | 20190318 | 600 | 11098 | 0 | 0 | 83.4 | 0 | 0 |
96 | 91 | 60001 | 20190310 | 600 | 2676 | 0 | 0 | 83.4 | 0 | 0 |
97 | 91 | 60001 | 20190304 | 600 | 10666 | 0 | 0 | 82.9 | 0 | 0 |
98 | 91 | 60001 | 20190330 | 600 | 2408 | 0 | 0 | 83.4 | 0 | 0 |
99 | 91 | 60001 | 20190312 | 600 | 2676 | 0 | 0 | 83.4 | 0 | 0 |