Dataset statistics
Number of variables | 6 |
---|---|
Number of observations | 25 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 3 |
Duplicate rows (%) | 12.0% |
Total size in memory | 1.4 KiB |
Average record size in memory | 57.3 B |
Variable types
DateTime | 1 |
---|---|
Numeric | 1 |
Categorical | 4 |
Dataset
Description | 한국남동발전 환경화학 시스템 내 용수 관리 정보입니다. 분석기간에 따른 원수비용, 여과수비용, 음용수비용 등의 데이터를 포함하고 있습니다. |
---|---|
Author | 한국남동발전㈜ |
URL | https://www.data.go.kr/data/15093003/fileData.do |
통화단위 has constant value "" | Constant |
Dataset has 3 (12.0%) duplicate rows | Duplicates |
순수비용 is highly overall correlated with 여과수비용 and 1 other fields | High correlation |
음용수비용 is highly overall correlated with 원수비용 and 2 other fields | High correlation |
여과수비용 is highly overall correlated with 음용수비용 and 1 other fields | High correlation |
원수비용 is highly overall correlated with 음용수비용 | High correlation |
여과수비용 is highly imbalanced (64.0%) | Imbalance |
음용수비용 is highly imbalanced (75.8%) | Imbalance |
순수비용 is highly imbalanced (64.0%) | Imbalance |
원수비용 has 17 (68.0%) zeros | Zeros |
Reproduction
Analysis started | 2023-12-12 18:19:02.801680 |
---|---|
Analysis finished | 2023-12-12 18:19:03.368244 |
Duration | 0.57 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
분석일자
Date
Distinct | 4 |
---|---|
Distinct (%) | 16.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 332.0 B |
Minimum | 2020-01-11 00:00:00 |
---|---|
Maximum | 2020-07-10 00:00:00 |
원수비용
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 9 |
---|---|
Distinct (%) | 36.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 8058855.4 |
Minimum | 0 |
---|---|
Maximum | 65554521 |
Zeros | 17 |
Zeros (%) | 68.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 357.0 B |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 6825654 |
95-th percentile | 43275860 |
Maximum | 65554521 |
Range | 65554521 |
Interquartile range (IQR) | 6825654 |
Descriptive statistics
Standard deviation | 17026267 |
---|---|
Coefficient of variation (CV) | 2.1127401 |
Kurtosis | 5.5162727 |
Mean | 8058855.4 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 2.4557808 |
Sum | 2.0147138 × 108 |
Variance | 2.8989377 × 1014 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 17 | |
42843300 | 1 | 4.0% |
13721290 | 1 | 4.0% |
4075826 | 1 | 4.0% |
43384000 | 1 | 4.0% |
12135580 | 1 | 4.0% |
6825654 | 1 | 4.0% |
65554521 | 1 | 4.0% |
12931213 | 1 | 4.0% |
Value | Count | Frequency (%) |
0 | 17 | |
4075826 | 1 | 4.0% |
6825654 | 1 | 4.0% |
12135580 | 1 | 4.0% |
12931213 | 1 | 4.0% |
13721290 | 1 | 4.0% |
42843300 | 1 | 4.0% |
43384000 | 1 | 4.0% |
65554521 | 1 | 4.0% |
Value | Count | Frequency (%) |
65554521 | 1 | 4.0% |
43384000 | 1 | 4.0% |
42843300 | 1 | 4.0% |
13721290 | 1 | 4.0% |
12931213 | 1 | 4.0% |
12135580 | 1 | 4.0% |
6825654 | 1 | 4.0% |
4075826 | 1 | 4.0% |
0 | 17 |
여과수비용
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 4 |
---|---|
Distinct (%) | 16.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 332.0 B |
0 | |
---|---|
1165887 | 1 |
2226968 | 1 |
277 | 1 |
Length
Max length | 7 |
---|---|
Median length | 1 |
Mean length | 1.56 |
Min length | 1 |
Unique
Unique | 3 ? |
---|---|
Unique (%) | 12.0% |
Sample
1st row | 0 |
---|---|
2nd row | 0 |
3rd row | 0 |
4th row | 1165887 |
5th row | 2226968 |
Common Values
Value | Count | Frequency (%) |
0 | 22 | |
1165887 | 1 | 4.0% |
2226968 | 1 | 4.0% |
277 | 1 | 4.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 22 | |
1165887 | 1 | 4.0% |
2226968 | 1 | 4.0% |
277 | 1 | 4.0% |
음용수비용
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | 8.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 332.0 B |
0 | |
---|---|
268 | 1 |
Length
Max length | 3 |
---|---|
Median length | 1 |
Mean length | 1.08 |
Min length | 1 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | 4.0% |
Sample
1st row | 0 |
---|---|
2nd row | 0 |
3rd row | 0 |
4th row | 0 |
5th row | 0 |
Common Values
Value | Count | Frequency (%) |
0 | 24 | |
268 | 1 | 4.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 24 | |
268 | 1 | 4.0% |
순수비용
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 4 |
---|---|
Distinct (%) | 16.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 332.0 B |
0 | |
---|---|
1318124 | 1 |
2249928 | 1 |
467 | 1 |
Length
Max length | 7 |
---|---|
Median length | 1 |
Mean length | 1.56 |
Min length | 1 |
Unique
Unique | 3 ? |
---|---|
Unique (%) | 12.0% |
Sample
1st row | 0 |
---|---|
2nd row | 0 |
3rd row | 0 |
4th row | 1318124 |
5th row | 2249928 |
Common Values
Value | Count | Frequency (%) |
0 | 22 | |
1318124 | 1 | 4.0% |
2249928 | 1 | 4.0% |
467 | 1 | 4.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 22 | |
1318124 | 1 | 4.0% |
2249928 | 1 | 4.0% |
467 | 1 | 4.0% |
통화단위
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 4.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 332.0 B |
KRW |
---|
Length
Max length | 3 |
---|---|
Median length | 3 |
Mean length | 3 |
Min length | 3 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | KRW |
---|---|
2nd row | KRW |
3rd row | KRW |
4th row | KRW |
5th row | KRW |
Common Values
Value | Count | Frequency (%) |
KRW | 25 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
krw | 25 |
분석일자 | 원수비용 | 여과수비용 | 음용수비용 | 순수비용 | |
---|---|---|---|---|---|
분석일자 | 1.000 | 0.378 | 0.545 | 0.345 | 0.545 |
원수비용 | 0.378 | 1.000 | 0.540 | 1.000 | 0.540 |
여과수비용 | 0.545 | 0.540 | 1.000 | 1.000 | 1.000 |
음용수비용 | 0.345 | 1.000 | 1.000 | 1.000 | 1.000 |
순수비용 | 0.545 | 0.540 | 1.000 | 1.000 | 1.000 |
순수비용 | 음용수비용 | 여과수비용 | |
---|---|---|---|
순수비용 | 1.000 | 0.956 | 1.000 |
음용수비용 | 0.956 | 1.000 | 0.956 |
여과수비용 | 1.000 | 0.956 | 1.000 |
원수비용 | 여과수비용 | 음용수비용 | 순수비용 | |
---|---|---|---|---|
원수비용 | 1.000 | 0.449 | 0.933 | 0.449 |
여과수비용 | 0.449 | 1.000 | 0.956 | 1.000 |
음용수비용 | 0.933 | 0.956 | 1.000 | 0.956 |
순수비용 | 0.449 | 1.000 | 0.956 | 1.000 |
분석일자 | 원수비용 | 여과수비용 | 음용수비용 | 순수비용 | 통화단위 | |
---|---|---|---|---|---|---|
0 | 2020-07-10 | 42843300 | 0 | 0 | 0 | KRW |
1 | 2020-07-10 | 13721290 | 0 | 0 | 0 | KRW |
2 | 2020-07-10 | 4075826 | 0 | 0 | 0 | KRW |
3 | 2020-07-10 | 0 | 1165887 | 0 | 1318124 | KRW |
4 | 2020-07-10 | 0 | 2226968 | 0 | 2249928 | KRW |
5 | 2020-01-11 | 0 | 0 | 0 | 0 | KRW |
6 | 2020-01-11 | 0 | 0 | 0 | 0 | KRW |
7 | 2020-01-11 | 0 | 0 | 0 | 0 | KRW |
8 | 2020-01-11 | 0 | 0 | 0 | 0 | KRW |
9 | 2020-01-11 | 0 | 0 | 0 | 0 | KRW |
분석일자 | 원수비용 | 여과수비용 | 음용수비용 | 순수비용 | 통화단위 | |
---|---|---|---|---|---|---|
15 | 2020-01-12 | 43384000 | 0 | 0 | 0 | KRW |
16 | 2020-01-12 | 12135580 | 0 | 0 | 0 | KRW |
17 | 2020-01-12 | 6825654 | 0 | 0 | 0 | KRW |
18 | 2020-01-12 | 0 | 0 | 0 | 0 | KRW |
19 | 2020-01-12 | 0 | 0 | 0 | 0 | KRW |
20 | 2020-01-16 | 65554521 | 277 | 268 | 467 | KRW |
21 | 2020-01-16 | 12931213 | 0 | 0 | 0 | KRW |
22 | 2020-01-16 | 0 | 0 | 0 | 0 | KRW |
23 | 2020-01-16 | 0 | 0 | 0 | 0 | KRW |
24 | 2020-01-16 | 0 | 0 | 0 | 0 | KRW |
Most frequently occurring
분석일자 | 원수비용 | 여과수비용 | 음용수비용 | 순수비용 | 통화단위 | # duplicates | |
---|---|---|---|---|---|---|---|
0 | 2020-01-11 | 0 | 0 | 0 | 0 | KRW | 10 |
2 | 2020-01-16 | 0 | 0 | 0 | 0 | KRW | 3 |
1 | 2020-01-12 | 0 | 0 | 0 | 0 | KRW | 2 |