Dataset statistics
Number of variables | 9 |
---|---|
Number of observations | 6422 |
Missing cells | 12345 |
Missing cells (%) | 21.4% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 483.0 KiB |
Average record size in memory | 77.0 B |
Variable types
Categorical | 3 |
---|---|
DateTime | 1 |
Numeric | 5 |
Dataset
Description | 한국서부발전 발전소 호기별 월별 대기오염물질 배출량 및 발전량 입니다. 데이터 기간 : 2002-01 ~ 2022-07 데이터 내용 : 발전소명, 호기명, 발전량(MWh), 황산화물(SOx ), 질소산화물(NOx), 먼지(TPS) - 대기오염물질 단위는 톤이며, 먼지는 TPS만 제공됩니다. |
---|---|
Author | 한국서부발전(주) |
URL | https://www.data.go.kr/data/15099592/fileData.do |
발전소 is highly overall correlated with 발전용량(MW) and 3 other fields | High correlation |
호기 is highly overall correlated with 발전용량(MW) and 2 other fields | High correlation |
비고 is highly overall correlated with 발전용량(MW) and 2 other fields | High correlation |
발전용량(MW) is highly overall correlated with 발전량(MWh) and 3 other fields | High correlation |
발전량(MWh) is highly overall correlated with 발전용량(MW) and 3 other fields | High correlation |
SOx is highly overall correlated with 발전량(MWh) and 2 other fields | High correlation |
NOx is highly overall correlated with 발전량(MWh) and 2 other fields | High correlation |
먼지(TSP) is highly overall correlated with SOx and 1 other fields | High correlation |
비고 is highly imbalanced (92.9%) | Imbalance |
발전량(MWh) has 1451 (22.6%) missing values | Missing |
SOx has 4282 (66.7%) missing values | Missing |
NOx has 4357 (67.8%) missing values | Missing |
먼지(TSP) has 2255 (35.1%) missing values | Missing |
Reproduction
Analysis started | 2023-12-12 19:21:47.356799 |
---|---|
Analysis finished | 2023-12-12 19:21:51.525173 |
Duration | 4.17 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
발전소
Categorical
HIGH CORRELATION
 
Distinct | 4 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 50.3 KiB |
태안 | |
---|---|
서인천 | |
평택 | |
군산 | 247 |
Length
Max length | 3 |
---|---|
Median length | 2 |
Mean length | 2.3076923 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 태안 |
---|---|
2nd row | 태안 |
3rd row | 태안 |
4th row | 태안 |
5th row | 태안 |
Common Values
Value | Count | Frequency (%) |
태안 | 2717 | |
서인천 | 1976 | |
평택 | 1482 | |
군산 | 247 | 3.8% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
태안 | 2717 | |
서인천 | 1976 | |
평택 | 1482 | |
군산 | 247 | 3.8% |
호기
Categorical
HIGH CORRELATION
 
Distinct | 26 |
---|---|
Distinct (%) | 0.4% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 50.3 KiB |
1호기 | 247 |
---|---|
2호기 | 247 |
3호기 | 247 |
4호기 | 247 |
5호기 | 247 |
Other values (21) |
Length
Max length | 8 |
---|---|
Median length | 7 |
Mean length | 5.4615385 |
Min length | 3 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 1호기 |
---|---|
2nd row | 1호기 |
3rd row | 1호기 |
4th row | 1호기 |
5th row | 1호기 |
Common Values
Value | Count | Frequency (%) |
1호기 | 247 | 3.8% |
2호기 | 247 | 3.8% |
3호기 | 247 | 3.8% |
4호기 | 247 | 3.8% |
5호기 | 247 | 3.8% |
6호기 | 247 | 3.8% |
7호기 | 247 | 3.8% |
8호기 | 247 | 3.8% |
9호기 | 247 | 3.8% |
10호기 | 247 | 3.8% |
Other values (16) | 3952 |
Length
Value | Count | Frequency (%) |
복합 | 2717 | |
기력 | 988 | 9.8% |
2호기 | 494 | 4.9% |
2cc | 494 | 4.9% |
1cc | 494 | 4.9% |
1호기 | 494 | 4.9% |
4호기 | 494 | 4.9% |
3호기 | 494 | 4.9% |
7호기 | 247 | 2.4% |
8호기 | 247 | 2.4% |
Other values (12) | 2964 |
날짜
Date
Distinct | 247 |
---|---|
Distinct (%) | 3.8% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 50.3 KiB |
Minimum | 2002-01-01 00:00:00 |
---|---|
Maximum | 2022-07-01 00:00:00 |
발전용량(MW)
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 8 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 450.50885 |
Minimum | 225 |
---|---|
Maximum | 1050 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 56.6 KiB |
Quantile statistics
Minimum | 225 |
---|---|
5-th percentile | 225 |
Q1 | 225 |
median | 415 |
Q3 | 500 |
95-th percentile | 1050 |
Maximum | 1050 |
Range | 825 |
Interquartile range (IQR) | 275 |
Descriptive statistics
Standard deviation | 235.66978 |
---|---|
Coefficient of variation (CV) | 0.5231191 |
Kurtosis | 0.93370119 |
Mean | 450.50885 |
Median Absolute Deviation (MAD) | 85 |
Skewness | 1.2559296 |
Sum | 2893167.8 |
Variance | 55540.246 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
500.0 | 1976 | |
225.0 | 1976 | |
350.0 | 988 | |
1050.0 | 494 | 7.7% |
346.33 | 247 | 3.8% |
480.0 | 247 | 3.8% |
868.5 | 247 | 3.8% |
718.4 | 247 | 3.8% |
Value | Count | Frequency (%) |
225.0 | 1976 | |
346.33 | 247 | 3.8% |
350.0 | 988 | |
480.0 | 247 | 3.8% |
500.0 | 1976 | |
718.4 | 247 | 3.8% |
868.5 | 247 | 3.8% |
1050.0 | 494 | 7.7% |
Value | Count | Frequency (%) |
1050.0 | 494 | 7.7% |
868.5 | 247 | 3.8% |
718.4 | 247 | 3.8% |
500.0 | 1976 | |
480.0 | 247 | 3.8% |
350.0 | 988 | |
346.33 | 247 | 3.8% |
225.0 | 1976 |
발전량(MWh)
Real number (ℝ)
HIGH CORRELATION
  MISSING
 
Distinct | 4919 |
---|---|
Distinct (%) | 99.0% |
Missing | 1451 |
Missing (%) | 22.6% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 187533.35 |
Minimum | 0 |
---|---|
Maximum | 749933 |
Zeros | 1 |
Zeros (%) | < 0.1% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 56.6 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 5786 |
Q1 | 64856.5 |
median | 134443 |
Q3 | 340267.5 |
95-th percentile | 388889.5 |
Maximum | 749933 |
Range | 749933 |
Interquartile range (IQR) | 275411 |
Descriptive statistics
Standard deviation | 149243.57 |
---|---|
Coefficient of variation (CV) | 0.79582415 |
Kurtosis | -0.38194161 |
Mean | 187533.35 |
Median Absolute Deviation (MAD) | 106729 |
Skewness | 0.65176916 |
Sum | 9.3222829 × 108 |
Variance | 2.2273644 × 1010 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
103453 | 2 | < 0.1% |
75762 | 2 | < 0.1% |
27562 | 2 | < 0.1% |
338846 | 2 | < 0.1% |
76900 | 2 | < 0.1% |
353544 | 2 | < 0.1% |
140432 | 2 | < 0.1% |
7235 | 2 | < 0.1% |
31921 | 2 | < 0.1% |
380320 | 2 | < 0.1% |
Other values (4909) | 4951 | |
(Missing) | 1451 | 22.6% |
Value | Count | Frequency (%) |
0 | 1 | |
3 | 2 | |
4 | 1 | |
33 | 1 | |
97 | 1 | |
116 | 1 | |
120 | 1 | |
134 | 1 | |
180 | 1 | |
189 | 1 |
Value | Count | Frequency (%) |
749933 | 1 | |
747394 | 1 | |
743351 | 1 | |
729921 | 1 | |
725428 | 1 | |
722789 | 1 | |
699180 | 1 | |
694927 | 1 | |
690001 | 1 | |
684722 | 1 |
SOx
Real number (ℝ)
HIGH CORRELATION
  MISSING
 
Distinct | 1197 |
---|---|
Distinct (%) | 55.9% |
Missing | 4282 |
Missing (%) | 66.7% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 73.110607 |
Minimum | 0 |
---|---|
Maximum | 306.2 |
Zeros | 6 |
Zeros (%) | 0.1% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 56.6 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 2.4 |
Q1 | 26 |
median | 60 |
Q3 | 113.45 |
95-th percentile | 178.905 |
Maximum | 306.2 |
Range | 306.2 |
Interquartile range (IQR) | 87.45 |
Descriptive statistics
Standard deviation | 57.053789 |
---|---|
Coefficient of variation (CV) | 0.78037636 |
Kurtosis | -0.14614913 |
Mean | 73.110607 |
Median Absolute Deviation (MAD) | 40.45 |
Skewness | 0.74112593 |
Sum | 156456.7 |
Variance | 3255.1349 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1.1 | 10 | 0.2% |
4.7 | 8 | 0.1% |
0.3 | 8 | 0.1% |
31.0 | 8 | 0.1% |
60.0 | 8 | 0.1% |
3.0 | 7 | 0.1% |
20.0 | 7 | 0.1% |
44.0 | 7 | 0.1% |
40.0 | 7 | 0.1% |
34.0 | 6 | 0.1% |
Other values (1187) | 2064 | |
(Missing) | 4282 |
Value | Count | Frequency (%) |
0.0 | 6 | |
0.1 | 6 | |
0.2 | 3 | < 0.1% |
0.3 | 8 | |
0.4 | 3 | < 0.1% |
0.5 | 5 | |
0.6 | 4 | |
0.7 | 3 | < 0.1% |
0.8 | 4 | |
0.9 | 2 | < 0.1% |
Value | Count | Frequency (%) |
306.2 | 1 | |
297.1 | 1 | |
294.5 | 1 | |
264.2 | 1 | |
252.5 | 1 | |
249.6 | 1 | |
247.0 | 1 | |
246.8 | 1 | |
243.1 | 1 | |
241.3 | 1 |
NOx
Real number (ℝ)
HIGH CORRELATION
  MISSING
 
Distinct | 1309 |
---|---|
Distinct (%) | 63.4% |
Missing | 4357 |
Missing (%) | 67.8% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 110.8431 |
Minimum | 0 |
---|---|
Maximum | 346 |
Zeros | 1 |
Zeros (%) | < 0.1% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 56.6 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 5.4 |
Q1 | 33.8 |
median | 75 |
Q3 | 194.9 |
95-th percentile | 258.18 |
Maximum | 346 |
Range | 346 |
Interquartile range (IQR) | 161.1 |
Descriptive statistics
Standard deviation | 88.734012 |
---|---|
Coefficient of variation (CV) | 0.80053709 |
Kurtosis | -1.256433 |
Mean | 110.8431 |
Median Absolute Deviation (MAD) | 60.2 |
Skewness | 0.46005842 |
Sum | 228891 |
Variance | 7873.7249 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
31.0 | 9 | 0.1% |
46.0 | 7 | 0.1% |
42.0 | 7 | 0.1% |
50.0 | 6 | 0.1% |
28.6 | 6 | 0.1% |
33.6 | 6 | 0.1% |
35.7 | 6 | 0.1% |
47.7 | 6 | 0.1% |
47.0 | 6 | 0.1% |
38.7 | 6 | 0.1% |
Other values (1299) | 2000 | |
(Missing) | 4357 |
Value | Count | Frequency (%) |
0.0 | 1 | < 0.1% |
0.1 | 4 | |
0.2 | 2 | |
0.3 | 3 | |
0.5 | 1 | < 0.1% |
0.6 | 3 | |
0.7 | 2 | |
0.8 | 2 | |
0.9 | 2 | |
1.0 | 3 |
Value | Count | Frequency (%) |
346.0 | 1 | |
324.9 | 1 | |
318.1 | 1 | |
317.7 | 1 | |
316.7 | 1 | |
310.0 | 1 | |
308.1 | 1 | |
306.9 | 1 | |
306.7 | 1 | |
306.4 | 1 |
먼지(TSP)
Real number (ℝ)
HIGH CORRELATION
  MISSING
 
Distinct | 588 |
---|---|
Distinct (%) | 14.1% |
Missing | 2255 |
Missing (%) | 35.1% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 13.30144 |
Minimum | 0 |
---|---|
Maximum | 214.8 |
Zeros | 29 |
Zeros (%) | 0.5% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 56.6 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0.2 |
Q1 | 2.1 |
median | 8 |
Q3 | 13.85 |
95-th percentile | 50.34 |
Maximum | 214.8 |
Range | 214.8 |
Interquartile range (IQR) | 11.75 |
Descriptive statistics
Standard deviation | 21.512152 |
---|---|
Coefficient of variation (CV) | 1.61728 |
Kurtosis | 23.728474 |
Mean | 13.30144 |
Median Absolute Deviation (MAD) | 5.9 |
Skewness | 4.2277072 |
Sum | 55427.1 |
Variance | 462.7727 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0.3 | 138 | 2.1% |
0.1 | 135 | 2.1% |
0.2 | 129 | 2.0% |
0.4 | 84 | 1.3% |
0.5 | 64 | 1.0% |
1.0 | 46 | 0.7% |
3.0 | 45 | 0.7% |
0.7 | 43 | 0.7% |
0.6 | 37 | 0.6% |
2.0 | 37 | 0.6% |
Other values (578) | 3409 | |
(Missing) | 2255 |
Value | Count | Frequency (%) |
0.0 | 29 | 0.5% |
0.1 | 135 | |
0.2 | 129 | |
0.3 | 138 | |
0.4 | 84 | |
0.5 | 64 | |
0.6 | 37 | 0.6% |
0.7 | 43 | 0.7% |
0.8 | 26 | 0.4% |
0.9 | 28 | 0.4% |
Value | Count | Frequency (%) |
214.8 | 1 | |
205.0 | 1 | |
204.2 | 1 | |
203.5 | 1 | |
200.0 | 1 | |
199.5 | 1 | |
192.7 | 1 | |
191.5 | 1 | |
186.0 | 1 | |
175.3 | 1 |
비고
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 50.3 KiB |
<NA> | |
---|---|
2018-01-01 부 폐지 | 55 |
Length
Max length | 15 |
---|---|
Median length | 4 |
Mean length | 4.0942074 |
Min length | 4 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | <NA> |
---|---|
2nd row | <NA> |
3rd row | <NA> |
4th row | <NA> |
5th row | <NA> |
Common Values
Value | Count | Frequency (%) |
<NA> | 6367 | |
2018-01-01 부 폐지 | 55 | 0.9% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
na | 6367 | |
2018-01-01 | 55 | 0.8% |
부 | 55 | 0.8% |
폐지 | 55 | 0.8% |
발전소 | 호기 | 발전용량(MW) | 발전량(MWh) | SOx | NOx | 먼지(TSP) | |
---|---|---|---|---|---|---|---|
발전소 | 1.000 | 1.000 | 0.847 | 0.724 | 0.528 | 0.621 | 0.526 |
호기 | 1.000 | 1.000 | 1.000 | 0.794 | 0.514 | 0.528 | 0.570 |
발전용량(MW) | 0.847 | 1.000 | 1.000 | 0.886 | 0.524 | 0.591 | 0.636 |
발전량(MWh) | 0.724 | 0.794 | 0.886 | 1.000 | 0.766 | 0.762 | 0.561 |
SOx | 0.528 | 0.514 | 0.524 | 0.766 | 1.000 | 0.809 | 0.565 |
NOx | 0.621 | 0.528 | 0.591 | 0.762 | 0.809 | 1.000 | 0.140 |
먼지(TSP) | 0.526 | 0.570 | 0.636 | 0.561 | 0.565 | 0.140 | 1.000 |
발전소 | 호기 | 비고 | |
---|---|---|---|
발전소 | 1.000 | 0.998 | 1.000 |
호기 | 0.998 | 1.000 | 1.000 |
비고 | 1.000 | 1.000 | 1.000 |
발전용량(MW) | 발전량(MWh) | SOx | NOx | 먼지(TSP) | 발전소 | 호기 | 비고 | |
---|---|---|---|---|---|---|---|---|
발전용량(MW) | 1.000 | 0.699 | 0.452 | 0.479 | -0.252 | 0.924 | 0.998 | 1.000 |
발전량(MWh) | 0.699 | 1.000 | 0.724 | 0.766 | 0.211 | 0.529 | 0.435 | 0.000 |
SOx | 0.452 | 0.724 | 1.000 | 0.903 | 0.777 | 0.406 | 0.219 | 0.000 |
NOx | 0.479 | 0.766 | 0.903 | 1.000 | 0.814 | 0.482 | 0.226 | 0.000 |
먼지(TSP) | -0.252 | 0.211 | 0.777 | 0.814 | 1.000 | 0.342 | 0.245 | 0.000 |
발전소 | 0.924 | 0.529 | 0.406 | 0.482 | 0.342 | 1.000 | 0.998 | 1.000 |
호기 | 0.998 | 0.435 | 0.219 | 0.226 | 0.245 | 0.998 | 1.000 | 1.000 |
비고 | 1.000 | 0.000 | 0.000 | 0.000 | 0.000 | 1.000 | 1.000 | 1.000 |
발전소 | 호기 | 날짜 | 발전용량(MW) | 발전량(MWh) | SOx | NOx | 먼지(TSP) | 비고 | |
---|---|---|---|---|---|---|---|---|---|
0 | 태안 | 1호기 | 2002-01 | 500.0 | 360555 | <NA> | <NA> | <NA> | <NA> |
1 | 태안 | 1호기 | 2002-02 | 500.0 | 106134 | <NA> | <NA> | <NA> | <NA> |
2 | 태안 | 1호기 | 2002-03 | 500.0 | 73675 | <NA> | <NA> | <NA> | <NA> |
3 | 태안 | 1호기 | 2002-04 | 500.0 | 339544 | <NA> | <NA> | <NA> | <NA> |
4 | 태안 | 1호기 | 2002-05 | 500.0 | 337666 | <NA> | <NA> | <NA> | <NA> |
5 | 태안 | 1호기 | 2002-06 | 500.0 | 341539 | <NA> | <NA> | <NA> | <NA> |
6 | 태안 | 1호기 | 2002-07 | 500.0 | 331669 | <NA> | <NA> | <NA> | <NA> |
7 | 태안 | 1호기 | 2002-08 | 500.0 | 313827 | <NA> | <NA> | <NA> | <NA> |
8 | 태안 | 1호기 | 2002-09 | 500.0 | 223793 | <NA> | <NA> | <NA> | <NA> |
9 | 태안 | 1호기 | 2002-10 | 500.0 | 354079 | <NA> | <NA> | <NA> | <NA> |
발전소 | 호기 | 날짜 | 발전용량(MW) | 발전량(MWh) | SOx | NOx | 먼지(TSP) | 비고 | |
---|---|---|---|---|---|---|---|---|---|
6412 | 군산 | 복합 CC | 2021-10 | 718.4 | 157349 | <NA> | <NA> | 11.8 | <NA> |
6413 | 군산 | 복합 CC | 2021-11 | 718.4 | 147689 | <NA> | <NA> | 11.1 | <NA> |
6414 | 군산 | 복합 CC | 2021-12 | 718.4 | 128665 | <NA> | <NA> | 9.6 | <NA> |
6415 | 군산 | 복합 CC | 2022-01 | 718.4 | 102682 | <NA> | <NA> | 8.1 | <NA> |
6416 | 군산 | 복합 CC | 2022-02 | 718.4 | 59888 | <NA> | <NA> | 5.0 | <NA> |
6417 | 군산 | 복합 CC | 2022-03 | 718.4 | <NA> | <NA> | <NA> | <NA> | <NA> |
6418 | 군산 | 복합 CC | 2022-04 | 718.4 | <NA> | <NA> | <NA> | <NA> | <NA> |
6419 | 군산 | 복합 CC | 2022-05 | 718.4 | <NA> | <NA> | <NA> | <NA> | <NA> |
6420 | 군산 | 복합 CC | 2022-06 | 718.4 | <NA> | <NA> | <NA> | <NA> | <NA> |
6421 | 군산 | 복합 CC | 2022-07 | 718.4 | 61954 | <NA> | <NA> | 5.7 | <NA> |