Dataset statistics
Number of variables | 7 |
---|---|
Number of observations | 10000 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 683.6 KiB |
Average record size in memory | 70.0 B |
Variable types
Numeric | 5 |
---|---|
Categorical | 1 |
Text | 1 |
Dataset
Description | 한국지역난방공사 기후환경시스템의 배출량 산정결과(온실가스) 자료입니다. 기준연월별 파라미터, 버전 등의 정보를 제공합니다. |
---|---|
Author | 한국지역난방공사 |
URL | https://www.data.go.kr/data/15124177/fileData.do |
배출활동순번 is highly imbalanced (80.0%) | Imbalance |
Reproduction
Analysis started | 2023-12-11 23:34:05.693134 |
---|---|
Analysis finished | 2023-12-11 23:34:09.356584 |
Duration | 3.66 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
기준연월
Real number (ℝ)
Distinct | 40 |
---|---|
Distinct (%) | 0.4% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 202128.27 |
Minimum | 202001 |
---|---|
Maximum | 202304 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 202001 |
---|---|
5-th percentile | 202003 |
Q1 | 202011 |
median | 202109 |
Q3 | 202207 |
95-th percentile | 202302 |
Maximum | 202304 |
Range | 303 |
Interquartile range (IQR) | 196 |
Descriptive statistics
Standard deviation | 97.25612 |
---|---|
Coefficient of variation (CV) | 0.00048116039 |
Kurtosis | -1.0878852 |
Mean | 202128.27 |
Median Absolute Deviation (MAD) | 98 |
Skewness | 0.17958462 |
Sum | 2.0212828 × 109 |
Variance | 9458.7529 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
202302 | 311 | 3.1% |
202112 | 286 | 2.9% |
202101 | 278 | 2.8% |
202206 | 277 | 2.8% |
202212 | 273 | 2.7% |
202201 | 268 | 2.7% |
202111 | 266 | 2.7% |
202106 | 266 | 2.7% |
202003 | 264 | 2.6% |
202104 | 263 | 2.6% |
Other values (30) | 7248 |
Value | Count | Frequency (%) |
202001 | 254 | |
202002 | 228 | |
202003 | 264 | |
202004 | 251 | |
202005 | 250 | |
202006 | 220 | |
202007 | 241 | |
202008 | 246 | |
202009 | 223 | |
202010 | 249 |
Value | Count | Frequency (%) |
202304 | 210 | |
202303 | 262 | |
202302 | 311 | |
202301 | 253 | |
202212 | 273 | |
202211 | 241 | |
202210 | 228 | |
202209 | 259 | |
202208 | 253 | |
202207 | 255 |
사업장순번
Real number (ℝ)
Distinct | 20 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 13.2973 |
Minimum | 4 |
---|---|
Maximum | 26 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 4 |
---|---|
5-th percentile | 4 |
Q1 | 9 |
median | 13 |
Q3 | 19 |
95-th percentile | 24 |
Maximum | 26 |
Range | 22 |
Interquartile range (IQR) | 10 |
Descriptive statistics
Standard deviation | 5.9533736 |
---|---|
Coefficient of variation (CV) | 0.44771296 |
Kurtosis | -0.94461362 |
Mean | 13.2973 |
Median Absolute Deviation (MAD) | 5 |
Skewness | 0.29026194 |
Sum | 132973 |
Variance | 35.442657 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
13 | 1272 | 12.7% |
14 | 783 | 7.8% |
9 | 703 | 7.0% |
5 | 655 | 6.6% |
19 | 604 | 6.0% |
22 | 574 | 5.7% |
11 | 544 | 5.4% |
7 | 503 | 5.0% |
4 | 501 | 5.0% |
8 | 474 | 4.7% |
Other values (10) | 3387 |
Value | Count | Frequency (%) |
4 | 501 | 5.0% |
5 | 655 | |
6 | 255 | 2.5% |
7 | 503 | 5.0% |
8 | 474 | 4.7% |
9 | 703 | |
10 | 471 | 4.7% |
11 | 544 | |
12 | 467 | 4.7% |
13 | 1272 |
Value | Count | Frequency (%) |
26 | 150 | 1.5% |
24 | 356 | |
23 | 192 | 1.9% |
22 | 574 | |
21 | 454 | |
20 | 466 | |
19 | 604 | |
16 | 270 | 2.7% |
15 | 306 | 3.1% |
14 | 783 |
배출시설순번
Real number (ℝ)
Distinct | 58 |
---|---|
Distinct (%) | 0.6% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 16.2661 |
Minimum | 1 |
---|---|
Maximum | 80 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 5 |
median | 11 |
Q3 | 22 |
95-th percentile | 44.05 |
Maximum | 80 |
Range | 79 |
Interquartile range (IQR) | 17 |
Descriptive statistics
Standard deviation | 15.226722 |
---|---|
Coefficient of variation (CV) | 0.93610161 |
Kurtosis | 2.7931792 |
Mean | 16.2661 |
Median Absolute Deviation (MAD) | 7 |
Skewness | 1.6274485 |
Sum | 162661 |
Variance | 231.85308 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
3 | 692 | 6.9% |
1 | 569 | 5.7% |
4 | 540 | 5.4% |
8 | 540 | 5.4% |
7 | 487 | 4.9% |
5 | 457 | 4.6% |
2 | 453 | 4.5% |
10 | 394 | 3.9% |
9 | 356 | 3.6% |
6 | 340 | 3.4% |
Other values (48) | 5172 |
Value | Count | Frequency (%) |
1 | 569 | |
2 | 453 | |
3 | 692 | |
4 | 540 | |
5 | 457 | |
6 | 340 | |
7 | 487 | |
8 | 540 | |
9 | 356 | |
10 | 394 |
Value | Count | Frequency (%) |
80 | 27 | |
77 | 44 | |
70 | 54 | |
66 | 18 | 0.2% |
64 | 34 | |
63 | 52 | |
62 | 48 | |
61 | 42 | |
60 | 32 | |
59 | 40 |
배출활동순번
Categorical
IMBALANCE
 
Distinct | 5 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
1 | |
---|---|
2 | 586 |
3 | 102 |
4 | 33 |
5 | 32 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 1 |
---|---|
2nd row | 1 |
3rd row | 1 |
4th row | 1 |
5th row | 1 |
Common Values
Value | Count | Frequency (%) |
1 | 9247 | |
2 | 586 | 5.9% |
3 | 102 | 1.0% |
4 | 33 | 0.3% |
5 | 32 | 0.3% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
1 | 9247 | |
2 | 586 | 5.9% |
3 | 102 | 1.0% |
4 | 33 | 0.3% |
5 | 32 | 0.3% |
파라미터ID
Real number (ℝ)
Distinct | 259 |
---|---|
Distinct (%) | 2.6% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 54989.485 |
Minimum | 1 |
---|---|
Maximum | 200233 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 132 |
Q1 | 202 |
median | 353 |
Q3 | 200002 |
95-th percentile | 200029 |
Maximum | 200233 |
Range | 200232 |
Interquartile range (IQR) | 199800 |
Descriptive statistics
Standard deviation | 87324.84 |
---|---|
Coefficient of variation (CV) | 1.588028 |
Kurtosis | -0.87802233 |
Mean | 54989.485 |
Median Absolute Deviation (MAD) | 222 |
Skewness | 1.0555387 |
Sum | 5.4989485 × 108 |
Variance | 7.6256277 × 109 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
200029 | 260 | 2.6% |
200002 | 255 | 2.5% |
211 | 253 | 2.5% |
200011 | 249 | 2.5% |
190 | 249 | 2.5% |
200018 | 248 | 2.5% |
198 | 248 | 2.5% |
200014 | 246 | 2.5% |
200022 | 244 | 2.4% |
209 | 237 | 2.4% |
Other values (249) | 7511 |
Value | Count | Frequency (%) |
1 | 7 | |
5 | 8 | |
9 | 6 | |
13 | 2 | < 0.1% |
17 | 5 | |
21 | 3 | < 0.1% |
24 | 4 | |
26 | 7 | |
29 | 6 | |
32 | 3 | < 0.1% |
Value | Count | Frequency (%) |
200233 | 2 | < 0.1% |
200232 | 2 | < 0.1% |
200231 | 6 | |
200230 | 2 | < 0.1% |
200229 | 5 | |
200228 | 3 | |
200227 | 5 | |
200217 | 3 | |
200216 | 3 | |
200215 | 5 |
파라미터값
Text
Distinct | 1039 |
---|---|
Distinct (%) | 10.4% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
0 | 4353 | |
1 | 1122 | 11.2% |
56100 | 297 | 3.0% |
39 | 292 | 2.9% |
44 | 265 | 2.6% |
10 | 239 | 2.4% |
3 | 230 | 2.3% |
4 | 198 | 2.0% |
38 | 162 | 1.6% |
74100 | 162 | 1.6% |
Other values (1029) | 2680 |
Most occurring characters
Value | Count | Frequency (%) |
9999 | ||
0 | 6549 | |
1 | 2838 | 9.9% |
3 | 1869 | 6.5% |
4 | 1634 | 5.7% |
5 | 1274 | 4.4% |
6 | 1044 | 3.6% |
9 | 964 | 3.4% |
2 | 945 | 3.3% |
7 | 824 | 2.9% |
Other values (3) | 719 | 2.5% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 18658 | |
Space Separator | 9999 | |
Open Punctuation | 1 | < 0.1% |
Close Punctuation | 1 | < 0.1% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 6549 | |
1 | 2838 | |
3 | 1869 | 10.0% |
4 | 1634 | 8.8% |
5 | 1274 | 6.8% |
6 | 1044 | 5.6% |
9 | 964 | 5.2% |
2 | 945 | 5.1% |
7 | 824 | 4.4% |
8 | 717 | 3.8% |
Space Separator
Value | Count | Frequency (%) |
9999 |
Open Punctuation
Value | Count | Frequency (%) |
( | 1 |
Close Punctuation
Value | Count | Frequency (%) |
) | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 28659 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
9999 | ||
0 | 6549 | |
1 | 2838 | 9.9% |
3 | 1869 | 6.5% |
4 | 1634 | 5.7% |
5 | 1274 | 4.4% |
6 | 1044 | 3.6% |
9 | 964 | 3.4% |
2 | 945 | 3.3% |
7 | 824 | 2.9% |
Other values (3) | 719 | 2.5% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 28659 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
9999 | ||
0 | 6549 | |
1 | 2838 | 9.9% |
3 | 1869 | 6.5% |
4 | 1634 | 5.7% |
5 | 1274 | 4.4% |
6 | 1044 | 3.6% |
9 | 964 | 3.4% |
2 | 945 | 3.3% |
7 | 824 | 2.9% |
Other values (3) | 719 | 2.5% |
파라미터버전순번
Real number (ℝ)
Distinct | 11 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1.3409 |
Minimum | 1 |
---|---|
Maximum | 11 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 1 |
median | 1 |
Q3 | 1 |
95-th percentile | 3 |
Maximum | 11 |
Range | 10 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 0.98973994 |
---|---|
Coefficient of variation (CV) | 0.73811615 |
Kurtosis | 41.877675 |
Mean | 1.3409 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 5.6305359 |
Sum | 13409 |
Variance | 0.97958515 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1 | 7999 | |
2 | 1344 | 13.4% |
3 | 440 | 4.4% |
4 | 69 | 0.7% |
5 | 46 | 0.5% |
9 | 25 | 0.2% |
11 | 24 | 0.2% |
6 | 18 | 0.2% |
10 | 14 | 0.1% |
8 | 12 | 0.1% |
Value | Count | Frequency (%) |
1 | 7999 | |
2 | 1344 | 13.4% |
3 | 440 | 4.4% |
4 | 69 | 0.7% |
5 | 46 | 0.5% |
6 | 18 | 0.2% |
7 | 9 | 0.1% |
8 | 12 | 0.1% |
9 | 25 | 0.2% |
10 | 14 | 0.1% |
Value | Count | Frequency (%) |
11 | 24 | 0.2% |
10 | 14 | 0.1% |
9 | 25 | 0.2% |
8 | 12 | 0.1% |
7 | 9 | 0.1% |
6 | 18 | 0.2% |
5 | 46 | 0.5% |
4 | 69 | 0.7% |
3 | 440 | 4.4% |
2 | 1344 |
기준연월 | 사업장순번 | 배출시설순번 | 배출활동순번 | 파라미터ID | 파라미터버전순번 | |
---|---|---|---|---|---|---|
기준연월 | 1.000 | 0.098 | 0.040 | 0.042 | 0.000 | 0.049 |
사업장순번 | 0.098 | 1.000 | 0.709 | 0.462 | 0.330 | 0.237 |
배출시설순번 | 0.040 | 0.709 | 1.000 | 0.296 | 0.261 | 0.144 |
배출활동순번 | 0.042 | 0.462 | 0.296 | 1.000 | 0.108 | 0.164 |
파라미터ID | 0.000 | 0.330 | 0.261 | 0.108 | 1.000 | 0.190 |
파라미터버전순번 | 0.049 | 0.237 | 0.144 | 0.164 | 0.190 | 1.000 |
기준연월 | 사업장순번 | 배출시설순번 | 파라미터ID | 파라미터버전순번 | 배출활동순번 | |
---|---|---|---|---|---|---|
기준연월 | 1.000 | 0.042 | -0.000 | -0.005 | -0.006 | 0.035 |
사업장순번 | 0.042 | 1.000 | -0.125 | 0.086 | 0.021 | 0.209 |
배출시설순번 | -0.000 | -0.125 | 1.000 | -0.131 | 0.029 | 0.127 |
파라미터ID | -0.005 | 0.086 | -0.131 | 1.000 | -0.121 | 0.132 |
파라미터버전순번 | -0.006 | 0.021 | 0.029 | -0.121 | 1.000 | 0.062 |
배출활동순번 | 0.035 | 0.209 | 0.127 | 0.132 | 0.062 | 1.000 |
기준연월 | 사업장순번 | 배출시설순번 | 배출활동순번 | 파라미터ID | 파라미터값 | 파라미터버전순번 | |
---|---|---|---|---|---|---|---|
13677 | 202006 | 13 | 34 | 1 | 211 | 56100 | 1 |
98004 | 202304 | 4 | 18 | 1 | 200005 | 0 | 1 |
57723 | 202112 | 7 | 22 | 1 | 112 | 0 | 1 |
79505 | 202208 | 21 | 6 | 1 | 200005 | 0 | 1 |
41532 | 202105 | 15 | 4 | 1 | 200011 | 377952 | 1 |
16604 | 202007 | 19 | 1 | 1 | 10093 | 74100 | 1 |
86340 | 202211 | 14 | 27 | 1 | 202 | 0 | 1 |
10039 | 202005 | 4 | 19 | 1 | 200018 | 0 | 1 |
1123 | 202001 | 11 | 21 | 1 | 209 | 39 | 2 |
83777 | 202210 | 14 | 10 | 1 | 223 | 0 | 1 |
기준연월 | 사업장순번 | 배출시설순번 | 배출활동순번 | 파라미터ID | 파라미터값 | 파라미터버전순번 | |
---|---|---|---|---|---|---|---|
30392 | 202101 | 10 | 20 | 1 | 202 | 721 | 5 |
4921 | 202002 | 24 | 7 | 1 | 10001 | 128 | 1 |
72888 | 202206 | 6 | 11 | 1 | 128 | 0 | 1 |
90069 | 202301 | 5 | 23 | 1 | 132 | 38 | 2 |
89476 | 202212 | 21 | 8 | 1 | 513 | 0 | 1 |
99575 | 202304 | 15 | 3 | 1 | 10083 | 0 | 1 |
63989 | 202202 | 14 | 3 | 1 | 10170 | 62 | 1 |
90964 | 202301 | 11 | 18 | 1 | 214 | 1 | 1 |
19581 | 202008 | 23 | 6 | 1 | 200008 | 0 | 1 |
73776 | 202206 | 13 | 33 | 1 | 209 | 39 | 2 |