Overview

Dataset statistics

Number of variables4
Number of observations239
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory8.1 KiB
Average record size in memory34.6 B

Variable types

Numeric2
Categorical2

Dataset

Description서부발전 발전소별 발전설비 열효율 정보입니다. 제공데이터는 연도,발전기명,발전원,발전열효율(%) 입니다. - 데이터 예) 2014,태안#1,유연탄,38.66
URLhttps://www.data.go.kr/data/15083347/fileData.do

Alerts

발전열효율(퍼센트) is highly overall correlated with 발전원High correlation
발전기명 is highly overall correlated with 발전원High correlation
발전원 is highly overall correlated with 발전열효율(퍼센트) and 1 other fieldsHigh correlation

Reproduction

Analysis started2023-12-12 20:58:25.912779
Analysis finished2023-12-12 20:58:26.538462
Duration0.63 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연도
Real number (ℝ)

Distinct10
Distinct (%)4.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2018.5272
Minimum2014
Maximum2023
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.2 KiB
2023-12-13T05:58:26.605929image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2014
5-th percentile2014
Q12016
median2019
Q32021
95-th percentile2023
Maximum2023
Range9
Interquartile range (IQR)5

Descriptive statistics

Standard deviation2.8620769
Coefficient of variation (CV)0.0014179036
Kurtosis-1.2114622
Mean2018.5272
Median Absolute Deviation (MAD)2
Skewness-0.0051409199
Sum482428
Variance8.1914841
MonotonicityNot monotonic
2023-12-13T05:58:26.735349image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%)
2017 25
10.5%
2016 24
10.0%
2018 24
10.0%
2019 24
10.0%
2020 24
10.0%
2021 24
10.0%
2022 24
10.0%
2023 24
10.0%
2014 23
9.6%
2015 23
9.6%
ValueCountFrequency (%)
2014 23
9.6%
2015 23
9.6%
2016 24
10.0%
2017 25
10.5%
2018 24
10.0%
2019 24
10.0%
2020 24
10.0%
2021 24
10.0%
2022 24
10.0%
2023 24
10.0%
ValueCountFrequency (%)
2023 24
10.0%
2022 24
10.0%
2021 24
10.0%
2020 24
10.0%
2019 24
10.0%
2018 24
10.0%
2017 25
10.5%
2016 24
10.0%
2015 23
9.6%
2014 23
9.6%

발전기명
Categorical

HIGH CORRELATION 

Distinct25
Distinct (%)10.5%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
태안#1
 
10
태안#2
 
10
군산복합
 
10
태안#3
 
10
태안#4
 
10
Other values (20)
189 

Length

Max length9
Median length7
Mean length6.2133891
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row태안#1
2nd row태안#2
3rd row태안#3
4th row태안#4
5th row태안#5

Common Values

ValueCountFrequency (%)
태안#1 10
 
4.2%
태안#2 10
 
4.2%
군산복합 10
 
4.2%
태안#3 10
 
4.2%
태안#4 10
 
4.2%
태안#5 10
 
4.2%
태안#6 10
 
4.2%
태안#7 10
 
4.2%
태안#8 10
 
4.2%
평택기력#1 10
 
4.2%
Other values (15) 139
58.2%

Length

2023-12-13T05:58:26.868396image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
태안#1 10
 
4.2%
태안#2 10
 
4.2%
서인천복합cc#8 10
 
4.2%
서인천복합cc#7 10
 
4.2%
서인천복합cc#6 10
 
4.2%
서인천복합cc#5 10
 
4.2%
서인천복합cc#4 10
 
4.2%
서인천복합cc#3 10
 
4.2%
서인천복합cc#2 10
 
4.2%
서인천복합cc#1 10
 
4.2%
Other values (15) 139
58.2%

발전원
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)1.3%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
LNG
120 
유연탄
95 
중유
24 

Length

Max length3
Median length3
Mean length2.8995816
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row유연탄
2nd row유연탄
3rd row유연탄
4th row유연탄
5th row유연탄

Common Values

ValueCountFrequency (%)
LNG 120
50.2%
유연탄 95
39.7%
중유 24
 
10.0%

Length

2023-12-13T05:58:27.008776image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T05:58:27.127368image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
lng 120
50.2%
유연탄 95
39.7%
중유 24
 
10.0%

발전열효율(퍼센트)
Real number (ℝ)

HIGH CORRELATION 

Distinct215
Distinct (%)90.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean40.471506
Minimum6.06
Maximum56.67
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.2 KiB
2023-12-13T05:58:27.236470image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum6.06
5-th percentile34.305
Q138.075
median39.01
Q345.16
95-th percentile47.989
Maximum56.67
Range50.61
Interquartile range (IQR)7.085

Descriptive statistics

Standard deviation6.0417025
Coefficient of variation (CV)0.14928287
Kurtosis6.0629187
Mean40.471506
Median Absolute Deviation (MAD)2.77
Skewness-1.3737659
Sum9672.69
Variance36.502169
MonotonicityNot monotonic
2023-12-13T05:58:27.394733image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
38.73 3
 
1.3%
38.66 2
 
0.8%
38.8 2
 
0.8%
38.64 2
 
0.8%
45.53 2
 
0.8%
37.24 2
 
0.8%
38.4 2
 
0.8%
38.57 2
 
0.8%
38.34 2
 
0.8%
44.68 2
 
0.8%
Other values (205) 218
91.2%
ValueCountFrequency (%)
6.06 1
0.4%
15.8 1
0.4%
18.7 1
0.4%
20.45 1
0.4%
20.61 1
0.4%
24.28 1
0.4%
24.39 1
0.4%
25.39 1
0.4%
29.63 1
0.4%
32.16 1
0.4%
ValueCountFrequency (%)
56.67 1
0.4%
52.68 1
0.4%
51.99 1
0.4%
51.54 1
0.4%
51.26 1
0.4%
51.16 1
0.4%
50.44 1
0.4%
50.4 1
0.4%
50.25 1
0.4%
50.06 1
0.4%

Interactions

2023-12-13T05:58:26.232325image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:58:26.068887image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:58:26.309102image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:58:26.140039image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T05:58:27.478007image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연도발전기명발전원발전열효율(퍼센트)
연도1.0000.0000.1770.321
발전기명0.0001.0000.9480.768
발전원0.1770.9481.0000.776
발전열효율(퍼센트)0.3210.7680.7761.000
2023-12-13T05:58:27.579691image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
발전기명발전원
발전기명1.0000.815
발전원0.8151.000
2023-12-13T05:58:27.662172image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연도발전열효율(퍼센트)발전기명발전원
연도1.000-0.2790.0000.041
발전열효율(퍼센트)-0.2791.0000.3810.646
발전기명0.0000.3811.0000.815
발전원0.0410.6460.8151.000

Missing values

2023-12-13T05:58:26.409642image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T05:58:26.499557image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연도발전기명발전원발전열효율(퍼센트)
02014태안#1유연탄38.66
12014태안#2유연탄38.7
22014태안#3유연탄38.45
32014태안#4유연탄38.67
42014태안#5유연탄38.84
52014태안#6유연탄38.96
62014태안#7유연탄38.87
72014태안#8유연탄39.06
82015태안#1유연탄38.68
92015태안#2유연탄38.38
연도발전기명발전원발전열효율(퍼센트)
2292014군산복합LNG49.42
2302015군산복합LNG47.69
2312016군산복합LNG46.02
2322017군산복합LNG43.41
2332018군산복합LNG45.33
2342019군산복합LNG45.06
2352020군산복합LNG44.86
2362021군산복합LNG42.38
2372022군산복합LNG6.06
2382023군산복합LNG38.53