Overview

Dataset statistics

Number of variables5
Number of observations26
Missing cells4
Missing cells (%)3.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.2 KiB
Average record size in memory48.1 B

Variable types

Categorical2
Numeric3

Dataset

Description2020년 국내원전(고리, 신고리, 월성, 신월성, 한빛, 한울) 호기별 발전량(2019년 발전량(MWh),2020년 발전량(MWh),계통연결 이후 누계 발전량(MWh))
Author원자력안전위원회
URLhttps://www.data.go.kr/data/15046077/fileData.do

Alerts

2019년 발전량(MWh) has 2 (7.7%) missing valuesMissing
2020년 발전량(MWh) has 2 (7.7%) missing valuesMissing
계통연결 이후 누계 발전량(MWh) has unique valuesUnique
2019년 발전량(MWh) has 2 (7.7%) zerosZeros
2020년 발전량(MWh) has 1 (3.8%) zerosZeros

Reproduction

Analysis started2023-12-12 23:08:34.240787
Analysis finished2023-12-12 23:08:35.798775
Duration1.56 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

구분
Categorical

Distinct6
Distinct (%)23.1%
Missing0
Missing (%)0.0%
Memory size340.0 B
한빛
한울
고리
신고리
월성

Length

Max length3
Median length2
Mean length2.2307692
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row고리
2nd row고리
3rd row고리
4th row고리
5th row신고리

Common Values

ValueCountFrequency (%)
한빛 6
23.1%
한울 6
23.1%
고리 4
15.4%
신고리 4
15.4%
월성 4
15.4%
신월성 2
 
7.7%

Length

2023-12-13T08:08:35.863158image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T08:08:35.967715image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
한빛 6
23.1%
한울 6
23.1%
고리 4
15.4%
신고리 4
15.4%
월성 4
15.4%
신월성 2
 
7.7%

호기
Categorical

Distinct6
Distinct (%)23.1%
Missing0
Missing (%)0.0%
Memory size340.0 B
#1
#2
#3
#4
#5

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row#1
2nd row#2
3rd row#3
4th row#4
5th row#1

Common Values

ValueCountFrequency (%)
#1 6
23.1%
#2 6
23.1%
#3 5
19.2%
#4 5
19.2%
#5 2
 
7.7%
#6 2
 
7.7%

Length

2023-12-13T08:08:36.121609image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T08:08:36.226531image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 6
23.1%
2 6
23.1%
3 5
19.2%
4 5
19.2%
5 2
 
7.7%
6 2
 
7.7%

2019년 발전량(MWh)
Real number (ℝ)

MISSING  ZEROS 

Distinct23
Distinct (%)95.8%
Missing2
Missing (%)7.7%
Infinite0
Infinite (%)0.0%
Mean6079569.5
Minimum0
Maximum11496255
Zeros2
Zeros (%)7.7%
Negative0
Negative (%)0.0%
Memory size366.0 B
2023-12-13T08:08:36.351327image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile224194.65
Q14956199.5
median6839773.5
Q37294901.2
95-th percentile9086530.6
Maximum11496255
Range11496255
Interquartile range (IQR)2338701.8

Descriptive statistics

Standard deviation2698260.4
Coefficient of variation (CV)0.44382425
Kurtosis1.107062
Mean6079569.5
Median Absolute Deviation (MAD)911593
Skewness-0.83494528
Sum1.4590967 × 108
Variance7.280609 × 1012
MonotonicityNot monotonic
2023-12-13T08:08:36.495516image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=23)
ValueCountFrequency (%)
0 2
 
7.7%
6217804 1
 
3.8%
7113005 1
 
3.8%
6829722 1
 
3.8%
8808352 1
 
3.8%
6976184 1
 
3.8%
6931558 1
 
3.8%
7384602 1
 
3.8%
6751494 1
 
3.8%
9135621 1
 
3.8%
Other values (13) 13
50.0%
(Missing) 2
 
7.7%
ValueCountFrequency (%)
0 2
7.7%
1494631 1
3.8%
3412033 1
3.8%
4603070 1
3.8%
4677717 1
3.8%
5049027 1
3.8%
5965175 1
3.8%
6217804 1
3.8%
6444361 1
3.8%
6751494 1
3.8%
ValueCountFrequency (%)
11496255 1
3.8%
9135621 1
3.8%
8808352 1
3.8%
7801495 1
3.8%
7788361 1
3.8%
7384602 1
3.8%
7265001 1
3.8%
7113005 1
3.8%
6976184 1
3.8%
6931558 1
3.8%

2020년 발전량(MWh)
Real number (ℝ)

MISSING  ZEROS 

Distinct24
Distinct (%)100.0%
Missing2
Missing (%)7.7%
Infinite0
Infinite (%)0.0%
Mean6674321.6
Minimum0
Maximum10033099
Zeros1
Zeros (%)3.8%
Negative0
Negative (%)0.0%
Memory size366.0 B
2023-12-13T08:08:36.622900image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile1032002.1
Q14241490.5
median7692360
Q39043808.8
95-th percentile9254393.3
Maximum10033099
Range10033099
Interquartile range (IQR)4802318.2

Descriptive statistics

Standard deviation2948279.2
Coefficient of variation (CV)0.44173466
Kurtosis-0.18164817
Mean6674321.6
Median Absolute Deviation (MAD)1524237
Skewness-1.0008376
Sum1.6018372 × 108
Variance8.6923501 × 1012
MonotonicityNot monotonic
2023-12-13T08:08:36.744654image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=24)
ValueCountFrequency (%)
6873748 1
 
3.8%
6916618 1
 
3.8%
9230505 1
 
3.8%
7918580 1
 
3.8%
9245935 1
 
3.8%
7885757 1
 
3.8%
6988234 1
 
3.8%
9255886 1
 
3.8%
2856219 1
 
3.8%
0 1
 
3.8%
Other values (14) 14
53.8%
(Missing) 2
 
7.7%
ValueCountFrequency (%)
0 1
3.8%
827142 1
3.8%
2192876 1
3.8%
2856219 1
3.8%
3635332 1
3.8%
4077437 1
3.8%
4296175 1
3.8%
6873748 1
3.8%
6916618 1
3.8%
6988234 1
3.8%
ValueCountFrequency (%)
10033099 1
3.8%
9255886 1
3.8%
9245935 1
3.8%
9230505 1
3.8%
9222088 1
3.8%
9211106 1
3.8%
8988043 1
3.8%
8963880 1
3.8%
8501084 1
3.8%
8326081 1
3.8%
Distinct26
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.5011574 × 108
Minimum16477460
Maximum2.577347 × 108
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size366.0 B
2023-12-13T08:08:36.856158image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum16477460
5-th percentile42874102
Q11.2079803 × 108
median1.4722615 × 108
Q31.8423684 × 108
95-th percentile2.5610235 × 108
Maximum2.577347 × 108
Range2.4125724 × 108
Interquartile range (IQR)63438805

Descriptive statistics

Standard deviation71021404
Coefficient of variation (CV)0.47311099
Kurtosis-0.78940404
Mean1.5011574 × 108
Median Absolute Deviation (MAD)36946324
Skewness-0.11178022
Sum3.9030091 × 109
Variance5.0440399 × 1015
MonotonicityNot monotonic
2023-12-13T08:08:36.973774image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=26)
ValueCountFrequency (%)
156022432 1
 
3.8%
253169140 1
 
3.8%
128852550 1
 
3.8%
135898027 1
 
3.8%
164048906 1
 
3.8%
178484240 1
 
3.8%
236060814 1
 
3.8%
238347012 1
 
3.8%
145404848 1
 
3.8%
145969241 1
 
3.8%
Other values (16) 16
61.5%
ValueCountFrequency (%)
16477460 1
3.8%
42855083 1
3.8%
42931160 1
3.8%
65089942 1
3.8%
65585878 1
3.8%
67275321 1
3.8%
120697372 1
3.8%
121100003 1
3.8%
128852550 1
3.8%
130604029 1
3.8%
ValueCountFrequency (%)
257734698 1
3.8%
257080085 1
3.8%
253169140 1
3.8%
242160989 1
3.8%
238347012 1
3.8%
236060814 1
3.8%
184301200 1
3.8%
184043740 1
3.8%
178484240 1
3.8%
174331898 1
3.8%

Interactions

2023-12-13T08:08:34.967695image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:08:34.413121image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:08:34.678907image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:08:35.082194image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:08:34.507544image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:08:34.766527image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:08:35.474331image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:08:34.587630image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:08:34.851942image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T08:08:37.050494image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분호기2019년 발전량(MWh)2020년 발전량(MWh)계통연결 이후 누계 발전량(MWh)
구분1.0000.0000.5530.7450.782
호기0.0001.0000.0000.0000.000
2019년 발전량(MWh)0.5530.0001.0000.9230.372
2020년 발전량(MWh)0.7450.0000.9231.0000.537
계통연결 이후 누계 발전량(MWh)0.7820.0000.3720.5371.000
2023-12-13T08:08:37.148762image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분호기
구분1.0000.000
호기0.0001.000
2023-12-13T08:08:37.230329image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2019년 발전량(MWh)2020년 발전량(MWh)계통연결 이후 누계 발전량(MWh)구분호기
2019년 발전량(MWh)1.0000.318-0.2460.3050.000
2020년 발전량(MWh)0.3181.000-0.3380.4950.000
계통연결 이후 누계 발전량(MWh)-0.246-0.3381.0000.3560.000
구분0.3050.4950.3561.0000.000
호기0.0000.0000.0000.0001.000

Missing values

2023-12-13T08:08:35.572840image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T08:08:35.662107image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-13T08:08:35.749824image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

구분호기2019년 발전량(MWh)2020년 발전량(MWh)계통연결 이후 누계 발전량(MWh)
0고리#1<NA><NA>156022432
1고리#259651752192876184301200
2고리#362178047238931257080085
3고리#446777177498963257734698
4신고리#16849825832608167275321
5신고리#27265001850108465089942
6신고리#311496255896388042855083
7신고리#464443611003309916477460
8월성#1<NA><NA>148483052
9월성#250490274077437130604029
구분호기2019년 발전량(MWh)2020년 발전량(MWh)계통연결 이후 누계 발전량(MWh)
16한빛#30827142184043740
17한빛#400174331898
18한빛#591356212856219145969241
19한빛#667514949255886145404848
20한울#173846026988234238347012
21한울#269315587885757236060814
22한울#369761849245935178484240
23한울#488083527918580164048906
24한울#568297229230505135898027
25한울#671130056916618128852550