Overview

Dataset statistics

Number of variables13
Number of observations30
Missing cells12
Missing cells (%)3.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory3.3 KiB
Average record size in memory111.4 B

Variable types

Numeric3
Categorical5
DateTime5

Dataset

Description2020년 국내 원자력발전소 현황(노형, 원자로공급사, 용량, 착공일, 건설허가일, 착공일(기초굴착일), 운영 허가일, 상업운전 개시일, 설계수명 만료일) - 2020.12월 기준
URLhttps://www.data.go.kr/data/15046076/fileData.do

Alerts

번호 is highly overall correlated with 용량2(MWe) and 2 other fieldsHigh correlation
용량1(MWe) is highly overall correlated with 구분 and 2 other fieldsHigh correlation
용량2(MWe) is highly overall correlated with 번호 and 4 other fieldsHigh correlation
구분 is highly overall correlated with 용량1(MWe) and 2 other fieldsHigh correlation
발전소명 is highly overall correlated with 번호 and 3 other fieldsHigh correlation
노형 is highly overall correlated with 용량2(MWe) and 1 other fieldsHigh correlation
원자로공급사 is highly overall correlated with 번호 and 3 other fieldsHigh correlation
노형 is highly imbalanced (78.9%)Imbalance
운영허가일 has 4 (13.3%) missing valuesMissing
상업운전개시일 has 4 (13.3%) missing valuesMissing
설계수명만료일 has 4 (13.3%) missing valuesMissing

Reproduction

Analysis started2023-12-12 19:43:21.704801
Analysis finished2023-12-12 19:43:23.638762
Duration1.93 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

번호
Real number (ℝ)

HIGH CORRELATION 

Distinct24
Distinct (%)80.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean10.433333
Minimum1
Maximum24
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size402.0 B
2023-12-13T04:43:23.725803image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q13.25
median9.5
Q316.75
95-th percentile22.55
Maximum24
Range23
Interquartile range (IQR)13.5

Descriptive statistics

Standard deviation7.5871185
Coefficient of variation (CV)0.72719986
Kurtosis-1.294385
Mean10.433333
Median Absolute Deviation (MAD)6.5
Skewness0.32030157
Sum313
Variance57.564368
MonotonicityNot monotonic
2023-12-13T04:43:23.871888image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=24)
ValueCountFrequency (%)
1 3
 
10.0%
2 3
 
10.0%
3 2
 
6.7%
4 2
 
6.7%
15 1
 
3.3%
24 1
 
3.3%
23 1
 
3.3%
22 1
 
3.3%
21 1
 
3.3%
20 1
 
3.3%
Other values (14) 14
46.7%
ValueCountFrequency (%)
1 3
10.0%
2 3
10.0%
3 2
6.7%
4 2
6.7%
5 1
 
3.3%
6 1
 
3.3%
7 1
 
3.3%
8 1
 
3.3%
9 1
 
3.3%
10 1
 
3.3%
ValueCountFrequency (%)
24 1
3.3%
23 1
3.3%
22 1
3.3%
21 1
3.3%
20 1
3.3%
19 1
3.3%
18 1
3.3%
17 1
3.3%
16 1
3.3%
15 1
3.3%

구분
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)10.0%
Missing0
Missing (%)0.0%
Memory size372.0 B
운영원전
24 
건설원전
정지원전
 
2

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row정지원전
2nd row정지원전
3rd row운영원전
4th row운영원전
5th row운영원전

Common Values

ValueCountFrequency (%)
운영원전 24
80.0%
건설원전 4
 
13.3%
정지원전 2
 
6.7%

Length

2023-12-13T04:43:24.041051image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T04:43:24.171939image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
운영원전 24
80.0%
건설원전 4
 
13.3%
정지원전 2
 
6.7%

발전소명
Categorical

HIGH CORRELATION 

Distinct7
Distinct (%)23.3%
Missing0
Missing (%)0.0%
Memory size372.0 B
신고리
한빛
한울
고리
월성
Other values (2)

Length

Max length3
Median length2
Mean length2.3333333
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row고리
2nd row월성
3rd row고리
4th row고리
5th row고리

Common Values

ValueCountFrequency (%)
신고리 6
20.0%
한빛 6
20.0%
한울 6
20.0%
고리 4
13.3%
월성 4
13.3%
신월성 2
 
6.7%
신한울 2
 
6.7%

Length

2023-12-13T04:43:24.312680image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T04:43:24.446271image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
신고리 6
20.0%
한빛 6
20.0%
한울 6
20.0%
고리 4
13.3%
월성 4
13.3%
신월성 2
 
6.7%
신한울 2
 
6.7%

호기
Categorical

Distinct6
Distinct (%)20.0%
Missing0
Missing (%)0.0%
Memory size372.0 B
#1
#2
#3
#4
#5

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row#1
2nd row#1
3rd row#2
4th row#3
5th row#4

Common Values

ValueCountFrequency (%)
#1 7
23.3%
#2 7
23.3%
#3 5
16.7%
#4 5
16.7%
#5 3
10.0%
#6 3
10.0%

Length

2023-12-13T04:43:24.630062image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T04:43:24.810112image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 7
23.3%
2 7
23.3%
3 5
16.7%
4 5
16.7%
5 3
10.0%
6 3
10.0%

노형
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)6.7%
Missing0
Missing (%)0.0%
Memory size372.0 B
PWR
29 
PHWR
 
1

Length

Max length4
Median length3
Mean length3.0333333
Min length3

Unique

Unique1 ?
Unique (%)3.3%

Sample

1st rowPWR
2nd rowPHWR
3rd rowPWR
4th rowPWR
5th rowPWR

Common Values

ValueCountFrequency (%)
PWR 29
96.7%
PHWR 1
 
3.3%

Length

2023-12-13T04:43:24.985242image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T04:43:25.128796image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
pwr 29
96.7%
phwr 1
 
3.3%

원자로공급사
Categorical

HIGH CORRELATION 

Distinct7
Distinct (%)23.3%
Missing0
Missing (%)0.0%
Memory size372.0 B
W/H
두중(APR1400)
두중
AECL(CANDU)
두중(OPR1000)
Other values (2)

Length

Max length11
Median length3
Mean length6.5333333
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowW/H
2nd rowAECL(CANDU)
3rd rowW/H
4th rowW/H
5th rowW/H

Common Values

ValueCountFrequency (%)
W/H 6
20.0%
두중(APR1400) 6
20.0%
두중 6
20.0%
AECL(CANDU) 4
13.3%
두중(OPR1000) 4
13.3%
C/E 2
 
6.7%
FRA 2
 
6.7%

Length

2023-12-13T04:43:25.298614image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T04:43:25.463517image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
w/h 6
20.0%
두중(apr1400 6
20.0%
두중 6
20.0%
aecl(candu 4
13.3%
두중(opr1000 4
13.3%
c/e 2
 
6.7%
fra 2
 
6.7%

용량1(MWe)
Real number (ℝ)

HIGH CORRELATION 

Distinct7
Distinct (%)23.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1003.8667
Minimum587
Maximum1400
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size402.0 B
2023-12-13T04:43:25.643362image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum587
5-th percentile663.05
Q1950
median1000
Q31000
95-th percentile1400
Maximum1400
Range813
Interquartile range (IQR)50

Descriptive statistics

Standard deviation237.85402
Coefficient of variation (CV)0.23693786
Kurtosis-0.28387199
Mean1003.8667
Median Absolute Deviation (MAD)50
Skewness0.37565115
Sum30116
Variance56574.533
MonotonicityNot monotonic
2023-12-13T04:43:25.828559image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=7)
ValueCountFrequency (%)
1000 12
40.0%
950 6
20.0%
1400 6
20.0%
700 3
 
10.0%
587 1
 
3.3%
679 1
 
3.3%
650 1
 
3.3%
ValueCountFrequency (%)
587 1
 
3.3%
650 1
 
3.3%
679 1
 
3.3%
700 3
 
10.0%
950 6
20.0%
1000 12
40.0%
1400 6
20.0%
ValueCountFrequency (%)
1400 6
20.0%
1000 12
40.0%
950 6
20.0%
700 3
 
10.0%
679 1
 
3.3%
650 1
 
3.3%
587 1
 
3.3%

용량2(MWe)
Real number (ℝ)

HIGH CORRELATION 

Distinct6
Distinct (%)20.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4403.8667
Minimum587
Maximum5900
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size402.0 B
2023-12-13T04:43:26.030771image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum587
5-th percentile1633.45
Q13125
median4550
Q35900
95-th percentile5900
Maximum5900
Range5313
Interquartile range (IQR)2775

Descriptive statistics

Standard deviation1568.0744
Coefficient of variation (CV)0.35606764
Kurtosis0.2031672
Mean4403.8667
Median Absolute Deviation (MAD)1350
Skewness-0.88135879
Sum132116
Variance2458857.3
MonotonicityNot monotonic
2023-12-13T04:43:26.193639image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
5900 12
40.0%
2800 6
20.0%
4550 5
16.7%
4100 5
16.7%
587 1
 
3.3%
679 1
 
3.3%
ValueCountFrequency (%)
587 1
 
3.3%
679 1
 
3.3%
2800 6
20.0%
4100 5
16.7%
4550 5
16.7%
5900 12
40.0%
ValueCountFrequency (%)
5900 12
40.0%
4550 5
16.7%
4100 5
16.7%
2800 6
20.0%
679 1
 
3.3%
587 1
 
3.3%
Distinct17
Distinct (%)56.7%
Missing0
Missing (%)0.0%
Memory size372.0 B
Minimum1972-05-31 00:00:00
Maximum2016-06-27 00:00:00
2023-12-13T04:43:26.350239image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T04:43:26.506040image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=17)
Distinct17
Distinct (%)56.7%
Missing0
Missing (%)0.0%
Memory size372.0 B
Minimum1905-10-14 00:00:00
Maximum2016-06-28 00:00:00
2023-12-13T04:43:26.640928image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T04:43:26.796051image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=17)

운영허가일
Date

MISSING 

Distinct25
Distinct (%)96.2%
Missing4
Missing (%)13.3%
Memory size372.0 B
Minimum1972-05-31 00:00:00
Maximum2095-06-02 00:00:00
2023-12-13T04:43:26.996878image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T04:43:27.184509image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=25)

상업운전개시일
Date

MISSING 

Distinct26
Distinct (%)100.0%
Missing4
Missing (%)13.3%
Memory size372.0 B
Minimum1978-04-29 00:00:00
Maximum2019-08-29 00:00:00
2023-12-13T04:43:27.390229image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T04:43:27.594437image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=26)

설계수명만료일
Date

MISSING 

Distinct25
Distinct (%)96.2%
Missing4
Missing (%)13.3%
Memory size372.0 B
Minimum2017-06-18 00:00:00
Maximum2079-01-31 00:00:00
2023-12-13T04:43:27.747313image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T04:43:27.920121image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=25)

Interactions

2023-12-13T04:43:22.858826image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T04:43:22.215587image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T04:43:22.480739image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T04:43:22.960782image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T04:43:22.298747image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T04:43:22.603065image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T04:43:23.071871image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T04:43:22.389860image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T04:43:22.764129image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T04:43:28.066329image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호구분발전소명호기노형원자로공급사용량1(MWe)용량2(MWe)건설허가일착공일(기초굴착일)운영허가일상업운전개시일설계수명만료일
번호1.0000.0000.8860.0000.0000.8650.8540.9120.9520.9520.8651.0000.865
구분0.0001.0000.6050.2980.4270.5790.6110.8441.0001.0001.0001.0001.000
발전소명0.8860.6051.0000.0000.1620.9580.7880.8441.0001.0000.3371.0000.337
호기0.0000.2980.0001.0000.0000.0000.0000.0000.0000.0000.9321.0000.932
노형0.0000.4270.1620.0001.0000.1620.2571.0001.0001.0001.0001.0001.000
원자로공급사0.8650.5790.9580.0000.1621.0000.8970.8091.0001.0001.0001.0001.000
용량1(MWe)0.8540.6110.7880.0000.2570.8971.0000.9201.0001.0001.0001.0001.000
용량2(MWe)0.9120.8440.8440.0001.0000.8090.9201.0001.0001.0000.8491.0000.849
건설허가일0.9521.0001.0000.0001.0001.0001.0001.0001.0001.0000.9631.0000.963
착공일(기초굴착일)0.9521.0001.0000.0001.0001.0001.0001.0001.0001.0000.9631.0000.963
운영허가일0.8651.0000.3370.9321.0001.0001.0000.8490.9630.9631.0001.0001.000
상업운전개시일1.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.000
설계수명만료일0.8651.0000.3370.9321.0001.0001.0000.8490.9630.9631.0001.0001.000
2023-12-13T04:43:28.290130image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분발전소명노형원자로공급사호기
구분1.0000.4550.6550.4270.094
발전소명0.4551.0000.1340.6800.000
노형0.6550.1341.0000.1340.000
원자로공급사0.4270.6800.1341.0000.000
호기0.0940.0000.0000.0001.000
2023-12-13T04:43:28.770149image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호용량1(MWe)용량2(MWe)구분발전소명호기노형원자로공급사
번호1.0000.0750.7980.0000.6680.0000.0000.629
용량1(MWe)0.0751.000-0.1140.5950.6610.0000.2990.814
용량2(MWe)0.798-0.1141.0000.8470.6880.0000.5980.642
구분0.0000.5950.8471.0000.4550.0940.6550.427
발전소명0.6680.6610.6880.4551.0000.0000.1340.680
호기0.0000.0000.0000.0940.0001.0000.0000.000
노형0.0000.2990.5980.6550.1340.0001.0000.134
원자로공급사0.6290.8140.6420.4270.6800.0000.1341.000

Missing values

2023-12-13T04:43:23.213841image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T04:43:23.425429image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-13T04:43:23.552174image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

번호구분발전소명호기노형원자로공급사용량1(MWe)용량2(MWe)건설허가일착공일(기초굴착일)운영허가일상업운전개시일설계수명만료일
01정지원전고리#1PWRW/H5875871972-05-311971-11-151972-05-311978-04-292017-06-18
12정지원전월성#1PHWRAECL(CANDU)6796791978-02-151977-05-031978-02-151983-04-222019-12-24
21운영원전고리#2PWRW/H65045501978-11-181977-03-011983-08-101983-07-252023-04-08
32운영원전고리#3PWRW/H95045501979-12-241979-04-091984-09-291985-09-302024-09-28
43운영원전고리#4PWRW/H95045501979-12-241979-04-091985-08-071986-04-292025-08-06
54운영원전신고리#1PWR두중(OPR1000)100045502005-07-011905-10-142010-05-192011-02-282050-05-18
65운영원전신고리#2PWR두중(OPR1000)100045502005-07-011905-10-142011-12-022012-07-202051-12-01
76운영원전신고리#3PWR두중(APR1400)140028002008-04-152008-04-152015-10-302016-12-202075-10-29
87운영원전신고리#4PWR두중(APR1400)140028002008-04-152008-04-152019-02-012019-08-292079-01-31
98운영원전월성#2PWRAECL(CANDU)70041001992-08-281991-10-091996-11-021997-07-012026-11-01
번호구분발전소명호기노형원자로공급사용량1(MWe)용량2(MWe)건설허가일착공일(기초굴착일)운영허가일상업운전개시일설계수명만료일
2019운영원전한울#1PWRFRA95059001983-01-251982-03-051987-12-231988-09-102027-12-22
2120운영원전한울#2PWRFRA95059001983-01-251982-03-051988-12-291989-09-302028-12-28
2221운영원전한울#3PWR두중100059001993-07-161992-05-271997-11-081998-08-112037-11-07
2322운영원전한울#4PWR두중100059001993-07-161992-05-271998-10-291999-12-312038-10-28
2423운영원전한울#5PWR두중100059001999-05-171999-01-042003-10-202004-07-292043-10-19
2524운영원전한울#6PWR두중100059001999-05-171999-01-042004-11-122005-04-222044-11-11
261건설원전신고리#5PWR두중(APR1400)140028002016-06-272016-06-28<NA><NA><NA>
272건설원전신고리#6PWR두중(APR1400)140028002016-06-272016-06-28<NA><NA><NA>
283건설원전신한울#1PWR두중(APR1400)140028002011-12-022011-12-03<NA><NA><NA>
294건설원전신한울#2PWR두중(APR1400)140028002011-12-022011-12-03<NA><NA><NA>