Overview

Dataset statistics

Number of variables6
Number of observations26
Missing cells24
Missing cells (%)15.4%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.4 KiB
Average record size in memory54.1 B

Variable types

Text2
Categorical2
Numeric1
DateTime1

Dataset

Description한수원 원자력발전소 호기별 일반현황입니다. 호기명, 원자로형, 설비용량, 상업운전일, 비고(영구정지) 항목으로 되어 있습니다.
Author한국수력원자력(주)
URLhttps://www.data.go.kr/data/3047879/fileData.do

Alerts

설비용량(MW) is highly overall correlated with 위치 and 1 other fieldsHigh correlation
위치 is highly overall correlated with 설비용량(MW) and 1 other fieldsHigh correlation
원자로형 is highly overall correlated with 설비용량(MW) and 1 other fieldsHigh correlation
비고 has 24 (92.3%) missing valuesMissing
호기명 has unique valuesUnique
상업운전일 has unique valuesUnique

Reproduction

Analysis started2023-12-12 08:52:46.630518
Analysis finished2023-12-12 08:52:47.200179
Duration0.57 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

호기명
Text

UNIQUE 

Distinct26
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size340.0 B
2023-12-12T17:52:47.368170image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length6
Median length5
Mean length5.2307692
Min length5

Characters and Unicode

Total characters136
Distinct characters16
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique26 ?
Unique (%)100.0%

Sample

1st row고리1호기
2nd row고리2호기
3rd row고리3호기
4th row고리4호기
5th row신고리1호기
ValueCountFrequency (%)
고리1호기 1
 
3.8%
고리2호기 1
 
3.8%
한울5호기 1
 
3.8%
한울4호기 1
 
3.8%
한울3호기 1
 
3.8%
한울2호기 1
 
3.8%
한울1호기 1
 
3.8%
한빛6호기 1
 
3.8%
한빛5호기 1
 
3.8%
한빛4호기 1
 
3.8%
Other values (16) 16
61.5%
2023-12-12T17:52:47.734095image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
26
19.1%
26
19.1%
12
8.8%
8
 
5.9%
8
 
5.9%
1 6
 
4.4%
2 6
 
4.4%
6
 
4.4%
6
 
4.4%
6
 
4.4%
Other values (6) 26
19.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 110
80.9%
Decimal Number 26
 
19.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
26
23.6%
26
23.6%
12
10.9%
8
 
7.3%
8
 
7.3%
6
 
5.5%
6
 
5.5%
6
 
5.5%
6
 
5.5%
6
 
5.5%
Decimal Number
ValueCountFrequency (%)
1 6
23.1%
2 6
23.1%
3 5
19.2%
4 5
19.2%
5 2
 
7.7%
6 2
 
7.7%

Most occurring scripts

ValueCountFrequency (%)
Hangul 110
80.9%
Common 26
 
19.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
26
23.6%
26
23.6%
12
10.9%
8
 
7.3%
8
 
7.3%
6
 
5.5%
6
 
5.5%
6
 
5.5%
6
 
5.5%
6
 
5.5%
Common
ValueCountFrequency (%)
1 6
23.1%
2 6
23.1%
3 5
19.2%
4 5
19.2%
5 2
 
7.7%
6 2
 
7.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 110
80.9%
ASCII 26
 
19.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
26
23.6%
26
23.6%
12
10.9%
8
 
7.3%
8
 
7.3%
6
 
5.5%
6
 
5.5%
6
 
5.5%
6
 
5.5%
6
 
5.5%
ASCII
ValueCountFrequency (%)
1 6
23.1%
2 6
23.1%
3 5
19.2%
4 5
19.2%
5 2
 
7.7%
6 2
 
7.7%

위치
Categorical

HIGH CORRELATION 

Distinct5
Distinct (%)19.2%
Missing0
Missing (%)0.0%
Memory size340.0 B
부산광역시 기장군 장안읍
경상북도 경주시 양남면
전라남도 영광군 홍농읍
경상북도 울진군 북면
울산광역시 울주군 서생면

Length

Max length13
Median length12
Mean length12.076923
Min length11

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row부산광역시 기장군 장안읍
2nd row부산광역시 기장군 장안읍
3rd row부산광역시 기장군 장안읍
4th row부산광역시 기장군 장안읍
5th row부산광역시 기장군 장안읍

Common Values

ValueCountFrequency (%)
부산광역시 기장군 장안읍 6
23.1%
경상북도 경주시 양남면 6
23.1%
전라남도 영광군 홍농읍 6
23.1%
경상북도 울진군 북면 6
23.1%
울산광역시 울주군 서생면 2
 
7.7%

Length

2023-12-12T17:52:47.888272image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T17:52:48.053171image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
경상북도 12
15.4%
부산광역시 6
7.7%
기장군 6
7.7%
장안읍 6
7.7%
경주시 6
7.7%
양남면 6
7.7%
전라남도 6
7.7%
영광군 6
7.7%
홍농읍 6
7.7%
울진군 6
7.7%
Other values (4) 12
15.4%

원자로형
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)7.7%
Missing0
Missing (%)0.0%
Memory size340.0 B
가압경수로형
22 
가압중수로형

Length

Max length6
Median length6
Mean length6
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row가압경수로형
2nd row가압경수로형
3rd row가압경수로형
4th row가압경수로형
5th row가압경수로형

Common Values

ValueCountFrequency (%)
가압경수로형 22
84.6%
가압중수로형 4
 
15.4%

Length

2023-12-12T17:52:48.247195image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T17:52:48.380168image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
가압경수로형 22
84.6%
가압중수로형 4
 
15.4%

설비용량(MW)
Real number (ℝ)

HIGH CORRELATION 

Distinct7
Distinct (%)26.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean942.92308
Minimum587
Maximum1400
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size366.0 B
2023-12-12T17:52:48.521112image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum587
5-th percentile657.25
Q1950
median1000
Q31000
95-th percentile1300
Maximum1400
Range813
Interquartile range (IQR)50

Descriptive statistics

Standard deviation191.45839
Coefficient of variation (CV)0.20304773
Kurtosis1.3783728
Mean942.92308
Median Absolute Deviation (MAD)50
Skewness0.38038126
Sum24516
Variance36656.314
MonotonicityNot monotonic
2023-12-12T17:52:48.712067image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=7)
ValueCountFrequency (%)
1000 12
46.2%
950 6
23.1%
700 3
 
11.5%
1400 2
 
7.7%
587 1
 
3.8%
650 1
 
3.8%
679 1
 
3.8%
ValueCountFrequency (%)
587 1
 
3.8%
650 1
 
3.8%
679 1
 
3.8%
700 3
 
11.5%
950 6
23.1%
1000 12
46.2%
1400 2
 
7.7%
ValueCountFrequency (%)
1400 2
 
7.7%
1000 12
46.2%
950 6
23.1%
700 3
 
11.5%
679 1
 
3.8%
650 1
 
3.8%
587 1
 
3.8%

상업운전일
Date

UNIQUE 

Distinct26
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size340.0 B
Minimum1978-04-29 00:00:00
Maximum2019-08-29 00:00:00
2023-12-12T17:52:48.895297image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:52:49.060436image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=26)

비고
Text

MISSING 

Distinct2
Distinct (%)100.0%
Missing24
Missing (%)92.3%
Memory size340.0 B
2023-12-12T17:52:49.247530image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length16
Mean length16
Min length16

Characters and Unicode

Total characters32
Distinct characters15
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2 ?
Unique (%)100.0%

Sample

1st row영구정지(2017-06-18)
2nd row영구정지(2019-12-24)
ValueCountFrequency (%)
영구정지(2017-06-18 1
50.0%
영구정지(2019-12-24 1
50.0%
2023-12-12T17:52:49.620403image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2 4
12.5%
1 4
12.5%
- 4
12.5%
0 3
9.4%
2
 
6.2%
2
 
6.2%
2
 
6.2%
2
 
6.2%
( 2
 
6.2%
) 2
 
6.2%
Other values (5) 5
15.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 16
50.0%
Other Letter 8
25.0%
Dash Punctuation 4
 
12.5%
Open Punctuation 2
 
6.2%
Close Punctuation 2
 
6.2%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 4
25.0%
1 4
25.0%
0 3
18.8%
7 1
 
6.2%
6 1
 
6.2%
8 1
 
6.2%
9 1
 
6.2%
4 1
 
6.2%
Other Letter
ValueCountFrequency (%)
2
25.0%
2
25.0%
2
25.0%
2
25.0%
Dash Punctuation
ValueCountFrequency (%)
- 4
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 24
75.0%
Hangul 8
 
25.0%

Most frequent character per script

Common
ValueCountFrequency (%)
2 4
16.7%
1 4
16.7%
- 4
16.7%
0 3
12.5%
( 2
8.3%
) 2
8.3%
7 1
 
4.2%
6 1
 
4.2%
8 1
 
4.2%
9 1
 
4.2%
Hangul
ValueCountFrequency (%)
2
25.0%
2
25.0%
2
25.0%
2
25.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 24
75.0%
Hangul 8
 
25.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2 4
16.7%
1 4
16.7%
- 4
16.7%
0 3
12.5%
( 2
8.3%
) 2
8.3%
7 1
 
4.2%
6 1
 
4.2%
8 1
 
4.2%
9 1
 
4.2%
Hangul
ValueCountFrequency (%)
2
25.0%
2
25.0%
2
25.0%
2
25.0%

Interactions

2023-12-12T17:52:46.856908image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T17:52:49.751562image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
호기명위치원자로형설비용량(MW)상업운전일비고
호기명1.0001.0001.0001.0001.0000.000
위치1.0001.0000.6070.9091.0000.000
원자로형1.0000.6071.0001.0001.0000.000
설비용량(MW)1.0000.9091.0001.0001.000NaN
상업운전일1.0001.0001.0001.0001.0000.000
비고0.0000.0000.000NaN0.0001.000
2023-12-12T17:52:49.941245image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
원자로형위치
원자로형1.0000.682
위치0.6821.000
2023-12-12T17:52:50.073156image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
설비용량(MW)위치원자로형
설비용량(MW)1.0000.6100.935
위치0.6101.0000.682
원자로형0.9350.6821.000

Missing values

2023-12-12T17:52:47.013831image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T17:52:47.135892image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

호기명위치원자로형설비용량(MW)상업운전일비고
0고리1호기부산광역시 기장군 장안읍가압경수로형5871978-04-29영구정지(2017-06-18)
1고리2호기부산광역시 기장군 장안읍가압경수로형6501983-07-25<NA>
2고리3호기부산광역시 기장군 장안읍가압경수로형9501985-09-30<NA>
3고리4호기부산광역시 기장군 장안읍가압경수로형9501986-04-29<NA>
4신고리1호기부산광역시 기장군 장안읍가압경수로형10002011-02-28<NA>
5신고리2호기부산광역시 기장군 장안읍가압경수로형10002012-07-20<NA>
6신고리3호기울산광역시 울주군 서생면가압경수로형14002016-12-20<NA>
7신고리4호기울산광역시 울주군 서생면가압경수로형14002019-08-29<NA>
8월성1호기경상북도 경주시 양남면가압중수로형6791983-04-22영구정지(2019-12-24)
9월성2호기경상북도 경주시 양남면가압중수로형7001997-07-01<NA>
호기명위치원자로형설비용량(MW)상업운전일비고
16한빛3호기전라남도 영광군 홍농읍가압경수로형10001995-03-31<NA>
17한빛4호기전라남도 영광군 홍농읍가압경수로형10001996-01-01<NA>
18한빛5호기전라남도 영광군 홍농읍가압경수로형10002002-05-21<NA>
19한빛6호기전라남도 영광군 홍농읍가압경수로형10002002-12-24<NA>
20한울1호기경상북도 울진군 북면가압경수로형9501988-09-10<NA>
21한울2호기경상북도 울진군 북면가압경수로형9501989-09-30<NA>
22한울3호기경상북도 울진군 북면가압경수로형10001998-08-11<NA>
23한울4호기경상북도 울진군 북면가압경수로형10001999-12-31<NA>
24한울5호기경상북도 울진군 북면가압경수로형10002004-07-29<NA>
25한울6호기경상북도 울진군 북면가압경수로형10002005-04-22<NA>