Overview

Dataset statistics

Number of variables7
Number of observations57
Missing cells27
Missing cells (%)6.8%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory3.2 KiB
Average record size in memory58.3 B

Variable types

Categorical4
Text2
DateTime1

Dataset

Description주요 원전공급국가의 해외 원전건설 현황에 대한 데이터로, 공급국가명, 대상국가명, 원전명, 체결시점, 진행상태 등의 항목을 포함하여 제공합니다.
URLhttps://www.data.go.kr/data/15100959/fileData.do

Alerts

공급국가명 is highly overall correlated with 노형High correlation
대상국가명 is highly overall correlated with 노형 and 1 other fieldsHigh correlation
노형 is highly overall correlated with 공급국가명 and 1 other fieldsHigh correlation
진행상태 is highly overall correlated with 대상국가명High correlation
체결시점 has 27 (47.4%) missing valuesMissing

Reproduction

Analysis started2023-12-12 13:16:30.909009
Analysis finished2023-12-12 13:16:31.514230
Duration0.61 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

공급국가명
Categorical

HIGH CORRELATION 

Distinct6
Distinct (%)10.5%
Missing0
Missing (%)0.0%
Memory size588.0 B
러시아
29 
미국
12 
중국
프랑스
러시아
 
2

Length

Max length4
Median length3
Mean length2.6666667
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row러시아
2nd row러시아
3rd row러시아
4th row러시아
5th row러시아

Common Values

ValueCountFrequency (%)
러시아 29
50.9%
미국 12
21.1%
중국 7
 
12.3%
프랑스 5
 
8.8%
러시아 2
 
3.5%
일본 2
 
3.5%

Length

2023-12-12T22:16:31.596746image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T22:16:31.742001image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
러시아 31
54.4%
미국 12
 
21.1%
중국 7
 
12.3%
프랑스 5
 
8.8%
일본 2
 
3.5%

대상국가명
Categorical

HIGH CORRELATION 

Distinct22
Distinct (%)38.6%
Missing0
Missing (%)0.0%
Memory size588.0 B
중국
12 
인도
튀르키예
이집트
영국
Other values (17)
24 

Length

Max length8
Median length6
Mean length3.122807
Min length2

Unique

Unique10 ?
Unique (%)17.5%

Sample

1st row벨라루스
2nd row인도
3rd row인도
4th row인도
5th row인도

Common Values

ValueCountFrequency (%)
중국 12
21.1%
인도 7
12.3%
튀르키예 6
10.5%
이집트 4
 
7.0%
영국 4
 
7.0%
핀란드 2
 
3.5%
베트남 2
 
3.5%
헝거리 2
 
3.5%
파키스탄 2
 
3.5%
방글라데시 2
 
3.5%
Other values (12) 14
24.6%

Length

2023-12-12T22:16:31.850716image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
중국 12
21.1%
인도 7
12.3%
튀르키예 6
10.5%
이집트 4
 
7.0%
영국 4
 
7.0%
파키스탄 2
 
3.5%
이란 2
 
3.5%
방글라데시 2
 
3.5%
루마니아 2
 
3.5%
헝거리 2
 
3.5%
Other values (12) 14
24.6%
Distinct56
Distinct (%)98.2%
Missing0
Missing (%)0.0%
Memory size588.0 B
2023-12-12T22:16:32.040686image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length19
Median length15
Mean length11.403509
Min length2

Characters and Unicode

Total characters650
Distinct characters59
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique55 ?
Unique (%)96.5%

Sample

1st rowOstrovets 2호기
2nd rowKudankulam 3호기
3rd rowKudankulam 4호기
4th rowKudankulam 5호기
5th rowKudankulam 6호기
ValueCountFrequency (%)
3호기 11
 
9.3%
2호기 9
 
7.6%
1호기 8
 
6.8%
4호기 6
 
5.1%
dabaa 4
 
3.4%
kudankulam 4
 
3.4%
tianwan 4
 
3.4%
el 4
 
3.4%
akkuyu 4
 
3.4%
ninh 2
 
1.7%
Other values (47) 62
52.5%
2023-12-12T22:16:32.366016image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
a 78
 
12.0%
62
 
9.5%
46
 
7.1%
40
 
6.2%
n 38
 
5.8%
u 29
 
4.5%
i 27
 
4.2%
k 19
 
2.9%
h 18
 
2.8%
l 18
 
2.8%
Other values (49) 275
42.3%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 370
56.9%
Other Letter 90
 
13.8%
Uppercase Letter 72
 
11.1%
Space Separator 62
 
9.5%
Decimal Number 53
 
8.2%
Other Punctuation 2
 
0.3%
Dash Punctuation 1
 
0.2%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a 78
21.1%
n 38
10.3%
u 29
 
7.8%
i 27
 
7.3%
k 19
 
5.1%
h 18
 
4.9%
l 18
 
4.9%
o 18
 
4.9%
e 18
 
4.9%
d 15
 
4.1%
Other values (13) 92
24.9%
Uppercase Letter
ValueCountFrequency (%)
T 9
12.5%
K 7
 
9.7%
C 7
 
9.7%
B 6
 
8.3%
A 5
 
6.9%
S 4
 
5.6%
D 4
 
5.6%
E 4
 
5.6%
H 4
 
5.6%
V 3
 
4.2%
Other values (9) 19
26.4%
Decimal Number
ValueCountFrequency (%)
2 13
24.5%
3 13
24.5%
1 11
20.8%
4 8
15.1%
6 3
 
5.7%
8 2
 
3.8%
5 2
 
3.8%
7 1
 
1.9%
Other Letter
ValueCountFrequency (%)
46
51.1%
40
44.4%
1
 
1.1%
1
 
1.1%
1
 
1.1%
1
 
1.1%
Space Separator
ValueCountFrequency (%)
62
100.0%
Other Punctuation
ValueCountFrequency (%)
, 2
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 442
68.0%
Common 118
 
18.2%
Hangul 90
 
13.8%

Most frequent character per script

Latin
ValueCountFrequency (%)
a 78
17.6%
n 38
 
8.6%
u 29
 
6.6%
i 27
 
6.1%
k 19
 
4.3%
h 18
 
4.1%
l 18
 
4.1%
o 18
 
4.1%
e 18
 
4.1%
d 15
 
3.4%
Other values (32) 164
37.1%
Common
ValueCountFrequency (%)
62
52.5%
2 13
 
11.0%
3 13
 
11.0%
1 11
 
9.3%
4 8
 
6.8%
6 3
 
2.5%
, 2
 
1.7%
8 2
 
1.7%
5 2
 
1.7%
7 1
 
0.8%
Hangul
ValueCountFrequency (%)
46
51.1%
40
44.4%
1
 
1.1%
1
 
1.1%
1
 
1.1%
1
 
1.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 560
86.2%
Hangul 90
 
13.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
a 78
 
13.9%
62
 
11.1%
n 38
 
6.8%
u 29
 
5.2%
i 27
 
4.8%
k 19
 
3.4%
h 18
 
3.2%
l 18
 
3.2%
o 18
 
3.2%
e 18
 
3.2%
Other values (43) 235
42.0%
Hangul
ValueCountFrequency (%)
46
51.1%
40
44.4%
1
 
1.1%
1
 
1.1%
1
 
1.1%
1
 
1.1%

노형
Categorical

HIGH CORRELATION 

Distinct13
Distinct (%)22.8%
Missing0
Missing (%)0.0%
Memory size588.0 B
VVER-1200
23 
AP1000
VVER-1100
EPR1600
Hualong1
Other values (8)
10 

Length

Max length10
Median length9
Mean length7.6842105
Min length3

Unique

Unique6 ?
Unique (%)10.5%

Sample

1st rowVVER-1200
2nd rowVVER V-491
3rd rowVVER-1100
4th rowVVER-1100
5th rowVVER-1100

Common Values

ValueCountFrequency (%)
VVER-1200 23
40.4%
AP1000 9
 
15.8%
VVER-1100 6
 
10.5%
EPR1600 5
 
8.8%
Hualong1 4
 
7.0%
ABWR 2
 
3.5%
Candu6 2
 
3.5%
VVER V-491 1
 
1.8%
VVER 1
 
1.8%
ATMEA1 1
 
1.8%
Other values (3) 3
 
5.3%

Length

2023-12-12T22:16:32.498266image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
vver-1200 23
39.7%
ap1000 9
 
15.5%
vver-1100 6
 
10.3%
epr1600 5
 
8.6%
hualong1 4
 
6.9%
abwr 2
 
3.4%
candu6 2
 
3.4%
vver 2
 
3.4%
v-491 1
 
1.7%
atmea1 1
 
1.7%
Other values (3) 3
 
5.2%

체결시점
Date

MISSING 

Distinct16
Distinct (%)53.3%
Missing27
Missing (%)47.4%
Memory size588.0 B
Minimum1976-02-01 00:00:00
Maximum2022-10-28 00:00:00
2023-12-12T22:16:32.604654image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:16:32.713836image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=16)

진행상태
Categorical

HIGH CORRELATION 

Distinct7
Distinct (%)12.3%
Missing0
Missing (%)0.0%
Memory size588.0 B
건설중
19 
건설착수전
13 
건설완료
11 
사업중단
계획중단
Other values (2)

Length

Max length5
Median length4
Mean length3.8947368
Min length3

Unique

Unique1 ?
Unique (%)1.8%

Sample

1st row건설중
2nd row건설중
3rd row건설중
4th row건설중
5th row건설중

Common Values

ValueCountFrequency (%)
건설중 19
33.3%
건설착수전 13
22.8%
건설완료 11
19.3%
사업중단 8
14.0%
계획중단 3
 
5.3%
협상중단 2
 
3.5%
건설취소 1
 
1.8%

Length

2023-12-12T22:16:32.889660image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T22:16:33.011219image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
건설중 19
33.3%
건설착수전 13
22.8%
건설완료 11
19.3%
사업중단 8
14.0%
계획중단 3
 
5.3%
협상중단 2
 
3.5%
건설취소 1
 
1.8%
Distinct41
Distinct (%)71.9%
Missing0
Missing (%)0.0%
Memory size588.0 B
2023-12-12T22:16:33.258336image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length71
Median length53
Mean length19.666667
Min length6

Characters and Unicode

Total characters1121
Distinct characters167
Distinct categories8 ?
Distinct scripts4 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique31 ?
Unique (%)54.4%

Sample

1st row2023년내 상업운전 예상
2nd row2026년 상업운전 예정
3rd row2027년 상업운전 예정
4th row2021년 6월 건설 시작
5th row2021년 12월 건설 시작
ValueCountFrequency (%)
상업운전 28
 
11.5%
예정 13
 
5.3%
2018년 7
 
2.9%
목표 6
 
2.5%
2027년 5
 
2.0%
건설계획 5
 
2.0%
2021년 4
 
1.6%
2022년 4
 
1.6%
가동목표 4
 
1.6%
2028년 4
 
1.6%
Other values (119) 164
67.2%
2023-12-12T22:16:33.712244image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
187
 
16.7%
2 102
 
9.1%
0 58
 
5.2%
55
 
4.9%
44
 
3.9%
38
 
3.4%
31
 
2.8%
29
 
2.6%
1 25
 
2.2%
19
 
1.7%
Other values (157) 533
47.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 654
58.3%
Decimal Number 244
 
21.8%
Space Separator 187
 
16.7%
Other Punctuation 16
 
1.4%
Uppercase Letter 9
 
0.8%
Close Punctuation 4
 
0.4%
Open Punctuation 4
 
0.4%
Math Symbol 3
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
55
 
8.4%
44
 
6.7%
38
 
5.8%
31
 
4.7%
29
 
4.4%
19
 
2.9%
18
 
2.8%
17
 
2.6%
15
 
2.3%
14
 
2.1%
Other values (136) 374
57.2%
Decimal Number
ValueCountFrequency (%)
2 102
41.8%
0 58
23.8%
1 25
 
10.2%
8 15
 
6.1%
3 12
 
4.9%
7 9
 
3.7%
6 9
 
3.7%
4 6
 
2.5%
9 5
 
2.0%
5 3
 
1.2%
Uppercase Letter
ValueCountFrequency (%)
C 3
33.3%
W 2
22.2%
E 2
22.2%
N 1
 
11.1%
G 1
 
11.1%
Other Punctuation
ValueCountFrequency (%)
, 14
87.5%
% 2
 
12.5%
Space Separator
ValueCountFrequency (%)
187
100.0%
Close Punctuation
ValueCountFrequency (%)
) 4
100.0%
Open Punctuation
ValueCountFrequency (%)
( 4
100.0%
Math Symbol
ValueCountFrequency (%)
~ 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 652
58.2%
Common 458
40.9%
Latin 9
 
0.8%
Han 2
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
55
 
8.4%
44
 
6.7%
38
 
5.8%
31
 
4.8%
29
 
4.4%
19
 
2.9%
18
 
2.8%
17
 
2.6%
15
 
2.3%
14
 
2.1%
Other values (135) 372
57.1%
Common
ValueCountFrequency (%)
187
40.8%
2 102
22.3%
0 58
 
12.7%
1 25
 
5.5%
8 15
 
3.3%
, 14
 
3.1%
3 12
 
2.6%
7 9
 
2.0%
6 9
 
2.0%
4 6
 
1.3%
Other values (6) 21
 
4.6%
Latin
ValueCountFrequency (%)
C 3
33.3%
W 2
22.2%
E 2
22.2%
N 1
 
11.1%
G 1
 
11.1%
Han
ValueCountFrequency (%)
2
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 652
58.2%
ASCII 467
41.7%
CJK 2
 
0.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
187
40.0%
2 102
21.8%
0 58
 
12.4%
1 25
 
5.4%
8 15
 
3.2%
, 14
 
3.0%
3 12
 
2.6%
7 9
 
1.9%
6 9
 
1.9%
4 6
 
1.3%
Other values (11) 30
 
6.4%
Hangul
ValueCountFrequency (%)
55
 
8.4%
44
 
6.7%
38
 
5.8%
31
 
4.8%
29
 
4.4%
19
 
2.9%
18
 
2.8%
17
 
2.6%
15
 
2.3%
14
 
2.1%
Other values (135) 372
57.1%
CJK
ValueCountFrequency (%)
2
100.0%

Correlations

2023-12-12T22:16:33.881284image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
공급국가명대상국가명원전명노형체결시점진행상태세부현황
공급국가명1.0000.6980.8710.9650.9300.5820.878
대상국가명0.6981.0000.9950.9160.9800.8740.993
원전명0.8710.9951.0000.0001.0000.3950.988
노형0.9650.9160.0001.0000.9550.7840.944
체결시점0.9300.9801.0000.9551.0000.7940.948
진행상태0.5820.8740.3950.7840.7941.0000.996
세부현황0.8780.9930.9880.9440.9480.9961.000
2023-12-12T22:16:34.012633image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
대상국가명진행상태노형공급국가명
대상국가명1.0000.5120.5470.331
진행상태0.5121.0000.4800.389
노형0.5470.4801.0000.829
공급국가명0.3310.3890.8291.000
2023-12-12T22:16:34.145856image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
공급국가명대상국가명노형진행상태
공급국가명1.0000.3310.8290.389
대상국가명0.3311.0000.5470.512
노형0.8290.5471.0000.480
진행상태0.3890.5120.4801.000

Missing values

2023-12-12T22:16:31.366043image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T22:16:31.475314image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

공급국가명대상국가명원전명노형체결시점진행상태세부현황
0러시아벨라루스Ostrovets 2호기VVER-12002012-07-01건설중2023년내 상업운전 예상
1러시아인도Kudankulam 3호기VVER V-4912016-02-01건설중2026년 상업운전 예정
2러시아인도Kudankulam 4호기VVER-11002016-02-01건설중2027년 상업운전 예정
3러시아인도Kudankulam 5호기VVER-11002017-06-01건설중2021년 6월 건설 시작
4러시아인도Kudankulam 6호기VVER-11002017-06-01건설중2021년 12월 건설 시작
5러시아이란Bushehr 2호기VVER-11002014-11-01건설중2024년 상업운전 예정
6러시아이란Bushehr 3호기VVER-11002014-11-01건설착수전2021년 부지작업 착수
7러시아방글라데시Rooppur 1호기VVER-12002011-02-01건설중2024년 하반기 상업운전 예정
8러시아방글라데시Rooppur 2호기VVER-12002011-02-01건설중2024년~2025년 상업운전 예정
9러시아튀르키예Akkuyu 1호기VVER-12002010-05-01건설중2024년 상업운전 예정
공급국가명대상국가명원전명노형체결시점진행상태세부현황
47미국중국Haiyang 1호기AP1000<NA>건설완료2018년 상업운전
48미국중국Haiyang 2호기AP1000<NA>건설완료2019년 상업운전
49미국인도Kovvada 6기AP1000<NA>계획중단2017년 WEC社의 파산으로 무기한 연기, 2023년 웨스팅하우스와 인도정부 원전도입 재논의중
50미국인도Chhaya-Mithi VirdiAP1000<NA>계획중단2017년 WEC社의 파산으로 무기한 연기, 재개 가능성 있음
51미국인도IgneadaAP1000<NA>계획중단추후계획논의
52미국영국Wylfa NewyddAP1000<NA>사업중단2019년 민간투자자 유치의 어려움을 사유로 사업중단 하였으나, 재개 가능성 있음(웨스팅하우스와 영국정부 원전건설 계속 논의중)
53미국리투아니아VisaginasABWR2011-05-01사업중단2016년 국민반대(원전반대 62%)로 신규원전 건설계획 취소됨에 따라 사업중단
54미국필리핀BataanWEC6211976-02-01사업중단1984년 고온기능 시험 완료후, 체르노빌 사고 및 일부 안전성 논란으로 운영 불허, 2022년 12월 건설재개 검토중
55미국대만Lungmen 1호기BWR<NA>사업중단2014년 가동전 안전검사 완료, 시운전 직전 국민반대로 건설 및 운영 중단
56미국폴란드3기AP10002022-10-28건설착수전사전설계 계약, 2033년 가동목표