Overview

Dataset statistics

Number of variables8
Number of observations462
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory29.5 KiB
Average record size in memory65.3 B

Variable types

Categorical5
Numeric1
Text2

Dataset

Description한국수력원자력 발전소별 작업장소 분류코드, 작업장소명 현황
Author한국수력원자력(주)
URLhttps://www.data.go.kr/data/15070280/fileData.do

Alerts

그룹코드 has constant value ""Constant
원자로구분 is highly overall correlated with 발전소코드 and 3 other fieldsHigh correlation
코드내용 is highly overall correlated with 발전소코드 and 1 other fieldsHigh correlation
작업장소 분류 코드 is highly overall correlated with 원자로구분 and 1 other fieldsHigh correlation
작업장소 분류 상세명 is highly overall correlated with 원자로구분 and 1 other fieldsHigh correlation
발전소코드 is highly overall correlated with 원자로구분 and 1 other fieldsHigh correlation

Reproduction

Analysis started2023-12-12 06:12:02.600463
Analysis finished2023-12-12 06:12:03.260244
Duration0.66 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

그룹코드
Categorical

CONSTANT 

Distinct1
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size3.7 KiB
A
462 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowA
2nd rowA
3rd rowA
4th rowA
5th rowA

Common Values

ValueCountFrequency (%)
A 462
100.0%

Length

2023-12-12T15:12:03.325505image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T15:12:03.410361image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
a 462
100.0%

원자로구분
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size3.7 KiB
경수로
392 
중수로
70 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row경수로
2nd row경수로
3rd row경수로
4th row경수로
5th row경수로

Common Values

ValueCountFrequency (%)
경수로 392
84.8%
중수로 70
 
15.2%

Length

2023-12-12T15:12:03.502415image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T15:12:03.603873image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
경수로 392
84.8%
중수로 70
 
15.2%

발전소코드
Real number (ℝ)

HIGH CORRELATION 

Distinct11
Distinct (%)2.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2346.5368
Minimum2110
Maximum2810
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size4.2 KiB
2023-12-12T15:12:03.705032image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2110
5-th percentile2110
Q12220
median2320
Q32420
95-th percentile2810
Maximum2810
Range700
Interquartile range (IQR)200

Descriptive statistics

Standard deviation188.11001
Coefficient of variation (CV)0.080164954
Kurtosis1.2258406
Mean2346.5368
Median Absolute Deviation (MAD)100
Skewness1.1789665
Sum1084100
Variance35385.377
MonotonicityIncreasing
2023-12-12T15:12:03.810017image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=11)
ValueCountFrequency (%)
2320 54
11.7%
2330 54
11.7%
2420 54
11.7%
2430 54
11.7%
2810 47
10.2%
2110 39
8.4%
2120 39
8.4%
2310 39
8.4%
2210 35
7.6%
2220 35
7.6%
ValueCountFrequency (%)
2110 39
8.4%
2120 39
8.4%
2210 35
7.6%
2220 35
7.6%
2310 39
8.4%
2320 54
11.7%
2330 54
11.7%
2410 12
 
2.6%
2420 54
11.7%
2430 54
11.7%
ValueCountFrequency (%)
2810 47
10.2%
2430 54
11.7%
2420 54
11.7%
2410 12
 
2.6%
2330 54
11.7%
2320 54
11.7%
2310 39
8.4%
2220 35
7.6%
2210 35
7.6%
2120 39
8.4%

코드내용
Categorical

HIGH CORRELATION 

Distinct11
Distinct (%)2.4%
Missing0
Missing (%)0.0%
Memory size3.7 KiB
한빛2발
54 
한빛3발
54 
한울2발
54 
한울3발
54 
새울1발
47 
Other values (6)
199 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row고리1발
2nd row고리1발
3rd row고리1발
4th row고리1발
5th row고리1발

Common Values

ValueCountFrequency (%)
한빛2발 54
11.7%
한빛3발 54
11.7%
한울2발 54
11.7%
한울3발 54
11.7%
새울1발 47
10.2%
고리1발 39
8.4%
고리2발 39
8.4%
한빛1발 39
8.4%
월성1발 35
7.6%
월성2발 35
7.6%

Length

2023-12-12T15:12:03.953631image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
한빛2발 54
11.7%
한빛3발 54
11.7%
한울2발 54
11.7%
한울3발 54
11.7%
새울1발 47
10.2%
고리1발 39
8.4%
고리2발 39
8.4%
한빛1발 39
8.4%
월성1발 35
7.6%
월성2발 35
7.6%
Distinct187
Distinct (%)40.5%
Missing0
Missing (%)0.0%
Memory size3.7 KiB
2023-12-12T15:12:04.324585image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length4
Median length4
Mean length4
Min length4

Characters and Unicode

Total characters1848
Distinct characters15
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique59 ?
Unique (%)12.8%

Sample

1st rowFW01
2nd rowFW02
3rd rowFW03
4th rowFW04
5th rowFW05
ValueCountFrequency (%)
fc32 4
 
0.9%
fc41 4
 
0.9%
fc45 4
 
0.9%
fc44 4
 
0.9%
fc43 4
 
0.9%
fc42 4
 
0.9%
fc46 4
 
0.9%
fc34 4
 
0.9%
fc31 4
 
0.9%
fc24 4
 
0.9%
Other values (177) 422
91.3%
2023-12-12T15:12:05.056330image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
F 474
25.6%
C 216
11.7%
1 148
 
8.0%
4 141
 
7.6%
2 123
 
6.7%
W 117
 
6.3%
5 105
 
5.7%
3 99
 
5.4%
6 96
 
5.2%
0 90
 
4.9%
Other values (5) 239
12.9%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter 924
50.0%
Decimal Number 924
50.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 148
16.0%
4 141
15.3%
2 123
13.3%
5 105
11.4%
3 99
10.7%
6 96
10.4%
0 90
9.7%
7 56
 
6.1%
8 36
 
3.9%
9 30
 
3.2%
Uppercase Letter
ValueCountFrequency (%)
F 474
51.3%
C 216
23.4%
W 117
 
12.7%
H 70
 
7.6%
O 47
 
5.1%

Most occurring scripts

ValueCountFrequency (%)
Latin 924
50.0%
Common 924
50.0%

Most frequent character per script

Common
ValueCountFrequency (%)
1 148
16.0%
4 141
15.3%
2 123
13.3%
5 105
11.4%
3 99
10.7%
6 96
10.4%
0 90
9.7%
7 56
 
6.1%
8 36
 
3.9%
9 30
 
3.2%
Latin
ValueCountFrequency (%)
F 474
51.3%
C 216
23.4%
W 117
 
12.7%
H 70
 
7.6%
O 47
 
5.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1848
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
F 474
25.6%
C 216
11.7%
1 148
 
8.0%
4 141
 
7.6%
2 123
 
6.7%
W 117
 
6.3%
5 105
 
5.7%
3 99
 
5.4%
6 96
 
5.2%
0 90
 
4.9%
Other values (5) 239
12.9%
Distinct168
Distinct (%)36.4%
Missing0
Missing (%)0.0%
Memory size3.7 KiB
2023-12-12T15:12:05.317412image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length25
Median length24
Mean length8.7662338
Min length4

Characters and Unicode

Total characters4050
Distinct characters138
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique42 ?
Unique (%)9.1%

Sample

1st row보조건물 74
2nd row보조건물 88
3rd row보조건물 100
4th row보조건물 126
5th row보조건물 148
ValueCountFrequency (%)
c/v 74
 
7.6%
100 51
 
5.2%
rwb 40
 
4.1%
원자로건물 38
 
3.9%
pab 32
 
3.3%
보조건물 31
 
3.2%
공통 28
 
2.9%
fhb 25
 
2.6%
sab 24
 
2.5%
건물 21
 
2.1%
Other values (133) 615
62.8%
2023-12-12T15:12:05.663506image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
517
 
12.8%
1 207
 
5.1%
B 175
 
4.3%
175
 
4.3%
147
 
3.6%
C 122
 
3.0%
0 116
 
2.9%
A 109
 
2.7%
/ 100
 
2.5%
V 93
 
2.3%
Other values (128) 2289
56.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1555
38.4%
Uppercase Letter 1058
26.1%
Decimal Number 725
17.9%
Space Separator 517
 
12.8%
Other Punctuation 111
 
2.7%
Close Punctuation 41
 
1.0%
Open Punctuation 41
 
1.0%
Dash Punctuation 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
175
 
11.3%
147
 
9.5%
71
 
4.6%
68
 
4.4%
49
 
3.2%
46
 
3.0%
43
 
2.8%
43
 
2.8%
43
 
2.8%
38
 
2.4%
Other values (87) 832
53.5%
Uppercase Letter
ValueCountFrequency (%)
B 175
16.5%
C 122
11.5%
A 109
10.3%
V 93
8.8%
R 84
7.9%
P 68
 
6.4%
L 58
 
5.5%
W 56
 
5.3%
S 56
 
5.3%
F 37
 
3.5%
Other values (14) 200
18.9%
Decimal Number
ValueCountFrequency (%)
1 207
28.6%
0 116
16.0%
2 92
12.7%
5 75
 
10.3%
8 57
 
7.9%
6 56
 
7.7%
7 54
 
7.4%
4 48
 
6.6%
3 20
 
2.8%
Other Punctuation
ValueCountFrequency (%)
/ 100
90.1%
, 6
 
5.4%
. 4
 
3.6%
& 1
 
0.9%
Space Separator
ValueCountFrequency (%)
517
100.0%
Close Punctuation
ValueCountFrequency (%)
) 41
100.0%
Open Punctuation
ValueCountFrequency (%)
( 41
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1555
38.4%
Common 1437
35.5%
Latin 1058
26.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
175
 
11.3%
147
 
9.5%
71
 
4.6%
68
 
4.4%
49
 
3.2%
46
 
3.0%
43
 
2.8%
43
 
2.8%
43
 
2.8%
38
 
2.4%
Other values (87) 832
53.5%
Latin
ValueCountFrequency (%)
B 175
16.5%
C 122
11.5%
A 109
10.3%
V 93
8.8%
R 84
7.9%
P 68
 
6.4%
L 58
 
5.5%
W 56
 
5.3%
S 56
 
5.3%
F 37
 
3.5%
Other values (14) 200
18.9%
Common
ValueCountFrequency (%)
517
36.0%
1 207
14.4%
0 116
 
8.1%
/ 100
 
7.0%
2 92
 
6.4%
5 75
 
5.2%
8 57
 
4.0%
6 56
 
3.9%
7 54
 
3.8%
4 48
 
3.3%
Other values (7) 115
 
8.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2495
61.6%
Hangul 1555
38.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
517
20.7%
1 207
 
8.3%
B 175
 
7.0%
C 122
 
4.9%
0 116
 
4.6%
A 109
 
4.4%
/ 100
 
4.0%
V 93
 
3.7%
2 92
 
3.7%
R 84
 
3.4%
Other values (31) 880
35.3%
Hangul
ValueCountFrequency (%)
175
 
11.3%
147
 
9.5%
71
 
4.6%
68
 
4.4%
49
 
3.2%
46
 
3.0%
43
 
2.8%
43
 
2.8%
43
 
2.8%
38
 
2.4%
Other values (87) 832
53.5%

작업장소 분류 코드
Categorical

HIGH CORRELATION 

Distinct16
Distinct (%)3.5%
Missing0
Missing (%)0.0%
Memory size3.7 KiB
ETC
110 
CV
103 
AB
71 
RW
59 
FB
24 
Other values (11)
95 

Length

Max length8
Median length2
Mean length2.4329004
Min length2

Unique

Unique3 ?
Unique (%)0.6%

Sample

1st rowAC
2nd rowAC
3rd rowAC
4th rowAC
5th rowAC

Common Values

ValueCountFrequency (%)
ETC 110
23.8%
CV 103
22.3%
AB 71
15.4%
RW 59
12.8%
FB 24
 
5.2%
AC 19
 
4.1%
RB 16
 
3.5%
CPB 13
 
2.8%
TRF 12
 
2.6%
ACB 12
 
2.6%
Other values (6) 23
 
5.0%

Length

2023-12-12T15:12:05.790134image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
etc 110
23.8%
cv 103
22.3%
ab 71
15.4%
rw 59
12.8%
fb 24
 
5.2%
ac 19
 
4.1%
rb 16
 
3.5%
cpb 13
 
2.8%
trf 12
 
2.6%
acb 12
 
2.6%
Other values (6) 23
 
5.0%

작업장소 분류 상세명
Categorical

HIGH CORRELATION 

Distinct16
Distinct (%)3.5%
Missing0
Missing (%)0.0%
Memory size3.7 KiB
기타
110 
CV
103 
AB
71 
RW
59 
FB
24 
Other values (11)
95 

Length

Max length8
Median length2
Mean length2.1948052
Min length2

Unique

Unique3 ?
Unique (%)0.6%

Sample

1st rowAC
2nd rowAC
3rd rowAC
4th rowAC
5th rowAC

Common Values

ValueCountFrequency (%)
기타 110
23.8%
CV 103
22.3%
AB 71
15.4%
RW 59
12.8%
FB 24
 
5.2%
AC 19
 
4.1%
RB 16
 
3.5%
CPB 13
 
2.8%
TRF 12
 
2.6%
ACB 12
 
2.6%
Other values (6) 23
 
5.0%

Length

2023-12-12T15:12:05.930805image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
기타 110
23.8%
cv 103
22.3%
ab 71
15.4%
rw 59
12.8%
fb 24
 
5.2%
ac 19
 
4.1%
rb 16
 
3.5%
cpb 13
 
2.8%
trf 12
 
2.6%
acb 12
 
2.6%
Other values (6) 23
 
5.0%

Interactions

2023-12-12T15:12:02.952307image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T15:12:05.999323image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
원자로구분발전소코드코드내용작업장소 분류 코드작업장소 분류 상세명
원자로구분1.0001.0001.0000.8620.862
발전소코드1.0001.0001.0000.7080.708
코드내용1.0001.0001.0000.8000.800
작업장소 분류 코드0.8620.7080.8001.0001.000
작업장소 분류 상세명0.8620.7080.8001.0001.000
2023-12-12T15:12:06.103357image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
원자로구분코드내용작업장소 분류 코드작업장소 분류 상세명
원자로구분1.0000.9900.7040.704
코드내용0.9901.0000.4580.458
작업장소 분류 코드0.7040.4581.0001.000
작업장소 분류 상세명0.7040.4581.0001.000
2023-12-12T15:12:06.199050image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
발전소코드원자로구분코드내용작업장소 분류 코드작업장소 분류 상세명
발전소코드1.0000.9960.9950.4410.441
원자로구분0.9961.0000.9900.7040.704
코드내용0.9950.9901.0000.4580.458
작업장소 분류 코드0.4410.7040.4581.0001.000
작업장소 분류 상세명0.4410.7040.4581.0001.000

Missing values

2023-12-12T15:12:03.082005image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T15:12:03.211497image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

그룹코드원자로구분발전소코드코드내용작업장소 코드작업장소작업장소 분류 코드작업장소 분류 상세명
0A경수로2110고리1발FW01보조건물 74ACAC
1A경수로2110고리1발FW02보조건물 88ACAC
2A경수로2110고리1발FW03보조건물 100ACAC
3A경수로2110고리1발FW04보조건물 126ACAC
4A경수로2110고리1발FW05보조건물 148ACAC
5A경수로2110고리1발FW06보조건물 전지역ACAC
6A경수로2110고리1발FW11출입통제건물 74ETC기타
7A경수로2110고리1발FW12출입통제건물 100ETC기타
8A경수로2110고리1발FW13출입통제건물 전지역ETC기타
9A경수로2110고리1발FW21핵연료건물 100FBFB
그룹코드원자로구분발전소코드코드내용작업장소 코드작업장소작업장소 분류 코드작업장소 분류 상세명
452A경수로2810새울1발FO38C/V 165CVCV
453A경수로2810새울1발FO39C/V 공통CVCV
454A경수로2810새울1발FO40ALL BLDG(C/V 및 고방사선구역제외)ETC기타
455A경수로2810새울1발FO41ALL BLDG(O/H용)ETC기타
456A경수로2810새울1발FO42MST ROOMETC기타
457A경수로2810새울1발FO43방사성폐기물저장고ETC기타
458A경수로2810새울1발FO44ALL BLDG(C/V제외)ETC기타
459A경수로2810새울1발FO45기타(터빈건물 등)ETC기타
460A경수로2810새울1발FO46CPB 161CPBCPB
461A경수로2810새울1발FO47CPB 171CPBCPB