Overview

Dataset statistics

Number of variables7
Number of observations3015
Missing cells1
Missing cells (%)< 0.1%
Duplicate rows11
Duplicate rows (%)0.4%
Total size in memory170.9 KiB
Average record size in memory58.0 B

Variable types

DateTime2
Text2
Categorical1
Numeric2

Dataset

Description2015년 ~2021년 연간 예방정비계획 및 예방정비일수 자료입니다.
Author한국전력거래소
URLhttps://www.data.go.kr/data/15065270/fileData.do

Alerts

Dataset has 11 (0.4%) duplicate rowsDuplicates

Reproduction

Analysis started2023-12-12 22:59:05.646554
Analysis finished2023-12-12 22:59:06.732899
Duration1.09 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct1103
Distinct (%)36.6%
Missing0
Missing (%)0.0%
Memory size23.7 KiB
Minimum2012-10-29 00:00:00
Maximum2021-12-31 00:00:00
2023-12-13T07:59:06.808728image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T07:59:06.979824image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct1222
Distinct (%)40.5%
Missing0
Missing (%)0.0%
Memory size23.7 KiB
Minimum2015-01-01 00:00:00
Maximum2022-02-04 00:00:00
2023-12-13T07:59:07.145653image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T07:59:07.278778image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct60
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size23.7 KiB
2023-12-13T07:59:07.520826image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length9
Median length8
Mean length3.2368159
Min length2

Characters and Unicode

Total characters9759
Distinct characters89
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique4 ?
Unique (%)0.1%

Sample

1st row한수원
2nd row한수원
3rd row한수원
4th row한수원
5th row한수원
ValueCountFrequency (%)
남부 514
16.5%
한수원 404
13.0%
중부 319
10.2%
동서 266
 
8.5%
수자원 222
 
7.1%
남동 198
 
6.4%
서부 190
 
6.1%
gseps 121
 
3.9%
난방공사 101
 
3.2%
포스코파워 98
 
3.1%
Other values (50) 681
21.9%
2023-12-13T07:59:07.869905image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1035
 
10.6%
716
 
7.3%
644
 
6.6%
626
 
6.4%
501
 
5.1%
491
 
5.0%
460
 
4.7%
S 417
 
4.3%
319
 
3.3%
228
 
2.3%
Other values (79) 4322
44.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 8094
82.9%
Uppercase Letter 1225
 
12.6%
Lowercase Letter 228
 
2.3%
Space Separator 137
 
1.4%
Other Punctuation 42
 
0.4%
Other Symbol 33
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1035
 
12.8%
716
 
8.8%
644
 
8.0%
626
 
7.7%
501
 
6.2%
491
 
6.1%
460
 
5.7%
319
 
3.9%
228
 
2.8%
213
 
2.6%
Other values (61) 2861
35.3%
Uppercase Letter
ValueCountFrequency (%)
S 417
34.0%
G 221
18.0%
P 208
17.0%
E 163
 
13.3%
C 63
 
5.1%
K 42
 
3.4%
N 33
 
2.7%
M 30
 
2.4%
H 24
 
2.0%
L 18
 
1.5%
Lowercase Letter
ValueCountFrequency (%)
e 57
25.0%
r 57
25.0%
w 57
25.0%
o 57
25.0%
Space Separator
ValueCountFrequency (%)
137
100.0%
Other Punctuation
ValueCountFrequency (%)
& 42
100.0%
Other Symbol
ValueCountFrequency (%)
33
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 8127
83.3%
Latin 1453
 
14.9%
Common 179
 
1.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1035
 
12.7%
716
 
8.8%
644
 
7.9%
626
 
7.7%
501
 
6.2%
491
 
6.0%
460
 
5.7%
319
 
3.9%
228
 
2.8%
213
 
2.6%
Other values (62) 2894
35.6%
Latin
ValueCountFrequency (%)
S 417
28.7%
G 221
15.2%
P 208
14.3%
E 163
 
11.2%
C 63
 
4.3%
e 57
 
3.9%
r 57
 
3.9%
w 57
 
3.9%
o 57
 
3.9%
K 42
 
2.9%
Other values (5) 111
 
7.6%
Common
ValueCountFrequency (%)
137
76.5%
& 42
 
23.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 8094
82.9%
ASCII 1632
 
16.7%
None 33
 
0.3%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1035
 
12.8%
716
 
8.8%
644
 
8.0%
626
 
7.7%
501
 
6.2%
491
 
6.1%
460
 
5.7%
319
 
3.9%
228
 
2.8%
213
 
2.6%
Other values (61) 2861
35.3%
ASCII
ValueCountFrequency (%)
S 417
25.6%
G 221
13.5%
P 208
12.7%
E 163
 
10.0%
137
 
8.4%
C 63
 
3.9%
e 57
 
3.5%
r 57
 
3.5%
w 57
 
3.5%
o 57
 
3.5%
Other values (7) 195
11.9%
None
ValueCountFrequency (%)
33
100.0%

구분
Categorical

Distinct23
Distinct (%)0.8%
Missing0
Missing (%)0.0%
Memory size23.7 KiB
복합
1544 
기력(석탄)
414 
수력
408 
집단에너지
166 
원자력
 
135
Other values (18)
348 

Length

Max length12
Median length2
Mean length3.1502488
Min length2

Unique

Unique4 ?
Unique (%)0.1%

Sample

1st row원자력
2nd row양수
3rd row양수
4th row원자력
5th row원자력

Common Values

ValueCountFrequency (%)
복합 1544
51.2%
기력(석탄) 414
 
13.7%
수력 408
 
13.5%
집단에너지 166
 
5.5%
원자력 135
 
4.5%
양수 83
 
2.8%
기력(제주) 53
 
1.8%
기력(국내탄) 44
 
1.5%
내연(제주) 43
 
1.4%
기타(ESS) 43
 
1.4%
Other values (13) 82
 
2.7%

Length

2023-12-13T07:59:07.995764image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
복합 1544
51.2%
기력(석탄 414
 
13.7%
수력 408
 
13.5%
집단에너지 166
 
5.5%
원자력 135
 
4.5%
양수 83
 
2.8%
기력(제주 53
 
1.8%
기력(국내탄 44
 
1.5%
내연(제주 43
 
1.4%
기타(ess 43
 
1.4%
Other values (13) 83
 
2.8%
Distinct515
Distinct (%)17.1%
Missing0
Missing (%)0.0%
Memory size23.7 KiB
2023-12-13T07:59:08.227230image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length12
Mean length7.6215589
Min length5

Characters and Unicode

Total characters22979
Distinct characters143
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique39 ?
Unique (%)1.3%

Sample

1st row월성원자력#1
2nd row예천양수#1
3rd row예천양수#2
4th row한빛원자력#3
5th row한울원자력#5
ValueCountFrequency (%)
ess 41
 
1.3%
제주내연#2 24
 
0.8%
제주내연#1 23
 
0.7%
안동복합gt#1 19
 
0.6%
충주수력#1 19
 
0.6%
안동복합st#1 19
 
0.6%
남제주화력#2 18
 
0.6%
남제주화력#1 18
 
0.6%
한림복합gt#1 17
 
0.6%
충주수력#2 17
 
0.6%
Other values (507) 2856
93.0%
2023-12-13T07:59:08.603017image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
# 2900
 
12.6%
1708
 
7.4%
T 1680
 
7.3%
1454
 
6.3%
G 1245
 
5.4%
1086
 
4.7%
1 1022
 
4.4%
S 722
 
3.1%
2 714
 
3.1%
608
 
2.6%
Other values (133) 9840
42.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 13275
57.8%
Uppercase Letter 3739
 
16.3%
Decimal Number 3002
 
13.1%
Other Punctuation 2903
 
12.6%
Space Separator 56
 
0.2%
Dash Punctuation 4
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1708
 
12.9%
1454
 
11.0%
1086
 
8.2%
608
 
4.6%
553
 
4.2%
534
 
4.0%
424
 
3.2%
309
 
2.3%
300
 
2.3%
298
 
2.2%
Other values (113) 6001
45.2%
Decimal Number
ValueCountFrequency (%)
1 1022
34.0%
2 714
23.8%
3 359
 
12.0%
4 304
 
10.1%
6 175
 
5.8%
5 172
 
5.7%
7 112
 
3.7%
8 97
 
3.2%
9 30
 
1.0%
0 17
 
0.6%
Uppercase Letter
ValueCountFrequency (%)
T 1680
44.9%
G 1245
33.3%
S 722
19.3%
C 44
 
1.2%
E 43
 
1.2%
I 5
 
0.1%
Other Punctuation
ValueCountFrequency (%)
# 2900
99.9%
, 3
 
0.1%
Space Separator
ValueCountFrequency (%)
56
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 13275
57.8%
Common 5965
26.0%
Latin 3739
 
16.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1708
 
12.9%
1454
 
11.0%
1086
 
8.2%
608
 
4.6%
553
 
4.2%
534
 
4.0%
424
 
3.2%
309
 
2.3%
300
 
2.3%
298
 
2.2%
Other values (113) 6001
45.2%
Common
ValueCountFrequency (%)
# 2900
48.6%
1 1022
 
17.1%
2 714
 
12.0%
3 359
 
6.0%
4 304
 
5.1%
6 175
 
2.9%
5 172
 
2.9%
7 112
 
1.9%
8 97
 
1.6%
56
 
0.9%
Other values (4) 54
 
0.9%
Latin
ValueCountFrequency (%)
T 1680
44.9%
G 1245
33.3%
S 722
19.3%
C 44
 
1.2%
E 43
 
1.2%
I 5
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 13275
57.8%
ASCII 9704
42.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
# 2900
29.9%
T 1680
17.3%
G 1245
12.8%
1 1022
 
10.5%
S 722
 
7.4%
2 714
 
7.4%
3 359
 
3.7%
4 304
 
3.1%
6 175
 
1.8%
5 172
 
1.8%
Other values (10) 411
 
4.2%
Hangul
ValueCountFrequency (%)
1708
 
12.9%
1454
 
11.0%
1086
 
8.2%
608
 
4.6%
553
 
4.2%
534
 
4.0%
424
 
3.2%
309
 
2.3%
300
 
2.3%
298
 
2.2%
Other values (113) 6001
45.2%

설비용량
Real number (ℝ)

Distinct210
Distinct (%)7.0%
Missing1
Missing (%)< 0.1%
Infinite0
Infinite (%)0.0%
Mean233.4517
Minimum6
Maximum1800
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size26.6 KiB
2023-12-13T07:59:08.747903image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum6
5-th percentile24
Q178
median150
Q3277
95-th percentile870
Maximum1800
Range1794
Interquartile range (IQR)199

Descriptive statistics

Standard deviation244.12181
Coefficient of variation (CV)1.0457059
Kurtosis4.1896833
Mean233.4517
Median Absolute Deviation (MAD)92.35
Skewness2.0324432
Sum703623.41
Variance59595.458
MonotonicityNot monotonic
2023-12-13T07:59:08.878371image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
150.0 465
 
15.4%
500.0 250
 
8.3%
100.0 247
 
8.2%
75.0 90
 
3.0%
40.0 76
 
2.5%
1000.0 73
 
2.4%
78.0 59
 
2.0%
35.0 58
 
1.9%
200.0 56
 
1.9%
161.0 49
 
1.6%
Other values (200) 1591
52.8%
ValueCountFrequency (%)
6.0 30
1.0%
11.0 10
 
0.3%
11.05 9
 
0.3%
11.25 16
0.5%
12.25 1
 
< 0.1%
14.0 10
 
0.3%
14.4 7
 
0.2%
15.0 4
 
0.1%
16.0 3
 
0.1%
19.19 6
 
0.2%
ValueCountFrequency (%)
1800.0 1
 
< 0.1%
1400.0 7
 
0.2%
1050.0 9
 
0.3%
1022.0 12
 
0.4%
1020.0 11
 
0.4%
1019.0 4
 
0.1%
1000.0 73
2.4%
950.0 30
1.0%
870.0 28
 
0.9%
800.0 14
 
0.5%

일수
Real number (ℝ)

Distinct148
Distinct (%)4.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean27.554892
Minimum1
Maximum1525
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size26.6 KiB
2023-12-13T07:59:09.017468image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2
Q15
median13
Q335
95-th percentile82
Maximum1525
Range1524
Interquartile range (IQR)30

Descriptive statistics

Standard deviation55.058419
Coefficient of variation (CV)1.9981359
Kurtosis252.66274
Mean27.554892
Median Absolute Deviation (MAD)10
Skewness12.562356
Sum83078
Variance3031.4296
MonotonicityNot monotonic
2023-12-13T07:59:09.134411image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2 260
 
8.6%
7 204
 
6.8%
3 191
 
6.3%
4 146
 
4.8%
10 140
 
4.6%
12 140
 
4.6%
5 132
 
4.4%
15 95
 
3.2%
1 88
 
2.9%
30 83
 
2.8%
Other values (138) 1536
50.9%
ValueCountFrequency (%)
1 88
 
2.9%
2 260
8.6%
3 191
6.3%
4 146
4.8%
5 132
4.4%
6 44
 
1.5%
7 204
6.8%
8 40
 
1.3%
9 51
 
1.7%
10 140
4.6%
ValueCountFrequency (%)
1525 1
< 0.1%
1025 1
< 0.1%
698 2
0.1%
650 1
< 0.1%
635 1
< 0.1%
609 1
< 0.1%
579 1
< 0.1%
456 1
< 0.1%
376 2
0.1%
365 2
0.1%

Interactions

2023-12-13T07:59:06.277774image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T07:59:06.009671image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T07:59:06.414540image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T07:59:06.164249image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T07:59:09.228468image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
회원사구분설비용량일수
회원사1.0000.9430.6960.000
구분0.9431.0000.7930.274
설비용량0.6960.7931.0000.134
일수0.0000.2740.1341.000
2023-12-13T07:59:09.297200image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
설비용량일수구분
설비용량1.0000.2750.469
일수0.2751.0000.121
구분0.4690.1211.000

Missing values

2023-12-13T07:59:06.551585image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T07:59:06.679213image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시작일종료일회원사구분발전기명설비용량일수
02012-10-292016-12-31한수원원자력월성원자력#1679.01525
12014-10-192015-02-20한수원양수예천양수#1400.0125
22014-10-192015-02-20한수원양수예천양수#2400.0125
32014-10-232015-01-25한수원원자력한빛원자력#31000.095
42014-11-142015-01-01한수원원자력한울원자력#51000.049
52015-01-022015-03-31한수원원자력고리원자력#2650.089
62015-01-192015-01-21중부내연(제주)제주내연#240.03
72015-01-222015-02-15한수원수력섬진강수력#36.025
82015-01-262015-01-28중부내연(제주)제주내연#140.03
92015-01-272015-03-08한수원원자력한울원자력#41000.041
시작일종료일회원사구분발전기명설비용량일수
30052021-11-292021-11-29남부복합한림복합ST#135.01
30062021-12-012021-12-09동서기력(석탄)당진화력#3500.09
30072021-12-072022-01-10한수원원자력월성원자력#2700.035
30082021-12-082021-12-10남부기력(제주)남제주화력#1100.03
30092021-12-132021-12-16중부내연(제주)제주내연#240.04
30102021-12-152022-01-25남부복합영월복합GT#1183.042
30112021-12-152021-12-17남부기력(제주)남제주화력#2100.03
30122021-12-242022-02-04남부복합영월복합GT#2183.043
30132021-12-302021-12-31남부복합영월복합ST#1299.02
30142021-12-312022-01-20남부복합영월복합GT#3183.021

Duplicate rows

Most frequently occurring

시작일종료일회원사구분발전기명설비용량일수# duplicates
02016-09-242017-01-26한수원수력팔당수력#230.01252
12018-09-102020-08-07한수원양수삼랑진양수#2300.06982
22018-09-272019-02-03한수원수력팔당수력#430.01302
32018-11-062018-11-08수자원수력임하수력#125.032
42020-05-022020-05-11포스코에너지복합포스코복합GT#8100.0102
52020-09-212021-07-20한수원수력화천수력#427.03032
62020-10-022021-02-14한수원원자력한빛원자력#31000.01362
72020-12-092021-01-27한수원원자력신월성원자력#21000.0502
82020-12-142021-01-25한수원원자력월성원자력#3700.0432
92020-12-222021-04-11한수원원자력한빛원자력#61000.01112