Overview

Dataset statistics

Number of variables6
Number of observations93
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory4.8 KiB
Average record size in memory52.4 B

Variable types

Categorical2
Text1
Numeric3

Dataset

Description한국동서발전의 사업소소별 연간송전량 정보입니다. 연간송전량은 사업소, 중분류, 호기, 용량(kW), 발전시간(HH), 송전량(kWh)의 항목으로 구성됩니다.
URLhttps://www.data.go.kr/data/15064442/fileData.do

Alerts

용량(kW) is highly overall correlated with 송전량(kWh) and 2 other fieldsHigh correlation
발전시간(HH) is highly overall correlated with 사업소High correlation
송전량(kWh) is highly overall correlated with 용량(kW) and 1 other fieldsHigh correlation
사업소 is highly overall correlated with 용량(kW) and 1 other fieldsHigh correlation
호기 is highly overall correlated with 용량(kW) and 1 other fieldsHigh correlation
발전시간(HH) has 2 (2.2%) zerosZeros
송전량(kWh) has 2 (2.2%) zerosZeros

Reproduction

Analysis started2023-12-12 13:23:16.505364
Analysis finished2023-12-12 13:23:17.929000
Duration1.42 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

사업소
Categorical

HIGH CORRELATION 

Distinct9
Distinct (%)9.7%
Missing0
Missing (%)0.0%
Memory size876.0 B
태양광
49 
울산복합
12 
당진
10 
일산복합
연료전지
Other values (4)

Length

Max length5
Median length3
Mean length3.2043011
Min length2

Unique

Unique2 ?
Unique (%)2.2%

Sample

1st row당진
2nd row당진
3rd row당진
4th row당진
5th row당진

Common Values

ValueCountFrequency (%)
태양광 49
52.7%
울산복합 12
 
12.9%
당진 10
 
10.8%
일산복합 8
 
8.6%
연료전지 7
 
7.5%
울산기력 3
 
3.2%
동해 2
 
2.2%
풍력 1
 
1.1%
바이오매스 1
 
1.1%

Length

2023-12-12T22:23:17.998148image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T22:23:18.109583image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
태양광 49
52.7%
울산복합 12
 
12.9%
당진 10
 
10.8%
일산복합 8
 
8.6%
연료전지 7
 
7.5%
울산기력 3
 
3.2%
동해 2
 
2.2%
풍력 1
 
1.1%
바이오매스 1
 
1.1%
Distinct67
Distinct (%)72.0%
Missing0
Missing (%)0.0%
Memory size876.0 B
2023-12-12T22:23:18.357491image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length17
Median length13
Mean length7.3870968
Min length2

Characters and Unicode

Total characters687
Distinct characters134
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique58 ?
Unique (%)62.4%

Sample

1st row당진
2nd row당진
3rd row당진
4th row당진
5th row당진
ValueCountFrequency (%)
태양광 24
 
18.6%
당진 10
 
7.8%
일산cc1 5
 
3.9%
동해 4
 
3.1%
발전설비 4
 
3.1%
일산cc2 3
 
2.3%
울산cc1 3
 
2.3%
울산cc2 3
 
2.3%
울산cc3 3
 
2.3%
울산cc4 3
 
2.3%
Other values (63) 67
51.9%
2023-12-12T22:23:18.753745image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
55
 
8.0%
55
 
8.0%
49
 
7.1%
C 42
 
6.1%
37
 
5.4%
34
 
4.9%
23
 
3.3%
18
 
2.6%
18
 
2.6%
1 13
 
1.9%
Other values (124) 343
49.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 559
81.4%
Uppercase Letter 48
 
7.0%
Space Separator 37
 
5.4%
Decimal Number 33
 
4.8%
Close Punctuation 4
 
0.6%
Open Punctuation 4
 
0.6%
Other Punctuation 1
 
0.1%
Connector Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
55
 
9.8%
55
 
9.8%
49
 
8.8%
34
 
6.1%
23
 
4.1%
18
 
3.2%
18
 
3.2%
12
 
2.1%
12
 
2.1%
11
 
2.0%
Other values (111) 272
48.7%
Uppercase Letter
ValueCountFrequency (%)
C 42
87.5%
S 4
 
8.3%
E 1
 
2.1%
M 1
 
2.1%
Decimal Number
ValueCountFrequency (%)
1 13
39.4%
2 11
33.3%
4 5
 
15.2%
3 4
 
12.1%
Space Separator
ValueCountFrequency (%)
37
100.0%
Close Punctuation
ValueCountFrequency (%)
) 4
100.0%
Open Punctuation
ValueCountFrequency (%)
( 4
100.0%
Other Punctuation
ValueCountFrequency (%)
# 1
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 559
81.4%
Common 80
 
11.6%
Latin 48
 
7.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
55
 
9.8%
55
 
9.8%
49
 
8.8%
34
 
6.1%
23
 
4.1%
18
 
3.2%
18
 
3.2%
12
 
2.1%
12
 
2.1%
11
 
2.0%
Other values (111) 272
48.7%
Common
ValueCountFrequency (%)
37
46.2%
1 13
 
16.2%
2 11
 
13.8%
4 5
 
6.2%
) 4
 
5.0%
( 4
 
5.0%
3 4
 
5.0%
# 1
 
1.2%
_ 1
 
1.2%
Latin
ValueCountFrequency (%)
C 42
87.5%
S 4
 
8.3%
E 1
 
2.1%
M 1
 
2.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 559
81.4%
ASCII 128
 
18.6%

Most frequent character per block

Hangul
ValueCountFrequency (%)
55
 
9.8%
55
 
9.8%
49
 
8.8%
34
 
6.1%
23
 
4.1%
18
 
3.2%
18
 
3.2%
12
 
2.1%
12
 
2.1%
11
 
2.0%
Other values (111) 272
48.7%
ASCII
ValueCountFrequency (%)
C 42
32.8%
37
28.9%
1 13
 
10.2%
2 11
 
8.6%
4 5
 
3.9%
) 4
 
3.1%
( 4
 
3.1%
S 4
 
3.1%
3 4
 
3.1%
# 1
 
0.8%
Other values (3) 3
 
2.3%

호기
Categorical

HIGH CORRELATION 

Distinct31
Distinct (%)33.3%
Missing0
Missing (%)0.0%
Memory size876.0 B
1
49 
CG3
 
2
CG2
 
2
4
 
2
5
 
2
Other values (26)
36 

Length

Max length3
Median length1
Mean length1.5591398
Min length1

Unique

Unique16 ?
Unique (%)17.2%

Sample

1st row1
2nd row2
3rd row3
4th row4
5th row5

Common Values

ValueCountFrequency (%)
1 49
52.7%
CG3 2
 
2.2%
CG2 2
 
2.2%
4 2
 
2.2%
5 2
 
2.2%
6 2
 
2.2%
CS2 2
 
2.2%
CS1 2
 
2.2%
CG1 2
 
2.2%
CG4 2
 
2.2%
Other values (21) 26
28.0%

Length

2023-12-12T22:23:18.907392image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
1 49
52.7%
cg1 2
 
2.2%
cg3 2
 
2.2%
f4 2
 
2.2%
s1 2
 
2.2%
cg6 2
 
2.2%
2 2
 
2.2%
cg4 2
 
2.2%
cg5 2
 
2.2%
cs1 2
 
2.2%
Other values (21) 26
28.0%

용량(kW)
Real number (ℝ)

HIGH CORRELATION 

Distinct61
Distinct (%)65.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean115776.34
Minimum88
Maximum1020000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size969.0 B
2023-12-12T22:23:19.315876image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum88
5-th percentile198.4
Q1693
median2800
Q3150000
95-th percentile500000
Maximum1020000
Range1019912
Interquartile range (IQR)149307

Descriptive statistics

Standard deviation206546.59
Coefficient of variation (CV)1.7840138
Kurtosis6.7393116
Mean115776.34
Median Absolute Deviation (MAD)2510.45
Skewness2.4392689
Sum10767199
Variance4.2661493 × 1010
MonotonicityNot monotonic
2023-12-12T22:23:19.450624image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
100000.0 10
 
10.8%
500000.0 8
 
8.6%
150000.0 6
 
6.5%
200000.0 3
 
3.2%
400000.0 3
 
3.2%
15000.0 2
 
2.2%
999.0 2
 
2.2%
1020000.0 2
 
2.2%
4200.0 2
 
2.2%
998.0 2
 
2.2%
Other values (51) 53
57.0%
ValueCountFrequency (%)
88.0 1
1.1%
95.0 1
1.1%
102.0 1
1.1%
109.0 1
1.1%
190.0 1
1.1%
204.0 1
1.1%
218.0 1
1.1%
289.55 1
1.1%
299.0 1
1.1%
326.0 1
1.1%
ValueCountFrequency (%)
1020000.0 2
 
2.2%
500000.0 8
8.6%
400000.0 3
 
3.2%
298700.0 1
 
1.1%
286600.0 2
 
2.2%
200000.0 3
 
3.2%
150000.0 6
6.5%
100000.0 10
10.8%
30000.0 1
 
1.1%
24952.0 1
 
1.1%

발전시간(HH)
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct91
Distinct (%)97.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4308.7634
Minimum0
Maximum8760
Zeros2
Zeros (%)2.2%
Negative0
Negative (%)0.0%
Memory size969.0 B
2023-12-12T22:23:19.602675image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile371.8
Q14017
median4387
Q34857
95-th percentile7982
Maximum8760
Range8760
Interquartile range (IQR)840

Descriptive statistics

Standard deviation2104.6308
Coefficient of variation (CV)0.48845356
Kurtosis0.2504147
Mean4308.7634
Median Absolute Deviation (MAD)470
Skewness-0.12580461
Sum400715
Variance4429471
MonotonicityNot monotonic
2023-12-12T22:23:19.748479image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 2
 
2.2%
8760 2
 
2.2%
174 1
 
1.1%
4884 1
 
1.1%
4516 1
 
1.1%
4164 1
 
1.1%
4317 1
 
1.1%
4136 1
 
1.1%
4405 1
 
1.1%
4315 1
 
1.1%
Other values (81) 81
87.1%
ValueCountFrequency (%)
0 2
2.2%
174 1
1.1%
176 1
1.1%
346 1
1.1%
389 1
1.1%
394 1
1.1%
460 1
1.1%
477 1
1.1%
615 1
1.1%
633 1
1.1%
ValueCountFrequency (%)
8760 2
2.2%
8725 1
1.1%
8675 1
1.1%
8657 1
1.1%
7532 1
1.1%
7460 1
1.1%
7447 1
1.1%
7406 1
1.1%
7366 1
1.1%
7321 1
1.1%

송전량(kWh)
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct92
Distinct (%)98.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4.0325066 × 108
Minimum0
Maximum5.7355052 × 109
Zeros2
Zeros (%)2.2%
Negative0
Negative (%)0.0%
Memory size969.0 B
2023-12-12T22:23:19.898911image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile120035.4
Q1825040
median3061491
Q32.2531967 × 108
95-th percentile2.5450575 × 109
Maximum5.7355052 × 109
Range5.7355052 × 109
Interquartile range (IQR)2.2449463 × 108

Descriptive statistics

Standard deviation9.9969282 × 108
Coefficient of variation (CV)2.4790854
Kurtosis12.494208
Mean4.0325066 × 108
Median Absolute Deviation (MAD)2933286
Skewness3.4015919
Sum3.7502311 × 1010
Variance9.9938574 × 1017
MonotonicityNot monotonic
2023-12-12T22:23:20.057230image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 2
 
2.2%
902071 1
 
1.1%
469100 1
 
1.1%
1260042 1
 
1.1%
1277508 1
 
1.1%
4855090 1
 
1.1%
146946 1
 
1.1%
249312 1
 
1.1%
137442 1
 
1.1%
2067930 1
 
1.1%
Other values (82) 82
88.2%
ValueCountFrequency (%)
0 2
2.2%
19512 1
1.1%
65100 1
1.1%
107781 1
1.1%
128205 1
1.1%
137442 1
1.1%
146946 1
1.1%
249312 1
1.1%
257386 1
1.1%
281835 1
1.1%
ValueCountFrequency (%)
5735505155 1
1.1%
4670222491 1
1.1%
3559989098 1
1.1%
2752170642 1
1.1%
2706422887 1
1.1%
2437480595 1
1.1%
2427285461 1
1.1%
1909725438 1
1.1%
1621732320 1
1.1%
1378377363 1
1.1%

Interactions

2023-12-12T22:23:17.443270image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:23:16.882842image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:23:17.181998image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:23:17.547606image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:23:16.994390image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:23:17.263191image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:23:17.637289image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:23:17.092116image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:23:17.346599image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T22:23:20.194082image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사업소중분류호기용량(kW)발전시간(HH)송전량(kWh)
사업소1.0001.0000.3340.9040.9360.727
중분류1.0001.0000.0000.8040.8660.000
호기0.3340.0001.0000.9390.6340.968
용량(kW)0.9040.8040.9391.0000.7090.883
발전시간(HH)0.9360.8660.6340.7091.0000.657
송전량(kWh)0.7270.0000.9680.8830.6571.000
2023-12-12T22:23:20.314351image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사업소호기
사업소1.0000.094
호기0.0941.000
2023-12-12T22:23:20.398654image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
용량(kW)발전시간(HH)송전량(kWh)사업소호기
용량(kW)1.0000.1110.8880.7030.638
발전시간(HH)0.1111.0000.3310.5930.246
송전량(kWh)0.8880.3311.0000.4640.704
사업소0.7030.5930.4641.0000.094
호기0.6380.2460.7040.0941.000

Missing values

2023-12-12T22:23:17.770375image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T22:23:17.878019image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

사업소중분류호기용량(kW)발전시간(HH)송전량(kWh)
0당진당진1500000.000
1당진당진2500000.086753559989098
2당진당진3500000.075322752170642
3당진당진4500000.0477183647644
4당진당진5500000.073662706422887
5당진당진6500000.068142427285461
6당진당진7500000.051831909725438
7당진당진8500000.066052437480595
8당진당진91020000.062354670222491
9당진당진101020000.074605735505155
사업소중분류호기용량(kW)발전시간(HH)송전량(kWh)
83태양광황금물류센터태양광11100.044481533761
84풍력영광지산풍력13000.058583499328
85바이오매스동해바이오매스130000.06867162612278
86연료전지동해 북평레포츠 연료전지14200.0621826310934
87연료전지동해연료전지115000.08725121736426
88연료전지울산수소연료전지11000.062923061491
89연료전지울산연료전지12800.000
90연료전지울산연료전지214200.0876034972542
91연료전지일산연료전지4F45280.0865738734415
92연료전지호남연료전지115000.08760129883958