Overview

Dataset statistics

Number of variables3
Number of observations4958
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory126.0 KiB
Average record size in memory26.0 B

Variable types

Numeric2
Text1

Dataset

Description연도별 연말 기준 전력시장에 참여하는 발전설비 용량에 대한 데이터로, 각 전력시장 참여자별로 합산하여 전력시장 참여자, 설비용량 합계 항목을 제공합니다. - 단위 : MW
URLhttps://www.data.go.kr/data/15069393/fileData.do

Alerts

설비용량(MW) is highly skewed (γ1 = 38.98786314)Skewed
번호 has unique valuesUnique

Reproduction

Analysis started2023-12-12 01:55:50.568885
Analysis finished2023-12-12 01:55:51.771190
Duration1.2 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

번호
Real number (ℝ)

UNIQUE 

Distinct4958
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2479.5
Minimum1
Maximum4958
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size43.7 KiB
2023-12-12T10:55:51.880219image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile248.85
Q11240.25
median2479.5
Q33718.75
95-th percentile4710.15
Maximum4958
Range4957
Interquartile range (IQR)2478.5

Descriptive statistics

Standard deviation1431.3956
Coefficient of variation (CV)0.57729205
Kurtosis-1.2
Mean2479.5
Median Absolute Deviation (MAD)1239.5
Skewness0
Sum12293361
Variance2048893.5
MonotonicityStrictly increasing
2023-12-12T10:55:52.080014image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
< 0.1%
3305 1
 
< 0.1%
3312 1
 
< 0.1%
3311 1
 
< 0.1%
3310 1
 
< 0.1%
3309 1
 
< 0.1%
3308 1
 
< 0.1%
3307 1
 
< 0.1%
3306 1
 
< 0.1%
3304 1
 
< 0.1%
Other values (4948) 4948
99.8%
ValueCountFrequency (%)
1 1
< 0.1%
2 1
< 0.1%
3 1
< 0.1%
4 1
< 0.1%
5 1
< 0.1%
6 1
< 0.1%
7 1
< 0.1%
8 1
< 0.1%
9 1
< 0.1%
10 1
< 0.1%
ValueCountFrequency (%)
4958 1
< 0.1%
4957 1
< 0.1%
4956 1
< 0.1%
4955 1
< 0.1%
4954 1
< 0.1%
4953 1
< 0.1%
4952 1
< 0.1%
4951 1
< 0.1%
4950 1
< 0.1%
4949 1
< 0.1%
Distinct4874
Distinct (%)98.3%
Missing0
Missing (%)0.0%
Memory size38.9 KiB
2023-12-12T10:55:52.349465image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length35
Median length23
Mean length9.9171037
Min length2

Characters and Unicode

Total characters49169
Distinct characters617
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique4798 ?
Unique (%)96.8%

Sample

1st row한국전력공사
2nd row한국수력원자력(주)
3rd row한국남동발전(주)
4th row한국중부발전(주)
5th row한국서부발전(주)
ValueCountFrequency (%)
주식회사 1472
 
19.3%
태양광발전소 560
 
7.3%
유한회사 316
 
4.1%
46
 
0.6%
발전소 40
 
0.5%
태양광 33
 
0.4%
농업회사법인 22
 
0.3%
2호 15
 
0.2%
1호 14
 
0.2%
9
 
0.1%
Other values (4899) 5104
66.9%
2023-12-12T10:55:52.800754image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2916
 
5.9%
2706
 
5.5%
2237
 
4.5%
2216
 
4.5%
2207
 
4.5%
2193
 
4.5%
2139
 
4.4%
2076
 
4.2%
2025
 
4.1%
1952
 
4.0%
Other values (607) 26502
53.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 42593
86.6%
Space Separator 2706
 
5.5%
Close Punctuation 1271
 
2.6%
Open Punctuation 1269
 
2.6%
Decimal Number 1120
 
2.3%
Uppercase Letter 161
 
0.3%
Lowercase Letter 21
 
< 0.1%
Other Punctuation 18
 
< 0.1%
Dash Punctuation 5
 
< 0.1%
Other Symbol 3
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2916
 
6.8%
2237
 
5.3%
2216
 
5.2%
2207
 
5.2%
2193
 
5.1%
2139
 
5.0%
2076
 
4.9%
2025
 
4.8%
1952
 
4.6%
1639
 
3.8%
Other values (551) 20993
49.3%
Uppercase Letter
ValueCountFrequency (%)
S 26
16.1%
E 15
 
9.3%
K 13
 
8.1%
H 13
 
8.1%
G 12
 
7.5%
J 11
 
6.8%
C 9
 
5.6%
Y 7
 
4.3%
L 6
 
3.7%
T 6
 
3.7%
Other values (12) 43
26.7%
Lowercase Letter
ValueCountFrequency (%)
o 3
14.3%
d 2
 
9.5%
y 2
 
9.5%
s 2
 
9.5%
k 2
 
9.5%
u 1
 
4.8%
i 1
 
4.8%
n 1
 
4.8%
g 1
 
4.8%
c 1
 
4.8%
Other values (5) 5
23.8%
Decimal Number
ValueCountFrequency (%)
1 368
32.9%
2 277
24.7%
3 134
 
12.0%
4 75
 
6.7%
0 74
 
6.6%
5 67
 
6.0%
6 41
 
3.7%
7 38
 
3.4%
8 25
 
2.2%
9 21
 
1.9%
Other Punctuation
ValueCountFrequency (%)
. 13
72.2%
& 5
 
27.8%
Math Symbol
ValueCountFrequency (%)
< 1
50.0%
> 1
50.0%
Space Separator
ValueCountFrequency (%)
2706
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1271
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1269
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 5
100.0%
Other Symbol
ValueCountFrequency (%)
3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 42596
86.6%
Common 6391
 
13.0%
Latin 182
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2916
 
6.8%
2237
 
5.3%
2216
 
5.2%
2207
 
5.2%
2193
 
5.1%
2139
 
5.0%
2076
 
4.9%
2025
 
4.8%
1952
 
4.6%
1639
 
3.8%
Other values (552) 20996
49.3%
Latin
ValueCountFrequency (%)
S 26
14.3%
E 15
 
8.2%
K 13
 
7.1%
H 13
 
7.1%
G 12
 
6.6%
J 11
 
6.0%
C 9
 
4.9%
Y 7
 
3.8%
L 6
 
3.3%
T 6
 
3.3%
Other values (27) 64
35.2%
Common
ValueCountFrequency (%)
2706
42.3%
) 1271
19.9%
( 1269
19.9%
1 368
 
5.8%
2 277
 
4.3%
3 134
 
2.1%
4 75
 
1.2%
0 74
 
1.2%
5 67
 
1.0%
6 41
 
0.6%
Other values (8) 109
 
1.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 42593
86.6%
ASCII 6573
 
13.4%
None 3
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
2916
 
6.8%
2237
 
5.3%
2216
 
5.2%
2207
 
5.2%
2193
 
5.1%
2139
 
5.0%
2076
 
4.9%
2025
 
4.8%
1952
 
4.6%
1639
 
3.8%
Other values (551) 20993
49.3%
ASCII
ValueCountFrequency (%)
2706
41.2%
) 1271
19.3%
( 1269
19.3%
1 368
 
5.6%
2 277
 
4.2%
3 134
 
2.0%
4 75
 
1.1%
0 74
 
1.1%
5 67
 
1.0%
6 41
 
0.6%
Other values (45) 291
 
4.4%
None
ValueCountFrequency (%)
3
100.0%

설비용량(MW)
Real number (ℝ)

SKEWED 

Distinct2519
Distinct (%)50.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean26.974925
Minimum0
Maximum30032.492
Zeros1
Zeros (%)< 0.1%
Negative0
Negative (%)0.0%
Memory size43.7 KiB
2023-12-12T10:55:53.000621image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0.09855
Q10.381275
median0.948
Q31.49235
95-th percentile6.9689272
Maximum30032.492
Range30032.492
Interquartile range (IQR)1.111075

Descriptive statistics

Standard deviation556.23609
Coefficient of variation (CV)20.620487
Kurtosis1847.5665
Mean26.974925
Median Absolute Deviation (MAD)0.55066
Skewness38.987863
Sum133741.68
Variance309398.59
MonotonicityNot monotonic
2023-12-12T10:55:53.196321image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.099 137
 
2.8%
0.99792 91
 
1.8%
0.999 82
 
1.7%
0.49896 47
 
0.9%
0.09912 46
 
0.9%
1.0 40
 
0.8%
0.9936 37
 
0.7%
0.4995 34
 
0.7%
0.09936 33
 
0.7%
0.99864 33
 
0.7%
Other values (2509) 4378
88.3%
ValueCountFrequency (%)
0.0 1
 
< 0.1%
0.012 1
 
< 0.1%
0.0162 1
 
< 0.1%
0.018 1
 
< 0.1%
0.021 1
 
< 0.1%
0.02409 1
 
< 0.1%
0.025125 1
 
< 0.1%
0.02988 1
 
< 0.1%
0.03 3
0.1%
0.03108 1
 
< 0.1%
ValueCountFrequency (%)
30032.49181 1
< 0.1%
11852.59415 1
< 0.1%
11534.26758 1
< 0.1%
10774.7048 1
< 0.1%
9574.050045 1
< 0.1%
9278.01063 1
< 0.1%
3603.16 1
< 0.1%
3210.869 1
< 0.1%
2477.0768 1
< 0.1%
2468.72685 1
< 0.1%

Interactions

2023-12-12T10:55:51.311842image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:55:51.064472image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:55:51.434822image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:55:51.173984image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T10:55:53.328791image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호설비용량(MW)
번호1.0000.081
설비용량(MW)0.0811.000
2023-12-12T10:55:53.436477image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호설비용량(MW)
번호1.000-0.144
설비용량(MW)-0.1441.000

Missing values

2023-12-12T10:55:51.617813image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T10:55:51.724176image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

번호회원사명설비용량(MW)
01한국전력공사0.0
12한국수력원자력(주)30032.49181
23한국남동발전(주)9278.01063
34한국중부발전(주)10774.7048
45한국서부발전(주)11852.59415
56한국남부발전(주)11534.26758
67한국동서발전(주)9574.050045
78에스디엔(주)2.9958
89(주)서대구에너지1.5
910(주)에스피에너지2.208
번호회원사명설비용량(MW)
49484949블루스카이 태양광발전소0.4914
49494950미래 태양광발전소0.29808
49504951이엘티에너지(주)0.999
49514952승영태양광발전소0.056
49524953영훈이엔지0.29988
49534954주식회사 썬로드0.994875
49544955(주)전한전력0.87588
49554956한미글로벌이앤씨10호 주식회사0.82309
49564957주식회사 이제이태양광0.9984
49574958태승 태양광발전소0.14904