Overview

Dataset statistics

Number of variables4
Number of observations31
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.2 KiB
Average record size in memory38.3 B

Variable types

Numeric2
Text1
DateTime1

Dataset

Description한국동서발전의 태양광발전 설비용량 정보입니다. 태양광발전 설비용량 정보는 번호, 구분, 설비용량, 준공일의 항목으로 구성됩니다.
Author한국동서발전(주)
URLhttps://www.data.go.kr/data/15103821/fileData.do

Alerts

번호 has unique valuesUnique
구분 has unique valuesUnique

Reproduction

Analysis started2023-12-12 17:53:11.031700
Analysis finished2023-12-12 17:53:11.861789
Duration0.83 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

번호
Real number (ℝ)

UNIQUE 

Distinct31
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean16
Minimum1
Maximum31
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size411.0 B
2023-12-13T02:53:11.948790image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2.5
Q18.5
median16
Q323.5
95-th percentile29.5
Maximum31
Range30
Interquartile range (IQR)15

Descriptive statistics

Standard deviation9.0921211
Coefficient of variation (CV)0.56825757
Kurtosis-1.2
Mean16
Median Absolute Deviation (MAD)8
Skewness0
Sum496
Variance82.666667
MonotonicityStrictly increasing
2023-12-13T02:53:12.116222image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=31)
ValueCountFrequency (%)
1 1
 
3.2%
2 1
 
3.2%
31 1
 
3.2%
30 1
 
3.2%
29 1
 
3.2%
28 1
 
3.2%
27 1
 
3.2%
26 1
 
3.2%
25 1
 
3.2%
24 1
 
3.2%
Other values (21) 21
67.7%
ValueCountFrequency (%)
1 1
3.2%
2 1
3.2%
3 1
3.2%
4 1
3.2%
5 1
3.2%
6 1
3.2%
7 1
3.2%
8 1
3.2%
9 1
3.2%
10 1
3.2%
ValueCountFrequency (%)
31 1
3.2%
30 1
3.2%
29 1
3.2%
28 1
3.2%
27 1
3.2%
26 1
3.2%
25 1
3.2%
24 1
3.2%
23 1
3.2%
22 1
3.2%

구분
Text

UNIQUE 

Distinct31
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size380.0 B
2023-12-13T02:53:12.377728image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length11
Median length9
Mean length7.2580645
Min length3

Characters and Unicode

Total characters225
Distinct characters86
Distinct categories5 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique31 ?
Unique (%)100.0%

Sample

1st row당진
2nd row호남
3rd row울산
4th row당진폐기물
5th row광양항 태양광
ValueCountFrequency (%)
당진 5
 
10.0%
광양항 4
 
8.0%
울산산단 2
 
4.0%
태양광 2
 
4.0%
동해 2
 
4.0%
울산 2
 
4.0%
2단계 1
 
2.0%
1단계 1
 
2.0%
지붕 1
 
2.0%
부산산단 1
 
2.0%
Other values (29) 29
58.0%
2023-12-13T02:53:12.880451image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
34
 
15.1%
10
 
4.4%
10
 
4.4%
10
 
4.4%
8
 
3.6%
7
 
3.1%
7
 
3.1%
7
 
3.1%
6
 
2.7%
5
 
2.2%
Other values (76) 121
53.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 177
78.7%
Space Separator 34
 
15.1%
Decimal Number 10
 
4.4%
Dash Punctuation 2
 
0.9%
Uppercase Letter 2
 
0.9%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
10
 
5.6%
10
 
5.6%
10
 
5.6%
8
 
4.5%
7
 
4.0%
7
 
4.0%
7
 
4.0%
6
 
3.4%
5
 
2.8%
5
 
2.8%
Other values (68) 102
57.6%
Decimal Number
ValueCountFrequency (%)
2 4
40.0%
1 3
30.0%
3 2
20.0%
4 1
 
10.0%
Uppercase Letter
ValueCountFrequency (%)
P 1
50.0%
G 1
50.0%
Space Separator
ValueCountFrequency (%)
34
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 177
78.7%
Common 46
 
20.4%
Latin 2
 
0.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
10
 
5.6%
10
 
5.6%
10
 
5.6%
8
 
4.5%
7
 
4.0%
7
 
4.0%
7
 
4.0%
6
 
3.4%
5
 
2.8%
5
 
2.8%
Other values (68) 102
57.6%
Common
ValueCountFrequency (%)
34
73.9%
2 4
 
8.7%
1 3
 
6.5%
- 2
 
4.3%
3 2
 
4.3%
4 1
 
2.2%
Latin
ValueCountFrequency (%)
P 1
50.0%
G 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 177
78.7%
ASCII 48
 
21.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
34
70.8%
2 4
 
8.3%
1 3
 
6.2%
- 2
 
4.2%
3 2
 
4.2%
P 1
 
2.1%
4 1
 
2.1%
G 1
 
2.1%
Hangul
ValueCountFrequency (%)
10
 
5.6%
10
 
5.6%
10
 
5.6%
8
 
4.5%
7
 
4.0%
7
 
4.0%
7
 
4.0%
6
 
3.4%
5
 
2.8%
5
 
2.8%
Other values (68) 102
57.6%

설비용량(MW)
Real number (ℝ)

Distinct26
Distinct (%)83.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean13.458065
Minimum0.1
Maximum274
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size411.0 B
2023-12-13T02:53:13.060266image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0.1
5-th percentile0.15
Q10.8
median2
Q36.5
95-th percentile24.6
Maximum274
Range273.9
Interquartile range (IQR)5.7

Descriptive statistics

Standard deviation48.822268
Coefficient of variation (CV)3.6277332
Kurtosis29.705892
Mean13.458065
Median Absolute Deviation (MAD)1.5
Skewness5.404033
Sum417.2
Variance2383.6138
MonotonicityNot monotonic
2023-12-13T02:53:13.224576image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=26)
ValueCountFrequency (%)
1.0 3
 
9.7%
6.5 2
 
6.5%
0.6 2
 
6.5%
0.1 2
 
6.5%
2.0 1
 
3.2%
3.2 1
 
3.2%
6.0 1
 
3.2%
10.8 1
 
3.2%
6.6 1
 
3.2%
24.2 1
 
3.2%
Other values (16) 16
51.6%
ValueCountFrequency (%)
0.1 2
6.5%
0.2 1
 
3.2%
0.4 1
 
3.2%
0.5 1
 
3.2%
0.6 2
6.5%
0.7 1
 
3.2%
0.9 1
 
3.2%
1.0 3
9.7%
1.1 1
 
3.2%
1.3 1
 
3.2%
ValueCountFrequency (%)
274.0 1
3.2%
25.0 1
3.2%
24.2 1
3.2%
20.0 1
3.2%
10.8 1
3.2%
9.8 1
3.2%
6.6 1
3.2%
6.5 2
6.5%
6.0 1
3.2%
3.5 1
3.2%
Distinct20
Distinct (%)64.5%
Missing0
Missing (%)0.0%
Memory size380.0 B
Minimum2010-09-01 00:00:00
Maximum2021-12-01 00:00:00
2023-12-13T02:53:13.388940image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:53:13.557972image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=20)

Interactions

2023-12-13T02:53:11.424755image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:53:11.225523image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:53:11.537237image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:53:11.323277image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T02:53:13.686133image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호구분설비용량(MW)준공일
번호1.0001.0000.2230.966
구분1.0001.0001.0001.000
설비용량(MW)0.2231.0001.0000.000
준공일0.9661.0000.0001.000
2023-12-13T02:53:13.780584image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호설비용량(MW)
번호1.0000.387
설비용량(MW)0.3871.000

Missing values

2023-12-13T02:53:11.709102image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T02:53:11.818506image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

번호구분설비용량(MW)준공일
01당진1.02010-09-01
12호남0.12011-01-01
23울산0.52011-03-01
34당진폐기물1.32011-12-01
45광양항 태양광2.32011-12-01
56르노삼성자동차20.02012-12-01
67당진자재창고0.72012-12-01
78당진수상태양광1.02013-06-01
89수원시 하수처리장1.52014-01-01
910광양항 황금물류센터1.12014-06-01
번호구분설비용량(MW)준공일
2122동서햇빛드림펀드274.02020-12-01
2223신안자라리24.22020-12-01
2324광양항 3단계-1차0.62020-12-01
2425햇빛새싹학교6.62020-12-01
2526옥산오창6.52021-04-01
2627광양항 3단계-2차0.62021-04-01
2728부산산단 지붕10.82021-04-01
2829울산산단 1단계6.52021-04-01
2930울산산단 2단계6.02021-01-01
3031동해 P2G 태양광3.22021-12-01