Overview

Dataset statistics

Number of variables3
Number of observations104
Missing cells69
Missing cells (%)22.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.8 KiB
Average record size in memory27.3 B

Variable types

DateTime1
Numeric2

Dataset

Description한국남부발전(주)_안동소내 태양광발전실적에 대한 데이터로 년월일, 태양광 호기별 발전량 항목을 제공합니다.
Author한국남부발전(주)
URLhttps://www.data.go.kr/data/15043397/fileData.do

Alerts

안동소내 태양광_1(kwh) is highly overall correlated with 안동소내 태양광_2(kwh)High correlation
안동소내 태양광_2(kwh) is highly overall correlated with 안동소내 태양광_1(kwh)High correlation
안동소내 태양광_1(kwh) has 28 (26.9%) missing valuesMissing
안동소내 태양광_2(kwh) has 41 (39.4%) missing valuesMissing
년월일 has unique valuesUnique

Reproduction

Analysis started2024-04-29 22:30:20.790203
Analysis finished2024-04-29 22:30:22.185685
Duration1.4 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

년월일
Date

UNIQUE 

Distinct104
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size964.0 B
Minimum2015-08-01 00:00:00
Maximum2024-03-01 00:00:00
2024-04-30T07:30:22.256969image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T07:30:22.390681image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

안동소내 태양광_1(kwh)
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct75
Distinct (%)98.7%
Missing28
Missing (%)26.9%
Infinite0
Infinite (%)0.0%
Mean11602.237
Minimum1797
Maximum17071
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2024-04-30T07:30:22.514898image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1797
5-th percentile8461
Q110062
median11439.5
Q313209.25
95-th percentile14874.75
Maximum17071
Range15274
Interquartile range (IQR)3147.25

Descriptive statistics

Standard deviation2343.2861
Coefficient of variation (CV)0.20196848
Kurtosis2.9842352
Mean11602.237
Median Absolute Deviation (MAD)1579.5
Skewness-0.67088812
Sum881770
Variance5490989.8
MonotonicityNot monotonic
2024-04-30T07:30:22.632603image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
13019 2
 
1.9%
10538 1
 
1.0%
9912 1
 
1.0%
12176 1
 
1.0%
17071 1
 
1.0%
15042 1
 
1.0%
12038 1
 
1.0%
13085 1
 
1.0%
11710 1
 
1.0%
10066 1
 
1.0%
Other values (65) 65
62.5%
(Missing) 28
26.9%
ValueCountFrequency (%)
1797 1
1.0%
7560 1
1.0%
8174 1
1.0%
8284 1
1.0%
8520 1
1.0%
8888 1
1.0%
8904 1
1.0%
8930 1
1.0%
9166 1
1.0%
9314 1
1.0%
ValueCountFrequency (%)
17071 1
1.0%
16402 1
1.0%
15874 1
1.0%
15042 1
1.0%
14819 1
1.0%
14810 1
1.0%
14509 1
1.0%
14387 1
1.0%
14288 1
1.0%
14160 1
1.0%

안동소내 태양광_2(kwh)
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct63
Distinct (%)100.0%
Missing41
Missing (%)39.4%
Infinite0
Infinite (%)0.0%
Mean31555.603
Minimum19791
Maximum46829
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2024-04-30T07:30:22.747558image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum19791
5-th percentile22954.3
Q126988
median30284
Q336412
95-th percentile41171
Maximum46829
Range27038
Interquartile range (IQR)9424

Descriptive statistics

Standard deviation6116.5944
Coefficient of variation (CV)0.19383545
Kurtosis-0.38997116
Mean31555.603
Median Absolute Deviation (MAD)4084
Skewness0.43820456
Sum1988003
Variance37412728
MonotonicityNot monotonic
2024-04-30T07:30:22.873488image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
30214 1
 
1.0%
25457 1
 
1.0%
25245 1
 
1.0%
28460 1
 
1.0%
33299 1
 
1.0%
32249 1
 
1.0%
40928 1
 
1.0%
46829 1
 
1.0%
35944 1
 
1.0%
33454 1
 
1.0%
Other values (53) 53
51.0%
(Missing) 41
39.4%
ValueCountFrequency (%)
19791 1
1.0%
21971 1
1.0%
21984 1
1.0%
22870 1
1.0%
23713 1
1.0%
24226 1
1.0%
24958 1
1.0%
25245 1
1.0%
25299 1
1.0%
25319 1
1.0%
ValueCountFrequency (%)
46829 1
1.0%
45403 1
1.0%
43652 1
1.0%
41198 1
1.0%
40928 1
1.0%
40205 1
1.0%
39846 1
1.0%
39450 1
1.0%
38842 1
1.0%
38797 1
1.0%

Interactions

2024-04-30T07:30:21.837041image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T07:30:21.496427image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T07:30:21.909576image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T07:30:21.625730image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-30T07:30:22.948433image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
안동소내 태양광_1(kwh)안동소내 태양광_2(kwh)
안동소내 태양광_1(kwh)1.0000.890
안동소내 태양광_2(kwh)0.8901.000
2024-04-30T07:30:23.019974image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
안동소내 태양광_1(kwh)안동소내 태양광_2(kwh)
안동소내 태양광_1(kwh)1.0000.876
안동소내 태양광_2(kwh)0.8761.000

Missing values

2024-04-30T07:30:22.008695image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-30T07:30:22.073402image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-04-30T07:30:22.147032image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

년월일안동소내 태양광_1(kwh)안동소내 태양광_2(kwh)
02015-08-01<NA><NA>
12015-09-01<NA><NA>
22015-10-01<NA><NA>
32015-11-01<NA><NA>
42015-12-01<NA><NA>
52016-01-01<NA><NA>
62016-02-01<NA><NA>
72016-03-01<NA><NA>
82016-04-01<NA><NA>
92016-05-01<NA><NA>
년월일안동소내 태양광_1(kwh)안동소내 태양광_2(kwh)
942023-06-011159537800
952023-07-01852029300
962023-08-01980132446
972023-09-01828428135
982023-10-01940432144
992023-11-01817426217
1002023-12-01916621971
1012024-01-011029624958
1022024-02-01756019791
1032024-03-011255133421