Overview

Dataset statistics

Number of variables2
Number of observations339
Missing cells350
Missing cells (%)51.6%
Duplicate rows1
Duplicate rows (%)0.3%
Total size in memory5.8 KiB
Average record size in memory17.4 B

Variable types

DateTime1
Numeric1

Dataset

Description기온효과 산출 과정은 기온반응함수와 기온효과의 곱을 기온에 대하여 적분하여 생성한다. 이러한 산출과정의 의미는 기온반응함수는 각 온도가 발생시키는 수도권 도시가스수요를 축정하여, 기온 분포함수는 각 온도의 발생횟수를 측정함으로써 최종적으로 해당 기온에서 발생하는 도시가스 수요를 측정한다. 따라서 기온효과는 월 단위 대전 도시가스 수요의 계절성을 나타낸다.
Author한국가스공사
URLhttps://www.data.go.kr/data/15088528/fileData.do

Alerts

Dataset has 1 (0.3%) duplicate rowsDuplicates
연월일 has 175 (51.6%) missing valuesMissing
기온효과 has 175 (51.6%) missing valuesMissing

Reproduction

Analysis started2023-12-12 15:30:32.412479
Analysis finished2023-12-12 15:30:32.862387
Duration0.45 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연월일
Date

MISSING 

Distinct164
Distinct (%)100.0%
Missing175
Missing (%)51.6%
Memory size2.8 KiB
Minimum2005-01-31 00:00:00
Maximum2018-08-31 00:00:00
2023-12-13T00:30:32.965708image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T00:30:33.153774image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

기온효과
Real number (ℝ)

MISSING 

Distinct164
Distinct (%)100.0%
Missing175
Missing (%)51.6%
Infinite0
Infinite (%)0.0%
Mean3.9560653
Minimum3.75929
Maximum4.26137
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.1 KiB
2023-12-13T00:30:33.327291image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum3.75929
5-th percentile3.7788315
Q13.8240475
median3.9235
Q34.0803575
95-th percentile4.1844515
Maximum4.26137
Range0.50208
Interquartile range (IQR)0.25631

Descriptive statistics

Standard deviation0.14341612
Coefficient of variation (CV)0.036252212
Kurtosis-1.2870631
Mean3.9560653
Median Absolute Deviation (MAD)0.11498
Skewness0.3574624
Sum648.79471
Variance0.020568183
MonotonicityNot monotonic
2023-12-13T00:30:33.495395image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
3.83179 1
 
0.3%
4.04174 1
 
0.3%
4.13611 1
 
0.3%
4.15847 1
 
0.3%
4.10982 1
 
0.3%
4.01642 1
 
0.3%
3.92406 1
 
0.3%
3.85813 1
 
0.3%
3.81549 1
 
0.3%
3.78643 1
 
0.3%
Other values (154) 154
45.4%
(Missing) 175
51.6%
ValueCountFrequency (%)
3.75929 1
0.3%
3.76875 1
0.3%
3.76914 1
0.3%
3.76915 1
0.3%
3.77495 1
0.3%
3.77528 1
0.3%
3.77605 1
0.3%
3.77789 1
0.3%
3.77859 1
0.3%
3.7802 1
0.3%
ValueCountFrequency (%)
4.26137 1
0.3%
4.21141 1
0.3%
4.21067 1
0.3%
4.20757 1
0.3%
4.20652 1
0.3%
4.20267 1
0.3%
4.18975 1
0.3%
4.18716 1
0.3%
4.18487 1
0.3%
4.18208 1
0.3%

Interactions

2023-12-13T00:30:32.451181image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Missing values

2023-12-13T00:30:32.578654image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T00:30:32.681821image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-13T00:30:32.788313image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

연월일기온효과
02005-01-314.18487
12005-02-284.17665
22005-03-314.07548
32005-04-303.93435
42005-05-313.881
52005-06-303.81882
62005-07-313.79072
72005-08-313.79395
82005-09-303.82462
92005-10-313.92333
연월일기온효과
329<NA><NA>
330<NA><NA>
331<NA><NA>
332<NA><NA>
333<NA><NA>
334<NA><NA>
335<NA><NA>
336<NA><NA>
337<NA><NA>
338<NA><NA>

Duplicate rows

Most frequently occurring

연월일기온효과# duplicates
0<NA><NA>175