Overview

Dataset statistics

Number of variables2
Number of observations339
Missing cells350
Missing cells (%)51.6%
Duplicate rows1
Duplicate rows (%)0.3%
Total size in memory5.8 KiB
Average record size in memory17.4 B

Variable types

DateTime1
Numeric1

Dataset

Description기온효과 산출 과정은 기온반응함수와 기온효과의 곱을 기온에 대하여 적분하여 생성한다. 이러한 산출과정의 의미는 기온반응함수는 각 온도가 발생시키는 수도권 도시가스수요를 축정하여, 기온 분포함수는 각 온도의 발생횟수를 측정함으로써 최종적으로 해당 기온에서 발생하는 도시가스 수요를 측정한다. 따라서 기온효과는 월 단위 부산 도시가스 수요의 계절성을 나타낸다.
Author한국가스공사
URLhttps://www.data.go.kr/data/15088538/fileData.do

Alerts

Dataset has 1 (0.3%) duplicate rowsDuplicates
연월일 has 175 (51.6%) missing valuesMissing
기온효과 has 175 (51.6%) missing valuesMissing

Reproduction

Analysis started2023-12-12 06:47:24.136439
Analysis finished2023-12-12 06:47:24.559811
Duration0.42 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연월일
Date

MISSING 

Distinct164
Distinct (%)100.0%
Missing175
Missing (%)51.6%
Memory size2.8 KiB
Minimum2005-01-31 00:00:00
Maximum2018-08-31 00:00:00
2023-12-12T15:47:24.650731image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:47:24.819237image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

기온효과
Real number (ℝ)

MISSING 

Distinct164
Distinct (%)100.0%
Missing175
Missing (%)51.6%
Infinite0
Infinite (%)0.0%
Mean5.6369282
Minimum5.46602
Maximum5.90313
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.1 KiB
2023-12-12T15:47:24.990847image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum5.46602
5-th percentile5.483716
Q15.54057
median5.613885
Q35.73053
95-th percentile5.827739
Maximum5.90313
Range0.43711
Interquartile range (IQR)0.18996

Descriptive statistics

Standard deviation0.1130284
Coefficient of variation (CV)0.020051417
Kurtosis-1.0845079
Mean5.6369282
Median Absolute Deviation (MAD)0.084895
Skewness0.35466465
Sum924.45622
Variance0.012775419
MonotonicityNot monotonic
2023-12-12T15:47:25.388870image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
5.52016 1
 
0.3%
5.68612 1
 
0.3%
5.77506 1
 
0.3%
5.78573 1
 
0.3%
5.76711 1
 
0.3%
5.69975 1
 
0.3%
5.62934 1
 
0.3%
5.57845 1
 
0.3%
5.5471 1
 
0.3%
5.50733 1
 
0.3%
Other values (154) 154
45.4%
(Missing) 175
51.6%
ValueCountFrequency (%)
5.46602 1
0.3%
5.46739 1
0.3%
5.46792 1
0.3%
5.46854 1
0.3%
5.47202 1
0.3%
5.47274 1
0.3%
5.47896 1
0.3%
5.48054 1
0.3%
5.48329 1
0.3%
5.48613 1
0.3%
ValueCountFrequency (%)
5.90313 1
0.3%
5.86093 1
0.3%
5.84851 1
0.3%
5.84242 1
0.3%
5.83439 1
0.3%
5.83331 1
0.3%
5.82845 1
0.3%
5.82843 1
0.3%
5.82809 1
0.3%
5.82575 1
0.3%

Interactions

2023-12-12T15:47:24.188280image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Missing values

2023-12-12T15:47:24.332274image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T15:47:24.418129image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-12T15:47:24.510530image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

연월일기온효과
02005-01-315.84242
12005-02-285.83331
22005-03-315.74575
32005-04-305.63755
42005-05-315.59753
52005-06-305.55137
62005-07-315.51796
72005-08-315.49532
82005-09-305.5311
92005-10-315.5976
연월일기온효과
329<NA><NA>
330<NA><NA>
331<NA><NA>
332<NA><NA>
333<NA><NA>
334<NA><NA>
335<NA><NA>
336<NA><NA>
337<NA><NA>
338<NA><NA>

Duplicate rows

Most frequently occurring

연월일기온효과# duplicates
0<NA><NA>175