Overview

Dataset statistics

Number of variables2
Number of observations32
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory676.0 B
Average record size in memory21.1 B

Variable types

Text1
Numeric1

Dataset

Description특수일은 국경일, 공휴일 등을 의미한다. 일반적으로 공휴일에는 근무일에 비하여 도시가스 수요가 감소하는 패턴을 보이기에, 특수일 효과는 그러한 감소 패턴을 특수일 별로 추정하는 것이다. 고려하는 공휴일은 월요일, 토요일, 일요일, 국경일, 공휴일, 명절, 임시휴일, 샌드위치데이, 연말, 기타 수요에 영향을 미치는 날을 고려한다. 구체적으로 광주 도시가스 수요가 해당 특수일에 근무일 대비 감소하는 비율을 추정한다.
Author한국가스공사
URLhttps://www.data.go.kr/data/15088557/fileData.do

Alerts

특수일 has unique valuesUnique

Reproduction

Analysis started2023-12-12 18:47:50.041303
Analysis finished2023-12-12 18:47:50.601211
Duration0.56 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

특수일
Text

UNIQUE 

Distinct32
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size388.0 B
2023-12-13T03:47:50.810118image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length7
Median length6
Mean length4.40625
Min length3

Characters and Unicode

Total characters141
Distinct characters54
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique32 ?
Unique (%)100.0%

Sample

1st row근무일
2nd row월요일
3rd row토요일
4th row일요일
5th row1월 1일
ValueCountFrequency (%)
1일 4
 
9.5%
추석 4
 
9.5%
설날 3
 
7.1%
당일 2
 
4.8%
2일 2
 
4.8%
근무일 1
 
2.4%
석가탄신일 1
 
2.4%
하계휴가 1
 
2.4%
샌드위치데이 1
 
2.4%
선거일 1
 
2.4%
Other values (22) 22
52.4%
2023-12-13T03:47:51.441161image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
24
17.0%
10
 
7.1%
1 8
 
5.7%
7
 
5.0%
6
 
4.3%
6
 
4.3%
+ 6
 
4.3%
6
 
4.3%
2 5
 
3.5%
5
 
3.5%
Other values (44) 58
41.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 105
74.5%
Decimal Number 16
 
11.3%
Space Separator 10
 
7.1%
Math Symbol 6
 
4.3%
Dash Punctuation 4
 
2.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
24
22.9%
7
 
6.7%
6
 
5.7%
6
 
5.7%
6
 
5.7%
5
 
4.8%
3
 
2.9%
3
 
2.9%
2
 
1.9%
2
 
1.9%
Other values (38) 41
39.0%
Decimal Number
ValueCountFrequency (%)
1 8
50.0%
2 5
31.2%
3 3
 
18.8%
Space Separator
ValueCountFrequency (%)
10
100.0%
Math Symbol
ValueCountFrequency (%)
+ 6
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 105
74.5%
Common 36
 
25.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
24
22.9%
7
 
6.7%
6
 
5.7%
6
 
5.7%
6
 
5.7%
5
 
4.8%
3
 
2.9%
3
 
2.9%
2
 
1.9%
2
 
1.9%
Other values (38) 41
39.0%
Common
ValueCountFrequency (%)
10
27.8%
1 8
22.2%
+ 6
16.7%
2 5
13.9%
- 4
 
11.1%
3 3
 
8.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 105
74.5%
ASCII 36
 
25.5%

Most frequent character per block

Hangul
ValueCountFrequency (%)
24
22.9%
7
 
6.7%
6
 
5.7%
6
 
5.7%
6
 
5.7%
5
 
4.8%
3
 
2.9%
3
 
2.9%
2
 
1.9%
2
 
1.9%
Other values (38) 41
39.0%
ASCII
ValueCountFrequency (%)
10
27.8%
1 8
22.2%
+ 6
16.7%
2 5
13.9%
- 4
 
11.1%
3 3
 
8.3%

추정치
Real number (ℝ)

Distinct29
Distinct (%)90.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean-0.10890625
Minimum-0.479
Maximum1.001
Zeros0
Zeros (%)0.0%
Negative31
Negative (%)96.9%
Memory size420.0 B
2023-12-13T03:47:51.698880image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum-0.479
5-th percentile-0.3696
Q1-0.14875
median-0.1095
Q3-0.07475
95-th percentile-0.0327
Maximum1.001
Range1.48
Interquartile range (IQR)0.074

Descriptive statistics

Standard deviation0.22911236
Coefficient of variation (CV)-2.1037577
Kurtosis18.700199
Mean-0.10890625
Median Absolute Deviation (MAD)0.04
Skewness3.6260227
Sum-3.485
Variance0.052492475
MonotonicityNot monotonic
2023-12-13T03:47:51.942909image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=29)
ValueCountFrequency (%)
-0.095 3
 
9.4%
-0.103 2
 
6.2%
1.001 1
 
3.1%
-0.289 1
 
3.1%
-0.079 1
 
3.1%
-0.04 1
 
3.1%
-0.051 1
 
3.1%
-0.145 1
 
3.1%
-0.28 1
 
3.1%
-0.407 1
 
3.1%
Other values (19) 19
59.4%
ValueCountFrequency (%)
-0.479 1
3.1%
-0.407 1
3.1%
-0.339 1
3.1%
-0.289 1
3.1%
-0.28 1
3.1%
-0.207 1
3.1%
-0.189 1
3.1%
-0.151 1
3.1%
-0.148 1
3.1%
-0.145 1
3.1%
ValueCountFrequency (%)
1.001 1
3.1%
-0.025 1
3.1%
-0.039 1
3.1%
-0.04 1
3.1%
-0.042 1
3.1%
-0.051 1
3.1%
-0.059 1
3.1%
-0.062 1
3.1%
-0.079 1
3.1%
-0.085 1
3.1%

Interactions

2023-12-13T03:47:50.150762image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T03:47:52.124954image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
특수일추정치
특수일1.0001.000
추정치1.0001.000

Missing values

2023-12-13T03:47:50.434453image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T03:47:50.553259image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

특수일추정치
0근무일1.001
1월요일-0.025
2토요일-0.085
3일요일-0.148
41월 1일-0.131
5삼일절-0.095
6노동절-0.12
7식목일-0.039
8어린이날-0.151
9현충일-0.105
특수일추정치
22추석-2일-0.133
23추석-1일-0.339
24추석 당일-0.479
25추석 +1일-0.407
26추석 +2일-0.28
27추석 +3일-0.145
28선거일-0.051
29샌드위치데이-0.04
30하계휴가-0.079
31임시공휴일-0.095