Overview

Dataset statistics

Number of variables6
Number of observations166
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory8.2 KiB
Average record size in memory50.8 B

Variable types

DateTime2
Categorical3
Numeric1

Dataset

Description년월,지사코드,지사명,전기생산량,단위,작업일시
Author서울에너지공사
URLhttps://data.seoul.go.kr/dataList/OA-20445/S/1/datasetView.do

Alerts

단위 has constant value ""Constant
작업일시 has constant value ""Constant
지사코드 is highly overall correlated with 지사명High correlation
지사명 is highly overall correlated with 지사코드High correlation
전기생산량 has 50 (30.1%) zerosZeros

Reproduction

Analysis started2024-05-11 04:44:35.540919
Analysis finished2024-05-11 04:44:36.612858
Duration1.07 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

년월
Date

Distinct83
Distinct (%)50.0%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
Minimum2016-12-01 00:00:00
Maximum2023-10-01 00:00:00
2024-05-11T04:44:37.074109image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T04:44:37.499999image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

지사코드
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)1.2%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
2
83 
1
83 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2
2nd row1
3rd row2
4th row1
5th row2

Common Values

ValueCountFrequency (%)
2 83
50.0%
1 83
50.0%

Length

2024-05-11T04:44:37.875873image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T04:44:38.188648image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2 83
50.0%
1 83
50.0%

지사명
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)1.2%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
동부지사
83 
서부지사
83 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row동부지사
2nd row서부지사
3rd row동부지사
4th row서부지사
5th row동부지사

Common Values

ValueCountFrequency (%)
동부지사 83
50.0%
서부지사 83
50.0%

Length

2024-05-11T04:44:38.550719image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T04:44:38.851768image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
동부지사 83
50.0%
서부지사 83
50.0%

전기생산량
Real number (ℝ)

ZEROS 

Distinct117
Distinct (%)70.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4959578.6
Minimum0
Maximum21902300
Zeros50
Zeros (%)30.1%
Negative0
Negative (%)0.0%
Memory size1.6 KiB
2024-05-11T04:44:39.244972image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median2070550
Q38903627.9
95-th percentile19314700
Maximum21902300
Range21902300
Interquartile range (IQR)8903627.9

Descriptive statistics

Standard deviation6396501.4
Coefficient of variation (CV)1.2897268
Kurtosis0.19878541
Mean4959578.6
Median Absolute Deviation (MAD)2070550
Skewness1.18786
Sum8.2329005 × 108
Variance4.0915231 × 1013
MonotonicityNot monotonic
2024-05-11T04:44:39.702536image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.0 50
30.1%
17780100.0 1
 
0.6%
3766200.0 1
 
0.6%
195209.0 1
 
0.6%
3623000.0 1
 
0.6%
2880041.5 1
 
0.6%
2149800.0 1
 
0.6%
8797000.0 1
 
0.6%
15725900.0 1
 
0.6%
1664650.0 1
 
0.6%
Other values (107) 107
64.5%
ValueCountFrequency (%)
0.0 50
30.1%
100.0 1
 
0.6%
569.0 1
 
0.6%
3152.0 1
 
0.6%
13853.0 1
 
0.6%
16227.0 1
 
0.6%
28347.0 1
 
0.6%
33700.0 1
 
0.6%
63686.0 1
 
0.6%
65361.0 1
 
0.6%
ValueCountFrequency (%)
21902300.0 1
0.6%
21736300.0 1
0.6%
21150600.0 1
0.6%
20606200.0 1
0.6%
20489200.0 1
0.6%
20231600.0 1
0.6%
19578700.0 1
0.6%
19542800.0 1
0.6%
19349800.0 1
0.6%
19209400.0 1
0.6%

단위
Categorical

CONSTANT 

Distinct1
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
kWh
166 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowkWh
2nd rowkWh
3rd rowkWh
4th rowkWh
5th rowkWh

Common Values

ValueCountFrequency (%)
kWh 166
100.0%

Length

2024-05-11T04:44:40.081423image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T04:44:40.388980image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
kwh 166
100.0%

작업일시
Date

CONSTANT 

Distinct1
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
Minimum2024-05-10 21:06:50
Maximum2024-05-10 21:06:50
2024-05-11T04:44:40.655191image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T04:44:40.966861image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2024-05-11T04:44:35.798923image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-05-11T04:44:41.182403image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
년월지사코드지사명전기생산량
년월1.0000.0000.0000.681
지사코드0.0001.0001.0000.526
지사명0.0001.0001.0000.526
전기생산량0.6810.5260.5261.000
2024-05-11T04:44:41.436165image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
지사코드지사명
지사코드1.0000.988
지사명0.9881.000
2024-05-11T04:44:41.669783image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
전기생산량지사코드지사명
전기생산량1.0000.3950.395
지사코드0.3951.0000.988
지사명0.3950.9881.000

Missing values

2024-05-11T04:44:36.104799image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-05-11T04:44:36.479242image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

년월지사코드지사명전기생산량단위작업일시
02023-102동부지사0.0kWh2024-05-10 21:06:50.0
12023-101서부지사0.0kWh2024-05-10 21:06:50.0
22023-092동부지사0.0kWh2024-05-10 21:06:50.0
32023-091서부지사0.0kWh2024-05-10 21:06:50.0
42023-082동부지사0.0kWh2024-05-10 21:06:50.0
52023-081서부지사169420.0kWh2024-05-10 21:06:50.0
62023-072동부지사0.0kWh2024-05-10 21:06:50.0
72023-071서부지사0.0kWh2024-05-10 21:06:50.0
82023-062동부지사33700.0kWh2024-05-10 21:06:50.0
92023-061서부지사0.0kWh2024-05-10 21:06:50.0
년월지사코드지사명전기생산량단위작업일시
1562017-041서부지사0.0kWh2024-05-10 21:06:50.0
1572017-042동부지사10209700.0kWh2024-05-10 21:06:50.0
1582017-032동부지사16356300.0kWh2024-05-10 21:06:50.0
1592017-031서부지사9776968.0kWh2024-05-10 21:06:50.0
1602017-021서부지사11866179.0kWh2024-05-10 21:06:50.0
1612017-022동부지사12882200.0kWh2024-05-10 21:06:50.0
1622017-012동부지사13780600.0kWh2024-05-10 21:06:50.0
1632017-011서부지사13700956.0kWh2024-05-10 21:06:50.0
1642016-122동부지사19349800.0kWh2024-05-10 21:06:50.0
1652016-121서부지사13057554.0kWh2024-05-10 21:06:50.0