Overview

Dataset statistics

Number of variables6
Number of observations166
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory8.2 KiB
Average record size in memory50.8 B

Variable types

DateTime1
Categorical4
Numeric1

Dataset

Description년월,지사코드,지사명,전기판매량,단위,작업일시
Author서울에너지공사
URLhttps://data.seoul.go.kr/dataList/OA-20446/S/1/datasetView.do

Alerts

단위 has constant value ""Constant
작업일시 has constant value ""Constant
지사코드 is highly overall correlated with 지사명High correlation
지사명 is highly overall correlated with 지사코드High correlation
전기판매량 has 38 (22.9%) zerosZeros

Reproduction

Analysis started2024-05-11 02:09:17.539006
Analysis finished2024-05-11 02:09:18.638003
Duration1.1 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

년월
Date

Distinct83
Distinct (%)50.0%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
Minimum2016-12-01 00:00:00
Maximum2023-10-01 00:00:00
2024-05-11T02:09:18.875523image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T02:09:19.309421image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

지사코드
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)1.2%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
2
83 
1
83 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2
2nd row1
3rd row2
4th row1
5th row2

Common Values

ValueCountFrequency (%)
2 83
50.0%
1 83
50.0%

Length

2024-05-11T02:09:19.721212image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T02:09:20.069004image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2 83
50.0%
1 83
50.0%

지사명
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)1.2%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
동부지사
83 
서부지사
83 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row동부지사
2nd row서부지사
3rd row동부지사
4th row서부지사
5th row동부지사

Common Values

ValueCountFrequency (%)
동부지사 83
50.0%
서부지사 83
50.0%

Length

2024-05-11T02:09:20.401139image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T02:09:20.714586image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
동부지사 83
50.0%
서부지사 83
50.0%

전기판매량
Real number (ℝ)

ZEROS 

Distinct126
Distinct (%)75.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4105964.1
Minimum0
Maximum18783760
Zeros38
Zeros (%)22.9%
Negative0
Negative (%)0.0%
Memory size1.6 KiB
2024-05-11T02:09:21.081260image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q172
median1616238.6
Q36818077.3
95-th percentile16597834
Maximum18783760
Range18783760
Interquartile range (IQR)6818005.3

Descriptive statistics

Standard deviation5424175
Coefficient of variation (CV)1.3210479
Kurtosis0.45627896
Mean4105964.1
Median Absolute Deviation (MAD)1616238.6
Skewness1.2770715
Sum6.8159003 × 108
Variance2.9421675 × 1013
MonotonicityNot monotonic
2024-05-11T02:09:21.567908image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.0 38
 
22.9%
72.0 4
 
2.4%
17599681.71 1
 
0.6%
2994266.52 1
 
0.6%
7272.0 1
 
0.6%
2879326.26 1
 
0.6%
2254968.0 1
 
0.6%
1697672.49 1
 
0.6%
288.0 1
 
0.6%
2448.0 1
 
0.6%
Other values (116) 116
69.9%
ValueCountFrequency (%)
0.0 38
22.9%
8.0 1
 
0.6%
72.0 4
 
2.4%
133.0 1
 
0.6%
288.0 1
 
0.6%
504.0 1
 
0.6%
648.0 1
 
0.6%
1008.0 1
 
0.6%
1152.0 1
 
0.6%
1656.0 1
 
0.6%
ValueCountFrequency (%)
18783760.38 1
0.6%
18373709.79 1
0.6%
18268954.11 1
0.6%
18011672.22 1
0.6%
17599681.71 1
0.6%
17322758.13 1
0.6%
16950051.0 1
0.6%
16916829.87 1
0.6%
16610322.51 1
0.6%
16560369.57 1
0.6%

단위
Categorical

CONSTANT 

Distinct1
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
kWh
166 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowkWh
2nd rowkWh
3rd rowkWh
4th rowkWh
5th rowkWh

Common Values

ValueCountFrequency (%)
kWh 166
100.0%

Length

2024-05-11T02:09:22.071531image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T02:09:22.480198image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
kwh 166
100.0%

작업일시
Categorical

CONSTANT 

Distinct1
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
2024-05-10 20:56:50.0
166 

Length

Max length21
Median length21
Mean length21
Min length21

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2024-05-10 20:56:50.0
2nd row2024-05-10 20:56:50.0
3rd row2024-05-10 20:56:50.0
4th row2024-05-10 20:56:50.0
5th row2024-05-10 20:56:50.0

Common Values

ValueCountFrequency (%)
2024-05-10 20:56:50.0 166
100.0%

Length

2024-05-11T02:09:22.828264image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T02:09:23.189755image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2024-05-10 166
50.0%
20:56:50.0 166
50.0%

Interactions

2024-05-11T02:09:17.801952image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-05-11T02:09:23.406231image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
년월지사코드지사명전기판매량
년월1.0000.0000.0000.548
지사코드0.0001.0001.0000.515
지사명0.0001.0001.0000.515
전기판매량0.5480.5150.5151.000
2024-05-11T02:09:23.728337image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
지사코드지사명
지사코드1.0000.988
지사명0.9881.000
2024-05-11T02:09:24.001247image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
전기판매량지사코드지사명
전기판매량1.0000.3860.386
지사코드0.3861.0000.988
지사명0.3860.9881.000

Missing values

2024-05-11T02:09:18.234523image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-05-11T02:09:18.546370image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

년월지사코드지사명전기판매량단위작업일시
02023-102동부지사0.0kWh2024-05-10 20:56:50.0
12023-101서부지사0.0kWh2024-05-10 20:56:50.0
22023-092동부지사30071.0kWh2024-05-10 20:56:50.0
32023-091서부지사72.0kWh2024-05-10 20:56:50.0
42023-082동부지사21016.0kWh2024-05-10 20:56:50.0
52023-081서부지사122616.0kWh2024-05-10 20:56:50.0
62023-072동부지사12781.0kWh2024-05-10 20:56:50.0
72023-071서부지사0.0kWh2024-05-10 20:56:50.0
82023-062동부지사1041033.43kWh2024-05-10 20:56:50.0
92023-061서부지사0.0kWh2024-05-10 20:56:50.0
년월지사코드지사명전기판매량단위작업일시
1562017-041서부지사0.0kWh2024-05-10 20:56:50.0
1572017-042동부지사8920722.12kWh2024-05-10 20:56:50.0
1582017-032동부지사14314669.68kWh2024-05-10 20:56:50.0
1592017-031서부지사8505360.0kWh2024-05-10 20:56:50.0
1602017-021서부지사9748656.0kWh2024-05-10 20:56:50.0
1612017-022동부지사10861612.08kWh2024-05-10 20:56:50.0
1622017-012동부지사11486266.32kWh2024-05-10 20:56:50.0
1632017-011서부지사11129040.0kWh2024-05-10 20:56:50.0
1642016-122동부지사16610322.51kWh2024-05-10 20:56:50.0
1652016-121서부지사11125512.0kWh2024-05-10 20:56:50.0