Overview

Dataset statistics

Number of variables7
Number of observations426
Missing cells32
Missing cells (%)1.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory23.8 KiB
Average record size in memory57.3 B

Variable types

DateTime1
Categorical5
Numeric1

Dataset

Description지역별 전기사용자 수용호수, 계약전력 등 전기수용호수 현황 자료를 제공합니다. (공급지역, 지역구분, 계정명, 용도명, 호수용량, 단위)
Author한국지역난방공사
URLhttps://www.data.go.kr/data/15003053/fileData.do

Alerts

지역구분 has constant value ""Constant
단위 is highly overall correlated with 계정명High correlation
계정명 is highly overall correlated with 단위High correlation
호수및용량 has 32 (7.5%) missing valuesMissing
호수및용량 has 74 (17.4%) zerosZeros

Reproduction

Analysis started2023-12-12 19:56:18.579525
Analysis finished2023-12-12 19:56:19.398742
Duration0.82 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct7
Distinct (%)1.6%
Missing0
Missing (%)0.0%
Memory size3.5 KiB
Minimum2014-12-31 00:00:00
Maximum2021-12-31 00:00:00
2023-12-13T04:56:19.480058image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T04:56:19.625201image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=7)

공급지역
Categorical

Distinct4
Distinct (%)0.9%
Missing0
Missing (%)0.0%
Memory size3.5 KiB
삼송지사
108 
동남권
108 
가락한라
106 
상암2지구
104 

Length

Max length5
Median length4
Mean length3.9906103
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row삼송지사
2nd row삼송지사
3rd row삼송지사
4th row삼송지사
5th row삼송지사

Common Values

ValueCountFrequency (%)
삼송지사 108
25.4%
동남권 108
25.4%
가락한라 106
24.9%
상암2지구 104
24.4%

Length

2023-12-13T04:56:19.811405image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T04:56:19.968848image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
삼송지사 108
25.4%
동남권 108
25.4%
가락한라 106
24.9%
상암2지구 104
24.4%

지역구분
Categorical

CONSTANT 

Distinct1
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size3.5 KiB
구역전기
426 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row구역전기
2nd row구역전기
3rd row구역전기
4th row구역전기
5th row구역전기

Common Values

ValueCountFrequency (%)
구역전기 426
100.0%

Length

2023-12-13T04:56:20.118396image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T04:56:20.243393image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
구역전기 426
100.0%

계정명
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size3.5 KiB
사용자수
213 
계약전력
213 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row사용자수
2nd row사용자수
3rd row사용자수
4th row사용자수
5th row사용자수

Common Values

ValueCountFrequency (%)
사용자수 213
50.0%
계약전력 213
50.0%

Length

2023-12-13T04:56:20.373385image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T04:56:20.506679image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
사용자수 213
50.0%
계약전력 213
50.0%

용도명
Categorical

Distinct9
Distinct (%)2.1%
Missing0
Missing (%)0.0%
Memory size3.5 KiB
주택용
56 
일반용
56 
산업용
56 
가로등
56 
교육용
56 
Other values (4)
146 

Length

Max length4
Median length3
Mean length3.0469484
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row주택용
2nd row일반용
3rd row산업용
4th row가로등
5th row교육용

Common Values

ValueCountFrequency (%)
주택용 56
13.1%
일반용 56
13.1%
산업용 56
13.1%
가로등 56
13.1%
교육용 56
13.1%
농사용 54
12.7%
임시전력 36
8.5%
기타 36
8.5%
임시동력 20
 
4.7%

Length

2023-12-13T04:56:20.662842image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T04:56:20.824442image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
주택용 56
13.1%
일반용 56
13.1%
산업용 56
13.1%
가로등 56
13.1%
교육용 56
13.1%
농사용 54
12.7%
임시전력 36
8.5%
기타 36
8.5%
임시동력 20
 
4.7%

호수및용량
Real number (ℝ)

MISSING  ZEROS 

Distinct184
Distinct (%)46.7%
Missing32
Missing (%)7.5%
Infinite0
Infinite (%)0.0%
Mean7026.3122
Minimum0
Maximum147333
Zeros74
Zeros (%)17.4%
Negative0
Negative (%)0.0%
Memory size3.9 KiB
2023-12-13T04:56:21.039159image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q11
median37
Q31108.25
95-th percentile40792.25
Maximum147333
Range147333
Interquartile range (IQR)1107.25

Descriptive statistics

Standard deviation22131.177
Coefficient of variation (CV)3.1497572
Kurtosis18.85274
Mean7026.3122
Median Absolute Deviation (MAD)37
Skewness4.2435856
Sum2768367
Variance4.8978901 × 108
MonotonicityNot monotonic
2023-12-13T04:56:21.255670image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 74
 
17.4%
1 34
 
8.0%
7 9
 
2.1%
8 9
 
2.1%
23 7
 
1.6%
89 7
 
1.6%
950 7
 
1.6%
130 7
 
1.6%
611 7
 
1.6%
11 6
 
1.4%
Other values (174) 227
53.3%
(Missing) 32
 
7.5%
ValueCountFrequency (%)
0 74
17.4%
1 34
8.0%
2 2
 
0.5%
3 5
 
1.2%
4 1
 
0.2%
5 2
 
0.5%
6 6
 
1.4%
7 9
 
2.1%
8 9
 
2.1%
9 1
 
0.2%
ValueCountFrequency (%)
147333 1
0.2%
146291 1
0.2%
131756 1
0.2%
128273 1
0.2%
124598 1
0.2%
116064 1
0.2%
110052 1
0.2%
101457 1
0.2%
91284 1
0.2%
84261 2
0.5%

단위
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size3.5 KiB
개소
213 
kW
213 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row개소
2nd row개소
3rd row개소
4th row개소
5th row개소

Common Values

ValueCountFrequency (%)
개소 213
50.0%
kW 213
50.0%

Length

2023-12-13T04:56:21.459909image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T04:56:21.599538image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
개소 213
50.0%
kw 213
50.0%

Interactions

2023-12-13T04:56:18.941682image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T04:56:21.694899image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
기준일자공급지역계정명용도명호수및용량단위
기준일자1.0000.0000.0000.1570.0000.000
공급지역0.0001.0000.0000.0000.3490.000
계정명0.0000.0001.0000.0000.3481.000
용도명0.1570.0000.0001.0000.2710.000
호수및용량0.0000.3490.3480.2711.0000.348
단위0.0000.0001.0000.0000.3481.000
2023-12-13T04:56:21.868025image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
용도명단위계정명공급지역
용도명1.0000.0000.0000.000
단위0.0001.0000.9950.000
계정명0.0000.9951.0000.000
공급지역0.0000.0000.0001.000
2023-12-13T04:56:22.023937image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
호수및용량공급지역계정명용도명단위
호수및용량1.0000.2140.2640.1260.264
공급지역0.2141.0000.0000.0000.000
계정명0.2640.0001.0000.0000.995
용도명0.1260.0000.0001.0000.000
단위0.2640.0000.9950.0001.000

Missing values

2023-12-13T04:56:19.126098image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T04:56:19.330032image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

기준일자공급지역지역구분계정명용도명호수및용량단위
02021-12-31삼송지사구역전기사용자수주택용4036개소
12021-12-31삼송지사구역전기사용자수일반용2296개소
22021-12-31삼송지사구역전기사용자수산업용128개소
32021-12-31삼송지사구역전기사용자수가로등551개소
42021-12-31삼송지사구역전기사용자수교육용19개소
52021-12-31삼송지사구역전기사용자수농사용68개소
62021-12-31삼송지사구역전기사용자수임시전력36개소
72021-12-31삼송지사구역전기사용자수기타1049개소
82021-12-31삼송지사구역전기계약전력주택용124598kW
92021-12-31삼송지사구역전기계약전력일반용146291kW
기준일자공급지역지역구분계정명용도명호수및용량단위
4162014-12-31가락한라구역전기사용자수농사용<NA>개소
4172014-12-31가락한라구역전기사용자수가로등1개소
4182014-12-31가락한라구역전기사용자수임시동력<NA>개소
4192014-12-31가락한라구역전기계약전력주택용3500kW
4202014-12-31가락한라구역전기계약전력산업용130kW
4212014-12-31가락한라구역전기계약전력일반용330kW
4222014-12-31가락한라구역전기계약전력교육용<NA>kW
4232014-12-31가락한라구역전기계약전력농사용<NA>kW
4242014-12-31가락한라구역전기계약전력가로등8kW
4252014-12-31가락한라구역전기계약전력임시동력<NA>kW