Overview

Dataset statistics

Number of variables4
Number of observations1150
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory38.3 KiB
Average record size in memory34.1 B

Variable types

Categorical2
Text1
Numeric1

Dataset

Description김해시에서 통계기반 도시현황 파악을 위해 개발한 통계지수 중 하나로서, 통계연도, 시도명, 시군구명, 전력사용량(킬로와트시)으로 구성되어 있습니다. 김해시 중심의 통계지수로서, 데이터 수집, 가공 등의 어려움으로 김해시 외 지역의 정보는 누락될 수 있습니다.
Author경상남도 김해시
URLhttps://www.data.go.kr/data/15110110/fileData.do

Alerts

전력사용량(킬로와트시) has unique valuesUnique

Reproduction

Analysis started2023-12-12 14:05:58.304916
Analysis finished2023-12-12 14:05:58.761407
Duration0.46 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

통계연도
Categorical

Distinct5
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size9.1 KiB
2020
230 
2019
230 
2017
230 
2018
230 
2021
230 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2020
2nd row2019
3rd row2017
4th row2019
5th row2018

Common Values

ValueCountFrequency (%)
2020 230
20.0%
2019 230
20.0%
2017 230
20.0%
2018 230
20.0%
2021 230
20.0%

Length

2023-12-12T23:05:58.828891image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T23:05:58.927492image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2020 230
20.0%
2019 230
20.0%
2017 230
20.0%
2018 230
20.0%
2021 230
20.0%

시도명
Categorical

Distinct18
Distinct (%)1.6%
Missing0
Missing (%)0.0%
Memory size9.1 KiB
경기도
155 
서울특별시
125 
경상북도
115 
전라남도
110 
강원도
90 
Other values (13)
555 

Length

Max length7
Median length5
Mean length4.1478261
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row강원도
2nd row서울특별시
3rd row대구광역시
4th row경기도
5th row전라남도

Common Values

ValueCountFrequency (%)
경기도 155
13.5%
서울특별시 125
10.9%
경상북도 115
10.0%
전라남도 110
9.6%
강원도 90
7.8%
경상남도 90
7.8%
부산광역시 80
7.0%
충청남도 77
6.7%
전라북도 70
 
6.1%
충청북도 55
 
4.8%
Other values (8) 183
15.9%

Length

2023-12-12T23:05:59.047606image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
경기도 155
13.5%
서울특별시 125
10.9%
경상북도 115
10.0%
전라남도 110
9.6%
강원도 90
7.8%
경상남도 90
7.8%
부산광역시 80
7.0%
충청남도 77
6.7%
전라북도 70
 
6.1%
충청북도 55
 
4.8%
Other values (8) 183
15.9%
Distinct209
Distinct (%)18.2%
Missing0
Missing (%)0.0%
Memory size9.1 KiB
2023-12-12T23:05:59.396102image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length4
Median length3
Mean length2.9313043
Min length2

Characters and Unicode

Total characters3371
Distinct characters134
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row횡성군
2nd row구로구
3rd row수성구
4th row평택시
5th row영광군
ValueCountFrequency (%)
동구 30
 
2.6%
중구 30
 
2.6%
서구 25
 
2.2%
남구 22
 
1.9%
북구 20
 
1.7%
고성군 10
 
0.9%
강서구 10
 
0.9%
무안군 5
 
0.4%
화천군 5
 
0.4%
예산군 5
 
0.4%
Other values (199) 988
85.9%
2023-12-12T23:05:59.877184image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
427
 
12.7%
398
 
11.8%
370
 
11.0%
110
 
3.3%
100
 
3.0%
93
 
2.8%
90
 
2.7%
85
 
2.5%
80
 
2.4%
65
 
1.9%
Other values (124) 1553
46.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3371
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
427
 
12.7%
398
 
11.8%
370
 
11.0%
110
 
3.3%
100
 
3.0%
93
 
2.8%
90
 
2.7%
85
 
2.5%
80
 
2.4%
65
 
1.9%
Other values (124) 1553
46.1%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3371
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
427
 
12.7%
398
 
11.8%
370
 
11.0%
110
 
3.3%
100
 
3.0%
93
 
2.8%
90
 
2.7%
85
 
2.5%
80
 
2.4%
65
 
1.9%
Other values (124) 1553
46.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3371
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
427
 
12.7%
398
 
11.8%
370
 
11.0%
110
 
3.3%
100
 
3.0%
93
 
2.8%
90
 
2.7%
85
 
2.5%
80
 
2.4%
65
 
1.9%
Other values (124) 1553
46.1%

전력사용량(킬로와트시)
Real number (ℝ)

UNIQUE 

Distinct1150
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.0633568 × 108
Minimum0
Maximum1.8479076 × 109
Zeros1
Zeros (%)0.1%
Negative0
Negative (%)0.0%
Memory size10.2 KiB
2023-12-12T23:06:00.055498image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile25596072
Q158460548
median1.2295455 × 108
Q32.3400899 × 108
95-th percentile8.2680173 × 108
Maximum1.8479076 × 109
Range1.8479076 × 109
Interquartile range (IQR)1.7554844 × 108

Descriptive statistics

Standard deviation2.6081753 × 108
Coefficient of variation (CV)1.2640447
Kurtosis9.140848
Mean2.0633568 × 108
Median Absolute Deviation (MAD)73905751
Skewness2.8470496
Sum2.3728603 × 1011
Variance6.8025783 × 1016
MonotonicityNot monotonic
2023-12-12T23:06:00.560274image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
64562810 1
 
0.1%
195783104 1
 
0.1%
158140948 1
 
0.1%
867608757 1
 
0.1%
249179012 1
 
0.1%
531051348 1
 
0.1%
81345252 1
 
0.1%
375370680 1
 
0.1%
380956401 1
 
0.1%
60491498 1
 
0.1%
Other values (1140) 1140
99.1%
ValueCountFrequency (%)
0 1
0.1%
1359 1
0.1%
1389 1
0.1%
10598 1
0.1%
525879 1
0.1%
4838514 1
0.1%
5267364 1
0.1%
5411095 1
0.1%
5506069 1
0.1%
5839109 1
0.1%
ValueCountFrequency (%)
1847907632 1
0.1%
1703596951 1
0.1%
1684425441 1
0.1%
1671362250 1
0.1%
1519379348 1
0.1%
1417190743 1
0.1%
1399860492 1
0.1%
1366643638 1
0.1%
1356803673 1
0.1%
1351999215 1
0.1%

Interactions

2023-12-12T23:05:58.494699image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T23:06:00.667435image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
통계연도시도명전력사용량(킬로와트시)
통계연도1.0000.0000.000
시도명0.0001.0000.521
전력사용량(킬로와트시)0.0000.5211.000
2023-12-12T23:06:00.763600image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시도명통계연도
시도명1.0000.000
통계연도0.0001.000
2023-12-12T23:06:00.848282image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
전력사용량(킬로와트시)통계연도시도명
전력사용량(킬로와트시)1.0000.0000.228
통계연도0.0001.0000.000
시도명0.2280.0001.000

Missing values

2023-12-12T23:05:58.631455image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T23:05:58.724913image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

통계연도시도명시군구명전력사용량(킬로와트시)
02020강원도횡성군64562810
12019서울특별시구로구180921979
22017대구광역시수성구149080941
32019경기도평택시1006969096
42018전라남도영광군92117882
52018전라남도보성군29509996
62020경상북도영주시139244845
72020경상남도통영시88806624
82019전라남도진도군31280274
92018경기도안산시790597234
통계연도시도명시군구명전력사용량(킬로와트시)
11402021대구광역시달서구357677357
11412019경상북도경산시261468218
11422020강원도영월군86823398
11432020충청남도보령시117807466
11442019인천광역시부평구223237317
11452018전라남도곡성군39036761
11462018대전광역시대덕구249443166
11472021서울특별시은평구135204631
11482017대구광역시북구215996742
11492017경기도양주시212834174