Overview

Dataset statistics

Number of variables14
Number of observations26
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory3.0 KiB
Average record size in memory119.1 B

Variable types

Numeric2
Categorical4
Text8

Dataset

Description에너지다소비사업자의 에너지사용량신고에 대한 데이터로 지역(권역)별, 연도별, 부문별 다소비사업자의 ‘요’설비 에너지사용량에 대한 데이터 개방
Author한국에너지공단
URLhttps://www.data.go.kr/data/15086733/fileData.do

Alerts

총합계 is highly overall correlated with 업종 and 2 other fieldsHigh correlation
업종 is highly overall correlated with 총합계 and 1 other fieldsHigh correlation
강원 is highly overall correlated with 총합계 and 1 other fieldsHigh correlation
대전 is highly overall correlated with 업종High correlation
세종 is highly overall correlated with 총합계 and 1 other fieldsHigh correlation
세종 is highly imbalanced (76.5%)Imbalance
총합계 has unique valuesUnique

Reproduction

Analysis started2023-12-12 15:01:53.652365
Analysis finished2023-12-12 15:01:54.868798
Duration1.22 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연도
Real number (ℝ)

Distinct7
Distinct (%)26.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2019.1923
Minimum2016
Maximum2022
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size366.0 B
2023-12-13T00:01:54.918294image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2016
5-th percentile2016
Q12018
median2019
Q32021
95-th percentile2022
Maximum2022
Range6
Interquartile range (IQR)3

Descriptive statistics

Standard deviation1.9802875
Coefficient of variation (CV)0.00098073247
Kurtosis-1.1409954
Mean2019.1923
Median Absolute Deviation (MAD)2
Skewness-0.12134113
Sum52499
Variance3.9215385
MonotonicityIncreasing
2023-12-13T00:01:55.032572image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=7)
ValueCountFrequency (%)
2018 4
15.4%
2019 4
15.4%
2020 4
15.4%
2021 4
15.4%
2022 4
15.4%
2016 3
11.5%
2017 3
11.5%
ValueCountFrequency (%)
2016 3
11.5%
2017 3
11.5%
2018 4
15.4%
2019 4
15.4%
2020 4
15.4%
2021 4
15.4%
2022 4
15.4%
ValueCountFrequency (%)
2022 4
15.4%
2021 4
15.4%
2020 4
15.4%
2019 4
15.4%
2018 4
15.4%
2017 3
11.5%
2016 3
11.5%

업종
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)15.4%
Missing0
Missing (%)0.0%
Memory size340.0 B
금속
요업
화공
산업기타

Length

Max length4
Median length2
Mean length2.3846154
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row금속
2nd row요업
3rd row화공
4th row금속
5th row요업

Common Values

ValueCountFrequency (%)
금속 7
26.9%
요업 7
26.9%
화공 7
26.9%
산업기타 5
19.2%

Length

2023-12-13T00:01:55.174210image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T00:01:55.299769image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
금속 7
26.9%
요업 7
26.9%
화공 7
26.9%
산업기타 5
19.2%

강원
Categorical

HIGH CORRELATION 

Distinct8
Distinct (%)30.8%
Missing0
Missing (%)0.0%
Memory size340.0 B
해당없음
19 
2052010.994
 
1
1,839,150
 
1
1680424.7
 
1
2005603.446
 
1
Other values (3)

Length

Max length11
Median length4
Mean length5.6538462
Min length4

Unique

Unique7 ?
Unique (%)26.9%

Sample

1st row해당없음
2nd row2052010.994
3rd row해당없음
4th row해당없음
5th row1,839,150

Common Values

ValueCountFrequency (%)
해당없음 19
73.1%
2052010.994 1
 
3.8%
1,839,150 1
 
3.8%
1680424.7 1
 
3.8%
2005603.446 1
 
3.8%
1,732,353 1
 
3.8%
1589984.279 1
 
3.8%
1538788.784 1
 
3.8%

Length

2023-12-13T00:01:55.480677image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T00:01:55.636992image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
해당없음 19
73.1%
2052010.994 1
 
3.8%
1,839,150 1
 
3.8%
1680424.7 1
 
3.8%
2005603.446 1
 
3.8%
1,732,353 1
 
3.8%
1589984.279 1
 
3.8%
1538788.784 1
 
3.8%

경기
Text

Distinct15
Distinct (%)57.7%
Missing0
Missing (%)0.0%
Memory size340.0 B
2023-12-13T00:01:55.806401image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length9
Median length8
Mean length5.5
Min length2

Characters and Unicode

Total characters143
Distinct characters16
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique14 ?
Unique (%)53.8%

Sample

1st row해당없음
2nd row13141.769
3rd row21.903
4th row해당없음
5th row14,573
ValueCountFrequency (%)
해당없음 12
46.2%
13141.769 1
 
3.8%
21.903 1
 
3.8%
14,573 1
 
3.8%
27.887804 1
 
3.8%
13399.598 1
 
3.8%
29.0962 1
 
3.8%
12307.097 1
 
3.8%
20 1
 
3.8%
9,703 1
 
3.8%
Other values (5) 5
19.2%
2023-12-13T00:01:56.136724image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
9 15
10.5%
12
 
8.4%
12
 
8.4%
12
 
8.4%
12
 
8.4%
1 11
 
7.7%
3 11
 
7.7%
. 11
 
7.7%
7 8
 
5.6%
2 8
 
5.6%
Other values (6) 31
21.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 82
57.3%
Other Letter 48
33.6%
Other Punctuation 13
 
9.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
9 15
18.3%
1 11
13.4%
3 11
13.4%
7 8
9.8%
2 8
9.8%
0 8
9.8%
8 7
8.5%
4 6
 
7.3%
5 6
 
7.3%
6 2
 
2.4%
Other Letter
ValueCountFrequency (%)
12
25.0%
12
25.0%
12
25.0%
12
25.0%
Other Punctuation
ValueCountFrequency (%)
. 11
84.6%
, 2
 
15.4%

Most occurring scripts

ValueCountFrequency (%)
Common 95
66.4%
Hangul 48
33.6%

Most frequent character per script

Common
ValueCountFrequency (%)
9 15
15.8%
1 11
11.6%
3 11
11.6%
. 11
11.6%
7 8
8.4%
2 8
8.4%
0 8
8.4%
8 7
7.4%
4 6
 
6.3%
5 6
 
6.3%
Other values (2) 4
 
4.2%
Hangul
ValueCountFrequency (%)
12
25.0%
12
25.0%
12
25.0%
12
25.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 95
66.4%
Hangul 48
33.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
9 15
15.8%
1 11
11.6%
3 11
11.6%
. 11
11.6%
7 8
8.4%
2 8
8.4%
0 8
8.4%
8 7
7.4%
4 6
 
6.3%
5 6
 
6.3%
Other values (2) 4
 
4.2%
Hangul
ValueCountFrequency (%)
12
25.0%
12
25.0%
12
25.0%
12
25.0%

경남
Text

Distinct18
Distinct (%)69.2%
Missing0
Missing (%)0.0%
Memory size340.0 B
2023-12-13T00:01:56.310404image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length9
Median length8
Mean length5.7307692
Min length3

Characters and Unicode

Total characters149
Distinct characters16
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique17 ?
Unique (%)65.4%

Sample

1st row1607.5
2nd row11475.018
3rd row해당없음
4th row1,646
5th row11,535
ValueCountFrequency (%)
해당없음 9
34.6%
1607.5 1
 
3.8%
1,152 1
 
3.8%
1416.41 1
 
3.8%
815.2 1
 
3.8%
16048.309 1
 
3.8%
860.162 1
 
3.8%
906.2674 1
 
3.8%
11,436 1
 
3.8%
913 1
 
3.8%
Other values (8) 8
30.8%
2023-12-13T00:01:56.683348image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 24
16.1%
6 12
 
8.1%
. 11
 
7.4%
2 11
 
7.4%
4 10
 
6.7%
9
 
6.0%
9
 
6.0%
9
 
6.0%
9
 
6.0%
0 9
 
6.0%
Other values (6) 36
24.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 98
65.8%
Other Letter 36
 
24.2%
Other Punctuation 15
 
10.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 24
24.5%
6 12
12.2%
2 11
11.2%
4 10
10.2%
0 9
 
9.2%
3 8
 
8.2%
5 7
 
7.1%
7 6
 
6.1%
8 6
 
6.1%
9 5
 
5.1%
Other Letter
ValueCountFrequency (%)
9
25.0%
9
25.0%
9
25.0%
9
25.0%
Other Punctuation
ValueCountFrequency (%)
. 11
73.3%
, 4
 
26.7%

Most occurring scripts

ValueCountFrequency (%)
Common 113
75.8%
Hangul 36
 
24.2%

Most frequent character per script

Common
ValueCountFrequency (%)
1 24
21.2%
6 12
10.6%
. 11
9.7%
2 11
9.7%
4 10
8.8%
0 9
 
8.0%
3 8
 
7.1%
5 7
 
6.2%
7 6
 
5.3%
8 6
 
5.3%
Other values (2) 9
 
8.0%
Hangul
ValueCountFrequency (%)
9
25.0%
9
25.0%
9
25.0%
9
25.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 113
75.8%
Hangul 36
 
24.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 24
21.2%
6 12
10.6%
. 11
9.7%
2 11
9.7%
4 10
8.8%
0 9
 
8.0%
3 8
 
7.1%
5 7
 
6.2%
7 6
 
5.3%
8 6
 
5.3%
Other values (2) 9
 
8.0%
Hangul
ValueCountFrequency (%)
9
25.0%
9
25.0%
9
25.0%
9
25.0%

경북
Text

Distinct22
Distinct (%)84.6%
Missing0
Missing (%)0.0%
Memory size340.0 B
2023-12-13T00:01:56.860931image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length11
Median length9
Mean length7.4615385
Min length3

Characters and Unicode

Total characters194
Distinct characters16
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique21 ?
Unique (%)80.8%

Sample

1st row9594.01
2nd row52330.295
3rd row6817.164836
4th row10,059
5th row51,054
ValueCountFrequency (%)
해당없음 5
19.2%
9594.01 1
 
3.8%
7,161 1
 
3.8%
50596.07745 1
 
3.8%
827.479 1
 
3.8%
8072.854724 1
 
3.8%
49620.322 1
 
3.8%
801.283 1
 
3.8%
5183.707646 1
 
3.8%
45,202 1
 
3.8%
Other values (12) 12
46.2%
2023-12-13T00:01:57.229268image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 20
10.3%
2 18
9.3%
1 17
8.8%
7 17
8.8%
4 16
8.2%
. 16
8.2%
6 16
8.2%
5 15
7.7%
8 15
7.7%
3 11
 
5.7%
Other values (6) 33
17.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 154
79.4%
Other Punctuation 20
 
10.3%
Other Letter 20
 
10.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 20
13.0%
2 18
11.7%
1 17
11.0%
7 17
11.0%
4 16
10.4%
6 16
10.4%
5 15
9.7%
8 15
9.7%
3 11
7.1%
9 9
5.8%
Other Letter
ValueCountFrequency (%)
5
25.0%
5
25.0%
5
25.0%
5
25.0%
Other Punctuation
ValueCountFrequency (%)
. 16
80.0%
, 4
 
20.0%

Most occurring scripts

ValueCountFrequency (%)
Common 174
89.7%
Hangul 20
 
10.3%

Most frequent character per script

Common
ValueCountFrequency (%)
0 20
11.5%
2 18
10.3%
1 17
9.8%
7 17
9.8%
4 16
9.2%
. 16
9.2%
6 16
9.2%
5 15
8.6%
8 15
8.6%
3 11
6.3%
Other values (2) 13
7.5%
Hangul
ValueCountFrequency (%)
5
25.0%
5
25.0%
5
25.0%
5
25.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 174
89.7%
Hangul 20
 
10.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 20
11.5%
2 18
10.3%
1 17
9.8%
7 17
9.8%
4 16
9.2%
. 16
9.2%
6 16
9.2%
5 15
8.6%
8 15
8.6%
3 11
6.3%
Other values (2) 13
7.5%
Hangul
ValueCountFrequency (%)
5
25.0%
5
25.0%
5
25.0%
5
25.0%

대전
Categorical

HIGH CORRELATION 

Distinct10
Distinct (%)38.5%
Missing0
Missing (%)0.0%
Memory size340.0 B
해당없음
12 
30.1
6.897
 
1
5
 
1
1.625
 
1
Other values (5)

Length

Max length6
Median length4
Mean length3.8461538
Min length1

Unique

Unique8 ?
Unique (%)30.8%

Sample

1st row6.897
2nd row해당없음
3rd row30.1
4th row5
5th row해당없음

Common Values

ValueCountFrequency (%)
해당없음 12
46.2%
30.1 6
23.1%
6.897 1
 
3.8%
5 1
 
3.8%
1.625 1
 
3.8%
9.865 1
 
3.8%
30 1
 
3.8%
6 1
 
3.8%
4.5 1
 
3.8%
19.212 1
 
3.8%

Length

2023-12-13T00:01:57.381478image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T00:01:57.542717image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
해당없음 12
46.2%
30.1 6
23.1%
6.897 1
 
3.8%
5 1
 
3.8%
1.625 1
 
3.8%
9.865 1
 
3.8%
30 1
 
3.8%
6 1
 
3.8%
4.5 1
 
3.8%
19.212 1
 
3.8%

세종
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)7.7%
Missing0
Missing (%)0.0%
Memory size340.0 B
해당없음
25 
2838.7936
 
1

Length

Max length9
Median length4
Mean length4.1923077
Min length4

Unique

Unique1 ?
Unique (%)3.8%

Sample

1st row해당없음
2nd row해당없음
3rd row해당없음
4th row해당없음
5th row해당없음

Common Values

ValueCountFrequency (%)
해당없음 25
96.2%
2838.7936 1
 
3.8%

Length

2023-12-13T00:01:57.724101image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T00:01:57.845729image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
해당없음 25
96.2%
2838.7936 1
 
3.8%

울산
Text

Distinct21
Distinct (%)80.8%
Missing0
Missing (%)0.0%
Memory size340.0 B
2023-12-13T00:01:58.015006image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length11
Median length10
Mean length6.8461538
Min length4

Characters and Unicode

Total characters178
Distinct characters15
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique20 ?
Unique (%)76.9%

Sample

1st row4075.28542
2nd row5984
3rd row7127.503
4th row3391.103536
5th row4581
ValueCountFrequency (%)
해당없음 6
23.1%
4075.28542 1
 
3.8%
5002 1
 
3.8%
11361.84296 1
 
3.8%
25037.853 1
 
3.8%
4376 1
 
3.8%
10483.9053 1
 
3.8%
18553.965 1
 
3.8%
4372 1
 
3.8%
6005.63813 1
 
3.8%
Other values (11) 11
42.3%
2023-12-13T00:01:58.361658image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5 22
12.4%
1 16
9.0%
3 16
9.0%
8 15
8.4%
. 14
7.9%
2 14
7.9%
6 13
 
7.3%
9 12
 
6.7%
4 11
 
6.2%
0 11
 
6.2%
Other values (5) 34
19.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 140
78.7%
Other Letter 24
 
13.5%
Other Punctuation 14
 
7.9%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 22
15.7%
1 16
11.4%
3 16
11.4%
8 15
10.7%
2 14
10.0%
6 13
9.3%
9 12
8.6%
4 11
7.9%
0 11
7.9%
7 10
7.1%
Other Letter
ValueCountFrequency (%)
6
25.0%
6
25.0%
6
25.0%
6
25.0%
Other Punctuation
ValueCountFrequency (%)
. 14
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 154
86.5%
Hangul 24
 
13.5%

Most frequent character per script

Common
ValueCountFrequency (%)
5 22
14.3%
1 16
10.4%
3 16
10.4%
8 15
9.7%
. 14
9.1%
2 14
9.1%
6 13
8.4%
9 12
7.8%
4 11
7.1%
0 11
7.1%
Hangul
ValueCountFrequency (%)
6
25.0%
6
25.0%
6
25.0%
6
25.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 154
86.5%
Hangul 24
 
13.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5 22
14.3%
1 16
10.4%
3 16
10.4%
8 15
9.7%
. 14
9.1%
2 14
9.1%
6 13
8.4%
9 12
7.8%
4 11
7.1%
0 11
7.1%
Hangul
ValueCountFrequency (%)
6
25.0%
6
25.0%
6
25.0%
6
25.0%

전남
Text

Distinct23
Distinct (%)88.5%
Missing0
Missing (%)0.0%
Memory size340.0 B
2023-12-13T00:01:58.538582image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length11
Median length9
Mean length7.4615385
Min length4

Characters and Unicode

Total characters194
Distinct characters15
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique22 ?
Unique (%)84.6%

Sample

1st row5233.98783
2nd row559.543
3rd row4522.752
4th row5137.728
5th row650.266
ValueCountFrequency (%)
해당없음 4
 
15.4%
772.059 1
 
3.8%
793.822 1
 
3.8%
6165.833048 1
 
3.8%
6862.967 1
 
3.8%
782.127 1
 
3.8%
5640.58567 1
 
3.8%
8772.461 1
 
3.8%
833.49 1
 
3.8%
5441.428552 1
 
3.8%
Other values (13) 13
50.0%
2023-12-13T00:01:58.964607image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5 26
13.4%
2 22
11.3%
. 21
10.8%
7 21
10.8%
8 20
10.3%
3 17
8.8%
4 12
6.2%
6 12
6.2%
0 10
 
5.2%
9 9
 
4.6%
Other values (5) 24
12.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 157
80.9%
Other Punctuation 21
 
10.8%
Other Letter 16
 
8.2%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 26
16.6%
2 22
14.0%
7 21
13.4%
8 20
12.7%
3 17
10.8%
4 12
7.6%
6 12
7.6%
0 10
 
6.4%
9 9
 
5.7%
1 8
 
5.1%
Other Letter
ValueCountFrequency (%)
4
25.0%
4
25.0%
4
25.0%
4
25.0%
Other Punctuation
ValueCountFrequency (%)
. 21
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 178
91.8%
Hangul 16
 
8.2%

Most frequent character per script

Common
ValueCountFrequency (%)
5 26
14.6%
2 22
12.4%
. 21
11.8%
7 21
11.8%
8 20
11.2%
3 17
9.6%
4 12
6.7%
6 12
6.7%
0 10
 
5.6%
9 9
 
5.1%
Hangul
ValueCountFrequency (%)
4
25.0%
4
25.0%
4
25.0%
4
25.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 178
91.8%
Hangul 16
 
8.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5 26
14.6%
2 22
12.4%
. 21
11.8%
7 21
11.8%
8 20
11.2%
3 17
9.6%
4 12
6.7%
6 12
6.7%
0 10
 
5.6%
9 9
 
5.1%
Hangul
ValueCountFrequency (%)
4
25.0%
4
25.0%
4
25.0%
4
25.0%

전북
Text

Distinct15
Distinct (%)57.7%
Missing0
Missing (%)0.0%
Memory size340.0 B
2023-12-13T00:01:59.146775image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length8
Median length7
Mean length6.0769231
Min length4

Characters and Unicode

Total characters158
Distinct characters15
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique14 ?
Unique (%)53.8%

Sample

1st row2658.932
2nd row3054.947
3rd row해당없음
4th row3028.51
5th row2680.545
ValueCountFrequency (%)
해당없음 12
46.2%
2658.932 1
 
3.8%
3054.947 1
 
3.8%
3028.51 1
 
3.8%
2680.545 1
 
3.8%
2502.041 1
 
3.8%
2725.821 1
 
3.8%
2521.464 1
 
3.8%
1980.825 1
 
3.8%
1871.779 1
 
3.8%
Other values (5) 5
19.2%
2023-12-13T00:01:59.503924image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2 19
12.0%
. 14
 
8.9%
12
 
7.6%
12
 
7.6%
12
 
7.6%
12
 
7.6%
6 10
 
6.3%
5 10
 
6.3%
9 10
 
6.3%
8 9
 
5.7%
Other values (5) 38
24.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 96
60.8%
Other Letter 48
30.4%
Other Punctuation 14
 
8.9%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 19
19.8%
6 10
10.4%
5 10
10.4%
9 10
10.4%
8 9
9.4%
7 9
9.4%
1 9
9.4%
3 8
8.3%
0 6
 
6.2%
4 6
 
6.2%
Other Letter
ValueCountFrequency (%)
12
25.0%
12
25.0%
12
25.0%
12
25.0%
Other Punctuation
ValueCountFrequency (%)
. 14
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 110
69.6%
Hangul 48
30.4%

Most frequent character per script

Common
ValueCountFrequency (%)
2 19
17.3%
. 14
12.7%
6 10
9.1%
5 10
9.1%
9 10
9.1%
8 9
8.2%
7 9
8.2%
1 9
8.2%
3 8
7.3%
0 6
 
5.5%
Hangul
ValueCountFrequency (%)
12
25.0%
12
25.0%
12
25.0%
12
25.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 110
69.6%
Hangul 48
30.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2 19
17.3%
. 14
12.7%
6 10
9.1%
5 10
9.1%
9 10
9.1%
8 9
8.2%
7 9
8.2%
1 9
8.2%
3 8
7.3%
0 6
 
5.5%
Hangul
ValueCountFrequency (%)
12
25.0%
12
25.0%
12
25.0%
12
25.0%

충남
Text

Distinct15
Distinct (%)57.7%
Missing0
Missing (%)0.0%
Memory size340.0 B
2023-12-13T00:01:59.678897image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length11
Median length10
Mean length6.2307692
Min length4

Characters and Unicode

Total characters162
Distinct characters15
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique14 ?
Unique (%)53.8%

Sample

1st row76.97
2nd row13770.79436
3rd row해당없음
4th row72.366248
5th row7796.791
ValueCountFrequency (%)
해당없음 12
46.2%
76.97 1
 
3.8%
13770.79436 1
 
3.8%
72.366248 1
 
3.8%
7796.791 1
 
3.8%
64.633128 1
 
3.8%
8200.56 1
 
3.8%
69.173326 1
 
3.8%
6969.131 1
 
3.8%
76.9012 1
 
3.8%
Other values (5) 5
19.2%
2023-12-13T00:02:00.066602image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
6 17
10.5%
7 15
9.3%
. 14
 
8.6%
12
 
7.4%
12
 
7.4%
12
 
7.4%
12
 
7.4%
3 12
 
7.4%
1 11
 
6.8%
9 10
 
6.2%
Other values (5) 35
21.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 100
61.7%
Other Letter 48
29.6%
Other Punctuation 14
 
8.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
6 17
17.0%
7 15
15.0%
3 12
12.0%
1 11
11.0%
9 10
10.0%
4 10
10.0%
2 8
8.0%
8 6
 
6.0%
5 6
 
6.0%
0 5
 
5.0%
Other Letter
ValueCountFrequency (%)
12
25.0%
12
25.0%
12
25.0%
12
25.0%
Other Punctuation
ValueCountFrequency (%)
. 14
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 114
70.4%
Hangul 48
29.6%

Most frequent character per script

Common
ValueCountFrequency (%)
6 17
14.9%
7 15
13.2%
. 14
12.3%
3 12
10.5%
1 11
9.6%
9 10
8.8%
4 10
8.8%
2 8
7.0%
8 6
 
5.3%
5 6
 
5.3%
Hangul
ValueCountFrequency (%)
12
25.0%
12
25.0%
12
25.0%
12
25.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 114
70.4%
Hangul 48
29.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
6 17
14.9%
7 15
13.2%
. 14
12.3%
3 12
10.5%
1 11
9.6%
9 10
8.8%
4 10
8.8%
2 8
7.0%
8 6
 
5.3%
5 6
 
5.3%
Hangul
ValueCountFrequency (%)
12
25.0%
12
25.0%
12
25.0%
12
25.0%

충북
Text

Distinct21
Distinct (%)80.8%
Missing0
Missing (%)0.0%
Memory size340.0 B
2023-12-13T00:02:00.268883image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length11
Median length7
Mean length7.2307692
Min length4

Characters and Unicode

Total characters188
Distinct characters15
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique20 ?
Unique (%)76.9%

Sample

1st row해당없음
2nd row1086930.539
3rd row18.741
4th row해당없음
5th row961956.9394
ValueCountFrequency (%)
해당없음 6
23.1%
685.763 1
 
3.8%
1052046.007 1
 
3.8%
610.76 1
 
3.8%
1403.211518 1
 
3.8%
15.608 1
 
3.8%
980367.7517 1
 
3.8%
682.044 1
 
3.8%
10.249 1
 
3.8%
833852.0403 1
 
3.8%
Other values (11) 11
42.3%
2023-12-13T00:02:00.637210image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
. 20
10.6%
1 19
10.1%
8 16
8.5%
6 16
8.5%
9 16
8.5%
3 16
8.5%
0 15
8.0%
4 14
7.4%
5 13
6.9%
7 11
 
5.9%
Other values (5) 32
17.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 144
76.6%
Other Letter 24
 
12.8%
Other Punctuation 20
 
10.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 19
13.2%
8 16
11.1%
6 16
11.1%
9 16
11.1%
3 16
11.1%
0 15
10.4%
4 14
9.7%
5 13
9.0%
7 11
7.6%
2 8
5.6%
Other Letter
ValueCountFrequency (%)
6
25.0%
6
25.0%
6
25.0%
6
25.0%
Other Punctuation
ValueCountFrequency (%)
. 20
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 164
87.2%
Hangul 24
 
12.8%

Most frequent character per script

Common
ValueCountFrequency (%)
. 20
12.2%
1 19
11.6%
8 16
9.8%
6 16
9.8%
9 16
9.8%
3 16
9.8%
0 15
9.1%
4 14
8.5%
5 13
7.9%
7 11
6.7%
Hangul
ValueCountFrequency (%)
6
25.0%
6
25.0%
6
25.0%
6
25.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 164
87.2%
Hangul 24
 
12.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
. 20
12.2%
1 19
11.6%
8 16
9.8%
6 16
9.8%
9 16
9.8%
3 16
9.8%
0 15
9.1%
4 14
8.5%
5 13
7.9%
7 11
6.7%
Hangul
ValueCountFrequency (%)
6
25.0%
6
25.0%
6
25.0%
6
25.0%

총합계
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct26
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean774660.31
Minimum714.385
Maximum3239257.9
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size366.0 B
2023-12-13T00:02:00.788279image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum714.385
5-th percentile1616.2152
Q115919.84
median23688.546
Q31993580.9
95-th percentile2991927
Maximum3239257.9
Range3238543.5
Interquartile range (IQR)1977661.1

Descriptive statistics

Standard deviation1276773.5
Coefficient of variation (CV)1.6481721
Kurtosis-0.72376425
Mean774660.31
Median Absolute Deviation (MAD)12632.236
Skewness1.1347494
Sum20141168
Variance1.6301507 × 1012
MonotonicityNot monotonic
2023-12-13T00:02:00.905592image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=26)
ValueCountFrequency (%)
23253.58225 1
 
3.8%
15047.06528 1
 
3.8%
24757.30128 1
 
3.8%
2675165.304 1
 
3.8%
2027.17 1
 
3.8%
24036.542 1
 
3.8%
40028.75572 1
 
3.8%
2658318.463 1
 
3.8%
1542.206 1
 
3.8%
20890.38381 1
 
3.8%
Other values (16) 16
61.5%
ValueCountFrequency (%)
714.385 1
3.8%
1542.206 1
3.8%
1838.243 1
3.8%
2027.17 1
3.8%
4529.166 1
3.8%
14764.28376 1
3.8%
15047.06528 1
3.8%
18538.16384 1
3.8%
19632.05691 1
3.8%
20890.38381 1
3.8%
ValueCountFrequency (%)
3239257.9 1
3.8%
3024576.756 1
3.8%
2893977.58 1
3.8%
2675165.304 1
3.8%
2667242.467 1
3.8%
2658318.463 1
3.8%
2644764.986 1
3.8%
40028.75572 1
3.8%
32566.46465 1
3.8%
24757.30128 1
3.8%

Interactions

2023-12-13T00:01:54.369734image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T00:01:54.186141image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T00:01:54.455208image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T00:01:54.271528image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T00:02:01.003840image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연도업종강원경기경남경북대전세종울산전남전북충남충북총합계
연도1.0000.0000.1210.2640.3100.5360.0000.0000.6580.6500.2640.2640.2860.000
업종0.0001.0000.5900.7410.8771.0000.8520.0000.9161.0000.7410.7411.0000.636
강원0.1210.5901.0001.0001.0001.0000.0001.0000.9061.0001.0001.0001.0001.000
경기0.2640.7411.0001.0000.0001.0000.0001.0000.9811.0000.0000.0001.0001.000
경남0.3100.8771.0000.0001.0000.8740.9431.0000.7640.7611.0001.0000.0001.000
경북0.5361.0001.0001.0000.8741.0001.0001.0001.0001.0001.0001.0000.0001.000
대전0.0000.8520.0000.0000.9431.0001.0000.0001.0001.0000.9590.9590.0000.000
세종0.0000.0001.0001.0001.0001.0000.0001.0001.0001.0001.0001.0001.0000.421
울산0.6580.9160.9060.9810.7641.0001.0001.0001.0001.0000.9810.9810.0000.914
전남0.6501.0001.0001.0000.7611.0001.0001.0001.0001.0001.0001.0000.5001.000
전북0.2640.7411.0000.0001.0001.0000.9591.0000.9811.0001.0001.0000.0001.000
충남0.2640.7411.0000.0001.0001.0000.9591.0000.9811.0001.0001.0000.0001.000
충북0.2861.0001.0001.0000.0000.0000.0001.0000.0000.5000.0000.0001.0001.000
총합계0.0000.6361.0001.0001.0001.0000.0000.4210.9141.0001.0001.0001.0001.000
2023-12-13T00:02:01.161764image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
세종대전강원업종
세종1.0000.0000.8660.000
대전0.0001.0000.0000.590
강원0.8660.0001.0000.246
업종0.0000.5900.2461.000
2023-12-13T00:02:01.280835image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연도총합계업종강원대전세종
연도1.000-0.0600.0000.0000.0000.000
총합계-0.0601.0000.6430.8850.0000.645
업종0.0000.6431.0000.2460.5900.000
강원0.0000.8850.2461.0000.0000.866
대전0.0000.0000.5900.0001.0000.000
세종0.0000.6450.0000.8660.0001.000

Missing values

2023-12-13T00:01:54.593643image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T00:01:54.799864image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연도업종강원경기경남경북대전세종울산전남전북충남충북총합계
02016금속해당없음해당없음1607.59594.016.897해당없음4075.285425233.987832658.93276.97해당없음23253.58225
12016요업2052010.99413141.76911475.01852330.295해당없음해당없음5984559.5433054.94713770.794361086930.5393239257.9
22016화공해당없음21.903해당없음6817.16483630.1해당없음7127.5034522.752해당없음해당없음18.74118538.16384
32017금속해당없음해당없음1,64610,0595해당없음3391.1035365137.7283028.5172.366248해당없음23340.55078
42017요업1,839,15014,57311,53551,054해당없음해당없음4581650.2662680.5457796.791961956.93942893977.58
52017화공해당없음27.887804해당없음5746.1011130.1해당없음9526.5964288.049해당없음해당없음13.32319632.05691
62018금속해당없음해당없음1852.28907.181.625해당없음5399.850825778.553272502.04164.633128해당없음24506.08322
72018산업기타해당없음해당없음해당없음해당없음해당없음해당없음해당없음3802해당없음해당없음727.1664529.166
82018요업1680424.713399.59812762.93750026.981해당없음해당없음4525731.32725.8218200.56894445.56982667242.467
92018화공해당없음29.0962해당없음8030.42667430.1해당없음8182.6114875.205해당없음해당없음60.99321208.43187
연도업종강원경기경남경북대전세종울산전남전북충남충북총합계
162020요업1,732,3539,70311,43645,202해당없음해당없음4372833.491623.7625388.35833852.04032644764.986
172020화공해당없음15.982해당없음5183.70764630.1해당없음18553.9658772.461해당없음해당없음10.24932566.46465
182021금속해당없음해당없음906.2674801.2834.5해당없음10483.90535640.585672386.627667.215444해당없음20890.38381
192021산업기타해당없음해당없음860.162해당없음해당없음해당없음해당없음해당없음해당없음해당없음682.0441542.206
202021요업1589984.27910452.88316048.30949620.322해당없음해당없음4376782.1271979.7964706.996980367.75172658318.463
212021화공해당없음9.373해당없음8072.85472430.1해당없음25037.8536862.967해당없음해당없음15.60840028.75572
222022금속해당없음해당없음815.2827.47919.212해당없음11361.842966165.8330483369.9273.843471403.21151824036.542
232022산업기타해당없음해당없음1416.41해당없음해당없음해당없음해당없음해당없음해당없음해당없음610.762027.17
242022요업1538788.7849954.39114942.46350596.07745해당없음해당없음해당없음793.8222892.6355151.1241052046.0072675165.304
252022화공해당없음9.415해당없음8201.46027630.1해당없음6286.72510190.763해당없음해당없음38.83824757.30128