Overview

Dataset statistics

Number of variables12
Number of observations1810
Missing cells2189
Missing cells (%)10.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory173.4 KiB
Average record size in memory98.1 B

Variable types

Numeric2
Categorical5
Text3
DateTime2

Dataset

Description광주광역시 태양광발전소 현황 데이터로 발전소명, 발전소위치, 허가일자, 사업개시일자, 변경일자, 용량 등의 정보를 제공합니다.
URLhttps://www.data.go.kr/data/15065713/fileData.do

Alerts

시도 has constant value ""Constant
변경사유 is highly overall correlated with 시군구 and 1 other fieldsHigh correlation
시군구 is highly overall correlated with 연번 and 2 other fieldsHigh correlation
연번 is highly overall correlated with 시군구 and 1 other fieldsHigh correlation
원동력의 종류 is highly overall correlated with 연번 and 1 other fieldsHigh correlation
사업개시여부 is highly overall correlated with 변경사유High correlation
변경사유 is highly imbalanced (94.1%)Imbalance
사업개시여부 is highly imbalanced (58.0%)Imbalance
용량(kW) has 23 (1.3%) missing valuesMissing
변경일 has 1777 (98.2%) missing valuesMissing
사업개시일 has 389 (21.5%) missing valuesMissing
연번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 01:36:07.193684
Analysis finished2023-12-12 01:36:09.494684
Duration2.3 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct1810
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean905.5
Minimum1
Maximum1810
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size16.0 KiB
2023-12-12T10:36:09.603296image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile91.45
Q1453.25
median905.5
Q31357.75
95-th percentile1719.55
Maximum1810
Range1809
Interquartile range (IQR)904.5

Descriptive statistics

Standard deviation522.64631
Coefficient of variation (CV)0.57719085
Kurtosis-1.2
Mean905.5
Median Absolute Deviation (MAD)452.5
Skewness0
Sum1638955
Variance273159.17
MonotonicityStrictly increasing
2023-12-12T10:36:09.796029image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.1%
1190 1
 
0.1%
1216 1
 
0.1%
1215 1
 
0.1%
1214 1
 
0.1%
1213 1
 
0.1%
1212 1
 
0.1%
1211 1
 
0.1%
1210 1
 
0.1%
1209 1
 
0.1%
Other values (1800) 1800
99.4%
ValueCountFrequency (%)
1 1
0.1%
2 1
0.1%
3 1
0.1%
4 1
0.1%
5 1
0.1%
6 1
0.1%
7 1
0.1%
8 1
0.1%
9 1
0.1%
10 1
0.1%
ValueCountFrequency (%)
1810 1
0.1%
1809 1
0.1%
1808 1
0.1%
1807 1
0.1%
1806 1
0.1%
1805 1
0.1%
1804 1
0.1%
1803 1
0.1%
1802 1
0.1%
1801 1
0.1%

시도
Categorical

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size14.3 KiB
광주광역시
1810 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row광주광역시
2nd row광주광역시
3rd row광주광역시
4th row광주광역시
5th row광주광역시

Common Values

ValueCountFrequency (%)
광주광역시 1810
100.0%

Length

2023-12-12T10:36:09.974887image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T10:36:10.108796image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
광주광역시 1810
100.0%

시군구
Categorical

HIGH CORRELATION 

Distinct5
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size14.3 KiB
광산구
1126 
북구
437 
서구
166 
남구
 
55
동구
 
26

Length

Max length3
Median length3
Mean length2.6220994
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row광산구
2nd row광산구
3rd row광산구
4th row광산구
5th row광산구

Common Values

ValueCountFrequency (%)
광산구 1126
62.2%
북구 437
 
24.1%
서구 166
 
9.2%
남구 55
 
3.0%
동구 26
 
1.4%

Length

2023-12-12T10:36:10.261673image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T10:36:10.437702image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
광산구 1126
62.2%
북구 437
 
24.1%
서구 166
 
9.2%
남구 55
 
3.0%
동구 26
 
1.4%

원동력의 종류
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size14.3 KiB
태양광
1373 
태양광발전
435 
연료전지
 
2

Length

Max length5
Median length3
Mean length3.481768
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row태양광
2nd row태양광
3rd row태양광
4th row태양광
5th row태양광

Common Values

ValueCountFrequency (%)
태양광 1373
75.9%
태양광발전 435
 
24.0%
연료전지 2
 
0.1%

Length

2023-12-12T10:36:10.607627image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T10:36:10.766149image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
태양광 1373
75.9%
태양광발전 435
 
24.0%
연료전지 2
 
0.1%
Distinct1753
Distinct (%)96.9%
Missing0
Missing (%)0.0%
Memory size14.3 KiB
2023-12-12T10:36:11.143494image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length26
Median length24
Mean length10.133149
Min length2

Characters and Unicode

Total characters18341
Distinct characters485
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1699 ?
Unique (%)93.9%

Sample

1st row빛모리태양광발전소
2nd row장수태양광발전소
3rd row금빛산이엔지태양광발전소
4th row㈜하이코리아
5th row동곡동주민복지회태양광발전소
ValueCountFrequency (%)
태양광발전소 536
 
21.7%
발전소 13
 
0.5%
주식회사 8
 
0.3%
태양광 7
 
0.3%
햇빛발전소 7
 
0.3%
진곡산단 7
 
0.3%
빛고을햇빛 6
 
0.2%
주차장 6
 
0.2%
전자공고 6
 
0.2%
태양광발전 5
 
0.2%
Other values (1769) 1873
75.7%
2023-12-12T10:36:11.694393image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1805
 
9.8%
1736
 
9.5%
1717
 
9.4%
1711
 
9.3%
1706
 
9.3%
1702
 
9.3%
668
 
3.6%
348
 
1.9%
240
 
1.3%
213
 
1.2%
Other values (475) 6495
35.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 16713
91.1%
Space Separator 668
 
3.6%
Decimal Number 483
 
2.6%
Uppercase Letter 177
 
1.0%
Other Symbol 142
 
0.8%
Close Punctuation 46
 
0.3%
Open Punctuation 46
 
0.3%
Lowercase Letter 32
 
0.2%
Dash Punctuation 26
 
0.1%
Other Punctuation 6
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1805
 
10.8%
1736
 
10.4%
1717
 
10.3%
1711
 
10.2%
1706
 
10.2%
1702
 
10.2%
348
 
2.1%
240
 
1.4%
213
 
1.3%
177
 
1.1%
Other values (426) 5358
32.1%
Uppercase Letter
ValueCountFrequency (%)
S 22
12.4%
E 16
9.0%
N 16
9.0%
J 15
8.5%
C 15
8.5%
G 14
7.9%
M 13
 
7.3%
H 12
 
6.8%
K 10
 
5.6%
T 10
 
5.6%
Other values (11) 34
19.2%
Decimal Number
ValueCountFrequency (%)
2 176
36.4%
1 161
33.3%
3 61
 
12.6%
4 35
 
7.2%
5 23
 
4.8%
6 8
 
1.7%
0 8
 
1.7%
7 6
 
1.2%
9 4
 
0.8%
8 1
 
0.2%
Lowercase Letter
ValueCountFrequency (%)
o 7
21.9%
c 4
12.5%
e 4
12.5%
p 4
12.5%
k 4
12.5%
r 3
9.4%
a 3
9.4%
l 3
9.4%
Other Punctuation
ValueCountFrequency (%)
. 3
50.0%
& 2
33.3%
, 1
 
16.7%
Math Symbol
ValueCountFrequency (%)
= 1
50.0%
> 1
50.0%
Space Separator
ValueCountFrequency (%)
668
100.0%
Other Symbol
ValueCountFrequency (%)
142
100.0%
Close Punctuation
ValueCountFrequency (%)
) 46
100.0%
Open Punctuation
ValueCountFrequency (%)
( 46
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 26
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 16855
91.9%
Common 1277
 
7.0%
Latin 209
 
1.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1805
 
10.7%
1736
 
10.3%
1717
 
10.2%
1711
 
10.2%
1706
 
10.1%
1702
 
10.1%
348
 
2.1%
240
 
1.4%
213
 
1.3%
177
 
1.1%
Other values (427) 5500
32.6%
Latin
ValueCountFrequency (%)
S 22
 
10.5%
E 16
 
7.7%
N 16
 
7.7%
J 15
 
7.2%
C 15
 
7.2%
G 14
 
6.7%
M 13
 
6.2%
H 12
 
5.7%
K 10
 
4.8%
T 10
 
4.8%
Other values (19) 66
31.6%
Common
ValueCountFrequency (%)
668
52.3%
2 176
 
13.8%
1 161
 
12.6%
3 61
 
4.8%
) 46
 
3.6%
( 46
 
3.6%
4 35
 
2.7%
- 26
 
2.0%
5 23
 
1.8%
6 8
 
0.6%
Other values (9) 27
 
2.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 16713
91.1%
ASCII 1486
 
8.1%
None 142
 
0.8%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1805
 
10.8%
1736
 
10.4%
1717
 
10.3%
1711
 
10.2%
1706
 
10.2%
1702
 
10.2%
348
 
2.1%
240
 
1.4%
213
 
1.3%
177
 
1.1%
Other values (426) 5358
32.1%
ASCII
ValueCountFrequency (%)
668
45.0%
2 176
 
11.8%
1 161
 
10.8%
3 61
 
4.1%
) 46
 
3.1%
( 46
 
3.1%
4 35
 
2.4%
- 26
 
1.7%
5 23
 
1.5%
S 22
 
1.5%
Other values (38) 222
 
14.9%
None
ValueCountFrequency (%)
142
100.0%
Distinct1668
Distinct (%)92.2%
Missing0
Missing (%)0.0%
Memory size14.3 KiB
2023-12-12T10:36:12.078515image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length188
Median length64
Mean length22.412155
Min length9

Characters and Unicode

Total characters40566
Distinct characters238
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1568 ?
Unique (%)86.6%

Sample

1st row광주광역시 광산구 대산동 172
2nd row광주광역시 광산구 수남안길 20
3rd row광주광역시 광산구 사암로106번길 77
4th row광주광역시 광산구 평동산단1번로 121
5th row광주광역시 광산구 동곡로 161-1
ValueCountFrequency (%)
광주광역시 1644
21.4%
광산구 1127
 
14.7%
북구 439
 
5.7%
서구 168
 
2.2%
남구 55
 
0.7%
상부 47
 
0.6%
평동산단7번로 41
 
0.5%
평동산단로 36
 
0.5%
33
 
0.4%
평동로803번길 30
 
0.4%
Other values (2111) 4069
52.9%
2023-12-12T10:36:12.796563image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5909
 
14.6%
4420
 
10.9%
1819
 
4.5%
1747
 
4.3%
1727
 
4.3%
1 1691
 
4.2%
1645
 
4.1%
1644
 
4.1%
1429
 
3.5%
1261
 
3.1%
Other values (228) 17274
42.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 24073
59.3%
Decimal Number 8126
 
20.0%
Space Separator 5909
 
14.6%
Dash Punctuation 914
 
2.3%
Open Punctuation 527
 
1.3%
Close Punctuation 526
 
1.3%
Other Punctuation 421
 
1.0%
Uppercase Letter 66
 
0.2%
Math Symbol 4
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4420
18.4%
1819
 
7.6%
1747
 
7.3%
1727
 
7.2%
1645
 
6.8%
1644
 
6.8%
1429
 
5.9%
1261
 
5.2%
897
 
3.7%
656
 
2.7%
Other values (200) 6828
28.4%
Decimal Number
ValueCountFrequency (%)
1 1691
20.8%
2 1119
13.8%
3 945
11.6%
5 711
8.7%
4 650
 
8.0%
6 642
 
7.9%
7 625
 
7.7%
0 607
 
7.5%
8 606
 
7.5%
9 530
 
6.5%
Uppercase Letter
ValueCountFrequency (%)
A 26
39.4%
B 21
31.8%
C 11
16.7%
D 4
 
6.1%
E 2
 
3.0%
K 1
 
1.5%
H 1
 
1.5%
Other Punctuation
ValueCountFrequency (%)
, 339
80.5%
/ 81
 
19.2%
: 1
 
0.2%
Open Punctuation
ValueCountFrequency (%)
( 525
99.6%
[ 2
 
0.4%
Close Punctuation
ValueCountFrequency (%)
) 524
99.6%
] 2
 
0.4%
Math Symbol
ValueCountFrequency (%)
~ 2
50.0%
+ 2
50.0%
Space Separator
ValueCountFrequency (%)
5909
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 914
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 24073
59.3%
Common 16427
40.5%
Latin 66
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
4420
18.4%
1819
 
7.6%
1747
 
7.3%
1727
 
7.2%
1645
 
6.8%
1644
 
6.8%
1429
 
5.9%
1261
 
5.2%
897
 
3.7%
656
 
2.7%
Other values (200) 6828
28.4%
Common
ValueCountFrequency (%)
5909
36.0%
1 1691
 
10.3%
2 1119
 
6.8%
3 945
 
5.8%
- 914
 
5.6%
5 711
 
4.3%
4 650
 
4.0%
6 642
 
3.9%
7 625
 
3.8%
0 607
 
3.7%
Other values (11) 2614
15.9%
Latin
ValueCountFrequency (%)
A 26
39.4%
B 21
31.8%
C 11
16.7%
D 4
 
6.1%
E 2
 
3.0%
K 1
 
1.5%
H 1
 
1.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 24073
59.3%
ASCII 16493
40.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5909
35.8%
1 1691
 
10.3%
2 1119
 
6.8%
3 945
 
5.7%
- 914
 
5.5%
5 711
 
4.3%
4 650
 
3.9%
6 642
 
3.9%
7 625
 
3.8%
0 607
 
3.7%
Other values (18) 2680
16.2%
Hangul
ValueCountFrequency (%)
4420
18.4%
1819
 
7.6%
1747
 
7.3%
1727
 
7.2%
1645
 
6.8%
1644
 
6.8%
1429
 
5.9%
1261
 
5.2%
897
 
3.7%
656
 
2.7%
Other values (200) 6828
28.4%

용량(kW)
Real number (ℝ)

MISSING 

Distinct1027
Distinct (%)57.5%
Missing23
Missing (%)1.3%
Infinite0
Infinite (%)0.0%
Mean168.87921
Minimum5.04
Maximum3000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size16.0 KiB
2023-12-12T10:36:13.048386image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum5.04
5-th percentile18
Q140
median98.28
Q3196.02
95-th percentile499.8
Maximum3000
Range2994.96
Interquartile range (IQR)156.02

Descriptive statistics

Standard deviation259.37326
Coefficient of variation (CV)1.5358508
Kurtosis38.790466
Mean168.87921
Median Absolute Deviation (MAD)65.72
Skewness5.0716079
Sum301787.15
Variance67274.489
MonotonicityNot monotonic
2023-12-12T10:36:13.236284image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
99.0 80
 
4.4%
99.84 25
 
1.4%
97.2 25
 
1.4%
97.92 22
 
1.2%
99.18 20
 
1.1%
50.0 17
 
0.9%
99.9 16
 
0.9%
99.2 16
 
0.9%
19.8 12
 
0.7%
19.5 12
 
0.7%
Other values (1017) 1542
85.2%
(Missing) 23
 
1.3%
ValueCountFrequency (%)
5.04 1
 
0.1%
5.67 1
 
0.1%
6.0 1
 
0.1%
8.28 1
 
0.1%
9.0 3
0.2%
9.05 1
 
0.1%
9.5 1
 
0.1%
9.94 1
 
0.1%
10.0 3
0.2%
10.2 1
 
0.1%
ValueCountFrequency (%)
3000.0 1
0.1%
2991.6 1
0.1%
2895.17 1
0.1%
2587.5 1
0.1%
2499.83 1
0.1%
2499.64 1
0.1%
2000.16 1
0.1%
1999.58 1
0.1%
1573.83 1
0.1%
1500.0 1
0.1%
Distinct794
Distinct (%)43.9%
Missing0
Missing (%)0.0%
Memory size14.3 KiB
Minimum2006-06-01 00:00:00
Maximum2023-02-27 00:00:00
2023-12-12T10:36:13.445513image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:36:13.656934image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

변경일
Date

MISSING 

Distinct29
Distinct (%)87.9%
Missing1777
Missing (%)98.2%
Memory size14.3 KiB
Minimum2014-07-16 00:00:00
Maximum2022-08-01 00:00:00
2023-12-12T10:36:13.830897image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:36:14.024438image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=29)

변경사유
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct8
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size14.3 KiB
<NA>
1777 
양도양수
 
11
재교부
 
7
허가용량 변경
 
6
상호명 변경
 
3
Other values (3)
 
6

Length

Max length10
Median length4
Mean length4.0209945
Min length3

Unique

Unique1 ?
Unique (%)0.1%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 1777
98.2%
양도양수 11
 
0.6%
재교부 7
 
0.4%
허가용량 변경 6
 
0.3%
상호명 변경 3
 
0.2%
준비기간 연장 3
 
0.2%
사업기간 연장 2
 
0.1%
양도양수(상호변경) 1
 
0.1%

Length

2023-12-12T10:36:14.222545image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T10:36:14.421536image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 1777
97.4%
양도양수 11
 
0.6%
변경 9
 
0.5%
재교부 7
 
0.4%
허가용량 6
 
0.3%
연장 5
 
0.3%
상호명 3
 
0.2%
준비기간 3
 
0.2%
사업기간 2
 
0.1%
양도양수(상호변경 1
 
0.1%

사업개시일
Text

MISSING 

Distinct914
Distinct (%)64.3%
Missing389
Missing (%)21.5%
Memory size14.3 KiB
2023-12-12T10:36:14.847111image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length11
Median length10
Mean length10.000704
Min length10

Characters and Unicode

Total characters14211
Distinct characters12
Distinct categories3 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique599 ?
Unique (%)42.2%

Sample

1st row2008-05-20
2nd row2009-05-28
3rd row2012-01-06
4th row2012-02-15
5th row2010-09-14
ValueCountFrequency (%)
2020-01-15 10
 
0.7%
2021-12-30 8
 
0.6%
2021-10-27 7
 
0.5%
2022-09-30 7
 
0.5%
2021-01-17 7
 
0.5%
2021-01-26 6
 
0.4%
2022-11-01 6
 
0.4%
2014-10-30 6
 
0.4%
2022-01-06 5
 
0.4%
2018-12-04 5
 
0.4%
Other values (903) 1354
95.3%
2023-12-12T10:36:15.482754image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 3241
22.8%
2 3034
21.3%
- 2842
20.0%
1 2414
17.0%
9 489
 
3.4%
4 431
 
3.0%
8 415
 
2.9%
3 369
 
2.6%
7 361
 
2.5%
5 341
 
2.4%
Other values (2) 274
 
1.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 11368
80.0%
Dash Punctuation 2842
 
20.0%
Space Separator 1
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 3241
28.5%
2 3034
26.7%
1 2414
21.2%
9 489
 
4.3%
4 431
 
3.8%
8 415
 
3.7%
3 369
 
3.2%
7 361
 
3.2%
5 341
 
3.0%
6 273
 
2.4%
Dash Punctuation
ValueCountFrequency (%)
- 2842
100.0%
Space Separator
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 14211
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 3241
22.8%
2 3034
21.3%
- 2842
20.0%
1 2414
17.0%
9 489
 
3.4%
4 431
 
3.0%
8 415
 
2.9%
3 369
 
2.6%
7 361
 
2.5%
5 341
 
2.4%
Other values (2) 274
 
1.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 14211
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 3241
22.8%
2 3034
21.3%
- 2842
20.0%
1 2414
17.0%
9 489
 
3.4%
4 431
 
3.0%
8 415
 
2.9%
3 369
 
2.6%
7 361
 
2.5%
5 341
 
2.4%
Other values (2) 274
 
1.9%

사업개시여부
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct5
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size14.3 KiB
Y
1417 
준비
260 
N
 
128
×
 
3
<NA>
 
2

Length

Max length4
Median length1
Mean length1.1469613
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowY
2nd rowY
3rd rowY
4th rowY
5th rowY

Common Values

ValueCountFrequency (%)
Y 1417
78.3%
준비 260
 
14.4%
N 128
 
7.1%
× 3
 
0.2%
<NA> 2
 
0.1%

Length

2023-12-12T10:36:15.675871image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T10:36:15.832139image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
y 1417
78.3%
준비 260
 
14.4%
n 128
 
7.1%
× 3
 
0.2%
na 2
 
0.1%

Interactions

2023-12-12T10:36:08.587120image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:36:08.239319image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:36:08.769121image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:36:08.415624image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T10:36:15.949616image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번시군구원동력의 종류용량(kW)변경일변경사유사업개시여부
연번1.0000.9430.7760.1591.0000.4930.631
시군구0.9431.0000.7170.142NaNNaN0.385
원동력의 종류0.7760.7171.0000.2411.0000.3730.337
용량(kW)0.1590.1420.2411.0001.0000.4890.074
변경일1.000NaN1.0001.0001.0001.0001.000
변경사유0.493NaN0.3730.4891.0001.0000.560
사업개시여부0.6310.3850.3370.0741.0000.5601.000
2023-12-12T10:36:16.394110image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
변경사유원동력의 종류사업개시여부시군구
변경사유1.0000.3590.5481.000
원동력의 종류0.3591.0000.3260.706
사업개시여부0.5480.3261.0000.323
시군구1.0000.7060.3231.000
2023-12-12T10:36:16.518348image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번용량(kW)시군구원동력의 종류변경사유사업개시여부
연번1.000-0.2610.6760.6550.3470.432
용량(kW)-0.2611.0000.0820.1090.3410.047
시군구0.6760.0821.0000.7061.0000.323
원동력의 종류0.6550.1090.7061.0000.3590.326
변경사유0.3470.3411.0000.3591.0000.548
사업개시여부0.4320.0470.3230.3260.5481.000

Missing values

2023-12-12T10:36:08.966694image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T10:36:09.226059image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-12T10:36:09.410096image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

연번시도시군구원동력의 종류사업의내용(발전소명)사업장소(발전소 위치)용량(kW)허가일변경일변경사유사업개시일사업개시여부
01광주광역시광산구태양광빛모리태양광발전소광주광역시 광산구 대산동 17228.02007-07-01<NA><NA>2008-05-20Y
12광주광역시광산구태양광장수태양광발전소광주광역시 광산구 수남안길 205.672008-12-01<NA><NA>2009-05-28Y
23광주광역시광산구태양광금빛산이엔지태양광발전소광주광역시 광산구 사암로106번길 7729.9252010-01-01<NA><NA>2012-01-06Y
34광주광역시광산구태양광㈜하이코리아광주광역시 광산구 평동산단1번로 12129.9252010-10-01<NA><NA>2012-02-15Y
45광주광역시광산구태양광동곡동주민복지회태양광발전소광주광역시 광산구 동곡로 161-147.62010-04-01<NA><NA>2010-09-14Y
56광주광역시광산구태양광이글스에너지㈜광주광역시 광산구 송치동 180-23외34필지1999.582011-04-22<NA><NA>2012-05-25Y
67광주광역시광산구태양광오선동태양광발전소광주광역시 광산구 하남산단6번로 8059.82011-09-14<NA><NA>2012-08-28Y
78광주광역시광산구태양광디케이솔라파워태양광발전소광주광역시 광산구 평동산단로 3601000.02011-12-07<NA><NA>2014-09-25Y
89광주광역시광산구태양광㈜지용금속태양광발전소2호광주광역시 광산구 하남산단천변우로126-899.752012-01-10<NA><NA>2012-04-04Y
910광주광역시광산구태양광㈜대성포장태양광발전소광주광역시 광산구 평동산단로 140200.02012-03-13<NA><NA>2013-08-05Y
연번시도시군구원동력의 종류사업의내용(발전소명)사업장소(발전소 위치)용량(kW)허가일변경일변경사유사업개시일사업개시여부
18001801광주광역시북구태양광발전천우2호 태양광발전소광주광역시 북구 서림로51번길 519.042022-10-17<NA><NA>2022-12-29Y
18011802광주광역시북구태양광발전유한회사 지아이전기 태양광발전소광주광역시 북구 연제동 1055-3, 1055-7260.192022-10-20<NA><NA><NA>N
18021803광주광역시북구태양광발전유한회사 지아이에너지 태양광발전소광주광역시 북구 연제동 1055-1428.752022-10-20<NA><NA><NA>N
18031804광주광역시북구태양광발전동우 태양광발전소광주광역시 북구 첨단벤처소로38번길 5-464.262022-11-15<NA><NA><NA>N
18041805광주광역시북구태양광발전기현 태양광발전소광주광역시 북구 서방로181번길 7013.572022-11-21<NA><NA><NA>N
18051806광주광역시북구태양광발전임금 태양광발전소광주광역시 북구 민주로 154180.882022-11-23<NA><NA><NA>N
18061807광주광역시북구태양광발전더함상사 태양광발전소광주광역시 북구 용전마을길 23-1160.182022-11-28<NA><NA><NA>N
18071808광주광역시북구태양광발전엠에스케이 태양광발전소광주광역시 북구 첨단연신로288번길 63298.542023-02-02<NA><NA><NA>N
18081809광주광역시북구태양광발전삼신기업 태양광발전소광주광역시 북구 첨단연신로 330320.112023-02-02<NA><NA><NA>N
18091810광주광역시북구태양광발전에비뉴 태양광발전소광주광역시 북구 월출동 970-9399.122023-02-02<NA><NA><NA>N