Overview

Dataset statistics

Number of variables12
Number of observations10000
Missing cells6784
Missing cells (%)5.7%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.0 MiB
Average record size in memory105.0 B

Variable types

DateTime3
Categorical5
Text4

Dataset

Description전북특별자치도에서 허가된 태양광 발전사업 현황 데이터입니다.발전소 명칭, 사업장소, 설치구분, 허가면적, 설비용량 등의 데이터를 제공합니다.
Author전북특별자치도
URLhttps://www.data.go.kr/data/15043292/fileData.do

Alerts

에너지원 has constant value ""Constant
설치구분 is highly overall correlated with 비고High correlation
비고 is highly overall correlated with 설치구분High correlation
설치구분 is highly imbalanced (72.7%)Imbalance
비고 is highly imbalanced (57.2%)Imbalance
허가면적(제곱미터) has 522 (5.2%) missing valuesMissing
사업개시신고일 has 6243 (62.4%) missing valuesMissing

Reproduction

Analysis started2024-03-14 15:33:42.700853
Analysis finished2024-03-14 15:33:45.806669
Duration3.11 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct998
Distinct (%)10.0%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2005-06-13 00:00:00
Maximum2019-11-01 00:00:00
2024-03-15T00:33:46.016219image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T00:33:46.399974image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

에너지원
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
태양광
10000 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row태양광
2nd row태양광
3rd row태양광
4th row태양광
5th row태양광

Common Values

ValueCountFrequency (%)
태양광 10000
100.0%

Length

2024-03-15T00:33:46.789286image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-15T00:33:47.077361image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
태양광 10000
100.0%
Distinct8808
Distinct (%)88.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-03-15T00:33:48.207508image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length33
Median length27
Mean length9.2834
Min length2

Characters and Unicode

Total characters92834
Distinct characters665
Distinct categories11 ?
Distinct scripts4 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique8101 ?
Unique (%)81.0%

Sample

1st row선운에너지농장태양광발전소
2nd row운봉태양광발전소
3rd row세광태양광발전소1호
4th row마이2호 태양광발전소
5th row연정에너지
ValueCountFrequency (%)
태양광발전소 196
 
1.9%
상월에너지스테이션 31
 
0.3%
유한회사 23
 
0.2%
대성태양광발전소 16
 
0.2%
태양광 13
 
0.1%
1호 13
 
0.1%
에너지 13
 
0.1%
한빛태양광발전소 12
 
0.1%
2호 10
 
0.1%
하늘태양광발전소 10
 
0.1%
Other values (8831) 10070
96.8%
2024-03-15T00:33:49.849399image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
9347
 
10.1%
9314
 
10.0%
9229
 
9.9%
9056
 
9.8%
8975
 
9.7%
8944
 
9.6%
3505
 
3.8%
1 1525
 
1.6%
2 1322
 
1.4%
1005
 
1.1%
Other values (655) 30612
33.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 86413
93.1%
Decimal Number 4245
 
4.6%
Uppercase Letter 757
 
0.8%
Space Separator 450
 
0.5%
Close Punctuation 296
 
0.3%
Open Punctuation 275
 
0.3%
Other Symbol 173
 
0.2%
Dash Punctuation 120
 
0.1%
Lowercase Letter 71
 
0.1%
Other Punctuation 31
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
9347
 
10.8%
9314
 
10.8%
9229
 
10.7%
9056
 
10.5%
8975
 
10.4%
8944
 
10.4%
3505
 
4.1%
1005
 
1.2%
934
 
1.1%
747
 
0.9%
Other values (598) 25357
29.3%
Uppercase Letter
ValueCountFrequency (%)
S 129
17.0%
J 81
10.7%
K 60
 
7.9%
Y 53
 
7.0%
B 53
 
7.0%
H 44
 
5.8%
C 43
 
5.7%
E 41
 
5.4%
M 39
 
5.2%
N 35
 
4.6%
Other values (12) 179
23.6%
Lowercase Letter
ValueCountFrequency (%)
r 13
18.3%
l 11
15.5%
a 11
15.5%
o 11
15.5%
e 5
 
7.0%
s 5
 
7.0%
n 4
 
5.6%
w 2
 
2.8%
t 2
 
2.8%
u 2
 
2.8%
Other values (5) 5
 
7.0%
Decimal Number
ValueCountFrequency (%)
1 1525
35.9%
2 1322
31.1%
3 572
 
13.5%
4 249
 
5.9%
5 188
 
4.4%
6 121
 
2.9%
0 90
 
2.1%
7 86
 
2.0%
8 53
 
1.2%
9 39
 
0.9%
Other Punctuation
ValueCountFrequency (%)
& 20
64.5%
. 9
29.0%
· 1
 
3.2%
: 1
 
3.2%
Space Separator
ValueCountFrequency (%)
450
100.0%
Close Punctuation
ValueCountFrequency (%)
) 296
100.0%
Open Punctuation
ValueCountFrequency (%)
( 275
100.0%
Other Symbol
ValueCountFrequency (%)
173
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 120
100.0%
Letter Number
ValueCountFrequency (%)
3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 86585
93.3%
Common 5417
 
5.8%
Latin 831
 
0.9%
Han 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
9347
 
10.8%
9314
 
10.8%
9229
 
10.7%
9056
 
10.5%
8975
 
10.4%
8944
 
10.3%
3505
 
4.0%
1005
 
1.2%
934
 
1.1%
747
 
0.9%
Other values (598) 25529
29.5%
Latin
ValueCountFrequency (%)
S 129
15.5%
J 81
 
9.7%
K 60
 
7.2%
Y 53
 
6.4%
B 53
 
6.4%
H 44
 
5.3%
C 43
 
5.2%
E 41
 
4.9%
M 39
 
4.7%
N 35
 
4.2%
Other values (28) 253
30.4%
Common
ValueCountFrequency (%)
1 1525
28.2%
2 1322
24.4%
3 572
 
10.6%
450
 
8.3%
) 296
 
5.5%
( 275
 
5.1%
4 249
 
4.6%
5 188
 
3.5%
6 121
 
2.2%
- 120
 
2.2%
Other values (8) 299
 
5.5%
Han
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 86412
93.1%
ASCII 6244
 
6.7%
None 174
 
0.2%
Number Forms 3
 
< 0.1%
CJK 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
9347
 
10.8%
9314
 
10.8%
9229
 
10.7%
9056
 
10.5%
8975
 
10.4%
8944
 
10.4%
3505
 
4.1%
1005
 
1.2%
934
 
1.1%
747
 
0.9%
Other values (597) 25356
29.3%
ASCII
ValueCountFrequency (%)
1 1525
24.4%
2 1322
21.2%
3 572
 
9.2%
450
 
7.2%
) 296
 
4.7%
( 275
 
4.4%
4 249
 
4.0%
5 188
 
3.0%
S 129
 
2.1%
6 121
 
1.9%
Other values (44) 1117
17.9%
None
ValueCountFrequency (%)
173
99.4%
· 1
 
0.6%
Number Forms
ValueCountFrequency (%)
3
100.0%
CJK
ValueCountFrequency (%)
1
100.0%

시군
Categorical

Distinct14
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
정읍시
1770 
남원시
1234 
김제시
1188 
익산시
1089 
고창군
822 
Other values (9)
3897 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row고창군
2nd row장수군
3rd row김제시
4th row김제시
5th row익산시

Common Values

ValueCountFrequency (%)
정읍시 1770
17.7%
남원시 1234
12.3%
김제시 1188
11.9%
익산시 1089
10.9%
고창군 822
8.2%
임실군 656
 
6.6%
부안군 646
 
6.5%
완주군 568
 
5.7%
군산시 505
 
5.1%
장수군 375
 
3.8%
Other values (4) 1147
11.5%

Length

2024-03-15T00:33:50.275784image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
정읍시 1770
17.7%
남원시 1234
12.3%
김제시 1188
11.9%
익산시 1089
10.9%
고창군 822
8.2%
임실군 656
 
6.6%
부안군 646
 
6.5%
완주군 568
 
5.7%
군산시 505
 
5.1%
장수군 375
 
3.8%
Other values (4) 1147
11.5%
Distinct9319
Distinct (%)93.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-03-15T00:33:51.435624image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length357
Median length174
Mean length23.1972
Min length6

Characters and Unicode

Total characters231972
Distinct characters384
Distinct categories14 ?
Distinct scripts3 ?
Distinct blocks8 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique8929 ?
Unique (%)89.3%

Sample

1st row아산면 계산리 444-2
2nd row번암면 유정리 254-1, 산221-2
3rd row요촌동 354-8 건물상부
4th row황산면 봉월리 51-1, 건축물위
5th row왕궁면 연정길 13-20 건물상부
ValueCountFrequency (%)
건물상부 1373
 
3.2%
건물상부(주1 649
 
1.5%
주2 329
 
0.8%
건물상부(주1,주2 314
 
0.7%
북면 273
 
0.6%
덕진구 272
 
0.6%
대산면 226
 
0.5%
용지면 167
 
0.4%
황등면 165
 
0.4%
보안면 165
 
0.4%
Other values (13691) 38709
90.8%
2024-03-15T00:33:53.204934image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
32778
 
14.1%
1 17652
 
7.6%
- 14540
 
6.3%
, 12272
 
5.3%
2 11612
 
5.0%
3 8934
 
3.9%
8648
 
3.7%
4 8087
 
3.5%
7815
 
3.4%
5 7365
 
3.2%
Other values (374) 102269
44.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 83029
35.8%
Other Letter 82641
35.6%
Space Separator 32778
 
14.1%
Dash Punctuation 14540
 
6.3%
Other Punctuation 12429
 
5.4%
Close Punctuation 3016
 
1.3%
Open Punctuation 3012
 
1.3%
Lowercase Letter 225
 
0.1%
Uppercase Letter 211
 
0.1%
Math Symbol 69
 
< 0.1%
Other values (4) 22
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
8648
 
10.5%
7815
 
9.5%
4656
 
5.6%
4182
 
5.1%
4125
 
5.0%
4038
 
4.9%
3582
 
4.3%
3520
 
4.3%
2642
 
3.2%
1195
 
1.4%
Other values (326) 38238
46.3%
Decimal Number
ValueCountFrequency (%)
1 17652
21.3%
2 11612
14.0%
3 8934
10.8%
4 8087
9.7%
5 7365
8.9%
6 7132
8.6%
7 6311
 
7.6%
8 5601
 
6.7%
0 5214
 
6.3%
9 5121
 
6.2%
Uppercase Letter
ValueCountFrequency (%)
W 171
81.0%
K 22
 
10.4%
A 6
 
2.8%
B 4
 
1.9%
C 3
 
1.4%
G 1
 
0.5%
D 1
 
0.5%
I 1
 
0.5%
L 1
 
0.5%
H 1
 
0.5%
Other Number
ValueCountFrequency (%)
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
Other Punctuation
ValueCountFrequency (%)
, 12272
98.7%
. 145
 
1.2%
: 6
 
< 0.1%
2
 
< 0.1%
/ 2
 
< 0.1%
2
 
< 0.1%
Math Symbol
ValueCountFrequency (%)
~ 66
95.7%
2
 
2.9%
+ 1
 
1.4%
Lowercase Letter
ValueCountFrequency (%)
k 187
83.1%
w 38
 
16.9%
Modifier Symbol
ValueCountFrequency (%)
` 4
66.7%
˚ 2
33.3%
Other Symbol
ValueCountFrequency (%)
2
66.7%
1
33.3%
Space Separator
ValueCountFrequency (%)
32778
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 14540
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3016
100.0%
Open Punctuation
ValueCountFrequency (%)
( 3012
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 148895
64.2%
Hangul 82641
35.6%
Latin 436
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
8648
 
10.5%
7815
 
9.5%
4656
 
5.6%
4182
 
5.1%
4125
 
5.0%
4038
 
4.9%
3582
 
4.3%
3520
 
4.3%
2642
 
3.2%
1195
 
1.4%
Other values (326) 38238
46.3%
Common
ValueCountFrequency (%)
32778
22.0%
1 17652
11.9%
- 14540
9.8%
, 12272
 
8.2%
2 11612
 
7.8%
3 8934
 
6.0%
4 8087
 
5.4%
5 7365
 
4.9%
6 7132
 
4.8%
7 6311
 
4.2%
Other values (26) 22212
14.9%
Latin
ValueCountFrequency (%)
k 187
42.9%
W 171
39.2%
w 38
 
8.7%
K 22
 
5.0%
A 6
 
1.4%
B 4
 
0.9%
C 3
 
0.7%
G 1
 
0.2%
D 1
 
0.2%
I 1
 
0.2%
Other values (2) 2
 
0.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 149312
64.4%
Hangul 82641
35.6%
Enclosed Alphanum 8
 
< 0.1%
Punctuation 4
 
< 0.1%
Arrows 2
 
< 0.1%
CJK Compat 2
 
< 0.1%
Modifier Letters 2
 
< 0.1%
Misc Symbols 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
32778
22.0%
1 17652
11.8%
- 14540
9.7%
, 12272
 
8.2%
2 11612
 
7.8%
3 8934
 
6.0%
4 8087
 
5.4%
5 7365
 
4.9%
6 7132
 
4.8%
7 6311
 
4.2%
Other values (24) 22629
15.2%
Hangul
ValueCountFrequency (%)
8648
 
10.5%
7815
 
9.5%
4656
 
5.6%
4182
 
5.1%
4125
 
5.0%
4038
 
4.9%
3582
 
4.3%
3520
 
4.3%
2642
 
3.2%
1195
 
1.4%
Other values (326) 38238
46.3%
Arrows
ValueCountFrequency (%)
2
100.0%
Punctuation
ValueCountFrequency (%)
2
50.0%
2
50.0%
CJK Compat
ValueCountFrequency (%)
2
100.0%
Modifier Letters
ValueCountFrequency (%)
˚ 2
100.0%
Enclosed Alphanum
ValueCountFrequency (%)
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
Misc Symbols
ValueCountFrequency (%)
1
100.0%

설치구분
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct12
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
6442 
건물상부
3514 
 
33
건물상부, 토지
 
2
건물상부, 토지위
 
2
Other values (7)
 
7

Length

Max length10
Median length4
Mean length3.9921
Min length1

Unique

Unique7 ?
Unique (%)0.1%

Sample

1st row<NA>
2nd row<NA>
3rd row건물상부
4th row건물상부
5th row건물상부

Common Values

ValueCountFrequency (%)
<NA> 6442
64.4%
건물상부 3514
35.1%
33
 
0.3%
건물상부, 토지 2
 
< 0.1%
건물상부, 토지위 2
 
< 0.1%
외벽 1
 
< 0.1%
1
 
< 0.1%
건물상부, 토지 위 1
 
< 0.1%
수상 1
 
< 0.1%
토지 및 건물 1
 
< 0.1%
Other values (2) 2
 
< 0.1%

Length

2024-03-15T00:33:53.556169image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
na 6442
64.6%
건물상부 3519
35.3%
토지 4
 
< 0.1%
토지위 3
 
< 0.1%
외벽 1
 
< 0.1%
1
 
< 0.1%
1
 
< 0.1%
수상 1
 
< 0.1%
1
 
< 0.1%
건물 1
 
< 0.1%
Distinct5228
Distinct (%)55.2%
Missing522
Missing (%)5.2%
Memory size156.2 KiB
2024-03-15T00:33:55.234546image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length6
Median length4
Mean length3.8454315
Min length1

Characters and Unicode

Total characters36447
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3373 ?
Unique (%)35.6%

Sample

1st row2254
2nd row11536
3rd row133
4th row1408
5th row196
ValueCountFrequency (%)
712 49
 
0.5%
420 42
 
0.4%
10000 33
 
0.3%
392 31
 
0.3%
388 28
 
0.3%
600 26
 
0.3%
6287 26
 
0.3%
504 24
 
0.3%
3900 23
 
0.2%
633 22
 
0.2%
Other values (5217) 9173
96.8%
2024-03-15T00:33:57.537997image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 5204
14.3%
0 4387
12.0%
2 4162
11.4%
3 3645
10.0%
4 3513
9.6%
5 3459
9.5%
6 3352
9.2%
8 2943
8.1%
7 2928
8.0%
9 2853
7.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 36446
> 99.9%
Space Separator 1
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 5204
14.3%
0 4387
12.0%
2 4162
11.4%
3 3645
10.0%
4 3513
9.6%
5 3459
9.5%
6 3352
9.2%
8 2943
8.1%
7 2928
8.0%
9 2853
7.8%
Space Separator
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 36447
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
1 5204
14.3%
0 4387
12.0%
2 4162
11.4%
3 3645
10.0%
4 3513
9.6%
5 3459
9.5%
6 3352
9.2%
8 2943
8.1%
7 2928
8.0%
9 2853
7.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 36447
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 5204
14.3%
0 4387
12.0%
2 4162
11.4%
3 3645
10.0%
4 3513
9.6%
5 3459
9.5%
6 3352
9.2%
8 2943
8.1%
7 2928
8.0%
9 2853
7.8%
Distinct697
Distinct (%)7.0%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-03-15T00:34:00.531369image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length4
Median length3
Mean length2.6213
Min length1

Characters and Unicode

Total characters26213
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique266 ?
Unique (%)2.7%

Sample

1st row99
2nd row198
3rd row20
4th row95
5th row19
ValueCountFrequency (%)
99 1730
 
17.3%
499 468
 
4.7%
30 408
 
4.1%
200 323
 
3.2%
300 275
 
2.8%
199 253
 
2.5%
999 242
 
2.4%
500 237
 
2.4%
497 217
 
2.2%
20 211
 
2.1%
Other values (686) 5635
56.4%
2024-03-15T00:34:03.312078image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
9 9560
36.5%
0 4567
17.4%
4 2179
 
8.3%
2 1917
 
7.3%
1 1785
 
6.8%
3 1507
 
5.7%
8 1430
 
5.5%
5 1416
 
5.4%
7 1078
 
4.1%
6 771
 
2.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 26210
> 99.9%
Space Separator 3
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
9 9560
36.5%
0 4567
17.4%
4 2179
 
8.3%
2 1917
 
7.3%
1 1785
 
6.8%
3 1507
 
5.7%
8 1430
 
5.5%
5 1416
 
5.4%
7 1078
 
4.1%
6 771
 
2.9%
Space Separator
ValueCountFrequency (%)
3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 26213
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
9 9560
36.5%
0 4567
17.4%
4 2179
 
8.3%
2 1917
 
7.3%
1 1785
 
6.8%
3 1507
 
5.7%
8 1430
 
5.5%
5 1416
 
5.4%
7 1078
 
4.1%
6 771
 
2.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 26213
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
9 9560
36.5%
0 4567
17.4%
4 2179
 
8.3%
2 1917
 
7.3%
1 1785
 
6.8%
3 1507
 
5.7%
8 1430
 
5.5%
5 1416
 
5.4%
7 1078
 
4.1%
6 771
 
2.9%

공급전압
Categorical

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
380
5841 
220
1855 
22900
1602 
<NA>
702 

Length

Max length5
Median length3
Mean length3.3906
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row380
2nd row380
3rd row220
4th row<NA>
5th row380

Common Values

ValueCountFrequency (%)
380 5841
58.4%
220 1855
 
18.6%
22900 1602
 
16.0%
<NA> 702
 
7.0%

Length

2024-03-15T00:34:03.764438image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-15T00:34:04.108110image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
380 5841
58.4%
220 1855
 
18.6%
22900 1602
 
16.0%
na 702
 
7.0%
Distinct1148
Distinct (%)11.5%
Missing19
Missing (%)0.2%
Memory size156.2 KiB
Minimum2005-10-01 00:00:00
Maximum2022-10-31 00:00:00
2024-03-15T00:34:04.338363image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T00:34:04.588464image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

사업개시신고일
Date

MISSING 

Distinct1137
Distinct (%)30.3%
Missing6243
Missing (%)62.4%
Memory size156.2 KiB
Minimum1902-09-22 00:00:00
Maximum2020-01-15 00:00:00
2024-03-15T00:34:04.879587image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T00:34:05.343425image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

비고
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
8215 
공급전압(220~380)
1784 
허가면적 (토지)7650, (건물)3733
 
1

Length

Max length23
Median length4
Mean length5.6075
Min length4

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row<NA>
2nd row<NA>
3rd row공급전압(220~380)
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 8215
82.2%
공급전압(220~380) 1784
 
17.8%
허가면적 (토지)7650, (건물)3733 1
 
< 0.1%

Length

2024-03-15T00:34:05.679506image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-15T00:34:05.952209image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 8215
82.1%
공급전압(220~380 1784
 
17.8%
허가면적 1
 
< 0.1%
토지)7650 1
 
< 0.1%
건물)3733 1
 
< 0.1%

Correlations

2024-03-15T00:34:06.158239image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시군설치구분공급전압비고
시군1.0000.0000.2880.000
설치구분0.0001.0000.000NaN
공급전압0.2880.0001.0000.706
비고0.000NaN0.7061.000
2024-03-15T00:34:06.439101image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시군설치구분공급전압비고
시군1.0000.0000.1660.000
설치구분0.0001.0000.0001.000
공급전압0.1660.0001.0000.499
비고0.0001.0000.4991.000
2024-03-15T00:34:06.696831image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시군설치구분공급전압비고
시군1.0000.0000.1660.000
설치구분0.0001.0000.0001.000
공급전압0.1660.0001.0000.499
비고0.0001.0000.4991.000

Missing values

2024-03-15T00:33:44.686968image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-15T00:33:45.189166image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-03-15T00:33:45.563929image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

허가일에너지원발전소명칭시군사업장소설치구분허가면적(제곱미터)설비용량공급전압준비기간사업개시신고일비고
11152012-05-08태양광선운에너지농장태양광발전소고창군아산면 계산리 444-2<NA>2254993802015-05-062012-07-12<NA>
58922017-06-22태양광운봉태양광발전소장수군번암면 유정리 254-1, 산221-2<NA>115361983802020-06-21<NA><NA>
9332012-01-10태양광세광태양광발전소1호김제시요촌동 354-8 건물상부건물상부133202202015-01-092012-04-03공급전압(220~380)
8882011-12-07태양광마이2호 태양광발전소김제시황산면 봉월리 51-1, 건축물위건물상부140895<NA>2014-12-062012-10-09<NA>
10642012-04-02태양광연정에너지익산시왕궁면 연정길 13-20 건물상부건물상부196193802015-03-302012-06-11<NA>
67472017-11-14태양광정월3호태양광발전소임실군임실읍 정월리 산9-1<NA>61835003802020-11-13<NA><NA>
78872018-04-04태양광솔라비젼스타태양광발전소남원시대강면 월탄리 64, 85, 86, 66-1, 68, 84, 산26, 66-2, 67, 83<NA>13984995229002021-04-03<NA><NA>
67772017-11-17태양광해풍3호태양광발전소김제시청하면 관상리 770-1, 770-2, 771, 769-3, 769-4,769-5,769-6<NA>41222503802020-11-16<NA><NA>
12382012-07-24태양광강명태양광발전소완주군소양면 명덕리 14-15 건물상부건물상부132203802015-07-232012-10-11<NA>
35472014-01-03태양광은숙태양광발전소2호정읍시북면 화평길 93-98(신평리 140-2)건물상부(주1)건물상부307502202017-01-022014-04-16공급전압(220~380)
허가일에너지원발전소명칭시군사업장소설치구분허가면적(제곱미터)설비용량공급전압준비기간사업개시신고일비고
99512019-06-13태양광관호농장3호김제시공덕면 황산리 34-1 건물상부(주1, 주2, 주3)건물상부25553003802022-06-12<NA><NA>
93322018-10-30태양광찬송태양광발전소장수군산서면 봉서리 168-1(건물 위)건물상부28985003802021-10-29<NA><NA>
91272018-09-27태양광수양촌태양광발전소남원시덕과면 덕촌리 412-1, 412-2, 412-3, 412-4, 412-5<NA>46614193802021-09-26<NA><NA>
57442017-05-26태양광화촌11태양광발전소임실군삼계면 덕계리 641<NA>22981503802020-05-25<NA><NA>
22032013-06-19태양광온누리태양광발전소익산시신흥동 903-4<NA>2000983802016-06-18<NA><NA>
34602013-12-23태양광JH태양광발전소남원시대산면 대곡리 산50-6<NA>1057992202016-12-222015-07-22공급전압(220~380)
23032013-07-08태양광사랑태양광발전소완주군소양면 명덕리 238,239-1,997-1,997-2<NA>2575993802016-07-072014-06-13<NA>
772007-11-30태양광여간태양광2호기정읍시덕천면 도계리 85-6<NA>300099<NA>2008-11-292008-07-28<NA>
26822013-09-11태양광쏠라리남원3태양광발전소남원시대산면 대곡리 638-17 건물상부(주1,주2)건물상부3921003802016-09-102014-04-25<NA>
42122014-03-31태양광석전태양광발전소1호완주군삼례읍 석전리 247-14 건물상부건물상부397992202017-03-302014-09-22공급전압(220~380)