Overview

Dataset statistics

Number of variables7
Number of observations369
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory20.3 KiB
Average record size in memory56.3 B

Variable types

Text4
Categorical1
DateTime2

Dataset

Description전라남도내 신재생에너지 발전 상업운전 현황태양광, 풍력, 소수력, 바이오에너지 등발전소명, 발전소위치, 사업허가일, 사업개시일, 발전용량 등
Author전라남도
URLhttps://www.data.go.kr/data/15029803/fileData.do

Alerts

종류 is highly imbalanced (87.3%)Imbalance
허가번호 has unique valuesUnique

Reproduction

Analysis started2024-03-14 12:18:13.310739
Analysis finished2024-03-14 12:18:14.479767
Duration1.17 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

허가번호
Text

UNIQUE 

Distinct369
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size3.0 KiB
2024-03-14T21:18:15.378232image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length8
Median length8
Mean length7.7804878
Min length5

Characters and Unicode

Total characters2871
Distinct characters14
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique369 ?
Unique (%)100.0%

Sample

1st row전남제3호
2nd row전남제13호
3rd row전남제15호
4th row전남제16호
5th row전남제20호
ValueCountFrequency (%)
전남제3호 1
 
0.3%
전남제4367호 1
 
0.3%
전남제4377호 1
 
0.3%
전남제4376호 1
 
0.3%
전남제4375호 1
 
0.3%
전남제4374호 1
 
0.3%
전남제4373호 1
 
0.3%
전남제4372호 1
 
0.3%
전남제4371호 1
 
0.3%
전남제4370호 1
 
0.3%
Other values (359) 359
97.3%
2024-03-14T21:18:16.898318image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
369
12.9%
369
12.9%
369
12.9%
369
12.9%
4 364
12.7%
3 210
7.3%
1 156
5.4%
2 116
 
4.0%
5 104
 
3.6%
7 102
 
3.6%
Other values (4) 343
11.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1476
51.4%
Decimal Number 1395
48.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
4 364
26.1%
3 210
15.1%
1 156
11.2%
2 116
 
8.3%
5 104
 
7.5%
7 102
 
7.3%
6 95
 
6.8%
8 94
 
6.7%
0 85
 
6.1%
9 69
 
4.9%
Other Letter
ValueCountFrequency (%)
369
25.0%
369
25.0%
369
25.0%
369
25.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1476
51.4%
Common 1395
48.6%

Most frequent character per script

Common
ValueCountFrequency (%)
4 364
26.1%
3 210
15.1%
1 156
11.2%
2 116
 
8.3%
5 104
 
7.5%
7 102
 
7.3%
6 95
 
6.8%
8 94
 
6.7%
0 85
 
6.1%
9 69
 
4.9%
Hangul
ValueCountFrequency (%)
369
25.0%
369
25.0%
369
25.0%
369
25.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1476
51.4%
ASCII 1395
48.6%

Most frequent character per block

Hangul
ValueCountFrequency (%)
369
25.0%
369
25.0%
369
25.0%
369
25.0%
ASCII
ValueCountFrequency (%)
4 364
26.1%
3 210
15.1%
1 156
11.2%
2 116
 
8.3%
5 104
 
7.5%
7 102
 
7.3%
6 95
 
6.8%
8 94
 
6.7%
0 85
 
6.1%
9 69
 
4.9%
Distinct368
Distinct (%)99.7%
Missing0
Missing (%)0.0%
Memory size3.0 KiB
2024-03-14T21:18:18.028945image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length33
Median length28
Mean length22.216802
Min length6

Characters and Unicode

Total characters8198
Distinct characters304
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique367 ?
Unique (%)99.5%

Sample

1st row대한소수력발전㈜(보성 소수력발전소)
2nd row솔라플러스(주)
3rd row에스디엔 주식회사(에스디엔 순천 태양광발전소)
4th row한국농어촌공사 담양지사(담양호 소수력발전소)
5th row㈜지엔씨에너지(순천생활폐기물매립장 매립가스발전소)
ValueCountFrequency (%)
태양광발전소 309
38.9%
고흥솔라파크 11
 
1.4%
주식회사(고흥만 11
 
1.4%
소수력발전소 4
 
0.5%
㈜포스코건설(전남 4
 
0.5%
발전소 4
 
0.5%
주식회사 4
 
0.5%
유)라미솔라(라미솔라 4
 
0.5%
영농조합법인 3
 
0.4%
강진군 3
 
0.4%
Other values (428) 437
55.0%
2024-03-14T21:18:19.272274image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
438
 
5.3%
434
 
5.3%
) 433
 
5.3%
( 431
 
5.3%
430
 
5.2%
426
 
5.2%
420
 
5.1%
410
 
5.0%
399
 
4.9%
358
 
4.4%
Other values (294) 4019
49.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 6335
77.3%
Close Punctuation 433
 
5.3%
Open Punctuation 431
 
5.3%
Space Separator 430
 
5.2%
Decimal Number 269
 
3.3%
Other Symbol 246
 
3.0%
Uppercase Letter 49
 
0.6%
Dash Punctuation 3
 
< 0.1%
Other Punctuation 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
438
 
6.9%
434
 
6.9%
426
 
6.7%
420
 
6.6%
410
 
6.5%
399
 
6.3%
358
 
5.7%
277
 
4.4%
247
 
3.9%
224
 
3.5%
Other values (258) 2702
42.7%
Uppercase Letter
ValueCountFrequency (%)
P 9
18.4%
S 6
12.2%
O 4
 
8.2%
E 4
 
8.2%
K 3
 
6.1%
C 3
 
6.1%
L 3
 
6.1%
F 2
 
4.1%
G 2
 
4.1%
I 2
 
4.1%
Other values (9) 11
22.4%
Decimal Number
ValueCountFrequency (%)
2 108
40.1%
5 60
22.3%
1 53
19.7%
3 20
 
7.4%
8 7
 
2.6%
4 7
 
2.6%
7 6
 
2.2%
9 4
 
1.5%
6 2
 
0.7%
0 2
 
0.7%
Other Punctuation
ValueCountFrequency (%)
# 1
50.0%
& 1
50.0%
Close Punctuation
ValueCountFrequency (%)
) 433
100.0%
Open Punctuation
ValueCountFrequency (%)
( 431
100.0%
Space Separator
ValueCountFrequency (%)
430
100.0%
Other Symbol
ValueCountFrequency (%)
246
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 6581
80.3%
Common 1568
 
19.1%
Latin 49
 
0.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
438
 
6.7%
434
 
6.6%
426
 
6.5%
420
 
6.4%
410
 
6.2%
399
 
6.1%
358
 
5.4%
277
 
4.2%
247
 
3.8%
246
 
3.7%
Other values (259) 2926
44.5%
Latin
ValueCountFrequency (%)
P 9
18.4%
S 6
12.2%
O 4
 
8.2%
E 4
 
8.2%
K 3
 
6.1%
C 3
 
6.1%
L 3
 
6.1%
F 2
 
4.1%
G 2
 
4.1%
I 2
 
4.1%
Other values (9) 11
22.4%
Common
ValueCountFrequency (%)
) 433
27.6%
( 431
27.5%
430
27.4%
2 108
 
6.9%
5 60
 
3.8%
1 53
 
3.4%
3 20
 
1.3%
8 7
 
0.4%
4 7
 
0.4%
7 6
 
0.4%
Other values (6) 13
 
0.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 6335
77.3%
ASCII 1617
 
19.7%
None 246
 
3.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
438
 
6.9%
434
 
6.9%
426
 
6.7%
420
 
6.6%
410
 
6.5%
399
 
6.3%
358
 
5.7%
277
 
4.4%
247
 
3.9%
224
 
3.5%
Other values (258) 2702
42.7%
ASCII
ValueCountFrequency (%)
) 433
26.8%
( 431
26.7%
430
26.6%
2 108
 
6.7%
5 60
 
3.7%
1 53
 
3.3%
3 20
 
1.2%
P 9
 
0.6%
8 7
 
0.4%
4 7
 
0.4%
Other values (25) 59
 
3.6%
None
ValueCountFrequency (%)
246
100.0%

종류
Categorical

IMBALANCE 

Distinct4
Distinct (%)1.1%
Missing0
Missing (%)0.0%
Memory size3.0 KiB
태양광
357 
풍력
 
6
소수력
 
4
바이오에너지
 
2

Length

Max length6
Median length3
Mean length3
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row소수력
2nd row태양광
3rd row태양광
4th row소수력
5th row바이오에너지

Common Values

ValueCountFrequency (%)
태양광 357
96.7%
풍력 6
 
1.6%
소수력 4
 
1.1%
바이오에너지 2
 
0.5%

Length

2024-03-14T21:18:19.731872image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-14T21:18:20.089445image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
태양광 357
96.7%
풍력 6
 
1.6%
소수력 4
 
1.1%
바이오에너지 2
 
0.5%
Distinct362
Distinct (%)98.1%
Missing0
Missing (%)0.0%
Memory size3.0 KiB
2024-03-14T21:18:21.038965image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length250
Median length89
Mean length34.149051
Min length12

Characters and Unicode

Total characters12601
Distinct characters218
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique355 ?
Unique (%)96.2%

Sample

1st row보성군 득량면 송곡리 1097-6
2nd row함평군 손불면 산남리 89-8,9,산111-1
3rd row순천시 별량면 학산리 127-48외 9필지
4th row담양군 금성면 대성리 산5-3 일원
5th row순천시 왕지동 483-3(생활폐기물매립장)
ValueCountFrequency (%)
신안군 72
 
3.5%
영광군 69
 
3.3%
지도읍 62
 
3.0%
하사리 52
 
2.5%
백수읍 48
 
2.3%
해남군 47
 
2.3%
황산면 28
 
1.3%
광양시 27
 
1.3%
염산면 18
 
0.9%
감정리 17
 
0.8%
Other values (1172) 1636
78.8%
2024-03-14T21:18:22.514796image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1760
 
14.0%
1 887
 
7.0%
- 700
 
5.6%
, 691
 
5.5%
8 476
 
3.8%
3 469
 
3.7%
2 448
 
3.6%
0 421
 
3.3%
5 419
 
3.3%
( 404
 
3.2%
Other values (208) 5926
47.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 4499
35.7%
Other Letter 4128
32.8%
Space Separator 1760
 
14.0%
Dash Punctuation 700
 
5.6%
Other Punctuation 695
 
5.5%
Open Punctuation 405
 
3.2%
Close Punctuation 405
 
3.2%
Uppercase Letter 5
 
< 0.1%
Math Symbol 3
 
< 0.1%
Other Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
363
 
8.8%
311
 
7.5%
217
 
5.3%
178
 
4.3%
166
 
4.0%
127
 
3.1%
113
 
2.7%
113
 
2.7%
109
 
2.6%
107
 
2.6%
Other values (180) 2324
56.3%
Decimal Number
ValueCountFrequency (%)
1 887
19.7%
8 476
10.6%
3 469
10.4%
2 448
10.0%
0 421
9.4%
5 419
9.3%
4 396
8.8%
7 360
8.0%
6 321
 
7.1%
9 302
 
6.7%
Uppercase Letter
ValueCountFrequency (%)
R 1
20.0%
S 1
20.0%
D 1
20.0%
I 1
20.0%
X 1
20.0%
Other Punctuation
ValueCountFrequency (%)
, 691
99.4%
/ 2
 
0.3%
: 1
 
0.1%
. 1
 
0.1%
Open Punctuation
ValueCountFrequency (%)
( 404
99.8%
[ 1
 
0.2%
Close Punctuation
ValueCountFrequency (%)
) 404
99.8%
] 1
 
0.2%
Math Symbol
ValueCountFrequency (%)
~ 2
66.7%
+ 1
33.3%
Space Separator
ValueCountFrequency (%)
1760
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 700
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 8467
67.2%
Hangul 4129
32.8%
Latin 5
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
363
 
8.8%
311
 
7.5%
217
 
5.3%
178
 
4.3%
166
 
4.0%
127
 
3.1%
113
 
2.7%
113
 
2.7%
109
 
2.6%
107
 
2.6%
Other values (181) 2325
56.3%
Common
ValueCountFrequency (%)
1760
20.8%
1 887
10.5%
- 700
 
8.3%
, 691
 
8.2%
8 476
 
5.6%
3 469
 
5.5%
2 448
 
5.3%
0 421
 
5.0%
5 419
 
4.9%
( 404
 
4.8%
Other values (12) 1792
21.2%
Latin
ValueCountFrequency (%)
R 1
20.0%
S 1
20.0%
D 1
20.0%
I 1
20.0%
X 1
20.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 8472
67.2%
Hangul 4128
32.8%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1760
20.8%
1 887
10.5%
- 700
 
8.3%
, 691
 
8.2%
8 476
 
5.6%
3 469
 
5.5%
2 448
 
5.3%
0 421
 
5.0%
5 419
 
4.9%
( 404
 
4.8%
Other values (17) 1797
21.2%
Hangul
ValueCountFrequency (%)
363
 
8.8%
311
 
7.5%
217
 
5.3%
178
 
4.3%
166
 
4.0%
127
 
3.1%
113
 
2.7%
113
 
2.7%
109
 
2.6%
107
 
2.6%
Other values (180) 2324
56.3%
None
ValueCountFrequency (%)
1
100.0%
Distinct155
Distinct (%)42.0%
Missing0
Missing (%)0.0%
Memory size3.0 KiB
2024-03-14T21:18:23.854333image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length8
Median length4
Mean length5.1056911
Min length4

Characters and Unicode

Total characters1884
Distinct characters12
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique129 ?
Unique (%)35.0%

Sample

1st row1200
2nd row1500
3rd row1650
4th row1274
5th row1850
ValueCountFrequency (%)
1000 60
 
16.3%
2999.36 32
 
8.7%
2999 26
 
7.0%
2000 24
 
6.5%
3000 21
 
5.7%
2997 12
 
3.3%
1500 8
 
2.2%
2995.2 6
 
1.6%
2910.6 5
 
1.4%
2016 5
 
1.4%
Other values (145) 170
46.1%
2024-03-14T21:18:25.636083image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 455
24.2%
9 309
16.4%
2 254
13.5%
1 207
11.0%
. 151
 
8.0%
6 116
 
6.2%
3 94
 
5.0%
8 84
 
4.5%
5 81
 
4.3%
4 68
 
3.6%
Other values (2) 65
 
3.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1732
91.9%
Other Punctuation 152
 
8.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 455
26.3%
9 309
17.8%
2 254
14.7%
1 207
12.0%
6 116
 
6.7%
3 94
 
5.4%
8 84
 
4.8%
5 81
 
4.7%
4 68
 
3.9%
7 64
 
3.7%
Other Punctuation
ValueCountFrequency (%)
. 151
99.3%
, 1
 
0.7%

Most occurring scripts

ValueCountFrequency (%)
Common 1884
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 455
24.2%
9 309
16.4%
2 254
13.5%
1 207
11.0%
. 151
 
8.0%
6 116
 
6.2%
3 94
 
5.0%
8 84
 
4.5%
5 81
 
4.3%
4 68
 
3.6%
Other values (2) 65
 
3.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1884
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 455
24.2%
9 309
16.4%
2 254
13.5%
1 207
11.0%
. 151
 
8.0%
6 116
 
6.2%
3 94
 
5.0%
8 84
 
4.5%
5 81
 
4.3%
4 68
 
3.6%
Other values (2) 65
 
3.5%
Distinct186
Distinct (%)50.4%
Missing0
Missing (%)0.0%
Memory size3.0 KiB
Minimum2002-07-02 00:00:00
Maximum2022-07-27 00:00:00
2024-03-14T21:18:26.042196image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T21:18:26.474466image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct210
Distinct (%)56.9%
Missing0
Missing (%)0.0%
Memory size3.0 KiB
Minimum2005-07-29 00:00:00
Maximum2023-07-05 00:00:00
2024-03-14T21:18:26.861794image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T21:18:27.283182image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

Missing values

2024-03-14T21:18:13.938744image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-14T21:18:14.327990image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

허가번호상호 (발전소명)종류발전소 주소설비용량 (킬로와트)허가일개시일
0전남제3호대한소수력발전㈜(보성 소수력발전소)소수력보성군 득량면 송곡리 1097-612002002-07-022009-03-03
1전남제13호솔라플러스(주)태양광함평군 손불면 산남리 89-8,9,산111-115002004-04-022008-04-11
2전남제15호에스디엔 주식회사(에스디엔 순천 태양광발전소)태양광순천시 별량면 학산리 127-48외 9필지16502004-05-182005-12-26
3전남제16호한국농어촌공사 담양지사(담양호 소수력발전소)소수력담양군 금성면 대성리 산5-3 일원12742004-06-182006-07-01
4전남제20호㈜지엔씨에너지(순천생활폐기물매립장 매립가스발전소)바이오에너지순천시 왕지동 483-3(생활폐기물매립장)18502005-02-112005-07-29
5전남제55호에스디엔㈜(에스디엔 순천 태양광발전소)태양광순천시 별량면 두고리 301-210002005-12-132007-01-15
6전남제61호한국수력원자력㈜(영광 솔라파크)태양광영광군 홍농읍 성산리 743,계마리96630002005-12-132007-05-23
7전남제70호남해에너지개발㈜(남해태양광발전소)태양광강진군 신전면 용월리산255-15,26, 130002005-12-272007-11-13
8전남제71호신안풍력발전㈜(신안 풍력발전소)풍력신안군 비금면 구림리 산1-130002006-01-022008-12-28
9전남제85호썬엔정에너지(주)(썬엔정에너지 태양광발전소)태양광영광군 법성면 용덕리 산23-1, 용성리 산11610002006-04-262007-11-06
허가번호상호 (발전소명)종류발전소 주소설비용량 (킬로와트)허가일개시일
359전남제4719호대불제이씨2호㈜(대불제이씨이호2 태양광발전소)태양광영암군 삼호읍 나불리 611-26(7-소조립장)(건물위)1139.22021-06-252022-02-04
360전남제4720호대불제이씨2호㈜(대불제이씨이호1 태양광발전소)태양광영암군 삼호읍 난전리 1710-6(1동)(건물위)2079.042021-06-252022-01-13
361전남제4721호대불제이씨1호㈜(대불제이씨일호2 태양광발전소)태양광영암군 삼호읍 나불리 341-13(건물위)2157.362021-06-252022-01-24
362전남제4771호㈜무주에너지(무주에너지 태양광발전소)태양광장성군 장성읍 유탕리 1545외 8필지 (건물 주2동, 14동, 16동, 17동, 18동, 21동, 23동/90번지 주1동, 부1~7동, 주차장 상부)2995.22021-08-132022-03-23
363전남제4770호㈜무주(무주 태양광발전소)태양광장성군 장성읍 유탕리 1545, 영천리 90, 100 건물위[유탕리 1545, 영천리 100(주1, 주3, 주4, 주10, 주20) 영천리 90 (주1)]2700.722021-08-172022-03-23
364전남제4779호디에스알 주식회사(DSR(주)광양 태양광발전소)태양광광양시 광양읍 초남리 759 DSR(주)공장 건물 위1373.42021-09-012021-12-27
365전남제4774호현대스틸산업(주)(현대스틸산업 율촌 태양광발전소)태양광광양시 광양읍 세풍리 2202 현대스틸산업 공장 지붕위2891.492021-09-062022-12-19
366전남제4818호썬스타트 주식회사(썬스타트 대평2호 태양광발전소)태양광광양시 도이동 825(건물 위)25002022-02-232022-12-15
367전남제4819호와이제이솔라 주식회사(와이제이솔라 대평1호 태양광발전소)태양광광양시 도이동 825(건물 위)25002022-02-232022-12-15
368전남제4828호(재)광양시사랑나눔복지재단(광양 복지형태양광발전소)태양광광양시 광양읍 세풍리 22221265.552022-07-272022-12-29