Overview

Dataset statistics

Number of variables7
Number of observations1554
Missing cells622
Missing cells (%)5.7%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory85.1 KiB
Average record size in memory56.1 B

Variable types

Text4
Categorical2
DateTime1

Dataset

Description경상북도 예천군 태양광 발전 허가현황(상호, 소재지, 발전원, 발전용량, 허가일, 사업개시, 비고)입니다. 데이터 기준일까지 허가 신청된 정보를 제공하고 있습니다.
Author경상북도 예천군
URLhttps://www.data.go.kr/data/15099400/fileData.do

Alerts

발전원 is highly imbalanced (99.2%)Imbalance
사업개시 has 622 (40.0%) missing valuesMissing

Reproduction

Analysis started2023-12-12 16:44:56.546779
Analysis finished2023-12-12 16:44:57.183274
Duration0.64 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

상호
Text

Distinct1520
Distinct (%)97.8%
Missing0
Missing (%)0.0%
Memory size12.3 KiB
2023-12-13T01:44:57.356847image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length27
Median length24
Mean length8.8841699
Min length2

Characters and Unicode

Total characters13806
Distinct characters407
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1488 ?
Unique (%)95.8%

Sample

1st row예천SP태양광제2발전소
2nd row예천솔라팜
3rd row㈜솔라테크에너지소화리태양광발전소
4th row영남햇빛 발전소
5th row솔라태양광발전소
ValueCountFrequency (%)
태양광발전소 470
 
20.8%
발전소 97
 
4.3%
태양광 32
 
1.4%
에너지 15
 
0.7%
주식회사 11
 
0.5%
롯데 6
 
0.3%
태양광㈜ 6
 
0.3%
석명 5
 
0.2%
쏠라포스 4
 
0.2%
도화 4
 
0.2%
Other values (1545) 1612
71.3%
2023-12-13T01:44:57.730574image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1355
 
9.8%
1340
 
9.7%
1332
 
9.6%
1243
 
9.0%
1242
 
9.0%
1216
 
8.8%
718
 
5.2%
409
 
3.0%
2 202
 
1.5%
1 166
 
1.2%
Other values (397) 4583
33.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 12273
88.9%
Space Separator 718
 
5.2%
Decimal Number 577
 
4.2%
Uppercase Letter 143
 
1.0%
Other Symbol 53
 
0.4%
Open Punctuation 12
 
0.1%
Close Punctuation 12
 
0.1%
Lowercase Letter 10
 
0.1%
Dash Punctuation 6
 
< 0.1%
Other Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1355
 
11.0%
1340
 
10.9%
1332
 
10.9%
1243
 
10.1%
1242
 
10.1%
1216
 
9.9%
409
 
3.3%
145
 
1.2%
125
 
1.0%
110
 
0.9%
Other values (353) 3756
30.6%
Uppercase Letter
ValueCountFrequency (%)
S 29
20.3%
H 20
14.0%
J 14
9.8%
M 10
 
7.0%
U 10
 
7.0%
K 7
 
4.9%
O 7
 
4.9%
E 7
 
4.9%
N 6
 
4.2%
P 6
 
4.2%
Other values (8) 27
18.9%
Decimal Number
ValueCountFrequency (%)
2 202
35.0%
1 166
28.8%
3 88
15.3%
4 36
 
6.2%
5 27
 
4.7%
6 21
 
3.6%
7 17
 
2.9%
9 8
 
1.4%
8 7
 
1.2%
0 5
 
0.9%
Lowercase Letter
ValueCountFrequency (%)
e 2
20.0%
n 1
10.0%
l 1
10.0%
y 1
10.0%
o 1
10.0%
w 1
10.0%
r 1
10.0%
t 1
10.0%
a 1
10.0%
Space Separator
ValueCountFrequency (%)
718
100.0%
Other Symbol
ValueCountFrequency (%)
53
100.0%
Open Punctuation
ValueCountFrequency (%)
( 12
100.0%
Close Punctuation
ValueCountFrequency (%)
) 12
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 6
100.0%
Other Punctuation
ValueCountFrequency (%)
& 1
100.0%
Math Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 12326
89.3%
Common 1327
 
9.6%
Latin 153
 
1.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1355
 
11.0%
1340
 
10.9%
1332
 
10.8%
1243
 
10.1%
1242
 
10.1%
1216
 
9.9%
409
 
3.3%
145
 
1.2%
125
 
1.0%
110
 
0.9%
Other values (354) 3809
30.9%
Latin
ValueCountFrequency (%)
S 29
19.0%
H 20
13.1%
J 14
 
9.2%
M 10
 
6.5%
U 10
 
6.5%
K 7
 
4.6%
O 7
 
4.6%
E 7
 
4.6%
N 6
 
3.9%
P 6
 
3.9%
Other values (17) 37
24.2%
Common
ValueCountFrequency (%)
718
54.1%
2 202
 
15.2%
1 166
 
12.5%
3 88
 
6.6%
4 36
 
2.7%
5 27
 
2.0%
6 21
 
1.6%
7 17
 
1.3%
( 12
 
0.9%
) 12
 
0.9%
Other values (6) 28
 
2.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 12273
88.9%
ASCII 1479
 
10.7%
None 53
 
0.4%
Arrows 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1355
 
11.0%
1340
 
10.9%
1332
 
10.9%
1243
 
10.1%
1242
 
10.1%
1216
 
9.9%
409
 
3.3%
145
 
1.2%
125
 
1.0%
110
 
0.9%
Other values (353) 3756
30.6%
ASCII
ValueCountFrequency (%)
718
48.5%
2 202
 
13.7%
1 166
 
11.2%
3 88
 
5.9%
4 36
 
2.4%
S 29
 
2.0%
5 27
 
1.8%
6 21
 
1.4%
H 20
 
1.4%
7 17
 
1.1%
Other values (32) 155
 
10.5%
None
ValueCountFrequency (%)
53
100.0%
Arrows
ValueCountFrequency (%)
1
100.0%
Distinct1029
Distinct (%)66.2%
Missing0
Missing (%)0.0%
Memory size12.3 KiB
2023-12-13T01:44:58.043694image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length73
Median length62
Mean length19.944015
Min length11

Characters and Unicode

Total characters30993
Distinct characters162
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique770 ?
Unique (%)49.5%

Sample

1st row풍양면 흔효리 168-1
2nd row지보면 어신리 산46-7외8필지
3rd row지보면 소화리 산77
4th row지보면 도화리 483-1
5th row감천면 돈산리 산60
ValueCountFrequency (%)
308
 
5.5%
유천면 300
 
5.3%
지보면 284
 
5.0%
감천면 193
 
3.4%
풍양면 179
 
3.2%
예천읍 144
 
2.6%
보문면 115
 
2.0%
호명면 81
 
1.4%
개포면 70
 
1.2%
고림리 68
 
1.2%
Other values (1469) 3902
69.1%
2023-12-13T01:44:58.483906image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4095
 
13.2%
1684
 
5.4%
1553
 
5.0%
( 1528
 
4.9%
) 1527
 
4.9%
1 1449
 
4.7%
1418
 
4.6%
1409
 
4.5%
3 1004
 
3.2%
2 966
 
3.1%
Other values (152) 14360
46.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 14778
47.7%
Decimal Number 7363
23.8%
Space Separator 4095
 
13.2%
Open Punctuation 1528
 
4.9%
Close Punctuation 1527
 
4.9%
Dash Punctuation 929
 
3.0%
Other Punctuation 773
 
2.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1684
 
11.4%
1553
 
10.5%
1418
 
9.6%
1409
 
9.5%
920
 
6.2%
728
 
4.9%
560
 
3.8%
560
 
3.8%
420
 
2.8%
344
 
2.3%
Other values (137) 5182
35.1%
Decimal Number
ValueCountFrequency (%)
1 1449
19.7%
3 1004
13.6%
2 966
13.1%
5 784
10.6%
4 669
9.1%
7 648
8.8%
6 567
 
7.7%
9 458
 
6.2%
8 431
 
5.9%
0 387
 
5.3%
Space Separator
ValueCountFrequency (%)
4095
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1528
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1527
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 929
100.0%
Other Punctuation
ValueCountFrequency (%)
, 773
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 16215
52.3%
Hangul 14778
47.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1684
 
11.4%
1553
 
10.5%
1418
 
9.6%
1409
 
9.5%
920
 
6.2%
728
 
4.9%
560
 
3.8%
560
 
3.8%
420
 
2.8%
344
 
2.3%
Other values (137) 5182
35.1%
Common
ValueCountFrequency (%)
4095
25.3%
( 1528
 
9.4%
) 1527
 
9.4%
1 1449
 
8.9%
3 1004
 
6.2%
2 966
 
6.0%
- 929
 
5.7%
5 784
 
4.8%
, 773
 
4.8%
4 669
 
4.1%
Other values (5) 2491
15.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 16215
52.3%
Hangul 14778
47.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
4095
25.3%
( 1528
 
9.4%
) 1527
 
9.4%
1 1449
 
8.9%
3 1004
 
6.2%
2 966
 
6.0%
- 929
 
5.7%
5 784
 
4.8%
, 773
 
4.8%
4 669
 
4.1%
Other values (5) 2491
15.4%
Hangul
ValueCountFrequency (%)
1684
 
11.4%
1553
 
10.5%
1418
 
9.6%
1409
 
9.5%
920
 
6.2%
728
 
4.9%
560
 
3.8%
560
 
3.8%
420
 
2.8%
344
 
2.3%
Other values (137) 5182
35.1%

발전원
Categorical

IMBALANCE 

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size12.3 KiB
태양광
1553 
99.2
 
1

Length

Max length4
Median length3
Mean length3.0006435
Min length3

Unique

Unique1 ?
Unique (%)0.1%

Sample

1st row태양광
2nd row태양광
3rd row태양광
4th row태양광
5th row태양광

Common Values

ValueCountFrequency (%)
태양광 1553
99.9%
99.2 1
 
0.1%

Length

2023-12-13T01:44:58.610261image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T01:44:58.695957image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
태양광 1553
99.9%
99.2 1
 
0.1%
Distinct410
Distinct (%)26.4%
Missing0
Missing (%)0.0%
Memory size12.3 KiB
2023-12-13T01:44:58.895746image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length8
Median length5
Mean length4.6235521
Min length2

Characters and Unicode

Total characters7185
Distinct characters12
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique258 ?
Unique (%)16.6%

Sample

1st row348.3
2nd row996
3rd row378
4th row838.2
5th row998.8
ValueCountFrequency (%)
99.6 115
 
7.4%
99.9 88
 
5.7%
99.36 74
 
4.8%
99 65
 
4.2%
99.45 64
 
4.1%
99.68 55
 
3.5%
97.2 41
 
2.6%
99.19 40
 
2.6%
99.84 39
 
2.5%
99.75 30
 
1.9%
Other values (400) 943
60.7%
2023-12-13T01:44:59.223168image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
9 2561
35.6%
. 1421
19.8%
6 581
 
8.1%
8 527
 
7.3%
4 459
 
6.4%
2 390
 
5.4%
5 344
 
4.8%
1 302
 
4.2%
7 287
 
4.0%
3 216
 
3.0%
Other values (2) 97
 
1.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 5763
80.2%
Other Punctuation 1422
 
19.8%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
9 2561
44.4%
6 581
 
10.1%
8 527
 
9.1%
4 459
 
8.0%
2 390
 
6.8%
5 344
 
6.0%
1 302
 
5.2%
7 287
 
5.0%
3 216
 
3.7%
0 96
 
1.7%
Other Punctuation
ValueCountFrequency (%)
. 1421
99.9%
, 1
 
0.1%

Most occurring scripts

ValueCountFrequency (%)
Common 7185
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
9 2561
35.6%
. 1421
19.8%
6 581
 
8.1%
8 527
 
7.3%
4 459
 
6.4%
2 390
 
5.4%
5 344
 
4.8%
1 302
 
4.2%
7 287
 
4.0%
3 216
 
3.0%
Other values (2) 97
 
1.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 7185
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
9 2561
35.6%
. 1421
19.8%
6 581
 
8.1%
8 527
 
7.3%
4 459
 
6.4%
2 390
 
5.4%
5 344
 
4.8%
1 302
 
4.2%
7 287
 
4.0%
3 216
 
3.0%
Other values (2) 97
 
1.4%
Distinct208
Distinct (%)13.4%
Missing0
Missing (%)0.0%
Memory size12.3 KiB
2023-12-13T01:44:59.488651image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length10
Mean length10
Min length10

Characters and Unicode

Total characters15540
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique47 ?
Unique (%)3.0%

Sample

1st row2007-11-05
2nd row2007-12-26
3rd row2008-01-15
4th row2008-02-20
5th row2008-07-03
ValueCountFrequency (%)
2022-01-13 42
 
2.7%
2022-03-16 39
 
2.5%
2021-03-16 35
 
2.3%
2017-12-26 31
 
2.0%
2021-05-21 30
 
1.9%
2021-12-03 28
 
1.8%
2019-07-22 28
 
1.8%
2021-07-08 28
 
1.8%
2019-08-20 28
 
1.8%
2020-04-24 27
 
1.7%
Other values (198) 1238
79.7%
2023-12-13T01:44:59.863246image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 3658
23.5%
2 3223
20.7%
- 3108
20.0%
1 2523
16.2%
9 621
 
4.0%
3 506
 
3.3%
8 458
 
2.9%
7 429
 
2.8%
6 383
 
2.5%
4 368
 
2.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 12432
80.0%
Dash Punctuation 3108
 
20.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 3658
29.4%
2 3223
25.9%
1 2523
20.3%
9 621
 
5.0%
3 506
 
4.1%
8 458
 
3.7%
7 429
 
3.5%
6 383
 
3.1%
4 368
 
3.0%
5 263
 
2.1%
Dash Punctuation
ValueCountFrequency (%)
- 3108
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 15540
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 3658
23.5%
2 3223
20.7%
- 3108
20.0%
1 2523
16.2%
9 621
 
4.0%
3 506
 
3.3%
8 458
 
2.9%
7 429
 
2.8%
6 383
 
2.5%
4 368
 
2.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 15540
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 3658
23.5%
2 3223
20.7%
- 3108
20.0%
1 2523
16.2%
9 621
 
4.0%
3 506
 
3.3%
8 458
 
2.9%
7 429
 
2.8%
6 383
 
2.5%
4 368
 
2.4%

사업개시
Date

MISSING 

Distinct382
Distinct (%)41.0%
Missing622
Missing (%)40.0%
Memory size12.3 KiB
Minimum2008-05-09 00:00:00
Maximum2022-02-11 00:00:00
2023-12-13T01:45:00.018499image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T01:45:00.213150image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

비고
Categorical

Distinct3
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size12.3 KiB
사업개시
932 
사업준비중
621 
허가완료
 
1

Length

Max length5
Median length4
Mean length4.3996139
Min length4

Unique

Unique1 ?
Unique (%)0.1%

Sample

1st row사업개시
2nd row사업개시
3rd row사업개시
4th row사업개시
5th row사업개시

Common Values

ValueCountFrequency (%)
사업개시 932
60.0%
사업준비중 621
40.0%
허가완료 1
 
0.1%

Length

2023-12-13T01:45:00.336869image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T01:45:00.440016image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
사업개시 932
60.0%
사업준비중 621
40.0%
허가완료 1
 
0.1%

Correlations

2023-12-13T01:45:00.518951image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
발전원비고
발전원1.0000.000
비고0.0001.000
2023-12-13T01:45:00.626571image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
발전원비고
발전원1.0000.000
비고0.0001.000
2023-12-13T01:45:00.712094image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
발전원비고
발전원1.0000.000
비고0.0001.000

Missing values

2023-12-13T01:44:57.013236image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T01:44:57.126578image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

상호소재지발전원발전용량허가일사업개시비고
0예천SP태양광제2발전소풍양면 흔효리 168-1태양광348.32007-11-052009-09-08사업개시
1예천솔라팜지보면 어신리 산46-7외8필지태양광9962007-12-262008-05-09사업개시
2㈜솔라테크에너지소화리태양광발전소지보면 소화리 산77태양광3782008-01-152009-11-11사업개시
3영남햇빛 발전소지보면 도화리 483-1태양광838.22008-02-202010-05-26사업개시
4솔라태양광발전소감천면 돈산리 산60태양광998.82008-07-032009-04-01사업개시
5㈜청운전력6호태양광발전소개포면 황산리 351태양광28.82009-02-162009-08-31사업개시
6수진솔라개포면 황산리 352-2, 356-1, 356-2태양광28.82009-02-162009-08-31사업개시
7㈜에스아이솔라개포면 황산리 351태양광28.82009-02-162009-08-27사업개시
8㈜해든나라태양광발전소개포면 황산리 352-1, 352-2, 355태양광86.42009-02-162009-08-31사업개시
9대일솔라개포면 황산리 352-2,355태양광86.42009-02-162009-08-31사업개시
상호소재지발전원발전용량허가일사업개시비고
1544에이치엘예천태양광발전소지보면 마산리 22(토지 위)태양광99.332022-03-16<NA>사업준비중
1545케이에이치태양광발전소지보면 마산리 22(토지 위)태양광99.332022-03-16<NA>사업준비중
1546엔닉스예천태양광발전소지보면 마산리 22(토지 위)태양광99.332022-03-16<NA>사업준비중
1547대경5호태양광발전소개포면 가곡리 818(토지 위)태양광94.162022-03-16<NA>사업준비중
1548대경6호태양광발전소개포면 가곡리 818(토지 위)태양광94.162022-03-16<NA>사업준비중
1549대경7호태양광발전소개포면 가곡리 818(토지 위)태양광94.162022-03-16<NA>사업준비중
1550어신 태양광발전소지보면 어신리 223, 225, 225-1(건물 위)태양광60.482022-03-16<NA>사업준비중
1551화에너지예천읍 갈구리 179(건물 위)태양광99.92022-03-16<NA>사업준비중
1552월에너지예천읍 갈구리 179(건물 위)태양광99.92022-03-16<NA>사업준비중
1553무오에너지예천읍 갈구리 179(건물 위)태양광49.52022-03-16<NA>사업준비중