Overview

Dataset statistics

Number of variables6
Number of observations1960
Missing cells560
Missing cells (%)4.8%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory93.9 KiB
Average record size in memory49.1 B

Variable types

Text2
Numeric1
Categorical1
DateTime2

Dataset

Description경상북도 구미시에 설치된 태양광발전소 현황으로 발전소명, 설치장소, 용량, 허가기관, 허가일자, 사업개시일자 등의 정보를 제공합니다.
Author경상북도 구미시
URLhttps://www.data.go.kr/data/3071362/fileData.do

Alerts

허가기관 has constant value ""Constant
사업개시일자 has 560 (28.6%) missing valuesMissing

Reproduction

Analysis started2023-12-12 11:51:48.240153
Analysis finished2023-12-12 11:51:49.419773
Duration1.18 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct1875
Distinct (%)95.7%
Missing0
Missing (%)0.0%
Memory size15.4 KiB
2023-12-12T20:51:49.650155image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length30
Median length22
Mean length9.2469388
Min length2

Characters and Unicode

Total characters18124
Distinct characters425
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1799 ?
Unique (%)91.8%

Sample

1st row영남에너지서비스태양광발전소
2nd row금오솔라팜
3rd row도레이새한구미1공장태양광발전소
4th row㈜지에스이앤알
5th row회명워터젠태양광발전소1
ValueCountFrequency (%)
동산태양광발전소 4
 
0.2%
태양광발전소 4
 
0.2%
대경2호태양광발전소 3
 
0.2%
대경1호태양광발전소 3
 
0.2%
미소태양광발전소 3
 
0.2%
미숙태양광발전소 3
 
0.2%
현대태양광발전소 3
 
0.2%
코오롱글로텍㈜태양광발전소 3
 
0.2%
성수2태양광발전소 3
 
0.2%
성수1태양광발전소 2
 
0.1%
Other values (1875) 1943
98.4%
2023-12-12T20:51:50.261379image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1762
 
9.7%
1750
 
9.7%
1741
 
9.6%
1731
 
9.6%
1725
 
9.5%
1700
 
9.4%
662
 
3.7%
2 386
 
2.1%
1 383
 
2.1%
3 194
 
1.1%
Other values (415) 6090
33.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 16596
91.6%
Decimal Number 1255
 
6.9%
Uppercase Letter 174
 
1.0%
Other Symbol 28
 
0.2%
Lowercase Letter 28
 
0.2%
Space Separator 15
 
0.1%
Close Punctuation 9
 
< 0.1%
Open Punctuation 9
 
< 0.1%
Dash Punctuation 5
 
< 0.1%
Other Punctuation 5
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1762
 
10.6%
1750
 
10.5%
1741
 
10.5%
1731
 
10.4%
1725
 
10.4%
1700
 
10.2%
662
 
4.0%
160
 
1.0%
160
 
1.0%
153
 
0.9%
Other values (361) 5052
30.4%
Uppercase Letter
ValueCountFrequency (%)
M 22
12.6%
S 19
10.9%
J 18
10.3%
K 14
 
8.0%
Y 13
 
7.5%
H 11
 
6.3%
G 11
 
6.3%
P 9
 
5.2%
T 8
 
4.6%
O 7
 
4.0%
Other values (12) 42
24.1%
Lowercase Letter
ValueCountFrequency (%)
g 4
14.3%
k 3
10.7%
s 3
10.7%
e 3
10.7%
a 3
10.7%
o 2
 
7.1%
r 2
 
7.1%
c 1
 
3.6%
p 1
 
3.6%
n 1
 
3.6%
Other values (5) 5
17.9%
Decimal Number
ValueCountFrequency (%)
2 386
30.8%
1 383
30.5%
3 194
15.5%
4 111
 
8.8%
5 62
 
4.9%
6 41
 
3.3%
7 25
 
2.0%
8 19
 
1.5%
0 18
 
1.4%
9 16
 
1.3%
Other Punctuation
ValueCountFrequency (%)
. 4
80.0%
, 1
 
20.0%
Other Symbol
ValueCountFrequency (%)
28
100.0%
Space Separator
ValueCountFrequency (%)
15
100.0%
Close Punctuation
ValueCountFrequency (%)
) 9
100.0%
Open Punctuation
ValueCountFrequency (%)
( 9
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 16624
91.7%
Common 1298
 
7.2%
Latin 202
 
1.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1762
 
10.6%
1750
 
10.5%
1741
 
10.5%
1731
 
10.4%
1725
 
10.4%
1700
 
10.2%
662
 
4.0%
160
 
1.0%
160
 
1.0%
153
 
0.9%
Other values (362) 5080
30.6%
Latin
ValueCountFrequency (%)
M 22
 
10.9%
S 19
 
9.4%
J 18
 
8.9%
K 14
 
6.9%
Y 13
 
6.4%
H 11
 
5.4%
G 11
 
5.4%
P 9
 
4.5%
T 8
 
4.0%
O 7
 
3.5%
Other values (27) 70
34.7%
Common
ValueCountFrequency (%)
2 386
29.7%
1 383
29.5%
3 194
14.9%
4 111
 
8.6%
5 62
 
4.8%
6 41
 
3.2%
7 25
 
1.9%
8 19
 
1.5%
0 18
 
1.4%
9 16
 
1.2%
Other values (6) 43
 
3.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 16596
91.6%
ASCII 1500
 
8.3%
None 28
 
0.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1762
 
10.6%
1750
 
10.5%
1741
 
10.5%
1731
 
10.4%
1725
 
10.4%
1700
 
10.2%
662
 
4.0%
160
 
1.0%
160
 
1.0%
153
 
0.9%
Other values (361) 5052
30.4%
ASCII
ValueCountFrequency (%)
2 386
25.7%
1 383
25.5%
3 194
12.9%
4 111
 
7.4%
5 62
 
4.1%
6 41
 
2.7%
7 25
 
1.7%
M 22
 
1.5%
S 19
 
1.3%
8 19
 
1.3%
Other values (43) 238
15.9%
None
ValueCountFrequency (%)
28
100.0%
Distinct1569
Distinct (%)80.1%
Missing0
Missing (%)0.0%
Memory size15.4 KiB
2023-12-12T20:51:50.615007image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length101
Median length68
Mean length39.420408
Min length18

Characters and Unicode

Total characters77264
Distinct characters221
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1342 ?
Unique (%)68.5%

Sample

1st row경상북도 구미시 공단동 303(1공단로6길 94)(토지)
2nd row경상북도 구미시 고아읍 파산리 196, 195-1
3rd row경상북도 구미시 임수동 93-1 구미1공장
4th row경상북도 구미시 공단2동 291(열병합발전소 내 보수동 옥상)
5th row경상북도 구미시 구포동 639 A동 공장지붕
ValueCountFrequency (%)
경상북도 1969
 
14.4%
구미시 1960
 
14.4%
지붕 1343
 
9.9%
선산읍 319
 
2.3%
고아읍 233
 
1.7%
도개면 216
 
1.6%
해평면 212
 
1.6%
옥성면 204
 
1.5%
주2동 194
 
1.4%
공단동 169
 
1.2%
Other values (2701) 6808
50.0%
2023-12-12T20:51:51.259685image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
11674
 
15.1%
1 3800
 
4.9%
( 3359
 
4.3%
) 3355
 
4.3%
2840
 
3.7%
2 2555
 
3.3%
2250
 
2.9%
- 2217
 
2.9%
2195
 
2.8%
2052
 
2.7%
Other values (211) 40967
53.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 37797
48.9%
Decimal Number 17459
22.6%
Space Separator 11674
 
15.1%
Open Punctuation 3377
 
4.4%
Close Punctuation 3373
 
4.4%
Dash Punctuation 2217
 
2.9%
Other Punctuation 1292
 
1.7%
Uppercase Letter 74
 
0.1%
Math Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2840
 
7.5%
2250
 
6.0%
2195
 
5.8%
2052
 
5.4%
2027
 
5.4%
1982
 
5.2%
1973
 
5.2%
1969
 
5.2%
1945
 
5.1%
1606
 
4.2%
Other values (184) 16958
44.9%
Decimal Number
ValueCountFrequency (%)
1 3800
21.8%
2 2555
14.6%
3 1797
10.3%
4 1724
9.9%
5 1507
 
8.6%
6 1344
 
7.7%
7 1307
 
7.5%
8 1233
 
7.1%
0 1196
 
6.9%
9 996
 
5.7%
Uppercase Letter
ValueCountFrequency (%)
A 29
39.2%
B 16
21.6%
C 11
 
14.9%
D 8
 
10.8%
E 5
 
6.8%
F 4
 
5.4%
N 1
 
1.4%
Other Punctuation
ValueCountFrequency (%)
, 1290
99.8%
. 1
 
0.1%
/ 1
 
0.1%
Open Punctuation
ValueCountFrequency (%)
( 3359
99.5%
[ 18
 
0.5%
Close Punctuation
ValueCountFrequency (%)
) 3355
99.5%
] 18
 
0.5%
Space Separator
ValueCountFrequency (%)
11674
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2217
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 39393
51.0%
Hangul 37797
48.9%
Latin 74
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2840
 
7.5%
2250
 
6.0%
2195
 
5.8%
2052
 
5.4%
2027
 
5.4%
1982
 
5.2%
1973
 
5.2%
1969
 
5.2%
1945
 
5.1%
1606
 
4.2%
Other values (184) 16958
44.9%
Common
ValueCountFrequency (%)
11674
29.6%
1 3800
 
9.6%
( 3359
 
8.5%
) 3355
 
8.5%
2 2555
 
6.5%
- 2217
 
5.6%
3 1797
 
4.6%
4 1724
 
4.4%
5 1507
 
3.8%
6 1344
 
3.4%
Other values (10) 6061
15.4%
Latin
ValueCountFrequency (%)
A 29
39.2%
B 16
21.6%
C 11
 
14.9%
D 8
 
10.8%
E 5
 
6.8%
F 4
 
5.4%
N 1
 
1.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 39467
51.1%
Hangul 37797
48.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
11674
29.6%
1 3800
 
9.6%
( 3359
 
8.5%
) 3355
 
8.5%
2 2555
 
6.5%
- 2217
 
5.6%
3 1797
 
4.6%
4 1724
 
4.4%
5 1507
 
3.8%
6 1344
 
3.4%
Other values (17) 6135
15.5%
Hangul
ValueCountFrequency (%)
2840
 
7.5%
2250
 
6.0%
2195
 
5.8%
2052
 
5.4%
2027
 
5.4%
1982
 
5.2%
1973
 
5.2%
1969
 
5.2%
1945
 
5.1%
1606
 
4.2%
Other values (184) 16958
44.9%

용량(kw)
Real number (ℝ)

Distinct233
Distinct (%)11.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean137.30969
Minimum9
Maximum3000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size17.4 KiB
2023-12-12T20:51:51.483250image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum9
5-th percentile20
Q190
median99
Q3100
95-th percentile495
Maximum3000
Range2991
Interquartile range (IQR)10

Descriptive statistics

Standard deviation202.27302
Coefficient of variation (CV)1.4731153
Kurtosis51.174708
Mean137.30969
Median Absolute Deviation (MAD)1
Skewness5.7994306
Sum269127
Variance40914.373
MonotonicityNot monotonic
2023-12-12T20:51:51.691323image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
100 561
28.6%
99 377
19.2%
98 96
 
4.9%
97 48
 
2.4%
30 46
 
2.3%
20 41
 
2.1%
96 34
 
1.7%
19 28
 
1.4%
29 27
 
1.4%
92 22
 
1.1%
Other values (223) 680
34.7%
ValueCountFrequency (%)
9 1
 
0.1%
10 1
 
0.1%
11 3
 
0.2%
12 1
 
0.1%
13 2
 
0.1%
14 1
 
0.1%
15 9
0.5%
16 3
 
0.2%
17 4
0.2%
18 9
0.5%
ValueCountFrequency (%)
3000 1
 
0.1%
2995 1
 
0.1%
1498 2
 
0.1%
1393 1
 
0.1%
1264 1
 
0.1%
1217 1
 
0.1%
1100 1
 
0.1%
1098 1
 
0.1%
1011 1
 
0.1%
1000 9
0.5%

허가기관
Categorical

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size15.4 KiB
경상북도 구미시
1960 

Length

Max length8
Median length8
Mean length8
Min length8

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row경상북도 구미시
2nd row경상북도 구미시
3rd row경상북도 구미시
4th row경상북도 구미시
5th row경상북도 구미시

Common Values

ValueCountFrequency (%)
경상북도 구미시 1960
100.0%

Length

2023-12-12T20:51:51.887347image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T20:51:52.026253image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
경상북도 1960
50.0%
구미시 1960
50.0%
Distinct623
Distinct (%)31.8%
Missing0
Missing (%)0.0%
Memory size15.4 KiB
Minimum2007-12-31 00:00:00
Maximum2023-09-15 00:00:00
2023-12-12T20:51:52.150388image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T20:51:52.395686image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

사업개시일자
Date

MISSING 

Distinct577
Distinct (%)41.2%
Missing560
Missing (%)28.6%
Memory size15.4 KiB
Minimum2008-05-15 00:00:00
Maximum2023-08-23 00:00:00
2023-12-12T20:51:52.658219image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T20:51:52.877284image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

Interactions

2023-12-12T20:51:48.954654image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Missing values

2023-12-12T20:51:49.186695image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T20:51:49.337759image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

발전소명설치장소용량(kw)허가기관허가일자사업개시일자
0영남에너지서비스태양광발전소경상북도 구미시 공단동 303(1공단로6길 94)(토지)200경상북도 구미시2007-12-312008-05-15
1금오솔라팜경상북도 구미시 고아읍 파산리 196, 195-117경상북도 구미시2008-03-282008-08-21
2도레이새한구미1공장태양광발전소경상북도 구미시 임수동 93-1 구미1공장46경상북도 구미시2008-04-172008-06-05
3㈜지에스이앤알경상북도 구미시 공단2동 291(열병합발전소 내 보수동 옥상)49경상북도 구미시2008-04-172008-06-13
4회명워터젠태양광발전소1경상북도 구미시 구포동 639 A동 공장지붕45경상북도 구미시2008-11-072008-12-23
5초곡 태양광발전소경상북도 구미시 옥성면 초곡리 38429경상북도 구미시2009-01-232009-09-08
6상지태양광발전소경상북도 구미시 선산읍 포상리 762, 76329경상북도 구미시2009-03-022011-07-25
7롯데마트구미태양광발전소경상북도 구미시 신평동 465(옥상)100경상북도 구미시2009-08-102009-12-01
8웅진에너지(주)구미지점경상북도 구미시 산동면 신당리 국가4단지6블럭 2-1로트(공장지붕)166경상북도 구미시2009-08-172009-11-18
9구포매립장매립가스발전소경상북도 구미시 구포동 497-1(매립지내)450경상북도 구미시2009-09-032010-05-01
발전소명설치장소용량(kw)허가기관허가일자사업개시일자
1950주아령제2호태양광발전소경상북도 구미시 옥성면 덕촌리 38-5(주아령로 125)(주2동, 주5동, 주6동, 주7동 지붕)100경상북도 구미시2023-09-07<NA>
1951주아령제3호태양광발전소경상북도 구미시 옥성면 덕촌리 38-5(주아령로 125)(주3동, 주4동, 주6동 지붕)36경상북도 구미시2023-09-07<NA>
1952오상5호태양광발전소경상북도 구미시 해평면 문량리 704-10, 704-50(성수문량길 408)(주2동 지붕)100경상북도 구미시2023-09-15<NA>
1953오상6호태양광발전소경상북도 구미시 해평면 문량리 704-10, 704-50(성수문량길 408)(주2동 지붕)100경상북도 구미시2023-09-15<NA>
1954오상7호태양광발전소경상북도 구미시 해평면 문량리 704-10, 704-50(성수문량길 408)(주2동 지붕)100경상북도 구미시2023-09-15<NA>
1955오상8호태양광발전소경상북도 구미시 해평면 문량리 704-10, 704-50(성수문량길 408)(주1동 지붕)100경상북도 구미시2023-09-15<NA>
1956오상9호태양광발전소경상북도 구미시 해평면 문량리 704-10, 704-50(성수문량길 408)(주1동, 주2동 지붕)100경상북도 구미시2023-09-15<NA>
1957오상10호태양광발전소경상북도 구미시 해평면 문량리 704-10, 704-50(성수문량길 408)(주1동 지붕)100경상북도 구미시2023-09-15<NA>
1958오상11호태양광발전소경상북도 구미시 해평면 문량리 704-10, 704-50(성수문량길 408)(주1동 지붕)100경상북도 구미시2023-09-15<NA>
1959오상12호태양광발전소경상북도 구미시 해평면 문량리 704-10, 704-50(성수문량길 408)(주2동 지붕)52경상북도 구미시2023-09-15<NA>