Overview

Dataset statistics

Number of variables5
Number of observations1363
Missing cells0
Missing cells (%)0.0%
Duplicate rows1
Duplicate rows (%)0.1%
Total size in memory54.7 KiB
Average record size in memory41.1 B

Variable types

Text2
Numeric1
DateTime1
Categorical1

Dataset

Description충청남도 홍성군 태양광발전설치현황에 대한 데이터로 발전소명, 설비용량, 발전소주소, 최초허가일, 데이터기준일자에 대한 정보를 제공합니다.     
Author충청남도
URLhttps://alldam.chungnam.go.kr/index.chungnam?menuCd=DOM_000000201001001001&st=&cds=&orgCd=&apiType=&isOpen=Y&pageIndex=431&beforeMenuCd=DOM_000000201001001000&publicdatapk=15034093

Alerts

데이터기준일자 has constant value ""Constant
Dataset has 1 (0.1%) duplicate rowsDuplicates

Reproduction

Analysis started2024-01-09 21:49:08.108147
Analysis finished2024-01-09 21:49:08.587528
Duration0.48 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct1300
Distinct (%)95.4%
Missing0
Missing (%)0.0%
Memory size10.8 KiB
2024-01-10T06:49:08.803516image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length29
Median length23
Mean length9.9376376
Min length2

Characters and Unicode

Total characters13545
Distinct characters352
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1250 ?
Unique (%)91.7%

Sample

1st row사천마을 태양광발전소
2nd row연아8호 태양광발전소
3rd row연아7호 태양광발전소
4th row연아6호 태양광발전소
5th row연아5호 태양광발전소
ValueCountFrequency (%)
태양광발전소 508
 
25.5%
발전소 18
 
0.9%
태양광 12
 
0.6%
수상태양광발전소 11
 
0.6%
홍성2차 11
 
0.6%
홍성 8
 
0.4%
무량 8
 
0.4%
2호 7
 
0.4%
1호 6
 
0.3%
4호 5
 
0.3%
Other values (1295) 1400
70.2%
2024-01-10T06:49:09.188669image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1299
 
9.6%
1277
 
9.4%
1268
 
9.4%
1263
 
9.3%
1255
 
9.3%
1246
 
9.2%
633
 
4.7%
579
 
4.3%
1 216
 
1.6%
2 212
 
1.6%
Other values (342) 4297
31.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 11947
88.2%
Decimal Number 747
 
5.5%
Space Separator 633
 
4.7%
Uppercase Letter 97
 
0.7%
Close Punctuation 48
 
0.4%
Open Punctuation 48
 
0.4%
Other Symbol 18
 
0.1%
Dash Punctuation 4
 
< 0.1%
Other Punctuation 2
 
< 0.1%
Letter Number 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1299
 
10.9%
1277
 
10.7%
1268
 
10.6%
1263
 
10.6%
1255
 
10.5%
1246
 
10.4%
579
 
4.8%
163
 
1.4%
110
 
0.9%
98
 
0.8%
Other values (307) 3389
28.4%
Uppercase Letter
ValueCountFrequency (%)
S 24
24.7%
M 12
12.4%
Y 10
10.3%
H 8
 
8.2%
A 7
 
7.2%
B 7
 
7.2%
D 6
 
6.2%
E 5
 
5.2%
N 4
 
4.1%
G 4
 
4.1%
Other values (7) 10
10.3%
Decimal Number
ValueCountFrequency (%)
1 216
28.9%
2 212
28.4%
3 104
13.9%
4 58
 
7.8%
5 44
 
5.9%
6 28
 
3.7%
7 27
 
3.6%
0 23
 
3.1%
8 20
 
2.7%
9 15
 
2.0%
Other Punctuation
ValueCountFrequency (%)
. 1
50.0%
1
50.0%
Space Separator
ValueCountFrequency (%)
633
100.0%
Close Punctuation
ValueCountFrequency (%)
) 48
100.0%
Open Punctuation
ValueCountFrequency (%)
( 48
100.0%
Other Symbol
ValueCountFrequency (%)
18
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 4
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 11965
88.3%
Common 1482
 
10.9%
Latin 98
 
0.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1299
 
10.9%
1277
 
10.7%
1268
 
10.6%
1263
 
10.6%
1255
 
10.5%
1246
 
10.4%
579
 
4.8%
163
 
1.4%
110
 
0.9%
98
 
0.8%
Other values (308) 3407
28.5%
Latin
ValueCountFrequency (%)
S 24
24.5%
M 12
12.2%
Y 10
10.2%
H 8
 
8.2%
A 7
 
7.1%
B 7
 
7.1%
D 6
 
6.1%
E 5
 
5.1%
N 4
 
4.1%
G 4
 
4.1%
Other values (8) 11
11.2%
Common
ValueCountFrequency (%)
633
42.7%
1 216
 
14.6%
2 212
 
14.3%
3 104
 
7.0%
4 58
 
3.9%
) 48
 
3.2%
( 48
 
3.2%
5 44
 
3.0%
6 28
 
1.9%
7 27
 
1.8%
Other values (6) 64
 
4.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 11947
88.2%
ASCII 1578
 
11.7%
None 19
 
0.1%
Number Forms 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1299
 
10.9%
1277
 
10.7%
1268
 
10.6%
1263
 
10.6%
1255
 
10.5%
1246
 
10.4%
579
 
4.8%
163
 
1.4%
110
 
0.9%
98
 
0.8%
Other values (307) 3389
28.4%
ASCII
ValueCountFrequency (%)
633
40.1%
1 216
 
13.7%
2 212
 
13.4%
3 104
 
6.6%
4 58
 
3.7%
) 48
 
3.0%
( 48
 
3.0%
5 44
 
2.8%
6 28
 
1.8%
7 27
 
1.7%
Other values (22) 160
 
10.1%
None
ValueCountFrequency (%)
18
94.7%
1
 
5.3%
Number Forms
ValueCountFrequency (%)
1
100.0%

설비용량
Real number (ℝ)

Distinct437
Distinct (%)32.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean188.04539
Minimum5.04
Maximum1495.44
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size12.1 KiB
2024-01-10T06:49:09.302517image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum5.04
5-th percentile29.403
Q197.2
median99.23
Q399.9
95-th percentile993.28
Maximum1495.44
Range1490.4
Interquartile range (IQR)2.7

Descriptive statistics

Standard deviation240.25413
Coefficient of variation (CV)1.2776391
Kurtosis5.8280303
Mean188.04539
Median Absolute Deviation (MAD)1.95
Skewness2.5794519
Sum256305.86
Variance57722.046
MonotonicityNot monotonic
2024-01-10T06:49:09.403818image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
99.0 86
 
6.3%
99.9 85
 
6.2%
97.92 61
 
4.5%
99.28 54
 
4.0%
99.36 40
 
2.9%
99.2 35
 
2.6%
99.71 34
 
2.5%
997.92 30
 
2.2%
99.84 28
 
2.1%
97.2 27
 
2.0%
Other values (427) 883
64.8%
ValueCountFrequency (%)
5.04 1
 
0.1%
9.6 1
 
0.1%
10.0 2
0.1%
14.4 1
 
0.1%
14.53 1
 
0.1%
15.0 3
0.2%
15.12 1
 
0.1%
15.3 1
 
0.1%
16.06 1
 
0.1%
16.2 1
 
0.1%
ValueCountFrequency (%)
1495.44 1
 
0.1%
1000.0 1
 
0.1%
999.99 1
 
0.1%
999.6 3
 
0.2%
999.0 7
 
0.5%
998.4 2
 
0.1%
997.92 30
2.2%
997.56 6
 
0.4%
996.96 1
 
0.1%
996.84 1
 
0.1%
Distinct1103
Distinct (%)80.9%
Missing0
Missing (%)0.0%
Memory size10.8 KiB
2024-01-10T06:49:09.663461image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length91
Median length64
Mean length29.322817
Min length18

Characters and Unicode

Total characters39967
Distinct characters170
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique971 ?
Unique (%)71.2%

Sample

1st row충청남도 홍성군 갈산면 기산리 197-1, 196-5, 196-6 주1동(건물위)
2nd row충청남도 홍성군 장곡면 가송리 451 외19필지
3rd row충청남도 홍성군 장곡면 가송리 451 외 19필지
4th row충청남도 홍성군 장곡면 가송리 451 외 19필지
5th row충청남도 홍성군 장곡면 가송리 451 외 19필지
ValueCountFrequency (%)
충청남도 1360
 
15.8%
홍성군 1360
 
15.8%
결성면 157
 
1.8%
구항면 157
 
1.8%
갈산면 151
 
1.8%
홍동면 150
 
1.7%
은하면 150
 
1.7%
148
 
1.7%
광천읍 130
 
1.5%
장곡면 125
 
1.5%
Other values (1616) 4707
54.8%
2024-01-10T06:49:10.047970image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
7233
 
18.1%
1 1726
 
4.3%
1693
 
4.2%
1647
 
4.1%
1437
 
3.6%
1400
 
3.5%
1374
 
3.4%
1366
 
3.4%
1360
 
3.4%
1193
 
3.0%
Other values (160) 19538
48.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 21463
53.7%
Decimal Number 8433
 
21.1%
Space Separator 7233
 
18.1%
Dash Punctuation 1105
 
2.8%
Other Punctuation 905
 
2.3%
Open Punctuation 409
 
1.0%
Close Punctuation 407
 
1.0%
Math Symbol 11
 
< 0.1%
Uppercase Letter 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1693
 
7.9%
1647
 
7.7%
1437
 
6.7%
1400
 
6.5%
1374
 
6.4%
1366
 
6.4%
1360
 
6.3%
1193
 
5.6%
1105
 
5.1%
632
 
2.9%
Other values (141) 8256
38.5%
Decimal Number
ValueCountFrequency (%)
1 1726
20.5%
2 1154
13.7%
3 1030
12.2%
4 897
10.6%
5 749
8.9%
6 683
 
8.1%
8 642
 
7.6%
7 636
 
7.5%
9 474
 
5.6%
0 442
 
5.2%
Other Punctuation
ValueCountFrequency (%)
, 901
99.6%
/ 3
 
0.3%
. 1
 
0.1%
Space Separator
ValueCountFrequency (%)
7233
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1105
100.0%
Open Punctuation
ValueCountFrequency (%)
( 409
100.0%
Close Punctuation
ValueCountFrequency (%)
) 407
100.0%
Math Symbol
ValueCountFrequency (%)
~ 11
100.0%
Uppercase Letter
ValueCountFrequency (%)
A 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 21463
53.7%
Common 18503
46.3%
Latin 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1693
 
7.9%
1647
 
7.7%
1437
 
6.7%
1400
 
6.5%
1374
 
6.4%
1366
 
6.4%
1360
 
6.3%
1193
 
5.6%
1105
 
5.1%
632
 
2.9%
Other values (141) 8256
38.5%
Common
ValueCountFrequency (%)
7233
39.1%
1 1726
 
9.3%
2 1154
 
6.2%
- 1105
 
6.0%
3 1030
 
5.6%
, 901
 
4.9%
4 897
 
4.8%
5 749
 
4.0%
6 683
 
3.7%
8 642
 
3.5%
Other values (8) 2383
 
12.9%
Latin
ValueCountFrequency (%)
A 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 21463
53.7%
ASCII 18504
46.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
7233
39.1%
1 1726
 
9.3%
2 1154
 
6.2%
- 1105
 
6.0%
3 1030
 
5.6%
, 901
 
4.9%
4 897
 
4.8%
5 749
 
4.0%
6 683
 
3.7%
8 642
 
3.5%
Other values (9) 2384
 
12.9%
Hangul
ValueCountFrequency (%)
1693
 
7.9%
1647
 
7.7%
1437
 
6.7%
1400
 
6.5%
1374
 
6.4%
1366
 
6.4%
1360
 
6.3%
1193
 
5.6%
1105
 
5.1%
632
 
2.9%
Other values (141) 8256
38.5%
Distinct440
Distinct (%)32.3%
Missing0
Missing (%)0.0%
Memory size10.8 KiB
Minimum2007-05-18 00:00:00
Maximum2022-11-16 00:00:00
2024-01-10T06:49:10.166536image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T06:49:10.273283image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size10.8 KiB
2023-04-17
1363 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023-04-17
2nd row2023-04-17
3rd row2023-04-17
4th row2023-04-17
5th row2023-04-17

Common Values

ValueCountFrequency (%)
2023-04-17 1363
100.0%

Length

2024-01-10T06:49:10.366412image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T06:49:10.436289image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023-04-17 1363
100.0%

Interactions

2024-01-10T06:49:08.376109image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Missing values

2024-01-10T06:49:08.479929image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-10T06:49:08.557151image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

발전소명설비용량발전소주소최초허가일데이터기준일자
0사천마을 태양광발전소19.0충청남도 홍성군 갈산면 기산리 197-1, 196-5, 196-6 주1동(건물위)2022-11-162023-04-17
1연아8호 태양광발전소99.71충청남도 홍성군 장곡면 가송리 451 외19필지2022-10-202023-04-17
2연아7호 태양광발전소99.71충청남도 홍성군 장곡면 가송리 451 외 19필지2022-10-202023-04-17
3연아6호 태양광발전소99.71충청남도 홍성군 장곡면 가송리 451 외 19필지2022-10-202023-04-17
4연아5호 태양광발전소99.71충청남도 홍성군 장곡면 가송리 451 외 19필지2022-10-202023-04-17
5연아4호 태양광발전소99.71충청남도 홍성군 장곡면 가송리 451 외 19필지2022-10-202023-04-17
6연아3호 태양광발전소99.71충청남도 홍성군 장곡면 가송리 451 외 19필지2022-10-202023-04-17
7연아2호 태양광발전소99.71충청남도 홍성군 장곡면 가송리 451 외 19필지2022-10-202023-04-17
8연아1호 태양광발전소99.71충청남도 홍성군 장곡면 가송리 460-1, 460-2 주1동(건물위)2022-10-202023-04-17
9유니트 태양광발전소605.09충청남도 홍성군 서부면 광리 4-1외 1필지 주1동, 4-4 주1동, 4-6 주1동(건물위)2022-10-182023-04-17
발전소명설비용량발전소주소최초허가일데이터기준일자
1353대율 태양광발전소999.0충청남도 홍성군 은하면 대율리 3982016-06-092023-04-17
1354월림산 태양광발전소999.0충청남도 홍성군 광천읍 광금남로63번길 109-142016-03-142023-04-17
1355㈜우리파워15호 태양광발전소603.84충청남도 홍성군 장곡면 옥계리 산 902016-03-142023-04-17
1356대송 태양광발전소999.0충청남도 홍성군 결성면 형산리 산 168-122016-02-222023-04-17
1357백현 태양광발전소870.0충청남도 홍성군 은하면 장곡리 462-1 대율리 51-7, 51-82015-11-302023-04-17
1358성남리2호 태양광발전소990.0충청남도 홍성군 결성면 성남리 1106-22015-09-072023-04-17
1359성남리1호 태양광발전소990.0충청남도 홍성군 결성면 성남리 1106-22015-09-072023-04-17
1360삼산에너지발전소700.0충청남도 홍성군 장곡면 신풍리 7692015-07-082023-04-17
1361단비덕실 태양광발전소934.2충청남도 홍성군 은하면 홍남로22번길 1492015-04-292023-04-17
1362(주)미르에너지 태양광발전소993.24충청남도 홍성군 구항면 신곡리 산 56-32015-03-052023-04-17

Duplicate rows

Most frequently occurring

발전소명설비용량발전소주소최초허가일데이터기준일자# duplicates
0유니쏠라테크29.76충청남도 홍성군 광천읍 가정리 257번지 12호2015-10-062023-04-172