Overview

Dataset statistics

Number of variables6
Number of observations655
Missing cells51
Missing cells (%)1.3%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory31.5 KiB
Average record size in memory49.2 B

Variable types

Text2
Numeric1
DateTime3

Dataset

Description충청남도 홍성군 태양광발전설치현황에 대한 데이터로 발전소명, 설비용량, 발전소주소, 최초허가일, 데이터기준일자에 대한 정보를 제공합니다.     
Author충청남도
URLhttps://alldam.chungnam.go.kr/index.chungnam?menuCd=DOM_000000201001001001&st=&cds=&orgCd=&apiType=&isOpen=Y&pageIndex=431&beforeMenuCd=DOM_000000201001001000&publicdatapk=15034093

Alerts

데이터기준일 has constant value ""Constant
사업개시일 has 51 (7.8%) missing valuesMissing

Reproduction

Analysis started2024-01-09 21:49:04.374955
Analysis finished2024-01-09 21:49:04.832536
Duration0.46 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct639
Distinct (%)97.6%
Missing0
Missing (%)0.0%
Memory size5.2 KiB
2024-01-10T06:49:05.019972image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length23
Median length18
Mean length9.4137405
Min length2

Characters and Unicode

Total characters6166
Distinct characters291
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique625 ?
Unique (%)95.4%

Sample

1st row일흥목장3호 태양광발전소
2nd row(주)에스에프씨 홍성 태양광 발전소 2호기
3rd row썬플라워 태양광발전소
4th row홍동면 수란마을회태양광발전소
5th row조양8호태양광발전소
ValueCountFrequency (%)
태양광발전소 164
 
18.8%
발전소 12
 
1.4%
태양광 10
 
1.1%
무량 8
 
0.9%
솔라앤팜 5
 
0.6%
내포태양광발전소 3
 
0.3%
대성태양광발전소 3
 
0.3%
현대 3
 
0.3%
이호 3
 
0.3%
기산태양광발전소 2
 
0.2%
Other values (645) 659
75.6%
2024-01-10T06:49:05.366957image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
614
 
10.0%
606
 
9.8%
606
 
9.8%
590
 
9.6%
590
 
9.6%
586
 
9.5%
245
 
4.0%
219
 
3.6%
1 84
 
1.4%
2 81
 
1.3%
Other values (281) 1945
31.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5560
90.2%
Decimal Number 294
 
4.8%
Space Separator 219
 
3.6%
Uppercase Letter 49
 
0.8%
Open Punctuation 19
 
0.3%
Close Punctuation 19
 
0.3%
Other Symbol 2
 
< 0.1%
Other Punctuation 2
 
< 0.1%
Dash Punctuation 1
 
< 0.1%
Letter Number 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
614
 
11.0%
606
 
10.9%
606
 
10.9%
590
 
10.6%
590
 
10.6%
586
 
10.5%
245
 
4.4%
68
 
1.2%
49
 
0.9%
49
 
0.9%
Other values (247) 1557
28.0%
Uppercase Letter
ValueCountFrequency (%)
Y 9
18.4%
S 8
16.3%
D 6
12.2%
A 5
10.2%
M 3
 
6.1%
N 3
 
6.1%
K 3
 
6.1%
T 2
 
4.1%
U 2
 
4.1%
G 2
 
4.1%
Other values (6) 6
12.2%
Decimal Number
ValueCountFrequency (%)
1 84
28.6%
2 81
27.6%
3 38
12.9%
4 23
 
7.8%
5 16
 
5.4%
0 16
 
5.4%
6 11
 
3.7%
7 10
 
3.4%
8 10
 
3.4%
9 5
 
1.7%
Other Punctuation
ValueCountFrequency (%)
. 1
50.0%
1
50.0%
Space Separator
ValueCountFrequency (%)
219
100.0%
Open Punctuation
ValueCountFrequency (%)
( 19
100.0%
Close Punctuation
ValueCountFrequency (%)
) 19
100.0%
Other Symbol
ValueCountFrequency (%)
2
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5562
90.2%
Common 554
 
9.0%
Latin 50
 
0.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
614
 
11.0%
606
 
10.9%
606
 
10.9%
590
 
10.6%
590
 
10.6%
586
 
10.5%
245
 
4.4%
68
 
1.2%
49
 
0.9%
49
 
0.9%
Other values (248) 1559
28.0%
Latin
ValueCountFrequency (%)
Y 9
18.0%
S 8
16.0%
D 6
12.0%
A 5
10.0%
M 3
 
6.0%
N 3
 
6.0%
K 3
 
6.0%
T 2
 
4.0%
U 2
 
4.0%
G 2
 
4.0%
Other values (7) 7
14.0%
Common
ValueCountFrequency (%)
219
39.5%
1 84
 
15.2%
2 81
 
14.6%
3 38
 
6.9%
4 23
 
4.2%
( 19
 
3.4%
) 19
 
3.4%
5 16
 
2.9%
0 16
 
2.9%
6 11
 
2.0%
Other values (6) 28
 
5.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5560
90.2%
ASCII 602
 
9.8%
None 3
 
< 0.1%
Number Forms 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
614
 
11.0%
606
 
10.9%
606
 
10.9%
590
 
10.6%
590
 
10.6%
586
 
10.5%
245
 
4.4%
68
 
1.2%
49
 
0.9%
49
 
0.9%
Other values (247) 1557
28.0%
ASCII
ValueCountFrequency (%)
219
36.4%
1 84
 
14.0%
2 81
 
13.5%
3 38
 
6.3%
4 23
 
3.8%
( 19
 
3.2%
) 19
 
3.2%
5 16
 
2.7%
0 16
 
2.7%
6 11
 
1.8%
Other values (21) 76
 
12.6%
None
ValueCountFrequency (%)
2
66.7%
1
33.3%
Number Forms
ValueCountFrequency (%)
1
100.0%

설비용량
Real number (ℝ)

Distinct219
Distinct (%)33.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean143.3994
Minimum9.6
Maximum999
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size5.9 KiB
2024-01-10T06:49:05.486330image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum9.6
5-th percentile29.991
Q197.5
median99.23
Q399.9
95-th percentile496.4
Maximum999
Range989.4
Interquartile range (IQR)2.4

Descriptive statistics

Standard deviation139.22069
Coefficient of variation (CV)0.97085961
Kurtosis11.863467
Mean143.3994
Median Absolute Deviation (MAD)0.83
Skewness3.104916
Sum93926.61
Variance19382.401
MonotonicityNot monotonic
2024-01-10T06:49:05.590296image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
99.9 73
 
11.1%
99.0 67
 
10.2%
99.28 28
 
4.3%
99.36 28
 
4.3%
99.2 23
 
3.5%
97.92 23
 
3.5%
99.23 20
 
3.1%
95.76 18
 
2.7%
97.2 17
 
2.6%
99.45 17
 
2.6%
Other values (209) 341
52.1%
ValueCountFrequency (%)
9.6 1
0.2%
10.0 2
0.3%
14.4 1
0.2%
14.53 1
0.2%
15.0 2
0.3%
15.12 1
0.2%
16.2 1
0.2%
16.43 1
0.2%
16.5 1
0.2%
17.28 1
0.2%
ValueCountFrequency (%)
999.0 2
0.3%
996.0 2
0.3%
994.0 1
0.2%
843.66 1
0.2%
600.0 1
0.2%
499.8 2
0.3%
499.77 1
0.2%
499.59 2
0.3%
499.38 2
0.3%
499.28 1
0.2%
Distinct584
Distinct (%)89.2%
Missing0
Missing (%)0.0%
Memory size5.2 KiB
2024-01-10T06:49:05.808670image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length70
Median length59
Mean length25.215267
Min length15

Characters and Unicode

Total characters16516
Distinct characters128
Distinct categories8 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique545 ?
Unique (%)83.2%

Sample

1st row충남 홍성군 홍동면 구정리 431-1
2nd row충남 홍성군 구항면 공리 456-3외 2필지
3rd row홍성군 홍동면 구정리 704, 705-7(건물위)
4th row홍성군 홍동면 광금북로 114번길 13
5th row충남 홍성군 광첩은 가정리 263, 264, 264-1, 15-2
ValueCountFrequency (%)
홍성군 654
 
18.0%
충남 549
 
15.1%
갈산면 92
 
2.5%
결성면 86
 
2.4%
광천읍 85
 
2.3%
은하면 78
 
2.2%
구항면 72
 
2.0%
홍동면 71
 
2.0%
충청남도 49
 
1.4%
서부면 48
 
1.3%
Other values (946) 1841
50.8%
2024-01-10T06:49:06.215520image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2972
18.0%
796
 
4.8%
785
 
4.8%
- 770
 
4.7%
1 731
 
4.4%
655
 
4.0%
644
 
3.9%
613
 
3.7%
601
 
3.6%
2 593
 
3.6%
Other values (118) 7356
44.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 8059
48.8%
Decimal Number 3919
23.7%
Space Separator 2972
 
18.0%
Dash Punctuation 770
 
4.7%
Other Punctuation 364
 
2.2%
Open Punctuation 214
 
1.3%
Close Punctuation 214
 
1.3%
Math Symbol 4
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
796
 
9.9%
785
 
9.7%
655
 
8.1%
644
 
8.0%
613
 
7.6%
601
 
7.5%
524
 
6.5%
232
 
2.9%
203
 
2.5%
194
 
2.4%
Other values (101) 2812
34.9%
Decimal Number
ValueCountFrequency (%)
1 731
18.7%
2 593
15.1%
3 496
12.7%
4 433
11.0%
5 337
8.6%
6 311
7.9%
7 304
7.8%
8 290
 
7.4%
9 214
 
5.5%
0 210
 
5.4%
Other Punctuation
ValueCountFrequency (%)
, 361
99.2%
/ 3
 
0.8%
Space Separator
ValueCountFrequency (%)
2972
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 770
100.0%
Open Punctuation
ValueCountFrequency (%)
( 214
100.0%
Close Punctuation
ValueCountFrequency (%)
) 214
100.0%
Math Symbol
ValueCountFrequency (%)
~ 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 8457
51.2%
Hangul 8059
48.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
796
 
9.9%
785
 
9.7%
655
 
8.1%
644
 
8.0%
613
 
7.6%
601
 
7.5%
524
 
6.5%
232
 
2.9%
203
 
2.5%
194
 
2.4%
Other values (101) 2812
34.9%
Common
ValueCountFrequency (%)
2972
35.1%
- 770
 
9.1%
1 731
 
8.6%
2 593
 
7.0%
3 496
 
5.9%
4 433
 
5.1%
, 361
 
4.3%
5 337
 
4.0%
6 311
 
3.7%
7 304
 
3.6%
Other values (7) 1149
 
13.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 8457
51.2%
Hangul 8059
48.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2972
35.1%
- 770
 
9.1%
1 731
 
8.6%
2 593
 
7.0%
3 496
 
5.9%
4 433
 
5.1%
, 361
 
4.3%
5 337
 
4.0%
6 311
 
3.7%
7 304
 
3.6%
Other values (7) 1149
 
13.6%
Hangul
ValueCountFrequency (%)
796
 
9.9%
785
 
9.7%
655
 
8.1%
644
 
8.0%
613
 
7.6%
601
 
7.5%
524
 
6.5%
232
 
2.9%
203
 
2.5%
194
 
2.4%
Other values (101) 2812
34.9%
Distinct249
Distinct (%)38.0%
Missing0
Missing (%)0.0%
Memory size5.2 KiB
Minimum2014-02-18 00:00:00
Maximum2021-03-30 00:00:00
2024-01-10T06:49:06.327897image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T06:49:06.426937image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

사업개시일
Date

MISSING 

Distinct233
Distinct (%)38.6%
Missing51
Missing (%)7.8%
Memory size5.2 KiB
Minimum2014-04-07 00:00:00
Maximum2021-04-27 00:00:00
2024-01-10T06:49:06.744205image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T06:49:06.842560image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

데이터기준일
Date

CONSTANT 

Distinct1
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size5.2 KiB
Minimum2021-05-06 00:00:00
Maximum2021-05-06 00:00:00
2024-01-10T06:49:06.918568image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T06:49:06.987517image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2024-01-10T06:49:04.634010image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Missing values

2024-01-10T06:49:04.721248image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-10T06:49:04.800360image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

발전소명설비용량발전소주소최초허가일사업개시일데이터기준일
0일흥목장3호 태양광발전소30.0충남 홍성군 홍동면 구정리 431-12014-02-182014-04-072021-05-06
1(주)에스에프씨 홍성 태양광 발전소 2호기302.59충남 홍성군 구항면 공리 456-3외 2필지2014-02-182014-06-052021-05-06
2썬플라워 태양광발전소98.28홍성군 홍동면 구정리 704, 705-7(건물위)2014-03-252014-05-152021-05-06
3홍동면 수란마을회태양광발전소10.0홍성군 홍동면 광금북로 114번길 132014-04-102014-04-102021-05-06
4조양8호태양광발전소497.61충남 홍성군 광첩은 가정리 263, 264, 264-1, 15-22014-04-23<NA>2021-05-06
5속동태양광발전소16.2충남 홍성군 서부면 상황리 685-1(건물위)2014-06-112014-09-112021-05-06
6경우 태양광발전소97.5충남 홍성군 구항면 남산리 4-1(건물위)2014-06-112014-07-102021-05-06
7이호태양광발전소99.0충남 홍성군 서부면 이호리 394-1, 2(건물위)2014-08-062014-09-262021-05-06
8보영1호 태양광발전소99.0충남 홍성군 결성면 성남리 산68-22014-08-072016-02-012021-05-06
9조양7호태양광발전소497.61충남 홍성군 광천읍 가정리 2642014-08-11<NA>2021-05-06
발전소명설비용량발전소주소최초허가일사업개시일데이터기준일
645꿈의1호 태양광발전소99.96홍성군 구항면 장양리 209-1외 3필지 주1동(건물위)2021-01-21<NA>2021-05-06
646석순2태양광발전소86.32홍성군 갈산면 동성리 395-2 주1~3동(건물위)2021-01-27<NA>2021-05-06
647거산1 태양광발전소67.34홍성군 은하면 학산리 204-2(건물위)2021-02-16<NA>2021-05-06
648거산 태양광발전소21.84홍성군 은하면 학산리 204(건물위)2021-02-16<NA>2021-05-06
649미어실2호태양광발전소43.68홍성군 은하면 덕실리 90-3, 232-5, 232-6, 232-7, 246-2, 246-5, 246-7, 246-8(건물위)2021-03-082021-04-142021-05-06
650미어실1호태양광발전소98.28홍성군 은하면 덕실리 90-3, 232-5, 232-6, 232-7, 246-2, 246-5, 246-7, 246-8(건물위)2021-03-082021-04-142021-05-06
651봉연발전소99.36홍성군 구항면 마온1길 38-1 주1~3동(건물위)2021-03-10<NA>2021-05-06
652황금3호 태양광 발전소99.0홍성군 결성면 무량리 67-8 주1~2동(건물위)2021-03-30<NA>2021-05-06
653황금2호 태양광 발전소99.0홍성군 결성면 무량리 67-8 주1동(건물위)2021-03-30<NA>2021-05-06
654황금1호 태양광 발전소99.0홍성군 결성면 무량리 67-8 주1동(건물위)2021-03-30<NA>2021-05-06