Overview

Dataset statistics

Number of variables5
Number of observations1017
Missing cells282
Missing cells (%)5.5%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory40.8 KiB
Average record size in memory41.1 B

Variable types

Text2
Numeric1
DateTime2

Dataset

Description충청남도 홍성군 태양광발전설치현황에 대한 데이터로 발전소명, 설비용량, 발전소주소, 최초허가일, 데이터기준일자에 대한 정보를 제공합니다.     
Author충청남도
URLhttps://alldam.chungnam.go.kr/index.chungnam?menuCd=DOM_000000201001001001&st=&cds=&orgCd=&apiType=&isOpen=Y&pageIndex=431&beforeMenuCd=DOM_000000201001001000&publicdatapk=15034093

Alerts

사업개시일 has 282 (27.7%) missing valuesMissing

Reproduction

Analysis started2024-01-09 21:49:17.700117
Analysis finished2024-01-09 21:49:18.145472
Duration0.45 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct982
Distinct (%)96.6%
Missing0
Missing (%)0.0%
Memory size8.1 KiB
2024-01-10T06:49:18.328690image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length29
Median length23
Mean length9.8230088
Min length2

Characters and Unicode

Total characters9990
Distinct characters340
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique950 ?
Unique (%)93.4%

Sample

1st row한가람 태양광발전소
2nd row두영1호 태양광발전소
3rd row봉서 태양광발전소
4th row두영2호 태양광발전소
5th row두영3호 태양광발전소
ValueCountFrequency (%)
태양광발전소 328
 
22.9%
발전소 14
 
1.0%
태양광 12
 
0.8%
수상태양광발전소 11
 
0.8%
무량 8
 
0.6%
그린 5
 
0.3%
솔라앤팜 5
 
0.3%
에너지 5
 
0.3%
홍성 4
 
0.3%
기산태양광발전소 3
 
0.2%
Other values (989) 1038
72.4%
2024-01-10T06:49:18.681508image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
963
 
9.6%
946
 
9.5%
945
 
9.5%
936
 
9.4%
930
 
9.3%
925
 
9.3%
418
 
4.2%
394
 
3.9%
2 141
 
1.4%
1 139
 
1.4%
Other values (330) 3253
32.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 8911
89.2%
Decimal Number 492
 
4.9%
Space Separator 418
 
4.2%
Uppercase Letter 69
 
0.7%
Close Punctuation 39
 
0.4%
Open Punctuation 39
 
0.4%
Other Symbol 18
 
0.2%
Other Punctuation 2
 
< 0.1%
Letter Number 1
 
< 0.1%
Dash Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
963
 
10.8%
946
 
10.6%
945
 
10.6%
936
 
10.5%
930
 
10.4%
925
 
10.4%
394
 
4.4%
107
 
1.2%
85
 
1.0%
80
 
0.9%
Other values (295) 2600
29.2%
Uppercase Letter
ValueCountFrequency (%)
S 12
17.4%
Y 10
14.5%
A 7
10.1%
D 6
8.7%
B 6
8.7%
H 5
7.2%
M 4
 
5.8%
K 4
 
5.8%
G 3
 
4.3%
N 3
 
4.3%
Other values (7) 9
13.0%
Decimal Number
ValueCountFrequency (%)
2 141
28.7%
1 139
28.3%
3 68
13.8%
4 36
 
7.3%
5 29
 
5.9%
0 20
 
4.1%
7 18
 
3.7%
6 17
 
3.5%
8 15
 
3.0%
9 9
 
1.8%
Other Punctuation
ValueCountFrequency (%)
. 1
50.0%
1
50.0%
Space Separator
ValueCountFrequency (%)
418
100.0%
Close Punctuation
ValueCountFrequency (%)
) 39
100.0%
Open Punctuation
ValueCountFrequency (%)
( 39
100.0%
Other Symbol
ValueCountFrequency (%)
18
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 8929
89.4%
Common 991
 
9.9%
Latin 70
 
0.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
963
 
10.8%
946
 
10.6%
945
 
10.6%
936
 
10.5%
930
 
10.4%
925
 
10.4%
394
 
4.4%
107
 
1.2%
85
 
1.0%
80
 
0.9%
Other values (296) 2618
29.3%
Latin
ValueCountFrequency (%)
S 12
17.1%
Y 10
14.3%
A 7
10.0%
D 6
8.6%
B 6
8.6%
H 5
7.1%
M 4
 
5.7%
K 4
 
5.7%
G 3
 
4.3%
N 3
 
4.3%
Other values (8) 10
14.3%
Common
ValueCountFrequency (%)
418
42.2%
2 141
 
14.2%
1 139
 
14.0%
3 68
 
6.9%
) 39
 
3.9%
( 39
 
3.9%
4 36
 
3.6%
5 29
 
2.9%
0 20
 
2.0%
7 18
 
1.8%
Other values (6) 44
 
4.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 8911
89.2%
ASCII 1059
 
10.6%
None 19
 
0.2%
Number Forms 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
963
 
10.8%
946
 
10.6%
945
 
10.6%
936
 
10.5%
930
 
10.4%
925
 
10.4%
394
 
4.4%
107
 
1.2%
85
 
1.0%
80
 
0.9%
Other values (295) 2600
29.2%
ASCII
ValueCountFrequency (%)
418
39.5%
2 141
 
13.3%
1 139
 
13.1%
3 68
 
6.4%
) 39
 
3.7%
( 39
 
3.7%
4 36
 
3.4%
5 29
 
2.7%
0 20
 
1.9%
7 18
 
1.7%
Other values (22) 112
 
10.6%
None
ValueCountFrequency (%)
18
94.7%
1
 
5.3%
Number Forms
ValueCountFrequency (%)
1
100.0%

설비용량
Real number (ℝ)

Distinct355
Distinct (%)34.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean203.6111
Minimum5.04
Maximum1000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size9.1 KiB
2024-01-10T06:49:18.802855image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum5.04
5-th percentile26.68
Q197.2
median99.23
Q399.96
95-th percentile996
Maximum1000
Range994.96
Interquartile range (IQR)2.76

Descriptive statistics

Standard deviation259.38065
Coefficient of variation (CV)1.2739023
Kurtosis3.9206936
Mean203.6111
Median Absolute Deviation (MAD)2.03
Skewness2.2580325
Sum207072.49
Variance67278.324
MonotonicityNot monotonic
2024-01-10T06:49:18.910980image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
99.9 81
 
8.0%
99.0 72
 
7.1%
99.28 40
 
3.9%
97.92 40
 
3.9%
99.36 35
 
3.4%
99.2 31
 
3.0%
997.92 29
 
2.9%
99.84 25
 
2.5%
97.2 24
 
2.4%
99.23 21
 
2.1%
Other values (345) 619
60.9%
ValueCountFrequency (%)
5.04 1
 
0.1%
9.6 1
 
0.1%
10.0 2
0.2%
14.4 1
 
0.1%
14.53 1
 
0.1%
15.0 3
0.3%
15.12 1
 
0.1%
15.3 1
 
0.1%
16.06 1
 
0.1%
16.2 1
 
0.1%
ValueCountFrequency (%)
1000.0 1
 
0.1%
999.99 1
 
0.1%
999.6 2
 
0.2%
999.0 7
 
0.7%
998.4 2
 
0.2%
997.92 29
2.9%
997.56 4
 
0.4%
996.96 1
 
0.1%
996.84 1
 
0.1%
996.0 4
 
0.4%
Distinct856
Distinct (%)84.2%
Missing0
Missing (%)0.0%
Memory size8.1 KiB
2024-01-10T06:49:19.133398image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length91
Median length59
Mean length28.726647
Min length18

Characters and Unicode

Total characters29215
Distinct characters168
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique768 ?
Unique (%)75.5%

Sample

1st row충청남도 홍성군 결성면 성곡리 482-7 주1동(건물위)
2nd row충청남도 홍성군 금마면 죽림리 874-2, 875 주1동(건물위)
3rd row충청남도 홍성군 금마면 봉서리 442, 441, 443, 443-1, 443-3, 470 주3동(건물위)
4th row충청남도 홍성군 금마면 죽림리 874-2, 875 주2동(건물위)
5th row충청남도 홍성군 금마면 죽림리 874-2, 875 주1동, 주2동(건물위)
ValueCountFrequency (%)
충청남도 1015
 
16.1%
홍성군 1014
 
16.1%
은하면 129
 
2.0%
결성면 129
 
2.0%
갈산면 121
 
1.9%
광천읍 112
 
1.8%
구항면 109
 
1.7%
96
 
1.5%
홍동면 91
 
1.4%
83
 
1.3%
Other values (1338) 3403
54.0%
2024-01-10T06:49:19.492479image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5286
 
18.1%
1 1247
 
4.3%
1244
 
4.3%
1232
 
4.2%
1077
 
3.7%
1031
 
3.5%
1029
 
3.5%
1017
 
3.5%
1014
 
3.5%
875
 
3.0%
Other values (158) 14163
48.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 15764
54.0%
Decimal Number 6167
 
21.1%
Space Separator 5286
 
18.1%
Dash Punctuation 796
 
2.7%
Other Punctuation 642
 
2.2%
Open Punctuation 275
 
0.9%
Close Punctuation 275
 
0.9%
Math Symbol 9
 
< 0.1%
Uppercase Letter 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1244
 
7.9%
1232
 
7.8%
1077
 
6.8%
1031
 
6.5%
1029
 
6.5%
1017
 
6.5%
1014
 
6.4%
875
 
5.6%
810
 
5.1%
475
 
3.0%
Other values (139) 5960
37.8%
Decimal Number
ValueCountFrequency (%)
1 1247
20.2%
2 840
13.6%
3 761
12.3%
4 673
10.9%
5 544
8.8%
7 489
 
7.9%
6 478
 
7.8%
8 466
 
7.6%
9 349
 
5.7%
0 320
 
5.2%
Other Punctuation
ValueCountFrequency (%)
, 638
99.4%
/ 3
 
0.5%
. 1
 
0.2%
Space Separator
ValueCountFrequency (%)
5286
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 796
100.0%
Open Punctuation
ValueCountFrequency (%)
( 275
100.0%
Close Punctuation
ValueCountFrequency (%)
) 275
100.0%
Math Symbol
ValueCountFrequency (%)
~ 9
100.0%
Uppercase Letter
ValueCountFrequency (%)
A 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 15764
54.0%
Common 13450
46.0%
Latin 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1244
 
7.9%
1232
 
7.8%
1077
 
6.8%
1031
 
6.5%
1029
 
6.5%
1017
 
6.5%
1014
 
6.4%
875
 
5.6%
810
 
5.1%
475
 
3.0%
Other values (139) 5960
37.8%
Common
ValueCountFrequency (%)
5286
39.3%
1 1247
 
9.3%
2 840
 
6.2%
- 796
 
5.9%
3 761
 
5.7%
4 673
 
5.0%
, 638
 
4.7%
5 544
 
4.0%
7 489
 
3.6%
6 478
 
3.6%
Other values (8) 1698
 
12.6%
Latin
ValueCountFrequency (%)
A 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 15764
54.0%
ASCII 13451
46.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5286
39.3%
1 1247
 
9.3%
2 840
 
6.2%
- 796
 
5.9%
3 761
 
5.7%
4 673
 
5.0%
, 638
 
4.7%
5 544
 
4.0%
7 489
 
3.6%
6 478
 
3.6%
Other values (9) 1699
 
12.6%
Hangul
ValueCountFrequency (%)
1244
 
7.9%
1232
 
7.8%
1077
 
6.8%
1031
 
6.5%
1029
 
6.5%
1017
 
6.5%
1014
 
6.4%
875
 
5.6%
810
 
5.1%
475
 
3.0%
Other values (139) 5960
37.8%
Distinct366
Distinct (%)36.0%
Missing0
Missing (%)0.0%
Memory size8.1 KiB
Minimum2007-05-18 00:00:00
Maximum2022-02-17 00:00:00
2024-01-10T06:49:19.599482image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T06:49:19.702504image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

사업개시일
Date

MISSING 

Distinct302
Distinct (%)41.1%
Missing282
Missing (%)27.7%
Memory size8.1 KiB
Minimum2007-10-16 00:00:00
Maximum2022-04-08 00:00:00
2024-01-10T06:49:19.799977image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T06:49:19.909283image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

Interactions

2024-01-10T06:49:17.954672image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Missing values

2024-01-10T06:49:18.040902image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-10T06:49:18.114224image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

발전소명설비용량발전소주소최초허가일사업개시일
0한가람 태양광발전소19.35충청남도 홍성군 결성면 성곡리 482-7 주1동(건물위)2022-02-17<NA>
1두영1호 태양광발전소99.76충청남도 홍성군 금마면 죽림리 874-2, 875 주1동(건물위)2022-02-17<NA>
2봉서 태양광발전소57.6충청남도 홍성군 금마면 봉서리 442, 441, 443, 443-1, 443-3, 470 주3동(건물위)2022-02-17<NA>
3두영2호 태양광발전소99.76충청남도 홍성군 금마면 죽림리 874-2, 875 주2동(건물위)2022-02-17<NA>
4두영3호 태양광발전소99.76충청남도 홍성군 금마면 죽림리 874-2, 875 주1동, 주2동(건물위)2022-02-17<NA>
5황규흥2호 태양광발전소53.94충청남도 홍성군 결성면 형산리 509-7 주1동(건물위)2022-02-11<NA>
6황규흥1호 태양광발전소89.9충청남도 홍성군 결성면 형산리 509-7 주1동(건물위)2022-02-10<NA>
7청운브레이크 태양광발전소294.98충청남도 홍성군 갈산면 취생리 605 주1동, 주2동(건물위)2022-02-01<NA>
8산수1호 태양광발전소99.84충청남도 홍성군 서부면 광리 770-4 주1동(건물위)2022-02-10<NA>
9가정길2 태양광발전소94.08충청남도 홍성군 광천읍 가정리 432 부2동(건물위)2022-02-01<NA>
발전소명설비용량발전소주소최초허가일사업개시일
1007대율 태양광발전소999.0충청남도 홍성군 은하면 대율리 3982016-06-092017-06-05
1008월림산 태양광발전소999.0충청남도 홍성군 광천읍 광금남로63번길 109-142016-03-142017-09-29
1009㈜우리파워15호 태양광발전소603.84충청남도 홍성군 장곡면 옥계리 산 902016-03-142017-12-29
1010대송 태양광발전소999.0충청남도 홍성군 결성면 형산리 산 168-122016-02-222017-02-28
1011백현 태양광발전소870.0충청남도 홍성군 은하면 장곡리 462-1 대율리 51-7, 51-82015-11-302017-03-09
1012성남리2호 태양광발전소990.0충청남도 홍성군 결성면 성남리 1106-22015-09-072016-10-25
1013성남리1호 태양광발전소990.0충청남도 홍성군 결성면 성남리 1106-22015-09-072016-10-25
1014삼산에너지발전소700.0충청남도 홍성군 장곡면 신풍리 7692015-07-082016-03-28
1015단비덕실 태양광발전소934.2충청남도 홍성군 은하면 홍남로22번길 1492015-04-292016-10-30
1016(주)미르에너지 태양광발전소993.24충청남도 홍성군 구항면 신곡리 산 56-32015-03-052018-04-25