Overview

Dataset statistics

Number of variables6
Number of observations895
Missing cells230
Missing cells (%)4.3%
Duplicate rows1
Duplicate rows (%)0.1%
Total size in memory43.0 KiB
Average record size in memory49.1 B

Variable types

Text2
Numeric1
DateTime2
Categorical1

Dataset

Description보령시 태양광발전설치현황(법인발전소명, 설비허가용량, 발전소주소, 최초허가일, 사업개시일, 데이터 기준일자)에 대한 데이터를 제공하는 파일데이터입니다.
Author충청남도
URLhttps://alldam.chungnam.go.kr/index.chungnam?menuCd=DOM_000000201001001001&st=&cds=&orgCd=&apiType=&isOpen=Y&pageIndex=385&beforeMenuCd=DOM_000000201001001000&publicdatapk=15036099

Alerts

데이터 기준일자 has constant value ""Constant
Dataset has 1 (0.1%) duplicate rowsDuplicates
사업개시일 has 227 (25.4%) missing valuesMissing

Reproduction

Analysis started2024-01-09 22:06:30.772975
Analysis finished2024-01-09 22:06:31.342247
Duration0.57 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct866
Distinct (%)96.8%
Missing0
Missing (%)0.0%
Memory size7.1 KiB
2024-01-10T07:06:31.514431image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length24
Median length22
Mean length11.146369
Min length1

Characters and Unicode

Total characters9976
Distinct characters300
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique839 ?
Unique (%)93.7%

Sample

1st row관표 태양광
2nd row철수 태양광발전소
3rd row신구 행복1호 태양광 발전소
4th row청라 태양광 발전소
5th row오성 태양광발전소
ValueCountFrequency (%)
태양광발전소 812
42.2%
ds하만 34
 
1.8%
발전소 24
 
1.2%
태양광 21
 
1.1%
은포리 20
 
1.0%
ds궁포 18
 
0.9%
제석 18
 
0.9%
요암동 17
 
0.9%
신흑동 15
 
0.8%
2호 8
 
0.4%
Other values (798) 937
48.7%
2024-01-10T07:06:31.824749image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1029
 
10.3%
867
 
8.7%
860
 
8.6%
860
 
8.6%
856
 
8.6%
850
 
8.5%
850
 
8.5%
451
 
4.5%
1 232
 
2.3%
2 151
 
1.5%
Other values (290) 2970
29.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 8004
80.2%
Space Separator 1029
 
10.3%
Decimal Number 741
 
7.4%
Uppercase Letter 155
 
1.6%
Open Punctuation 13
 
0.1%
Close Punctuation 13
 
0.1%
Lowercase Letter 10
 
0.1%
Dash Punctuation 9
 
0.1%
Other Punctuation 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
867
10.8%
860
 
10.7%
860
 
10.7%
856
 
10.7%
850
 
10.6%
850
 
10.6%
451
 
5.6%
89
 
1.1%
85
 
1.1%
82
 
1.0%
Other values (254) 2154
26.9%
Uppercase Letter
ValueCountFrequency (%)
S 63
40.6%
D 57
36.8%
J 7
 
4.5%
H 5
 
3.2%
K 5
 
3.2%
Y 4
 
2.6%
W 2
 
1.3%
M 2
 
1.3%
N 2
 
1.3%
L 2
 
1.3%
Other values (6) 6
 
3.9%
Decimal Number
ValueCountFrequency (%)
1 232
31.3%
2 151
20.4%
3 85
 
11.5%
0 56
 
7.6%
4 56
 
7.6%
5 42
 
5.7%
6 37
 
5.0%
7 32
 
4.3%
8 26
 
3.5%
9 24
 
3.2%
Lowercase Letter
ValueCountFrequency (%)
p 2
20.0%
e 2
20.0%
o 2
20.0%
c 2
20.0%
k 2
20.0%
Space Separator
ValueCountFrequency (%)
1029
100.0%
Open Punctuation
ValueCountFrequency (%)
( 13
100.0%
Close Punctuation
ValueCountFrequency (%)
) 13
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 9
100.0%
Other Punctuation
ValueCountFrequency (%)
& 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 8004
80.2%
Common 1807
 
18.1%
Latin 165
 
1.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
867
10.8%
860
 
10.7%
860
 
10.7%
856
 
10.7%
850
 
10.6%
850
 
10.6%
451
 
5.6%
89
 
1.1%
85
 
1.1%
82
 
1.0%
Other values (254) 2154
26.9%
Latin
ValueCountFrequency (%)
S 63
38.2%
D 57
34.5%
J 7
 
4.2%
H 5
 
3.0%
K 5
 
3.0%
Y 4
 
2.4%
p 2
 
1.2%
W 2
 
1.2%
e 2
 
1.2%
o 2
 
1.2%
Other values (11) 16
 
9.7%
Common
ValueCountFrequency (%)
1029
56.9%
1 232
 
12.8%
2 151
 
8.4%
3 85
 
4.7%
0 56
 
3.1%
4 56
 
3.1%
5 42
 
2.3%
6 37
 
2.0%
7 32
 
1.8%
8 26
 
1.4%
Other values (5) 61
 
3.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 8004
80.2%
ASCII 1972
 
19.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1029
52.2%
1 232
 
11.8%
2 151
 
7.7%
3 85
 
4.3%
S 63
 
3.2%
D 57
 
2.9%
0 56
 
2.8%
4 56
 
2.8%
5 42
 
2.1%
6 37
 
1.9%
Other values (26) 164
 
8.3%
Hangul
ValueCountFrequency (%)
867
10.8%
860
 
10.7%
860
 
10.7%
856
 
10.7%
850
 
10.6%
850
 
10.6%
451
 
5.6%
89
 
1.1%
85
 
1.1%
82
 
1.0%
Other values (254) 2154
26.9%

설비허가 용량(kW)
Real number (ℝ)

Distinct208
Distinct (%)23.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean141.53238
Minimum0
Maximum999
Zeros2
Zeros (%)0.2%
Negative0
Negative (%)0.0%
Memory size8.0 KiB
2024-01-10T07:06:31.935163image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile30
Q198.28
median99.2
Q399.72
95-th percentile496.8
Maximum999
Range999
Interquartile range (IQR)1.44

Descriptive statistics

Standard deviation138.90022
Coefficient of variation (CV)0.98140243
Kurtosis14.373635
Mean141.53238
Median Absolute Deviation (MAD)0.7
Skewness3.4211073
Sum126671.48
Variance19293.272
MonotonicityNot monotonic
2024-01-10T07:06:32.040527image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
99.0 109
 
12.2%
99.2 72
 
8.0%
99.96 54
 
6.0%
99.4 52
 
5.8%
97.2 37
 
4.1%
99.36 36
 
4.0%
99.28 31
 
3.5%
98.8 29
 
3.2%
99.45 29
 
3.2%
97.31 29
 
3.2%
Other values (198) 417
46.6%
ValueCountFrequency (%)
0.0 2
0.2%
0.01 1
0.1%
10.0 1
0.1%
15.08 1
0.1%
15.6 1
0.1%
18.0 2
0.2%
18.68 1
0.1%
18.9 1
0.1%
19.0 2
0.2%
19.5 1
0.1%
ValueCountFrequency (%)
999.0 1
 
0.1%
998.4 2
0.2%
997.92 4
0.4%
996.8 1
 
0.1%
990.0 1
 
0.1%
500.0 1
 
0.1%
499.84 1
 
0.1%
499.8 1
 
0.1%
499.72 4
0.4%
499.2 3
0.3%
Distinct699
Distinct (%)78.4%
Missing3
Missing (%)0.3%
Memory size7.1 KiB
2024-01-10T07:06:32.279346image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length74
Median length50
Mean length22.109865
Min length10

Characters and Unicode

Total characters19722
Distinct characters155
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique631 ?
Unique (%)70.7%

Sample

1st row충남 보령시 청라면 음현리 28 / 건물 위
2nd row충남 보령시 청라면 의평리 332-4
3rd row보령시 주산면 신구리 414번지
4th row보령시 청라면 나원리 994-2
5th row보령시 주교면 외평길 10 / 건물 위
ValueCountFrequency (%)
보령시 879
 
20.1%
충남 283
 
6.5%
천북면 192
 
4.4%
남포면 122
 
2.8%
웅천읍 100
 
2.3%
주산면 96
 
2.2%
주교면 95
 
2.2%
청소면 76
 
1.7%
청라면 59
 
1.4%
59
 
1.4%
Other values (1093) 2408
55.1%
2024-01-10T07:06:32.875408image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3479
17.6%
- 1075
 
5.5%
1 1053
 
5.3%
887
 
4.5%
884
 
4.5%
882
 
4.5%
802
 
4.1%
723
 
3.7%
2 702
 
3.6%
3 585
 
3.0%
Other values (145) 8650
43.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 9202
46.7%
Decimal Number 5265
26.7%
Space Separator 3479
 
17.6%
Dash Punctuation 1075
 
5.5%
Other Punctuation 546
 
2.8%
Open Punctuation 73
 
0.4%
Close Punctuation 73
 
0.4%
Uppercase Letter 6
 
< 0.1%
Other Symbol 2
 
< 0.1%
Math Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
887
 
9.6%
884
 
9.6%
882
 
9.6%
802
 
8.7%
723
 
7.9%
464
 
5.0%
420
 
4.6%
359
 
3.9%
297
 
3.2%
247
 
2.7%
Other values (122) 3237
35.2%
Decimal Number
ValueCountFrequency (%)
1 1053
20.0%
2 702
13.3%
3 585
11.1%
5 496
9.4%
6 492
9.3%
4 468
8.9%
9 451
8.6%
7 392
 
7.4%
8 360
 
6.8%
0 266
 
5.1%
Uppercase Letter
ValueCountFrequency (%)
A 2
33.3%
B 2
33.3%
D 1
16.7%
C 1
16.7%
Other Punctuation
ValueCountFrequency (%)
, 447
81.9%
/ 97
 
17.8%
. 2
 
0.4%
Space Separator
ValueCountFrequency (%)
3479
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1075
100.0%
Open Punctuation
ValueCountFrequency (%)
( 73
100.0%
Close Punctuation
ValueCountFrequency (%)
) 73
100.0%
Other Symbol
ValueCountFrequency (%)
2
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 10514
53.3%
Hangul 9202
46.7%
Latin 6
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
887
 
9.6%
884
 
9.6%
882
 
9.6%
802
 
8.7%
723
 
7.9%
464
 
5.0%
420
 
4.6%
359
 
3.9%
297
 
3.2%
247
 
2.7%
Other values (122) 3237
35.2%
Common
ValueCountFrequency (%)
3479
33.1%
- 1075
 
10.2%
1 1053
 
10.0%
2 702
 
6.7%
3 585
 
5.6%
5 496
 
4.7%
6 492
 
4.7%
4 468
 
4.5%
9 451
 
4.3%
, 447
 
4.3%
Other values (9) 1266
 
12.0%
Latin
ValueCountFrequency (%)
A 2
33.3%
B 2
33.3%
D 1
16.7%
C 1
16.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 10518
53.3%
Hangul 9202
46.7%
CJK Compat 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3479
33.1%
- 1075
 
10.2%
1 1053
 
10.0%
2 702
 
6.7%
3 585
 
5.6%
5 496
 
4.7%
6 492
 
4.7%
4 468
 
4.4%
9 451
 
4.3%
, 447
 
4.2%
Other values (12) 1270
 
12.1%
Hangul
ValueCountFrequency (%)
887
 
9.6%
884
 
9.6%
882
 
9.6%
802
 
8.7%
723
 
7.9%
464
 
5.0%
420
 
4.6%
359
 
3.9%
297
 
3.2%
247
 
2.7%
Other values (122) 3237
35.2%
CJK Compat
ValueCountFrequency (%)
2
100.0%
Distinct212
Distinct (%)23.7%
Missing0
Missing (%)0.0%
Memory size7.1 KiB
Minimum2009-06-09 00:00:00
Maximum2021-03-24 00:00:00
2024-01-10T07:06:32.994197image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T07:06:33.101279image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

사업개시일
Date

MISSING 

Distinct165
Distinct (%)24.7%
Missing227
Missing (%)25.4%
Memory size7.1 KiB
Minimum2013-06-14 00:00:00
Maximum2021-03-31 00:00:00
2024-01-10T07:06:33.213498image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T07:06:33.321596image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

데이터 기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size7.1 KiB
2021-04-13
895 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2021-04-13
2nd row2021-04-13
3rd row2021-04-13
4th row2021-04-13
5th row2021-04-13

Common Values

ValueCountFrequency (%)
2021-04-13 895
100.0%

Length

2024-01-10T07:06:33.411673image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T07:06:33.485019image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2021-04-13 895
100.0%

Interactions

2024-01-10T07:06:31.042344image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Missing values

2024-01-10T07:06:31.132403image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-10T07:06:31.215200image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-01-10T07:06:31.294545image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

법인발전소명설비허가 용량(kW)발전소주소(설치장소)최초허가일사업개시일데이터 기준일자
0관표 태양광19.92충남 보령시 청라면 음현리 28 / 건물 위2020-11-042020-12-102021-04-13
1철수 태양광발전소92.7충남 보령시 청라면 의평리 332-42020-10-26<NA>2021-04-13
2신구 행복1호 태양광 발전소29.58보령시 주산면 신구리 414번지2021-03-24<NA>2021-04-13
3청라 태양광 발전소29.75보령시 청라면 나원리 994-22021-03-04<NA>2021-04-13
4오성 태양광발전소29.6보령시 주교면 외평길 10 / 건물 위2021-03-04<NA>2021-04-13
5남옥 태양광 발전소49.73보령시 남포면 양기리 226 / 건물 위2021-02-15<NA>2021-04-13
6연희 태양광 발전소99.9보령시 청라면 향천리 38-1, 38-2, 38-3, 38-7, 40 / 건물위2021-02-09<NA>2021-04-13
7윤서에너지 태양광발전소997.92보령시 청소면 장곡리 209-12018-09-21<NA>2021-04-13
8대경천북 태양광발전소997.92보령시 천북면 궁포리 360-6, 360-72018-09-27<NA>2021-04-13
9(주)수림일호 태양광발전소997.92보령시 천북면 궁포리 360-6, 360-72018-09-27<NA>2021-04-13
법인발전소명설비허가 용량(kW)발전소주소(설치장소)최초허가일사업개시일데이터 기준일자
885덕광전자(주)99.11충남 보령시 주교면 주교리 605-1 / 건물위2014-04-092015-03-312021-04-13
886루스터3호 태양광발전소52.47충남 보령시 주교면 주교리 605/건물위2014-04-092015-03-312021-04-13
887루스터2호 태양광발전소69.96충남 보령시 주교면 주교리 606, 607/건물위2014-04-092015-03-312021-04-13
888조양6호 태양광발전소99.36보령시 청라면 장현리 961-762014-04-032015-06-122021-04-13
889조양5호 태양광발전소99.36보령시 청라면 장현리 961-47, 961-75(2필지)2014-04-032015-06-122021-04-13
890엔에스산업 태양광발전소99.75충남 보령시 주포면 배지길 6-77(1동공장)/건물위2014-03-052015-06-012021-04-13
891화산제1호 태양광발전소96.46보령시 대청로 397(화산동) /건물 위2014-02-252014-07-102021-04-13
892원산미영 태양광발전소25.52충남 보령시 오천면 원산도리 205, 207(2필지)2013-03-142013-06-142021-04-13
893보령그린환경(주)보령그린 발전소440.0보령시 남곡동 1140-6 외 5필지2012-04-092016-05-202021-04-13
894청천소수력 발전소490.0충남 보령시 죽정동 12-32009-06-09<NA>2021-04-13

Duplicate rows

Most frequently occurring

법인발전소명설비허가 용량(kW)발전소주소(설치장소)최초허가일사업개시일데이터 기준일자# duplicates
010.0<NA>2013-01-01<NA>2021-04-132