Overview

Dataset statistics

Number of variables7
Number of observations928
Missing cells231
Missing cells (%)3.6%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory52.7 KiB
Average record size in memory58.1 B

Variable types

Numeric2
Text2
DateTime3

Dataset

Description보령시 태양광발전설치현황(법인발전소명, 설비허가용량, 발전소주소, 최초허가일, 사업개시일, 데이터 기준일자)에 대한 데이터를 제공하는 파일데이터입니다.
URLhttps://www.data.go.kr/data/15036099/fileData.do

Alerts

데이터 기준일자 has constant value ""Constant
사업개시일 has 231 (24.9%) missing valuesMissing
일련번호 has unique valuesUnique

Reproduction

Analysis started2023-12-12 05:07:56.047001
Analysis finished2023-12-12 05:07:57.318742
Duration1.27 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

일련번호
Real number (ℝ)

UNIQUE 

Distinct928
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean464.5
Minimum1
Maximum928
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size8.3 KiB
2023-12-12T14:07:57.423723image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile47.35
Q1232.75
median464.5
Q3696.25
95-th percentile881.65
Maximum928
Range927
Interquartile range (IQR)463.5

Descriptive statistics

Standard deviation268.03482
Coefficient of variation (CV)0.57703945
Kurtosis-1.2
Mean464.5
Median Absolute Deviation (MAD)232
Skewness0
Sum431056
Variance71842.667
MonotonicityStrictly increasing
2023-12-12T14:07:57.605348image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.1%
625 1
 
0.1%
613 1
 
0.1%
614 1
 
0.1%
615 1
 
0.1%
616 1
 
0.1%
617 1
 
0.1%
618 1
 
0.1%
619 1
 
0.1%
620 1
 
0.1%
Other values (918) 918
98.9%
ValueCountFrequency (%)
1 1
0.1%
2 1
0.1%
3 1
0.1%
4 1
0.1%
5 1
0.1%
6 1
0.1%
7 1
0.1%
8 1
0.1%
9 1
0.1%
10 1
0.1%
ValueCountFrequency (%)
928 1
0.1%
927 1
0.1%
926 1
0.1%
925 1
0.1%
924 1
0.1%
923 1
0.1%
922 1
0.1%
921 1
0.1%
920 1
0.1%
919 1
0.1%
Distinct900
Distinct (%)97.0%
Missing0
Missing (%)0.0%
Memory size7.4 KiB
2023-12-12T14:07:57.858325image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length24
Median length22
Mean length11.252155
Min length3

Characters and Unicode

Total characters10442
Distinct characters306
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique873 ?
Unique (%)94.1%

Sample

1st row관표 태양광
2nd row철수 태양광발전소
3rd row코미포6호 태양광 발전소
4th row코미포4호 태양광 발전소
5th row코미포3호 태양광 발전소
ValueCountFrequency (%)
태양광발전소 812
39.9%
발전소 63
 
3.1%
태양광 59
 
2.9%
ds하만 34
 
1.7%
은포리 20
 
1.0%
제석 18
 
0.9%
ds궁포 18
 
0.9%
요암동 17
 
0.8%
신흑동 15
 
0.7%
2호 7
 
0.3%
Other values (835) 973
47.8%
2023-12-12T14:07:58.274854image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1108
 
10.6%
906
 
8.7%
902
 
8.6%
898
 
8.6%
898
 
8.6%
889
 
8.5%
888
 
8.5%
475
 
4.5%
1 233
 
2.2%
2 159
 
1.5%
Other values (296) 3086
29.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 8373
80.2%
Space Separator 1108
 
10.6%
Decimal Number 761
 
7.3%
Uppercase Letter 155
 
1.5%
Open Punctuation 12
 
0.1%
Close Punctuation 12
 
0.1%
Lowercase Letter 10
 
0.1%
Dash Punctuation 9
 
0.1%
Other Punctuation 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
906
10.8%
902
 
10.8%
898
 
10.7%
898
 
10.7%
889
 
10.6%
888
 
10.6%
475
 
5.7%
93
 
1.1%
87
 
1.0%
82
 
1.0%
Other values (260) 2255
26.9%
Uppercase Letter
ValueCountFrequency (%)
S 63
40.6%
D 57
36.8%
J 7
 
4.5%
H 5
 
3.2%
K 5
 
3.2%
Y 4
 
2.6%
W 2
 
1.3%
M 2
 
1.3%
L 2
 
1.3%
N 2
 
1.3%
Other values (6) 6
 
3.9%
Decimal Number
ValueCountFrequency (%)
1 233
30.6%
2 159
20.9%
3 91
 
12.0%
4 58
 
7.6%
0 56
 
7.4%
5 42
 
5.5%
6 38
 
5.0%
7 33
 
4.3%
8 27
 
3.5%
9 24
 
3.2%
Lowercase Letter
ValueCountFrequency (%)
o 2
20.0%
c 2
20.0%
p 2
20.0%
e 2
20.0%
k 2
20.0%
Space Separator
ValueCountFrequency (%)
1108
100.0%
Open Punctuation
ValueCountFrequency (%)
( 12
100.0%
Close Punctuation
ValueCountFrequency (%)
) 12
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 9
100.0%
Other Punctuation
ValueCountFrequency (%)
& 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 8373
80.2%
Common 1904
 
18.2%
Latin 165
 
1.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
906
10.8%
902
 
10.8%
898
 
10.7%
898
 
10.7%
889
 
10.6%
888
 
10.6%
475
 
5.7%
93
 
1.1%
87
 
1.0%
82
 
1.0%
Other values (260) 2255
26.9%
Latin
ValueCountFrequency (%)
S 63
38.2%
D 57
34.5%
J 7
 
4.2%
H 5
 
3.0%
K 5
 
3.0%
Y 4
 
2.4%
W 2
 
1.2%
o 2
 
1.2%
c 2
 
1.2%
p 2
 
1.2%
Other values (11) 16
 
9.7%
Common
ValueCountFrequency (%)
1108
58.2%
1 233
 
12.2%
2 159
 
8.4%
3 91
 
4.8%
4 58
 
3.0%
0 56
 
2.9%
5 42
 
2.2%
6 38
 
2.0%
7 33
 
1.7%
8 27
 
1.4%
Other values (5) 59
 
3.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 8373
80.2%
ASCII 2069
 
19.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1108
53.6%
1 233
 
11.3%
2 159
 
7.7%
3 91
 
4.4%
S 63
 
3.0%
4 58
 
2.8%
D 57
 
2.8%
0 56
 
2.7%
5 42
 
2.0%
6 38
 
1.8%
Other values (26) 164
 
7.9%
Hangul
ValueCountFrequency (%)
906
10.8%
902
 
10.8%
898
 
10.7%
898
 
10.7%
889
 
10.6%
888
 
10.6%
475
 
5.7%
93
 
1.1%
87
 
1.0%
82
 
1.0%
Other values (260) 2255
26.9%

설비허가 용량(kW)
Real number (ℝ)

Distinct219
Distinct (%)23.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean146.56947
Minimum10
Maximum999
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size8.3 KiB
2023-12-12T14:07:58.770749image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum10
5-th percentile29.635
Q198.28
median99.2
Q399.84
95-th percentile497.28
Maximum999
Range989
Interquartile range (IQR)1.56

Descriptive statistics

Standard deviation153.67731
Coefficient of variation (CV)1.0484947
Kurtosis13.294699
Mean146.56947
Median Absolute Deviation (MAD)0.7
Skewness3.3956137
Sum136016.47
Variance23616.716
MonotonicityNot monotonic
2023-12-12T14:07:58.978908image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
99.0 109
 
11.7%
99.2 72
 
7.8%
99.96 54
 
5.8%
99.4 52
 
5.6%
97.2 37
 
4.0%
99.36 37
 
4.0%
99.28 31
 
3.3%
98.8 29
 
3.1%
97.31 29
 
3.1%
99.45 29
 
3.1%
Other values (209) 449
48.4%
ValueCountFrequency (%)
10.0 1
0.1%
15.08 1
0.1%
15.6 1
0.1%
18.0 2
0.2%
18.68 1
0.1%
18.9 1
0.1%
19.0 2
0.2%
19.5 1
0.1%
19.58 2
0.2%
19.6 1
0.1%
ValueCountFrequency (%)
999.0 1
 
0.1%
998.4 2
 
0.2%
997.92 6
0.6%
996.8 1
 
0.1%
996.48 1
 
0.1%
994.95 1
 
0.1%
990.0 1
 
0.1%
909.12 1
 
0.1%
864.0 1
 
0.1%
612.72 1
 
0.1%
Distinct626
Distinct (%)67.5%
Missing0
Missing (%)0.0%
Memory size7.4 KiB
2023-12-12T14:07:59.369947image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length57
Median length49
Mean length25.30819
Min length16

Characters and Unicode

Total characters23486
Distinct characters180
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique526 ?
Unique (%)56.7%

Sample

1st row충청남도 보령시 청라면 은고개길 243-16
2nd row충청남도 보령시 청라면 냉풍욕장길 75-133
3rd row충청남도 보령시 웅천읍 산업단지길 24
4th row충청남도 보령시 주교면 관창공단길 225
5th row충청남도 보령시 주교면 관창공단길 165, 코리아휠주식회사
ValueCountFrequency (%)
충청남도 926
 
16.6%
보령시 926
 
16.6%
245
 
4.4%
천북면 198
 
3.6%
남포면 131
 
2.4%
1호 130
 
2.3%
주교면 105
 
1.9%
주산면 101
 
1.8%
웅천읍 101
 
1.8%
청소면 76
 
1.4%
Other values (666) 2625
47.2%
2023-12-12T14:08:00.007587image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4636
19.7%
1074
 
4.6%
1072
 
4.6%
937
 
4.0%
936
 
4.0%
934
 
4.0%
928
 
4.0%
926
 
3.9%
1 846
 
3.6%
756
 
3.2%
Other values (170) 10441
44.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 14345
61.1%
Space Separator 4636
 
19.7%
Decimal Number 4088
 
17.4%
Dash Punctuation 267
 
1.1%
Other Punctuation 94
 
0.4%
Close Punctuation 28
 
0.1%
Open Punctuation 28
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1074
 
7.5%
1072
 
7.5%
937
 
6.5%
936
 
6.5%
934
 
6.5%
928
 
6.5%
926
 
6.5%
756
 
5.3%
755
 
5.3%
687
 
4.8%
Other values (154) 5340
37.2%
Decimal Number
ValueCountFrequency (%)
1 846
20.7%
2 538
13.2%
3 474
11.6%
6 385
9.4%
7 344
8.4%
5 342
8.4%
4 339
8.3%
9 327
 
8.0%
8 267
 
6.5%
0 226
 
5.5%
Other Punctuation
ValueCountFrequency (%)
, 90
95.7%
/ 4
 
4.3%
Space Separator
ValueCountFrequency (%)
4636
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 267
100.0%
Close Punctuation
ValueCountFrequency (%)
) 28
100.0%
Open Punctuation
ValueCountFrequency (%)
( 28
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 14345
61.1%
Common 9141
38.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1074
 
7.5%
1072
 
7.5%
937
 
6.5%
936
 
6.5%
934
 
6.5%
928
 
6.5%
926
 
6.5%
756
 
5.3%
755
 
5.3%
687
 
4.8%
Other values (154) 5340
37.2%
Common
ValueCountFrequency (%)
4636
50.7%
1 846
 
9.3%
2 538
 
5.9%
3 474
 
5.2%
6 385
 
4.2%
7 344
 
3.8%
5 342
 
3.7%
4 339
 
3.7%
9 327
 
3.6%
- 267
 
2.9%
Other values (6) 643
 
7.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 14345
61.1%
ASCII 9141
38.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
4636
50.7%
1 846
 
9.3%
2 538
 
5.9%
3 474
 
5.2%
6 385
 
4.2%
7 344
 
3.8%
5 342
 
3.7%
4 339
 
3.7%
9 327
 
3.6%
- 267
 
2.9%
Other values (6) 643
 
7.0%
Hangul
ValueCountFrequency (%)
1074
 
7.5%
1072
 
7.5%
937
 
6.5%
936
 
6.5%
934
 
6.5%
928
 
6.5%
926
 
6.5%
756
 
5.3%
755
 
5.3%
687
 
4.8%
Other values (154) 5340
37.2%
Distinct224
Distinct (%)24.1%
Missing0
Missing (%)0.0%
Memory size7.4 KiB
Minimum2009-06-09 00:00:00
Maximum2021-07-27 00:00:00
2023-12-12T14:08:00.204223image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:08:00.401434image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

사업개시일
Date

MISSING 

Distinct175
Distinct (%)25.1%
Missing231
Missing (%)24.9%
Memory size7.4 KiB
Minimum2013-06-14 00:00:00
Maximum2021-07-22 00:00:00
2023-12-12T14:08:00.610058image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:08:00.783775image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

데이터 기준일자
Date

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size7.4 KiB
Minimum2023-08-23 00:00:00
Maximum2023-08-23 00:00:00
2023-12-12T14:08:00.916333image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:08:01.033448image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2023-12-12T14:07:56.777293image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:07:56.531253image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:07:56.917707image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:07:56.650411image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T14:08:01.126970image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
일련번호설비허가 용량(kW)
일련번호1.0000.331
설비허가 용량(kW)0.3311.000
2023-12-12T14:08:01.240776image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
일련번호설비허가 용량(kW)
일련번호1.000-0.077
설비허가 용량(kW)-0.0771.000

Missing values

2023-12-12T14:07:57.101956image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T14:07:57.260166image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

일련번호법인발전소명설비허가 용량(kW)발전소주소(설치장소)최초허가일사업개시일데이터 기준일자
01관표 태양광19.92충청남도 보령시 청라면 은고개길 243-162020-11-042020-12-102023-08-23
12철수 태양광발전소92.7충청남도 보령시 청라면 냉풍욕장길 75-1332020-10-26<NA>2023-08-23
23코미포6호 태양광 발전소996.48충청남도 보령시 웅천읍 산업단지길 242021-07-27<NA>2023-08-23
34코미포4호 태양광 발전소909.12충청남도 보령시 주교면 관창공단길 2252021-07-27<NA>2023-08-23
45코미포3호 태양광 발전소864.0충청남도 보령시 주교면 관창공단길 165, 코리아휠주식회사2021-07-27<NA>2023-08-23
56코미포2호 태양광 발전소448.8충청남도 보령시 주교면 관창공단길 165, 코리아휠주식회사2021-07-27<NA>2023-08-23
67김덕순3호 태양광 발전소99.84충청남도 보령시 청라면 의평리 335-22021-07-22<NA>2023-08-23
78김덕순2호 태양광 발전소99.84충청남도 보령시 청라면 의평리 355-22021-07-22<NA>2023-08-23
89김덕순1호 태양광 발전소99.84충청남도 보령시 청라면 의평리 335-22021-07-22<NA>2023-08-23
910소양2호 태양광 발전소99.65충청남도 보령시 청라면 소양리 5112021-07-21<NA>2023-08-23
일련번호법인발전소명설비허가 용량(kW)발전소주소(설치장소)최초허가일사업개시일데이터 기준일자
918919덕광전자(주)99.11충청남도 보령시 주교면 주교리 605번지 1호2014-04-092015-03-312023-08-23
919920루스터3호 태양광발전소52.47충청남도 보령시 주교면 주교리 605번지2014-04-092015-03-312023-08-23
920921루스터2호 태양광발전소69.96충청남도 보령시 주교면 주교리 606번지2014-04-092015-03-312023-08-23
921922조양6호 태양광발전소99.36충청남도 보령시 청라면 장현리 961번지 76호2014-04-032015-06-122023-08-23
922923조양5호 태양광발전소99.36충청남도 보령시 청라면 장현리 961번지 47호2014-04-032015-06-122023-08-23
923924엔에스산업 태양광발전소99.75충청남도 보령시 주포면 배재길 6-772014-03-052015-06-012023-08-23
924925화산제1호 태양광발전소96.46충청남도 보령시 대청로 397 (화산동)2014-02-252014-07-102023-08-23
925926원산미영 태양광발전소25.52충청남도 보령시 오천면 원산도3길 1572013-03-142013-06-142023-08-23
926927보령그린환경(주)보령그린 발전소440.0충청남도 보령시 해안로 543 (남곡동)2012-04-092016-05-202023-08-23
927928청천소수력 발전소490.0충청남도 보령시 죽성로 94-19 (죽정동)2009-06-09<NA>2023-08-23