Overview

Dataset statistics

Number of variables6
Number of observations349
Missing cells210
Missing cells (%)10.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory16.8 KiB
Average record size in memory49.4 B

Variable types

Categorical1
Text2
Numeric1
DateTime2

Dataset

Description태양광발전 시설 현황(발전소명, 설비용량, 발전소주소, 최초허가일, 사업개시일)에 대한 정보를 제공하고 있습니다.
Author충청남도 태안군
URLhttps://www.data.go.kr/data/15033988/fileData.do

Alerts

구분 has constant value ""Constant
사업개시일 has 210 (60.2%) missing valuesMissing

Reproduction

Analysis started2023-12-23 08:13:30.701164
Analysis finished2023-12-23 08:13:31.818230
Duration1.12 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

구분
Categorical

CONSTANT 

Distinct1
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size2.9 KiB
전기사업허가
349 

Length

Max length6
Median length6
Mean length6
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row전기사업허가
2nd row전기사업허가
3rd row전기사업허가
4th row전기사업허가
5th row전기사업허가

Common Values

ValueCountFrequency (%)
전기사업허가 349
100.0%

Length

2023-12-23T08:13:32.077589image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-23T08:13:32.390714image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
전기사업허가 349
100.0%
Distinct330
Distinct (%)94.6%
Missing0
Missing (%)0.0%
Memory size2.9 KiB
2023-12-23T08:13:33.007905image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length18
Median length16
Mean length11.232092
Min length2

Characters and Unicode

Total characters3920
Distinct characters235
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique317 ?
Unique (%)90.8%

Sample

1st row마정2
2nd row윤경
3rd row원북면장대1리 태양광발전소
4th row신두3리 다목적회관
5th row방갈2리 다목적회관
ValueCountFrequency (%)
태양광발전소 303
41.8%
쏠라포스 23
 
3.2%
2호 8
 
1.1%
3호 7
 
1.0%
1호 6
 
0.8%
㈜썬솔라에너지 6
 
0.8%
다도 5
 
0.7%
4호 5
 
0.7%
유한회사 5
 
0.7%
방갈1리 4
 
0.6%
Other values (317) 353
48.7%
2023-12-23T08:13:34.235714image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
382
 
9.7%
376
 
9.6%
347
 
8.9%
333
 
8.5%
330
 
8.4%
330
 
8.4%
329
 
8.4%
190
 
4.8%
1 73
 
1.9%
59
 
1.5%
Other values (225) 1171
29.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3250
82.9%
Space Separator 376
 
9.6%
Decimal Number 265
 
6.8%
Other Symbol 14
 
0.4%
Uppercase Letter 11
 
0.3%
Open Punctuation 2
 
0.1%
Close Punctuation 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
382
11.8%
347
10.7%
333
 
10.2%
330
 
10.2%
330
 
10.2%
329
 
10.1%
190
 
5.8%
59
 
1.8%
49
 
1.5%
36
 
1.1%
Other values (205) 865
26.6%
Decimal Number
ValueCountFrequency (%)
1 73
27.5%
2 52
19.6%
3 35
13.2%
4 33
12.5%
5 21
 
7.9%
6 14
 
5.3%
7 10
 
3.8%
8 10
 
3.8%
9 9
 
3.4%
0 8
 
3.0%
Uppercase Letter
ValueCountFrequency (%)
S 3
27.3%
B 3
27.3%
K 2
18.2%
R 1
 
9.1%
C 1
 
9.1%
M 1
 
9.1%
Space Separator
ValueCountFrequency (%)
376
100.0%
Other Symbol
ValueCountFrequency (%)
14
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3264
83.3%
Common 645
 
16.5%
Latin 11
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
382
11.7%
347
 
10.6%
333
 
10.2%
330
 
10.1%
330
 
10.1%
329
 
10.1%
190
 
5.8%
59
 
1.8%
49
 
1.5%
36
 
1.1%
Other values (206) 879
26.9%
Common
ValueCountFrequency (%)
376
58.3%
1 73
 
11.3%
2 52
 
8.1%
3 35
 
5.4%
4 33
 
5.1%
5 21
 
3.3%
6 14
 
2.2%
7 10
 
1.6%
8 10
 
1.6%
9 9
 
1.4%
Other values (3) 12
 
1.9%
Latin
ValueCountFrequency (%)
S 3
27.3%
B 3
27.3%
K 2
18.2%
R 1
 
9.1%
C 1
 
9.1%
M 1
 
9.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3250
82.9%
ASCII 656
 
16.7%
None 14
 
0.4%

Most frequent character per block

Hangul
ValueCountFrequency (%)
382
11.8%
347
10.7%
333
 
10.2%
330
 
10.2%
330
 
10.2%
329
 
10.1%
190
 
5.8%
59
 
1.8%
49
 
1.5%
36
 
1.1%
Other values (205) 865
26.6%
ASCII
ValueCountFrequency (%)
376
57.3%
1 73
 
11.1%
2 52
 
7.9%
3 35
 
5.3%
4 33
 
5.0%
5 21
 
3.2%
6 14
 
2.1%
7 10
 
1.5%
8 10
 
1.5%
9 9
 
1.4%
Other values (9) 23
 
3.5%
None
ValueCountFrequency (%)
14
100.0%

설비용량
Real number (ℝ)

Distinct112
Distinct (%)32.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean296.12444
Minimum11.83
Maximum1951.2
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.2 KiB
2023-12-23T08:13:34.646577image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum11.83
5-th percentile19.587
Q198.1
median99.68
Q3499.8
95-th percentile998.4
Maximum1951.2
Range1939.37
Interquartile range (IQR)401.7

Descriptive statistics

Standard deviation356.14882
Coefficient of variation (CV)1.2026998
Kurtosis0.93350455
Mean296.12444
Median Absolute Deviation (MAD)43.12
Skewness1.4072339
Sum103347.43
Variance126841.98
MonotonicityNot monotonic
2023-12-23T08:13:35.110135image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
99.68 42
 
12.0%
99.76 26
 
7.4%
998.4 26
 
7.4%
99.44 25
 
7.2%
499.96 21
 
6.0%
99.6 20
 
5.7%
98.1 14
 
4.0%
99.84 14
 
4.0%
96.3 7
 
2.0%
991.2 5
 
1.4%
Other values (102) 149
42.7%
ValueCountFrequency (%)
11.83 1
0.3%
12.6 1
0.3%
15.0 1
0.3%
15.3 1
0.3%
15.47 1
0.3%
16.02 1
0.3%
16.2 1
0.3%
17.85 2
0.6%
18.06 1
0.3%
19.04 1
0.3%
ValueCountFrequency (%)
1951.2 1
 
0.3%
999.9 1
 
0.3%
999.18 2
 
0.6%
999.0 5
 
1.4%
998.64 3
 
0.9%
998.4 26
7.4%
997.92 5
 
1.4%
997.5 1
 
0.3%
996.84 4
 
1.1%
991.2 5
 
1.4%
Distinct220
Distinct (%)63.0%
Missing0
Missing (%)0.0%
Memory size2.9 KiB
2023-12-23T08:13:35.597152image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length104
Median length51
Mean length25.808023
Min length13

Characters and Unicode

Total characters9007
Distinct characters98
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique172 ?
Unique (%)49.3%

Sample

1st row태안군 안면읍 중장리 425-348, 425-349
2nd row태안군 원북면 장대리 7-2 제2동
3rd row태안군 원북면 장대리 233-2
4th row태안군 원북면 신두리 1221--12
5th row태안군 원북면 방갈리 515-131(건물위)
ValueCountFrequency (%)
태안군 343
19.2%
178
 
10.0%
안면읍 135
 
7.6%
정당리 90
 
5.0%
소원면 80
 
4.5%
원북면 48
 
2.7%
모항리 35
 
2.0%
태안읍 35
 
2.0%
중장리 34
 
1.9%
충청남도 32
 
1.8%
Other values (333) 778
43.5%
2023-12-23T08:13:36.239130image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1443
 
16.0%
1 588
 
6.5%
514
 
5.7%
- 481
 
5.3%
378
 
4.2%
2 360
 
4.0%
348
 
3.9%
343
 
3.8%
315
 
3.5%
6 298
 
3.3%
Other values (88) 3939
43.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3865
42.9%
Decimal Number 2595
28.8%
Space Separator 1443
 
16.0%
Dash Punctuation 481
 
5.3%
Open Punctuation 212
 
2.4%
Close Punctuation 212
 
2.4%
Other Punctuation 199
 
2.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
514
13.3%
378
 
9.8%
348
 
9.0%
343
 
8.9%
315
 
8.2%
188
 
4.9%
170
 
4.4%
139
 
3.6%
138
 
3.6%
107
 
2.8%
Other values (72) 1225
31.7%
Decimal Number
ValueCountFrequency (%)
1 588
22.7%
2 360
13.9%
6 298
11.5%
5 286
11.0%
7 270
10.4%
4 229
 
8.8%
3 183
 
7.1%
0 158
 
6.1%
9 116
 
4.5%
8 107
 
4.1%
Other Punctuation
ValueCountFrequency (%)
, 192
96.5%
/ 7
 
3.5%
Space Separator
ValueCountFrequency (%)
1443
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 481
100.0%
Open Punctuation
ValueCountFrequency (%)
( 212
100.0%
Close Punctuation
ValueCountFrequency (%)
) 212
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 5142
57.1%
Hangul 3865
42.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
514
13.3%
378
 
9.8%
348
 
9.0%
343
 
8.9%
315
 
8.2%
188
 
4.9%
170
 
4.4%
139
 
3.6%
138
 
3.6%
107
 
2.8%
Other values (72) 1225
31.7%
Common
ValueCountFrequency (%)
1443
28.1%
1 588
11.4%
- 481
 
9.4%
2 360
 
7.0%
6 298
 
5.8%
5 286
 
5.6%
7 270
 
5.3%
4 229
 
4.5%
( 212
 
4.1%
) 212
 
4.1%
Other values (6) 763
14.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 5142
57.1%
Hangul 3865
42.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1443
28.1%
1 588
11.4%
- 481
 
9.4%
2 360
 
7.0%
6 298
 
5.8%
5 286
 
5.6%
7 270
 
5.3%
4 229
 
4.5%
( 212
 
4.1%
) 212
 
4.1%
Other values (6) 763
14.8%
Hangul
ValueCountFrequency (%)
514
13.3%
378
 
9.8%
348
 
9.0%
343
 
8.9%
315
 
8.2%
188
 
4.9%
170
 
4.4%
139
 
3.6%
138
 
3.6%
107
 
2.8%
Other values (72) 1225
31.7%
Distinct112
Distinct (%)32.1%
Missing0
Missing (%)0.0%
Memory size2.9 KiB
Minimum2014-02-04 00:00:00
Maximum2023-11-30 00:00:00
2023-12-23T08:13:36.562108image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-23T08:13:36.971661image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

사업개시일
Date

MISSING 

Distinct60
Distinct (%)43.2%
Missing210
Missing (%)60.2%
Memory size2.9 KiB
Minimum2015-03-12 00:00:00
Maximum2025-07-21 00:00:00
2023-12-23T08:13:37.355917image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-23T08:13:37.773094image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

Interactions

2023-12-23T08:13:31.203961image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-23T08:13:38.033072image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
설비용량사업개시일
설비용량1.0000.976
사업개시일0.9761.000

Missing values

2023-12-23T08:13:31.438181image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-23T08:13:31.637822image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

구분발전소명설비용량발전소주소최초허가일사업개시일
0전기사업허가마정2496.47태안군 안면읍 중장리 425-348, 425-3492019-12-262021-08-25
1전기사업허가윤경99.9태안군 원북면 장대리 7-2 제2동2019-12-262020-05-11
2전기사업허가원북면장대1리 태양광발전소146.25태안군 원북면 장대리 233-22019-12-272020-07-30
3전기사업허가신두3리 다목적회관94.5태안군 원북면 신두리 1221--122020-01-032020-09-12
4전기사업허가방갈2리 다목적회관37.35태안군 원북면 방갈리 515-131(건물위)2020-01-152020-07-24
5전기사업허가에이치파워1호99.54태안군 원북면 방갈리 176-132020-04-032020-12-15
6전기사업허가제이에이치 태양광발전소99.54태안군 원북면 방갈리 176-132020-04-032020-12-15
7전기사업허가솜씨에너지498.42태안군 소원면 소근리 421-2772020-04-10<NA>
8전기사업허가위드안면도19.89태안군 안면읍 승언리 1336-25(건물위)2020-04-302020-08-24
9전기사업허가김오묵 태양광발전소19.5태안군 태안읍 동문리 900-2(건물 위)2020-05-212020-09-16
구분발전소명설비용량발전소주소최초허가일사업개시일
339전기사업허가양산12호 태양광발전소98.1태안군 원북면 양산리 635-1(토지 위)2023-09-01<NA>
340전기사업허가양산13호 태양광발전소98.1태안군 원북면 양산리 635-1(토지 위)2023-09-01<NA>
341전기사업허가양산14호 태양광발전소98.1태안군 원북면 양산리 635-1(토지 위)2023-09-01<NA>
342전기사업허가양산15호 태양광발전소98.1태안군 원북면 양산리 635, 635-1, 635-2(토지 위)2023-09-01<NA>
343전기사업허가케이엠에프 태안 태양광발전소1951.2태안군 남면 양잠리 1276-24, 1276-25, 1276-26, 1276-27(건물 위)2021-03-04<NA>
344전기사업허가명헌식 태양광발전소19.04태안군 태안읍 평천리 627-35(건물 위)2023-09-07<NA>
345전기사업허가씨엔코 태양광발전소70.85태안군 태안읍 인평리 260-1(건물 위)2023-09-14<NA>
346전기사업허가애플태안 태양광발전소499.8태안군 안면읍 중장리 425-247, 425-3442023-10-04<NA>
347전기사업허가지운례 태양광발전소49.98태안군 태안읍 도내리 93-1(건물 위)2023-11-30<NA>
348전기사업허가황촌1리목말협동조합 태양광발전소19.5태안군 원북면 옥파로 7522023-11-21<NA>