Overview

Dataset statistics

Number of variables4
Number of observations313
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory10.2 KiB
Average record size in memory33.4 B

Variable types

Text2
Numeric1
DateTime1

Dataset

Description경상남도 고성군에 소재하고 있는 태양광 발전시설 현황에 대한 데이터로 발전소명, 설비용량, 주소 등의 항목을 제공합니다.
Author경상남도 고성군
URLhttps://bigdata.gyeongnam.go.kr/index.gn?menuCd=DOM_000000114002001000&publicdatapk=15033951

Reproduction

Analysis started2023-12-10 23:45:32.220442
Analysis finished2023-12-10 23:45:32.671343
Duration0.45 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct310
Distinct (%)99.0%
Missing0
Missing (%)0.0%
Memory size2.6 KiB
2023-12-11T08:45:32.946628image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length27
Median length23
Mean length10.584665
Min length4

Characters and Unicode

Total characters3313
Distinct characters240
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique307 ?
Unique (%)98.1%

Sample

1st row한국남동발전(주)
2nd row(주)송산태양광발전소
3rd row녹원태양광발전소
4th row덕호쏠라에너지
5th row월흥쏠라에너지
ValueCountFrequency (%)
태양광발전소 128
 
26.5%
주식회사 5
 
1.0%
4
 
0.8%
고성 3
 
0.6%
발전소 3
 
0.6%
더썬 2
 
0.4%
주차장 2
 
0.4%
영농형 2
 
0.4%
수상태양광발전소 2
 
0.4%
1호 2
 
0.4%
Other values (328) 330
68.3%
2023-12-11T08:45:33.472699image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
320
 
9.7%
307
 
9.3%
306
 
9.2%
300
 
9.1%
300
 
9.1%
299
 
9.0%
182
 
5.5%
65
 
2.0%
53
 
1.6%
( 39
 
1.2%
Other values (230) 1142
34.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2899
87.5%
Space Separator 182
 
5.5%
Decimal Number 103
 
3.1%
Open Punctuation 39
 
1.2%
Close Punctuation 39
 
1.2%
Math Symbol 20
 
0.6%
Other Symbol 14
 
0.4%
Uppercase Letter 14
 
0.4%
Dash Punctuation 2
 
0.1%
Other Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
320
 
11.0%
307
 
10.6%
306
 
10.6%
300
 
10.3%
300
 
10.3%
299
 
10.3%
65
 
2.2%
53
 
1.8%
34
 
1.2%
32
 
1.1%
Other values (206) 883
30.5%
Decimal Number
ValueCountFrequency (%)
1 36
35.0%
2 35
34.0%
3 13
 
12.6%
5 6
 
5.8%
0 5
 
4.9%
4 5
 
4.9%
7 1
 
1.0%
6 1
 
1.0%
8 1
 
1.0%
Uppercase Letter
ValueCountFrequency (%)
E 4
28.6%
S 2
14.3%
A 2
14.3%
C 2
14.3%
P 1
 
7.1%
H 1
 
7.1%
T 1
 
7.1%
J 1
 
7.1%
Space Separator
ValueCountFrequency (%)
182
100.0%
Open Punctuation
ValueCountFrequency (%)
( 39
100.0%
Close Punctuation
ValueCountFrequency (%)
) 39
100.0%
Math Symbol
ValueCountFrequency (%)
20
100.0%
Other Symbol
ValueCountFrequency (%)
14
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%
Other Punctuation
ValueCountFrequency (%)
& 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2913
87.9%
Common 386
 
11.7%
Latin 14
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
320
 
11.0%
307
 
10.5%
306
 
10.5%
300
 
10.3%
300
 
10.3%
299
 
10.3%
65
 
2.2%
53
 
1.8%
34
 
1.2%
32
 
1.1%
Other values (207) 897
30.8%
Common
ValueCountFrequency (%)
182
47.2%
( 39
 
10.1%
) 39
 
10.1%
1 36
 
9.3%
2 35
 
9.1%
20
 
5.2%
3 13
 
3.4%
5 6
 
1.6%
0 5
 
1.3%
4 5
 
1.3%
Other values (5) 6
 
1.6%
Latin
ValueCountFrequency (%)
E 4
28.6%
S 2
14.3%
A 2
14.3%
C 2
14.3%
P 1
 
7.1%
H 1
 
7.1%
T 1
 
7.1%
J 1
 
7.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2899
87.5%
ASCII 380
 
11.5%
Arrows 20
 
0.6%
None 14
 
0.4%

Most frequent character per block

Hangul
ValueCountFrequency (%)
320
 
11.0%
307
 
10.6%
306
 
10.6%
300
 
10.3%
300
 
10.3%
299
 
10.3%
65
 
2.2%
53
 
1.8%
34
 
1.2%
32
 
1.1%
Other values (206) 883
30.5%
ASCII
ValueCountFrequency (%)
182
47.9%
( 39
 
10.3%
) 39
 
10.3%
1 36
 
9.5%
2 35
 
9.2%
3 13
 
3.4%
5 6
 
1.6%
0 5
 
1.3%
4 5
 
1.3%
E 4
 
1.1%
Other values (12) 16
 
4.2%
Arrows
ValueCountFrequency (%)
20
100.0%
None
ValueCountFrequency (%)
14
100.0%
Distinct165
Distinct (%)52.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean631.78498
Minimum9.4
Maximum82500
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.9 KiB
2023-12-11T08:45:33.644562image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum9.4
5-th percentile20
Q198
median99.2
Q3390.6
95-th percentile1991.1
Maximum82500
Range82490.6
Interquartile range (IQR)292.6

Descriptive statistics

Standard deviation4675.2703
Coefficient of variation (CV)7.4000972
Kurtosis304.17966
Mean631.78498
Median Absolute Deviation (MAD)58.4
Skewness17.322102
Sum197748.7
Variance21858152
MonotonicityNot monotonic
2023-12-11T08:45:33.771797image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
99.0 67
 
21.4%
997.9 7
 
2.2%
198.0 6
 
1.9%
99.4 6
 
1.9%
100.0 5
 
1.6%
499.0 5
 
1.6%
99.1 4
 
1.3%
96.9 4
 
1.3%
495.4 4
 
1.3%
99.2 4
 
1.3%
Other values (155) 201
64.2%
ValueCountFrequency (%)
9.4 3
1.0%
9.8 1
 
0.3%
9.9 2
0.6%
10.1 1
 
0.3%
15.0 1
 
0.3%
18.2 1
 
0.3%
18.7 1
 
0.3%
19.3 1
 
0.3%
19.5 1
 
0.3%
19.8 2
0.6%
ValueCountFrequency (%)
82500.0 1
 
0.3%
2989.4 1
 
0.3%
2953.8 1
 
0.3%
2948.4 1
 
0.3%
2695.0 1
 
0.3%
2518.6 1
 
0.3%
2391.5 1
 
0.3%
2303.9 1
 
0.3%
2002.3 4
1.3%
2001.1 1
 
0.3%

주소
Text

Distinct270
Distinct (%)86.3%
Missing0
Missing (%)0.0%
Memory size2.6 KiB
2023-12-11T08:45:34.033897image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length28
Median length27
Mean length21.210863
Min length16

Characters and Unicode

Total characters6639
Distinct characters118
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique245 ?
Unique (%)78.3%

Sample

1st row경상남도 고성군 하이면 하이로 1
2nd row경상남도 고성군 거류면 거류로 329
3rd row경상남도 고성군 하이면 공룡로 36-12
4th row경상남도 고성군 하이면 공룡로 36-18
5th row경상남도 고성군 하이면 자란만로 857
ValueCountFrequency (%)
고성군 308
19.7%
경상남도 292
18.7%
거류면 45
 
2.9%
삼산면 31
 
2.0%
하일면 30
 
1.9%
고성읍 28
 
1.8%
마암면 27
 
1.7%
하이면 27
 
1.7%
가려리 26
 
1.7%
상리면 25
 
1.6%
Other values (375) 721
46.2%
2023-12-11T08:45:34.427374image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1250
18.8%
348
 
5.2%
336
 
5.1%
319
 
4.8%
317
 
4.8%
313
 
4.7%
308
 
4.6%
293
 
4.4%
287
 
4.3%
285
 
4.3%
Other values (108) 2583
38.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4127
62.2%
Space Separator 1250
 
18.8%
Decimal Number 1092
 
16.4%
Dash Punctuation 168
 
2.5%
Other Punctuation 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
348
 
8.4%
336
 
8.1%
319
 
7.7%
317
 
7.7%
313
 
7.6%
308
 
7.5%
293
 
7.1%
287
 
7.0%
285
 
6.9%
154
 
3.7%
Other values (95) 1167
28.3%
Decimal Number
ValueCountFrequency (%)
1 222
20.3%
2 151
13.8%
3 144
13.2%
4 109
10.0%
7 100
9.2%
5 99
9.1%
6 77
 
7.1%
9 69
 
6.3%
8 65
 
6.0%
0 56
 
5.1%
Space Separator
ValueCountFrequency (%)
1250
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 168
100.0%
Other Punctuation
ValueCountFrequency (%)
, 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4127
62.2%
Common 2512
37.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
348
 
8.4%
336
 
8.1%
319
 
7.7%
317
 
7.7%
313
 
7.6%
308
 
7.5%
293
 
7.1%
287
 
7.0%
285
 
6.9%
154
 
3.7%
Other values (95) 1167
28.3%
Common
ValueCountFrequency (%)
1250
49.8%
1 222
 
8.8%
- 168
 
6.7%
2 151
 
6.0%
3 144
 
5.7%
4 109
 
4.3%
7 100
 
4.0%
5 99
 
3.9%
6 77
 
3.1%
9 69
 
2.7%
Other values (3) 123
 
4.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4127
62.2%
ASCII 2512
37.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1250
49.8%
1 222
 
8.8%
- 168
 
6.7%
2 151
 
6.0%
3 144
 
5.7%
4 109
 
4.3%
7 100
 
4.0%
5 99
 
3.9%
6 77
 
3.1%
9 69
 
2.7%
Other values (3) 123
 
4.9%
Hangul
ValueCountFrequency (%)
348
 
8.4%
336
 
8.1%
319
 
7.7%
317
 
7.7%
313
 
7.6%
308
 
7.5%
293
 
7.1%
287
 
7.0%
285
 
6.9%
154
 
3.7%
Other values (95) 1167
28.3%
Distinct168
Distinct (%)53.7%
Missing0
Missing (%)0.0%
Memory size2.6 KiB
Minimum2005-04-14 00:00:00
Maximum2018-11-12 00:00:00
2023-12-11T08:45:34.555323image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:45:34.673208image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

Interactions

2023-12-11T08:45:32.435091image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Missing values

2023-12-11T08:45:32.545825image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T08:45:32.627452image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

발전소명설비용량(킬로와트)주소최초허가일
0한국남동발전(주)100.0경상남도 고성군 하이면 하이로 12005-04-14
1(주)송산태양광발전소250.0경상남도 고성군 거류면 거류로 3292007-08-22
2녹원태양광발전소9.4경상남도 고성군 하이면 공룡로 36-122008-07-21
3덕호쏠라에너지9.4경상남도 고성군 하이면 공룡로 36-182008-07-21
4월흥쏠라에너지9.4경상남도 고성군 하이면 자란만로 8572008-07-21
5성광태양광발전소29.4경상남도 고성군 구만면 화림리 2252009-02-23
6조광태양광발전소29.4경상남도 고성군 구만면 화림리 2252009-02-25
7(재)하이산업171.6경상남도 고성군 하이면 남일로 3972010-03-02
8익호태양광발전소9.9경상남도 고성군 동해면 봉암2길 119-102011-10-10
9한빛 태양광발전소61.2경상남도 고성군 마암면 삼락4길 80-232011-12-23
발전소명설비용량(킬로와트)주소최초허가일
303이당리(2) 태양광발전소44.2경남 고성군 고성읍 이당리 10422018-10-15
304영산 태양광발전소199.3경남 고성군 영오면 영산리 10-22018-10-19
305신화 태양광발전소199.7경남 고성군 고성읍 율대리 159-22018-10-25
306㈜조선 태양광발전소102.2경남 고성군 고성읍 이당리1054-32018-10-30
307신촌마을 영농형 태양광발전소76.8경남 고성군 하이면 석지리 733-12018-10-30
308두포리1호 태양광발전소99.0경남 고성군 삼산면 두포리 341-42018-11-05
309학림5호 태양광발전소290.2경남 고성군 하일면 학림리 12332018-11-12
310학림6호 태양광발전소473.6경남 고성군 하일면 학림리 12272018-11-12
311학림7호 태양광발전소98.6경남 고성군 하일면 학림리 1223-22018-11-12
312학림8호 태양광발전소98.5경남 고성군 하일면 학림리 12452018-11-12