Overview

Dataset statistics

Number of variables6
Number of observations2327
Missing cells1685
Missing cells (%)12.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory111.5 KiB
Average record size in memory49.1 B

Variable types

Numeric1
Text3
DateTime2

Dataset

Description전라북도 고창군 신재생에너지 발전허가 및 상업운영 현황 용량, 발전소명, 주소
Author전라북도 고창군
URLhttps://www.data.go.kr/data/15029960/fileData.do

Alerts

사업개시일 has 1685 (72.4%) missing valuesMissing
연번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 02:52:57.021022
Analysis finished2023-12-12 02:52:58.247330
Duration1.23 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct2327
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1164
Minimum1
Maximum2327
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size20.6 KiB
2023-12-12T11:52:58.357415image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile117.3
Q1582.5
median1164
Q31745.5
95-th percentile2210.7
Maximum2327
Range2326
Interquartile range (IQR)1163

Descriptive statistics

Standard deviation671.89136
Coefficient of variation (CV)0.57722625
Kurtosis-1.2
Mean1164
Median Absolute Deviation (MAD)582
Skewness0
Sum2708628
Variance451438
MonotonicityStrictly increasing
2023-12-12T11:52:58.542964image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
< 0.1%
1529 1
 
< 0.1%
1549 1
 
< 0.1%
1550 1
 
< 0.1%
1551 1
 
< 0.1%
1552 1
 
< 0.1%
1553 1
 
< 0.1%
1554 1
 
< 0.1%
1555 1
 
< 0.1%
1556 1
 
< 0.1%
Other values (2317) 2317
99.6%
ValueCountFrequency (%)
1 1
< 0.1%
2 1
< 0.1%
3 1
< 0.1%
4 1
< 0.1%
5 1
< 0.1%
6 1
< 0.1%
7 1
< 0.1%
8 1
< 0.1%
9 1
< 0.1%
10 1
< 0.1%
ValueCountFrequency (%)
2327 1
< 0.1%
2326 1
< 0.1%
2325 1
< 0.1%
2324 1
< 0.1%
2323 1
< 0.1%
2322 1
< 0.1%
2321 1
< 0.1%
2320 1
< 0.1%
2319 1
< 0.1%
2318 1
< 0.1%
Distinct2203
Distinct (%)94.7%
Missing0
Missing (%)0.0%
Memory size18.3 KiB
2023-12-12T11:52:58.987978image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length41
Median length25
Mean length9.3837559
Min length2

Characters and Unicode

Total characters21836
Distinct characters422
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2104 ?
Unique (%)90.4%

Sample

1st row참빛1호태양광발전소
2nd row참빛2호태양광발전소
3rd row참빛3호태양광발전소
4th row사반3호태양광발전소
5th row싱푸에너지
ValueCountFrequency (%)
태양광발전소 226
 
8.5%
태양광 34
 
1.3%
고창1차 11
 
0.4%
우신솔라타운 11
 
0.4%
발전소 11
 
0.4%
대산태양광발전소 7
 
0.3%
1호 6
 
0.2%
에너지 6
 
0.2%
희망태양광발전소 6
 
0.2%
한국남부발전(주 5
 
0.2%
Other values (2220) 2344
87.9%
2023-12-12T11:52:59.603873image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2234
 
10.2%
2209
 
10.1%
2196
 
10.1%
2080
 
9.5%
2036
 
9.3%
2036
 
9.3%
1033
 
4.7%
1 418
 
1.9%
2 351
 
1.6%
340
 
1.6%
Other values (412) 6903
31.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 19882
91.1%
Decimal Number 1290
 
5.9%
Space Separator 340
 
1.6%
Uppercase Letter 139
 
0.6%
Close Punctuation 60
 
0.3%
Open Punctuation 59
 
0.3%
Other Symbol 29
 
0.1%
Lowercase Letter 15
 
0.1%
Dash Punctuation 14
 
0.1%
Other Punctuation 8
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2234
11.2%
2209
 
11.1%
2196
 
11.0%
2080
 
10.5%
2036
 
10.2%
2036
 
10.2%
1033
 
5.2%
176
 
0.9%
168
 
0.8%
164
 
0.8%
Other values (368) 5550
27.9%
Uppercase Letter
ValueCountFrequency (%)
S 21
15.1%
J 15
10.8%
Y 12
 
8.6%
M 12
 
8.6%
E 11
 
7.9%
H 10
 
7.2%
N 7
 
5.0%
C 6
 
4.3%
K 6
 
4.3%
O 6
 
4.3%
Other values (11) 33
23.7%
Decimal Number
ValueCountFrequency (%)
1 418
32.4%
2 351
27.2%
3 174
13.5%
4 94
 
7.3%
5 77
 
6.0%
6 53
 
4.1%
7 39
 
3.0%
9 29
 
2.2%
0 28
 
2.2%
8 27
 
2.1%
Lowercase Letter
ValueCountFrequency (%)
o 3
20.0%
s 3
20.0%
a 3
20.0%
r 3
20.0%
l 3
20.0%
Other Punctuation
ValueCountFrequency (%)
, 4
50.0%
& 3
37.5%
. 1
 
12.5%
Space Separator
ValueCountFrequency (%)
340
100.0%
Close Punctuation
ValueCountFrequency (%)
) 60
100.0%
Open Punctuation
ValueCountFrequency (%)
( 59
100.0%
Other Symbol
ValueCountFrequency (%)
29
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 14
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 19911
91.2%
Common 1771
 
8.1%
Latin 154
 
0.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2234
11.2%
2209
 
11.1%
2196
 
11.0%
2080
 
10.4%
2036
 
10.2%
2036
 
10.2%
1033
 
5.2%
176
 
0.9%
168
 
0.8%
164
 
0.8%
Other values (369) 5579
28.0%
Latin
ValueCountFrequency (%)
S 21
13.6%
J 15
 
9.7%
Y 12
 
7.8%
M 12
 
7.8%
E 11
 
7.1%
H 10
 
6.5%
N 7
 
4.5%
C 6
 
3.9%
K 6
 
3.9%
O 6
 
3.9%
Other values (16) 48
31.2%
Common
ValueCountFrequency (%)
1 418
23.6%
2 351
19.8%
340
19.2%
3 174
9.8%
4 94
 
5.3%
5 77
 
4.3%
) 60
 
3.4%
( 59
 
3.3%
6 53
 
3.0%
7 39
 
2.2%
Other values (7) 106
 
6.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 19882
91.1%
ASCII 1925
 
8.8%
None 29
 
0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
2234
11.2%
2209
 
11.1%
2196
 
11.0%
2080
 
10.5%
2036
 
10.2%
2036
 
10.2%
1033
 
5.2%
176
 
0.9%
168
 
0.8%
164
 
0.8%
Other values (368) 5550
27.9%
ASCII
ValueCountFrequency (%)
1 418
21.7%
2 351
18.2%
340
17.7%
3 174
9.0%
4 94
 
4.9%
5 77
 
4.0%
) 60
 
3.1%
( 59
 
3.1%
6 53
 
2.8%
7 39
 
2.0%
Other values (33) 260
13.5%
None
ValueCountFrequency (%)
29
100.0%
Distinct1837
Distinct (%)78.9%
Missing0
Missing (%)0.0%
Memory size18.3 KiB
2023-12-12T11:53:00.064970image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length128
Median length109
Mean length20.486893
Min length9

Characters and Unicode

Total characters47673
Distinct characters172
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1594 ?
Unique (%)68.5%

Sample

1st row상하면 장호리 246 건물상부
2nd row상하면 장호리 250-2 건물상부
3rd row상하면 장호리 257-1 건물상부
4th row해리면 사반리 998, 998-3
5th row흥덕면 사포리 229-2
ValueCountFrequency (%)
공음면 339
 
3.6%
대산면 335
 
3.5%
무장면 319
 
3.4%
흥덕면 211
 
2.2%
부안면 184
 
1.9%
건물상부 184
 
1.9%
상하면 184
 
1.9%
성내면 140
 
1.5%
성송면 135
 
1.4%
고수면 126
 
1.3%
Other values (2719) 7293
77.2%
2023-12-12T11:53:00.725111image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
7935
16.6%
1 3340
 
7.0%
- 2986
 
6.3%
2 2693
 
5.6%
2381
 
5.0%
, 2282
 
4.8%
2279
 
4.8%
5 1664
 
3.5%
4 1648
 
3.5%
3 1579
 
3.3%
Other values (162) 18886
39.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 17327
36.3%
Other Letter 16670
35.0%
Space Separator 7935
16.6%
Dash Punctuation 2986
 
6.3%
Other Punctuation 2290
 
4.8%
Close Punctuation 218
 
0.5%
Open Punctuation 218
 
0.5%
Math Symbol 20
 
< 0.1%
Lowercase Letter 8
 
< 0.1%
Other Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2381
 
14.3%
2279
 
13.7%
1228
 
7.4%
623
 
3.7%
546
 
3.3%
461
 
2.8%
442
 
2.7%
441
 
2.6%
379
 
2.3%
356
 
2.1%
Other values (141) 7534
45.2%
Decimal Number
ValueCountFrequency (%)
1 3340
19.3%
2 2693
15.5%
5 1664
9.6%
4 1648
9.5%
3 1579
9.1%
7 1545
8.9%
9 1312
 
7.6%
6 1266
 
7.3%
0 1195
 
6.9%
8 1085
 
6.3%
Other Punctuation
ValueCountFrequency (%)
, 2282
99.7%
. 6
 
0.3%
: 2
 
0.1%
Lowercase Letter
ValueCountFrequency (%)
w 4
50.0%
k 4
50.0%
Space Separator
ValueCountFrequency (%)
7935
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2986
100.0%
Close Punctuation
ValueCountFrequency (%)
) 218
100.0%
Open Punctuation
ValueCountFrequency (%)
( 218
100.0%
Math Symbol
ValueCountFrequency (%)
~ 20
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 30995
65.0%
Hangul 16670
35.0%
Latin 8
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2381
 
14.3%
2279
 
13.7%
1228
 
7.4%
623
 
3.7%
546
 
3.3%
461
 
2.8%
442
 
2.7%
441
 
2.6%
379
 
2.3%
356
 
2.1%
Other values (141) 7534
45.2%
Common
ValueCountFrequency (%)
7935
25.6%
1 3340
10.8%
- 2986
 
9.6%
2 2693
 
8.7%
, 2282
 
7.4%
5 1664
 
5.4%
4 1648
 
5.3%
3 1579
 
5.1%
7 1545
 
5.0%
9 1312
 
4.2%
Other values (9) 4011
12.9%
Latin
ValueCountFrequency (%)
w 4
50.0%
k 4
50.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 31002
65.0%
Hangul 16670
35.0%
CJK Compat 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
7935
25.6%
1 3340
10.8%
- 2986
 
9.6%
2 2693
 
8.7%
, 2282
 
7.4%
5 1664
 
5.4%
4 1648
 
5.3%
3 1579
 
5.1%
7 1545
 
5.0%
9 1312
 
4.2%
Other values (10) 4018
13.0%
Hangul
ValueCountFrequency (%)
2381
 
14.3%
2279
 
13.7%
1228
 
7.4%
623
 
3.7%
546
 
3.3%
461
 
2.8%
442
 
2.7%
441
 
2.6%
379
 
2.3%
356
 
2.1%
Other values (141) 7534
45.2%
CJK Compat
ValueCountFrequency (%)
1
100.0%
Distinct489
Distinct (%)21.0%
Missing0
Missing (%)0.0%
Memory size18.3 KiB
2023-12-12T11:53:01.090904image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length8
Median length7
Mean length5.0893855
Min length2

Characters and Unicode

Total characters11843
Distinct characters13
Distinct categories3 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique301 ?
Unique (%)12.9%

Sample

1st row99.00
2nd row99.00
3rd row99.00
4th row30.00
5th row50.00
ValueCountFrequency (%)
99.00 187
 
8.0%
99.4 164
 
7.0%
97.92 146
 
6.3%
99 142
 
6.1%
30.00 75
 
3.2%
99.84 70
 
3.0%
99.36 68
 
2.9%
99.11 65
 
2.8%
98.28 51
 
2.2%
99.96 51
 
2.2%
Other values (463) 1308
56.2%
2023-12-12T11:53:01.699154image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
9 3588
30.3%
. 2036
17.2%
0 1392
 
11.8%
806
 
6.8%
4 740
 
6.2%
2 720
 
6.1%
8 645
 
5.4%
7 458
 
3.9%
6 393
 
3.3%
3 374
 
3.2%
Other values (3) 691
 
5.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 9000
76.0%
Other Punctuation 2037
 
17.2%
Space Separator 806
 
6.8%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
9 3588
39.9%
0 1392
 
15.5%
4 740
 
8.2%
2 720
 
8.0%
8 645
 
7.2%
7 458
 
5.1%
6 393
 
4.4%
3 374
 
4.2%
5 352
 
3.9%
1 338
 
3.8%
Other Punctuation
ValueCountFrequency (%)
. 2036
> 99.9%
/ 1
 
< 0.1%
Space Separator
ValueCountFrequency (%)
806
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 11843
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
9 3588
30.3%
. 2036
17.2%
0 1392
 
11.8%
806
 
6.8%
4 740
 
6.2%
2 720
 
6.1%
8 645
 
5.4%
7 458
 
3.9%
6 393
 
3.3%
3 374
 
3.2%
Other values (3) 691
 
5.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 11843
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
9 3588
30.3%
. 2036
17.2%
0 1392
 
11.8%
806
 
6.8%
4 740
 
6.2%
2 720
 
6.1%
8 645
 
5.4%
7 458
 
3.9%
6 393
 
3.3%
3 374
 
3.2%
Other values (3) 691
 
5.8%
Distinct430
Distinct (%)18.5%
Missing0
Missing (%)0.0%
Memory size18.3 KiB
Minimum2005-11-16 00:00:00
Maximum2020-03-24 00:00:00
2023-12-12T11:53:02.039952image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:53:02.292291image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

사업개시일
Date

MISSING 

Distinct261
Distinct (%)40.7%
Missing1685
Missing (%)72.4%
Memory size18.3 KiB
Minimum2007-05-28 00:00:00
Maximum2020-02-21 00:00:00
2023-12-12T11:53:02.616244image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:53:02.930408image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

Interactions

2023-12-12T11:52:57.843901image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Missing values

2023-12-12T11:52:58.029563image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T11:52:58.183683image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번발전소명발전소주소설비용량허가일사업개시일
01참빛1호태양광발전소상하면 장호리 246 건물상부99.002014-05-162015-07-01
12참빛2호태양광발전소상하면 장호리 250-2 건물상부99.002014-05-162015-07-01
23참빛3호태양광발전소상하면 장호리 257-1 건물상부99.002014-05-162015-07-01
34사반3호태양광발전소해리면 사반리 998, 998-330.002014-05-162015-11-06
45싱푸에너지흥덕면 사포리 229-250.002014-05-162017-02-24
56경자태양광발전소성내면 장수길 19(신성리 493-1) 건물상부 주1,주239.602014-05-212014-09-12
67해성태양광발전소흥덕면 치룡리 592 (토지14.945kw, 건물 15kw)29.952014-05-282014-12-29
78명성태양광발전소고창읍 화산리 359(주1동)32.502014-05-282014-08-25
89명경태양광발전소성송면 계당리 629-1외 631-4(주1,주2)30.002014-05-282014-08-25
910봉암태양광발전소부안면 봉암리 205-4,205(건물위 263㎡)40.002014-06-092015-01-29
연번발전소명발전소주소설비용량허가일사업개시일
23172318김동규태양광발전소5호공음면 예전리 12002019-07-25<NA>
23182319순덕태양광발전소아산면 주진리 120-13, 120-44, 104193.052019-07-25<NA>
23192320주현1호태양광발전소해리면 사반리 111-14982019-07-25<NA>
23202321김세란태양광발전소1호무장면 목우리 416, 413-3491.42019-07-25<NA>
23212322김세란태양광발전소무장면 목우리 414196.562019-07-25<NA>
23222323정재직태양광발전소대산면 광대로 1325-8499.52019-07-25<NA>
23232324유병준태양광발전소아산면 인천강변로 3822002019-07-25<NA>
23242325나기동태양광발전소아산면 학전리 795249.752019-07-31<NA>
23252326그린태양광발전소흥덕면 송암리 686199.82019-07-31<NA>
23262327백암태양광발전소해리면 금평리 157-2399.62019-07-31<NA>