Overview

Dataset statistics

Number of variables10
Number of observations1016
Missing cells406
Missing cells (%)4.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory81.5 KiB
Average record size in memory82.1 B

Variable types

Numeric2
Categorical4
Text2
DateTime2

Dataset

Description2023년 3월 22일 기준 괴산군 신재생에너지(태양광, 풍력 등) 발전사업허가 및 사업개시 현황 (100kW 이하) 데이터파일입니다. 자세한 사항은 괴산군으로 문의주시기 바랍니다.
URLhttps://www.data.go.kr/data/15033979/fileData.do

Alerts

시도 has constant value ""Constant
시군구 has constant value ""Constant
구분 has constant value ""Constant
데이터기준일 has constant value ""Constant
사업개시일 has 406 (40.0%) missing valuesMissing
연번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 06:44:11.949712
Analysis finished2023-12-12 06:44:12.956050
Duration1.01 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct1016
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean508.5
Minimum1
Maximum1016
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size9.1 KiB
2023-12-12T15:44:13.032239image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile51.75
Q1254.75
median508.5
Q3762.25
95-th percentile965.25
Maximum1016
Range1015
Interquartile range (IQR)507.5

Descriptive statistics

Standard deviation293.43824
Coefficient of variation (CV)0.57706635
Kurtosis-1.2
Mean508.5
Median Absolute Deviation (MAD)254
Skewness0
Sum516636
Variance86106
MonotonicityStrictly increasing
2023-12-12T15:44:13.164908image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.1%
684 1
 
0.1%
671 1
 
0.1%
672 1
 
0.1%
673 1
 
0.1%
674 1
 
0.1%
675 1
 
0.1%
676 1
 
0.1%
677 1
 
0.1%
678 1
 
0.1%
Other values (1006) 1006
99.0%
ValueCountFrequency (%)
1 1
0.1%
2 1
0.1%
3 1
0.1%
4 1
0.1%
5 1
0.1%
6 1
0.1%
7 1
0.1%
8 1
0.1%
9 1
0.1%
10 1
0.1%
ValueCountFrequency (%)
1016 1
0.1%
1015 1
0.1%
1014 1
0.1%
1013 1
0.1%
1012 1
0.1%
1011 1
0.1%
1010 1
0.1%
1009 1
0.1%
1008 1
0.1%
1007 1
0.1%

시도
Categorical

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size8.1 KiB
충청북도
1016 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row충청북도
2nd row충청북도
3rd row충청북도
4th row충청북도
5th row충청북도

Common Values

ValueCountFrequency (%)
충청북도 1016
100.0%

Length

2023-12-12T15:44:13.284425image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T15:44:13.367508image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
충청북도 1016
100.0%

시군구
Categorical

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size8.1 KiB
괴산군
1016 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row괴산군
2nd row괴산군
3rd row괴산군
4th row괴산군
5th row괴산군

Common Values

ValueCountFrequency (%)
괴산군 1016
100.0%

Length

2023-12-12T15:44:13.461644image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T15:44:13.548967image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
괴산군 1016
100.0%

구분
Categorical

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size8.1 KiB
태양광
1016 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row태양광
2nd row태양광
3rd row태양광
4th row태양광
5th row태양광

Common Values

ValueCountFrequency (%)
태양광 1016
100.0%

Length

2023-12-12T15:44:13.633256image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T15:44:13.726666image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
태양광 1016
100.0%
Distinct864
Distinct (%)85.0%
Missing0
Missing (%)0.0%
Memory size8.1 KiB
2023-12-12T15:44:13.939871image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length28
Median length24
Mean length9.7086614
Min length3

Characters and Unicode

Total characters9864
Distinct characters363
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique726 ?
Unique (%)71.5%

Sample

1st row관동태양광발전소
2nd row명걸태양광발전소
3rd row정성2호태양광발전소
4th row정성1호태양광발전소
5th row하람1호태양광발전소
ValueCountFrequency (%)
태양광발전소 254
 
19.6%
불정태양광발전소 4
 
0.3%
괴산태양광발전소 4
 
0.3%
행복태양광발전소 4
 
0.3%
1호 4
 
0.3%
물레방아태양광발전소 4
 
0.3%
광진태양광발전소 3
 
0.2%
부흥태양광발전소 3
 
0.2%
백양태양광발전소 3
 
0.2%
하늘태양광발전소 3
 
0.2%
Other values (867) 1011
77.9%
2023-12-12T15:44:14.694996image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
978
 
9.9%
976
 
9.9%
972
 
9.9%
962
 
9.8%
961
 
9.7%
946
 
9.6%
302
 
3.1%
281
 
2.8%
1 162
 
1.6%
2 128
 
1.3%
Other values (353) 3196
32.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 8943
90.7%
Decimal Number 488
 
4.9%
Space Separator 281
 
2.8%
Uppercase Letter 47
 
0.5%
Open Punctuation 42
 
0.4%
Close Punctuation 42
 
0.4%
Lowercase Letter 10
 
0.1%
Dash Punctuation 9
 
0.1%
Other Punctuation 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
978
 
10.9%
976
 
10.9%
972
 
10.9%
962
 
10.8%
961
 
10.7%
946
 
10.6%
302
 
3.4%
88
 
1.0%
79
 
0.9%
66
 
0.7%
Other values (320) 2613
29.2%
Uppercase Letter
ValueCountFrequency (%)
S 8
17.0%
N 6
12.8%
H 5
10.6%
E 5
10.6%
Y 4
8.5%
G 4
8.5%
O 4
8.5%
T 3
 
6.4%
A 2
 
4.3%
C 2
 
4.3%
Other values (3) 4
8.5%
Decimal Number
ValueCountFrequency (%)
1 162
33.2%
2 128
26.2%
3 51
 
10.5%
4 35
 
7.2%
5 31
 
6.4%
6 26
 
5.3%
7 19
 
3.9%
8 14
 
2.9%
9 12
 
2.5%
0 10
 
2.0%
Lowercase Letter
ValueCountFrequency (%)
c 2
20.0%
o 2
20.0%
p 2
20.0%
e 2
20.0%
k 2
20.0%
Space Separator
ValueCountFrequency (%)
281
100.0%
Open Punctuation
ValueCountFrequency (%)
( 42
100.0%
Close Punctuation
ValueCountFrequency (%)
) 42
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 9
100.0%
Other Punctuation
ValueCountFrequency (%)
& 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 8943
90.7%
Common 864
 
8.8%
Latin 57
 
0.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
978
 
10.9%
976
 
10.9%
972
 
10.9%
962
 
10.8%
961
 
10.7%
946
 
10.6%
302
 
3.4%
88
 
1.0%
79
 
0.9%
66
 
0.7%
Other values (320) 2613
29.2%
Latin
ValueCountFrequency (%)
S 8
14.0%
N 6
10.5%
H 5
 
8.8%
E 5
 
8.8%
Y 4
 
7.0%
G 4
 
7.0%
O 4
 
7.0%
T 3
 
5.3%
c 2
 
3.5%
o 2
 
3.5%
Other values (8) 14
24.6%
Common
ValueCountFrequency (%)
281
32.5%
1 162
18.8%
2 128
14.8%
3 51
 
5.9%
( 42
 
4.9%
) 42
 
4.9%
4 35
 
4.1%
5 31
 
3.6%
6 26
 
3.0%
7 19
 
2.2%
Other values (5) 47
 
5.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 8943
90.7%
ASCII 921
 
9.3%

Most frequent character per block

Hangul
ValueCountFrequency (%)
978
 
10.9%
976
 
10.9%
972
 
10.9%
962
 
10.8%
961
 
10.7%
946
 
10.6%
302
 
3.4%
88
 
1.0%
79
 
0.9%
66
 
0.7%
Other values (320) 2613
29.2%
ASCII
ValueCountFrequency (%)
281
30.5%
1 162
17.6%
2 128
13.9%
3 51
 
5.5%
( 42
 
4.6%
) 42
 
4.6%
4 35
 
3.8%
5 31
 
3.4%
6 26
 
2.8%
7 19
 
2.1%
Other values (23) 104
 
11.3%
Distinct764
Distinct (%)75.2%
Missing0
Missing (%)0.0%
Memory size8.1 KiB
2023-12-12T15:44:15.093158image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length63
Median length51
Mean length25.253937
Min length11

Characters and Unicode

Total characters25658
Distinct characters204
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique597 ?
Unique (%)58.8%

Sample

1st row충북 괴산군 괴산읍 사창리 204-2(건물 위)
2nd row충북 괴산군 사리면 중흥리 120(건물위)
3rd row충북 괴산군 연풍면 유상리 358, 360-1(건물위)
4th row충북 괴산군 연풍면 유상리 358, 360-1(건물위)
5th row충북 괴산군 청천면 송면리 301, 308(부지 위)
ValueCountFrequency (%)
괴산군 981
 
16.7%
충북 602
 
10.3%
396
 
6.8%
충청북도 298
 
5.1%
사리면 145
 
2.5%
청안면 139
 
2.4%
청천면 124
 
2.1%
장연면 122
 
2.1%
불정면 117
 
2.0%
소수면 103
 
1.8%
Other values (1096) 2835
48.4%
2023-12-12T15:44:15.724087image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4848
 
18.9%
1261
 
4.9%
1089
 
4.2%
990
 
3.9%
981
 
3.8%
941
 
3.7%
907
 
3.5%
900
 
3.5%
1 870
 
3.4%
- 722
 
2.8%
Other values (194) 12149
47.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 13867
54.0%
Space Separator 4848
 
18.9%
Decimal Number 4649
 
18.1%
Dash Punctuation 722
 
2.8%
Close Punctuation 566
 
2.2%
Open Punctuation 566
 
2.2%
Other Punctuation 435
 
1.7%
Uppercase Letter 5
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1261
 
9.1%
1089
 
7.9%
990
 
7.1%
981
 
7.1%
941
 
6.8%
907
 
6.5%
900
 
6.5%
575
 
4.1%
563
 
4.1%
385
 
2.8%
Other values (174) 5275
38.0%
Decimal Number
ValueCountFrequency (%)
1 870
18.7%
2 667
14.3%
3 583
12.5%
5 440
9.5%
6 438
9.4%
4 411
8.8%
7 348
 
7.5%
0 304
 
6.5%
9 304
 
6.5%
8 284
 
6.1%
Other Punctuation
ValueCountFrequency (%)
, 432
99.3%
. 2
 
0.5%
: 1
 
0.2%
Uppercase Letter
ValueCountFrequency (%)
E 2
40.0%
B 2
40.0%
A 1
20.0%
Space Separator
ValueCountFrequency (%)
4848
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 722
100.0%
Close Punctuation
ValueCountFrequency (%)
) 566
100.0%
Open Punctuation
ValueCountFrequency (%)
( 566
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 13867
54.0%
Common 11786
45.9%
Latin 5
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1261
 
9.1%
1089
 
7.9%
990
 
7.1%
981
 
7.1%
941
 
6.8%
907
 
6.5%
900
 
6.5%
575
 
4.1%
563
 
4.1%
385
 
2.8%
Other values (174) 5275
38.0%
Common
ValueCountFrequency (%)
4848
41.1%
1 870
 
7.4%
- 722
 
6.1%
2 667
 
5.7%
3 583
 
4.9%
) 566
 
4.8%
( 566
 
4.8%
5 440
 
3.7%
6 438
 
3.7%
, 432
 
3.7%
Other values (7) 1654
 
14.0%
Latin
ValueCountFrequency (%)
E 2
40.0%
B 2
40.0%
A 1
20.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 13867
54.0%
ASCII 11791
46.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
4848
41.1%
1 870
 
7.4%
- 722
 
6.1%
2 667
 
5.7%
3 583
 
4.9%
) 566
 
4.8%
( 566
 
4.8%
5 440
 
3.7%
6 438
 
3.7%
, 432
 
3.7%
Other values (10) 1659
 
14.1%
Hangul
ValueCountFrequency (%)
1261
 
9.1%
1089
 
7.9%
990
 
7.1%
981
 
7.1%
941
 
6.8%
907
 
6.5%
900
 
6.5%
575
 
4.1%
563
 
4.1%
385
 
2.8%
Other values (174) 5275
38.0%

허가용량(kW)
Real number (ℝ)

Distinct340
Distinct (%)33.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean179.5682
Minimum6
Maximum1000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size9.1 KiB
2023-12-12T15:44:15.899359image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum6
5-th percentile19.11
Q162.82
median99.11
Q399.9
95-th percentile996.99
Maximum1000
Range994
Interquartile range (IQR)37.08

Descriptive statistics

Standard deviation254.87749
Coefficient of variation (CV)1.419391
Kurtosis5.0108835
Mean179.5682
Median Absolute Deviation (MAD)19.11
Skewness2.4853538
Sum182441.29
Variance64962.533
MonotonicityNot monotonic
2023-12-12T15:44:16.057681image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
99.0 92
 
9.1%
99.6 43
 
4.2%
99.45 34
 
3.3%
99.9 31
 
3.1%
99.36 26
 
2.6%
99.96 22
 
2.2%
99.11 21
 
2.1%
99.23 21
 
2.1%
99.2 20
 
2.0%
19.8 17
 
1.7%
Other values (330) 689
67.8%
ValueCountFrequency (%)
6.0 2
 
0.2%
6.58 2
 
0.2%
9.0 2
 
0.2%
9.45 2
 
0.2%
9.99 1
 
0.1%
10.0 7
0.7%
10.36 1
 
0.1%
11.04 2
 
0.2%
11.25 1
 
0.1%
12.0 2
 
0.2%
ValueCountFrequency (%)
1000.0 6
 
0.6%
999.32 5
 
0.5%
999.05 1
 
0.1%
999.0 8
0.8%
998.64 1
 
0.1%
998.4 4
 
0.4%
998.25 6
 
0.6%
997.92 17
1.7%
997.56 3
 
0.3%
996.8 1
 
0.1%
Distinct423
Distinct (%)41.6%
Missing0
Missing (%)0.0%
Memory size8.1 KiB
Minimum2005-07-06 00:00:00
Maximum2023-03-20 00:00:00
2023-12-12T15:44:16.207434image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:44:16.376569image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

사업개시일
Date

MISSING 

Distinct278
Distinct (%)45.6%
Missing406
Missing (%)40.0%
Memory size8.1 KiB
Minimum2006-09-28 00:00:00
Maximum2023-02-13 00:00:00
2023-12-12T15:44:16.565347image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:44:16.719695image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

데이터기준일
Categorical

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size8.1 KiB
2023-03-21
1016 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023-03-21
2nd row2023-03-21
3rd row2023-03-21
4th row2023-03-21
5th row2023-03-21

Common Values

ValueCountFrequency (%)
2023-03-21 1016
100.0%

Length

2023-12-12T15:44:16.867714image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T15:44:16.957176image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023-03-21 1016
100.0%

Interactions

2023-12-12T15:44:12.530619image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:44:12.357220image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:44:12.637809image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:44:12.448111image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T15:44:17.015386image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번허가용량(kW)
연번1.0000.398
허가용량(kW)0.3981.000
2023-12-12T15:44:17.101978image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번허가용량(kW)
연번1.0000.007
허가용량(kW)0.0071.000

Missing values

2023-12-12T15:44:12.768561image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T15:44:12.905553image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번시도시군구구분발전소명사업장소허가용량(kW)허가일사업개시일데이터기준일
01충청북도괴산군태양광관동태양광발전소충북 괴산군 괴산읍 사창리 204-2(건물 위)99.962023-03-20<NA>2023-03-21
12충청북도괴산군태양광명걸태양광발전소충북 괴산군 사리면 중흥리 120(건물위)99.992023-03-20<NA>2023-03-21
23충청북도괴산군태양광정성2호태양광발전소충북 괴산군 연풍면 유상리 358, 360-1(건물위)38.682023-03-20<NA>2023-03-21
34충청북도괴산군태양광정성1호태양광발전소충북 괴산군 연풍면 유상리 358, 360-1(건물위)85.092023-03-20<NA>2023-03-21
45충청북도괴산군태양광하람1호태양광발전소충북 괴산군 청천면 송면리 301, 308(부지 위)98.12023-02-27<NA>2023-03-21
56충청북도괴산군태양광햇빛태양광발전소충북 괴산군 청천면 송면리 320(부지 위)98.12023-02-27<NA>2023-03-21
67충청북도괴산군태양광세여니태양광발전소충북 괴산군 청천면 송면리 30198.12023-02-27<NA>2023-03-21
78충청북도괴산군태양광햇님1호태양광발전소충북 괴산군 청천면 송면리 301(부지 위)414.22023-02-27<NA>2023-03-21
89충청북도괴산군태양광디에스태양광발전소충북 괴산군 청천면 송면리 312(부지 위)98.12023-02-27<NA>2023-03-21
910충청북도괴산군태양광드레곤태양광발전소충북 괴산군 청천면 송면리 312(부지 위)98.12023-02-27<NA>2023-03-21
연번시도시군구구분발전소명사업장소허가용량(kW)허가일사업개시일데이터기준일
10061007충청북도괴산군태양광선창에너지(주)충청북도 괴산군 장연면 장암리 산 15-11000.02008-04-162009-11-112023-03-21
10071008충청북도괴산군태양광청천태양광발전소충청북도 괴산군 청천면 금평진등2길 19288.02008-01-14<NA>2023-03-21
10081009충청북도괴산군태양광괴산태양광발전소충청북도 괴산군 청안면 조천리 303 ,299-199.02007-10-082009-03-192023-03-21
10091010충청북도괴산군태양광(주)블루썬에너지충청북도 괴산군 소수면 소암리 산 91-21000.02007-09-05<NA>2023-03-21
10101011충청북도괴산군태양광(주)선진기공(후평태양광발전소)충청북도 괴산군 청천면 후평리 14829.02007-06-292008-08-042023-03-21
10111012충청북도괴산군태양광검승태양광발전소충청북도 괴산군 괴산읍 검승리 산 12-4500.02007-01-25<NA>2023-03-21
10121013충청북도괴산군태양광에너지디자인 괴산발전소충청북도 괴산군 불정면 한불로앵천6길 11-1, 흙살림연구모임6.02006-07-262006-10-092023-03-21
10131014충청북도괴산군태양광마켓투유주식회사괴산군 불정면 앵천리 254-2(건물 위)9.02006-07-262006-09-282023-03-21
10141015충청북도괴산군태양광미래태양광시스템충청북도 괴산군 문광면 유평리 6-26.582006-02-272009-03-162023-03-21
10151016충청북도괴산군태양광태광쏠라파워(주)충청북도 괴산군 감물면 오창리 산 61000.02005-07-062009-10-222023-03-21