Overview

Dataset statistics

Number of variables9
Number of observations146
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory10.7 KiB
Average record size in memory74.9 B

Variable types

Text3
Categorical4
Numeric1
DateTime1

Dataset

Description2019년 12월9일 기준 괴산군 발전사업현황(인허가번호, 상호, 설치장소소재지, 영업구분, 원동력의종류, 설비용량KW, 공급전압V, 주파수Hz 허가일자) 데이터파일입니다. 자세한 사항은 괴산군으로 문의주시기 바랍니다.
Author충청북도 괴산군
URLhttps://www.data.go.kr/data/15041864/fileData.do

Alerts

영업구분 has constant value ""Constant
주파수 has constant value ""Constant
원동력의종류 is highly imbalanced (94.1%)Imbalance
공급전압 is highly imbalanced (55.2%)Imbalance
인허가번호 has unique valuesUnique

Reproduction

Analysis started2023-12-12 13:24:59.228148
Analysis finished2023-12-12 13:24:59.920119
Duration0.69 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

인허가번호
Text

UNIQUE 

Distinct146
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
2023-12-12T22:25:00.091462image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length23
Median length23
Mean length23
Min length23

Characters and Unicode

Total characters3358
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique146 ?
Unique (%)100.0%

Sample

1st row2019-4460000-38-5-00070
2nd row2019-4460000-38-5-00067
3rd row2019-4460000-38-5-00066
4th row2019-4460000-38-5-00053
5th row2019-4460000-38-5-00052
ValueCountFrequency (%)
2019-4460000-38-5-00070 1
 
0.7%
2015-4460000-38-5-00071 1
 
0.7%
2016-4460000-38-5-00063 1
 
0.7%
2016-4460000-38-5-00054 1
 
0.7%
2016-4460000-38-5-00062 1
 
0.7%
2016-4460000-38-5-00059 1
 
0.7%
2016-4460000-38-5-00058 1
 
0.7%
2016-4460000-38-5-00057 1
 
0.7%
2016-4460000-38-5-00056 1
 
0.7%
2016-4460000-38-5-00055 1
 
0.7%
Other values (136) 136
93.2%
2023-12-12T22:25:00.745734image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 1179
35.1%
- 584
17.4%
4 319
 
9.5%
1 220
 
6.6%
6 217
 
6.5%
5 204
 
6.1%
8 195
 
5.8%
3 188
 
5.6%
2 179
 
5.3%
7 40
 
1.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 2774
82.6%
Dash Punctuation 584
 
17.4%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 1179
42.5%
4 319
 
11.5%
1 220
 
7.9%
6 217
 
7.8%
5 204
 
7.4%
8 195
 
7.0%
3 188
 
6.8%
2 179
 
6.5%
7 40
 
1.4%
9 33
 
1.2%
Dash Punctuation
ValueCountFrequency (%)
- 584
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 3358
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 1179
35.1%
- 584
17.4%
4 319
 
9.5%
1 220
 
6.6%
6 217
 
6.5%
5 204
 
6.1%
8 195
 
5.8%
3 188
 
5.6%
2 179
 
5.3%
7 40
 
1.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 3358
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 1179
35.1%
- 584
17.4%
4 319
 
9.5%
1 220
 
6.6%
6 217
 
6.5%
5 204
 
6.1%
8 195
 
5.8%
3 188
 
5.6%
2 179
 
5.3%
7 40
 
1.2%

상호
Text

Distinct144
Distinct (%)98.6%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
2023-12-12T22:25:01.032166image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length19
Median length16
Mean length10.171233
Min length7

Characters and Unicode

Total characters1485
Distinct characters169
Distinct categories5 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique142 ?
Unique (%)97.3%

Sample

1st row배상복태양광발전소
2nd row입암리태양광발전소
3rd row삼원태양광발전소
4th row해광태양광발전소
5th row케이비태양광발전소
ValueCountFrequency (%)
태양광발전소 81
35.2%
매전 2
 
0.9%
사리 2
 
0.9%
태진 2
 
0.9%
죽림 1
 
0.4%
나린태양광발전소 1
 
0.4%
지효5호 1
 
0.4%
배상복태양광발전소 1
 
0.4%
우주 1
 
0.4%
대성 1
 
0.4%
Other values (137) 137
59.6%
2023-12-12T22:25:01.462072image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
149
 
10.0%
147
 
9.9%
147
 
9.9%
146
 
9.8%
145
 
9.8%
144
 
9.7%
84
 
5.7%
45
 
3.0%
1 25
 
1.7%
17
 
1.1%
Other values (159) 436
29.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1325
89.2%
Space Separator 84
 
5.7%
Decimal Number 66
 
4.4%
Uppercase Letter 9
 
0.6%
Other Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
149
11.2%
147
11.1%
147
11.1%
146
11.0%
145
10.9%
144
10.9%
45
 
3.4%
17
 
1.3%
16
 
1.2%
16
 
1.2%
Other values (141) 353
26.6%
Decimal Number
ValueCountFrequency (%)
1 25
37.9%
2 15
22.7%
5 5
 
7.6%
7 4
 
6.1%
6 4
 
6.1%
3 4
 
6.1%
4 3
 
4.5%
8 2
 
3.0%
9 2
 
3.0%
0 2
 
3.0%
Uppercase Letter
ValueCountFrequency (%)
S 3
33.3%
H 2
22.2%
O 1
 
11.1%
N 1
 
11.1%
A 1
 
11.1%
Y 1
 
11.1%
Space Separator
ValueCountFrequency (%)
84
100.0%
Other Punctuation
ValueCountFrequency (%)
& 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1325
89.2%
Common 151
 
10.2%
Latin 9
 
0.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
149
11.2%
147
11.1%
147
11.1%
146
11.0%
145
10.9%
144
10.9%
45
 
3.4%
17
 
1.3%
16
 
1.2%
16
 
1.2%
Other values (141) 353
26.6%
Common
ValueCountFrequency (%)
84
55.6%
1 25
 
16.6%
2 15
 
9.9%
5 5
 
3.3%
7 4
 
2.6%
6 4
 
2.6%
3 4
 
2.6%
4 3
 
2.0%
8 2
 
1.3%
9 2
 
1.3%
Other values (2) 3
 
2.0%
Latin
ValueCountFrequency (%)
S 3
33.3%
H 2
22.2%
O 1
 
11.1%
N 1
 
11.1%
A 1
 
11.1%
Y 1
 
11.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1325
89.2%
ASCII 160
 
10.8%

Most frequent character per block

Hangul
ValueCountFrequency (%)
149
11.2%
147
11.1%
147
11.1%
146
11.0%
145
10.9%
144
10.9%
45
 
3.4%
17
 
1.3%
16
 
1.2%
16
 
1.2%
Other values (141) 353
26.6%
ASCII
ValueCountFrequency (%)
84
52.5%
1 25
 
15.6%
2 15
 
9.4%
5 5
 
3.1%
7 4
 
2.5%
6 4
 
2.5%
3 4
 
2.5%
S 3
 
1.9%
4 3
 
1.9%
8 2
 
1.2%
Other values (8) 11
 
6.9%
Distinct125
Distinct (%)85.6%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
2023-12-12T22:25:01.823817image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length54
Median length51
Mean length26.986301
Min length18

Characters and Unicode

Total characters3940
Distinct characters197
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique115 ?
Unique (%)78.8%

Sample

1st row충청북도 괴산군 사리면 이곡리 866번지 869-1, 877
2nd row충청북도 괴산군 소수면 입암리 383번지 1호
3rd row충청북도 괴산군 괴산읍 능촌리 700번지
4th row충청북도 괴산군 장연면 장암리 25번지
5th row충청북도 괴산군 장연면 장암리 228번지
ValueCountFrequency (%)
충청북도 132
 
14.8%
괴산군 125
 
14.0%
사리면 28
 
3.1%
장연면 22
 
2.5%
청천면 20
 
2.2%
1호 19
 
2.1%
17
 
1.9%
3호 17
 
1.9%
청안면 17
 
1.9%
고성리 15
 
1.7%
Other values (301) 481
53.9%
2023-12-12T22:25:02.333733image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
747
 
19.0%
184
 
4.7%
159
 
4.0%
142
 
3.6%
137
 
3.5%
1 137
 
3.5%
136
 
3.5%
134
 
3.4%
131
 
3.3%
125
 
3.2%
Other values (187) 1908
48.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2466
62.6%
Space Separator 747
 
19.0%
Decimal Number 623
 
15.8%
Other Punctuation 43
 
1.1%
Dash Punctuation 31
 
0.8%
Open Punctuation 15
 
0.4%
Close Punctuation 15
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
184
 
7.5%
159
 
6.4%
142
 
5.8%
137
 
5.6%
136
 
5.5%
134
 
5.4%
131
 
5.3%
125
 
5.1%
119
 
4.8%
104
 
4.2%
Other values (172) 1095
44.4%
Decimal Number
ValueCountFrequency (%)
1 137
22.0%
3 99
15.9%
2 87
14.0%
0 58
9.3%
4 52
 
8.3%
5 50
 
8.0%
6 47
 
7.5%
9 34
 
5.5%
7 31
 
5.0%
8 28
 
4.5%
Space Separator
ValueCountFrequency (%)
747
100.0%
Other Punctuation
ValueCountFrequency (%)
, 43
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 31
100.0%
Open Punctuation
ValueCountFrequency (%)
( 15
100.0%
Close Punctuation
ValueCountFrequency (%)
) 15
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2466
62.6%
Common 1474
37.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
184
 
7.5%
159
 
6.4%
142
 
5.8%
137
 
5.6%
136
 
5.5%
134
 
5.4%
131
 
5.3%
125
 
5.1%
119
 
4.8%
104
 
4.2%
Other values (172) 1095
44.4%
Common
ValueCountFrequency (%)
747
50.7%
1 137
 
9.3%
3 99
 
6.7%
2 87
 
5.9%
0 58
 
3.9%
4 52
 
3.5%
5 50
 
3.4%
6 47
 
3.2%
, 43
 
2.9%
9 34
 
2.3%
Other values (5) 120
 
8.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2466
62.6%
ASCII 1474
37.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
747
50.7%
1 137
 
9.3%
3 99
 
6.7%
2 87
 
5.9%
0 58
 
3.9%
4 52
 
3.5%
5 50
 
3.4%
6 47
 
3.2%
, 43
 
2.9%
9 34
 
2.3%
Other values (5) 120
 
8.1%
Hangul
ValueCountFrequency (%)
184
 
7.5%
159
 
6.4%
142
 
5.8%
137
 
5.6%
136
 
5.5%
134
 
5.4%
131
 
5.3%
125
 
5.1%
119
 
4.8%
104
 
4.2%
Other values (172) 1095
44.4%

영업구분
Categorical

CONSTANT 

Distinct1
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
사업개시
146 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row사업개시
2nd row사업개시
3rd row사업개시
4th row사업개시
5th row사업개시

Common Values

ValueCountFrequency (%)
사업개시 146
100.0%

Length

2023-12-12T22:25:02.487200image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T22:25:02.595657image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
사업개시 146
100.0%

원동력의종류
Categorical

IMBALANCE 

Distinct2
Distinct (%)1.4%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
태양광
145 
지열
 
1

Length

Max length3
Median length3
Mean length2.9931507
Min length2

Unique

Unique1 ?
Unique (%)0.7%

Sample

1st row태양광
2nd row태양광
3rd row태양광
4th row태양광
5th row태양광

Common Values

ValueCountFrequency (%)
태양광 145
99.3%
지열 1
 
0.7%

Length

2023-12-12T22:25:02.701314image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T22:25:02.808627image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
태양광 145
99.3%
지열 1
 
0.7%

설비용량
Real number (ℝ)

Distinct55
Distinct (%)37.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean98.565685
Minimum15
Maximum981.18
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.4 KiB
2023-12-12T22:25:02.913664image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum15
5-th percentile19.8
Q174.25
median99
Q399.2
95-th percentile194.74
Maximum981.18
Range966.18
Interquartile range (IQR)24.95

Descriptive statistics

Standard deviation94.368
Coefficient of variation (CV)0.95741231
Kurtosis54.068641
Mean98.565685
Median Absolute Deviation (MAD)0.69
Skewness6.3198796
Sum14390.59
Variance8905.3194
MonotonicityNot monotonic
2023-12-12T22:25:03.056797image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
99.0 28
19.2%
99.23 15
 
10.3%
99.11 15
 
10.3%
19.8 15
 
10.3%
97.28 8
 
5.5%
99.2 8
 
5.5%
96.0 3
 
2.1%
99.36 2
 
1.4%
99.96 2
 
1.4%
99.71 2
 
1.4%
Other values (45) 48
32.9%
ValueCountFrequency (%)
15.0 1
 
0.7%
19.2 1
 
0.7%
19.8 15
10.3%
24.85 1
 
0.7%
25.0 1
 
0.7%
25.35 1
 
0.7%
28.8 1
 
0.7%
29.25 1
 
0.7%
29.6 1
 
0.7%
30.0 1
 
0.7%
ValueCountFrequency (%)
981.18 1
0.7%
408.24 1
0.7%
399.6 1
0.7%
299.88 1
0.7%
299.7 1
0.7%
297.92 1
0.7%
201.28 1
0.7%
196.0 1
0.7%
190.96 1
0.7%
167.69 1
0.7%

공급전압
Categorical

IMBALANCE 

Distinct7
Distinct (%)4.8%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
380
112 
380또는220
13 
220또는380
 
11
380 또는 220
 
6
22900
 
2
Other values (2)
 
2

Length

Max length10
Median length3
Mean length4.1643836
Min length3

Unique

Unique2 ?
Unique (%)1.4%

Sample

1st row380
2nd row380
3rd row22900
4th row380
5th row380

Common Values

ValueCountFrequency (%)
380 112
76.7%
380또는220 13
 
8.9%
220또는380 11
 
7.5%
380 또는 220 6
 
4.1%
22900 2
 
1.4%
220/380 1
 
0.7%
220 1
 
0.7%

Length

2023-12-12T22:25:03.208530image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T22:25:03.336101image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
380 118
74.7%
380또는220 13
 
8.2%
220또는380 11
 
7.0%
220 7
 
4.4%
또는 6
 
3.8%
22900 2
 
1.3%
220/380 1
 
0.6%

주파수
Categorical

CONSTANT 

Distinct1
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
60
146 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row60
2nd row60
3rd row60
4th row60
5th row60

Common Values

ValueCountFrequency (%)
60 146
100.0%

Length

2023-12-12T22:25:03.460113image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T22:25:03.566364image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
60 146
100.0%
Distinct73
Distinct (%)50.0%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
Minimum2012-11-05 00:00:00
Maximum2019-05-31 00:00:00
2023-12-12T22:25:03.686021image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:25:03.864207image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

Interactions

2023-12-12T22:24:59.583986image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T22:25:03.964514image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
원동력의종류설비용량공급전압허가일자
원동력의종류1.0000.0000.1990.000
설비용량0.0001.0000.4850.973
공급전압0.1990.4851.0000.999
허가일자0.0000.9730.9991.000
2023-12-12T22:25:04.069779image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
공급전압원동력의종류
공급전압1.0000.209
원동력의종류0.2091.000
2023-12-12T22:25:04.181223image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
설비용량원동력의종류공급전압
설비용량1.0000.0000.313
원동력의종류0.0001.0000.209
공급전압0.3130.2091.000

Missing values

2023-12-12T22:24:59.716357image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T22:24:59.857588image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

인허가번호상호설치장소소재지영업구분원동력의종류설비용량공급전압주파수허가일자
02019-4460000-38-5-00070배상복태양광발전소충청북도 괴산군 사리면 이곡리 866번지 869-1, 877사업개시태양광167.69380602019-03-11
12019-4460000-38-5-00067입암리태양광발전소충청북도 괴산군 소수면 입암리 383번지 1호사업개시태양광201.28380602019-04-02
22019-4460000-38-5-00066삼원태양광발전소충청북도 괴산군 괴산읍 능촌리 700번지사업개시태양광981.1822900602019-01-24
32019-4460000-38-5-00053해광태양광발전소충청북도 괴산군 장연면 장암리 25번지사업개시태양광299.7380602018-11-11
42019-4460000-38-5-00052케이비태양광발전소충청북도 괴산군 장연면 장암리 228번지사업개시태양광399.6380602018-11-14
52019-4460000-38-5-00037신풍태양광발전소충청북도 괴산군 연풍면 원풍리 555번지 3호사업개시태양광190.96380602019-05-31
62019-4460000-38-5-00034청안버섯태양광발전소충청북도 괴산군 청안면 읍내리 139번지사업개시태양광196.0380602018-06-07
72019-4460000-38-5-00033비손태양광발전소충청북도 괴산군 불정면 신흥리 222번지사업개시태양광125.92380602018-09-11
82019-4460000-38-5-00025선유 태양광발전소충청북도 괴산군 연풍면 적석리 458번지 3호사업개시태양광19.2380602019-05-02
92019-4460000-38-5-00024사담리태양광발전소4호충청북도 괴산군 사리면 사담리 산 2번지 40호사업개시태양광408.24380602018-04-09
인허가번호상호설치장소소재지영업구분원동력의종류설비용량공급전압주파수허가일자
1362014-4460000-38-5-00010세종2호 태양광발전소충청북도 괴산군 청안면 효근리 477번지 2호사업개시태양광99.0380602014-09-23
1372013-4460000-38-5-00179아성11호태양광발전소경기도 성남시 분당구 대왕판교로 660, 유스페이스1 비동 703호 (삼평동)사업개시태양광99.11380602018-09-27
1382013-4460000-38-5-00177아성9호태양광발전소경상남도 함안군 군북면 중암4길 65사업개시태양광99.11380602018-09-27
1392013-4460000-38-5-00176아성8호태양광발전소인천광역시 서구 청라에메랄드로 156, 206동 1604호 (청라동, 청라지구17단지웰카운티)사업개시태양광99.11380602018-09-27
1402013-4460000-38-5-00174아성6호태양광발전소서울특별시 송파구 풍성로 14, 1동 1102호 (풍납동, 미성맨션)사업개시태양광99.11380602018-09-27
1412013-4460000-38-5-00173아성5호태양광발전소서울특별시 용산구 이촌로 248, 23동 305호 (이촌동, 한강맨션)사업개시태양광99.11380602018-09-27
1422013-4460000-38-5-00169아성1호태양광발전소충청남도 금산군 부리면 선원길 6사업개시태양광99.11380602018-09-27
1432013-4460000-38-5-00032매전 제2호 태양광발전소충청북도 괴산군 감물면 매전1길 56사업개시태양광36.54380 또는 220602013-03-07
1442013-4460000-38-5-00031매전 제1호 태양광발전소충청북도 괴산군 감물면 매전리 902번지 7호사업개시태양광57.42380 또는 220602013-03-07
1452012-4460000-38-5-00095용훈태양광발전소충청북도 괴산군 감물면 광전5길 37사업개시태양광15.0380602012-11-05