Overview

Dataset statistics

Number of variables8
Number of observations740
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory47.8 KiB
Average record size in memory66.2 B

Variable types

Text2
Categorical4
Numeric1
DateTime1

Dataset

Description충청북도 제천시의 태양광 발전소설치현황입니다. 발전소명, 최초허가일, 설비용량, 공급전압, 주파수, 사업상태, 데이터기준일자로 구성되어 있습니다.
Author충청북도 제천시
URLhttps://www.data.go.kr/data/15033904/fileData.do

Alerts

주파수(hz) has constant value ""Constant
데이터기준일자 has constant value ""Constant
공급전압(V) is highly imbalanced (65.2%)Imbalance

Reproduction

Analysis started2023-12-12 23:16:05.716883
Analysis finished2023-12-12 23:16:06.416603
Duration0.7 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct709
Distinct (%)95.8%
Missing0
Missing (%)0.0%
Memory size5.9 KiB
2023-12-13T08:16:06.589898image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length30
Median length27
Mean length6.2878378
Min length1

Characters and Unicode

Total characters4653
Distinct characters356
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique681 ?
Unique (%)92.0%

Sample

1st row광명
2nd row이형숙
3rd row영서두학 1호
4th row영서두학
5th row해솔태양광2
ValueCountFrequency (%)
태양광발전소 34
 
3.7%
마을회 19
 
2.1%
태양광 18
 
2.0%
충북 17
 
1.9%
협동조합 17
 
1.9%
1호 11
 
1.2%
제천행복 5
 
0.5%
2호 5
 
0.5%
발전소(덕곡리 5
 
0.5%
박달재 5
 
0.5%
Other values (725) 782
85.2%
2023-12-13T08:16:07.025560image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
281
 
6.0%
185
 
4.0%
178
 
3.8%
177
 
3.8%
177
 
3.8%
177
 
3.8%
160
 
3.4%
160
 
3.4%
1 138
 
3.0%
2 130
 
2.8%
Other values (346) 2890
62.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3822
82.1%
Decimal Number 404
 
8.7%
Space Separator 178
 
3.8%
Uppercase Letter 129
 
2.8%
Close Punctuation 41
 
0.9%
Open Punctuation 41
 
0.9%
Lowercase Letter 30
 
0.6%
Dash Punctuation 8
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
281
 
7.4%
185
 
4.8%
177
 
4.6%
177
 
4.6%
177
 
4.6%
160
 
4.2%
160
 
4.2%
92
 
2.4%
88
 
2.3%
83
 
2.2%
Other values (304) 2242
58.7%
Uppercase Letter
ValueCountFrequency (%)
S 33
25.6%
M 26
20.2%
L 25
19.4%
P 8
 
6.2%
D 7
 
5.4%
Y 6
 
4.7%
V 5
 
3.9%
H 4
 
3.1%
B 4
 
3.1%
C 4
 
3.1%
Other values (4) 7
 
5.4%
Lowercase Letter
ValueCountFrequency (%)
o 5
16.7%
e 5
16.7%
r 4
13.3%
k 3
10.0%
a 2
 
6.7%
c 2
 
6.7%
p 2
 
6.7%
y 1
 
3.3%
l 1
 
3.3%
f 1
 
3.3%
Other values (4) 4
13.3%
Decimal Number
ValueCountFrequency (%)
1 138
34.2%
2 130
32.2%
3 45
 
11.1%
5 21
 
5.2%
4 20
 
5.0%
6 12
 
3.0%
7 10
 
2.5%
0 10
 
2.5%
8 10
 
2.5%
9 8
 
2.0%
Space Separator
ValueCountFrequency (%)
178
100.0%
Close Punctuation
ValueCountFrequency (%)
) 41
100.0%
Open Punctuation
ValueCountFrequency (%)
( 41
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 8
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3822
82.1%
Common 672
 
14.4%
Latin 159
 
3.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
281
 
7.4%
185
 
4.8%
177
 
4.6%
177
 
4.6%
177
 
4.6%
160
 
4.2%
160
 
4.2%
92
 
2.4%
88
 
2.3%
83
 
2.2%
Other values (304) 2242
58.7%
Latin
ValueCountFrequency (%)
S 33
20.8%
M 26
16.4%
L 25
15.7%
P 8
 
5.0%
D 7
 
4.4%
Y 6
 
3.8%
V 5
 
3.1%
o 5
 
3.1%
e 5
 
3.1%
H 4
 
2.5%
Other values (18) 35
22.0%
Common
ValueCountFrequency (%)
178
26.5%
1 138
20.5%
2 130
19.3%
3 45
 
6.7%
) 41
 
6.1%
( 41
 
6.1%
5 21
 
3.1%
4 20
 
3.0%
6 12
 
1.8%
7 10
 
1.5%
Other values (4) 36
 
5.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3822
82.1%
ASCII 831
 
17.9%

Most frequent character per block

Hangul
ValueCountFrequency (%)
281
 
7.4%
185
 
4.8%
177
 
4.6%
177
 
4.6%
177
 
4.6%
160
 
4.2%
160
 
4.2%
92
 
2.4%
88
 
2.3%
83
 
2.2%
Other values (304) 2242
58.7%
ASCII
ValueCountFrequency (%)
178
21.4%
1 138
16.6%
2 130
15.6%
3 45
 
5.4%
) 41
 
4.9%
( 41
 
4.9%
S 33
 
4.0%
M 26
 
3.1%
L 25
 
3.0%
5 21
 
2.5%
Other values (32) 153
18.4%
Distinct536
Distinct (%)72.4%
Missing0
Missing (%)0.0%
Memory size5.9 KiB
2023-12-13T08:16:07.368786image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length75
Median length48
Mean length26.544595
Min length18

Characters and Unicode

Total characters19643
Distinct characters189
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique459 ?
Unique (%)62.0%

Sample

1st row충청북도 제천시 송학면 송학주천로11길 32
2nd row충청북도 제천시 송학면 시곡포전로 164
3rd row충청북도 제천시 의병대로42길 109(두학동)
4th row충청북도 제천시 의병대로42길 109(두학동)
5th row충청북도 제천시 의병대로42길 109(두학동)
ValueCountFrequency (%)
충청북도 740
 
16.6%
제천시 739
 
16.6%
봉양읍 179
 
4.0%
송학면 153
 
3.4%
141
 
3.2%
121
 
2.7%
1호 73
 
1.6%
공전리 66
 
1.5%
백운면 57
 
1.3%
57
 
1.3%
Other values (839) 2120
47.7%
2023-12-13T08:16:08.125920image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3706
 
18.9%
817
 
4.2%
778
 
4.0%
1 769
 
3.9%
769
 
3.9%
768
 
3.9%
755
 
3.8%
753
 
3.8%
741
 
3.8%
2 503
 
2.6%
Other values (179) 9284
47.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 11239
57.2%
Space Separator 3706
 
18.9%
Decimal Number 3474
 
17.7%
Dash Punctuation 371
 
1.9%
Other Punctuation 312
 
1.6%
Open Punctuation 265
 
1.3%
Close Punctuation 265
 
1.3%
Uppercase Letter 11
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
817
 
7.3%
778
 
6.9%
769
 
6.8%
768
 
6.8%
755
 
6.7%
753
 
6.7%
741
 
6.6%
374
 
3.3%
369
 
3.3%
352
 
3.1%
Other values (154) 4763
42.4%
Decimal Number
ValueCountFrequency (%)
1 769
22.1%
2 503
14.5%
3 366
10.5%
4 327
9.4%
6 288
 
8.3%
7 257
 
7.4%
5 253
 
7.3%
9 253
 
7.3%
0 247
 
7.1%
8 211
 
6.1%
Uppercase Letter
ValueCountFrequency (%)
E 2
18.2%
T 2
18.2%
M 1
9.1%
A 1
9.1%
R 1
9.1%
B 1
9.1%
C 1
9.1%
D 1
9.1%
K 1
9.1%
Other Punctuation
ValueCountFrequency (%)
, 306
98.1%
. 6
 
1.9%
Space Separator
ValueCountFrequency (%)
3706
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 371
100.0%
Open Punctuation
ValueCountFrequency (%)
( 265
100.0%
Close Punctuation
ValueCountFrequency (%)
) 265
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 11239
57.2%
Common 8393
42.7%
Latin 11
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
817
 
7.3%
778
 
6.9%
769
 
6.8%
768
 
6.8%
755
 
6.7%
753
 
6.7%
741
 
6.6%
374
 
3.3%
369
 
3.3%
352
 
3.1%
Other values (154) 4763
42.4%
Common
ValueCountFrequency (%)
3706
44.2%
1 769
 
9.2%
2 503
 
6.0%
- 371
 
4.4%
3 366
 
4.4%
4 327
 
3.9%
, 306
 
3.6%
6 288
 
3.4%
( 265
 
3.2%
) 265
 
3.2%
Other values (6) 1227
 
14.6%
Latin
ValueCountFrequency (%)
E 2
18.2%
T 2
18.2%
M 1
9.1%
A 1
9.1%
R 1
9.1%
B 1
9.1%
C 1
9.1%
D 1
9.1%
K 1
9.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 11239
57.2%
ASCII 8404
42.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3706
44.1%
1 769
 
9.2%
2 503
 
6.0%
- 371
 
4.4%
3 366
 
4.4%
4 327
 
3.9%
, 306
 
3.6%
6 288
 
3.4%
( 265
 
3.2%
) 265
 
3.2%
Other values (15) 1238
 
14.7%
Hangul
ValueCountFrequency (%)
817
 
7.3%
778
 
6.9%
769
 
6.8%
768
 
6.8%
755
 
6.7%
753
 
6.7%
741
 
6.6%
374
 
3.3%
369
 
3.3%
352
 
3.1%
Other values (154) 4763
42.4%

사업상태
Categorical

Distinct4
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size5.9 KiB
사업개시
451 
인허가취소
130 
인허가
126 
공사진행
 
33

Length

Max length5
Median length4
Mean length4.0054054
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row인허가
2nd row인허가
3rd row인허가
4th row인허가
5th row인허가

Common Values

ValueCountFrequency (%)
사업개시 451
60.9%
인허가취소 130
 
17.6%
인허가 126
 
17.0%
공사진행 33
 
4.5%

Length

2023-12-13T08:16:08.257174image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T08:16:08.358692image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
사업개시 451
60.9%
인허가취소 130
 
17.6%
인허가 126
 
17.0%
공사진행 33
 
4.5%

설비용량(kw)
Real number (ℝ)

Distinct314
Distinct (%)42.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean166.53495
Minimum9
Maximum1000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size6.6 KiB
2023-12-13T08:16:08.473572image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum9
5-th percentile19.14
Q149.95
median99
Q399.68
95-th percentile978.516
Maximum1000
Range991
Interquartile range (IQR)49.73

Descriptive statistics

Standard deviation244.47631
Coefficient of variation (CV)1.4680181
Kurtosis5.6728701
Mean166.53495
Median Absolute Deviation (MAD)9.12
Skewness2.6152705
Sum123235.86
Variance59768.667
MonotonicityNot monotonic
2023-12-13T08:16:08.611347image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
99.0 70
 
9.5%
99.23 44
 
5.9%
99.2 31
 
4.2%
98.56 20
 
2.7%
19.8 20
 
2.7%
99.68 19
 
2.6%
99.6 19
 
2.6%
98.28 19
 
2.6%
99.28 17
 
2.3%
99.45 15
 
2.0%
Other values (304) 466
63.0%
ValueCountFrequency (%)
9.0 2
0.3%
10.8 1
0.1%
11.1 1
0.1%
12.6 1
0.1%
14.08 1
0.1%
14.18 1
0.1%
15.0 2
0.3%
15.19 1
0.1%
15.5 1
0.1%
15.75 1
0.1%
ValueCountFrequency (%)
1000.0 3
0.4%
999.92 1
 
0.1%
999.58 1
 
0.1%
999.36 1
 
0.1%
999.0 1
 
0.1%
998.87 1
 
0.1%
998.82 1
 
0.1%
998.4 7
0.9%
997.92 1
 
0.1%
996.96 1
 
0.1%

공급전압(V)
Categorical

IMBALANCE 

Distinct9
Distinct (%)1.2%
Missing0
Missing (%)0.0%
Memory size5.9 KiB
380
603 
22,900
 
58
22900
 
26
380/220
 
20
220/380
 
19
Other values (4)
 
14

Length

Max length10
Median length3
Mean length3.5378378
Min length3

Unique

Unique3 ?
Unique (%)0.4%

Sample

1st row380
2nd row380
3rd row380
4th row380
5th row380

Common Values

ValueCountFrequency (%)
380 603
81.5%
22,900 58
 
7.8%
22900 26
 
3.5%
380/220 20
 
2.7%
220/380 19
 
2.6%
220 11
 
1.5%
380,220 1
 
0.1%
380 또는 220 1
 
0.1%
380또는220 1
 
0.1%

Length

2023-12-13T08:16:08.760127image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T08:16:08.893326image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
380 604
81.4%
22,900 58
 
7.8%
22900 26
 
3.5%
380/220 20
 
2.7%
220/380 19
 
2.6%
220 12
 
1.6%
380,220 1
 
0.1%
또는 1
 
0.1%
380또는220 1
 
0.1%

주파수(hz)
Categorical

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size5.9 KiB
60
740 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row60
2nd row60
3rd row60
4th row60
5th row60

Common Values

ValueCountFrequency (%)
60 740
100.0%

Length

2023-12-13T08:16:09.012614image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T08:16:09.114337image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
60 740
100.0%
Distinct321
Distinct (%)43.4%
Missing0
Missing (%)0.0%
Memory size5.9 KiB
Minimum2006-05-15 00:00:00
Maximum2022-09-29 00:00:00
2023-12-13T08:16:09.210329image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:16:09.354228image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size5.9 KiB
2022-10-07
740 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2022-10-07
2nd row2022-10-07
3rd row2022-10-07
4th row2022-10-07
5th row2022-10-07

Common Values

ValueCountFrequency (%)
2022-10-07 740
100.0%

Length

2023-12-13T08:16:09.488212image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T08:16:09.579583image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2022-10-07 740
100.0%

Interactions

2023-12-13T08:16:06.115927image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T08:16:09.644949image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사업상태설비용량(kw)공급전압(V)
사업상태1.0000.2760.266
설비용량(kw)0.2761.0000.562
공급전압(V)0.2660.5621.000
2023-12-13T08:16:09.749243image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
공급전압(V)사업상태
공급전압(V)1.0000.172
사업상태0.1721.000
2023-12-13T08:16:09.833512image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
설비용량(kw)사업상태공급전압(V)
설비용량(kw)1.0000.1670.296
사업상태0.1671.0000.172
공급전압(V)0.2960.1721.000

Missing values

2023-12-13T08:16:06.240445image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T08:16:06.370654image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

발전소명도로명주소사업상태설비용량(kw)공급전압(V)주파수(hz)허가일자데이터기준일자
0광명충청북도 제천시 송학면 송학주천로11길 32인허가34.8380602022-09-292022-10-07
1이형숙충청북도 제천시 송학면 시곡포전로 164인허가19.7380602022-09-292022-10-07
2영서두학 1호충청북도 제천시 의병대로42길 109(두학동)인허가99.68380602022-09-192022-10-07
3영서두학충청북도 제천시 의병대로42길 109(두학동)인허가99.68380602022-09-192022-10-07
4해솔태양광2충청북도 제천시 의병대로42길 109(두학동)인허가99.68380602022-09-192022-10-07
5해솔태양광1충청북도 제천시 의병대로42길 109(두학동)인허가99.68380602022-09-192022-10-07
6한빛충청북도 제천시 의병대로42길 109(두학동)인허가99.68380602022-09-192022-10-07
7제천행복 8호충청북도 제천시 의병대로42길 109(두학동)인허가16.02380602022-09-192022-10-07
8제천행복 7호충청북도 제천시 의병대로42길 109(두학동)인허가29.82380602022-09-192022-10-07
9제천행복 4호충청북도 제천시 의병대로42길 109(두학동)인허가99.68380602022-09-192022-10-07
발전소명도로명주소사업상태설비용량(kw)공급전압(V)주파수(hz)허가일자데이터기준일자
730명문에너지(주)2호충청북도 제천시 세명로 65, 세명대학교 (신월동)사업개시417.2422900602015-01-192022-10-07
731영서충청북도 제천시 송학면 장곡리 산 20번지 1호 328-3사업개시600.022900602015-01-192022-10-07
732명문에너지(주)충청북도 제천시 대학로 316, 대원대학 (신월동)사업개시820.822900602014-09-162022-10-07
733(주)한국유통충청북도 제천시 하소로 58 (하소동)인허가취소295.222,900602012-09-282022-10-07
734원박충청북도 제천시 봉양읍 원박리 67번지 73, 74-1, 75-2인허가취소199.6422900602011-03-232022-10-07
735명도에너지충청북도 제천시 봉양읍 용두대로44길 59-9사업개시196.022,900602010-04-212022-10-07
736진성에너지충청북도 제천시 고명로5길 37 (고명동)인허가취소197.0380602008-08-062022-10-07
737미래에너지(주)충청북도 제천시 봉양읍 삼거리 산 51번지사업개시976.3222,900602008-03-052022-10-07
738시공건설(주)충청북도 제천시 송학면 시곡리 산 56번지 7호인허가취소1000.022,900602007-10-042022-10-07
739(주)지앤지컨설턴트 에프다아이충청북도 제천시 수산면 오티리 189번지인허가취소820.022,900602006-05-152022-10-07