Overview

Dataset statistics

Number of variables6
Number of observations322
Missing cells119
Missing cells (%)6.2%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory15.9 KiB
Average record size in memory50.4 B

Variable types

Numeric2
Text2
DateTime2

Dataset

Description신재생에너지(전기사업) 발전 허가 현황
Author경상남도 밀양시
URLhttps://www.data.go.kr/data/15034184/fileData.do

Alerts

사업개시일 has 119 (37.0%) missing valuesMissing
연번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 04:39:30.279527
Analysis finished2023-12-12 04:39:31.263842
Duration0.98 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct322
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean161.5
Minimum1
Maximum322
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.0 KiB
2023-12-12T13:39:31.354139image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile17.05
Q181.25
median161.5
Q3241.75
95-th percentile305.95
Maximum322
Range321
Interquartile range (IQR)160.5

Descriptive statistics

Standard deviation93.097619
Coefficient of variation (CV)0.57645585
Kurtosis-1.2
Mean161.5
Median Absolute Deviation (MAD)80.5
Skewness0
Sum52003
Variance8667.1667
MonotonicityStrictly increasing
2023-12-12T13:39:31.532182image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.3%
243 1
 
0.3%
221 1
 
0.3%
220 1
 
0.3%
219 1
 
0.3%
218 1
 
0.3%
217 1
 
0.3%
216 1
 
0.3%
215 1
 
0.3%
214 1
 
0.3%
Other values (312) 312
96.9%
ValueCountFrequency (%)
1 1
0.3%
2 1
0.3%
3 1
0.3%
4 1
0.3%
5 1
0.3%
6 1
0.3%
7 1
0.3%
8 1
0.3%
9 1
0.3%
10 1
0.3%
ValueCountFrequency (%)
322 1
0.3%
321 1
0.3%
320 1
0.3%
319 1
0.3%
318 1
0.3%
317 1
0.3%
316 1
0.3%
315 1
0.3%
314 1
0.3%
313 1
0.3%
Distinct313
Distinct (%)97.2%
Missing0
Missing (%)0.0%
Memory size2.6 KiB
2023-12-12T13:39:32.000958image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length21
Median length16
Mean length8.4565217
Min length4

Characters and Unicode

Total characters2723
Distinct characters241
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique308 ?
Unique (%)95.7%

Sample

1st row제이에스파워1호태양광발전소
2nd row고례리태양광발전소
3rd row등자방태양광발전소
4th row덕곡 태양광발전소
5th row두산판넬건재(두산판넬태양광발전소)
ValueCountFrequency (%)
태양광 163
29.9%
태양광발전소 26
 
4.8%
희망빛 9
 
1.6%
수상태양광 7
 
1.3%
이○○ 4
 
0.7%
김○○ 3
 
0.5%
정○○ 3
 
0.5%
발전소 3
 
0.5%
제2발전소 3
 
0.5%
제1발전소 3
 
0.5%
Other values (314) 322
59.0%
2023-12-12T13:39:32.649981image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
303
 
11.1%
294
 
10.8%
292
 
10.7%
225
 
8.3%
140
 
5.1%
133
 
4.9%
133
 
4.9%
60
 
2.2%
53
 
1.9%
1 45
 
1.7%
Other values (231) 1045
38.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2311
84.9%
Space Separator 225
 
8.3%
Decimal Number 101
 
3.7%
Other Symbol 63
 
2.3%
Uppercase Letter 10
 
0.4%
Open Punctuation 5
 
0.2%
Close Punctuation 5
 
0.2%
Lowercase Letter 2
 
0.1%
Other Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
303
 
13.1%
294
 
12.7%
292
 
12.6%
140
 
6.1%
133
 
5.8%
133
 
5.8%
53
 
2.3%
33
 
1.4%
31
 
1.3%
31
 
1.3%
Other values (206) 868
37.6%
Decimal Number
ValueCountFrequency (%)
1 45
44.6%
2 33
32.7%
3 9
 
8.9%
4 4
 
4.0%
6 3
 
3.0%
5 2
 
2.0%
7 2
 
2.0%
8 2
 
2.0%
9 1
 
1.0%
Uppercase Letter
ValueCountFrequency (%)
G 2
20.0%
N 2
20.0%
A 1
10.0%
E 1
10.0%
K 1
10.0%
P 1
10.0%
S 1
10.0%
U 1
10.0%
Other Symbol
ValueCountFrequency (%)
60
95.2%
3
 
4.8%
Lowercase Letter
ValueCountFrequency (%)
y 1
50.0%
s 1
50.0%
Space Separator
ValueCountFrequency (%)
225
100.0%
Open Punctuation
ValueCountFrequency (%)
( 5
100.0%
Close Punctuation
ValueCountFrequency (%)
) 5
100.0%
Other Punctuation
ValueCountFrequency (%)
& 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2314
85.0%
Common 397
 
14.6%
Latin 12
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
303
 
13.1%
294
 
12.7%
292
 
12.6%
140
 
6.1%
133
 
5.7%
133
 
5.7%
53
 
2.3%
33
 
1.4%
31
 
1.3%
31
 
1.3%
Other values (207) 871
37.6%
Common
ValueCountFrequency (%)
225
56.7%
60
 
15.1%
1 45
 
11.3%
2 33
 
8.3%
3 9
 
2.3%
( 5
 
1.3%
) 5
 
1.3%
4 4
 
1.0%
6 3
 
0.8%
5 2
 
0.5%
Other values (4) 6
 
1.5%
Latin
ValueCountFrequency (%)
G 2
16.7%
N 2
16.7%
A 1
8.3%
y 1
8.3%
E 1
8.3%
K 1
8.3%
s 1
8.3%
P 1
8.3%
S 1
8.3%
U 1
8.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2311
84.9%
ASCII 349
 
12.8%
Geometric Shapes 60
 
2.2%
None 3
 
0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
303
 
13.1%
294
 
12.7%
292
 
12.6%
140
 
6.1%
133
 
5.8%
133
 
5.8%
53
 
2.3%
33
 
1.4%
31
 
1.3%
31
 
1.3%
Other values (206) 868
37.6%
ASCII
ValueCountFrequency (%)
225
64.5%
1 45
 
12.9%
2 33
 
9.5%
3 9
 
2.6%
( 5
 
1.4%
) 5
 
1.4%
4 4
 
1.1%
6 3
 
0.9%
G 2
 
0.6%
5 2
 
0.6%
Other values (13) 16
 
4.6%
Geometric Shapes
ValueCountFrequency (%)
60
100.0%
None
ValueCountFrequency (%)
3
100.0%

설비허가 용량(kW)
Real number (ℝ)

Distinct187
Distinct (%)58.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean124.00365
Minimum3
Maximum998.64
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.0 KiB
2023-12-12T13:39:32.874619image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum3
5-th percentile9.924
Q129.4425
median97.5
Q399.4375
95-th percentile498.96
Maximum998.64
Range995.64
Interquartile range (IQR)69.995

Descriptive statistics

Standard deviation186.49165
Coefficient of variation (CV)1.5039207
Kurtosis12.86744
Mean124.00365
Median Absolute Deviation (MAD)47.84
Skewness3.5303665
Sum39929.175
Variance34779.137
MonotonicityNot monotonic
2023-12-12T13:39:33.086110image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
10.0 13
 
4.0%
99.0 13
 
4.0%
99.36 12
 
3.7%
3.0 12
 
3.7%
98.55 11
 
3.4%
97.5 8
 
2.5%
97.92 7
 
2.2%
99.28 7
 
2.2%
19.5 6
 
1.9%
99.9 4
 
1.2%
Other values (177) 229
71.1%
ValueCountFrequency (%)
3.0 12
3.7%
3.18 2
 
0.6%
9.6 1
 
0.3%
9.92 2
 
0.6%
10.0 13
4.0%
10.2 1
 
0.3%
10.5 1
 
0.3%
11.34 1
 
0.3%
11.85 1
 
0.3%
14.06 1
 
0.3%
ValueCountFrequency (%)
998.64 3
0.9%
997.92 3
0.9%
997.56 2
0.6%
992.07 1
 
0.3%
831.48 1
 
0.3%
782.56 1
 
0.3%
703.125 2
0.6%
528.9 1
 
0.3%
498.96 4
1.2%
492.75 1
 
0.3%
Distinct295
Distinct (%)91.6%
Missing0
Missing (%)0.0%
Memory size2.6 KiB
2023-12-12T13:39:33.454357image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length101
Median length93
Mean length27.173913
Min length16

Characters and Unicode

Total characters8750
Distinct characters132
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique282 ?
Unique (%)87.6%

Sample

1st row경상남도 밀양시 초동면 금포리 571번지
2nd row경상남도 밀양시 단장면 고례리 1498-1, 1498-7
3rd row경상남도 밀양시 산내면 원서리 92-1번지
4th row경상남도 밀양시 부북면 덕곡리 126
5th row경상남도 밀양시 교동 246-30
ValueCountFrequency (%)
경상남도 322
 
18.2%
밀양시 320
 
18.1%
단장면 47
 
2.7%
상남면 43
 
2.4%
무안면 42
 
2.4%
산내면 36
 
2.0%
초동면 30
 
1.7%
부북면 26
 
1.5%
상동면 23
 
1.3%
삼랑진읍 20
 
1.1%
Other values (521) 861
48.6%
2023-12-12T13:39:34.014506image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1542
 
17.6%
1 457
 
5.2%
397
 
4.5%
389
 
4.4%
355
 
4.1%
332
 
3.8%
322
 
3.7%
320
 
3.7%
320
 
3.7%
309
 
3.5%
Other values (122) 4007
45.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4881
55.8%
Decimal Number 1870
 
21.4%
Space Separator 1542
 
17.6%
Dash Punctuation 281
 
3.2%
Other Punctuation 172
 
2.0%
Open Punctuation 2
 
< 0.1%
Close Punctuation 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
397
 
8.1%
389
 
8.0%
355
 
7.3%
332
 
6.8%
322
 
6.6%
320
 
6.6%
320
 
6.6%
309
 
6.3%
304
 
6.2%
302
 
6.2%
Other values (106) 1531
31.4%
Decimal Number
ValueCountFrequency (%)
1 457
24.4%
2 257
13.7%
4 213
11.4%
3 180
 
9.6%
8 163
 
8.7%
9 132
 
7.1%
5 127
 
6.8%
6 125
 
6.7%
7 115
 
6.1%
0 101
 
5.4%
Other Punctuation
ValueCountFrequency (%)
, 170
98.8%
. 2
 
1.2%
Space Separator
ValueCountFrequency (%)
1542
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 281
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4881
55.8%
Common 3869
44.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
397
 
8.1%
389
 
8.0%
355
 
7.3%
332
 
6.8%
322
 
6.6%
320
 
6.6%
320
 
6.6%
309
 
6.3%
304
 
6.2%
302
 
6.2%
Other values (106) 1531
31.4%
Common
ValueCountFrequency (%)
1542
39.9%
1 457
 
11.8%
- 281
 
7.3%
2 257
 
6.6%
4 213
 
5.5%
3 180
 
4.7%
, 170
 
4.4%
8 163
 
4.2%
9 132
 
3.4%
5 127
 
3.3%
Other values (6) 347
 
9.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4881
55.8%
ASCII 3869
44.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1542
39.9%
1 457
 
11.8%
- 281
 
7.3%
2 257
 
6.6%
4 213
 
5.5%
3 180
 
4.7%
, 170
 
4.4%
8 163
 
4.2%
9 132
 
3.4%
5 127
 
3.3%
Other values (6) 347
 
9.0%
Hangul
ValueCountFrequency (%)
397
 
8.1%
389
 
8.0%
355
 
7.3%
332
 
6.8%
322
 
6.6%
320
 
6.6%
320
 
6.6%
309
 
6.3%
304
 
6.2%
302
 
6.2%
Other values (106) 1531
31.4%
Distinct158
Distinct (%)49.1%
Missing0
Missing (%)0.0%
Memory size2.6 KiB
Minimum2012-10-09 00:00:00
Maximum2020-04-03 00:00:00
2023-12-12T13:39:34.188315image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T13:39:34.727648image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

사업개시일
Date

MISSING 

Distinct121
Distinct (%)59.6%
Missing119
Missing (%)37.0%
Memory size2.6 KiB
Minimum2013-02-08 00:00:00
Maximum2020-03-24 00:00:00
2023-12-12T13:39:34.925945image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T13:39:35.081095image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

Interactions

2023-12-12T13:39:30.726950image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T13:39:30.527037image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T13:39:30.882238image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T13:39:30.625070image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T13:39:35.208758image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번설비허가 용량(kW)
연번1.0000.320
설비허가 용량(kW)0.3201.000
2023-12-12T13:39:35.331568image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번설비허가 용량(kW)
연번1.0000.115
설비허가 용량(kW)0.1151.000

Missing values

2023-12-12T13:39:31.058081image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T13:39:31.214780image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번발전소명설비허가 용량(kW)발전소 주소(설치장소)최초허가일사업개시일
01제이에스파워1호태양광발전소100.0경상남도 밀양시 초동면 금포리 571번지2012-10-092013-02-08
12고례리태양광발전소19.32경상남도 밀양시 단장면 고례리 1498-1, 1498-72013-02-052013-04-30
23등자방태양광발전소29.4경상남도 밀양시 산내면 원서리 92-1번지2013-06-252016-04-14
34덕곡 태양광발전소200.0경상남도 밀양시 부북면 덕곡리 1262013-07-262013-12-27
45두산판넬건재(두산판넬태양광발전소)30.0경상남도 밀양시 교동 246-302013-09-162013-12-27
56박○○태양광발전소40.28경상남도 밀양시 초동면 신월리 413-32014-02-102014-04-30
67엠태양광발전소99.11경상남도 밀양시 부북면 위양리 276번지 15호2014-02-122014-06-10
78온누리태양광발전소99.11경상남도 밀양시 부북면 위양리 276번지 16호2014-02-122014-06-10
89큰골농원40.0경상남도 밀양시 산내면 용전리 1535-12014-02-262014-06-10
910석골농장1호태양광발전소97.2경상남도 밀양시 산내면 원서리 1230-152014-03-052014-05-14
연번발전소명설비허가 용량(kW)발전소 주소(설치장소)최초허가일사업개시일
312313미자 태양광99.9경상남도 밀양시 부북면 가산길 62-13 다동, 마동, 사동2020-01-16<NA>
313314재천 태양광141.0경상남도 밀양시 부북면 가산길 62-13 가동, 나동, 라동, 바동2020-01-16<NA>
314315활성1호 태양광99.63경상남도 밀양시 활성동 253번지2020-02-03<NA>
315316태은1호 태양광99.0경상남도 밀양시 상남면 동산리 1168-1(가동)2020-02-14<NA>
316317태은2호 태양광39.375경상남도 밀양시 상남면 동산리 1168-1(다동)2020-02-14<NA>
317318마흘 태양광73.515경상남도 밀양시 무안면 가복3길 372020-03-25<NA>
318319당두 태양광19.5경상남도 밀양시 무안면 사명로 772-8 라동2020-04-02<NA>
319320재식 태양광29.97경상남도 밀양시 상남면 마산리 753-22020-04-02<NA>
320321태승뷰티1호 태양광99.225경상남도 밀양시 부북면 전사포리 88-252020-04-03<NA>
321322태승뷰티2호 태양광29.16경상남도 밀양시 부북면 전사포리 88-252020-04-03<NA>