Overview

Dataset statistics

Number of variables6
Number of observations862
Missing cells283
Missing cells (%)5.5%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory41.4 KiB
Average record size in memory49.2 B

Variable types

Text2
Numeric1
DateTime3

Dataset

Description경상남도 진주시 소재 태양광발전소의 정보(태양광발전소명, 설비용량, 발전소주소, 최초허가일, 사업개시일)를 제공합니다.
Author경상남도 진주시
URLhttps://bigdata.gyeongnam.go.kr/index.gn?menuCd=DOM_000000114002001000&publicdatapk=15033945

Alerts

기준일자 has constant value ""Constant
사업개시일 has 283 (32.8%) missing valuesMissing

Reproduction

Analysis started2023-12-29 22:16:40.656589
Analysis finished2023-12-29 22:16:42.651070
Duration1.99 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct853
Distinct (%)99.0%
Missing0
Missing (%)0.0%
Memory size6.9 KiB
2023-12-29T22:16:43.218379image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length24
Median length22
Mean length10.561485
Min length4

Characters and Unicode

Total characters9104
Distinct characters358
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique845 ?
Unique (%)98.0%

Sample

1st row햇살그린발전소 255호
2nd row햇살그린발전소 254호
3rd row씨엔태양광발전소8
4th row씨엔태양광발전소9
5th row성광태양광발전소
ValueCountFrequency (%)
태양광발전소 579
36.1%
발전소 43
 
2.7%
태양광 34
 
2.1%
1호 9
 
0.6%
2호 8
 
0.5%
모햇태양광발전소 7
 
0.4%
3호 5
 
0.3%
제2발전소 5
 
0.3%
에너지 4
 
0.2%
신성메탈 4
 
0.2%
Other values (858) 905
56.5%
2023-12-29T22:16:44.506214image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
808
 
8.9%
806
 
8.9%
804
 
8.8%
794
 
8.7%
791
 
8.7%
791
 
8.7%
741
 
8.1%
286
 
3.1%
1 160
 
1.8%
2 149
 
1.6%
Other values (348) 2974
32.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 7709
84.7%
Space Separator 741
 
8.1%
Decimal Number 469
 
5.2%
Uppercase Letter 66
 
0.7%
Close Punctuation 37
 
0.4%
Open Punctuation 36
 
0.4%
Lowercase Letter 20
 
0.2%
Dash Punctuation 13
 
0.1%
Other Punctuation 8
 
0.1%
Other Symbol 5
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
808
 
10.5%
806
 
10.5%
804
 
10.4%
794
 
10.3%
791
 
10.3%
791
 
10.3%
286
 
3.7%
84
 
1.1%
75
 
1.0%
74
 
1.0%
Other values (308) 2396
31.1%
Uppercase Letter
ValueCountFrequency (%)
E 10
15.2%
S 9
13.6%
G 6
 
9.1%
N 5
 
7.6%
U 4
 
6.1%
P 4
 
6.1%
X 4
 
6.1%
K 3
 
4.5%
J 3
 
4.5%
M 3
 
4.5%
Other values (8) 15
22.7%
Decimal Number
ValueCountFrequency (%)
1 160
34.1%
2 149
31.8%
3 59
 
12.6%
4 28
 
6.0%
5 17
 
3.6%
6 16
 
3.4%
8 11
 
2.3%
9 11
 
2.3%
7 11
 
2.3%
0 7
 
1.5%
Lowercase Letter
ValueCountFrequency (%)
c 4
20.0%
o 4
20.0%
p 4
20.0%
e 4
20.0%
k 4
20.0%
Other Punctuation
ValueCountFrequency (%)
. 4
50.0%
# 4
50.0%
Space Separator
ValueCountFrequency (%)
741
100.0%
Close Punctuation
ValueCountFrequency (%)
) 37
100.0%
Open Punctuation
ValueCountFrequency (%)
( 36
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 13
100.0%
Other Symbol
ValueCountFrequency (%)
5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 7714
84.7%
Common 1304
 
14.3%
Latin 86
 
0.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
808
 
10.5%
806
 
10.4%
804
 
10.4%
794
 
10.3%
791
 
10.3%
791
 
10.3%
286
 
3.7%
84
 
1.1%
75
 
1.0%
74
 
1.0%
Other values (309) 2401
31.1%
Latin
ValueCountFrequency (%)
E 10
 
11.6%
S 9
 
10.5%
G 6
 
7.0%
N 5
 
5.8%
c 4
 
4.7%
o 4
 
4.7%
p 4
 
4.7%
e 4
 
4.7%
k 4
 
4.7%
U 4
 
4.7%
Other values (13) 32
37.2%
Common
ValueCountFrequency (%)
741
56.8%
1 160
 
12.3%
2 149
 
11.4%
3 59
 
4.5%
) 37
 
2.8%
( 36
 
2.8%
4 28
 
2.1%
5 17
 
1.3%
6 16
 
1.2%
- 13
 
1.0%
Other values (6) 48
 
3.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 7709
84.7%
ASCII 1390
 
15.3%
None 5
 
0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
808
 
10.5%
806
 
10.5%
804
 
10.4%
794
 
10.3%
791
 
10.3%
791
 
10.3%
286
 
3.7%
84
 
1.1%
75
 
1.0%
74
 
1.0%
Other values (308) 2396
31.1%
ASCII
ValueCountFrequency (%)
741
53.3%
1 160
 
11.5%
2 149
 
10.7%
3 59
 
4.2%
) 37
 
2.7%
( 36
 
2.6%
4 28
 
2.0%
5 17
 
1.2%
6 16
 
1.2%
- 13
 
0.9%
Other values (29) 134
 
9.6%
None
ValueCountFrequency (%)
5
100.0%

설비용량(KW)
Real number (ℝ)

Distinct367
Distinct (%)42.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean139.98255
Minimum0
Maximum5192
Zeros1
Zeros (%)0.1%
Negative0
Negative (%)0.0%
Memory size7.7 KiB
2023-12-29T22:16:44.781880image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile19.806
Q186.25
median99.22
Q399.96
95-th percentile473.671
Maximum5192
Range5192
Interquartile range (IQR)13.71

Descriptive statistics

Standard deviation231.70287
Coefficient of variation (CV)1.6552268
Kurtosis264.67101
Mean139.98255
Median Absolute Deviation (MAD)2.02
Skewness13.104509
Sum120664.96
Variance53686.219
MonotonicityNot monotonic
2023-12-29T22:16:45.065522image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
99.96 74
 
8.6%
99.0 55
 
6.4%
99.76 27
 
3.1%
99.9 26
 
3.0%
97.2 24
 
2.8%
99.71 21
 
2.4%
99.6 18
 
2.1%
98.28 18
 
2.1%
98.77 17
 
2.0%
99.84 15
 
1.7%
Other values (357) 567
65.8%
ValueCountFrequency (%)
0.0 1
0.1%
9.75 1
0.1%
10.13 1
0.1%
10.2 1
0.1%
10.92 1
0.1%
13.88 1
0.1%
14.0 1
0.1%
14.04 2
0.2%
14.6 2
0.2%
14.72 1
0.1%
ValueCountFrequency (%)
5192.0 1
 
0.1%
1000.0 3
0.3%
999.81 1
 
0.1%
995.0 1
 
0.1%
990.0 1
 
0.1%
972.23 1
 
0.1%
960.0 1
 
0.1%
958.32 1
 
0.1%
939.58 1
 
0.1%
925.0 1
 
0.1%
Distinct699
Distinct (%)81.1%
Missing0
Missing (%)0.0%
Memory size6.9 KiB
2023-12-29T22:16:45.580148image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length76
Median length59
Mean length30.952436
Min length15

Characters and Unicode

Total characters26681
Distinct characters181
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique600 ?
Unique (%)69.6%

Sample

1st row경상남도 진주시 문산읍 월아산로996번길 88(공장 건물 위)
2nd row경상남도 진주시 문산읍 월아산로996번길 88(공장 건물 위)
3rd row경상남도 진주시 진성면 동부로1602번길 48(공장 건물 위)
4th row경상남도 진주시 정촌면 산업로 119(공장 건물 위)
5th row경상남도 진주시 진성면 동부로1602번길 48(공장 건물 위)
ValueCountFrequency (%)
경상남도 862
 
14.7%
진주시 857
 
14.6%
535
 
9.1%
건물 282
 
4.8%
토지 137
 
2.3%
미천면 97
 
1.7%
정촌면 89
 
1.5%
사봉면 86
 
1.5%
수곡면 81
 
1.4%
진성면 78
 
1.3%
Other values (1132) 2747
46.9%
2023-12-29T22:16:46.477339image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4995
 
18.7%
983
 
3.7%
1 974
 
3.7%
940
 
3.5%
895
 
3.4%
880
 
3.3%
868
 
3.3%
868
 
3.3%
863
 
3.2%
734
 
2.8%
Other values (171) 13681
51.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 14967
56.1%
Space Separator 4995
 
18.7%
Decimal Number 4467
 
16.7%
Open Punctuation 712
 
2.7%
Close Punctuation 712
 
2.7%
Dash Punctuation 511
 
1.9%
Other Punctuation 312
 
1.2%
Uppercase Letter 5
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
983
 
6.6%
940
 
6.3%
895
 
6.0%
880
 
5.9%
868
 
5.8%
868
 
5.8%
863
 
5.8%
734
 
4.9%
660
 
4.4%
567
 
3.8%
Other values (152) 6709
44.8%
Decimal Number
ValueCountFrequency (%)
1 974
21.8%
2 544
12.2%
4 502
11.2%
5 475
10.6%
3 384
 
8.6%
9 379
 
8.5%
6 315
 
7.1%
8 310
 
6.9%
7 302
 
6.8%
0 282
 
6.3%
Uppercase Letter
ValueCountFrequency (%)
B 2
40.0%
H 1
20.0%
I 1
20.0%
A 1
20.0%
Space Separator
ValueCountFrequency (%)
4995
100.0%
Open Punctuation
ValueCountFrequency (%)
( 712
100.0%
Close Punctuation
ValueCountFrequency (%)
) 712
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 511
100.0%
Other Punctuation
ValueCountFrequency (%)
, 312
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 14967
56.1%
Common 11709
43.9%
Latin 5
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
983
 
6.6%
940
 
6.3%
895
 
6.0%
880
 
5.9%
868
 
5.8%
868
 
5.8%
863
 
5.8%
734
 
4.9%
660
 
4.4%
567
 
3.8%
Other values (152) 6709
44.8%
Common
ValueCountFrequency (%)
4995
42.7%
1 974
 
8.3%
( 712
 
6.1%
) 712
 
6.1%
2 544
 
4.6%
- 511
 
4.4%
4 502
 
4.3%
5 475
 
4.1%
3 384
 
3.3%
9 379
 
3.2%
Other values (5) 1521
 
13.0%
Latin
ValueCountFrequency (%)
B 2
40.0%
H 1
20.0%
I 1
20.0%
A 1
20.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 14967
56.1%
ASCII 11714
43.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
4995
42.6%
1 974
 
8.3%
( 712
 
6.1%
) 712
 
6.1%
2 544
 
4.6%
- 511
 
4.4%
4 502
 
4.3%
5 475
 
4.1%
3 384
 
3.3%
9 379
 
3.2%
Other values (9) 1526
 
13.0%
Hangul
ValueCountFrequency (%)
983
 
6.6%
940
 
6.3%
895
 
6.0%
880
 
5.9%
868
 
5.8%
868
 
5.8%
863
 
5.8%
734
 
4.9%
660
 
4.4%
567
 
3.8%
Other values (152) 6709
44.8%
Distinct302
Distinct (%)35.0%
Missing0
Missing (%)0.0%
Memory size6.9 KiB
Minimum2007-01-10 00:00:00
Maximum2023-11-24 00:00:00
2023-12-29T22:16:46.914269image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-29T22:16:47.346572image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

사업개시일
Date

MISSING 

Distinct315
Distinct (%)54.4%
Missing283
Missing (%)32.8%
Memory size6.9 KiB
Minimum2008-05-27 00:00:00
Maximum2023-10-30 00:00:00
2023-12-29T22:16:47.759503image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-29T22:16:48.366313image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

기준일자
Date

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size6.9 KiB
Minimum2023-12-15 00:00:00
Maximum2023-12-15 00:00:00
2023-12-29T22:16:48.609457image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-29T22:16:48.769935image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2023-12-29T22:16:41.823898image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Missing values

2023-12-29T22:16:42.133423image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-29T22:16:42.490307image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

태양광발전소명설비용량(KW)발전소주소최초허가일사업개시일기준일자
0햇살그린발전소 255호99.56경상남도 진주시 문산읍 월아산로996번길 88(공장 건물 위)2023-11-24<NA>2023-12-15
1햇살그린발전소 254호99.56경상남도 진주시 문산읍 월아산로996번길 88(공장 건물 위)2023-11-24<NA>2023-12-15
2씨엔태양광발전소8444.84경상남도 진주시 진성면 동부로1602번길 48(공장 건물 위)2023-11-14<NA>2023-12-15
3씨엔태양광발전소9591.31경상남도 진주시 정촌면 산업로 119(공장 건물 위)2023-11-14<NA>2023-12-15
4성광태양광발전소163.4경상남도 진주시 진성면 동부로1602번길 48(공장 건물 위)2023-11-14<NA>2023-12-15
5에스지씨태양광발전소138.7경상남도 진주시 정촌면 산업로 119(공장 건물 위)2023-11-14<NA>2023-12-15
6씨엔태양광발전소7352.4경상남도 진주시 정촌면 산업로 129(공장 건물 위)2023-11-14<NA>2023-12-15
7해상2호 태양광발전소99.96경상남도 진주시 대평면 대평리 249(토지 위)2023-11-14<NA>2023-12-15
8진모태양광발전소99.96경상남도 진주시 대평면 대평리 249(토지 위)2023-11-14<NA>2023-12-15
9은평1호 태양광발전소399.85경상남도 진주시 대곡면 덕곡리 1056(동식물 관련 건물 위)2023-11-10<NA>2023-12-15
태양광발전소명설비용량(KW)발전소주소최초허가일사업개시일기준일자
852영광 태양광발전소19.6경상남도 진주시 금산면 속사길 592008-08-262008-09-252023-12-15
853부광 태양광발전소19.6경상남도 진주시 금산면 속사길 89-1 (속사리 234-1, 234-3, 234-5)2008-09-012008-09-252023-12-15
854정광 태양광발전소29.16경상남도 진주시 금산면 속사리 218-12008-08-122009-07-012023-12-15
855이대식 태양광발전소20.0경상남도 진주시 금산면 속사길57번길 82007-11-052008-05-272023-12-15
856(주)진주 태양광발전소1000.0경상남도 진주시 사봉면 봉곡리 산 222번지 8호 , 1145-22007-10-022009-08-172023-12-15
857임용택 태양광발전소15.66경상남도 진주시 진양호로97번길 19-17 (평거동)(건물 위)2007-06-132009-08-172023-12-15
858신한태양광발전소134.4경상남도 진주시 명석면 외율리 산 136번지 8호2007-04-02<NA>2023-12-15
859광발전㈜ 태양광발전소99.0경상남도 진주시 정촌면 예상리 산 13번지2007-01-102009-11-012023-12-15
860진수전력 태양광발전소1000.0경상남도 진주시 수곡면 효자리 산 26번지, 산 27번지2007-09-012013-03-062023-12-15
861(주)외율 태양광발전소134.4경상남도 진주시 명석면 진주대로 2320-362007-04-02<NA>2023-12-15