Overview

Dataset statistics

Number of variables6
Number of observations698
Missing cells183
Missing cells (%)4.4%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory33.5 KiB
Average record size in memory49.2 B

Variable types

Text2
Numeric1
DateTime2
Categorical1

Dataset

Description경상남도 진주시 소재 태양광발전소의 정보(태양광발전소명, 설비용량, 발전소주소, 최초허가일, 사업개시일)를 제공합니다.
Author경상남도 진주시
URLhttps://bigdata.gyeongnam.go.kr/index.gn?menuCd=DOM_000000114002001000&publicdatapk=15033945

Alerts

기준일자 has constant value ""Constant
사업개시일 has 177 (25.4%) missing valuesMissing

Reproduction

Analysis started2023-12-29 22:17:01.265790
Analysis finished2023-12-29 22:17:03.096778
Duration1.83 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct692
Distinct (%)99.1%
Missing0
Missing (%)0.0%
Memory size5.6 KiB
2023-12-29T22:17:03.613300image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length24
Median length22
Mean length10.412607
Min length4

Characters and Unicode

Total characters7268
Distinct characters330
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique687 ?
Unique (%)98.4%

Sample

1st row해상 태양광발전소
2nd row지 태양광발전소
3rd row솔 태양광발전소
4th row조동규에너지
5th row영남캠핑카 1호
ValueCountFrequency (%)
태양광발전소 459
35.3%
발전소 39
 
3.0%
태양광 34
 
2.6%
1호 6
 
0.5%
2호 5
 
0.4%
제2발전소 5
 
0.4%
모햇태양광발전소 5
 
0.4%
청수농원태양광발전소 4
 
0.3%
주식회사 4
 
0.3%
supex 4
 
0.3%
Other values (702) 734
56.5%
2023-12-29T22:17:04.840399image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
651
 
9.0%
649
 
8.9%
646
 
8.9%
644
 
8.9%
634
 
8.7%
633
 
8.7%
601
 
8.3%
196
 
2.7%
1 120
 
1.7%
2 111
 
1.5%
Other values (320) 2383
32.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 6175
85.0%
Space Separator 601
 
8.3%
Decimal Number 343
 
4.7%
Uppercase Letter 38
 
0.5%
Close Punctuation 35
 
0.5%
Open Punctuation 34
 
0.5%
Lowercase Letter 20
 
0.3%
Dash Punctuation 13
 
0.2%
Other Symbol 5
 
0.1%
Other Punctuation 4
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
651
 
10.5%
649
 
10.5%
646
 
10.5%
644
 
10.4%
634
 
10.3%
633
 
10.3%
196
 
3.2%
64
 
1.0%
58
 
0.9%
56
 
0.9%
Other values (286) 1944
31.5%
Uppercase Letter
ValueCountFrequency (%)
S 7
18.4%
E 6
15.8%
P 4
10.5%
U 4
10.5%
X 4
10.5%
G 2
 
5.3%
H 2
 
5.3%
W 2
 
5.3%
F 2
 
5.3%
J 2
 
5.3%
Other values (3) 3
7.9%
Decimal Number
ValueCountFrequency (%)
1 120
35.0%
2 111
32.4%
3 41
 
12.0%
4 21
 
6.1%
6 11
 
3.2%
7 9
 
2.6%
5 8
 
2.3%
9 8
 
2.3%
8 8
 
2.3%
0 6
 
1.7%
Lowercase Letter
ValueCountFrequency (%)
c 4
20.0%
o 4
20.0%
p 4
20.0%
e 4
20.0%
k 4
20.0%
Space Separator
ValueCountFrequency (%)
601
100.0%
Close Punctuation
ValueCountFrequency (%)
) 35
100.0%
Open Punctuation
ValueCountFrequency (%)
( 34
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 13
100.0%
Other Symbol
ValueCountFrequency (%)
5
100.0%
Other Punctuation
ValueCountFrequency (%)
# 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 6180
85.0%
Common 1030
 
14.2%
Latin 58
 
0.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
651
 
10.5%
649
 
10.5%
646
 
10.5%
644
 
10.4%
634
 
10.3%
633
 
10.2%
196
 
3.2%
64
 
1.0%
58
 
0.9%
56
 
0.9%
Other values (287) 1949
31.5%
Latin
ValueCountFrequency (%)
S 7
12.1%
E 6
10.3%
P 4
 
6.9%
U 4
 
6.9%
X 4
 
6.9%
c 4
 
6.9%
o 4
 
6.9%
p 4
 
6.9%
e 4
 
6.9%
k 4
 
6.9%
Other values (8) 13
22.4%
Common
ValueCountFrequency (%)
601
58.3%
1 120
 
11.7%
2 111
 
10.8%
3 41
 
4.0%
) 35
 
3.4%
( 34
 
3.3%
4 21
 
2.0%
- 13
 
1.3%
6 11
 
1.1%
7 9
 
0.9%
Other values (5) 34
 
3.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 6175
85.0%
ASCII 1088
 
15.0%
None 5
 
0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
651
 
10.5%
649
 
10.5%
646
 
10.5%
644
 
10.4%
634
 
10.3%
633
 
10.3%
196
 
3.2%
64
 
1.0%
58
 
0.9%
56
 
0.9%
Other values (286) 1944
31.5%
ASCII
ValueCountFrequency (%)
601
55.2%
1 120
 
11.0%
2 111
 
10.2%
3 41
 
3.8%
) 35
 
3.2%
( 34
 
3.1%
4 21
 
1.9%
- 13
 
1.2%
6 11
 
1.0%
7 9
 
0.8%
Other values (23) 92
 
8.5%
None
ValueCountFrequency (%)
5
100.0%

설비용량(KW)
Real number (ℝ)

Distinct330
Distinct (%)47.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean144.63226
Minimum0
Maximum5192
Zeros1
Zeros (%)0.1%
Negative0
Negative (%)0.0%
Memory size6.3 KiB
2023-12-29T22:17:05.109529image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile19.551
Q176
median99
Q399.91
95-th percentile491.481
Maximum5192
Range5192
Interquartile range (IQR)23.91

Descriptive statistics

Standard deviation253.40998
Coefficient of variation (CV)1.7520986
Kurtosis227.52315
Mean144.63226
Median Absolute Deviation (MAD)3.28
Skewness12.296428
Sum100953.32
Variance64216.62
MonotonicityNot monotonic
2023-12-29T22:17:05.516135image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
99.0 51
 
7.3%
99.96 37
 
5.3%
99.76 27
 
3.9%
97.2 24
 
3.4%
99.9 19
 
2.7%
98.28 18
 
2.6%
98.77 17
 
2.4%
99.84 16
 
2.3%
99.6 13
 
1.9%
99.91 13
 
1.9%
Other values (320) 463
66.3%
ValueCountFrequency (%)
0.0 1
0.1%
9.75 1
0.1%
10.13 1
0.1%
10.2 1
0.1%
10.92 1
0.1%
13.88 1
0.1%
14.0 1
0.1%
14.04 2
0.3%
14.6 2
0.3%
14.72 1
0.1%
ValueCountFrequency (%)
5192.0 1
 
0.1%
1000.0 3
0.4%
999.81 1
 
0.1%
995.0 1
 
0.1%
990.0 1
 
0.1%
972.23 1
 
0.1%
960.0 1
 
0.1%
958.32 1
 
0.1%
939.58 1
 
0.1%
925.0 1
 
0.1%
Distinct601
Distinct (%)86.8%
Missing6
Missing (%)0.9%
Memory size5.6 KiB
2023-12-29T22:17:06.010915image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length74
Median length53
Mean length28.199422
Min length3

Characters and Unicode

Total characters19514
Distinct characters181
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique540 ?
Unique (%)78.0%

Sample

1st row경남 진주시 대평면 대평리 252(토지 위)
2nd row경남 진주시 대평면 대평리 269(토지 위)
3rd row경남 진주시 대평면 대평리 269
4th row경남 진주시 내동면 순환로 426-18(건물 위)
5th row경남 진주시 이반성면 진마대로2498번길 9(길성리 917)(공장 건물 위)
ValueCountFrequency (%)
진주시 678
 
15.1%
경남 441
 
9.8%
385
 
8.5%
건물 213
 
4.7%
토지 138
 
3.1%
미천면 83
 
1.8%
수곡면 78
 
1.7%
64
 
1.4%
진성면 62
 
1.4%
사봉면 60
 
1.3%
Other values (1000) 2302
51.1%
2023-12-29T22:17:07.555066image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3824
 
19.6%
788
 
4.0%
1 743
 
3.8%
718
 
3.7%
690
 
3.5%
599
 
3.1%
582
 
3.0%
) 564
 
2.9%
( 564
 
2.9%
492
 
2.5%
Other values (171) 9950
51.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 10281
52.7%
Space Separator 3824
 
19.6%
Decimal Number 3588
 
18.4%
Close Punctuation 564
 
2.9%
Open Punctuation 564
 
2.9%
Dash Punctuation 439
 
2.2%
Other Punctuation 249
 
1.3%
Uppercase Letter 5
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
788
 
7.7%
718
 
7.0%
690
 
6.7%
599
 
5.8%
582
 
5.7%
492
 
4.8%
476
 
4.6%
443
 
4.3%
423
 
4.1%
417
 
4.1%
Other values (152) 4653
45.3%
Decimal Number
ValueCountFrequency (%)
1 743
20.7%
2 450
12.5%
4 394
11.0%
5 364
10.1%
3 335
9.3%
9 312
8.7%
7 266
 
7.4%
6 266
 
7.4%
8 256
 
7.1%
0 202
 
5.6%
Uppercase Letter
ValueCountFrequency (%)
B 2
40.0%
H 1
20.0%
I 1
20.0%
A 1
20.0%
Space Separator
ValueCountFrequency (%)
3824
100.0%
Close Punctuation
ValueCountFrequency (%)
) 564
100.0%
Open Punctuation
ValueCountFrequency (%)
( 564
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 439
100.0%
Other Punctuation
ValueCountFrequency (%)
, 249
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 10281
52.7%
Common 9228
47.3%
Latin 5
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
788
 
7.7%
718
 
7.0%
690
 
6.7%
599
 
5.8%
582
 
5.7%
492
 
4.8%
476
 
4.6%
443
 
4.3%
423
 
4.1%
417
 
4.1%
Other values (152) 4653
45.3%
Common
ValueCountFrequency (%)
3824
41.4%
1 743
 
8.1%
) 564
 
6.1%
( 564
 
6.1%
2 450
 
4.9%
- 439
 
4.8%
4 394
 
4.3%
5 364
 
3.9%
3 335
 
3.6%
9 312
 
3.4%
Other values (5) 1239
 
13.4%
Latin
ValueCountFrequency (%)
B 2
40.0%
H 1
20.0%
I 1
20.0%
A 1
20.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 10281
52.7%
ASCII 9233
47.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3824
41.4%
1 743
 
8.0%
) 564
 
6.1%
( 564
 
6.1%
2 450
 
4.9%
- 439
 
4.8%
4 394
 
4.3%
5 364
 
3.9%
3 335
 
3.6%
9 312
 
3.4%
Other values (9) 1244
 
13.5%
Hangul
ValueCountFrequency (%)
788
 
7.7%
718
 
7.0%
690
 
6.7%
599
 
5.8%
582
 
5.7%
492
 
4.8%
476
 
4.6%
443
 
4.3%
423
 
4.1%
417
 
4.1%
Other values (152) 4653
45.3%
Distinct271
Distinct (%)38.8%
Missing0
Missing (%)0.0%
Memory size5.6 KiB
Minimum2007-01-10 00:00:00
Maximum2023-06-13 00:00:00
2023-12-29T22:17:07.976282image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-29T22:17:08.619728image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

사업개시일
Date

MISSING 

Distinct295
Distinct (%)56.6%
Missing177
Missing (%)25.4%
Memory size5.6 KiB
Minimum2008-05-27 00:00:00
Maximum2023-05-12 00:00:00
2023-12-29T22:17:09.203902image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-29T22:17:09.703779image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size5.6 KiB
2023-06-13
698 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023-06-13
2nd row2023-06-13
3rd row2023-06-13
4th row2023-06-13
5th row2023-06-13

Common Values

ValueCountFrequency (%)
2023-06-13 698
100.0%

Length

2023-12-29T22:17:10.117132image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-29T22:17:10.447648image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023-06-13 698
100.0%

Interactions

2023-12-29T22:17:01.937235image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Missing values

2023-12-29T22:17:02.280143image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-29T22:17:02.620284image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-29T22:17:02.972762image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

태양광발전소명설비용량(KW)발전소주소최초허가일사업개시일기준일자
0해상 태양광발전소99.96경남 진주시 대평면 대평리 252(토지 위)2023-06-13<NA>2023-06-13
1지 태양광발전소99.96경남 진주시 대평면 대평리 269(토지 위)2023-06-13<NA>2023-06-13
2솔 태양광발전소99.96경남 진주시 대평면 대평리 2692023-06-13<NA>2023-06-13
3조동규에너지17.85경남 진주시 내동면 순환로 426-18(건물 위)2023-06-13<NA>2023-06-13
4영남캠핑카 1호99.5경남 진주시 이반성면 진마대로2498번길 9(길성리 917)(공장 건물 위)2023-06-13<NA>2023-06-13
5영남캠핑카 2호99.5경남 진주시 이반성면 진마대로2498번길 9(길성리 917)(공장 건물 위)2023-06-13<NA>2023-06-13
6영남캠핑카 3호99.5경남 진주시 이반성면 진마대로2498번길 9(길성리 917)(공장 건물 위)2023-06-13<NA>2023-06-13
7시민태양광99.91경남 진주시 문산읍 소문리 332, 333, 335-12023-06-13<NA>2023-06-13
8마실5 태양광발전소99.96경남 진주시 수곡면 사곡리 981-5, 9822023-06-13<NA>2023-06-13
9마실4 태양광발전소99.96경남 진주시 수곡면 사곡리 978(토지 위)2023-06-13<NA>2023-06-13
태양광발전소명설비용량(KW)발전소주소최초허가일사업개시일기준일자
688영광 태양광발전소19.6금산면 속사길 592008-08-262008-09-252023-06-13
689부광 태양광발전소19.6경남 진주시 금산면 속사길 89-1 (속사리 234-1, 234-3, 234-5)2008-09-012008-09-252023-06-13
690정광 태양광발전소29.16금산면 속사리 218-12008-08-122009-07-012023-06-13
691이대식 태양광발전소20.0진주시 금산면 속사길57번길 82007-11-052008-05-272023-06-13
692(주)진주 태양광발전소1000.0진주시 사봉면 봉곡리 산 222번지 8호 , 1145-22007-10-022009-08-172023-06-13
693임용택 태양광발전소15.66진주시 진양호로97번길 19-17 (평거동)(건물 위)2007-06-132009-08-172023-06-13
694신한태양광발전소134.4경상남도 진주시 명석면 외율리 산 136번지 8호2007-04-02<NA>2023-06-13
695광발전㈜ 태양광발전소99.0진주시 정촌면 예상리 산 13번지2007-01-102009-11-012023-06-13
696진수전력 태양광발전소1000.0진주시 수곡면 효자리 산 26번지, 산 27번지2007-09-012013-03-062023-06-13
697(주)외율 태양광발전소134.4경상남도 진주시 명석면 진주대로 2320-362007-04-02<NA>2023-06-13