Overview

Dataset statistics

Number of variables6
Number of observations638
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory30.7 KiB
Average record size in memory49.2 B

Variable types

Text2
Categorical1
Numeric1
DateTime2

Dataset

Description경기도 김포시 관내 위치한 태양광발전소 현황(상호, 설치장소소재지, 영업구분, 설비용량(킬로와트), 사업개시일, 데이터기준일 등)의 데이터를 제공하고 있습니다.
Author경기도 김포시
URLhttps://www.data.go.kr/data/15037678/fileData.do

Alerts

영업구분 has constant value ""Constant
데이터기준일 has constant value ""Constant

Reproduction

Analysis started2023-12-11 23:32:19.769807
Analysis finished2023-12-11 23:32:20.367484
Duration0.6 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

상호
Text

Distinct634
Distinct (%)99.4%
Missing0
Missing (%)0.0%
Memory size5.1 KiB
2023-12-12T08:32:20.681887image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length23
Median length19
Mean length10.382445
Min length4

Characters and Unicode

Total characters6624
Distinct characters342
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique630 ?
Unique (%)98.7%

Sample

1st row솔로몬4호 태양광발전소
2nd row성도옹정3호 태양광발전소
3rd row성도옹정2호 태양광발전소
4th row성도옹정1호 태양광발전소
5th row브이아이피5호 태양광발전소
ValueCountFrequency (%)
태양광발전소 369
33.7%
발전소 24
 
2.2%
태양광 7
 
0.6%
1호 7
 
0.6%
2호 7
 
0.6%
재영 5
 
0.5%
제1 4
 
0.4%
주)본실업 4
 
0.4%
한성교구 3
 
0.3%
제3 3
 
0.3%
Other values (638) 662
60.5%
2023-12-12T08:32:21.202806image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
626
 
9.5%
615
 
9.3%
612
 
9.2%
609
 
9.2%
608
 
9.2%
607
 
9.2%
457
 
6.9%
185
 
2.8%
2 105
 
1.6%
1 94
 
1.4%
Other values (332) 2106
31.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5742
86.7%
Space Separator 457
 
6.9%
Decimal Number 291
 
4.4%
Uppercase Letter 64
 
1.0%
Lowercase Letter 28
 
0.4%
Open Punctuation 17
 
0.3%
Close Punctuation 17
 
0.3%
Dash Punctuation 7
 
0.1%
Other Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
626
 
10.9%
615
 
10.7%
612
 
10.7%
609
 
10.6%
608
 
10.6%
607
 
10.6%
185
 
3.2%
69
 
1.2%
66
 
1.1%
37
 
0.6%
Other values (290) 1708
29.7%
Uppercase Letter
ValueCountFrequency (%)
N 9
14.1%
E 9
14.1%
S 6
9.4%
G 6
9.4%
K 5
7.8%
C 4
 
6.2%
D 4
 
6.2%
M 4
 
6.2%
F 3
 
4.7%
B 3
 
4.7%
Other values (6) 11
17.2%
Lowercase Letter
ValueCountFrequency (%)
p 5
17.9%
o 4
14.3%
k 3
10.7%
e 3
10.7%
c 3
10.7%
j 3
10.7%
l 2
 
7.1%
a 2
 
7.1%
n 1
 
3.6%
t 1
 
3.6%
Decimal Number
ValueCountFrequency (%)
2 105
36.1%
1 94
32.3%
3 46
15.8%
4 19
 
6.5%
5 11
 
3.8%
6 8
 
2.7%
8 3
 
1.0%
7 3
 
1.0%
9 1
 
0.3%
0 1
 
0.3%
Space Separator
ValueCountFrequency (%)
457
100.0%
Open Punctuation
ValueCountFrequency (%)
( 17
100.0%
Close Punctuation
ValueCountFrequency (%)
) 17
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 7
100.0%
Other Punctuation
ValueCountFrequency (%)
& 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5742
86.7%
Common 790
 
11.9%
Latin 92
 
1.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
626
 
10.9%
615
 
10.7%
612
 
10.7%
609
 
10.6%
608
 
10.6%
607
 
10.6%
185
 
3.2%
69
 
1.2%
66
 
1.1%
37
 
0.6%
Other values (290) 1708
29.7%
Latin
ValueCountFrequency (%)
N 9
 
9.8%
E 9
 
9.8%
S 6
 
6.5%
G 6
 
6.5%
p 5
 
5.4%
K 5
 
5.4%
C 4
 
4.3%
D 4
 
4.3%
o 4
 
4.3%
M 4
 
4.3%
Other values (17) 36
39.1%
Common
ValueCountFrequency (%)
457
57.8%
2 105
 
13.3%
1 94
 
11.9%
3 46
 
5.8%
4 19
 
2.4%
( 17
 
2.2%
) 17
 
2.2%
5 11
 
1.4%
6 8
 
1.0%
- 7
 
0.9%
Other values (5) 9
 
1.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5742
86.7%
ASCII 882
 
13.3%

Most frequent character per block

Hangul
ValueCountFrequency (%)
626
 
10.9%
615
 
10.7%
612
 
10.7%
609
 
10.6%
608
 
10.6%
607
 
10.6%
185
 
3.2%
69
 
1.2%
66
 
1.1%
37
 
0.6%
Other values (290) 1708
29.7%
ASCII
ValueCountFrequency (%)
457
51.8%
2 105
 
11.9%
1 94
 
10.7%
3 46
 
5.2%
4 19
 
2.2%
( 17
 
1.9%
) 17
 
1.9%
5 11
 
1.2%
N 9
 
1.0%
E 9
 
1.0%
Other values (32) 98
 
11.1%
Distinct482
Distinct (%)75.5%
Missing0
Missing (%)0.0%
Memory size5.1 KiB
2023-12-12T08:32:21.543558image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length47
Median length36
Mean length23.633229
Min length15

Characters and Unicode

Total characters15078
Distinct characters176
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique397 ?
Unique (%)62.2%

Sample

1st row경기도 김포시 양촌읍 황금로23번길 281
2nd row경기도 김포시 통진읍 김포대로2435번길 81-1
3rd row경기도 김포시 통진읍 김포대로2435번길 81-1
4th row경기도 김포시 통진읍 김포대로2435번길 81-1
5th row경기도 김포시 하성면 마곡로 278
ValueCountFrequency (%)
김포시 638
19.2%
경기도 634
19.1%
대곶면 216
 
6.5%
하성면 114
 
3.4%
양촌읍 111
 
3.3%
통진읍 103
 
3.1%
월곶면 57
 
1.7%
건물위 43
 
1.3%
고촌읍 24
 
0.7%
대곶남로 20
 
0.6%
Other values (678) 1365
41.1%
2023-12-12T08:32:22.139184image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2688
 
17.8%
707
 
4.7%
671
 
4.5%
660
 
4.4%
647
 
4.3%
647
 
4.3%
1 635
 
4.2%
634
 
4.2%
606
 
4.0%
2 406
 
2.7%
Other values (166) 6777
44.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 8865
58.8%
Decimal Number 3028
 
20.1%
Space Separator 2688
 
17.8%
Dash Punctuation 273
 
1.8%
Other Punctuation 92
 
0.6%
Open Punctuation 64
 
0.4%
Close Punctuation 64
 
0.4%
Math Symbol 2
 
< 0.1%
Uppercase Letter 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
707
 
8.0%
671
 
7.6%
660
 
7.4%
647
 
7.3%
647
 
7.3%
634
 
7.2%
606
 
6.8%
387
 
4.4%
365
 
4.1%
359
 
4.0%
Other values (147) 3182
35.9%
Decimal Number
ValueCountFrequency (%)
1 635
21.0%
2 406
13.4%
3 295
9.7%
5 284
9.4%
4 263
8.7%
6 260
8.6%
8 244
 
8.1%
7 239
 
7.9%
9 224
 
7.4%
0 178
 
5.9%
Math Symbol
ValueCountFrequency (%)
~ 1
50.0%
+ 1
50.0%
Uppercase Letter
ValueCountFrequency (%)
A 1
50.0%
B 1
50.0%
Space Separator
ValueCountFrequency (%)
2688
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 273
100.0%
Other Punctuation
ValueCountFrequency (%)
, 92
100.0%
Open Punctuation
ValueCountFrequency (%)
( 64
100.0%
Close Punctuation
ValueCountFrequency (%)
) 64
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 8865
58.8%
Common 6211
41.2%
Latin 2
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
707
 
8.0%
671
 
7.6%
660
 
7.4%
647
 
7.3%
647
 
7.3%
634
 
7.2%
606
 
6.8%
387
 
4.4%
365
 
4.1%
359
 
4.0%
Other values (147) 3182
35.9%
Common
ValueCountFrequency (%)
2688
43.3%
1 635
 
10.2%
2 406
 
6.5%
3 295
 
4.7%
5 284
 
4.6%
- 273
 
4.4%
4 263
 
4.2%
6 260
 
4.2%
8 244
 
3.9%
7 239
 
3.8%
Other values (7) 624
 
10.0%
Latin
ValueCountFrequency (%)
A 1
50.0%
B 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 8865
58.8%
ASCII 6213
41.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2688
43.3%
1 635
 
10.2%
2 406
 
6.5%
3 295
 
4.7%
5 284
 
4.6%
- 273
 
4.4%
4 263
 
4.2%
6 260
 
4.2%
8 244
 
3.9%
7 239
 
3.8%
Other values (9) 626
 
10.1%
Hangul
ValueCountFrequency (%)
707
 
8.0%
671
 
7.6%
660
 
7.4%
647
 
7.3%
647
 
7.3%
634
 
7.2%
606
 
6.8%
387
 
4.4%
365
 
4.1%
359
 
4.0%
Other values (147) 3182
35.9%

영업구분
Categorical

CONSTANT 

Distinct1
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size5.1 KiB
사업개시
638 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row사업개시
2nd row사업개시
3rd row사업개시
4th row사업개시
5th row사업개시

Common Values

ValueCountFrequency (%)
사업개시 638
100.0%

Length

2023-12-12T08:32:22.295331image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T08:32:22.390722image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
사업개시 638
100.0%
Distinct362
Distinct (%)56.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean82.17373
Minimum10
Maximum465.6
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size5.7 KiB
2023-12-12T08:32:22.513275image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum10
5-th percentile19.234
Q139.0225
median86.4
Q399.3925
95-th percentile194.315
Maximum465.6
Range455.6
Interquartile range (IQR)60.37

Descriptive statistics

Standard deviation60.230841
Coefficient of variation (CV)0.73296953
Kurtosis12.999396
Mean82.17373
Median Absolute Deviation (MAD)13.955
Skewness2.9007904
Sum52426.84
Variance3627.7542
MonotonicityNot monotonic
2023-12-12T08:32:22.666994image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
99.0 22
 
3.4%
99.96 20
 
3.1%
29.58 18
 
2.8%
99.6 18
 
2.8%
99.12 9
 
1.4%
99.84 8
 
1.3%
99.99 8
 
1.3%
99.5 7
 
1.1%
99.36 7
 
1.1%
99.65 7
 
1.1%
Other values (352) 514
80.6%
ValueCountFrequency (%)
10.0 1
 
0.2%
10.8 1
 
0.2%
11.62 1
 
0.2%
11.7 1
 
0.2%
13.68 1
 
0.2%
14.4 1
 
0.2%
14.56 1
 
0.2%
15.0 5
0.8%
15.3 1
 
0.2%
15.36 1
 
0.2%
ValueCountFrequency (%)
465.6 1
0.2%
458.15 1
0.2%
449.9 1
0.2%
427.95 1
0.2%
408.0 1
0.2%
400.0 1
0.2%
399.6 1
0.2%
343.71 1
0.2%
301.92 1
0.2%
299.52 1
0.2%
Distinct402
Distinct (%)63.0%
Missing0
Missing (%)0.0%
Memory size5.1 KiB
Minimum2009-09-24 00:00:00
Maximum2023-09-22 00:00:00
2023-12-12T08:32:22.836146image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T08:32:22.997626image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

데이터기준일
Date

CONSTANT 

Distinct1
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size5.1 KiB
Minimum2023-10-23 00:00:00
Maximum2023-10-23 00:00:00
2023-12-12T08:32:23.111840image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T08:32:23.203440image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2023-12-12T08:32:20.088709image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Missing values

2023-12-12T08:32:20.211354image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T08:32:20.316452image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

상호설치장소소재지영업구분설비용량(킬로와트)사업개시일데이터기준일
0솔로몬4호 태양광발전소경기도 김포시 양촌읍 황금로23번길 281사업개시99.122023-09-222023-10-23
1성도옹정3호 태양광발전소경기도 김포시 통진읍 김포대로2435번길 81-1사업개시99.962023-09-042023-10-23
2성도옹정2호 태양광발전소경기도 김포시 통진읍 김포대로2435번길 81-1사업개시99.962023-09-042023-10-23
3성도옹정1호 태양광발전소경기도 김포시 통진읍 김포대로2435번길 81-1사업개시99.962023-09-042023-10-23
4브이아이피5호 태양광발전소경기도 김포시 하성면 마곡로 278사업개시95.382023-08-282023-10-23
5브이아이피3호 태양광발전소경기도 김포시 하성면 마곡로 278사업개시67.52023-08-282023-10-23
6선보2 태양광발전소경기도 김포시 통진읍 고정1로 57사업개시49.982023-08-252023-10-23
7선보1 태양광발전소경기도 김포시 통진읍 고정1로 57사업개시99.962023-08-252023-10-23
8세운태양광발전소경기도 김포시 통진읍 월하로 586-66사업개시79.862023-08-222023-10-23
9동신텍 태양광발전소경기도 김포시 통진읍 검암2로 101사업개시85.682023-08-222023-10-23
상호설치장소소재지영업구분설비용량(킬로와트)사업개시일데이터기준일
628가현양수장 태양광발전소경기도 김포시 통진읍 가현로166번길 32-9, (건물위)사업개시99.122013-02-202023-10-23
629대성 태양광발전소경기도 김포시 양촌읍 황금로128번길 108사업개시97.52012-10-102023-10-23
630거성2 태양광발전소경기도 김포시 통진읍 월하로301번길 225-1사업개시22.02012-08-152023-10-23
631파워에이스 태양광발전소경기도 김포시 대곶면 대곶북로 527-41사업개시27.02012-08-012023-10-23
632거성 태양광발전소경기도 김포시 통진읍 월하로301번길 225-1, (가동 건물위)사업개시55.02012-03-232023-10-23
633갈산 태양광발전소경기도 김포시 월곶면 점동로 89-23사업개시60.02010-09-272023-10-23
634지멘스(주) 태양광발전소경기도 김포시 월곶면 김포대로2659번길 77사업개시19.82010-01-272023-10-23
635김포하수처리장 태양광발전소경기도 김포시 감암로 137 (걸포동)사업개시465.62009-12-092023-10-23
636시암 태양광발전소경기도 김포시 하성면 석평로544번길 28-1사업개시29.42009-09-302023-10-23
637정한 태양광발전소경기도 김포시 대곶면 수남로22번길 154-24사업개시23.02009-09-242023-10-23