Overview

Dataset statistics

Number of variables18
Number of observations320
Missing cells672
Missing cells (%)11.7%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory46.4 KiB
Average record size in memory148.4 B

Variable types

Categorical10
Text4
Numeric2
DateTime2

Dataset

Description경기도 양주시 태양광발전소 현황에 대한 데이터로 용도, 세부용도, 사업상태, 발전소명, 주소, 설비용량 등의 내용을 포함하고 있습니다.
Author경기도 양주시
URLhttps://www.data.go.kr/data/15096302/fileData.do

Alerts

허가기관명 has constant value ""Constant
시군명 has constant value ""Constant
용도 has constant value ""Constant
세부용도 has constant value ""Constant
주파수 has constant value ""Constant
데이터기준일자 has constant value ""Constant
설비용량 is highly overall correlated with 집광판면적 and 1 other fieldsHigh correlation
집광판면적 is highly overall correlated with 설비용량 and 2 other fieldsHigh correlation
설치위치구분 is highly overall correlated with 집광판면적 and 1 other fieldsHigh correlation
지목 is highly overall correlated with 설비용량 and 2 other fieldsHigh correlation
공급전압 is highly imbalanced (90.3%)Imbalance
설치위치구분 is highly imbalanced (69.3%)Imbalance
소재지도로명주소 has 291 (90.9%) missing valuesMissing
소재지지번주소 has 29 (9.1%) missing valuesMissing
사업개시일자 has 88 (27.5%) missing valuesMissing
비고 has 262 (81.9%) missing valuesMissing

Reproduction

Analysis started2024-03-14 17:23:23.272214
Analysis finished2024-03-14 17:23:26.271760
Duration3 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

허가기관명
Categorical

CONSTANT 

Distinct1
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size2.6 KiB
양주시
320 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row양주시
2nd row양주시
3rd row양주시
4th row양주시
5th row양주시

Common Values

ValueCountFrequency (%)
양주시 320
100.0%

Length

2024-03-15T02:23:26.417386image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-15T02:23:26.645397image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
양주시 320
100.0%

시군명
Categorical

CONSTANT 

Distinct1
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size2.6 KiB
양주시
320 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row양주시
2nd row양주시
3rd row양주시
4th row양주시
5th row양주시

Common Values

ValueCountFrequency (%)
양주시 320
100.0%

Length

2024-03-15T02:23:26.874543image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-15T02:23:27.263103image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
양주시 320
100.0%

용도
Categorical

CONSTANT 

Distinct1
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size2.6 KiB
전기사업용
320 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row전기사업용
2nd row전기사업용
3rd row전기사업용
4th row전기사업용
5th row전기사업용

Common Values

ValueCountFrequency (%)
전기사업용 320
100.0%

Length

2024-03-15T02:23:27.443065image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-15T02:23:27.671676image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
전기사업용 320
100.0%

세부용도
Categorical

CONSTANT 

Distinct1
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size2.6 KiB
발전사업용
320 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row발전사업용
2nd row발전사업용
3rd row발전사업용
4th row발전사업용
5th row발전사업용

Common Values

ValueCountFrequency (%)
발전사업용 320
100.0%

Length

2024-03-15T02:23:28.001740image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-15T02:23:28.313115image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
발전사업용 320
100.0%

사업상태
Categorical

Distinct2
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size2.6 KiB
운영중
232 
사업허가
88 

Length

Max length4
Median length3
Mean length3.275
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row운영중
2nd row운영중
3rd row운영중
4th row운영중
5th row운영중

Common Values

ValueCountFrequency (%)
운영중 232
72.5%
사업허가 88
 
27.5%

Length

2024-03-15T02:23:28.640931image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-15T02:23:28.967473image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
운영중 232
72.5%
사업허가 88
 
27.5%
Distinct313
Distinct (%)97.8%
Missing0
Missing (%)0.0%
Memory size2.6 KiB
2024-03-15T02:23:29.808432image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length18
Median length16
Mean length9.33125
Min length3

Characters and Unicode

Total characters2986
Distinct characters255
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique307 ?
Unique (%)95.9%

Sample

1st row신** 태양광발전소
2nd row광사 태양광발전소
3rd row냅대 태양광발전소
4th row강** 태양광발전소
5th row황** 태양광발전소
ValueCountFrequency (%)
태양광발전소 28
 
7.7%
8
 
2.2%
태양광 5
 
1.4%
봉양3통마을회 2
 
0.6%
에스시엠아이 2
 
0.6%
에덴 2
 
0.6%
2
 
0.6%
2
 
0.6%
미래태양광발전소 2
 
0.6%
그린태양광발전소 2
 
0.6%
Other values (305) 307
84.8%
2024-03-15T02:23:31.167150image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
296
 
9.9%
293
 
9.8%
290
 
9.7%
288
 
9.6%
274
 
9.2%
271
 
9.1%
76
 
2.5%
2 49
 
1.6%
* 48
 
1.6%
1 45
 
1.5%
Other values (245) 1056
35.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2760
92.4%
Decimal Number 127
 
4.3%
Other Punctuation 48
 
1.6%
Space Separator 42
 
1.4%
Uppercase Letter 5
 
0.2%
Dash Punctuation 2
 
0.1%
Other Symbol 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
296
 
10.7%
293
 
10.6%
290
 
10.5%
288
 
10.4%
274
 
9.9%
271
 
9.8%
76
 
2.8%
29
 
1.1%
25
 
0.9%
25
 
0.9%
Other values (227) 893
32.4%
Decimal Number
ValueCountFrequency (%)
2 49
38.6%
1 45
35.4%
3 21
16.5%
4 3
 
2.4%
5 2
 
1.6%
0 2
 
1.6%
6 2
 
1.6%
8 1
 
0.8%
9 1
 
0.8%
7 1
 
0.8%
Uppercase Letter
ValueCountFrequency (%)
H 2
40.0%
Y 1
20.0%
K 1
20.0%
G 1
20.0%
Other Punctuation
ValueCountFrequency (%)
* 48
100.0%
Space Separator
ValueCountFrequency (%)
42
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%
Other Symbol
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2762
92.5%
Common 219
 
7.3%
Latin 5
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
296
 
10.7%
293
 
10.6%
290
 
10.5%
288
 
10.4%
274
 
9.9%
271
 
9.8%
76
 
2.8%
29
 
1.0%
25
 
0.9%
25
 
0.9%
Other values (228) 895
32.4%
Common
ValueCountFrequency (%)
2 49
22.4%
* 48
21.9%
1 45
20.5%
42
19.2%
3 21
9.6%
4 3
 
1.4%
5 2
 
0.9%
0 2
 
0.9%
6 2
 
0.9%
- 2
 
0.9%
Other values (3) 3
 
1.4%
Latin
ValueCountFrequency (%)
H 2
40.0%
Y 1
20.0%
K 1
20.0%
G 1
20.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2760
92.4%
ASCII 224
 
7.5%
None 2
 
0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
296
 
10.7%
293
 
10.6%
290
 
10.5%
288
 
10.4%
274
 
9.9%
271
 
9.8%
76
 
2.8%
29
 
1.1%
25
 
0.9%
25
 
0.9%
Other values (227) 893
32.4%
ASCII
ValueCountFrequency (%)
2 49
21.9%
* 48
21.4%
1 45
20.1%
42
18.8%
3 21
9.4%
4 3
 
1.3%
5 2
 
0.9%
0 2
 
0.9%
6 2
 
0.9%
- 2
 
0.9%
Other values (7) 8
 
3.6%
None
ValueCountFrequency (%)
2
100.0%
Distinct29
Distinct (%)100.0%
Missing291
Missing (%)90.9%
Memory size2.6 KiB
2024-03-15T02:23:32.188245image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length47
Median length38
Mean length32.206897
Min length19

Characters and Unicode

Total characters934
Distinct characters85
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique29 ?
Unique (%)100.0%

Sample

1st row경기도 양주시 백석읍 월암로112번길 35-6 [건물위]
2nd row경기도 양주시 장흥면 일영로502번길 222-58 [건물위]
3rd row경기도 양주시 청담로84번길 231-298
4th row경기도 양주시 고덕로139번길 61-26, 제2호 (덕계동) [건물위]
5th row경기도 양주시 청담로42번길 42 (고읍동) [건물위]
ValueCountFrequency (%)
경기도 29
 
15.7%
양주시 29
 
15.7%
건물위 12
 
6.5%
10
 
5.4%
백석읍 6
 
3.2%
광적면 6
 
3.2%
건물 5
 
2.7%
은현면 5
 
2.7%
삼일로 3
 
1.6%
남면 3
 
1.6%
Other values (72) 77
41.6%
2024-03-15T02:23:33.564065image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
156
 
16.7%
2 36
 
3.9%
31
 
3.3%
1 31
 
3.3%
29
 
3.1%
29
 
3.1%
29
 
3.1%
29
 
3.1%
29
 
3.1%
27
 
2.9%
Other values (75) 508
54.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 498
53.3%
Decimal Number 183
 
19.6%
Space Separator 156
 
16.7%
Close Punctuation 34
 
3.6%
Open Punctuation 34
 
3.6%
Dash Punctuation 17
 
1.8%
Other Punctuation 10
 
1.1%
Uppercase Letter 2
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
31
 
6.2%
29
 
5.8%
29
 
5.8%
29
 
5.8%
29
 
5.8%
29
 
5.8%
27
 
5.4%
27
 
5.4%
23
 
4.6%
23
 
4.6%
Other values (56) 222
44.6%
Decimal Number
ValueCountFrequency (%)
2 36
19.7%
1 31
16.9%
5 21
11.5%
3 17
9.3%
6 17
9.3%
4 15
8.2%
7 14
 
7.7%
9 13
 
7.1%
8 12
 
6.6%
0 7
 
3.8%
Close Punctuation
ValueCountFrequency (%)
] 20
58.8%
) 14
41.2%
Open Punctuation
ValueCountFrequency (%)
[ 20
58.8%
( 14
41.2%
Uppercase Letter
ValueCountFrequency (%)
B 1
50.0%
A 1
50.0%
Space Separator
ValueCountFrequency (%)
156
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 17
100.0%
Other Punctuation
ValueCountFrequency (%)
, 10
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 498
53.3%
Common 434
46.5%
Latin 2
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
31
 
6.2%
29
 
5.8%
29
 
5.8%
29
 
5.8%
29
 
5.8%
29
 
5.8%
27
 
5.4%
27
 
5.4%
23
 
4.6%
23
 
4.6%
Other values (56) 222
44.6%
Common
ValueCountFrequency (%)
156
35.9%
2 36
 
8.3%
1 31
 
7.1%
5 21
 
4.8%
] 20
 
4.6%
[ 20
 
4.6%
3 17
 
3.9%
- 17
 
3.9%
6 17
 
3.9%
4 15
 
3.5%
Other values (7) 84
19.4%
Latin
ValueCountFrequency (%)
B 1
50.0%
A 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 498
53.3%
ASCII 436
46.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
156
35.8%
2 36
 
8.3%
1 31
 
7.1%
5 21
 
4.8%
] 20
 
4.6%
[ 20
 
4.6%
3 17
 
3.9%
- 17
 
3.9%
6 17
 
3.9%
4 15
 
3.4%
Other values (9) 86
19.7%
Hangul
ValueCountFrequency (%)
31
 
6.2%
29
 
5.8%
29
 
5.8%
29
 
5.8%
29
 
5.8%
29
 
5.8%
27
 
5.4%
27
 
5.4%
23
 
4.6%
23
 
4.6%
Other values (56) 222
44.6%

소재지지번주소
Text

MISSING 

Distinct279
Distinct (%)95.9%
Missing29
Missing (%)9.1%
Memory size2.6 KiB
2024-03-15T02:23:34.691158image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length107
Median length38
Mean length25.955326
Min length14

Characters and Unicode

Total characters7553
Distinct characters120
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique269 ?
Unique (%)92.4%

Sample

1st row경기도 양주시 회정동 69-2
2nd row경기도 양주시 광사동 98
3rd row경기도 양주시 산북동 155-4[건물 위]
4th row경기도 양주시 옥정동 796-5
5th row경기도 양주시 은현면 운암리 498-1 [건물 위]
ValueCountFrequency (%)
양주시 291
17.4%
경기도 224
 
13.4%
165
 
9.8%
은현면 63
 
3.8%
광적면 62
 
3.7%
남면 49
 
2.9%
백석읍 42
 
2.5%
봉양동 22
 
1.3%
운암리 18
 
1.1%
가납리 17
 
1.0%
Other values (380) 724
43.2%
2024-03-15T02:23:36.071911image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1386
 
18.4%
313
 
4.1%
292
 
3.9%
291
 
3.9%
250
 
3.3%
- 249
 
3.3%
1 249
 
3.3%
234
 
3.1%
230
 
3.0%
225
 
3.0%
Other values (110) 3834
50.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3986
52.8%
Space Separator 1386
 
18.4%
Decimal Number 1258
 
16.7%
Dash Punctuation 249
 
3.3%
Open Punctuation 232
 
3.1%
Close Punctuation 231
 
3.1%
Other Punctuation 172
 
2.3%
Uppercase Letter 39
 
0.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
313
 
7.9%
292
 
7.3%
291
 
7.3%
250
 
6.3%
234
 
5.9%
230
 
5.8%
225
 
5.6%
224
 
5.6%
207
 
5.2%
207
 
5.2%
Other values (85) 1513
38.0%
Decimal Number
ValueCountFrequency (%)
1 249
19.8%
2 170
13.5%
4 155
12.3%
3 147
11.7%
5 124
9.9%
6 93
 
7.4%
7 93
 
7.4%
9 83
 
6.6%
8 78
 
6.2%
0 66
 
5.2%
Uppercase Letter
ValueCountFrequency (%)
B 13
33.3%
A 12
30.8%
C 6
15.4%
F 3
 
7.7%
D 2
 
5.1%
E 2
 
5.1%
J 1
 
2.6%
Open Punctuation
ValueCountFrequency (%)
( 205
88.4%
[ 27
 
11.6%
Close Punctuation
ValueCountFrequency (%)
) 204
88.3%
] 27
 
11.7%
Other Punctuation
ValueCountFrequency (%)
, 171
99.4%
. 1
 
0.6%
Space Separator
ValueCountFrequency (%)
1386
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 249
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3986
52.8%
Common 3528
46.7%
Latin 39
 
0.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
313
 
7.9%
292
 
7.3%
291
 
7.3%
250
 
6.3%
234
 
5.9%
230
 
5.8%
225
 
5.6%
224
 
5.6%
207
 
5.2%
207
 
5.2%
Other values (85) 1513
38.0%
Common
ValueCountFrequency (%)
1386
39.3%
- 249
 
7.1%
1 249
 
7.1%
( 205
 
5.8%
) 204
 
5.8%
, 171
 
4.8%
2 170
 
4.8%
4 155
 
4.4%
3 147
 
4.2%
5 124
 
3.5%
Other values (8) 468
 
13.3%
Latin
ValueCountFrequency (%)
B 13
33.3%
A 12
30.8%
C 6
15.4%
F 3
 
7.7%
D 2
 
5.1%
E 2
 
5.1%
J 1
 
2.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3986
52.8%
ASCII 3567
47.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1386
38.9%
- 249
 
7.0%
1 249
 
7.0%
( 205
 
5.7%
) 204
 
5.7%
, 171
 
4.8%
2 170
 
4.8%
4 155
 
4.3%
3 147
 
4.1%
5 124
 
3.5%
Other values (15) 507
 
14.2%
Hangul
ValueCountFrequency (%)
313
 
7.9%
292
 
7.3%
291
 
7.3%
250
 
6.3%
234
 
5.9%
230
 
5.8%
225
 
5.6%
224
 
5.6%
207
 
5.2%
207
 
5.2%
Other values (85) 1513
38.0%

설비용량
Real number (ℝ)

HIGH CORRELATION 

Distinct228
Distinct (%)71.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean95.165984
Minimum5.625
Maximum999.9
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.9 KiB
2024-03-15T02:23:36.489224image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum5.625
5-th percentile17.95
Q129.81875
median70.56
Q399.45
95-th percentile294.748
Maximum999.9
Range994.275
Interquartile range (IQR)69.63125

Descriptive statistics

Standard deviation116.83641
Coefficient of variation (CV)1.2277119
Kurtosis25.450883
Mean95.165984
Median Absolute Deviation (MAD)31.36
Skewness4.3265251
Sum30453.115
Variance13650.748
MonotonicityNot monotonic
2024-03-15T02:23:36.935820image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
29.7 8
 
2.5%
30.0 7
 
2.2%
99.96 6
 
1.9%
20.0 5
 
1.6%
99.28 5
 
1.6%
93.075 4
 
1.2%
99.36 4
 
1.2%
99.4 4
 
1.2%
97.2 4
 
1.2%
28.8 3
 
0.9%
Other values (218) 270
84.4%
ValueCountFrequency (%)
5.625 1
 
0.3%
9.52 1
 
0.3%
9.75 1
 
0.3%
10.5 1
 
0.3%
10.8 2
0.6%
13.5 1
 
0.3%
14.4 1
 
0.3%
14.7 1
 
0.3%
15.0 3
0.9%
15.12 1
 
0.3%
ValueCountFrequency (%)
999.9 1
0.3%
999.68 1
0.3%
695.8 1
0.3%
567.45 1
0.3%
504.0 1
0.3%
496.1 1
0.3%
489.6 1
0.3%
459.9 1
0.3%
403.2 1
0.3%
395.28 1
0.3%

공급전압
Categorical

IMBALANCE 

Distinct2
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size2.6 KiB
380
316 
220
 
4

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row380
2nd row380
3rd row380
4th row380
5th row380

Common Values

ValueCountFrequency (%)
380 316
98.8%
220 4
 
1.2%

Length

2024-03-15T02:23:37.684033image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-15T02:23:37.884908image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
380 316
98.8%
220 4
 
1.2%

주파수
Categorical

CONSTANT 

Distinct1
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size2.6 KiB
60
320 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row60
2nd row60
3rd row60
4th row60
5th row60

Common Values

ValueCountFrequency (%)
60 320
100.0%

Length

2024-03-15T02:23:38.150791image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-15T02:23:38.319340image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
60 320
100.0%
Distinct191
Distinct (%)59.7%
Missing0
Missing (%)0.0%
Memory size2.6 KiB
Minimum2013-12-23 00:00:00
Maximum2022-11-29 00:00:00
2024-03-15T02:23:38.497119image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T02:23:38.743490image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

사업개시일자
Date

MISSING 

Distinct181
Distinct (%)78.0%
Missing88
Missing (%)27.5%
Memory size2.6 KiB
Minimum2014-05-10 00:00:00
Maximum2022-10-19 00:00:00
2024-03-15T02:23:39.006714image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T02:23:39.362766image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

집광판면적
Real number (ℝ)

HIGH CORRELATION 

Distinct288
Distinct (%)90.6%
Missing2
Missing (%)0.6%
Infinite0
Infinite (%)0.0%
Mean582.18588
Minimum28.06
Maximum10149
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.9 KiB
2024-03-15T02:23:39.616828image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum28.06
5-th percentile88.91
Q1150.125
median357.13
Q3550
95-th percentile1800
Maximum10149
Range10120.94
Interquartile range (IQR)399.875

Descriptive statistics

Standard deviation976.62364
Coefficient of variation (CV)1.6775117
Kurtosis43.34856
Mean582.18588
Median Absolute Deviation (MAD)200.645
Skewness5.8502582
Sum185135.11
Variance953793.73
MonotonicityNot monotonic
2024-03-15T02:23:39.904350image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
507.25 4
 
1.2%
483.0 3
 
0.9%
143.32 3
 
0.9%
180.0 3
 
0.9%
180.47 3
 
0.9%
145.068 2
 
0.6%
160.0 2
 
0.6%
324.4 2
 
0.6%
140.7 2
 
0.6%
223.0 2
 
0.6%
Other values (278) 292
91.2%
ValueCountFrequency (%)
28.06 1
0.3%
28.494 1
0.3%
29.82 1
0.3%
43.0 1
0.3%
51.39 1
0.3%
55.8 1
0.3%
56.81 1
0.3%
65.5 1
0.3%
66.2 1
0.3%
67.0 1
0.3%
ValueCountFrequency (%)
10149.0 1
0.3%
7200.0 1
0.3%
6664.58 1
0.3%
6042.0 1
0.3%
5055.0 1
0.3%
3264.0 1
0.3%
2921.38 1
0.3%
2656.0 1
0.3%
2505.44 1
0.3%
2431.8 1
0.3%

설치위치구분
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct4
Distinct (%)1.2%
Missing0
Missing (%)0.0%
Memory size2.6 KiB
건물위
287 
토지위
 
19
토지위, 건물위
 
11
저수지 위
 
3

Length

Max length8
Median length3
Mean length3.190625
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row건물위
2nd row건물위
3rd row건물위
4th row건물위
5th row건물위

Common Values

ValueCountFrequency (%)
건물위 287
89.7%
토지위 19
 
5.9%
토지위, 건물위 11
 
3.4%
저수지 위 3
 
0.9%

Length

2024-03-15T02:23:40.140688image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-15T02:23:40.341676image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
건물위 298
89.2%
토지위 30
 
9.0%
저수지 3
 
0.9%
3
 
0.9%

지목
Categorical

HIGH CORRELATION 

Distinct26
Distinct (%)8.1%
Missing0
Missing (%)0.0%
Memory size2.6 KiB
125 
89 
21 
20 
14 
Other values (21)
51 

Length

Max length17
Median length1
Mean length1.228125
Min length1

Unique

Unique12 ?
Unique (%)3.8%

Sample

1st row
2nd row장,대
3rd row
4th row
5th row장,창

Common Values

ValueCountFrequency (%)
125
39.1%
89
27.8%
21
 
6.6%
20
 
6.2%
14
 
4.4%
14
 
4.4%
9
 
2.8%
3
 
0.9%
장,대 3
 
0.9%
대,창 2
 
0.6%
Other values (16) 20
 
6.2%

Length

2024-03-15T02:23:40.632314image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
128
38.8%
92
27.9%
22
 
6.7%
22
 
6.7%
15
 
4.5%
15
 
4.5%
9
 
2.7%
3
 
0.9%
장,대 3
 
0.9%
종교용지 2
 
0.6%
Other values (13) 19
 
5.8%

비고
Text

MISSING 

Distinct57
Distinct (%)98.3%
Missing262
Missing (%)81.9%
Memory size2.6 KiB
2024-03-15T02:23:41.516170image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length30
Median length19
Mean length10.672414
Min length2

Characters and Unicode

Total characters619
Distinct characters36
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique56 ?
Unique (%)96.6%

Sample

1st row70-3, 70-11 [건물위]
2nd row98-1 [건물위]
3rd row231-306 (옥정동) [건물위]
4th row796-5,21 [건물 위]
5th row634-10
ValueCountFrequency (%)
11
 
10.5%
건물위 8
 
7.6%
건물 5
 
4.8%
131 2
 
1.9%
634-10 2
 
1.9%
127-9 2
 
1.9%
127-10 2
 
1.9%
127-13 2
 
1.9%
70-3 1
 
1.0%
387-7 1
 
1.0%
Other values (69) 69
65.7%
2024-03-15T02:23:42.770441image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 83
13.4%
1 64
 
10.3%
2 56
 
9.0%
49
 
7.9%
7 41
 
6.6%
, 38
 
6.1%
4 37
 
6.0%
3 34
 
5.5%
0 24
 
3.9%
6 22
 
3.6%
Other values (26) 171
27.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 337
54.4%
Dash Punctuation 83
 
13.4%
Other Letter 82
 
13.2%
Space Separator 49
 
7.9%
Other Punctuation 38
 
6.1%
Open Punctuation 14
 
2.3%
Close Punctuation 14
 
2.3%
Uppercase Letter 2
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
19
23.2%
19
23.2%
19
23.2%
9
11.0%
3
 
3.7%
2
 
2.4%
1
 
1.2%
1
 
1.2%
1
 
1.2%
1
 
1.2%
Other values (7) 7
 
8.5%
Decimal Number
ValueCountFrequency (%)
1 64
19.0%
2 56
16.6%
7 41
12.2%
4 37
11.0%
3 34
10.1%
0 24
 
7.1%
6 22
 
6.5%
5 21
 
6.2%
9 20
 
5.9%
8 18
 
5.3%
Open Punctuation
ValueCountFrequency (%)
( 9
64.3%
[ 5
35.7%
Close Punctuation
ValueCountFrequency (%)
) 9
64.3%
] 5
35.7%
Uppercase Letter
ValueCountFrequency (%)
E 1
50.0%
F 1
50.0%
Dash Punctuation
ValueCountFrequency (%)
- 83
100.0%
Space Separator
ValueCountFrequency (%)
49
100.0%
Other Punctuation
ValueCountFrequency (%)
, 38
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 535
86.4%
Hangul 82
 
13.2%
Latin 2
 
0.3%

Most frequent character per script

Common
ValueCountFrequency (%)
- 83
15.5%
1 64
12.0%
2 56
10.5%
49
9.2%
7 41
7.7%
, 38
7.1%
4 37
6.9%
3 34
 
6.4%
0 24
 
4.5%
6 22
 
4.1%
Other values (7) 87
16.3%
Hangul
ValueCountFrequency (%)
19
23.2%
19
23.2%
19
23.2%
9
11.0%
3
 
3.7%
2
 
2.4%
1
 
1.2%
1
 
1.2%
1
 
1.2%
1
 
1.2%
Other values (7) 7
 
8.5%
Latin
ValueCountFrequency (%)
E 1
50.0%
F 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 537
86.8%
Hangul 82
 
13.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 83
15.5%
1 64
11.9%
2 56
10.4%
49
9.1%
7 41
7.6%
, 38
7.1%
4 37
6.9%
3 34
 
6.3%
0 24
 
4.5%
6 22
 
4.1%
Other values (9) 89
16.6%
Hangul
ValueCountFrequency (%)
19
23.2%
19
23.2%
19
23.2%
9
11.0%
3
 
3.7%
2
 
2.4%
1
 
1.2%
1
 
1.2%
1
 
1.2%
1
 
1.2%
Other values (7) 7
 
8.5%

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size2.6 KiB
2022-12-01
320 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2022-12-01
2nd row2022-12-01
3rd row2022-12-01
4th row2022-12-01
5th row2022-12-01

Common Values

ValueCountFrequency (%)
2022-12-01 320
100.0%

Length

2024-03-15T02:23:43.014894image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-15T02:23:43.192539image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2022-12-01 320
100.0%

Interactions

2024-03-15T02:23:24.821484image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T02:23:24.316788image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T02:23:25.074338image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T02:23:24.575258image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-15T02:23:43.311610image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사업상태소재지도로명주소설비용량공급전압집광판면적설치위치구분지목비고
사업상태1.0001.0000.2390.0000.1250.0000.3121.000
소재지도로명주소1.0001.000NaNNaNNaN1.0001.000NaN
설비용량0.239NaN1.0000.0000.8830.8150.8291.000
공급전압0.000NaN0.0001.0000.0000.0000.000NaN
집광판면적0.125NaN0.8830.0001.0000.7570.8571.000
설치위치구분0.0001.0000.8150.0000.7571.0000.8631.000
지목0.3121.0000.8290.0000.8570.8631.0001.000
비고1.000NaN1.000NaN1.0001.0001.0001.000
2024-03-15T02:23:43.531105image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사업상태설치위치구분공급전압지목
사업상태1.0000.0000.0000.238
설치위치구분0.0001.0000.0000.629
공급전압0.0000.0001.0000.000
지목0.2380.6290.0001.000
2024-03-15T02:23:43.701081image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
설비용량집광판면적사업상태공급전압설치위치구분지목
설비용량1.0000.9300.1770.0000.4800.502
집광판면적0.9301.0000.1230.0000.5980.533
사업상태0.1770.1231.0000.0000.0000.238
공급전압0.0000.0000.0001.0000.0000.000
설치위치구분0.4800.5980.0000.0001.0000.629
지목0.5020.5330.2380.0000.6291.000

Missing values

2024-03-15T02:23:25.463063image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-15T02:23:25.869272image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-03-15T02:23:26.144833image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

허가기관명시군명용도세부용도사업상태발전소명소재지도로명주소소재지지번주소설비용량공급전압주파수사업허가일자사업개시일자집광판면적설치위치구분지목비고데이터기준일자
0양주시양주시전기사업용발전사업용운영중신** 태양광발전소<NA>경기도 양주시 회정동 69-230.0380602013-12-232014-07-04159.0건물위70-3, 70-11 [건물위]2022-12-01
1양주시양주시전기사업용발전사업용운영중광사 태양광발전소<NA>경기도 양주시 광사동 9883.7380602014-01-172014-05-10394.0건물위장,대98-1 [건물위]2022-12-01
2양주시양주시전기사업용발전사업용운영중냅대 태양광발전소경기도 양주시 백석읍 월암로112번길 35-6 [건물위]<NA>15.0380602014-02-282014-09-05104.0건물위<NA>2022-12-01
3양주시양주시전기사업용발전사업용운영중강** 태양광발전소경기도 양주시 장흥면 일영로502번길 222-58 [건물위]<NA>19.25380602014-04-152014-07-1767.0건물위<NA>2022-12-01
4양주시양주시전기사업용발전사업용운영중황** 태양광발전소경기도 양주시 청담로84번길 231-298<NA>85.0380602014-04-082014-08-01446.0건물위장,창231-306 (옥정동) [건물위]2022-12-01
5양주시양주시전기사업용발전사업용운영중일오삼 태양광발전소경기도 양주시 고덕로139번길 61-26, 제2호 (덕계동) [건물위]<NA>20.0380602014-05-122014-06-25115.65건물위<NA>2022-12-01
6양주시양주시전기사업용발전사업용운영중예광 태양광발전소경기도 양주시 청담로42번길 42 (고읍동) [건물위]<NA>37.44380602014-05-202014-10-06193.0건물위<NA>2022-12-01
7양주시양주시전기사업용발전사업용운영중예스2 태양광발전소경기도 양주시 청담로84번길 231-140 (옥정동 454) [건물위]<NA>30.0380602014-05-222014-11-04197.0건물위<NA>2022-12-01
8양주시양주시전기사업용발전사업용운영중덕영 태양광발전소경기도 양주시 은현면 은현로56번길 243 [건물위]<NA>74.1380602014-05-262015-01-08518.0건물위<NA>2022-12-01
9양주시양주시전기사업용발전사업용운영중에덴 태양광발전소 1경기도 양주시 광적면 삼일로 264, B동 [건물위]<NA>20.0380602014-06-132014-09-12138.0건물위<NA>2022-12-01
허가기관명시군명용도세부용도사업상태발전소명소재지도로명주소소재지지번주소설비용량공급전압주파수사업허가일자사업개시일자집광판면적설치위치구분지목비고데이터기준일자
310양주시양주시전기사업용발전사업용사업허가양주2호태양광발전소<NA>양주시 은현면 운암리 173-1(건물 위)99.76380602022-09-13<NA>460.0건물위<NA>2022-12-01
311양주시양주시전기사업용발전사업용사업허가태양농장3호태양광발전소<NA>양주시 은현면 하패리 804-1, 841-1, B동, J동(건물 위)99.76380602022-09-29<NA>447.2건물위<NA>2022-12-01
312양주시양주시전기사업용발전사업용사업허가대동알파산업발전소<NA>양주시 봉양동 564-28(건물 위)147.32380602022-09-29<NA>678.18건물위<NA>2022-12-01
313양주시양주시전기사업용발전사업용사업허가대원태양광발전소<NA>양주시 은현면 도하리 454-1, 다동, 라동(건물 위)56.64380602022-10-17<NA>277.0건물위<NA>2022-12-01
314양주시양주시전기사업용발전사업용사업허가성수3호태양광발전소<NA>양주시 광적면 우고리 320-2, 321-1(건물 위)174.96380602022-11-02<NA>976.0건물위<NA>2022-12-01
315양주시양주시전기사업용발전사업용사업허가동보산업태양광발전소<NA>양주시 은현면 선암리 218-1(건물 위)90.86380602022-11-02<NA>410.01건물위<NA>2022-12-01
316양주시양주시전기사업용발전사업용사업허가월드글로벌2태양광발전소<NA>양주시 장흥면 부곡리 84-163, 85-15, -29(건물 위)294.64380602022-11-22<NA>1391.0건물위대, 장<NA>2022-12-01
317양주시양주시전기사업용발전사업용사업허가동남태양광발전소<NA>양주시 남면 경신리 154-5(건물 위)29.585380602022-11-22<NA>139.3건물위<NA>2022-12-01
318양주시양주시전기사업용발전사업용사업허가종혁태양광발전소<NA>양주시 남면 경신리 221-1, 240-4(건물 위)33.04380602022-11-29<NA>151.1건물위<NA>2022-12-01
319양주시양주시전기사업용발전사업용사업허가백석태양광발전소2호<NA>양주시 백석읍 오산리 704-6(건물 위)9.52380602022-11-29<NA>43.0건물위<NA>2022-12-01