Overview

Dataset statistics

Number of variables13
Number of observations1396
Missing cells1086
Missing cells (%)6.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory148.7 KiB
Average record size in memory109.1 B

Variable types

Numeric3
Text5
DateTime2
Categorical3

Dataset

Description충청남도 금산군 태양광 발전소 설치현황(발전소명, 설비용량, 주소, 최초허가일, 사업개시일) 관련하여 자료를 제공합니다.
Author충청남도
URLhttps://alldam.chungnam.go.kr/index.chungnam?menuCd=DOM_000000201001001001&st=&cds=&orgCd=&apiType=&isOpen=Y&pageIndex=385&beforeMenuCd=DOM_000000201001001000&publicdatapk=15033866

Alerts

데이터기준일 has constant value ""Constant
사업상태 is highly overall correlated with 공급전압 and 1 other fieldsHigh correlation
주파수 is highly overall correlated with 연번 and 4 other fieldsHigh correlation
공급전압 is highly overall correlated with 연번 and 4 other fieldsHigh correlation
연번 is highly overall correlated with 공급전압 and 1 other fieldsHigh correlation
설비용량 is highly overall correlated with 공급전압 and 1 other fieldsHigh correlation
설치면적 is highly overall correlated with 공급전압 and 1 other fieldsHigh correlation
공급전압 is highly imbalanced (88.3%)Imbalance
주파수 is highly imbalanced (88.3%)Imbalance
사업상태 is highly imbalanced (83.1%)Imbalance
사업개시일 has 1028 (73.6%) missing valuesMissing
설치면적 has 29 (2.1%) missing valuesMissing
지목 has 29 (2.1%) missing valuesMissing
연번 has unique valuesUnique

Reproduction

Analysis started2024-01-09 21:12:02.395924
Analysis finished2024-01-09 21:12:04.147200
Duration1.75 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct1396
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean698.5
Minimum1
Maximum1396
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size12.4 KiB
2024-01-10T06:12:04.210189image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile70.75
Q1349.75
median698.5
Q31047.25
95-th percentile1326.25
Maximum1396
Range1395
Interquartile range (IQR)697.5

Descriptive statistics

Standard deviation403.1348
Coefficient of variation (CV)0.57714359
Kurtosis-1.2
Mean698.5
Median Absolute Deviation (MAD)349
Skewness0
Sum975106
Variance162517.67
MonotonicityStrictly increasing
2024-01-10T06:12:04.335984image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.1%
930 1
 
0.1%
938 1
 
0.1%
937 1
 
0.1%
936 1
 
0.1%
935 1
 
0.1%
934 1
 
0.1%
933 1
 
0.1%
932 1
 
0.1%
931 1
 
0.1%
Other values (1386) 1386
99.3%
ValueCountFrequency (%)
1 1
0.1%
2 1
0.1%
3 1
0.1%
4 1
0.1%
5 1
0.1%
6 1
0.1%
7 1
0.1%
8 1
0.1%
9 1
0.1%
10 1
0.1%
ValueCountFrequency (%)
1396 1
0.1%
1395 1
0.1%
1394 1
0.1%
1393 1
0.1%
1392 1
0.1%
1391 1
0.1%
1390 1
0.1%
1389 1
0.1%
1388 1
0.1%
1387 1
0.1%
Distinct1372
Distinct (%)98.3%
Missing0
Missing (%)0.0%
Memory size11.0 KiB
2024-01-10T06:12:04.521300image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length29
Median length18
Mean length9.8839542
Min length1

Characters and Unicode

Total characters13798
Distinct characters370
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1350 ?
Unique (%)96.7%

Sample

1st row방우리소수력
2nd row 금성
3rd row 두리
4th row 영솔라
5th row ㈜서울전업공사
ValueCountFrequency (%)
태양광발전소 741
31.8%
태양광 39
 
1.7%
외부리 19
 
0.8%
마장리 16
 
0.7%
1호 13
 
0.6%
2호 12
 
0.5%
3호 11
 
0.5%
진산태양광발전소 10
 
0.4%
금산 9
 
0.4%
창평태양광발전소 8
 
0.3%
Other values (1317) 1450
62.3%
2024-01-10T06:12:04.833367image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1677
 
12.2%
1079
 
7.8%
1075
 
7.8%
1062
 
7.7%
1018
 
7.4%
1011
 
7.3%
1000
 
7.2%
690
 
5.0%
1 336
 
2.4%
2 231
 
1.7%
Other values (360) 4619
33.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 10758
78.0%
Space Separator 1677
 
12.2%
Decimal Number 1140
 
8.3%
Dash Punctuation 85
 
0.6%
Uppercase Letter 84
 
0.6%
Other Symbol 32
 
0.2%
Close Punctuation 8
 
0.1%
Open Punctuation 8
 
0.1%
Other Punctuation 4
 
< 0.1%
Math Symbol 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1079
 
10.0%
1075
 
10.0%
1062
 
9.9%
1018
 
9.5%
1011
 
9.4%
1000
 
9.3%
690
 
6.4%
218
 
2.0%
170
 
1.6%
141
 
1.3%
Other values (320) 3294
30.6%
Uppercase Letter
ValueCountFrequency (%)
S 26
31.0%
J 9
 
10.7%
N 8
 
9.5%
C 5
 
6.0%
P 4
 
4.8%
G 4
 
4.8%
H 3
 
3.6%
A 3
 
3.6%
U 3
 
3.6%
I 2
 
2.4%
Other values (11) 17
20.2%
Decimal Number
ValueCountFrequency (%)
1 336
29.5%
2 231
20.3%
3 137
12.0%
4 95
 
8.3%
5 83
 
7.3%
0 68
 
6.0%
6 58
 
5.1%
7 49
 
4.3%
8 44
 
3.9%
9 39
 
3.4%
Other Punctuation
ValueCountFrequency (%)
. 2
50.0%
# 1
25.0%
& 1
25.0%
Space Separator
ValueCountFrequency (%)
1677
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 85
100.0%
Other Symbol
ValueCountFrequency (%)
32
100.0%
Close Punctuation
ValueCountFrequency (%)
) 8
100.0%
Open Punctuation
ValueCountFrequency (%)
( 8
100.0%
Math Symbol
ValueCountFrequency (%)
> 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 10790
78.2%
Common 2924
 
21.2%
Latin 84
 
0.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1079
 
10.0%
1075
 
10.0%
1062
 
9.8%
1018
 
9.4%
1011
 
9.4%
1000
 
9.3%
690
 
6.4%
218
 
2.0%
170
 
1.6%
141
 
1.3%
Other values (321) 3326
30.8%
Latin
ValueCountFrequency (%)
S 26
31.0%
J 9
 
10.7%
N 8
 
9.5%
C 5
 
6.0%
P 4
 
4.8%
G 4
 
4.8%
H 3
 
3.6%
A 3
 
3.6%
U 3
 
3.6%
I 2
 
2.4%
Other values (11) 17
20.2%
Common
ValueCountFrequency (%)
1677
57.4%
1 336
 
11.5%
2 231
 
7.9%
3 137
 
4.7%
4 95
 
3.2%
- 85
 
2.9%
5 83
 
2.8%
0 68
 
2.3%
6 58
 
2.0%
7 49
 
1.7%
Other values (8) 105
 
3.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 10758
78.0%
ASCII 3008
 
21.8%
None 32
 
0.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1677
55.8%
1 336
 
11.2%
2 231
 
7.7%
3 137
 
4.6%
4 95
 
3.2%
- 85
 
2.8%
5 83
 
2.8%
0 68
 
2.3%
6 58
 
1.9%
7 49
 
1.6%
Other values (29) 189
 
6.3%
Hangul
ValueCountFrequency (%)
1079
 
10.0%
1075
 
10.0%
1062
 
9.9%
1018
 
9.5%
1011
 
9.4%
1000
 
9.3%
690
 
6.4%
218
 
2.0%
170
 
1.6%
141
 
1.3%
Other values (320) 3294
30.6%
None
ValueCountFrequency (%)
32
100.0%
Distinct710
Distinct (%)50.9%
Missing0
Missing (%)0.0%
Memory size11.0 KiB
2024-01-10T06:12:05.083933image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length53
Median length48
Mean length21.117479
Min length7

Characters and Unicode

Total characters29480
Distinct characters145
Distinct categories8 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique540 ?
Unique (%)38.7%

Sample

1st row 금산군 부리면 방우리 산3-2
2nd row 금산군 금성면 상가리 산67-1
3rd row 금산군 부리면 창평리 280-1
4th row 금산군 부리면 창평리 280-3
5th row 금산군 추부면 신평리 1000
ValueCountFrequency (%)
금산군 1172
 
19.4%
제원면 234
 
3.9%
부리면 183
 
3.0%
진산면 164
 
2.7%
남이면 157
 
2.6%
금성면 153
 
2.5%
남일면 132
 
2.2%
군북면 130
 
2.2%
복수면 113
 
1.9%
하금리 85
 
1.4%
Other values (953) 3523
58.3%
2024-01-10T06:12:05.498896image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
6961
23.6%
2050
 
7.0%
1572
 
5.3%
1473
 
5.0%
- 1366
 
4.6%
1 1343
 
4.6%
1312
 
4.5%
1305
 
4.4%
2 1136
 
3.9%
3 1025
 
3.5%
Other values (135) 9937
33.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 12705
43.1%
Decimal Number 7416
25.2%
Space Separator 6961
23.6%
Dash Punctuation 1366
 
4.6%
Other Punctuation 823
 
2.8%
Open Punctuation 104
 
0.4%
Close Punctuation 104
 
0.4%
Control 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2050
16.1%
1572
12.4%
1473
11.6%
1312
 
10.3%
1305
 
10.3%
354
 
2.8%
295
 
2.3%
287
 
2.3%
269
 
2.1%
239
 
1.9%
Other values (115) 3549
27.9%
Decimal Number
ValueCountFrequency (%)
1 1343
18.1%
2 1136
15.3%
3 1025
13.8%
4 858
11.6%
5 601
8.1%
6 595
8.0%
7 499
 
6.7%
0 470
 
6.3%
9 465
 
6.3%
8 424
 
5.7%
Other Punctuation
ValueCountFrequency (%)
, 797
96.8%
/ 23
 
2.8%
. 3
 
0.4%
Open Punctuation
ValueCountFrequency (%)
( 102
98.1%
[ 2
 
1.9%
Close Punctuation
ValueCountFrequency (%)
) 102
98.1%
] 2
 
1.9%
Space Separator
ValueCountFrequency (%)
6961
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1366
100.0%
Control
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 16775
56.9%
Hangul 12705
43.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2050
16.1%
1572
12.4%
1473
11.6%
1312
 
10.3%
1305
 
10.3%
354
 
2.8%
295
 
2.3%
287
 
2.3%
269
 
2.1%
239
 
1.9%
Other values (115) 3549
27.9%
Common
ValueCountFrequency (%)
6961
41.5%
- 1366
 
8.1%
1 1343
 
8.0%
2 1136
 
6.8%
3 1025
 
6.1%
4 858
 
5.1%
, 797
 
4.8%
5 601
 
3.6%
6 595
 
3.5%
7 499
 
3.0%
Other values (10) 1594
 
9.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 16775
56.9%
Hangul 12705
43.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
6961
41.5%
- 1366
 
8.1%
1 1343
 
8.0%
2 1136
 
6.8%
3 1025
 
6.1%
4 858
 
5.1%
, 797
 
4.8%
5 601
 
3.6%
6 595
 
3.5%
7 499
 
3.0%
Other values (10) 1594
 
9.5%
Hangul
ValueCountFrequency (%)
2050
16.1%
1572
12.4%
1473
11.6%
1312
 
10.3%
1305
 
10.3%
354
 
2.8%
295
 
2.3%
287
 
2.3%
269
 
2.1%
239
 
1.9%
Other values (115) 3549
27.9%
Distinct331
Distinct (%)23.7%
Missing0
Missing (%)0.0%
Memory size11.0 KiB
Minimum1986-08-05 00:00:00
Maximum2020-04-13 00:00:00
2024-01-10T06:12:05.631472image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T06:12:05.770799image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

사업개시일
Text

MISSING 

Distinct181
Distinct (%)49.2%
Missing1028
Missing (%)73.6%
Memory size11.0 KiB
2024-01-10T06:12:06.056194image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length10
Mean length9.9755435
Min length1

Characters and Unicode

Total characters3671
Distinct characters12
Distinct categories3 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique126 ?
Unique (%)34.2%

Sample

1st row1986-08-05
2nd row2010-01-06
3rd row2010-06-11
4th row2010-06-11
5th row2011-11-25
ValueCountFrequency (%)
2015-03-16 19
 
5.2%
2018-11-28 18
 
4.9%
2018-10-30 16
 
4.4%
2018-05-09 15
 
4.1%
2017-09-21 14
 
3.8%
2018-02-07 9
 
2.5%
2018-11-14 8
 
2.2%
2018-11-02 7
 
1.9%
2018-09-10 7
 
1.9%
2017-03-06 7
 
1.9%
Other values (170) 247
67.3%
2024-01-10T06:12:06.498822image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 808
22.0%
- 734
20.0%
1 723
19.7%
2 568
15.5%
8 224
 
6.1%
9 150
 
4.1%
6 115
 
3.1%
3 104
 
2.8%
5 93
 
2.5%
7 93
 
2.5%
Other values (2) 59
 
1.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 2936
80.0%
Dash Punctuation 734
 
20.0%
Space Separator 1
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 808
27.5%
1 723
24.6%
2 568
19.3%
8 224
 
7.6%
9 150
 
5.1%
6 115
 
3.9%
3 104
 
3.5%
5 93
 
3.2%
7 93
 
3.2%
4 58
 
2.0%
Dash Punctuation
ValueCountFrequency (%)
- 734
100.0%
Space Separator
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 3671
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 808
22.0%
- 734
20.0%
1 723
19.7%
2 568
15.5%
8 224
 
6.1%
9 150
 
4.1%
6 115
 
3.1%
3 104
 
2.8%
5 93
 
2.5%
7 93
 
2.5%
Other values (2) 59
 
1.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 3671
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 808
22.0%
- 734
20.0%
1 723
19.7%
2 568
15.5%
8 224
 
6.1%
9 150
 
4.1%
6 115
 
3.1%
3 104
 
2.8%
5 93
 
2.5%
7 93
 
2.5%
Other values (2) 59
 
1.6%
Distinct725
Distinct (%)51.9%
Missing0
Missing (%)0.0%
Memory size11.0 KiB
2024-01-10T06:12:06.861422image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length57
Median length48
Mean length17.808739
Min length6

Characters and Unicode

Total characters24861
Distinct characters143
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique557 ?
Unique (%)39.9%

Sample

1st row 금산군 부리면 방우리 산3-2
2nd row 금산군 금성면 상가리 산67-1
3rd row 금산군 부리면 창평리 280-1
4th row 금산군 부리면 창평리 280-3
5th row 금산군 추부면 신평리 1000
ValueCountFrequency (%)
부리면 179
 
3.7%
제원면 154
 
3.2%
남이면 142
 
3.0%
진산면 138
 
2.9%
금성면 137
 
2.9%
남일면 111
 
2.3%
군북면 107
 
2.2%
복수면 104
 
2.2%
금산군 94
 
2.0%
하금리 85
 
1.8%
Other values (972) 3540
73.9%
2024-01-10T06:12:07.309880image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5997
24.1%
1582
 
6.4%
- 1389
 
5.6%
1 1389
 
5.6%
2 1157
 
4.7%
1102
 
4.4%
3 1038
 
4.2%
954
 
3.8%
4 860
 
3.5%
, 826
 
3.3%
Other values (133) 8567
34.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 8905
35.8%
Decimal Number 7510
30.2%
Space Separator 5997
24.1%
Dash Punctuation 1389
 
5.6%
Other Punctuation 852
 
3.4%
Close Punctuation 104
 
0.4%
Open Punctuation 104
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1582
17.8%
1102
 
12.4%
954
 
10.7%
385
 
4.3%
273
 
3.1%
271
 
3.0%
258
 
2.9%
239
 
2.7%
204
 
2.3%
188
 
2.1%
Other values (114) 3449
38.7%
Decimal Number
ValueCountFrequency (%)
1 1389
18.5%
2 1157
15.4%
3 1038
13.8%
4 860
11.5%
5 606
8.1%
6 596
7.9%
7 500
 
6.7%
0 470
 
6.3%
9 467
 
6.2%
8 427
 
5.7%
Other Punctuation
ValueCountFrequency (%)
, 826
96.9%
/ 23
 
2.7%
. 3
 
0.4%
Close Punctuation
ValueCountFrequency (%)
) 102
98.1%
] 2
 
1.9%
Open Punctuation
ValueCountFrequency (%)
( 102
98.1%
[ 2
 
1.9%
Space Separator
ValueCountFrequency (%)
5997
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1389
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 15956
64.2%
Hangul 8905
35.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1582
17.8%
1102
 
12.4%
954
 
10.7%
385
 
4.3%
273
 
3.1%
271
 
3.0%
258
 
2.9%
239
 
2.7%
204
 
2.3%
188
 
2.1%
Other values (114) 3449
38.7%
Common
ValueCountFrequency (%)
5997
37.6%
- 1389
 
8.7%
1 1389
 
8.7%
2 1157
 
7.3%
3 1038
 
6.5%
4 860
 
5.4%
, 826
 
5.2%
5 606
 
3.8%
6 596
 
3.7%
7 500
 
3.1%
Other values (9) 1598
 
10.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 15956
64.2%
Hangul 8905
35.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5997
37.6%
- 1389
 
8.7%
1 1389
 
8.7%
2 1157
 
7.3%
3 1038
 
6.5%
4 860
 
5.4%
, 826
 
5.2%
5 606
 
3.8%
6 596
 
3.7%
7 500
 
3.1%
Other values (9) 1598
 
10.0%
Hangul
ValueCountFrequency (%)
1582
17.8%
1102
 
12.4%
954
 
10.7%
385
 
4.3%
273
 
3.1%
271
 
3.0%
258
 
2.9%
239
 
2.7%
204
 
2.3%
188
 
2.1%
Other values (114) 3449
38.7%

설비용량
Real number (ℝ)

HIGH CORRELATION 

Distinct305
Distinct (%)21.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean186.949
Minimum9
Maximum2994
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size12.4 KiB
2024-01-10T06:12:07.467281image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum9
5-th percentile45.61125
Q197.92
median99
Q399.54
95-th percentile786.7425
Maximum2994
Range2985
Interquartile range (IQR)1.62

Descriptive statistics

Standard deviation294.82806
Coefficient of variation (CV)1.5770507
Kurtosis30.046643
Mean186.949
Median Absolute Deviation (MAD)0.96
Skewness4.855762
Sum260980.81
Variance86923.586
MonotonicityNot monotonic
2024-01-10T06:12:07.650095image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
99.0 176
 
12.6%
97.92 133
 
9.5%
99.2 90
 
6.4%
98.28 83
 
5.9%
99.4 79
 
5.7%
97.2 43
 
3.1%
98.0 30
 
2.1%
99.96 27
 
1.9%
99.18 27
 
1.9%
99.225 26
 
1.9%
Other values (295) 682
48.9%
ValueCountFrequency (%)
9.0 1
0.1%
9.92 1
0.1%
11.04 1
0.1%
14.625 1
0.1%
14.74 1
0.1%
14.8 1
0.1%
18.36 1
0.1%
18.675 1
0.1%
18.75 1
0.1%
19.2 1
0.1%
ValueCountFrequency (%)
2994.0 2
0.1%
2800.0 1
 
0.1%
2520.0 1
 
0.1%
2120.0 1
 
0.1%
2034.72 2
0.1%
2000.0 1
 
0.1%
1997.64 4
0.3%
1989.0 1
 
0.1%
1980.0 1
 
0.1%
1756.08 1
 
0.1%

공급전압
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size11.0 KiB
380
1374 
<NA>
 
22

Length

Max length4
Median length3
Mean length3.0157593
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
380 1374
98.4%
<NA> 22
 
1.6%

Length

2024-01-10T06:12:07.798197image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T06:12:07.897284image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
380 1374
98.4%
na 22
 
1.6%

주파수
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size11.0 KiB
60
1374 
<NA>
 
22

Length

Max length4
Median length2
Mean length2.0315186
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
60 1374
98.4%
<NA> 22
 
1.6%

Length

2024-01-10T06:12:08.016054image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T06:12:08.166521image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
60 1374
98.4%
na 22
 
1.6%

설치면적
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct764
Distinct (%)55.9%
Missing29
Missing (%)2.1%
Infinite0
Infinite (%)0.0%
Mean2356.2721
Minimum80
Maximum64470
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size12.4 KiB
2024-01-10T06:12:08.291463image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum80
5-th percentile388
Q1605
median1280
Q31774.5
95-th percentile10021.6
Maximum64470
Range64390
Interquartile range (IQR)1169.5

Descriptive statistics

Standard deviation4183.0873
Coefficient of variation (CV)1.7752989
Kurtosis52.804091
Mean2356.2721
Median Absolute Deviation (MAD)619
Skewness5.7890385
Sum3221024
Variance17498219
MonotonicityNot monotonic
2024-01-10T06:12:08.761339image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
598 64
 
4.6%
560 52
 
3.7%
1322 25
 
1.8%
568 23
 
1.6%
605 19
 
1.4%
554 18
 
1.3%
1320 18
 
1.3%
3300 17
 
1.2%
1655 17
 
1.2%
1115 17
 
1.2%
Other values (754) 1097
78.6%
(Missing) 29
 
2.1%
ValueCountFrequency (%)
80 1
0.1%
84 1
0.1%
91 1
0.1%
96 2
0.1%
99 1
0.1%
110 1
0.1%
113 2
0.1%
116 1
0.1%
120 1
0.1%
123 1
0.1%
ValueCountFrequency (%)
64470 1
0.1%
44382 1
0.1%
29747 1
0.1%
29537 1
0.1%
29227 1
0.1%
28368 1
0.1%
27760 1
0.1%
24420 1
0.1%
23923 1
0.1%
23243 1
0.1%

지목
Text

MISSING 

Distinct104
Distinct (%)7.6%
Missing29
Missing (%)2.1%
Memory size11.0 KiB
2024-01-10T06:12:08.922205image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length19
Median length4
Mean length4.6869056
Min length1

Characters and Unicode

Total characters6407
Distinct characters36
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique58 ?
Unique (%)4.2%

Sample

1st row 공장용지
2nd row
3rd row 잡종지
4th row 임야
5th row 공장용지
ValueCountFrequency (%)
임야 709
51.6%
128
 
9.3%
124
 
9.0%
공장용지 57
 
4.2%
임야/전 42
 
3.1%
잡종지 37
 
2.7%
30
 
2.2%
전/임야/전/전/답 18
 
1.3%
전/임야 15
 
1.1%
창고용지 13
 
0.9%
Other values (79) 200
 
14.6%
2024-01-10T06:12:09.244780image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2511
39.2%
920
 
14.4%
920
 
14.4%
/ 505
 
7.9%
416
 
6.5%
221
 
3.4%
189
 
2.9%
117
 
1.8%
96
 
1.5%
73
 
1.1%
Other values (26) 439
 
6.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3356
52.4%
Space Separator 2511
39.2%
Other Punctuation 509
 
7.9%
Open Punctuation 14
 
0.2%
Close Punctuation 14
 
0.2%
Uppercase Letter 3
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
920
27.4%
920
27.4%
416
12.4%
221
 
6.6%
189
 
5.6%
117
 
3.5%
96
 
2.9%
73
 
2.2%
58
 
1.7%
58
 
1.7%
Other values (20) 288
 
8.6%
Other Punctuation
ValueCountFrequency (%)
/ 505
99.2%
, 4
 
0.8%
Space Separator
ValueCountFrequency (%)
2511
100.0%
Open Punctuation
ValueCountFrequency (%)
( 14
100.0%
Close Punctuation
ValueCountFrequency (%)
) 14
100.0%
Uppercase Letter
ValueCountFrequency (%)
X 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3356
52.4%
Common 3048
47.6%
Latin 3
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
920
27.4%
920
27.4%
416
12.4%
221
 
6.6%
189
 
5.6%
117
 
3.5%
96
 
2.9%
73
 
2.2%
58
 
1.7%
58
 
1.7%
Other values (20) 288
 
8.6%
Common
ValueCountFrequency (%)
2511
82.4%
/ 505
 
16.6%
( 14
 
0.5%
) 14
 
0.5%
, 4
 
0.1%
Latin
ValueCountFrequency (%)
X 3
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3356
52.4%
ASCII 3051
47.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2511
82.3%
/ 505
 
16.6%
( 14
 
0.5%
) 14
 
0.5%
, 4
 
0.1%
X 3
 
0.1%
Hangul
ValueCountFrequency (%)
920
27.4%
920
27.4%
416
12.4%
221
 
6.6%
189
 
5.6%
117
 
3.5%
96
 
2.9%
73
 
2.2%
58
 
1.7%
58
 
1.7%
Other values (20) 288
 
8.6%

사업상태
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct3
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size11.0 KiB
<NA>
1334 
취소
 
61
 
1

Length

Max length4
Median length4
Mean length3.9104585
Min length1

Unique

Unique1 ?
Unique (%)0.1%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 1334
95.6%
취소 61
 
4.4%
1
 
0.1%

Length

2024-01-10T06:12:09.397207image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T06:12:09.501913image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 1334
95.6%
취소 61
 
4.4%

데이터기준일
Date

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size11.0 KiB
Minimum2020-04-14 00:00:00
Maximum2020-04-14 00:00:00
2024-01-10T06:12:09.579316image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T06:12:09.671209image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2024-01-10T06:12:03.462979image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T06:12:02.935511image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T06:12:03.193883image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T06:12:03.553890image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T06:12:03.024040image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T06:12:03.281075image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T06:12:03.639571image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T06:12:03.110662image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T06:12:03.368521image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-01-10T06:12:09.738412image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번설비용량설치면적사업상태
연번1.0000.2320.1700.498
설비용량0.2321.0000.7340.000
설치면적0.1700.7341.0000.000
사업상태0.4980.0000.0001.000
2024-01-10T06:12:09.833020image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사업상태주파수공급전압
사업상태1.0001.0001.000
주파수1.0001.0001.000
공급전압1.0001.0001.000
2024-01-10T06:12:09.927165image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번설비용량설치면적공급전압주파수사업상태
연번1.0000.044-0.1981.0001.0000.354
설비용량0.0441.0000.4151.0001.0000.000
설치면적-0.1980.4151.0001.0001.0000.000
공급전압1.0001.0001.0001.0001.0001.000
주파수1.0001.0001.0001.0001.0001.000
사업상태0.3540.0000.0001.0001.0001.000

Missing values

2024-01-10T06:12:03.764191image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-10T06:12:03.913283image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-01-10T06:12:04.079460image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

연번발전소명발전소주소최초허가일사업개시일설치위치설비용량공급전압주파수설치면적지목사업상태데이터기준일
01방우리소수력금산군 부리면 방우리 산3-21986-08-051986-08-05금산군 부리면 방우리 산3-22120.0<NA><NA><NA><NA><NA>2020-04-14
12금성금산군 금성면 상가리 산67-12007-02-022010-01-06금산군 금성면 상가리 산67-11000.0<NA><NA><NA><NA><NA>2020-04-14
23두리금산군 부리면 창평리 280-12009-08-142010-06-11금산군 부리면 창평리 280-199.0<NA><NA><NA><NA><NA>2020-04-14
34영솔라금산군 부리면 창평리 280-32009-08-142010-06-11금산군 부리면 창평리 280-399.0<NA><NA><NA><NA><NA>2020-04-14
45㈜서울전업공사금산군 추부면 신평리 10002009-12-282011-11-25금산군 추부면 신평리 100021.0<NA><NA><NA><NA><NA>2020-04-14
56신정2호금산군 남일면 신정리 418-62010-03-162011-04-07금산군 남일면 신정리 418-619.35<NA><NA><NA><NA><NA>2020-04-14
67문은옥금산군 추부면 대학로 156-14(1필지)2012-03-262012-07-06금산군 추부면 대학로 156-14(1필지)24.0<NA><NA><NA><NA><NA>2020-04-14
78군북금산군 군북면 두두리 380-82012-04-03<NA>금산군 군북면 두두리 380-89.0<NA><NA><NA><NA><NA>2020-04-14
89㈜에버솔라금산군 추부면 신평리 1006-32012-04-092014-01-15금산군 추부면 신평리 1006-3250.0<NA><NA><NA><NA><NA>2020-04-14
910초희금산군 남일면 신정리 4162012-06-042012-12-26금산군 남일면 신정리 41619.74<NA><NA><NA><NA><NA>2020-04-14
연번발전소명발전소주소최초허가일사업개시일설치위치설비용량공급전압주파수설치면적지목사업상태데이터기준일
13861387송림3호 태양광발전소군북면 호티리 607-22020-03-30<NA>군북면 호티리 607-218.6753806091<NA>2020-04-14
13871388산들림에너지협동조합 태양광발전소금성면 의총리 581-1, 581-2, 5832020-03-31<NA>금성면 의총리 581-1, 581-2, 58399.06380601287<NA>2020-04-14
13881389햇살쏠라에너지협동조합 태양광발전소금성면 의총리 581-1, 5832020-03-31<NA>금성면 의총리 581-1, 58399.06380601246<NA>2020-04-14
13891390부산기업 태양광발전소복수면 구례리 4942020-04-03<NA>복수면 구례리 49493.55538060169공장용지<NA>2020-04-14
13901391대덕 태양광발전소군북면 조정리 2722020-04-06<NA>군북면 조정리 27299.7538060614공장용지<NA>2020-04-14
13911392SM2호 태양광발전소금산읍 양지리 221-52020-04-13<NA>금산읍 양지리 221-599.63380601168<NA>2020-04-14
13921393솔라원2 태양광발전소금산읍 양지리 221-52020-04-13<NA>금산읍 양지리 221-599.63380601223<NA>2020-04-14
13931394솔라원3 태양광발전소금산읍 양지리 221-52020-04-13<NA>금산읍 양지리 221-599.63380601259<NA>2020-04-14
13941395조순 태양광발전소금산읍 양지리 221-52020-04-13<NA>금산읍 양지리 221-599.6338060986<NA>2020-04-14
13951396양훈 태양광발전소금산읍 양지리 221-52020-04-13<NA>금산읍 양지리 221-599.6338060904<NA>2020-04-14