Overview

Dataset statistics

Number of variables12
Number of observations1599
Missing cells1290
Missing cells (%)6.7%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory157.8 KiB
Average record size in memory101.1 B

Variable types

Numeric3
Text5
Categorical4

Dataset

Description충청남도 금산군 태양광 발전소 설치현황(발전소명, 설비용량, 주소, 최초허가일, 사업개시일) 관련하여 자료를 제공합니다.
Author충청남도
URLhttps://alldam.chungnam.go.kr/index.chungnam?menuCd=DOM_000000201001001001&st=&cds=&orgCd=&apiType=&isOpen=Y&pageIndex=385&beforeMenuCd=DOM_000000201001001000&publicdatapk=15033866

Alerts

데이터기준일 has constant value ""Constant
사업상태 is highly overall correlated with 공급전압 and 1 other fieldsHigh correlation
주파수 is highly overall correlated with 연번 and 4 other fieldsHigh correlation
공급전압 is highly overall correlated with 연번 and 4 other fieldsHigh correlation
연번 is highly overall correlated with 공급전압 and 1 other fieldsHigh correlation
설비용량 is highly overall correlated with 공급전압 and 1 other fieldsHigh correlation
설치면적 is highly overall correlated with 공급전압 and 1 other fieldsHigh correlation
공급전압 is highly imbalanced (89.5%)Imbalance
주파수 is highly imbalanced (89.5%)Imbalance
사업상태 is highly imbalanced (84.8%)Imbalance
사업개시일 has 1232 (77.0%) missing valuesMissing
설치면적 has 29 (1.8%) missing valuesMissing
지목 has 29 (1.8%) missing valuesMissing
연번 has unique valuesUnique

Reproduction

Analysis started2024-01-09 21:11:47.278642
Analysis finished2024-01-09 21:11:49.137029
Duration1.86 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct1599
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean800
Minimum1
Maximum1599
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size14.2 KiB
2024-01-10T06:11:49.207189image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile80.9
Q1400.5
median800
Q31199.5
95-th percentile1519.1
Maximum1599
Range1598
Interquartile range (IQR)799

Descriptive statistics

Standard deviation461.73586
Coefficient of variation (CV)0.57716982
Kurtosis-1.2
Mean800
Median Absolute Deviation (MAD)400
Skewness0
Sum1279200
Variance213200
MonotonicityStrictly increasing
2024-01-10T06:11:49.343828image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.1%
1064 1
 
0.1%
1074 1
 
0.1%
1073 1
 
0.1%
1072 1
 
0.1%
1071 1
 
0.1%
1070 1
 
0.1%
1069 1
 
0.1%
1068 1
 
0.1%
1067 1
 
0.1%
Other values (1589) 1589
99.4%
ValueCountFrequency (%)
1 1
0.1%
2 1
0.1%
3 1
0.1%
4 1
0.1%
5 1
0.1%
6 1
0.1%
7 1
0.1%
8 1
0.1%
9 1
0.1%
10 1
0.1%
ValueCountFrequency (%)
1599 1
0.1%
1598 1
0.1%
1597 1
0.1%
1596 1
0.1%
1595 1
0.1%
1594 1
0.1%
1593 1
0.1%
1592 1
0.1%
1591 1
0.1%
1590 1
0.1%
Distinct1559
Distinct (%)97.5%
Missing0
Missing (%)0.0%
Memory size12.6 KiB
2024-01-10T06:11:49.546826image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length29
Median length19
Mean length10.045654
Min length1

Characters and Unicode

Total characters16063
Distinct characters397
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1521 ?
Unique (%)95.1%

Sample

1st row방우리소수력
2nd row금성
3rd row두리
4th row영솔라
5th row㈜서울전업공사
ValueCountFrequency (%)
태양광발전소 941
34.3%
태양광 39
 
1.4%
외부리 19
 
0.7%
마장리 16
 
0.6%
1호 14
 
0.5%
2호 13
 
0.5%
금산 12
 
0.4%
3호 11
 
0.4%
진산태양광발전소 10
 
0.4%
에너지 8
 
0.3%
Other values (1503) 1663
60.6%
2024-01-10T06:11:49.872495image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1884
 
11.7%
1283
 
8.0%
1278
 
8.0%
1263
 
7.9%
1223
 
7.6%
1214
 
7.6%
1203
 
7.5%
751
 
4.7%
1 373
 
2.3%
2 265
 
1.6%
Other values (387) 5326
33.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 12630
78.6%
Space Separator 1884
 
11.7%
Decimal Number 1295
 
8.1%
Uppercase Letter 104
 
0.6%
Dash Punctuation 91
 
0.6%
Other Symbol 33
 
0.2%
Open Punctuation 8
 
< 0.1%
Close Punctuation 8
 
< 0.1%
Other Punctuation 4
 
< 0.1%
Lowercase Letter 4
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1283
 
10.2%
1278
 
10.1%
1263
 
10.0%
1223
 
9.7%
1214
 
9.6%
1203
 
9.5%
751
 
5.9%
240
 
1.9%
183
 
1.4%
180
 
1.4%
Other values (343) 3812
30.2%
Uppercase Letter
ValueCountFrequency (%)
S 30
28.8%
J 10
 
9.6%
N 9
 
8.7%
B 9
 
8.7%
C 8
 
7.7%
G 4
 
3.8%
P 4
 
3.8%
U 3
 
2.9%
K 3
 
2.9%
A 3
 
2.9%
Other values (11) 21
20.2%
Decimal Number
ValueCountFrequency (%)
1 373
28.8%
2 265
20.5%
3 162
12.5%
4 107
 
8.3%
5 93
 
7.2%
0 74
 
5.7%
6 73
 
5.6%
7 57
 
4.4%
8 49
 
3.8%
9 42
 
3.2%
Lowercase Letter
ValueCountFrequency (%)
a 1
25.0%
o 1
25.0%
l 1
25.0%
h 1
25.0%
Other Punctuation
ValueCountFrequency (%)
. 2
50.0%
& 1
25.0%
# 1
25.0%
Space Separator
ValueCountFrequency (%)
1884
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 91
100.0%
Other Symbol
ValueCountFrequency (%)
33
100.0%
Open Punctuation
ValueCountFrequency (%)
( 8
100.0%
Close Punctuation
ValueCountFrequency (%)
) 8
100.0%
Math Symbol
ValueCountFrequency (%)
> 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 12663
78.8%
Common 3292
 
20.5%
Latin 108
 
0.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1283
 
10.1%
1278
 
10.1%
1263
 
10.0%
1223
 
9.7%
1214
 
9.6%
1203
 
9.5%
751
 
5.9%
240
 
1.9%
183
 
1.4%
180
 
1.4%
Other values (344) 3845
30.4%
Latin
ValueCountFrequency (%)
S 30
27.8%
J 10
 
9.3%
N 9
 
8.3%
B 9
 
8.3%
C 8
 
7.4%
G 4
 
3.7%
P 4
 
3.7%
U 3
 
2.8%
K 3
 
2.8%
A 3
 
2.8%
Other values (15) 25
23.1%
Common
ValueCountFrequency (%)
1884
57.2%
1 373
 
11.3%
2 265
 
8.0%
3 162
 
4.9%
4 107
 
3.3%
5 93
 
2.8%
- 91
 
2.8%
0 74
 
2.2%
6 73
 
2.2%
7 57
 
1.7%
Other values (8) 113
 
3.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 12630
78.6%
ASCII 3400
 
21.2%
None 33
 
0.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1884
55.4%
1 373
 
11.0%
2 265
 
7.8%
3 162
 
4.8%
4 107
 
3.1%
5 93
 
2.7%
- 91
 
2.7%
0 74
 
2.2%
6 73
 
2.1%
7 57
 
1.7%
Other values (33) 221
 
6.5%
Hangul
ValueCountFrequency (%)
1283
 
10.2%
1278
 
10.1%
1263
 
10.0%
1223
 
9.7%
1214
 
9.6%
1203
 
9.5%
751
 
5.9%
240
 
1.9%
183
 
1.4%
180
 
1.4%
Other values (343) 3812
30.2%
None
ValueCountFrequency (%)
33
100.0%
Distinct796
Distinct (%)49.8%
Missing0
Missing (%)0.0%
Memory size12.6 KiB
2024-01-10T06:11:50.182567image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length62
Median length55
Mean length25.898687
Min length18

Characters and Unicode

Total characters41412
Distinct characters146
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique590 ?
Unique (%)36.9%

Sample

1st row충청남도 금산군 부리면 방우리 산3-2
2nd row충청남도 금산군 금성면 상가리 산67-1
3rd row충청남도 금산군 부리면 창평리 280-1
4th row충청남도 금산군 부리면 창평리 280-3
5th row충청남도 금산군 추부면 신평리 1000
ValueCountFrequency (%)
금산군 1600
 
18.2%
충청남도 1599
 
18.2%
제원면 271
 
3.1%
부리면 210
 
2.4%
금성면 201
 
2.3%
남이면 184
 
2.1%
진산면 182
 
2.1%
남일면 151
 
1.7%
군북면 140
 
1.6%
복수면 128
 
1.5%
Other values (1035) 4134
47.0%
2024-01-10T06:11:50.607666image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
8555
20.7%
2522
 
6.1%
1979
 
4.8%
1940
 
4.7%
1802
 
4.4%
1743
 
4.2%
1719
 
4.2%
1600
 
3.9%
1600
 
3.9%
- 1560
 
3.8%
Other values (136) 16392
39.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 21704
52.4%
Space Separator 8555
 
20.7%
Decimal Number 8475
 
20.5%
Dash Punctuation 1560
 
3.8%
Other Punctuation 910
 
2.2%
Close Punctuation 104
 
0.3%
Open Punctuation 104
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2522
11.6%
1979
9.1%
1940
 
8.9%
1802
 
8.3%
1743
 
8.0%
1719
 
7.9%
1600
 
7.4%
1600
 
7.4%
1526
 
7.0%
411
 
1.9%
Other values (117) 4862
22.4%
Decimal Number
ValueCountFrequency (%)
1 1539
18.2%
2 1305
15.4%
3 1100
13.0%
4 1009
11.9%
6 703
8.3%
5 662
7.8%
9 554
 
6.5%
7 549
 
6.5%
0 546
 
6.4%
8 508
 
6.0%
Other Punctuation
ValueCountFrequency (%)
, 884
97.1%
/ 23
 
2.5%
. 3
 
0.3%
Close Punctuation
ValueCountFrequency (%)
) 102
98.1%
] 2
 
1.9%
Open Punctuation
ValueCountFrequency (%)
( 102
98.1%
[ 2
 
1.9%
Space Separator
ValueCountFrequency (%)
8555
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1560
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 21704
52.4%
Common 19708
47.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2522
11.6%
1979
9.1%
1940
 
8.9%
1802
 
8.3%
1743
 
8.0%
1719
 
7.9%
1600
 
7.4%
1600
 
7.4%
1526
 
7.0%
411
 
1.9%
Other values (117) 4862
22.4%
Common
ValueCountFrequency (%)
8555
43.4%
- 1560
 
7.9%
1 1539
 
7.8%
2 1305
 
6.6%
3 1100
 
5.6%
4 1009
 
5.1%
, 884
 
4.5%
6 703
 
3.6%
5 662
 
3.4%
9 554
 
2.8%
Other values (9) 1837
 
9.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 21704
52.4%
ASCII 19708
47.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
8555
43.4%
- 1560
 
7.9%
1 1539
 
7.8%
2 1305
 
6.6%
3 1100
 
5.6%
4 1009
 
5.1%
, 884
 
4.5%
6 703
 
3.6%
5 662
 
3.4%
9 554
 
2.8%
Other values (9) 1837
 
9.3%
Hangul
ValueCountFrequency (%)
2522
11.6%
1979
9.1%
1940
 
8.9%
1802
 
8.3%
1743
 
8.0%
1719
 
7.9%
1600
 
7.4%
1600
 
7.4%
1526
 
7.0%
411
 
1.9%
Other values (117) 4862
22.4%
Distinct373
Distinct (%)23.3%
Missing0
Missing (%)0.0%
Memory size12.6 KiB
2024-01-10T06:11:50.864385image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length22
Median length22
Mean length22
Min length22

Characters and Unicode

Total characters35178
Distinct characters15
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique141 ?
Unique (%)8.8%

Sample

1st row1986-08-05 오전 12:00:00
2nd row2007-02-02 오전 12:00:00
3rd row2009-08-14 오전 12:00:00
4th row2009-08-14 오전 12:00:00
5th row2009-12-28 오전 12:00:00
ValueCountFrequency (%)
12:00:00 1599
33.3%
오전 1599
33.3%
2018-05-15 57
 
1.2%
2016-01-29 48
 
1.0%
2018-09-11 39
 
0.8%
2020-06-30 37
 
0.8%
2018-01-23 35
 
0.7%
2017-09-07 33
 
0.7%
2018-09-19 28
 
0.6%
2020-09-29 28
 
0.6%
Other values (365) 1294
27.0%
2024-01-10T06:11:51.222460image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 10163
28.9%
2 4416
12.6%
1 4347
12.4%
- 3198
 
9.1%
3198
 
9.1%
: 3198
 
9.1%
1599
 
4.5%
1599
 
4.5%
8 827
 
2.4%
9 698
 
2.0%
Other values (5) 1935
 
5.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 22386
63.6%
Dash Punctuation 3198
 
9.1%
Space Separator 3198
 
9.1%
Other Punctuation 3198
 
9.1%
Other Letter 3198
 
9.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 10163
45.4%
2 4416
19.7%
1 4347
19.4%
8 827
 
3.7%
9 698
 
3.1%
7 532
 
2.4%
6 467
 
2.1%
5 345
 
1.5%
3 329
 
1.5%
4 262
 
1.2%
Other Letter
ValueCountFrequency (%)
1599
50.0%
1599
50.0%
Dash Punctuation
ValueCountFrequency (%)
- 3198
100.0%
Space Separator
ValueCountFrequency (%)
3198
100.0%
Other Punctuation
ValueCountFrequency (%)
: 3198
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 31980
90.9%
Hangul 3198
 
9.1%

Most frequent character per script

Common
ValueCountFrequency (%)
0 10163
31.8%
2 4416
13.8%
1 4347
13.6%
- 3198
 
10.0%
3198
 
10.0%
: 3198
 
10.0%
8 827
 
2.6%
9 698
 
2.2%
7 532
 
1.7%
6 467
 
1.5%
Other values (3) 936
 
2.9%
Hangul
ValueCountFrequency (%)
1599
50.0%
1599
50.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 31980
90.9%
Hangul 3198
 
9.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 10163
31.8%
2 4416
13.8%
1 4347
13.6%
- 3198
 
10.0%
3198
 
10.0%
: 3198
 
10.0%
8 827
 
2.6%
9 698
 
2.2%
7 532
 
1.7%
6 467
 
1.5%
Other values (3) 936
 
2.9%
Hangul
ValueCountFrequency (%)
1599
50.0%
1599
50.0%

사업개시일
Text

MISSING 

Distinct180
Distinct (%)49.0%
Missing1232
Missing (%)77.0%
Memory size12.6 KiB
2024-01-10T06:11:51.491599image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length22
Median length22
Mean length22
Min length22

Characters and Unicode

Total characters8074
Distinct characters15
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique125 ?
Unique (%)34.1%

Sample

1st row1986-08-05 오전 12:00:00
2nd row2010-01-06 오전 12:00:00
3rd row2010-06-11 오전 12:00:00
4th row2010-06-11 오전 12:00:00
5th row2011-11-25 오전 12:00:00
ValueCountFrequency (%)
12:00:00 367
33.3%
오전 367
33.3%
2015-03-16 19
 
1.7%
2018-11-28 18
 
1.6%
2018-10-30 16
 
1.5%
2018-05-09 15
 
1.4%
2017-09-21 14
 
1.3%
2018-02-07 9
 
0.8%
2018-11-14 8
 
0.7%
2017-03-06 7
 
0.6%
Other values (172) 261
23.7%
2024-01-10T06:11:51.886822image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 2276
28.2%
1 1090
13.5%
2 935
11.6%
- 734
 
9.1%
734
 
9.1%
: 734
 
9.1%
367
 
4.5%
367
 
4.5%
8 224
 
2.8%
9 150
 
1.9%
Other values (5) 463
 
5.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 5138
63.6%
Dash Punctuation 734
 
9.1%
Space Separator 734
 
9.1%
Other Punctuation 734
 
9.1%
Other Letter 734
 
9.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 2276
44.3%
1 1090
21.2%
2 935
18.2%
8 224
 
4.4%
9 150
 
2.9%
6 115
 
2.2%
3 104
 
2.0%
5 93
 
1.8%
7 93
 
1.8%
4 58
 
1.1%
Other Letter
ValueCountFrequency (%)
367
50.0%
367
50.0%
Dash Punctuation
ValueCountFrequency (%)
- 734
100.0%
Space Separator
ValueCountFrequency (%)
734
100.0%
Other Punctuation
ValueCountFrequency (%)
: 734
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 7340
90.9%
Hangul 734
 
9.1%

Most frequent character per script

Common
ValueCountFrequency (%)
0 2276
31.0%
1 1090
14.9%
2 935
12.7%
- 734
 
10.0%
734
 
10.0%
: 734
 
10.0%
8 224
 
3.1%
9 150
 
2.0%
6 115
 
1.6%
3 104
 
1.4%
Other values (3) 244
 
3.3%
Hangul
ValueCountFrequency (%)
367
50.0%
367
50.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 7340
90.9%
Hangul 734
 
9.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 2276
31.0%
1 1090
14.9%
2 935
12.7%
- 734
 
10.0%
734
 
10.0%
: 734
 
10.0%
8 224
 
3.1%
9 150
 
2.0%
6 115
 
1.6%
3 104
 
1.4%
Other values (3) 244
 
3.3%
Hangul
ValueCountFrequency (%)
367
50.0%
367
50.0%

설비용량
Real number (ℝ)

HIGH CORRELATION 

Distinct347
Distinct (%)21.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean174.87266
Minimum9
Maximum2994
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size14.2 KiB
2024-01-10T06:11:52.019860image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum9
5-th percentile38.4735
Q197.92
median99
Q399.6
95-th percentile499.384
Maximum2994
Range2985
Interquartile range (IQR)1.68

Descriptive statistics

Standard deviation277.59037
Coefficient of variation (CV)1.5873858
Kurtosis34.590502
Mean174.87266
Median Absolute Deviation (MAD)0.96
Skewness5.2018431
Sum279621.38
Variance77056.416
MonotonicityNot monotonic
2024-01-10T06:11:52.149659image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
99.0 177
 
11.1%
97.92 133
 
8.3%
98.28 109
 
6.8%
99.2 94
 
5.9%
99.4 79
 
4.9%
97.2 75
 
4.7%
99.75 51
 
3.2%
99.96 34
 
2.1%
98.0 30
 
1.9%
99.18 29
 
1.8%
Other values (337) 788
49.3%
ValueCountFrequency (%)
9.0 1
0.1%
9.92 1
0.1%
11.04 1
0.1%
14.625 1
0.1%
14.74 1
0.1%
14.8 1
0.1%
18.0 1
0.1%
18.36 1
0.1%
18.675 1
0.1%
18.75 1
0.1%
ValueCountFrequency (%)
2994.0 2
0.1%
2800.0 1
 
0.1%
2520.0 1
 
0.1%
2120.0 1
 
0.1%
2034.72 2
0.1%
2000.0 1
 
0.1%
1997.64 4
0.3%
1989.0 1
 
0.1%
1980.0 1
 
0.1%
1756.08 1
 
0.1%

공급전압
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size12.6 KiB
380
1577 
<NA>
 
22

Length

Max length4
Median length3
Mean length3.0137586
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
380 1577
98.6%
<NA> 22
 
1.4%

Length

2024-01-10T06:11:52.268440image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T06:11:52.631674image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
380 1577
98.6%
na 22
 
1.4%

주파수
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size12.6 KiB
60
1577 
<NA>
 
22

Length

Max length4
Median length2
Mean length2.0275172
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
60 1577
98.6%
<NA> 22
 
1.4%

Length

2024-01-10T06:11:52.732135image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T06:11:52.848159image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
60 1577
98.6%
na 22
 
1.4%

설치면적
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct859
Distinct (%)54.7%
Missing29
Missing (%)1.8%
Infinite0
Infinite (%)0.0%
Mean2156.4962
Minimum80
Maximum64470
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size14.2 KiB
2024-01-10T06:11:52.957841image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum80
5-th percentile291.45
Q1598
median1200
Q31666.75
95-th percentile9230.55
Maximum64470
Range64390
Interquartile range (IQR)1068.75

Descriptive statistics

Standard deviation3955.2032
Coefficient of variation (CV)1.8340877
Kurtosis58.984882
Mean2156.4962
Median Absolute Deviation (MAD)602
Skewness6.1058378
Sum3385699
Variance15643632
MonotonicityNot monotonic
2024-01-10T06:11:53.108386image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
598 64
 
4.0%
560 52
 
3.3%
498 34
 
2.1%
1322 25
 
1.6%
568 23
 
1.4%
605 19
 
1.2%
1320 18
 
1.1%
554 18
 
1.1%
1115 17
 
1.1%
3300 17
 
1.1%
Other values (849) 1283
80.2%
(Missing) 29
 
1.8%
ValueCountFrequency (%)
80 1
 
0.1%
84 1
 
0.1%
89 1
 
0.1%
91 1
 
0.1%
96 3
0.2%
97 1
 
0.1%
99 1
 
0.1%
108 1
 
0.1%
110 1
 
0.1%
113 2
0.1%
ValueCountFrequency (%)
64470 1
0.1%
44382 1
0.1%
29747 1
0.1%
29537 1
0.1%
29227 1
0.1%
28368 1
0.1%
27760 1
0.1%
24420 1
0.1%
23923 1
0.1%
23243 1
0.1%

지목
Text

MISSING 

Distinct96
Distinct (%)6.1%
Missing29
Missing (%)1.8%
Memory size12.6 KiB
2024-01-10T06:11:53.263161image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length17
Median length16
Mean length2.6961783
Min length1

Characters and Unicode

Total characters4233
Distinct characters32
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique49 ?
Unique (%)3.1%

Sample

1st row공장용지
2nd row
3rd row잡종지
4th row임야
5th row공장용지
ValueCountFrequency (%)
임야 737
46.9%
169
 
10.8%
165
 
10.5%
공장용지 63
 
4.0%
임야+전 43
 
2.7%
잡종지 39
 
2.5%
37
 
2.4%
32
 
2.0%
전+임야+전+전+답 18
 
1.1%
전+임야 16
 
1.0%
Other values (86) 251
 
16.0%
2024-01-10T06:11:53.562296image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1002
23.7%
954
22.5%
+ 507
12.0%
479
11.3%
266
 
6.3%
211
 
5.0%
132
 
3.1%
111
 
2.6%
80
 
1.9%
67
 
1.6%
Other values (22) 424
10.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3697
87.3%
Math Symbol 507
 
12.0%
Other Punctuation 29
 
0.7%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1002
27.1%
954
25.8%
479
13.0%
266
 
7.2%
211
 
5.7%
132
 
3.6%
111
 
3.0%
80
 
2.2%
67
 
1.8%
62
 
1.7%
Other values (20) 333
 
9.0%
Math Symbol
ValueCountFrequency (%)
+ 507
100.0%
Other Punctuation
ValueCountFrequency (%)
, 29
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3697
87.3%
Common 536
 
12.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1002
27.1%
954
25.8%
479
13.0%
266
 
7.2%
211
 
5.7%
132
 
3.6%
111
 
3.0%
80
 
2.2%
67
 
1.8%
62
 
1.7%
Other values (20) 333
 
9.0%
Common
ValueCountFrequency (%)
+ 507
94.6%
, 29
 
5.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3697
87.3%
ASCII 536
 
12.7%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1002
27.1%
954
25.8%
479
13.0%
266
 
7.2%
211
 
5.7%
132
 
3.6%
111
 
3.0%
80
 
2.2%
67
 
1.8%
62
 
1.7%
Other values (20) 333
 
9.0%
ASCII
ValueCountFrequency (%)
+ 507
94.6%
, 29
 
5.4%

사업상태
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct3
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size12.6 KiB
<NA>
1537 
취소
 
61
 
1

Length

Max length4
Median length4
Mean length3.9218261
Min length1

Unique

Unique1 ?
Unique (%)0.1%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 1537
96.1%
취소 61
 
3.8%
1
 
0.1%

Length

2024-01-10T06:11:53.690871image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T06:11:53.795455image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 1537
96.2%
취소 61
 
3.8%

데이터기준일
Categorical

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size12.6 KiB
2021-04-14 오전 12:00:00
1599 

Length

Max length22
Median length22
Mean length22
Min length22

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2021-04-14 오전 12:00:00
2nd row2021-04-14 오전 12:00:00
3rd row2021-04-14 오전 12:00:00
4th row2021-04-14 오전 12:00:00
5th row2021-04-14 오전 12:00:00

Common Values

ValueCountFrequency (%)
2021-04-14 오전 12:00:00 1599
100.0%

Length

2024-01-10T06:11:53.889352image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T06:11:53.974413image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2021-04-14 1599
33.3%
오전 1599
33.3%
12:00:00 1599
33.3%

Interactions

2024-01-10T06:11:48.480874image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T06:11:47.888331image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T06:11:48.179378image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T06:11:48.577802image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T06:11:47.989774image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T06:11:48.288622image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T06:11:48.660974image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T06:11:48.081218image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T06:11:48.389643image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-01-10T06:11:54.037416image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번설비용량설치면적지목사업상태
연번1.0000.3210.2050.7560.295
설비용량0.3211.0000.7310.0000.000
설치면적0.2050.7311.0000.0000.000
지목0.7560.0000.0001.0000.000
사업상태0.2950.0000.0000.0001.000
2024-01-10T06:11:54.142270image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사업상태주파수공급전압
사업상태1.0001.0001.000
주파수1.0001.0001.000
공급전압1.0001.0001.000
2024-01-10T06:11:54.226650image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번설비용량설치면적공급전압주파수사업상태
연번1.000-0.039-0.3431.0001.0000.300
설비용량-0.0391.0000.4241.0001.0000.000
설치면적-0.3430.4241.0001.0001.0000.000
공급전압1.0001.0001.0001.0001.0001.000
주파수1.0001.0001.0001.0001.0001.000
사업상태0.3000.0000.0001.0001.0001.000

Missing values

2024-01-10T06:11:48.784418image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-10T06:11:48.939462image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-01-10T06:11:49.059472image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

연번발전소명발전소주소최초허가일사업개시일설비용량공급전압주파수설치면적지목사업상태데이터기준일
01방우리소수력충청남도 금산군 부리면 방우리 산3-21986-08-05 오전 12:00:001986-08-05 오전 12:00:002120.0<NA><NA><NA><NA><NA>2021-04-14 오전 12:00:00
12금성충청남도 금산군 금성면 상가리 산67-12007-02-02 오전 12:00:002010-01-06 오전 12:00:001000.0<NA><NA><NA><NA><NA>2021-04-14 오전 12:00:00
23두리충청남도 금산군 부리면 창평리 280-12009-08-14 오전 12:00:002010-06-11 오전 12:00:0099.0<NA><NA><NA><NA><NA>2021-04-14 오전 12:00:00
34영솔라충청남도 금산군 부리면 창평리 280-32009-08-14 오전 12:00:002010-06-11 오전 12:00:0099.0<NA><NA><NA><NA><NA>2021-04-14 오전 12:00:00
45㈜서울전업공사충청남도 금산군 추부면 신평리 10002009-12-28 오전 12:00:002011-11-25 오전 12:00:0021.0<NA><NA><NA><NA><NA>2021-04-14 오전 12:00:00
56신정2호충청남도 금산군 남일면 신정리 418-62010-03-16 오전 12:00:002011-04-07 오전 12:00:0019.35<NA><NA><NA><NA><NA>2021-04-14 오전 12:00:00
67문은옥충청남도 금산군 추부면 대학로 156-14(1필지)2012-03-26 오전 12:00:002012-07-06 오전 12:00:0024.0<NA><NA><NA><NA><NA>2021-04-14 오전 12:00:00
78군북충청남도 금산군 군북면 두두리 380-82012-04-03 오전 12:00:00<NA>9.0<NA><NA><NA><NA><NA>2021-04-14 오전 12:00:00
89㈜에버솔라충청남도 금산군 추부면 신평리 1006-32012-04-09 오전 12:00:002014-01-15 오전 12:00:00250.0<NA><NA><NA><NA><NA>2021-04-14 오전 12:00:00
910초희충청남도 금산군 남일면 신정리 4162012-06-04 오전 12:00:002012-12-26 오전 12:00:0019.74<NA><NA><NA><NA><NA>2021-04-14 오전 12:00:00
연번발전소명발전소주소최초허가일사업개시일설비용량공급전압주파수설치면적지목사업상태데이터기준일
15891590B5성곡리 태양광발전소충청남도 금산군 남이면 성곡리 58-122020-12-29 오전 12:00:00<NA>98.438060429<NA>2021-04-14 오전 12:00:00
15901591B6성곡리 태양광발전소충청남도 금산군 남이면 성곡리 58-122020-12-29 오전 12:00:00<NA>98.438060585<NA>2021-04-14 오전 12:00:00
15911592B9성곡리 태양광발전소충청남도 금산군 남이면 성곡리 62-172020-12-29 오전 12:00:00<NA>85.6838060383<NA>2021-04-14 오전 12:00:00
15921593한을정 태양광발전소충청남도 금산군 군북면 외부리 685-9, 6882020-12-31 오전 12:00:00<NA>98.2838060732임,과수원<NA>2021-04-14 오전 12:00:00
15931594박은영 태양광발전소충청남도 금산군 군북면 외부리 685-9, 6882020-12-31 오전 12:00:00<NA>98.2838060618임,과수원<NA>2021-04-14 오전 12:00:00
15941595김예랑 태양광발전소충청남도 금산군 군북면 외부리 685-9, 6882020-12-31 오전 12:00:00<NA>29.438060131임,과수원<NA>2021-04-14 오전 12:00:00
15951596신송남 태양광발전소충청남도 금산군 군북면 외부리 685-9, 6882020-12-31 오전 12:00:00<NA>98.2838060472임,과수원<NA>2021-04-14 오전 12:00:00
15961597황철연 태양광발전소충청남도 금산군 군북면 외부리 685-9, 6882020-12-31 오전 12:00:00<NA>98.2838060672임,과수원<NA>2021-04-14 오전 12:00:00
15971598박찬식 태양광발전소충청남도 금산군 군북면 외부리 685-9, 6882020-12-31 오전 12:00:00<NA>98.2838060704임,과수원<NA>2021-04-14 오전 12:00:00
15981599유지연 태양광발전소충청남도 금산군 군북면 외부리 685-9, 6882020-12-31 오전 12:00:00<NA>98.2838060703임,과수원<NA>2021-04-14 오전 12:00:00