Overview

Dataset statistics

Number of variables6
Number of observations1821
Missing cells1510
Missing cells (%)13.8%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory89.0 KiB
Average record size in memory50.1 B

Variable types

Numeric2
Text1
Categorical1
DateTime2

Dataset

Description전라남도 신안군 태양광발전 설치 현황으로 발전소명, 원동력, 허가용량(킬로와트), 허가증교부일, 사업개시일을 포함하고 있습니다.
Author전라남도 신안군
URLhttps://www.data.go.kr/data/15033936/fileData.do

Alerts

원동력 is highly imbalanced (99.3%)Imbalance
사업개시일 has 1510 (82.9%) missing valuesMissing
순번 has unique valuesUnique

Reproduction

Analysis started2024-03-16 04:18:02.395893
Analysis finished2024-03-16 04:18:04.570933
Duration2.18 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

UNIQUE 

Distinct1821
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean911
Minimum1
Maximum1821
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size16.1 KiB
2024-03-16T13:18:04.769247image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile92
Q1456
median911
Q31366
95-th percentile1730
Maximum1821
Range1820
Interquartile range (IQR)910

Descriptive statistics

Standard deviation525.82174
Coefficient of variation (CV)0.57719181
Kurtosis-1.2
Mean911
Median Absolute Deviation (MAD)455
Skewness0
Sum1658931
Variance276488.5
MonotonicityStrictly increasing
2024-03-16T13:18:05.022657image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.1%
1197 1
 
0.1%
1223 1
 
0.1%
1222 1
 
0.1%
1221 1
 
0.1%
1220 1
 
0.1%
1219 1
 
0.1%
1218 1
 
0.1%
1217 1
 
0.1%
1216 1
 
0.1%
Other values (1811) 1811
99.5%
ValueCountFrequency (%)
1 1
0.1%
2 1
0.1%
3 1
0.1%
4 1
0.1%
5 1
0.1%
6 1
0.1%
7 1
0.1%
8 1
0.1%
9 1
0.1%
10 1
0.1%
ValueCountFrequency (%)
1821 1
0.1%
1820 1
0.1%
1819 1
0.1%
1818 1
0.1%
1817 1
0.1%
1816 1
0.1%
1815 1
0.1%
1814 1
0.1%
1813 1
0.1%
1812 1
0.1%
Distinct1774
Distinct (%)97.4%
Missing0
Missing (%)0.0%
Memory size14.4 KiB
2024-03-16T13:18:05.481281image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length18
Median length16
Mean length9.7666118
Min length2

Characters and Unicode

Total characters17785
Distinct characters442
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1731 ?
Unique (%)95.1%

Sample

1st row한국지역난방공사
2nd row오엠시솔라에너지태양광발전소
3rd row신안쏠라파크 태양광발전소
4th row신안쏠라파크2호태양광발전소
5th row파인솔태양광발전소
ValueCountFrequency (%)
태양광발전소 180
 
8.9%
복룡태양광발전소 6
 
0.3%
발전소 4
 
0.2%
미래태양광발전소 4
 
0.2%
유한회사 4
 
0.2%
한양신재생 3
 
0.1%
소망태양광발전소 3
 
0.1%
단산 3
 
0.1%
희망태양광발전소 3
 
0.1%
믿음태양광발전소 2
 
0.1%
Other values (1773) 1815
89.5%
2024-03-16T13:18:06.107576image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1677
 
9.4%
1654
 
9.3%
1653
 
9.3%
1649
 
9.3%
1648
 
9.3%
1646
 
9.3%
717
 
4.0%
299
 
1.7%
1 298
 
1.7%
289
 
1.6%
Other values (432) 6255
35.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 16207
91.1%
Decimal Number 919
 
5.2%
Space Separator 207
 
1.2%
Open Punctuation 139
 
0.8%
Close Punctuation 139
 
0.8%
Other Symbol 73
 
0.4%
Uppercase Letter 71
 
0.4%
Dash Punctuation 22
 
0.1%
Lowercase Letter 5
 
< 0.1%
Letter Number 3
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1677
 
10.3%
1654
 
10.2%
1653
 
10.2%
1649
 
10.2%
1648
 
10.2%
1646
 
10.2%
717
 
4.4%
299
 
1.8%
289
 
1.8%
224
 
1.4%
Other values (390) 4751
29.3%
Uppercase Letter
ValueCountFrequency (%)
S 10
14.1%
H 8
11.3%
J 8
11.3%
K 7
9.9%
Y 5
 
7.0%
W 4
 
5.6%
A 4
 
5.6%
B 4
 
5.6%
D 3
 
4.2%
N 3
 
4.2%
Other values (10) 15
21.1%
Decimal Number
ValueCountFrequency (%)
1 298
32.4%
2 226
24.6%
3 124
13.5%
4 77
 
8.4%
5 62
 
6.7%
6 41
 
4.5%
7 31
 
3.4%
8 21
 
2.3%
9 20
 
2.2%
0 19
 
2.1%
Lowercase Letter
ValueCountFrequency (%)
o 2
40.0%
x 1
20.0%
l 1
20.0%
k 1
20.0%
Letter Number
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%
Space Separator
ValueCountFrequency (%)
207
100.0%
Open Punctuation
ValueCountFrequency (%)
( 139
100.0%
Close Punctuation
ValueCountFrequency (%)
) 139
100.0%
Other Symbol
ValueCountFrequency (%)
73
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 22
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 16280
91.5%
Common 1426
 
8.0%
Latin 79
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1677
 
10.3%
1654
 
10.2%
1653
 
10.2%
1649
 
10.1%
1648
 
10.1%
1646
 
10.1%
717
 
4.4%
299
 
1.8%
289
 
1.8%
224
 
1.4%
Other values (391) 4824
29.6%
Latin
ValueCountFrequency (%)
S 10
12.7%
H 8
 
10.1%
J 8
 
10.1%
K 7
 
8.9%
Y 5
 
6.3%
W 4
 
5.1%
A 4
 
5.1%
B 4
 
5.1%
D 3
 
3.8%
N 3
 
3.8%
Other values (17) 23
29.1%
Common
ValueCountFrequency (%)
1 298
20.9%
2 226
15.8%
207
14.5%
( 139
9.7%
) 139
9.7%
3 124
8.7%
4 77
 
5.4%
5 62
 
4.3%
6 41
 
2.9%
7 31
 
2.2%
Other values (4) 82
 
5.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 16207
91.1%
ASCII 1502
 
8.4%
None 73
 
0.4%
Number Forms 3
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1677
 
10.3%
1654
 
10.2%
1653
 
10.2%
1649
 
10.2%
1648
 
10.2%
1646
 
10.2%
717
 
4.4%
299
 
1.8%
289
 
1.8%
224
 
1.4%
Other values (390) 4751
29.3%
ASCII
ValueCountFrequency (%)
1 298
19.8%
2 226
15.0%
207
13.8%
( 139
9.3%
) 139
9.3%
3 124
8.3%
4 77
 
5.1%
5 62
 
4.1%
6 41
 
2.7%
7 31
 
2.1%
Other values (28) 158
10.5%
None
ValueCountFrequency (%)
73
100.0%
Number Forms
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%

원동력
Categorical

IMBALANCE 

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size14.4 KiB
태양광
1820 
조류
 
1

Length

Max length3
Median length3
Mean length2.9994509
Min length2

Unique

Unique1 ?
Unique (%)0.1%

Sample

1st row태양광
2nd row태양광
3rd row태양광
4th row태양광
5th row태양광

Common Values

ValueCountFrequency (%)
태양광 1820
99.9%
조류 1
 
0.1%

Length

2024-03-16T13:18:06.350177image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-16T13:18:06.625331image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
태양광 1820
99.9%
조류 1
 
0.1%
Distinct362
Distinct (%)19.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean421.11559
Minimum9.66
Maximum999.99
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size16.1 KiB
2024-03-16T13:18:06.890023image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum9.66
5-th percentile97.2
Q199.16
median250
Q3897.82
95-th percentile999
Maximum999.99
Range990.33
Interquartile range (IQR)798.66

Descriptive statistics

Standard deviation370.5355
Coefficient of variation (CV)0.87989025
Kurtosis-1.2756354
Mean421.11559
Median Absolute Deviation (MAD)153.44
Skewness0.6132335
Sum766851.49
Variance137296.56
MonotonicityNot monotonic
2024-03-16T13:18:07.141065image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
99.0 227
 
12.5%
999.0 87
 
4.8%
997.92 73
 
4.0%
498.96 70
 
3.8%
99.96 66
 
3.6%
99.28 64
 
3.5%
99.2 59
 
3.2%
99.36 59
 
3.2%
499.0 57
 
3.1%
97.92 34
 
1.9%
Other values (352) 1025
56.3%
ValueCountFrequency (%)
9.66 1
0.1%
15.0 1
0.1%
18.0 1
0.1%
18.72 1
0.1%
19.5 2
0.1%
19.565 1
0.1%
19.8 1
0.1%
19.84 1
0.1%
20.0 2
0.1%
21.0 1
0.1%
ValueCountFrequency (%)
999.99 17
0.9%
999.98 8
0.4%
999.92 2
 
0.1%
999.9 1
 
0.1%
999.81 5
 
0.3%
999.8 1
 
0.1%
999.79 4
 
0.2%
999.75 3
 
0.2%
999.68 1
 
0.1%
999.64 1
 
0.1%
Distinct282
Distinct (%)15.5%
Missing0
Missing (%)0.0%
Memory size14.4 KiB
Minimum2006-06-01 00:00:00
Maximum2024-02-23 00:00:00
2024-03-16T13:18:07.405323image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-16T13:18:07.636724image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

사업개시일
Date

MISSING 

Distinct125
Distinct (%)40.2%
Missing1510
Missing (%)82.9%
Memory size14.4 KiB
Minimum2007-01-01 00:00:00
Maximum2024-01-25 00:00:00
2024-03-16T13:18:07.880467image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-16T13:18:08.205782image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

Interactions

2024-03-16T13:18:03.406733image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-16T13:18:02.999605image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-16T13:18:03.615285image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-16T13:18:03.222964image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-16T13:18:08.346360image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번원동력허가용량(킬로와트)
순번1.0000.0030.353
원동력0.0031.0000.000
허가용량(킬로와트)0.3530.0001.000
2024-03-16T13:18:08.505495image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번허가용량(킬로와트)원동력
순번1.0000.0520.002
허가용량(킬로와트)0.0521.0000.000
원동력0.0020.0001.000

Missing values

2024-03-16T13:18:04.237923image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-16T13:18:04.429010image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

순번발전소명원동력허가용량(킬로와트)허가증교부일사업개시일
01한국지역난방공사태양광806.42006-06-012007-01-01
12오엠시솔라에너지태양광발전소태양광480.02007-12-172010-03-31
23신안쏠라파크 태양광발전소태양광495.62009-10-192010-06-18
34신안쏠라파크2호태양광발전소태양광297.62009-10-192010-06-18
45파인솔태양광발전소태양광99.02010-11-252012-03-02
56오병이어태양광발전소태양광99.02010-11-252012-01-01
67(유)화이트스카이솔라태양광99.02010-11-252012-03-02
78에이스태양광발전소태양광99.02010-11-252012-03-02
89대안태양광발전소태양광99.02010-11-252012-01-01
910일출자연솔라태양광발전소태양광99.02010-11-252012-03-02
순번발전소명원동력허가용량(킬로와트)허가증교부일사업개시일
18111812충인2호 태양광발전소태양광99.552024-02-19<NA>
18121813충인3호 태양광발전소태양광99.552024-02-19<NA>
18131814문이제1호 태양광발전소태양광299.882024-02-23<NA>
18141815문이제2호 태양광발전소태양광299.882024-02-23<NA>
18151816수희제1호 태양광발전소태양광299.882024-02-23<NA>
18161817수희제2호 태양광발전소태양광358.192024-02-23<NA>
18171818영철제1호 태양광발전소태양광199.922024-02-23<NA>
18181819영철제2호 태양광발전소태양광199.922024-02-23<NA>
18191820영철제3호 태양광발전소태양광299.882024-02-23<NA>
18201821영철제4호 태양광발전소태양광299.882024-02-23<NA>