Overview

Dataset statistics

Number of variables5
Number of observations130
Missing cells130
Missing cells (%)20.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory5.5 KiB
Average record size in memory43.0 B

Variable types

Text2
Numeric1
DateTime1
Unsupported1

Dataset

Description청도군 관내에 있는 태양광발전소설치현황에 대한 자료로 발전소명, 용량, 주소, 사업개시일 등의 정보를 포함하고 있습니다.
Author경상북도 청도군
URLhttps://www.data.go.kr/data/15033887/fileData.do

Alerts

비고 has 130 (100.0%) missing valuesMissing
비고 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2023-12-12 15:57:41.580372
Analysis finished2023-12-12 15:57:42.102661
Duration0.52 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct129
Distinct (%)99.2%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
2023-12-13T00:57:42.290927image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length21
Median length18
Mean length9.7615385
Min length4

Characters and Unicode

Total characters1269
Distinct characters174
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique128 ?
Unique (%)98.5%

Sample

1st row경북서원태양광발전소
2nd row청도태양광발전소
3rd rowHJ글로벌태양광발전소
4th row학산1호태양광발전소
5th row학산2호 태양광발전소
ValueCountFrequency (%)
태양광발전소 28
 
16.1%
주식회사 4
 
2.3%
동국태양광발전소 3
 
1.7%
금산태양광발전소 2
 
1.1%
수상 2
 
1.1%
오성앤학교가구 1
 
0.6%
청도 1
 
0.6%
팔조령 1
 
0.6%
박태천태양광발전소 1
 
0.6%
달감태양광발전소 1
 
0.6%
Other values (130) 130
74.7%
2023-12-13T00:57:42.693485image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
126
 
9.9%
125
 
9.9%
125
 
9.9%
124
 
9.8%
123
 
9.7%
123
 
9.7%
46
 
3.6%
18
 
1.4%
17
 
1.3%
13
 
1.0%
Other values (164) 429
33.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1173
92.4%
Space Separator 46
 
3.6%
Decimal Number 28
 
2.2%
Uppercase Letter 16
 
1.3%
Open Punctuation 2
 
0.2%
Close Punctuation 2
 
0.2%
Other Symbol 1
 
0.1%
Dash Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
126
 
10.7%
125
 
10.7%
125
 
10.7%
124
 
10.6%
123
 
10.5%
123
 
10.5%
18
 
1.5%
17
 
1.4%
13
 
1.1%
11
 
0.9%
Other values (145) 368
31.4%
Uppercase Letter
ValueCountFrequency (%)
S 3
18.8%
N 2
12.5%
J 2
12.5%
G 2
12.5%
L 2
12.5%
E 2
12.5%
K 1
 
6.2%
O 1
 
6.2%
H 1
 
6.2%
Decimal Number
ValueCountFrequency (%)
1 11
39.3%
2 9
32.1%
3 5
17.9%
4 2
 
7.1%
5 1
 
3.6%
Space Separator
ValueCountFrequency (%)
46
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1174
92.5%
Common 79
 
6.2%
Latin 16
 
1.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
126
 
10.7%
125
 
10.6%
125
 
10.6%
124
 
10.6%
123
 
10.5%
123
 
10.5%
18
 
1.5%
17
 
1.4%
13
 
1.1%
11
 
0.9%
Other values (146) 369
31.4%
Common
ValueCountFrequency (%)
46
58.2%
1 11
 
13.9%
2 9
 
11.4%
3 5
 
6.3%
( 2
 
2.5%
) 2
 
2.5%
4 2
 
2.5%
- 1
 
1.3%
5 1
 
1.3%
Latin
ValueCountFrequency (%)
S 3
18.8%
N 2
12.5%
J 2
12.5%
G 2
12.5%
L 2
12.5%
E 2
12.5%
K 1
 
6.2%
O 1
 
6.2%
H 1
 
6.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1173
92.4%
ASCII 95
 
7.5%
None 1
 
0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
126
 
10.7%
125
 
10.7%
125
 
10.7%
124
 
10.6%
123
 
10.5%
123
 
10.5%
18
 
1.5%
17
 
1.4%
13
 
1.1%
11
 
0.9%
Other values (145) 368
31.4%
ASCII
ValueCountFrequency (%)
46
48.4%
1 11
 
11.6%
2 9
 
9.5%
3 5
 
5.3%
S 3
 
3.2%
( 2
 
2.1%
N 2
 
2.1%
J 2
 
2.1%
G 2
 
2.1%
L 2
 
2.1%
Other values (8) 11
 
11.6%
None
ValueCountFrequency (%)
1
100.0%

주소
Text

Distinct116
Distinct (%)89.2%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
2023-12-13T00:57:42.959076image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length47
Median length30
Mean length17.892308
Min length11

Characters and Unicode

Total characters2326
Distinct characters102
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique106 ?
Unique (%)81.5%

Sample

1st row 청도군 이서면 서원리258-1외2
2nd row청도군 매전면 호화리 752-79
3rd row청도군 화양읍 진라2길 11
4th row청도군 이서면 학산1길 58-15
5th row청도군 이서면 학신1길 58-15
ValueCountFrequency (%)
청도군 124
23.0%
풍각면 28
 
5.2%
이서면 22
 
4.1%
청도읍 20
 
3.7%
금천면 19
 
3.5%
16
 
3.0%
각남면 14
 
2.6%
화양읍 13
 
2.4%
매전면 10
 
1.9%
갈지리 8
 
1.5%
Other values (199) 264
49.1%
2023-12-13T00:57:43.561123image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
410
17.6%
150
 
6.4%
144
 
6.2%
124
 
5.3%
1 118
 
5.1%
98
 
4.2%
81
 
3.5%
2 76
 
3.3%
8 68
 
2.9%
- 67
 
2.9%
Other values (92) 990
42.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1221
52.5%
Decimal Number 588
25.3%
Space Separator 410
 
17.6%
Dash Punctuation 67
 
2.9%
Other Punctuation 40
 
1.7%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
150
 
12.3%
144
 
11.8%
124
 
10.2%
98
 
8.0%
81
 
6.6%
49
 
4.0%
48
 
3.9%
34
 
2.8%
33
 
2.7%
28
 
2.3%
Other values (78) 432
35.4%
Decimal Number
ValueCountFrequency (%)
1 118
20.1%
2 76
12.9%
8 68
11.6%
3 64
10.9%
5 51
8.7%
7 51
8.7%
0 45
 
7.7%
4 45
 
7.7%
6 42
 
7.1%
9 28
 
4.8%
Other Punctuation
ValueCountFrequency (%)
, 39
97.5%
. 1
 
2.5%
Space Separator
ValueCountFrequency (%)
410
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 67
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1221
52.5%
Common 1105
47.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
150
 
12.3%
144
 
11.8%
124
 
10.2%
98
 
8.0%
81
 
6.6%
49
 
4.0%
48
 
3.9%
34
 
2.8%
33
 
2.7%
28
 
2.3%
Other values (78) 432
35.4%
Common
ValueCountFrequency (%)
410
37.1%
1 118
 
10.7%
2 76
 
6.9%
8 68
 
6.2%
- 67
 
6.1%
3 64
 
5.8%
5 51
 
4.6%
7 51
 
4.6%
0 45
 
4.1%
4 45
 
4.1%
Other values (4) 110
 
10.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1221
52.5%
ASCII 1105
47.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
410
37.1%
1 118
 
10.7%
2 76
 
6.9%
8 68
 
6.2%
- 67
 
6.1%
3 64
 
5.8%
5 51
 
4.6%
7 51
 
4.6%
0 45
 
4.1%
4 45
 
4.1%
Other values (4) 110
 
10.0%
Hangul
ValueCountFrequency (%)
150
 
12.3%
144
 
11.8%
124
 
10.2%
98
 
8.0%
81
 
6.6%
49
 
4.0%
48
 
3.9%
34
 
2.8%
33
 
2.7%
28
 
2.3%
Other values (78) 432
35.4%

설비용량(kw)
Real number (ℝ)

Distinct88
Distinct (%)67.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean245.56496
Minimum9.3
Maximum1379.7
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.3 KiB
2023-12-13T00:57:43.729290image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum9.3
5-th percentile17.545
Q197.47
median99.45
Q3216.4275
95-th percentile999.72
Maximum1379.7
Range1370.4
Interquartile range (IQR)118.9575

Descriptive statistics

Standard deviation317.47883
Coefficient of variation (CV)1.2928507
Kurtosis2.8494147
Mean245.56496
Median Absolute Deviation (MAD)61.41
Skewness1.9992901
Sum31923.445
Variance100792.81
MonotonicityNot monotonic
2023-12-13T00:57:43.900702image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
99.0 15
 
11.5%
999.72 5
 
3.8%
99.2 4
 
3.1%
99.28 4
 
3.1%
99.84 3
 
2.3%
99.36 3
 
2.3%
198.4 3
 
2.3%
998.0 2
 
1.5%
197.1 2
 
1.5%
99.9 2
 
1.5%
Other values (78) 87
66.9%
ValueCountFrequency (%)
9.3 1
0.8%
9.86 1
0.8%
9.92 1
0.8%
11.13 1
0.8%
15.3 1
0.8%
16.2 1
0.8%
16.6 1
0.8%
18.7 1
0.8%
18.72 1
0.8%
19.5 2
1.5%
ValueCountFrequency (%)
1379.7 1
 
0.8%
1316.25 1
 
0.8%
1018.35 1
 
0.8%
1000.0 1
 
0.8%
999.9 1
 
0.8%
999.72 5
3.8%
998.0 2
 
1.5%
997.56 1
 
0.8%
996.0 2
 
1.5%
827.82 1
 
0.8%
Distinct91
Distinct (%)70.0%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
Minimum2007-12-21 00:00:00
Maximum2020-03-17 00:00:00
2023-12-13T00:57:44.111490image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T00:57:44.341638image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

비고
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing130
Missing (%)100.0%
Memory size1.3 KiB

Interactions

2023-12-13T00:57:41.804002image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T00:57:44.475351image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
설비용량(kw)사업허가일
설비용량(kw)1.0000.576
사업허가일0.5761.000

Missing values

2023-12-13T00:57:41.951807image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T00:57:42.064703image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

발전소명주소설비용량(kw)사업허가일비고
0경북서원태양광발전소청도군 이서면 서원리258-1외230.02007-12-21<NA>
1청도태양광발전소청도군 매전면 호화리 752-79100.02008-12-22<NA>
2HJ글로벌태양광발전소청도군 화양읍 진라2길 1199.02013-11-11<NA>
3학산1호태양광발전소청도군 이서면 학산1길 58-1519.52014-06-11<NA>
4학산2호 태양광발전소청도군 이서면 학신1길 58-1519.52014-06-11<NA>
5이가네태양광발전소청도군 풍각면 송서리 610-111.132014-05-14<NA>
6나오캠태양광발전소청도군 청도읍 월곡2길4699.362014-05-26<NA>
7부야리태양광발전소청도군 청도읍 부야리 580-126.462014-06-23<NA>
8농업회사법인 대흥농산 북부주식회사청도군 각남면 화리134,137403.922014-07-28<NA>
9누리태양광발전소청도군 각남면 한재로 1095-24299.02014-08-13<NA>
발전소명주소설비용량(kw)사업허가일비고
120희순태양광발전소청도군 금천면 갈지리 1138-399.752019-12-31<NA>
121LJ태양광발전소청도읍 청도읍 운산2길21-516.62019-12-31<NA>
122김연수태양광발전소청도군 각북면 헐티로893-358.12020-01-09<NA>
123태림태양광발전소청도군 풍각면 봉기1길70270.62020-01-09<NA>
124유등리2태양광발전소청도군 화양읍 유등리 100092.132020-02-20<NA>
125오봉태양광발전소청도군 금천면 오봉길71-599.92020-02-06<NA>
126동국태양광발전소 3호금천면 박곡길 143-590.062020-02-11<NA>
127삼우연사태양광발전소청도군 이서면 연지로 318442.82020-02-20<NA>
128주식회사 백진전기태양광발전소청도군 화양읍 덕암리 55399.62020-03-17<NA>
129주식회사 제이에스 태양광발전소청도군 화양읍 덕암리 55399.62020-03-17<NA>