Overview

Dataset statistics

Number of variables4
Number of observations1199
Missing cells367
Missing cells (%)7.7%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory38.8 KiB
Average record size in memory33.1 B

Variable types

Text2
DateTime1
Numeric1

Dataset

Description경상북도 경산시 소재 태양광 발전 현황의 데이터로 상호명, 소재지, 허가일, 사업개시 발전용량 등 데이터를 포함하고 있습니다.
Author경상북도 경산시
URLhttps://www.data.go.kr/data/15033921/fileData.do

Alerts

사업개시발전용량 has 367 (30.6%) missing valuesMissing

Reproduction

Analysis started2023-12-12 12:10:21.012976
Analysis finished2023-12-12 12:10:21.771413
Duration0.76 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

상호
Text

Distinct1145
Distinct (%)95.5%
Missing0
Missing (%)0.0%
Memory size9.5 KiB
2023-12-12T21:10:22.008603image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length26
Median length25
Mean length9.470392
Min length2

Characters and Unicode

Total characters11355
Distinct characters382
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1100 ?
Unique (%)91.7%

Sample

1st row세연에너지주식회사태양광발전소
2nd row㈜창우실업태양광발전소
3rd row대구태양광발전소
4th row자인(정)태양광발전소
5th row자인(정)태양광발전소
ValueCountFrequency (%)
태양광발전소 19
 
1.5%
경북우리집re100발전소 13
 
1.0%
남신태양광발전소 4
 
0.3%
발전소 4
 
0.3%
주식회사 4
 
0.3%
햇빛가온 4
 
0.3%
신일태양광발전소 3
 
0.2%
월드태양광발전소 3
 
0.2%
금강태양광발전소 3
 
0.2%
모햇태양광발전소 3
 
0.2%
Other values (1153) 1204
95.3%
2023-12-12T21:10:22.519974image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1124
 
9.9%
1121
 
9.9%
1115
 
9.8%
1067
 
9.4%
1063
 
9.4%
1062
 
9.4%
330
 
2.9%
2 213
 
1.9%
1 205
 
1.8%
3 98
 
0.9%
Other values (372) 3957
34.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 10368
91.3%
Decimal Number 642
 
5.7%
Uppercase Letter 137
 
1.2%
Space Separator 76
 
0.7%
Other Symbol 48
 
0.4%
Lowercase Letter 39
 
0.3%
Dash Punctuation 24
 
0.2%
Open Punctuation 10
 
0.1%
Close Punctuation 10
 
0.1%
Letter Number 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1124
 
10.8%
1121
 
10.8%
1115
 
10.8%
1067
 
10.3%
1063
 
10.3%
1062
 
10.2%
330
 
3.2%
90
 
0.9%
85
 
0.8%
82
 
0.8%
Other values (328) 3229
31.1%
Uppercase Letter
ValueCountFrequency (%)
E 29
21.2%
R 16
11.7%
Y 11
 
8.0%
J 11
 
8.0%
S 10
 
7.3%
F 9
 
6.6%
K 7
 
5.1%
C 6
 
4.4%
N 6
 
4.4%
G 5
 
3.6%
Other values (11) 27
19.7%
Decimal Number
ValueCountFrequency (%)
2 213
33.2%
1 205
31.9%
3 98
15.3%
4 35
 
5.5%
0 35
 
5.5%
5 23
 
3.6%
6 10
 
1.6%
7 10
 
1.6%
8 7
 
1.1%
9 6
 
0.9%
Lowercase Letter
ValueCountFrequency (%)
e 8
20.5%
o 8
20.5%
c 7
17.9%
p 7
17.9%
k 7
17.9%
l 1
 
2.6%
n 1
 
2.6%
Space Separator
ValueCountFrequency (%)
76
100.0%
Other Symbol
ValueCountFrequency (%)
48
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 24
100.0%
Open Punctuation
ValueCountFrequency (%)
( 10
100.0%
Close Punctuation
ValueCountFrequency (%)
) 10
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 10416
91.7%
Common 762
 
6.7%
Latin 177
 
1.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1124
 
10.8%
1121
 
10.8%
1115
 
10.7%
1067
 
10.2%
1063
 
10.2%
1062
 
10.2%
330
 
3.2%
90
 
0.9%
85
 
0.8%
82
 
0.8%
Other values (329) 3277
31.5%
Latin
ValueCountFrequency (%)
E 29
16.4%
R 16
 
9.0%
Y 11
 
6.2%
J 11
 
6.2%
S 10
 
5.6%
F 9
 
5.1%
e 8
 
4.5%
o 8
 
4.5%
c 7
 
4.0%
p 7
 
4.0%
Other values (19) 61
34.5%
Common
ValueCountFrequency (%)
2 213
28.0%
1 205
26.9%
3 98
12.9%
76
 
10.0%
4 35
 
4.6%
0 35
 
4.6%
- 24
 
3.1%
5 23
 
3.0%
6 10
 
1.3%
( 10
 
1.3%
Other values (4) 33
 
4.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 10368
91.3%
ASCII 938
 
8.3%
None 48
 
0.4%
Number Forms 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1124
 
10.8%
1121
 
10.8%
1115
 
10.8%
1067
 
10.3%
1063
 
10.3%
1062
 
10.2%
330
 
3.2%
90
 
0.9%
85
 
0.8%
82
 
0.8%
Other values (328) 3229
31.1%
ASCII
ValueCountFrequency (%)
2 213
22.7%
1 205
21.9%
3 98
10.4%
76
 
8.1%
4 35
 
3.7%
0 35
 
3.7%
E 29
 
3.1%
- 24
 
2.6%
5 23
 
2.5%
R 16
 
1.7%
Other values (32) 184
19.6%
None
ValueCountFrequency (%)
48
100.0%
Number Forms
ValueCountFrequency (%)
1
100.0%
Distinct908
Distinct (%)75.7%
Missing0
Missing (%)0.0%
Memory size9.5 KiB
2023-12-12T21:10:22.888414image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length51
Median length40
Mean length21.847373
Min length14

Characters and Unicode

Total characters26195
Distinct characters210
Distinct categories8 ?
Distinct scripts4 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique756 ?
Unique (%)63.1%

Sample

1st row경산시 용성면 외촌리 12-1
2nd row경산시 용성면 외촌리 산39-5
3rd row경산시 옥산동 840-5(옥상)
4th row경산시 자인면 울옥리 산14-1
5th row경산시 자인면 울옥리 산14-1, 옥천리 산49-1
ValueCountFrequency (%)
경산시 1199
21.7%
지붕 283
 
5.1%
진량읍 230
 
4.2%
용성면 226
 
4.1%
하양읍 177
 
3.2%
와촌면 176
 
3.2%
162
 
2.9%
자인면 125
 
2.3%
건물 110
 
2.0%
남산면 78
 
1.4%
Other values (1260) 2752
49.9%
2023-12-12T21:10:23.393500image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4324
 
16.5%
1467
 
5.6%
1238
 
4.7%
1204
 
4.6%
1 1158
 
4.4%
1087
 
4.1%
) 951
 
3.6%
( 951
 
3.6%
766
 
2.9%
675
 
2.6%
Other values (200) 12374
47.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 14074
53.7%
Decimal Number 4876
 
18.6%
Space Separator 4324
 
16.5%
Close Punctuation 1023
 
3.9%
Open Punctuation 1023
 
3.9%
Dash Punctuation 614
 
2.3%
Other Punctuation 239
 
0.9%
Uppercase Letter 22
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1467
 
10.4%
1238
 
8.8%
1204
 
8.6%
1087
 
7.7%
766
 
5.4%
675
 
4.8%
532
 
3.8%
484
 
3.4%
454
 
3.2%
349
 
2.5%
Other values (176) 5818
41.3%
Decimal Number
ValueCountFrequency (%)
1 1158
23.7%
2 625
12.8%
4 558
11.4%
3 467
9.6%
5 395
 
8.1%
7 362
 
7.4%
6 350
 
7.2%
0 345
 
7.1%
8 310
 
6.4%
9 306
 
6.3%
Uppercase Letter
ValueCountFrequency (%)
A 11
50.0%
G 4
 
18.2%
C 3
 
13.6%
B 2
 
9.1%
D 1
 
4.5%
H 1
 
4.5%
Close Punctuation
ValueCountFrequency (%)
) 951
93.0%
] 72
 
7.0%
Open Punctuation
ValueCountFrequency (%)
( 951
93.0%
[ 72
 
7.0%
Other Punctuation
ValueCountFrequency (%)
, 238
99.6%
/ 1
 
0.4%
Space Separator
ValueCountFrequency (%)
4324
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 614
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 14067
53.7%
Common 12099
46.2%
Latin 22
 
0.1%
Han 7
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1467
 
10.4%
1238
 
8.8%
1204
 
8.6%
1087
 
7.7%
766
 
5.4%
675
 
4.8%
532
 
3.8%
484
 
3.4%
454
 
3.2%
349
 
2.5%
Other values (174) 5811
41.3%
Common
ValueCountFrequency (%)
4324
35.7%
1 1158
 
9.6%
) 951
 
7.9%
( 951
 
7.9%
2 625
 
5.2%
- 614
 
5.1%
4 558
 
4.6%
3 467
 
3.9%
5 395
 
3.3%
7 362
 
3.0%
Other values (8) 1694
 
14.0%
Latin
ValueCountFrequency (%)
A 11
50.0%
G 4
 
18.2%
C 3
 
13.6%
B 2
 
9.1%
D 1
 
4.5%
H 1
 
4.5%
Han
ValueCountFrequency (%)
4
57.1%
3
42.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 14067
53.7%
ASCII 12121
46.3%
CJK 7
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
4324
35.7%
1 1158
 
9.6%
) 951
 
7.8%
( 951
 
7.8%
2 625
 
5.2%
- 614
 
5.1%
4 558
 
4.6%
3 467
 
3.9%
5 395
 
3.3%
7 362
 
3.0%
Other values (14) 1716
 
14.2%
Hangul
ValueCountFrequency (%)
1467
 
10.4%
1238
 
8.8%
1204
 
8.6%
1087
 
7.7%
766
 
5.4%
675
 
4.8%
532
 
3.8%
484
 
3.4%
454
 
3.2%
349
 
2.5%
Other values (174) 5811
41.3%
CJK
ValueCountFrequency (%)
4
57.1%
3
42.9%
Distinct543
Distinct (%)45.3%
Missing0
Missing (%)0.0%
Memory size9.5 KiB
Minimum2008-03-28 00:00:00
Maximum2023-09-18 00:00:00
2023-12-12T21:10:23.565028image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:10:23.719442image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

사업개시발전용량
Real number (ℝ)

MISSING 

Distinct439
Distinct (%)52.8%
Missing367
Missing (%)30.6%
Infinite0
Infinite (%)0.0%
Mean139.08133
Minimum3
Maximum3091.2
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size10.7 KiB
2023-12-12T21:10:23.840131image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum3
5-th percentile19.452
Q161.6
median98.55
Q399.82
95-th percentile457.281
Maximum3091.2
Range3088.2
Interquartile range (IQR)38.22

Descriptive statistics

Standard deviation247.56766
Coefficient of variation (CV)1.7800208
Kurtosis57.859804
Mean139.08133
Median Absolute Deviation (MAD)11.85
Skewness6.6701329
Sum115715.67
Variance61289.747
MonotonicityNot monotonic
2023-12-12T21:10:23.958929image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
99.96 38
 
3.2%
99.6 26
 
2.2%
98.28 23
 
1.9%
99.65 21
 
1.8%
99.0 19
 
1.6%
99.9 18
 
1.5%
98.55 17
 
1.4%
99.36 14
 
1.2%
99.84 14
 
1.2%
99.45 12
 
1.0%
Other values (429) 630
52.5%
(Missing) 367
30.6%
ValueCountFrequency (%)
3.0 1
0.1%
9.0 1
0.1%
10.0 1
0.1%
10.01 1
0.1%
10.12 1
0.1%
12.75 1
0.1%
12.96 1
0.1%
13.28 1
0.1%
13.86 2
0.2%
14.76 1
0.1%
ValueCountFrequency (%)
3091.2 1
0.1%
2897.31 1
0.1%
2196.72 1
0.1%
1875.0 1
0.1%
1807.92 1
0.1%
1504.8 1
0.1%
1495.04 1
0.1%
1467.13 1
0.1%
1459.62 1
0.1%
1075.0 1
0.1%

Interactions

2023-12-12T21:10:21.450043image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Missing values

2023-12-12T21:10:21.621143image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T21:10:21.732477image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

상호소재지허가일사업개시발전용량
0세연에너지주식회사태양광발전소경산시 용성면 외촌리 12-12008-03-28194.04
1㈜창우실업태양광발전소경산시 용성면 외촌리 산39-52008-07-03194.04
2대구태양광발전소경산시 옥산동 840-5(옥상)2009-06-249.0
3자인(정)태양광발전소경산시 자인면 울옥리 산14-12011-08-1198.4
4자인(정)태양광발전소경산시 자인면 울옥리 산14-1, 옥천리 산49-12021-11-15200.98
5자인(정)소수력발전소경산시 자인면 울옥리 산14-12011-08-1165.0
6자비정사태양광발전소경산시 백천3동 121(종교용지)2012-01-0310.12
7(주)거평그린1호태양광발전소경산시 진량읍 신상리 1186-7(지붕)2012-03-2099.0
8(주)거평그린2호태양광발전소경산시 진량읍 신상리 1185-7(지붕)2012-03-2098.8
9서광태양광발전소경산시 와촌면 대동리 289-20(지붕)2012-08-2298.8
상호소재지허가일사업개시발전용량
1189해오름태양광경산시 압량읍 진등길 88(건물 위)2023-09-04<NA>
1190오리온경산시 용성면 내촌리 2022023-09-06<NA>
1191애드준태양광발전소경산시 진량읍 일연로 590(건물 위)2023-09-06<NA>
1192경산축산농협자인사업소 태양광발전소경산시 자인면 계정길 67(건물 위)2023-09-07<NA>
1193구일1태양광발전소경산시 남천면 구일리 106, 77-10, 109-3, 109-4 (건물 위)2023-09-11<NA>
1194구일2태양광발전소경산시 남천면 구일리 106, 77-10, 109-3, 109-4 (건물 위)2023-09-11<NA>
1195구일3태양광발전소경산시 남천면 구일리 106, 77-10, 109-3, 109-4 (건물 위)2023-09-11<NA>
1196구일4태양광발전소경산시 남천면 구일리 106, 77-10, 109-3, 109-4 (건물 위)2023-09-11<NA>
1197힐링1호태양광발전소경산시 자인면 원당길 131-10(지붕)2023-09-18<NA>
1198디에스니들태양광발전소경산시 진량읍 공단4로5길 79(건물 위)2023-09-18<NA>