Overview

Dataset statistics

Number of variables11
Number of observations10000
Missing cells6792
Missing cells (%)6.2%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory937.5 KiB
Average record size in memory96.0 B

Variable types

DateTime2
Categorical4
Text5

Dataset

Description전라북도_태양광발전사업허가현황_20191231
Author전라북도
URLhttps://www.bigdatahub.go.kr/opendata/dataSet/detail.nm?contentId=37&rlik=49451aebf056b486&serviceId=204412

Alerts

에너지원 has constant value ""Constant
설치구분 is highly imbalanced (72.7%)Imbalance
허가면적(㎡) has 517 (5.2%) missing valuesMissing
사업개시신고일 has 6257 (62.6%) missing valuesMissing

Reproduction

Analysis started2024-03-14 00:50:08.482972
Analysis finished2024-03-14 00:50:10.198281
Duration1.72 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct994
Distinct (%)9.9%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2005-06-13 00:00:00
Maximum2019-11-01 00:00:00
2024-03-14T09:50:10.313027image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T09:50:10.483830image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

에너지원
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
태양광
10000 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row태양광
2nd row태양광
3rd row태양광
4th row태양광
5th row태양광

Common Values

ValueCountFrequency (%)
태양광 10000
100.0%

Length

2024-03-14T09:50:10.713247image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-14T09:50:10.820198image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
태양광 10000
100.0%
Distinct8813
Distinct (%)88.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-03-14T09:50:11.074527image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length33
Median length27
Mean length9.2864
Min length2

Characters and Unicode

Total characters92864
Distinct characters666
Distinct categories11 ?
Distinct scripts4 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique8101 ?
Unique (%)81.0%

Sample

1st row장판6호태양광발전소
2nd row대성9태양광발전소
3rd row시열태양광발전소
4th row승용태양광발전소
5th row제이오메가태양광발전소
ValueCountFrequency (%)
태양광발전소 197
 
1.9%
상월에너지스테이션 32
 
0.3%
유한회사 22
 
0.2%
대성태양광발전소 16
 
0.2%
1호 14
 
0.1%
태양광 13
 
0.1%
에너지 12
 
0.1%
한빛태양광발전소 12
 
0.1%
하늘태양광발전소 10
 
0.1%
양지태양광발전소 9
 
0.1%
Other values (8835) 10068
96.8%
2024-03-14T09:50:11.528789image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
9355
 
10.1%
9328
 
10.0%
9241
 
10.0%
9053
 
9.7%
8970
 
9.7%
8942
 
9.6%
3506
 
3.8%
1 1537
 
1.7%
2 1314
 
1.4%
1010
 
1.1%
Other values (656) 30608
33.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 86434
93.1%
Decimal Number 4245
 
4.6%
Uppercase Letter 765
 
0.8%
Space Separator 448
 
0.5%
Close Punctuation 296
 
0.3%
Open Punctuation 275
 
0.3%
Other Symbol 176
 
0.2%
Dash Punctuation 122
 
0.1%
Lowercase Letter 71
 
0.1%
Other Punctuation 29
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
9355
 
10.8%
9328
 
10.8%
9241
 
10.7%
9053
 
10.5%
8970
 
10.4%
8942
 
10.3%
3506
 
4.1%
1010
 
1.2%
936
 
1.1%
750
 
0.9%
Other values (599) 25343
29.3%
Uppercase Letter
ValueCountFrequency (%)
S 132
17.3%
J 82
10.7%
K 59
 
7.7%
B 53
 
6.9%
Y 51
 
6.7%
C 45
 
5.9%
H 44
 
5.8%
E 42
 
5.5%
M 41
 
5.4%
N 35
 
4.6%
Other values (12) 181
23.7%
Lowercase Letter
ValueCountFrequency (%)
r 13
18.3%
o 11
15.5%
a 11
15.5%
l 11
15.5%
e 5
 
7.0%
s 5
 
7.0%
n 4
 
5.6%
u 2
 
2.8%
t 2
 
2.8%
w 2
 
2.8%
Other values (5) 5
 
7.0%
Decimal Number
ValueCountFrequency (%)
1 1537
36.2%
2 1314
31.0%
3 578
 
13.6%
4 243
 
5.7%
5 180
 
4.2%
6 122
 
2.9%
0 92
 
2.2%
7 84
 
2.0%
8 54
 
1.3%
9 41
 
1.0%
Other Punctuation
ValueCountFrequency (%)
& 18
62.1%
. 9
31.0%
· 1
 
3.4%
: 1
 
3.4%
Space Separator
ValueCountFrequency (%)
448
100.0%
Close Punctuation
ValueCountFrequency (%)
) 296
100.0%
Open Punctuation
ValueCountFrequency (%)
( 275
100.0%
Other Symbol
ValueCountFrequency (%)
176
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 122
100.0%
Letter Number
ValueCountFrequency (%)
3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 86609
93.3%
Common 5415
 
5.8%
Latin 839
 
0.9%
Han 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
9355
 
10.8%
9328
 
10.8%
9241
 
10.7%
9053
 
10.5%
8970
 
10.4%
8942
 
10.3%
3506
 
4.0%
1010
 
1.2%
936
 
1.1%
750
 
0.9%
Other values (599) 25518
29.5%
Latin
ValueCountFrequency (%)
S 132
15.7%
J 82
 
9.8%
K 59
 
7.0%
B 53
 
6.3%
Y 51
 
6.1%
C 45
 
5.4%
H 44
 
5.2%
E 42
 
5.0%
M 41
 
4.9%
N 35
 
4.2%
Other values (28) 255
30.4%
Common
ValueCountFrequency (%)
1 1537
28.4%
2 1314
24.3%
3 578
 
10.7%
448
 
8.3%
) 296
 
5.5%
( 275
 
5.1%
4 243
 
4.5%
5 180
 
3.3%
- 122
 
2.3%
6 122
 
2.3%
Other values (8) 300
 
5.5%
Han
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 86433
93.1%
ASCII 6250
 
6.7%
None 177
 
0.2%
Number Forms 3
 
< 0.1%
CJK 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
9355
 
10.8%
9328
 
10.8%
9241
 
10.7%
9053
 
10.5%
8970
 
10.4%
8942
 
10.3%
3506
 
4.1%
1010
 
1.2%
936
 
1.1%
750
 
0.9%
Other values (598) 25342
29.3%
ASCII
ValueCountFrequency (%)
1 1537
24.6%
2 1314
21.0%
3 578
 
9.2%
448
 
7.2%
) 296
 
4.7%
( 275
 
4.4%
4 243
 
3.9%
5 180
 
2.9%
S 132
 
2.1%
- 122
 
2.0%
Other values (44) 1125
18.0%
None
ValueCountFrequency (%)
176
99.4%
· 1
 
0.6%
Number Forms
ValueCountFrequency (%)
3
100.0%
CJK
ValueCountFrequency (%)
1
100.0%

시군
Categorical

Distinct14
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
정읍시
1770 
남원시
1230 
김제시
1188 
익산시
1086 
고창군
827 
Other values (9)
3899 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row장수군
2nd row정읍시
3rd row임실군
4th row정읍시
5th row임실군

Common Values

ValueCountFrequency (%)
정읍시 1770
17.7%
남원시 1230
12.3%
김제시 1188
11.9%
익산시 1086
10.9%
고창군 827
8.3%
임실군 656
 
6.6%
부안군 650
 
6.5%
완주군 564
 
5.6%
군산시 508
 
5.1%
장수군 372
 
3.7%
Other values (4) 1149
11.5%

Length

2024-03-14T09:50:11.638524image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
정읍시 1770
17.7%
남원시 1230
12.3%
김제시 1188
11.9%
익산시 1086
10.9%
고창군 827
8.3%
임실군 656
 
6.6%
부안군 650
 
6.5%
완주군 564
 
5.6%
군산시 508
 
5.1%
장수군 372
 
3.7%
Other values (4) 1149
11.5%
Distinct9326
Distinct (%)93.3%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-03-14T09:50:11.946798image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length357
Median length174
Mean length23.1634
Min length6

Characters and Unicode

Total characters231634
Distinct characters385
Distinct categories14 ?
Distinct scripts3 ?
Distinct blocks8 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique8941 ?
Unique (%)89.4%

Sample

1st row천천면 장판리 산38, 산39
2nd row북면 한교리 240-2,240-5,239 건물상부(3,4동)
3rd row오수면 춘향로 1568-34(금암리 856-1)건물상부(주1,주2)
4th row구룡동 산188-2, 109-1 건물상부(주1, 주2, 주3)
5th row오수면 용두리 503-3
ValueCountFrequency (%)
건물상부 1378
 
3.2%
건물상부(주1 650
 
1.5%
주2 331
 
0.8%
건물상부(주1,주2 309
 
0.7%
북면 275
 
0.6%
덕진구 271
 
0.6%
대산면 225
 
0.5%
용지면 170
 
0.4%
보안면 167
 
0.4%
황등면 165
 
0.4%
Other values (13684) 38694
90.8%
2024-03-14T09:50:12.388019image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
32771
 
14.1%
1 17628
 
7.6%
- 14487
 
6.3%
, 12219
 
5.3%
2 11566
 
5.0%
3 8897
 
3.8%
8641
 
3.7%
4 8020
 
3.5%
7814
 
3.4%
5 7364
 
3.2%
Other values (375) 102227
44.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 82851
35.8%
Other Letter 82623
35.7%
Space Separator 32771
 
14.1%
Dash Punctuation 14487
 
6.3%
Other Punctuation 12375
 
5.3%
Close Punctuation 3004
 
1.3%
Open Punctuation 3000
 
1.3%
Lowercase Letter 223
 
0.1%
Uppercase Letter 209
 
0.1%
Math Symbol 69
 
< 0.1%
Other values (4) 22
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
8641
 
10.5%
7814
 
9.5%
4656
 
5.6%
4157
 
5.0%
4127
 
5.0%
4036
 
4.9%
3585
 
4.3%
3519
 
4.3%
2652
 
3.2%
1194
 
1.4%
Other values (327) 38242
46.3%
Decimal Number
ValueCountFrequency (%)
1 17628
21.3%
2 11566
14.0%
3 8897
10.7%
4 8020
9.7%
5 7364
8.9%
6 7144
8.6%
7 6298
 
7.6%
8 5601
 
6.8%
0 5232
 
6.3%
9 5101
 
6.2%
Uppercase Letter
ValueCountFrequency (%)
W 169
80.9%
K 22
 
10.5%
A 6
 
2.9%
B 4
 
1.9%
C 3
 
1.4%
D 1
 
0.5%
G 1
 
0.5%
I 1
 
0.5%
L 1
 
0.5%
H 1
 
0.5%
Other Number
ValueCountFrequency (%)
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
Other Punctuation
ValueCountFrequency (%)
, 12219
98.7%
. 144
 
1.2%
: 6
 
< 0.1%
/ 2
 
< 0.1%
2
 
< 0.1%
2
 
< 0.1%
Math Symbol
ValueCountFrequency (%)
~ 66
95.7%
2
 
2.9%
+ 1
 
1.4%
Lowercase Letter
ValueCountFrequency (%)
k 185
83.0%
w 38
 
17.0%
Modifier Symbol
ValueCountFrequency (%)
` 4
66.7%
˚ 2
33.3%
Other Symbol
ValueCountFrequency (%)
2
66.7%
1
33.3%
Space Separator
ValueCountFrequency (%)
32771
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 14487
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3004
100.0%
Open Punctuation
ValueCountFrequency (%)
( 3000
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 148579
64.1%
Hangul 82623
35.7%
Latin 432
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
8641
 
10.5%
7814
 
9.5%
4656
 
5.6%
4157
 
5.0%
4127
 
5.0%
4036
 
4.9%
3585
 
4.3%
3519
 
4.3%
2652
 
3.2%
1194
 
1.4%
Other values (327) 38242
46.3%
Common
ValueCountFrequency (%)
32771
22.1%
1 17628
11.9%
- 14487
9.8%
, 12219
 
8.2%
2 11566
 
7.8%
3 8897
 
6.0%
4 8020
 
5.4%
5 7364
 
5.0%
6 7144
 
4.8%
7 6298
 
4.2%
Other values (26) 22185
14.9%
Latin
ValueCountFrequency (%)
k 185
42.8%
W 169
39.1%
w 38
 
8.8%
K 22
 
5.1%
A 6
 
1.4%
B 4
 
0.9%
C 3
 
0.7%
D 1
 
0.2%
G 1
 
0.2%
I 1
 
0.2%
Other values (2) 2
 
0.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 148992
64.3%
Hangul 82623
35.7%
Enclosed Alphanum 8
 
< 0.1%
Punctuation 4
 
< 0.1%
CJK Compat 2
 
< 0.1%
Arrows 2
 
< 0.1%
Modifier Letters 2
 
< 0.1%
Misc Symbols 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
32771
22.0%
1 17628
11.8%
- 14487
9.7%
, 12219
 
8.2%
2 11566
 
7.8%
3 8897
 
6.0%
4 8020
 
5.4%
5 7364
 
4.9%
6 7144
 
4.8%
7 6298
 
4.2%
Other values (24) 22598
15.2%
Hangul
ValueCountFrequency (%)
8641
 
10.5%
7814
 
9.5%
4656
 
5.6%
4157
 
5.0%
4127
 
5.0%
4036
 
4.9%
3585
 
4.3%
3519
 
4.3%
2652
 
3.2%
1194
 
1.4%
Other values (327) 38242
46.3%
CJK Compat
ValueCountFrequency (%)
2
100.0%
Arrows
ValueCountFrequency (%)
2
100.0%
Modifier Letters
ValueCountFrequency (%)
˚ 2
100.0%
Punctuation
ValueCountFrequency (%)
2
50.0%
2
50.0%
Misc Symbols
ValueCountFrequency (%)
1
100.0%
Enclosed Alphanum
ValueCountFrequency (%)
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%

설치구분
Categorical

IMBALANCE 

Distinct12
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
6440 
건물상부
3516 
 
33
건물상부, 토지위
 
2
건물상부, 토지
 
2
Other values (7)
 
7

Length

Max length10
Median length4
Mean length3.9921
Min length1

Unique

Unique7 ?
Unique (%)0.1%

Sample

1st row<NA>
2nd row건물상부
3rd row건물상부
4th row건물상부
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 6440
64.4%
건물상부 3516
35.2%
33
 
0.3%
건물상부, 토지위 2
 
< 0.1%
건물상부, 토지 2
 
< 0.1%
건물상부, 토지 위 1
 
< 0.1%
건물,토지 1
 
< 0.1%
수상 1
 
< 0.1%
외벽 1
 
< 0.1%
토지위 1
 
< 0.1%
Other values (2) 2
 
< 0.1%

Length

2024-03-14T09:50:12.505812image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
na 6440
64.6%
건물상부 3521
35.3%
토지 4
 
< 0.1%
토지위 3
 
< 0.1%
1
 
< 0.1%
건물,토지 1
 
< 0.1%
수상 1
 
< 0.1%
외벽 1
 
< 0.1%
1
 
< 0.1%
1
 
< 0.1%

허가면적(㎡)
Text

MISSING 

Distinct5238
Distinct (%)55.2%
Missing517
Missing (%)5.2%
Memory size156.2 KiB
2024-03-14T09:50:12.821646image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length4
Mean length3.8495202
Min length1

Characters and Unicode

Total characters36505
Distinct characters18
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3373 ?
Unique (%)35.6%

Sample

1st row11944
2nd row385
3rd row432
4th row1554
5th row5900
ValueCountFrequency (%)
712 47
 
0.5%
420 42
 
0.4%
10000 33
 
0.3%
388 29
 
0.3%
392 29
 
0.3%
6287 28
 
0.3%
600 25
 
0.3%
504 24
 
0.3%
3900 23
 
0.2%
6088 23
 
0.2%
Other values (5229) 9181
96.8%
2024-03-14T09:50:13.294214image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 5203
14.3%
0 4397
12.0%
2 4173
11.4%
3 3657
10.0%
4 3514
9.6%
5 3457
9.5%
6 3357
9.2%
8 2948
8.1%
7 2931
8.0%
9 2855
7.8%
Other values (8) 13
 
< 0.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 36492
> 99.9%
Space Separator 5
 
< 0.1%
Other Letter 4
 
< 0.1%
Other Punctuation 3
 
< 0.1%
Dash Punctuation 1
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 5203
14.3%
0 4397
12.0%
2 4173
11.4%
3 3657
10.0%
4 3514
9.6%
5 3457
9.5%
6 3357
9.2%
8 2948
8.1%
7 2931
8.0%
9 2855
7.8%
Other Letter
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%
Other Punctuation
ValueCountFrequency (%)
. 2
66.7%
, 1
33.3%
Space Separator
ValueCountFrequency (%)
5
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 36501
> 99.9%
Hangul 4
 
< 0.1%

Most frequent character per script

Common
ValueCountFrequency (%)
1 5203
14.3%
0 4397
12.0%
2 4173
11.4%
3 3657
10.0%
4 3514
9.6%
5 3457
9.5%
6 3357
9.2%
8 2948
8.1%
7 2931
8.0%
9 2855
7.8%
Other values (4) 9
 
< 0.1%
Hangul
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 36501
> 99.9%
Hangul 4
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 5203
14.3%
0 4397
12.0%
2 4173
11.4%
3 3657
10.0%
4 3514
9.6%
5 3457
9.5%
6 3357
9.2%
8 2948
8.1%
7 2931
8.0%
9 2855
7.8%
Other values (4) 9
 
< 0.1%
Hangul
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%
Distinct697
Distinct (%)7.0%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-03-14T09:50:13.603924image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length4
Median length3
Mean length2.6225
Min length1

Characters and Unicode

Total characters26225
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique267 ?
Unique (%)2.7%

Sample

1st row999
2nd row84
3rd row99
4th row300
5th row498
ValueCountFrequency (%)
99 1718
 
17.2%
499 471
 
4.7%
30 407
 
4.1%
200 323
 
3.2%
300 267
 
2.7%
199 257
 
2.6%
999 251
 
2.5%
500 239
 
2.4%
497 213
 
2.1%
20 209
 
2.1%
Other values (686) 5644
56.4%
2024-03-14T09:50:13.996460image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
9 9583
36.5%
0 4538
17.3%
4 2181
 
8.3%
2 1925
 
7.3%
1 1793
 
6.8%
3 1498
 
5.7%
8 1437
 
5.5%
5 1424
 
5.4%
7 1073
 
4.1%
6 770
 
2.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 26222
> 99.9%
Space Separator 3
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
9 9583
36.5%
0 4538
17.3%
4 2181
 
8.3%
2 1925
 
7.3%
1 1793
 
6.8%
3 1498
 
5.7%
8 1437
 
5.5%
5 1424
 
5.4%
7 1073
 
4.1%
6 770
 
2.9%
Space Separator
ValueCountFrequency (%)
3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 26225
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
9 9583
36.5%
0 4538
17.3%
4 2181
 
8.3%
2 1925
 
7.3%
1 1793
 
6.8%
3 1498
 
5.7%
8 1437
 
5.5%
5 1424
 
5.4%
7 1073
 
4.1%
6 770
 
2.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 26225
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
9 9583
36.5%
0 4538
17.3%
4 2181
 
8.3%
2 1925
 
7.3%
1 1793
 
6.8%
3 1498
 
5.7%
8 1437
 
5.5%
5 1424
 
5.4%
7 1073
 
4.1%
6 770
 
2.9%

공급전압
Categorical

Distinct10
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
380
5845 
220/380
1754 
22900
1600 
<NA>
699 
220
 
70
Other values (5)
 
32

Length

Max length11
Median length3
Mean length4.1103
Min length3

Unique

Unique2 ?
Unique (%)< 0.1%

Sample

1st row22900
2nd row380
3rd row380
4th row380
5th row380

Common Values

ValueCountFrequency (%)
380 5845
58.5%
220/380 1754
 
17.5%
22900 1600
 
16.0%
<NA> 699
 
7.0%
220 70
 
0.7%
220/380 24
 
0.2%
220 /380 3
 
< 0.1%
380/220 3
 
< 0.1%
22900/380 1
 
< 0.1%
22/380 1
 
< 0.1%

Length

2024-03-14T09:50:14.150552image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-14T09:50:14.282596image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
380 5848
58.5%
220/380 1778
 
17.8%
22900 1600
 
16.0%
na 699
 
7.0%
220 73
 
0.7%
380/220 3
 
< 0.1%
22900/380 1
 
< 0.1%
22/380 1
 
< 0.1%
Distinct1145
Distinct (%)11.5%
Missing18
Missing (%)0.2%
Memory size156.2 KiB
2024-03-14T09:50:14.530845image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length10
Mean length9.9990984
Min length1

Characters and Unicode

Total characters99811
Distinct characters12
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique349 ?
Unique (%)3.5%

Sample

1st row2020-09-10
2nd row2015-12-02
3rd row2016-12-17
4th row2021-01-24
5th row2022-04-07
ValueCountFrequency (%)
2016-09-02 63
 
0.6%
2022-02-06 58
 
0.6%
2017-01-26 57
 
0.6%
2016-12-22 56
 
0.6%
2021-09-17 55
 
0.6%
2020-12-20 52
 
0.5%
2016-10-06 49
 
0.5%
2020-09-28 46
 
0.5%
2020-06-18 45
 
0.5%
2016-12-05 45
 
0.5%
Other values (1135) 9456
94.7%
2024-03-14T09:50:14.885398image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 24859
24.9%
2 22072
22.1%
- 19962
20.0%
1 15183
15.2%
6 3859
 
3.9%
7 2734
 
2.7%
3 2607
 
2.6%
9 2434
 
2.4%
5 2299
 
2.3%
4 2107
 
2.1%
Other values (2) 1695
 
1.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 79848
80.0%
Dash Punctuation 19962
 
20.0%
Other Letter 1
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 24859
31.1%
2 22072
27.6%
1 15183
19.0%
6 3859
 
4.8%
7 2734
 
3.4%
3 2607
 
3.3%
9 2434
 
3.0%
5 2299
 
2.9%
4 2107
 
2.6%
8 1694
 
2.1%
Dash Punctuation
ValueCountFrequency (%)
- 19962
100.0%
Other Letter
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 99810
> 99.9%
Hangul 1
 
< 0.1%

Most frequent character per script

Common
ValueCountFrequency (%)
0 24859
24.9%
2 22072
22.1%
- 19962
20.0%
1 15183
15.2%
6 3859
 
3.9%
7 2734
 
2.7%
3 2607
 
2.6%
9 2434
 
2.4%
5 2299
 
2.3%
4 2107
 
2.1%
Hangul
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 99810
> 99.9%
Hangul 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 24859
24.9%
2 22072
22.1%
- 19962
20.0%
1 15183
15.2%
6 3859
 
3.9%
7 2734
 
2.7%
3 2607
 
2.6%
9 2434
 
2.4%
5 2299
 
2.3%
4 2107
 
2.1%
Hangul
ValueCountFrequency (%)
1
100.0%

사업개시신고일
Date

MISSING 

Distinct1142
Distinct (%)30.5%
Missing6257
Missing (%)62.6%
Memory size156.2 KiB
Minimum1902-09-22 00:00:00
Maximum2020-01-15 00:00:00
2024-03-14T09:50:15.032430image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T09:50:15.171322image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

Correlations

2024-03-14T09:50:15.246822image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시군설치구분공급전압
시군1.0000.0000.204
설치구분0.0001.0000.000
공급전압0.2040.0001.000
2024-03-14T09:50:15.519317image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
설치구분시군공급전압
설치구분1.0000.0000.000
시군0.0001.0000.087
공급전압0.0000.0871.000
2024-03-14T09:50:15.587022image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시군설치구분공급전압
시군1.0000.0000.087
설치구분0.0001.0000.000
공급전압0.0870.0001.000

Missing values

2024-03-14T09:50:09.725836image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-14T09:50:09.893384image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-03-14T09:50:10.065197image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

허가일에너지원발전소명칭시군사업장소설치구분허가면적(㎡)설비용량공급전압준비기간사업개시신고일
62472017-09-11태양광장판6호태양광발전소장수군천천면 장판리 산38, 산39<NA>11944999229002020-09-10<NA>
15022012-12-03태양광대성9태양광발전소정읍시북면 한교리 240-2,240-5,239 건물상부(3,4동)건물상부385843802015-12-022013-02-15
34112013-12-18태양광시열태양광발전소임실군오수면 춘향로 1568-34(금암리 856-1)건물상부(주1,주2)건물상부432993802016-12-172014-03-26
73832018-01-25태양광승용태양광발전소정읍시구룡동 산188-2, 109-1 건물상부(주1, 주2, 주3)건물상부15543003802021-01-24<NA>
97462019-04-08태양광제이오메가태양광발전소임실군오수면 용두리 503-3<NA>59004983802022-04-07<NA>
552007-10-15태양광신화태양광발전소김제시요촌동 346-7 건물상부건물상부1895111<NA>2008-10-142013-03-08
37322014-02-04태양광영주1호태양광발전소부안군행안면 봉야로 312,312-6(대초리 1121-1,1121-2)건물상부(주1,주2)건물상부1990250229002017-02-032014-07-11
31162013-11-13태양광진성5호태양광발전소정읍시입암면 밤고개로 996(봉양리 115) 건물상부건물상부4139993802016-11-12<NA>
35482014-01-03태양광종문태양광발전소정읍시북면 화평길 93-90(신평리 140-1,141-2,142-7,143-4)건물상부(주1)건물상부48877220/3802017-01-022014-04-16
61572017-08-24태양광청정에너지정읍시소성면 고교리 653-2<NA>74304993802020-08-23<NA>
허가일에너지원발전소명칭시군사업장소설치구분허가면적(㎡)설비용량공급전압준비기간사업개시신고일
56072017-04-28태양광구보다태양광발전소정읍시영원면 장재리 685-2, 685-3, 720-1<NA>34343003802020-04-27<NA>
37742014-02-10태양광방채정태양광발전소고창군상하면 송라길 214-54 건물상부건물상부71299220/3802017-02-09<NA>
34772013-12-23태양광박철홍태양광발전소정읍시입암면 평암길21(하부리 613,613-1)건물상부건물상부432603802016-12-222014-02-21
94662018-12-06태양광후광1태양과발전소정읍시이평면 장내리 산20-4, 산19-1<NA>36002993802021-12-05<NA>
37432014-02-05태양광김병남1호태양광발전소정읍시이평면 오금리 857-1<NA>71299220/3802017-02-042015-05-15
11422012-05-30태양광용지태양광발전소김제시용지면 구암리 294-1건물상부건물상부592993802015-05-292013-03-11
78132018-03-27태양광중송태양광발전소남원시주천면 쑥고개로 592-21(송치리 322, 323, 471-1, 471-2, 473, 478) 건물상부(주1~주4)건물상부22803963802021-03-26<NA>
67972017-11-21태양광봉두태양광발전소정읍시태인면 박산리 702<NA>27601993802020-11-20<NA>
71722018-01-04태양광은혜태양광발전소김제시청하면 장산리 241, 395-1, 395-3, 403, 400-4<NA>66534993802021-01-03<NA>
35072013-12-27태양광함열태양광발전소익산시함열읍 석매리 1231-20 건물상부(주1,주2,주3)건물상부334783802016-12-26<NA>