Overview

Dataset statistics

Number of variables7
Number of observations623
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory36.0 KiB
Average record size in memory59.2 B

Variable types

Numeric2
Text2
Categorical3

Dataset

Description칠곡군 태양광 발전사업 허가현황에 대한 내용으로 상호, 설치소재지, 에너지원, 설비용량, 공급전압, 주파수 등에 대한 내역
Author경상북도 칠곡군
URLhttps://www.data.go.kr/data/15099437/fileData.do

Alerts

원동력의종류 has constant value ""Constant
주파수(Hz) has constant value ""Constant
공급전압(V) is highly imbalanced (82.8%)Imbalance
순번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 05:38:13.267550
Analysis finished2023-12-12 05:38:14.415297
Duration1.15 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

UNIQUE 

Distinct623
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean312
Minimum1
Maximum623
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size5.6 KiB
2023-12-12T14:38:14.489165image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile32.1
Q1156.5
median312
Q3467.5
95-th percentile591.9
Maximum623
Range622
Interquartile range (IQR)311

Descriptive statistics

Standard deviation179.98889
Coefficient of variation (CV)0.57688746
Kurtosis-1.2
Mean312
Median Absolute Deviation (MAD)156
Skewness0
Sum194376
Variance32396
MonotonicityStrictly increasing
2023-12-12T14:38:14.652943image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.2%
420 1
 
0.2%
413 1
 
0.2%
414 1
 
0.2%
415 1
 
0.2%
416 1
 
0.2%
417 1
 
0.2%
418 1
 
0.2%
419 1
 
0.2%
421 1
 
0.2%
Other values (613) 613
98.4%
ValueCountFrequency (%)
1 1
0.2%
2 1
0.2%
3 1
0.2%
4 1
0.2%
5 1
0.2%
6 1
0.2%
7 1
0.2%
8 1
0.2%
9 1
0.2%
10 1
0.2%
ValueCountFrequency (%)
623 1
0.2%
622 1
0.2%
621 1
0.2%
620 1
0.2%
619 1
0.2%
618 1
0.2%
617 1
0.2%
616 1
0.2%
615 1
0.2%
614 1
0.2%

상호
Text

Distinct613
Distinct (%)98.4%
Missing0
Missing (%)0.0%
Memory size5.0 KiB
2023-12-12T14:38:15.026262image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length21
Median length19
Mean length10.301766
Min length4

Characters and Unicode

Total characters6418
Distinct characters314
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique603 ?
Unique (%)96.8%

Sample

1st rowJC 태양광발전소
2nd row하이파크 태양광발전소
3rd row대흥테크 태양광발전소
4th row장성1 태양광발전소
5th row장성2 태양광발전소
ValueCountFrequency (%)
태양광발전소 506
43.2%
1호 5
 
0.4%
경북우리집re100발전소 4
 
0.3%
발전소 4
 
0.3%
이홍규 3
 
0.3%
한일 3
 
0.3%
태양광발전소2호 3
 
0.3%
대명 3
 
0.3%
세영 3
 
0.3%
2호 3
 
0.3%
Other values (612) 633
54.1%
2023-12-12T14:38:15.619691image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
604
 
9.4%
602
 
9.4%
600
 
9.3%
600
 
9.3%
599
 
9.3%
596
 
9.3%
549
 
8.6%
181
 
2.8%
2 101
 
1.6%
1 93
 
1.4%
Other values (304) 1893
29.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5464
85.1%
Space Separator 549
 
8.6%
Decimal Number 299
 
4.7%
Uppercase Letter 53
 
0.8%
Close Punctuation 19
 
0.3%
Open Punctuation 19
 
0.3%
Dash Punctuation 8
 
0.1%
Lowercase Letter 4
 
0.1%
Letter Number 2
 
< 0.1%
Other Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
604
 
11.1%
602
 
11.0%
600
 
11.0%
600
 
11.0%
599
 
11.0%
596
 
10.9%
181
 
3.3%
49
 
0.9%
43
 
0.8%
41
 
0.8%
Other values (268) 1549
28.3%
Uppercase Letter
ValueCountFrequency (%)
S 7
13.2%
K 7
13.2%
H 6
11.3%
E 6
11.3%
M 5
9.4%
R 4
7.5%
G 3
5.7%
B 3
5.7%
N 3
5.7%
J 3
5.7%
Other values (5) 6
11.3%
Decimal Number
ValueCountFrequency (%)
2 101
33.8%
1 93
31.1%
3 41
13.7%
5 17
 
5.7%
4 15
 
5.0%
0 12
 
4.0%
6 11
 
3.7%
7 6
 
2.0%
8 2
 
0.7%
9 1
 
0.3%
Lowercase Letter
ValueCountFrequency (%)
l 1
25.0%
a 1
25.0%
o 1
25.0%
r 1
25.0%
Letter Number
ValueCountFrequency (%)
1
50.0%
1
50.0%
Space Separator
ValueCountFrequency (%)
549
100.0%
Close Punctuation
ValueCountFrequency (%)
) 19
100.0%
Open Punctuation
ValueCountFrequency (%)
( 19
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 8
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5465
85.2%
Common 894
 
13.9%
Latin 59
 
0.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
604
 
11.1%
602
 
11.0%
600
 
11.0%
600
 
11.0%
599
 
11.0%
596
 
10.9%
181
 
3.3%
49
 
0.9%
43
 
0.8%
41
 
0.8%
Other values (269) 1550
28.4%
Latin
ValueCountFrequency (%)
S 7
11.9%
K 7
11.9%
H 6
10.2%
E 6
10.2%
M 5
8.5%
R 4
 
6.8%
G 3
 
5.1%
B 3
 
5.1%
N 3
 
5.1%
J 3
 
5.1%
Other values (11) 12
20.3%
Common
ValueCountFrequency (%)
549
61.4%
2 101
 
11.3%
1 93
 
10.4%
3 41
 
4.6%
) 19
 
2.1%
( 19
 
2.1%
5 17
 
1.9%
4 15
 
1.7%
0 12
 
1.3%
6 11
 
1.2%
Other values (4) 17
 
1.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5464
85.1%
ASCII 951
 
14.8%
Number Forms 2
 
< 0.1%
None 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
604
 
11.1%
602
 
11.0%
600
 
11.0%
600
 
11.0%
599
 
11.0%
596
 
10.9%
181
 
3.3%
49
 
0.9%
43
 
0.8%
41
 
0.8%
Other values (268) 1549
28.3%
ASCII
ValueCountFrequency (%)
549
57.7%
2 101
 
10.6%
1 93
 
9.8%
3 41
 
4.3%
) 19
 
2.0%
( 19
 
2.0%
5 17
 
1.8%
4 15
 
1.6%
0 12
 
1.3%
6 11
 
1.2%
Other values (23) 74
 
7.8%
None
ValueCountFrequency (%)
1
100.0%
Number Forms
ValueCountFrequency (%)
1
50.0%
1
50.0%
Distinct481
Distinct (%)77.2%
Missing0
Missing (%)0.0%
Memory size5.0 KiB
2023-12-12T14:38:16.032481image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length49
Median length33
Mean length25.322632
Min length19

Characters and Unicode

Total characters15776
Distinct characters140
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique391 ?
Unique (%)62.8%

Sample

1st row경상북도 칠곡군 지천면 금호로 255-2, 건물 위
2nd row경상북도 칠곡군 석적읍 유학로 110-17, 건물 위
3rd row경상북도 칠곡군 왜관읍 아곡5길 224, 건물 위
4th row경상북도 칠곡군 지천면 칠곡대로 2233-55, 건물 위
5th row경상북도 칠곡군 지천면 칠곡대로 2233-55, 건물 위
ValueCountFrequency (%)
경상북도 623
16.7%
칠곡군 623
16.7%
224
 
6.0%
건물 214
 
5.7%
약목면 137
 
3.7%
지천면 134
 
3.6%
왜관읍 103
 
2.8%
가산면 68
 
1.8%
북삼읍 57
 
1.5%
동안들1길 57
 
1.5%
Other values (637) 1495
40.0%
2023-12-12T14:38:16.640820image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3113
19.7%
702
 
4.4%
668
 
4.2%
653
 
4.1%
1 650
 
4.1%
649
 
4.1%
637
 
4.0%
628
 
4.0%
623
 
3.9%
423
 
2.7%
Other values (130) 7030
44.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 9433
59.8%
Space Separator 3113
 
19.7%
Decimal Number 2633
 
16.7%
Dash Punctuation 308
 
2.0%
Other Punctuation 267
 
1.7%
Open Punctuation 8
 
0.1%
Close Punctuation 8
 
0.1%
Uppercase Letter 6
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
702
 
7.4%
668
 
7.1%
653
 
6.9%
649
 
6.9%
637
 
6.8%
628
 
6.7%
623
 
6.6%
423
 
4.5%
339
 
3.6%
271
 
2.9%
Other values (113) 3840
40.7%
Decimal Number
ValueCountFrequency (%)
1 650
24.7%
2 400
15.2%
3 265
10.1%
6 236
 
9.0%
0 201
 
7.6%
4 200
 
7.6%
5 193
 
7.3%
9 182
 
6.9%
7 164
 
6.2%
8 142
 
5.4%
Uppercase Letter
ValueCountFrequency (%)
A 3
50.0%
B 3
50.0%
Space Separator
ValueCountFrequency (%)
3113
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 308
100.0%
Other Punctuation
ValueCountFrequency (%)
, 267
100.0%
Open Punctuation
ValueCountFrequency (%)
( 8
100.0%
Close Punctuation
ValueCountFrequency (%)
) 8
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 9433
59.8%
Common 6337
40.2%
Latin 6
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
702
 
7.4%
668
 
7.1%
653
 
6.9%
649
 
6.9%
637
 
6.8%
628
 
6.7%
623
 
6.6%
423
 
4.5%
339
 
3.6%
271
 
2.9%
Other values (113) 3840
40.7%
Common
ValueCountFrequency (%)
3113
49.1%
1 650
 
10.3%
2 400
 
6.3%
- 308
 
4.9%
, 267
 
4.2%
3 265
 
4.2%
6 236
 
3.7%
0 201
 
3.2%
4 200
 
3.2%
5 193
 
3.0%
Other values (5) 504
 
8.0%
Latin
ValueCountFrequency (%)
A 3
50.0%
B 3
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 9433
59.8%
ASCII 6343
40.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3113
49.1%
1 650
 
10.2%
2 400
 
6.3%
- 308
 
4.9%
, 267
 
4.2%
3 265
 
4.2%
6 236
 
3.7%
0 201
 
3.2%
4 200
 
3.2%
5 193
 
3.0%
Other values (7) 510
 
8.0%
Hangul
ValueCountFrequency (%)
702
 
7.4%
668
 
7.1%
653
 
6.9%
649
 
6.9%
637
 
6.8%
628
 
6.7%
623
 
6.6%
423
 
4.5%
339
 
3.6%
271
 
2.9%
Other values (113) 3840
40.7%

원동력의종류
Categorical

CONSTANT 

Distinct1
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size5.0 KiB
태양광
623 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row태양광
2nd row태양광
3rd row태양광
4th row태양광
5th row태양광

Common Values

ValueCountFrequency (%)
태양광 623
100.0%

Length

2023-12-12T14:38:16.813048image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T14:38:16.932415image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
태양광 623
100.0%

설비용량(KW)
Real number (ℝ)

Distinct339
Distinct (%)54.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean121.6205
Minimum5.52
Maximum1500
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size5.6 KiB
2023-12-12T14:38:17.069794image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum5.52
5-th percentile19.359
Q147.06
median96.2
Q399.75
95-th percentile355.468
Maximum1500
Range1494.48
Interquartile range (IQR)52.69

Descriptive statistics

Standard deviation187.5378
Coefficient of variation (CV)1.5419917
Kurtosis27.872276
Mean121.6205
Median Absolute Deviation (MAD)28.7
Skewness4.9138879
Sum75769.57
Variance35170.427
MonotonicityNot monotonic
2023-12-12T14:38:17.262004image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
99.0 42
 
6.7%
99.19 26
 
4.2%
99.84 23
 
3.7%
99.9 17
 
2.7%
99.6 13
 
2.1%
99.75 11
 
1.8%
97.2 9
 
1.4%
99.96 7
 
1.1%
99.51 6
 
1.0%
98.28 6
 
1.0%
Other values (329) 463
74.3%
ValueCountFrequency (%)
5.52 1
0.2%
8.19 1
0.2%
10.0 2
0.3%
10.2 1
0.2%
10.22 1
0.2%
11.04 1
0.2%
12.0 2
0.3%
14.85 1
0.2%
14.96 1
0.2%
15.02 1
0.2%
ValueCountFrequency (%)
1500.0 2
0.3%
1496.0 2
0.3%
1479.87 1
0.2%
1000.0 2
0.3%
998.4 1
0.2%
996.84 1
0.2%
995.0 1
0.2%
994.84 1
0.2%
991.2 1
0.2%
977.04 1
0.2%

공급전압(V)
Categorical

IMBALANCE 

Distinct10
Distinct (%)1.6%
Missing0
Missing (%)0.0%
Memory size5.0 KiB
380
575 
220
 
24
22900
 
7
220/380
 
5
22,900
 
4
Other values (5)
 
8

Length

Max length11
Median length3
Mean length3.141252
Min length3

Unique

Unique3 ?
Unique (%)0.5%

Sample

1st row380
2nd row380
3rd row380
4th row380
5th row380

Common Values

ValueCountFrequency (%)
380 575
92.3%
220 24
 
3.9%
22900 7
 
1.1%
220/380 5
 
0.8%
22,900 4
 
0.6%
380/22,900 3
 
0.5%
22.9k 2
 
0.3%
2200/380 1
 
0.2%
380/220 1
 
0.2%
380V/22,900 1
 
0.2%

Length

2023-12-12T14:38:17.478281image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T14:38:17.658321image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
380 575
92.3%
220 24
 
3.9%
22900 7
 
1.1%
220/380 5
 
0.8%
22,900 4
 
0.6%
380/22,900 3
 
0.5%
22.9k 2
 
0.3%
2200/380 1
 
0.2%
380/220 1
 
0.2%
380v/22,900 1
 
0.2%

주파수(Hz)
Categorical

CONSTANT 

Distinct1
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size5.0 KiB
60
623 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row60
2nd row60
3rd row60
4th row60
5th row60

Common Values

ValueCountFrequency (%)
60 623
100.0%

Length

2023-12-12T14:38:17.826134image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T14:38:17.947522image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
60 623
100.0%

Interactions

2023-12-12T14:38:13.958488image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:38:13.710944image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:38:14.081911image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:38:13.829761image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T14:38:18.054809image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번설비용량(KW)공급전압(V)
순번1.0000.2550.425
설비용량(KW)0.2551.0000.669
공급전압(V)0.4250.6691.000
2023-12-12T14:38:18.176456image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번설비용량(KW)공급전압(V)
순번1.0000.0790.142
설비용량(KW)0.0791.0000.399
공급전압(V)0.1420.3991.000

Missing values

2023-12-12T14:38:14.242743image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T14:38:14.362268image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

순번상호설치장소소재지원동력의종류설비용량(KW)공급전압(V)주파수(Hz)
01JC 태양광발전소경상북도 칠곡군 지천면 금호로 255-2, 건물 위태양광49.9238060
12하이파크 태양광발전소경상북도 칠곡군 석적읍 유학로 110-17, 건물 위태양광29.7638060
23대흥테크 태양광발전소경상북도 칠곡군 왜관읍 아곡5길 224, 건물 위태양광83.5238060
34장성1 태양광발전소경상북도 칠곡군 지천면 칠곡대로 2233-55, 건물 위태양광71.338060
45장성2 태양광발전소경상북도 칠곡군 지천면 칠곡대로 2233-55, 건물 위태양광49.4538060
56지안4 태양광발전소경상북도 칠곡군 약목면 무림4길 36, 건물 위 가동태양광99.938060
67지안3 태양광발전소경상북도 칠곡군 약목면 무림4길 36, 건물 위 나동태양광37.2638060
78지안5 태양광발전소경상북도 칠곡군 약목면 무림4길 36, 건물 위 가동태양광99.938060
89예린농장6호 태양광발전소경상북도 칠곡군 약목면 동안들1길 160-47, 건물 위태양광99.8438060
910예린농장5호 태양광발전소경상북도 칠곡군 약목면 동안들1길 160-47, 건물 위태양광99.8438060
순번상호설치장소소재지원동력의종류설비용량(KW)공급전압(V)주파수(Hz)
613614송준섭 태양광발전소경상북도 칠곡군 가산면 학신로 159태양광73.0838060
614615남명화 태양광발전소경상북도 칠곡군 가산면 학신로 157태양광73.0838060
615616대인 태양광발전소2호경상북도 칠곡군 석적읍 망정리 624번지태양광29.238060
616617연경 태양광발전소경상북도 칠곡군 지천면 금호로 172-15태양광79.9238060
617618테크 태양광발전소경상북도 칠곡군 가산면 경북대로 1772-2태양광99.638060
618619신리 태양광발전소경상북도 칠곡군 지천면 신동로2길 91-22태양광48.638060
619620(주)신광쏠라태양광발전소경상북도 칠곡군 지천면 신리 산 22번지태양광995.038060
620621신광태양광발전소경상북도 칠곡군 지천면 신동로 181태양광10.038060
621622(주)디씨에너지경상북도 칠곡군 가산면 학산리 산 78번지 2호태양광1500.0380V/22,90060
622623전원태양광발전소경상북도 칠곡군 동명면 남원로3길 116태양광10.0220/38060