Overview

Dataset statistics

Number of variables8
Number of observations1313
Missing cells174
Missing cells (%)1.7%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory84.8 KiB
Average record size in memory66.1 B

Variable types

Numeric2
Text2
DateTime2
Categorical2

Dataset

Description충청남도 서천군 관내의 태양광 인허가 현황 데이터로 상호, 설치장소, 설비용량, 허가일, 사업개시일 등을 제공합니다.
URLhttps://www.data.go.kr/data/15113466/fileData.do

Alerts

데이터기준일 has constant value ""Constant
영업구분 is highly imbalanced (55.5%)Imbalance
사업개시일 has 174 (13.3%) missing valuesMissing
순번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 09:43:52.135006
Analysis finished2023-12-12 09:43:53.381231
Duration1.25 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

UNIQUE 

Distinct1313
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean657
Minimum1
Maximum1313
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size11.7 KiB
2023-12-12T18:43:53.482744image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile66.6
Q1329
median657
Q3985
95-th percentile1247.4
Maximum1313
Range1312
Interquartile range (IQR)656

Descriptive statistics

Standard deviation379.17476
Coefficient of variation (CV)0.57713054
Kurtosis-1.2
Mean657
Median Absolute Deviation (MAD)328
Skewness0
Sum862641
Variance143773.5
MonotonicityStrictly increasing
2023-12-12T18:43:53.647338image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.1%
883 1
 
0.1%
881 1
 
0.1%
880 1
 
0.1%
879 1
 
0.1%
878 1
 
0.1%
877 1
 
0.1%
876 1
 
0.1%
875 1
 
0.1%
874 1
 
0.1%
Other values (1303) 1303
99.2%
ValueCountFrequency (%)
1 1
0.1%
2 1
0.1%
3 1
0.1%
4 1
0.1%
5 1
0.1%
6 1
0.1%
7 1
0.1%
8 1
0.1%
9 1
0.1%
10 1
0.1%
ValueCountFrequency (%)
1313 1
0.1%
1312 1
0.1%
1311 1
0.1%
1310 1
0.1%
1309 1
0.1%
1308 1
0.1%
1307 1
0.1%
1306 1
0.1%
1305 1
0.1%
1304 1
0.1%

상호
Text

Distinct1273
Distinct (%)97.0%
Missing0
Missing (%)0.0%
Memory size10.4 KiB
2023-12-12T18:43:54.142973image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length24
Median length16
Mean length5.345773
Min length1

Characters and Unicode

Total characters7019
Distinct characters357
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1239 ?
Unique (%)94.4%

Sample

1st row당정
2nd row해오름2호
3rd row해오름1호
4th row기업과사람들3
5th row기업과사람들2
ValueCountFrequency (%)
태양광발전소 209
 
13.2%
7
 
0.4%
1호 6
 
0.4%
월리2호 6
 
0.4%
대등 5
 
0.3%
2호 5
 
0.3%
3호 4
 
0.3%
하늘 4
 
0.3%
서천 4
 
0.3%
와니 4
 
0.3%
Other values (1251) 1333
84.0%
2023-12-12T18:43:54.784812image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
524
 
7.5%
335
 
4.8%
320
 
4.6%
316
 
4.5%
303
 
4.3%
303
 
4.3%
294
 
4.2%
275
 
3.9%
1 235
 
3.3%
2 205
 
2.9%
Other values (347) 3909
55.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5782
82.4%
Decimal Number 765
 
10.9%
Space Separator 275
 
3.9%
Uppercase Letter 97
 
1.4%
Open Punctuation 37
 
0.5%
Close Punctuation 37
 
0.5%
Lowercase Letter 12
 
0.2%
Dash Punctuation 9
 
0.1%
Other Punctuation 4
 
0.1%
Other Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
524
 
9.1%
335
 
5.8%
320
 
5.5%
316
 
5.5%
303
 
5.2%
303
 
5.2%
294
 
5.1%
133
 
2.3%
105
 
1.8%
77
 
1.3%
Other values (308) 3072
53.1%
Uppercase Letter
ValueCountFrequency (%)
S 33
34.0%
G 20
20.6%
J 8
 
8.2%
K 7
 
7.2%
T 5
 
5.2%
H 5
 
5.2%
B 5
 
5.2%
M 4
 
4.1%
O 3
 
3.1%
N 2
 
2.1%
Other values (4) 5
 
5.2%
Decimal Number
ValueCountFrequency (%)
1 235
30.7%
2 205
26.8%
3 119
15.6%
4 66
 
8.6%
5 41
 
5.4%
6 31
 
4.1%
7 24
 
3.1%
8 20
 
2.6%
9 13
 
1.7%
0 11
 
1.4%
Lowercase Letter
ValueCountFrequency (%)
e 2
16.7%
o 2
16.7%
i 2
16.7%
k 2
16.7%
c 2
16.7%
g 1
8.3%
s 1
8.3%
Other Punctuation
ValueCountFrequency (%)
" 2
50.0%
& 1
25.0%
: 1
25.0%
Space Separator
ValueCountFrequency (%)
275
100.0%
Open Punctuation
ValueCountFrequency (%)
( 37
100.0%
Close Punctuation
ValueCountFrequency (%)
) 37
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 9
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5783
82.4%
Common 1127
 
16.1%
Latin 109
 
1.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
524
 
9.1%
335
 
5.8%
320
 
5.5%
316
 
5.5%
303
 
5.2%
303
 
5.2%
294
 
5.1%
133
 
2.3%
105
 
1.8%
77
 
1.3%
Other values (309) 3073
53.1%
Latin
ValueCountFrequency (%)
S 33
30.3%
G 20
18.3%
J 8
 
7.3%
K 7
 
6.4%
T 5
 
4.6%
H 5
 
4.6%
B 5
 
4.6%
M 4
 
3.7%
O 3
 
2.8%
N 2
 
1.8%
Other values (11) 17
15.6%
Common
ValueCountFrequency (%)
275
24.4%
1 235
20.9%
2 205
18.2%
3 119
10.6%
4 66
 
5.9%
5 41
 
3.6%
( 37
 
3.3%
) 37
 
3.3%
6 31
 
2.8%
7 24
 
2.1%
Other values (7) 57
 
5.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5782
82.4%
ASCII 1236
 
17.6%
None 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
524
 
9.1%
335
 
5.8%
320
 
5.5%
316
 
5.5%
303
 
5.2%
303
 
5.2%
294
 
5.1%
133
 
2.3%
105
 
1.8%
77
 
1.3%
Other values (308) 3072
53.1%
ASCII
ValueCountFrequency (%)
275
22.2%
1 235
19.0%
2 205
16.6%
3 119
9.6%
4 66
 
5.3%
5 41
 
3.3%
( 37
 
3.0%
) 37
 
3.0%
S 33
 
2.7%
6 31
 
2.5%
Other values (28) 157
12.7%
None
ValueCountFrequency (%)
1
100.0%
Distinct1028
Distinct (%)78.3%
Missing0
Missing (%)0.0%
Memory size10.4 KiB
2023-12-12T18:43:55.118942image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length78
Median length58
Mean length30.147753
Min length18

Characters and Unicode

Total characters39584
Distinct characters204
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique899 ?
Unique (%)68.5%

Sample

1st row충청남도 서천군 종천면 당정로 47-20, , 557, 7동 (건물 위)
2nd row충청남도 서천군 서천읍 두왕길1번길 68-3, 나동, 다동
3rd row충청남도 서천군 서천읍 두왕길1번길 68-3, 가동
4th row충청남도 서천군 종천면 종천공단길 47, 씨엔테크(주) 1,2동
5th row충청남도 서천군 종천면 종천공단길 47, 씨엔테크(주) 1, 5동
ValueCountFrequency (%)
충청남도 1312
 
14.4%
서천군 1312
 
14.4%
467
 
5.1%
서면 350
 
3.9%
건물 333
 
3.7%
231
 
2.5%
화양면 151
 
1.7%
마서면 150
 
1.7%
종천면 148
 
1.6%
원두리 101
 
1.1%
Other values (1381) 4525
49.8%
2023-12-12T18:43:55.658248image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
7788
19.7%
1954
 
4.9%
1598
 
4.0%
1 1360
 
3.4%
1344
 
3.4%
1339
 
3.4%
1337
 
3.4%
1330
 
3.4%
1312
 
3.3%
1207
 
3.0%
Other values (194) 19015
48.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 20950
52.9%
Space Separator 7788
 
19.7%
Decimal Number 7744
 
19.6%
Other Punctuation 1061
 
2.7%
Dash Punctuation 975
 
2.5%
Open Punctuation 522
 
1.3%
Close Punctuation 522
 
1.3%
Math Symbol 19
 
< 0.1%
Uppercase Letter 3
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1954
 
9.3%
1598
 
7.6%
1344
 
6.4%
1339
 
6.4%
1337
 
6.4%
1330
 
6.3%
1312
 
6.3%
1207
 
5.8%
739
 
3.5%
669
 
3.2%
Other values (174) 8121
38.8%
Decimal Number
ValueCountFrequency (%)
1 1360
17.6%
2 1179
15.2%
3 1018
13.1%
5 873
11.3%
4 850
11.0%
7 591
7.6%
6 556
7.2%
8 461
 
6.0%
0 458
 
5.9%
9 398
 
5.1%
Uppercase Letter
ValueCountFrequency (%)
A 1
33.3%
C 1
33.3%
B 1
33.3%
Other Punctuation
ValueCountFrequency (%)
, 1060
99.9%
/ 1
 
0.1%
Space Separator
ValueCountFrequency (%)
7788
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 975
100.0%
Open Punctuation
ValueCountFrequency (%)
( 522
100.0%
Close Punctuation
ValueCountFrequency (%)
) 522
100.0%
Math Symbol
ValueCountFrequency (%)
~ 19
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 20950
52.9%
Common 18631
47.1%
Latin 3
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1954
 
9.3%
1598
 
7.6%
1344
 
6.4%
1339
 
6.4%
1337
 
6.4%
1330
 
6.3%
1312
 
6.3%
1207
 
5.8%
739
 
3.5%
669
 
3.2%
Other values (174) 8121
38.8%
Common
ValueCountFrequency (%)
7788
41.8%
1 1360
 
7.3%
2 1179
 
6.3%
, 1060
 
5.7%
3 1018
 
5.5%
- 975
 
5.2%
5 873
 
4.7%
4 850
 
4.6%
7 591
 
3.2%
6 556
 
3.0%
Other values (7) 2381
 
12.8%
Latin
ValueCountFrequency (%)
A 1
33.3%
C 1
33.3%
B 1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 20950
52.9%
ASCII 18634
47.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
7788
41.8%
1 1360
 
7.3%
2 1179
 
6.3%
, 1060
 
5.7%
3 1018
 
5.5%
- 975
 
5.2%
5 873
 
4.7%
4 850
 
4.6%
7 591
 
3.2%
6 556
 
3.0%
Other values (10) 2384
 
12.8%
Hangul
ValueCountFrequency (%)
1954
 
9.3%
1598
 
7.6%
1344
 
6.4%
1339
 
6.4%
1337
 
6.4%
1330
 
6.3%
1312
 
6.3%
1207
 
5.8%
739
 
3.5%
669
 
3.2%
Other values (174) 8121
38.8%

설비용량(KW)
Real number (ℝ)

Distinct318
Distinct (%)24.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean123.74097
Minimum10
Maximum999.96
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size11.7 KiB
2023-12-12T18:43:55.845617image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum10
5-th percentile28.794
Q197.47
median99.36
Q399.84
95-th percentile474.924
Maximum999.96
Range989.96
Interquartile range (IQR)2.37

Descriptive statistics

Standard deviation136.41658
Coefficient of variation (CV)1.1024366
Kurtosis21.879746
Mean123.74097
Median Absolute Deviation (MAD)0.54
Skewness4.4041409
Sum162471.9
Variance18609.482
MonotonicityNot monotonic
2023-12-12T18:43:56.019604image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
99.9 133
 
10.1%
99.0 124
 
9.4%
99.84 59
 
4.5%
99.96 43
 
3.3%
99.36 37
 
2.8%
99.45 35
 
2.7%
99.75 34
 
2.6%
98.28 33
 
2.5%
99.12 32
 
2.4%
99.65 29
 
2.2%
Other values (308) 754
57.4%
ValueCountFrequency (%)
10.0 1
 
0.1%
10.98 1
 
0.1%
12.0 2
0.2%
15.0 3
0.2%
15.4 1
 
0.1%
17.0 1
 
0.1%
17.02 1
 
0.1%
17.4 1
 
0.1%
17.49 1
 
0.1%
18.0 3
0.2%
ValueCountFrequency (%)
999.96 1
 
0.1%
999.35 1
 
0.1%
998.96 1
 
0.1%
998.4 1
 
0.1%
997.92 5
0.4%
997.9 2
 
0.2%
995.78 1
 
0.1%
994.95 1
 
0.1%
985.5 1
 
0.1%
982.8 2
 
0.2%
Distinct405
Distinct (%)30.8%
Missing0
Missing (%)0.0%
Memory size10.4 KiB
Minimum2008-09-09 00:00:00
Maximum2023-03-14 00:00:00
2023-12-12T18:43:56.190785image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:43:56.362565image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

사업개시일
Date

MISSING 

Distinct335
Distinct (%)29.4%
Missing174
Missing (%)13.3%
Memory size10.4 KiB
Minimum2009-08-06 00:00:00
Maximum2023-03-03 00:00:00
2023-12-12T18:43:56.508827image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:43:56.642520image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

영업구분
Categorical

IMBALANCE 

Distinct3
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size10.4 KiB
사업개시
1127 
인허가
138 
공사진행
 
48

Length

Max length4
Median length4
Mean length3.8948972
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row사업개시
2nd row인허가
3rd row인허가
4th row인허가
5th row인허가

Common Values

ValueCountFrequency (%)
사업개시 1127
85.8%
인허가 138
 
10.5%
공사진행 48
 
3.7%

Length

2023-12-12T18:43:56.779264image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T18:43:56.905789image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
사업개시 1127
85.8%
인허가 138
 
10.5%
공사진행 48
 
3.7%

데이터기준일
Categorical

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size10.4 KiB
2023-03-15
1313 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023-03-15
2nd row2023-03-15
3rd row2023-03-15
4th row2023-03-15
5th row2023-03-15

Common Values

ValueCountFrequency (%)
2023-03-15 1313
100.0%

Length

2023-12-12T18:43:57.047601image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T18:43:57.136285image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023-03-15 1313
100.0%

Interactions

2023-12-12T18:43:52.914229image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:43:52.671232image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:43:53.033564image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:43:52.799854image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T18:43:57.187222image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번설비용량(KW)영업구분
순번1.0000.3210.486
설비용량(KW)0.3211.0000.121
영업구분0.4860.1211.000
2023-12-12T18:43:57.267260image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번설비용량(KW)영업구분
순번1.000-0.0690.333
설비용량(KW)-0.0691.0000.076
영업구분0.3330.0761.000

Missing values

2023-12-12T18:43:53.168231image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T18:43:53.324557image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

순번상호설치장소소재지설비용량(KW)허가일자사업개시일영업구분데이터기준일
01당정충청남도 서천군 종천면 당정로 47-20, , 557, 7동 (건물 위)69.32021-06-292021-12-30사업개시2023-03-15
12해오름2호충청남도 서천군 서천읍 두왕길1번길 68-3, 나동, 다동85.682023-03-14<NA>인허가2023-03-15
23해오름1호충청남도 서천군 서천읍 두왕길1번길 68-3, 가동99.962023-03-14<NA>인허가2023-03-15
34기업과사람들3충청남도 서천군 종천면 종천공단길 47, 씨엔테크(주) 1,2동99.452023-03-08<NA>인허가2023-03-15
45기업과사람들2충청남도 서천군 종천면 종천공단길 47, 씨엔테크(주) 1, 5동99.452023-03-08<NA>인허가2023-03-15
56기업과사람들1충청남도 서천군 종천면 종천공단길 47, 씨엔테크(주)99.452023-03-08<NA>인허가2023-03-15
67삼부자4호충청남도 서천군 기산면 원길길27번길 88-23, 1동99.52023-02-27<NA>인허가2023-03-15
78삼부자3호충청남도 서천군 기산면 원길길27번길 88-23, 1동99.52023-02-27<NA>인허가2023-03-15
89삼부자2호충청남도 서천군 기산면 원길길27번길 8899.52023-02-27<NA>인허가2023-03-15
910삼부자1호충청남도 서천군 기산면 원길길27번길 8899.52023-02-27<NA>인허가2023-03-15
순번상호설치장소소재지설비용량(KW)허가일자사업개시일영업구분데이터기준일
13031304마명2호 태양광발전소충청남도 서천군 마산면 삼일로381번길 28997.922018-09-04<NA>사업개시2023-03-15
13041305마명1호 태양광발전소충청남도 서천군 마산면 삼일로381번길 28997.922018-09-042020-04-24사업개시2023-03-15
13051306(주)소망 태양광발전소충청남도 서천군 서면 부원길 12-2997.92018-09-042020-01-06사업개시2023-03-15
13061307(주)청수 태양광발전소충청남도 서천군 서면 부원길 12-2997.92018-09-042020-01-02사업개시2023-03-15
13071308봉산1호 태양광발전소충청남도 서천군 마산면 삼일로248번길 16-17985.52018-09-04<NA>인허가2023-03-15
13081309서천4호 태양광발전소충청남도 서천군 판교면 만금길54번길 126994.952018-02-132019-08-06사업개시2023-03-15
13091310㈜아침태양광2호서천 태양광발전소충청남도 서천군 비인면 선도길43번길 56-77997.922017-12-282020-05-01사업개시2023-03-15
13101311지에스피브이3호 태양광충청남도 서천군 마산면 삼일로562번길 26602.532017-11-162019-06-12사업개시2023-03-15
13111312지에스피브이2호 태양광충청남도 서천군 마산면 삼일로562번길 26999.352017-11-162019-06-12사업개시2023-03-15
13121313문산 태양광발전소충청남도 서천군 문산면 구동길 282998.962016-04-112019-09-09사업개시2023-03-15