Overview

Dataset statistics

Number of variables8
Number of observations1131
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory73.0 KiB
Average record size in memory66.1 B

Variable types

Numeric2
DateTime2
Text2
Categorical2

Dataset

Description도내 영업 중인 전기사업체에 대한 사업장명, 도로명주소, 영업상태, 설비용량, 원동력종류에 대한 데이터를 포함합니다.
Author강원특별자치도
URLhttps://www.data.go.kr/data/15029869/fileData.do

Alerts

상태 has constant value ""Constant
원동력종류 is highly imbalanced (91.1%)Imbalance
번호 has unique valuesUnique

Reproduction

Analysis started2024-03-14 23:41:16.675053
Analysis finished2024-03-14 23:41:19.172744
Duration2.5 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

번호
Real number (ℝ)

UNIQUE 

Distinct1131
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean566
Minimum1
Maximum1131
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size10.1 KiB
2024-03-15T08:41:19.384285image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile57.5
Q1283.5
median566
Q3848.5
95-th percentile1074.5
Maximum1131
Range1130
Interquartile range (IQR)565

Descriptive statistics

Standard deviation326.63588
Coefficient of variation (CV)0.5770952
Kurtosis-1.2
Mean566
Median Absolute Deviation (MAD)283
Skewness0
Sum640146
Variance106691
MonotonicityStrictly increasing
2024-03-15T08:41:19.791457image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.1%
753 1
 
0.1%
759 1
 
0.1%
758 1
 
0.1%
757 1
 
0.1%
756 1
 
0.1%
755 1
 
0.1%
754 1
 
0.1%
752 1
 
0.1%
761 1
 
0.1%
Other values (1121) 1121
99.1%
ValueCountFrequency (%)
1 1
0.1%
2 1
0.1%
3 1
0.1%
4 1
0.1%
5 1
0.1%
6 1
0.1%
7 1
0.1%
8 1
0.1%
9 1
0.1%
10 1
0.1%
ValueCountFrequency (%)
1131 1
0.1%
1130 1
0.1%
1129 1
0.1%
1128 1
0.1%
1127 1
0.1%
1126 1
0.1%
1125 1
0.1%
1124 1
0.1%
1123 1
0.1%
1122 1
0.1%
Distinct378
Distinct (%)33.4%
Missing0
Missing (%)0.0%
Memory size9.0 KiB
Minimum1983-12-29 00:00:00
Maximum2023-03-13 00:00:00
2024-03-15T08:41:20.199549image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T08:41:20.648951image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

상호
Text

Distinct1118
Distinct (%)98.9%
Missing0
Missing (%)0.0%
Memory size9.0 KiB
2024-03-15T08:41:21.557556image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length38
Median length30
Mean length14.445623
Min length2

Characters and Unicode

Total characters16338
Distinct characters427
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1107 ?
Unique (%)97.9%

Sample

1st row(주)네오플램
2nd row솔라원일호(주)
3rd row경신동해태양광발전소(주)
4th row유한회사 상인베스트먼트(손곡 태양광발전소)
5th row아이아이알 태양광발전소
ValueCountFrequency (%)
태양광발전소 629
25.3%
㈜강원학교태양광 159
 
6.4%
강릉 30
 
1.2%
원주 27
 
1.1%
춘천 21
 
0.8%
주식회사 19
 
0.8%
정중앙쏠라 18
 
0.7%
속초 13
 
0.5%
2호 12
 
0.5%
1호 11
 
0.4%
Other values (1281) 1547
62.2%
2024-03-15T08:41:22.858655image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1355
 
8.3%
1156
 
7.1%
1149
 
7.0%
1148
 
7.0%
984
 
6.0%
973
 
6.0%
967
 
5.9%
( 622
 
3.8%
) 620
 
3.8%
373
 
2.3%
Other values (417) 6991
42.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 13122
80.3%
Space Separator 1355
 
8.3%
Open Punctuation 622
 
3.8%
Close Punctuation 620
 
3.8%
Decimal Number 344
 
2.1%
Other Symbol 164
 
1.0%
Uppercase Letter 86
 
0.5%
Lowercase Letter 12
 
0.1%
Dash Punctuation 8
 
< 0.1%
Other Punctuation 5
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1156
 
8.8%
1149
 
8.8%
1148
 
8.7%
984
 
7.5%
973
 
7.4%
967
 
7.4%
373
 
2.8%
278
 
2.1%
271
 
2.1%
269
 
2.0%
Other values (370) 5554
42.3%
Uppercase Letter
ValueCountFrequency (%)
S 19
22.1%
E 11
12.8%
Y 9
10.5%
K 8
9.3%
G 5
 
5.8%
M 5
 
5.8%
D 4
 
4.7%
A 3
 
3.5%
J 3
 
3.5%
H 3
 
3.5%
Other values (10) 16
18.6%
Decimal Number
ValueCountFrequency (%)
1 112
32.6%
2 96
27.9%
3 48
14.0%
4 22
 
6.4%
5 18
 
5.2%
7 17
 
4.9%
8 10
 
2.9%
6 9
 
2.6%
0 7
 
2.0%
9 5
 
1.5%
Lowercase Letter
ValueCountFrequency (%)
o 3
25.0%
a 2
16.7%
r 2
16.7%
e 1
 
8.3%
t 1
 
8.3%
w 1
 
8.3%
l 1
 
8.3%
j 1
 
8.3%
Other Punctuation
ValueCountFrequency (%)
, 2
40.0%
. 1
20.0%
& 1
20.0%
/ 1
20.0%
Space Separator
ValueCountFrequency (%)
1355
100.0%
Open Punctuation
ValueCountFrequency (%)
( 622
100.0%
Close Punctuation
ValueCountFrequency (%)
) 620
100.0%
Other Symbol
ValueCountFrequency (%)
164
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 8
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 13286
81.3%
Common 2954
 
18.1%
Latin 98
 
0.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1156
 
8.7%
1149
 
8.6%
1148
 
8.6%
984
 
7.4%
973
 
7.3%
967
 
7.3%
373
 
2.8%
278
 
2.1%
271
 
2.0%
269
 
2.0%
Other values (371) 5718
43.0%
Latin
ValueCountFrequency (%)
S 19
19.4%
E 11
 
11.2%
Y 9
 
9.2%
K 8
 
8.2%
G 5
 
5.1%
M 5
 
5.1%
D 4
 
4.1%
A 3
 
3.1%
J 3
 
3.1%
H 3
 
3.1%
Other values (18) 28
28.6%
Common
ValueCountFrequency (%)
1355
45.9%
( 622
21.1%
) 620
21.0%
1 112
 
3.8%
2 96
 
3.2%
3 48
 
1.6%
4 22
 
0.7%
5 18
 
0.6%
7 17
 
0.6%
8 10
 
0.3%
Other values (8) 34
 
1.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 13122
80.3%
ASCII 3052
 
18.7%
None 164
 
1.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1355
44.4%
( 622
20.4%
) 620
20.3%
1 112
 
3.7%
2 96
 
3.1%
3 48
 
1.6%
4 22
 
0.7%
S 19
 
0.6%
5 18
 
0.6%
7 17
 
0.6%
Other values (36) 123
 
4.0%
Hangul
ValueCountFrequency (%)
1156
 
8.8%
1149
 
8.8%
1148
 
8.7%
984
 
7.5%
973
 
7.4%
967
 
7.4%
373
 
2.8%
278
 
2.1%
271
 
2.1%
269
 
2.0%
Other values (370) 5554
42.3%
None
ValueCountFrequency (%)
164
100.0%

설비용량(KW)
Real number (ℝ)

Distinct436
Distinct (%)38.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean450.62216
Minimum3
Maximum6800
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size10.1 KiB
2024-03-15T08:41:23.290881image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum3
5-th percentile17.67
Q184
median99
Q3495.5
95-th percentile2258
Maximum6800
Range6797
Interquartile range (IQR)411.5

Descriptive statistics

Standard deviation728.61596
Coefficient of variation (CV)1.6169111
Kurtosis7.5049526
Mean450.62216
Median Absolute Deviation (MAD)47.9
Skewness2.3968239
Sum509653.66
Variance530881.21
MonotonicityNot monotonic
2024-03-15T08:41:23.695594image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
99.0 269
 
23.8%
99.84 26
 
2.3%
97.2 21
 
1.9%
90.0 17
 
1.5%
99.2 17
 
1.5%
20.0 15
 
1.3%
98.8 14
 
1.2%
30.0 14
 
1.2%
78.75 14
 
1.2%
10.0 13
 
1.1%
Other values (426) 711
62.9%
ValueCountFrequency (%)
3.0 1
 
0.1%
5.0 1
 
0.1%
6.0 1
 
0.1%
8.1 1
 
0.1%
9.0 2
 
0.2%
9.57 1
 
0.1%
9.6 1
 
0.1%
9.8 1
 
0.1%
10.0 13
1.1%
10.78 1
 
0.1%
ValueCountFrequency (%)
6800.0 1
 
0.1%
3000.0 5
0.4%
2999.15 2
 
0.2%
2997.68 1
 
0.1%
2996.91 1
 
0.1%
2996.46 1
 
0.1%
2995.2 2
 
0.2%
2991.6 1
 
0.1%
2980.8 1
 
0.1%
2965.41 1
 
0.1%
Distinct991
Distinct (%)87.6%
Missing0
Missing (%)0.0%
Memory size9.0 KiB
2024-03-15T08:41:24.710240image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length95
Median length59
Mean length25.536693
Min length13

Characters and Unicode

Total characters28882
Distinct characters326
Distinct categories9 ?
Distinct scripts2 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique933 ?
Unique (%)82.5%

Sample

1st row강원특별자치도 원주시 지정면 기업도시로 1
2nd row강원특별자치도 양구군 국토정중앙면 청리 572번지
3rd row강원특별자치도 동해시 공단6로 59-7(구호동)
4th row강원특별자치도 원주시 부론면 손곡리 583번지 ,584,585,산15-1,산15-2
5th row강원도 홍천군 남면 유치리 399번지 ,424,425,426
ValueCountFrequency (%)
강원도 1127
 
17.3%
264
 
4.1%
원주시 121
 
1.9%
강릉시 119
 
1.8%
양구군 107
 
1.6%
1호 101
 
1.6%
철원군 100
 
1.5%
춘천시 95
 
1.5%
고성군 93
 
1.4%
영월군 70
 
1.1%
Other values (1877) 4310
66.2%
2024-03-15T08:41:26.337860image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5376
 
18.6%
1430
 
5.0%
1307
 
4.5%
1 1229
 
4.3%
1195
 
4.1%
2 713
 
2.5%
706
 
2.4%
703
 
2.4%
689
 
2.4%
3 655
 
2.3%
Other values (316) 14879
51.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 16461
57.0%
Decimal Number 5608
 
19.4%
Space Separator 5376
 
18.6%
Other Punctuation 483
 
1.7%
Dash Punctuation 427
 
1.5%
Close Punctuation 260
 
0.9%
Open Punctuation 260
 
0.9%
Other Symbol 6
 
< 0.1%
Math Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1430
 
8.7%
1307
 
7.9%
1195
 
7.3%
706
 
4.3%
703
 
4.3%
689
 
4.2%
644
 
3.9%
583
 
3.5%
480
 
2.9%
475
 
2.9%
Other values (297) 8249
50.1%
Decimal Number
ValueCountFrequency (%)
1 1229
21.9%
2 713
12.7%
3 655
11.7%
4 573
10.2%
5 545
9.7%
6 439
 
7.8%
0 377
 
6.7%
7 374
 
6.7%
8 362
 
6.5%
9 341
 
6.1%
Other Punctuation
ValueCountFrequency (%)
, 481
99.6%
· 1
 
0.2%
. 1
 
0.2%
Space Separator
ValueCountFrequency (%)
5376
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 427
100.0%
Close Punctuation
ValueCountFrequency (%)
) 260
100.0%
Open Punctuation
ValueCountFrequency (%)
( 260
100.0%
Other Symbol
ValueCountFrequency (%)
6
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 16461
57.0%
Common 12421
43.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1430
 
8.7%
1307
 
7.9%
1195
 
7.3%
706
 
4.3%
703
 
4.3%
689
 
4.2%
644
 
3.9%
583
 
3.5%
480
 
2.9%
475
 
2.9%
Other values (297) 8249
50.1%
Common
ValueCountFrequency (%)
5376
43.3%
1 1229
 
9.9%
2 713
 
5.7%
3 655
 
5.3%
4 573
 
4.6%
5 545
 
4.4%
, 481
 
3.9%
6 439
 
3.5%
- 427
 
3.4%
0 377
 
3.0%
Other values (9) 1606
 
12.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 16461
57.0%
ASCII 12414
43.0%
CJK Compat 6
 
< 0.1%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5376
43.3%
1 1229
 
9.9%
2 713
 
5.7%
3 655
 
5.3%
4 573
 
4.6%
5 545
 
4.4%
, 481
 
3.9%
6 439
 
3.5%
- 427
 
3.4%
0 377
 
3.0%
Other values (7) 1599
 
12.9%
Hangul
ValueCountFrequency (%)
1430
 
8.7%
1307
 
7.9%
1195
 
7.3%
706
 
4.3%
703
 
4.3%
689
 
4.2%
644
 
3.9%
583
 
3.5%
480
 
2.9%
475
 
2.9%
Other values (297) 8249
50.1%
CJK Compat
ValueCountFrequency (%)
6
100.0%
None
ValueCountFrequency (%)
· 1
100.0%

원동력종류
Categorical

IMBALANCE 

Distinct6
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size9.0 KiB
태양광
1099 
풍력
 
17
소수력
 
11
가스엔진발전기
 
2
바이오가스
 
1

Length

Max length7
Median length3
Mean length2.9929266
Min length2

Unique

Unique2 ?
Unique (%)0.2%

Sample

1st row태양광
2nd row태양광
3rd row태양광
4th row태양광
5th row태양광

Common Values

ValueCountFrequency (%)
태양광 1099
97.2%
풍력 17
 
1.5%
소수력 11
 
1.0%
가스엔진발전기 2
 
0.2%
바이오가스 1
 
0.1%
기타 1
 
0.1%

Length

2024-03-15T08:41:26.636283image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-15T08:41:26.970670image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
태양광 1099
97.2%
풍력 17
 
1.5%
소수력 11
 
1.0%
가스엔진발전기 2
 
0.2%
바이오가스 1
 
0.1%
기타 1
 
0.1%

상태
Categorical

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size9.0 KiB
사업개시
1131 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row사업개시
2nd row사업개시
3rd row사업개시
4th row사업개시
5th row사업개시

Common Values

ValueCountFrequency (%)
사업개시 1131
100.0%

Length

2024-03-15T08:41:27.334028image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-15T08:41:27.627684image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
사업개시 1131
100.0%
Distinct653
Distinct (%)57.7%
Missing0
Missing (%)0.0%
Memory size9.0 KiB
Minimum2005-08-22 00:00:00
Maximum2023-10-05 00:00:00
2024-03-15T08:41:27.942452image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T08:41:28.370216image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

Interactions

2024-03-15T08:41:18.064088image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T08:41:17.480574image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T08:41:18.332858image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T08:41:17.745035image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-15T08:41:28.634139image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호설비용량(KW)원동력종류
번호1.0000.5640.187
설비용량(KW)0.5641.0000.495
원동력종류0.1870.4951.000
2024-03-15T08:41:28.873325image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호설비용량(KW)원동력종류
번호1.000-0.4560.099
설비용량(KW)-0.4561.0000.198
원동력종류0.0990.1981.000

Missing values

2024-03-15T08:41:18.668920image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-15T08:41:19.006864image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

번호허가일자상호설비용량(KW)설치장소원동력종류상태사업개시일
012023-03-13(주)네오플램2499.75강원특별자치도 원주시 지정면 기업도시로 1태양광사업개시2023-10-05
122022-11-08솔라원일호(주)1999.79강원특별자치도 양구군 국토정중앙면 청리 572번지태양광사업개시2023-09-18
232022-10-27경신동해태양광발전소(주)1152.8강원특별자치도 동해시 공단6로 59-7(구호동)태양광사업개시2023-05-31
342022-05-10유한회사 상인베스트먼트(손곡 태양광발전소)1499.52강원특별자치도 원주시 부론면 손곡리 583번지 ,584,585,산15-1,산15-2태양광사업개시2023-08-01
452021-08-19아이아이알 태양광발전소1457.28강원도 홍천군 남면 유치리 399번지 ,424,425,426태양광사업개시2022-01-06
562020-09-15(주)고령철석태양발전소(주연태양광발전소)2521.26강원도 화천군 사내면 용담리 596번지 597,598,599,601,산88-1태양광사업개시2021-09-13
672020-07-14태초에너지(주)(황조1호 태양광발전소)2016.9강원도 태백시 반딧불길 35 (황지동)태양광사업개시2021-11-04
782020-07-09(주)구미찬훈발전소2016.0강원도 평창군 방림면 방림리 62번지 63,65-1,99,103-1,104,102태양광사업개시2021-01-04
892020-07-09(주)평창원길리태양광3호(고령백양발전소)2520.0강원도 평창군 방림면 방림리 63번지 65,65-1,66,71,72,73,74,74-1,75,75-2,75-3,76,77,78,99태양광사업개시2021-12-30
9102020-07-03한국동서발전(주)(그린수소 생산 R&D 실증단지 태양광발전소)2995.2강원도 동해시 구호동 223-2태양광사업개시2021-12-16
번호허가일자상호설비용량(KW)설치장소원동력종류상태사업개시일
112111222006-12-20합자회사 현대발전소99.0강원도 양구군 동면 팔랑리 1445번지 8호태양광사업개시2007-12-01
112211232006-12-20속초태양광발전소 2차10.0강원도 속초시 청대마을1길 15 (조양동)태양광사업개시2007-12-01
112311242006-08-18강원대기풍력발전소2750.0강원도 강릉시 왕산면 대기리 2214번지풍력사업개시2007-09-15
112411252006-08-14춘천태양광발전45.0강원도 춘천시 신동 394번지태양광사업개시2007-04-10
112511262006-02-21진부태양광발전소30.0강원도 평창군 진부면 송정택지1길 35태양광사업개시2007-08-01
112611272006-01-26속초 태양광발전소10.0강원도 고성군 토성면 봉포2길 21태양광사업개시2006-06-20
112711282005-11-18한국중부발전(주) (매봉산풍력발전)6800.0강원도 태백시 창죽동 9번지 185호 외풍력사업개시2005-12-30
112811292005-12-05노암 태양광발전소3.0강원도 강릉시 노암동 682번지 5호태양광사업개시2006-02-28
112911302002-01-10한국수력원자력주식회사 (양양양수 소수력)1400.0강원도 양양군 서면 용소리 139-4소수력사업개시2005-08-22
113011311983-12-29애플에너지(주) 소수력발전소2750.0강원도 정선군 정선읍 덕송리 산 25번지소수력사업개시2014-11-18