Overview

Dataset statistics

Number of variables6
Number of observations592
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory29.0 KiB
Average record size in memory50.2 B

Variable types

Numeric2
Categorical1
Text2
DateTime1

Dataset

Description2014년부터 천안시 관내 설치된 전기사업(태양광발전) 설치현황에 대한 자료로 상호, 사업장소, 설비용량, 허가일을 제공합니다.
Author충청남도 천안시
URLhttps://www.data.go.kr/data/15034092/fileData.do

Alerts

시군명 has constant value ""Constant
연번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 09:46:12.911271
Analysis finished2023-12-12 09:46:13.924514
Duration1.01 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct592
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean296.5
Minimum1
Maximum592
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size5.3 KiB
2023-12-12T18:46:14.014496image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile30.55
Q1148.75
median296.5
Q3444.25
95-th percentile562.45
Maximum592
Range591
Interquartile range (IQR)295.5

Descriptive statistics

Standard deviation171.03996
Coefficient of variation (CV)0.57686326
Kurtosis-1.2
Mean296.5
Median Absolute Deviation (MAD)148
Skewness0
Sum175528
Variance29254.667
MonotonicityStrictly increasing
2023-12-12T18:46:14.169686image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.2%
391 1
 
0.2%
393 1
 
0.2%
394 1
 
0.2%
395 1
 
0.2%
396 1
 
0.2%
397 1
 
0.2%
398 1
 
0.2%
399 1
 
0.2%
400 1
 
0.2%
Other values (582) 582
98.3%
ValueCountFrequency (%)
1 1
0.2%
2 1
0.2%
3 1
0.2%
4 1
0.2%
5 1
0.2%
6 1
0.2%
7 1
0.2%
8 1
0.2%
9 1
0.2%
10 1
0.2%
ValueCountFrequency (%)
592 1
0.2%
591 1
0.2%
590 1
0.2%
589 1
0.2%
588 1
0.2%
587 1
0.2%
586 1
0.2%
585 1
0.2%
584 1
0.2%
583 1
0.2%

시군명
Categorical

CONSTANT 

Distinct1
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size4.8 KiB
천안시
592 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row천안시
2nd row천안시
3rd row천안시
4th row천안시
5th row천안시

Common Values

ValueCountFrequency (%)
천안시 592
100.0%

Length

2023-12-12T18:46:14.329090image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T18:46:14.453908image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
천안시 592
100.0%

상호
Text

Distinct588
Distinct (%)99.3%
Missing0
Missing (%)0.0%
Memory size4.8 KiB
2023-12-12T18:46:14.909899image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length36
Median length24
Mean length11.180743
Min length3

Characters and Unicode

Total characters6619
Distinct characters316
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique584 ?
Unique (%)98.6%

Sample

1st row펜타1 태양광발전소
2nd row유경 태양광발전소
3rd row송연 태양광발전소
4th row번영3호 태양광발전소
5th row부림쏠라 태양광발전소
ValueCountFrequency (%)
태양광발전소 568
45.6%
2호 16
 
1.3%
1호 13
 
1.0%
3호 10
 
0.8%
하나 9
 
0.7%
수지양령 5
 
0.4%
수지도하 5
 
0.4%
대흥 5
 
0.4%
5호 4
 
0.3%
산우물 4
 
0.3%
Other values (566) 606
48.7%
2023-12-12T18:46:15.569658image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
657
 
9.9%
609
 
9.2%
605
 
9.1%
595
 
9.0%
594
 
9.0%
592
 
8.9%
591
 
8.9%
261
 
3.9%
1 106
 
1.6%
2 104
 
1.6%
Other values (306) 1905
28.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5556
83.9%
Space Separator 657
 
9.9%
Decimal Number 320
 
4.8%
Uppercase Letter 43
 
0.6%
Open Punctuation 15
 
0.2%
Close Punctuation 15
 
0.2%
Lowercase Letter 6
 
0.1%
Dash Punctuation 5
 
0.1%
Connector Punctuation 1
 
< 0.1%
Other Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
609
 
11.0%
605
 
10.9%
595
 
10.7%
594
 
10.7%
592
 
10.7%
591
 
10.6%
261
 
4.7%
53
 
1.0%
47
 
0.8%
42
 
0.8%
Other values (269) 1567
28.2%
Uppercase Letter
ValueCountFrequency (%)
S 8
18.6%
E 6
14.0%
J 6
14.0%
H 4
9.3%
M 4
9.3%
L 2
 
4.7%
R 2
 
4.7%
T 2
 
4.7%
C 2
 
4.7%
X 1
 
2.3%
Other values (6) 6
14.0%
Decimal Number
ValueCountFrequency (%)
1 106
33.1%
2 104
32.5%
3 55
17.2%
4 26
 
8.1%
5 13
 
4.1%
6 7
 
2.2%
7 4
 
1.2%
8 3
 
0.9%
9 2
 
0.6%
Lowercase Letter
ValueCountFrequency (%)
j 1
16.7%
r 1
16.7%
a 1
16.7%
l 1
16.7%
o 1
16.7%
s 1
16.7%
Space Separator
ValueCountFrequency (%)
657
100.0%
Open Punctuation
ValueCountFrequency (%)
( 15
100.0%
Close Punctuation
ValueCountFrequency (%)
) 15
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 5
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5557
84.0%
Common 1013
 
15.3%
Latin 49
 
0.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
609
 
11.0%
605
 
10.9%
595
 
10.7%
594
 
10.7%
592
 
10.7%
591
 
10.6%
261
 
4.7%
53
 
1.0%
47
 
0.8%
42
 
0.8%
Other values (270) 1568
28.2%
Latin
ValueCountFrequency (%)
S 8
16.3%
E 6
12.2%
J 6
12.2%
H 4
 
8.2%
M 4
 
8.2%
L 2
 
4.1%
R 2
 
4.1%
T 2
 
4.1%
C 2
 
4.1%
X 1
 
2.0%
Other values (12) 12
24.5%
Common
ValueCountFrequency (%)
657
64.9%
1 106
 
10.5%
2 104
 
10.3%
3 55
 
5.4%
4 26
 
2.6%
( 15
 
1.5%
) 15
 
1.5%
5 13
 
1.3%
6 7
 
0.7%
- 5
 
0.5%
Other values (4) 10
 
1.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5556
83.9%
ASCII 1062
 
16.0%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
657
61.9%
1 106
 
10.0%
2 104
 
9.8%
3 55
 
5.2%
4 26
 
2.4%
( 15
 
1.4%
) 15
 
1.4%
5 13
 
1.2%
S 8
 
0.8%
6 7
 
0.7%
Other values (26) 56
 
5.3%
Hangul
ValueCountFrequency (%)
609
 
11.0%
605
 
10.9%
595
 
10.7%
594
 
10.7%
592
 
10.7%
591
 
10.6%
261
 
4.7%
53
 
1.0%
47
 
0.8%
42
 
0.8%
Other values (269) 1567
28.2%
None
ValueCountFrequency (%)
1
100.0%
Distinct453
Distinct (%)76.5%
Missing0
Missing (%)0.0%
Memory size4.8 KiB
2023-12-12T18:46:15.970860image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length47
Median length40
Mean length31.25
Min length21

Characters and Unicode

Total characters18500
Distinct characters210
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique382 ?
Unique (%)64.5%

Sample

1st row충청남도 천안시 동남구 풍세면 광풍로 1168
2nd row충청남도 천안시 동남구 동면 화계리 285번지
3rd row충청남도 천안시 동남구 동면 송연리 42번지 3호
4th row충청남도 천안시 서북구 성환읍 어룡2길 157, A,B동 (건물지붕)
5th row충청남도 천안시 동남구 풍세면 성엽자기길 54
ValueCountFrequency (%)
천안시 585
 
13.6%
충청남도 574
 
13.3%
동남구 336
 
7.8%
329
 
7.6%
서북구 255
 
5.9%
건물 237
 
5.5%
성환읍 148
 
3.4%
동면 81
 
1.9%
풍세면 73
 
1.7%
양령리 60
 
1.4%
Other values (722) 1623
37.7%
2023-12-12T18:46:16.581748image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3716
20.1%
987
 
5.3%
655
 
3.5%
611
 
3.3%
607
 
3.3%
594
 
3.2%
589
 
3.2%
588
 
3.2%
579
 
3.1%
562
 
3.0%
Other values (200) 9012
48.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 10862
58.7%
Space Separator 3716
 
20.1%
Decimal Number 2602
 
14.1%
Dash Punctuation 429
 
2.3%
Other Punctuation 375
 
2.0%
Close Punctuation 250
 
1.4%
Open Punctuation 250
 
1.4%
Uppercase Letter 15
 
0.1%
Math Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
987
 
9.1%
655
 
6.0%
611
 
5.6%
607
 
5.6%
594
 
5.5%
589
 
5.4%
588
 
5.4%
579
 
5.3%
562
 
5.2%
399
 
3.7%
Other values (176) 4691
43.2%
Decimal Number
ValueCountFrequency (%)
1 551
21.2%
2 371
14.3%
3 289
11.1%
4 275
10.6%
5 254
9.8%
8 217
 
8.3%
6 193
 
7.4%
0 163
 
6.3%
7 160
 
6.1%
9 129
 
5.0%
Uppercase Letter
ValueCountFrequency (%)
A 6
40.0%
B 5
33.3%
N 1
 
6.7%
M 1
 
6.7%
L 1
 
6.7%
C 1
 
6.7%
Other Punctuation
ValueCountFrequency (%)
, 370
98.7%
. 3
 
0.8%
: 2
 
0.5%
Space Separator
ValueCountFrequency (%)
3716
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 429
100.0%
Close Punctuation
ValueCountFrequency (%)
) 250
100.0%
Open Punctuation
ValueCountFrequency (%)
( 250
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 10862
58.7%
Common 7623
41.2%
Latin 15
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
987
 
9.1%
655
 
6.0%
611
 
5.6%
607
 
5.6%
594
 
5.5%
589
 
5.4%
588
 
5.4%
579
 
5.3%
562
 
5.2%
399
 
3.7%
Other values (176) 4691
43.2%
Common
ValueCountFrequency (%)
3716
48.7%
1 551
 
7.2%
- 429
 
5.6%
2 371
 
4.9%
, 370
 
4.9%
3 289
 
3.8%
4 275
 
3.6%
5 254
 
3.3%
) 250
 
3.3%
( 250
 
3.3%
Other values (8) 868
 
11.4%
Latin
ValueCountFrequency (%)
A 6
40.0%
B 5
33.3%
N 1
 
6.7%
M 1
 
6.7%
L 1
 
6.7%
C 1
 
6.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 10862
58.7%
ASCII 7638
41.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3716
48.7%
1 551
 
7.2%
- 429
 
5.6%
2 371
 
4.9%
, 370
 
4.8%
3 289
 
3.8%
4 275
 
3.6%
5 254
 
3.3%
) 250
 
3.3%
( 250
 
3.3%
Other values (14) 883
 
11.6%
Hangul
ValueCountFrequency (%)
987
 
9.1%
655
 
6.0%
611
 
5.6%
607
 
5.6%
594
 
5.5%
589
 
5.4%
588
 
5.4%
579
 
5.3%
562
 
5.2%
399
 
3.7%
Other values (176) 4691
43.2%
Distinct261
Distinct (%)44.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean102.85549
Minimum10
Maximum599.2
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size5.3 KiB
2023-12-12T18:46:16.776479image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum10
5-th percentile19.55
Q174.8
median98.44
Q399.68
95-th percentile291.4425
Maximum599.2
Range589.2
Interquartile range (IQR)24.88

Descriptive statistics

Standard deviation80.909627
Coefficient of variation (CV)0.78663402
Kurtosis11.339459
Mean102.85549
Median Absolute Deviation (MAD)1.46
Skewness3.0527229
Sum60890.45
Variance6546.3677
MonotonicityNot monotonic
2023-12-12T18:46:16.934891image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
98.28 44
 
7.4%
99.9 32
 
5.4%
99.0 27
 
4.6%
99.45 18
 
3.0%
99.76 18
 
3.0%
98.77 17
 
2.9%
98.55 14
 
2.4%
97.92 13
 
2.2%
99.51 11
 
1.9%
99.96 11
 
1.9%
Other values (251) 387
65.4%
ValueCountFrequency (%)
10.0 1
0.2%
10.24 1
0.2%
10.8 1
0.2%
11.34 1
0.2%
11.83 1
0.2%
13.2 1
0.2%
15.12 1
0.2%
15.6 1
0.2%
16.15 1
0.2%
16.56 1
0.2%
ValueCountFrequency (%)
599.2 1
0.2%
499.5 1
0.2%
499.2 1
0.2%
498.96 1
0.2%
497.42 1
0.2%
495.55 1
0.2%
495.36 1
0.2%
492.48 1
0.2%
480.0 1
0.2%
459.84 1
0.2%
Distinct249
Distinct (%)42.1%
Missing0
Missing (%)0.0%
Memory size4.8 KiB
Minimum2014-03-27 00:00:00
Maximum2022-04-06 00:00:00
2023-12-12T18:46:17.116610image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:46:17.281172image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

Interactions

2023-12-12T18:46:13.449964image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:46:13.251820image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:46:13.574516image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:46:13.357009image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T18:46:17.391692image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번설비용량(키로와트)
연번1.0000.376
설비용량(키로와트)0.3761.000
2023-12-12T18:46:17.518146image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번설비용량(키로와트)
연번1.000-0.003
설비용량(키로와트)-0.0031.000

Missing values

2023-12-12T18:46:13.748736image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T18:46:13.868680image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번시군명상호사업장소(발전소위치)설비용량(키로와트)허가일
01천안시펜타1 태양광발전소충청남도 천안시 동남구 풍세면 광풍로 116899.02014-06-13
12천안시유경 태양광발전소충청남도 천안시 동남구 동면 화계리 285번지66.02014-06-12
23천안시송연 태양광발전소충청남도 천안시 동남구 동면 송연리 42번지 3호96.62014-12-16
34천안시번영3호 태양광발전소충청남도 천안시 서북구 성환읍 어룡2길 157, A,B동 (건물지붕)99.02014-09-19
45천안시부림쏠라 태양광발전소충청남도 천안시 동남구 풍세면 성엽자기길 5499.02014-11-11
56천안시엘리야 태양광발전소충청남도 천안시 동남구 동면 화계리 47번지 1호 산38-299.02015-07-24
67천안시야베스 태양광발전소충청남도 천안시 동남구 동면 화계리 47번지 1호 , 산38-299.02015-07-24
78천안시성산태양광발전소충청남도 천안시 동남구 목천읍 동리 134번지 4호 (토지위)70.132015-07-30
89천안시은열 태양광발전소충청남도 천안시 동남구 동면 구도3길 97-29, 4동97.472015-08-06
910천안시한길 태양광발전소충청남도 천안시 동남구 성남면 화성리 259번지98.182015-04-09
연번시군명상호사업장소(발전소위치)설비용량(키로와트)허가일
582583천안시순수진1호 태양광발전소충청남도 천안시 서북구 성환읍 양령리 553-2, 건물 위98.282022-02-17
583584천안시순수화2호 태양광발전소충청남도 천안시 서북구 성환읍 양령리 553-2, 건물 위98.282022-02-17
584585천안시순수화1호 태양광발전소충청남도 천안시 서북구 성환읍 양령리 553-2, 건물 위98.282022-02-17
585586천안시순수영2호 태양광발전소충청남도 천안시 서북구 성환읍 양령리 553-2, 건물 위98.282022-02-17
586587천안시순수영1호 태양광발전소충청남도 천안시 서북구 성환읍 양령리 553-2, 건물 위98.282022-02-17
587588천안시양령파워2호 태양광발전소충청남도 천안시 서북구 양령리 556-3, 사동, 건물 위99.512022-04-06
588589천안시양령파워1호 태양광발전소충청남도 천안시 서북구 성환읍 양령리 556-3, 마동, 건물 위99.512022-04-06
589590천안시양령드림3호 태양광발전소충청남도 천안시 서북구 성환읍 양령리 556-3, 라동, 바동, 건물 위98.982022-04-06
590591천안시양령드림2호 태양광발전소충청남도 천안시 서북구 성환읍 양령리 556-3, 나동, 건물 위99.512022-04-06
591592천안시양령드림1호 태양광발전소충청남도 천안시 서북구 성환읍 양령리 556-3, 가동, 다동, 건물 위98.442022-04-06