Overview

Dataset statistics

Number of variables4
Number of observations313
Missing cells0
Missing cells (%)0.0%
Duplicate rows28
Duplicate rows (%)8.9%
Total size in memory10.2 KiB
Average record size in memory33.4 B

Variable types

Text2
Numeric1
Categorical1

Dataset

Description태양광 발전사업 허가내역으로 발전시설의 위치, 발전시설의 허가자, 발전시설의 설비용량 및 발전시설 사업 허가 후 실제 사업 개시여부 등의 정보를 포함함
Author경상남도 함안군
URLhttps://bigdata.gyeongnam.go.kr/index.gn?menuCd=DOM_000000114002001000&publicdatapk=15113391

Alerts

Dataset has 28 (8.9%) duplicate rowsDuplicates

Reproduction

Analysis started2024-04-20 18:39:39.276382
Analysis finished2024-04-20 18:39:40.648542
Duration1.37 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

위치
Text

Distinct101
Distinct (%)32.3%
Missing0
Missing (%)0.0%
Memory size2.6 KiB
2024-04-21T03:39:40.779498image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length21
Median length16
Mean length16.444089
Min length16

Characters and Unicode

Total characters5147
Distinct characters93
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique42 ?
Unique (%)13.4%

Sample

1st row경상남도 함안군 칠서면 대치리
2nd row경상남도 함안군 법수면 강주리
3rd row경상남도 함안군 법수면 우거리
4th row경상남도 함안군 법수면 백산리
5th row경상남도 함안군 법수면 백산리
ValueCountFrequency (%)
경상남도 313
25.0%
함안군 313
25.0%
법수면 87
 
6.9%
군북면 49
 
3.9%
칠원읍 46
 
3.7%
칠북면 36
 
2.9%
백산리 31
 
2.5%
가야읍 28
 
2.2%
대산면 25
 
2.0%
칠서면 24
 
1.9%
Other values (79) 302
24.1%
2024-04-21T03:39:41.045217image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1050
20.4%
362
 
7.0%
333
 
6.5%
323
 
6.3%
322
 
6.3%
314
 
6.1%
313
 
6.1%
313
 
6.1%
274
 
5.3%
239
 
4.6%
Other values (83) 1304
25.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4090
79.5%
Space Separator 1050
 
20.4%
Decimal Number 5
 
0.1%
Other Punctuation 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
362
 
8.9%
333
 
8.1%
323
 
7.9%
322
 
7.9%
314
 
7.7%
313
 
7.7%
313
 
7.7%
274
 
6.7%
239
 
5.8%
108
 
2.6%
Other values (79) 1189
29.1%
Decimal Number
ValueCountFrequency (%)
1 4
80.0%
2 1
 
20.0%
Space Separator
ValueCountFrequency (%)
1050
100.0%
Other Punctuation
ValueCountFrequency (%)
, 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4090
79.5%
Common 1057
 
20.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
362
 
8.9%
333
 
8.1%
323
 
7.9%
322
 
7.9%
314
 
7.7%
313
 
7.7%
313
 
7.7%
274
 
6.7%
239
 
5.8%
108
 
2.6%
Other values (79) 1189
29.1%
Common
ValueCountFrequency (%)
1050
99.3%
1 4
 
0.4%
, 2
 
0.2%
2 1
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4090
79.5%
ASCII 1057
 
20.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1050
99.3%
1 4
 
0.4%
, 2
 
0.2%
2 1
 
0.1%
Hangul
ValueCountFrequency (%)
362
 
8.9%
333
 
8.1%
323
 
7.9%
322
 
7.9%
314
 
7.7%
313
 
7.7%
313
 
7.7%
274
 
6.7%
239
 
5.8%
108
 
2.6%
Other values (79) 1189
29.1%

성명
Text

Distinct169
Distinct (%)54.0%
Missing0
Missing (%)0.0%
Memory size2.6 KiB
2024-04-21T03:39:41.333416image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length3
Median length3
Mean length2.9968051
Min length2

Characters and Unicode

Total characters938
Distinct characters108
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique102 ?
Unique (%)32.6%

Sample

1st row김*국
2nd row정*란
3rd row김*숙
4th row박*규
5th row박*규
ValueCountFrequency (%)
장*화 15
 
4.8%
정*찬 7
 
2.2%
김*현 7
 
2.2%
김*태 6
 
1.9%
모*청 6
 
1.9%
최*훈 6
 
1.9%
유*덕 5
 
1.6%
정*란 5
 
1.6%
장*기 5
 
1.6%
박*규 5
 
1.6%
Other values (159) 246
78.6%
2024-04-21T03:39:41.731445image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
* 311
33.2%
53
 
5.7%
34
 
3.6%
31
 
3.3%
23
 
2.5%
17
 
1.8%
16
 
1.7%
16
 
1.7%
14
 
1.5%
14
 
1.5%
Other values (98) 409
43.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 627
66.8%
Other Punctuation 311
33.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
53
 
8.5%
34
 
5.4%
31
 
4.9%
23
 
3.7%
17
 
2.7%
16
 
2.6%
16
 
2.6%
14
 
2.2%
14
 
2.2%
13
 
2.1%
Other values (97) 396
63.2%
Other Punctuation
ValueCountFrequency (%)
* 311
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 627
66.8%
Common 311
33.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
53
 
8.5%
34
 
5.4%
31
 
4.9%
23
 
3.7%
17
 
2.7%
16
 
2.6%
16
 
2.6%
14
 
2.2%
14
 
2.2%
13
 
2.1%
Other values (97) 396
63.2%
Common
ValueCountFrequency (%)
* 311
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 627
66.8%
ASCII 311
33.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
* 311
100.0%
Hangul
ValueCountFrequency (%)
53
 
8.5%
34
 
5.4%
31
 
4.9%
23
 
3.7%
17
 
2.7%
16
 
2.6%
16
 
2.6%
14
 
2.2%
14
 
2.2%
13
 
2.1%
Other values (97) 396
63.2%
Distinct191
Distinct (%)61.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean165.01254
Minimum9.6
Maximum999.92
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.9 KiB
2024-04-21T03:39:41.853807image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum9.6
5-th percentile29.7
Q180.64
median99.6
Q3124.32
95-th percentile580.558
Maximum999.92
Range990.32
Interquartile range (IQR)43.68

Descriptive statistics

Standard deviation187.26407
Coefficient of variation (CV)1.1348475
Kurtosis6.6184444
Mean165.01254
Median Absolute Deviation (MAD)21.3
Skewness2.5525927
Sum51648.925
Variance35067.83
MonotonicityNot monotonic
2024-04-21T03:39:41.983385image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
99.96 25
 
8.0%
99.84 20
 
6.4%
99.0 12
 
3.8%
99.76 9
 
2.9%
90.0 6
 
1.9%
85.5 6
 
1.9%
98.88 6
 
1.9%
99.6 6
 
1.9%
99.71 5
 
1.6%
29.7 4
 
1.3%
Other values (181) 214
68.4%
ValueCountFrequency (%)
9.6 1
0.3%
14.88 1
0.3%
19.14 1
0.3%
19.2 1
0.3%
19.8 2
0.6%
24.99 1
0.3%
25.2 1
0.3%
27.285 2
0.6%
28.32 1
0.3%
28.56 1
0.3%
ValueCountFrequency (%)
999.92 2
0.6%
960.29 1
0.3%
940.8 1
0.3%
932.79 1
0.3%
917.49 1
0.3%
907.0 1
0.3%
806.3 1
0.3%
769.08 1
0.3%
726.0 1
0.3%
710.99 1
0.3%

비고
Categorical

Distinct3
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size2.6 KiB
진행중
199 
사업개시
87 
사업취소
27 

Length

Max length4
Median length3
Mean length3.3642173
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row진행중
2nd row진행중
3rd row사업개시
4th row사업개시
5th row사업개시

Common Values

ValueCountFrequency (%)
진행중 199
63.6%
사업개시 87
27.8%
사업취소 27
 
8.6%

Length

2024-04-21T03:39:42.088364image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-21T03:39:42.173962image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
진행중 199
63.6%
사업개시 87
27.8%
사업취소 27
 
8.6%

Interactions

2024-04-21T03:39:40.396004image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-21T03:39:42.233238image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
설비용량(킬로와트)비고
설비용량(킬로와트)1.0000.180
비고0.1801.000
2024-04-21T03:39:42.319452image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
설비용량(킬로와트)비고
설비용량(킬로와트)1.0000.107
비고0.1071.000

Missing values

2024-04-21T03:39:40.533702image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-21T03:39:40.607484image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

위치성명설비용량(킬로와트)비고
0경상남도 함안군 칠서면 대치리김*국97.01진행중
1경상남도 함안군 법수면 강주리정*란192.64진행중
2경상남도 함안군 법수면 우거리김*숙499.2사업개시
3경상남도 함안군 법수면 백산리박*규99.76사업개시
4경상남도 함안군 법수면 백산리박*규99.76사업개시
5경상남도 함안군 법수면 백산리박*규99.76사업개시
6경상남도 함안군 칠북면 북원로정*란234.78사업개시
7경상남도 함안군 칠원읍 오곡로동*74.24사업개시
8경상남도 함안군 법수면 장백로정*란189.66사업개시
9경상남도 함안군 군북면 석교천길이*경299.92사업개시
위치성명설비용량(킬로와트)비고
303경상남도 함안군 칠원읍 오곡리장*화99.56진행중
304경상남도 함안군 칠원읍 오곡리장*화99.56진행중
305경상남도 함안군 칠서면 계내리김*문387.75진행중
306경상남도 함안군 법수면 윤외리모*청200.2진행중
307경상남도 함안군 군북면 유현리장*화98.25진행중
308경상남도 함안군 군북면 유현리장*화98.25진행중
309경상남도 함안군 군북면 유현리장*화98.25진행중
310경상남도 함안군 군북면 유현리장*화73.36진행중
311경상남도 함안군 법수면 윤외리김*열96.28진행중
312경상남도 함안군 법수면 윤외리안*미51.04진행중

Duplicate rows

Most frequently occurring

위치성명설비용량(킬로와트)비고# duplicates
6경상남도 함안군 군북면 사도리윤*근99.96진행중4
7경상남도 함안군 군북면 유현리장*화98.25진행중3
8경상남도 함안군 군북면 하림리윤*국99.71사업개시3
12경상남도 함안군 법수면 백산리박*규99.76사업개시3
17경상남도 함안군 칠북면 검단리정*만29.75진행중3
20경상남도 함안군 칠북면 화천리김*빈99.96사업개시3
26경상남도 함안군 칠원읍 용산리옥*환99.99진행중3
0경상남도 함안군 가야읍 가야리최*훈71.4진행중2
1경상남도 함안군 가야읍 북실길강*식85.5사업취소2
2경상남도 함안군 가야읍 북실길오*재90.0사업취소2