Overview

Dataset statistics

Number of variables6
Number of observations109
Missing cells44
Missing cells (%)6.7%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory5.6 KiB
Average record size in memory52.2 B

Variable types

Numeric2
Text2
Categorical2

Dataset

Description인천광역시 미추홀구의 신재생에너지 확대기반 조성사업 추진 현황에 대한 데이터로 건물명, 도로명주소, 에너지원, 발전용량 등의 데이터를 제공합니다.미추홀구에 설치된 모든 태양광 발전설비에 대한 데이터가 아닌, 신재생에너지 확대기반 조성사업으로 인해 설치된 태양광 발전설비에 한정된 데이터임을 알려드립니다.
Author인천광역시 미추홀구
URLhttps://data.incheon.go.kr/findData/publicDataDetail?dataId=15095452&srcSe=7661IVAWM27C61E190

Alerts

에너지원 is highly overall correlated with 발전용량(킬로와트) and 1 other fieldsHigh correlation
발전용량(제곱미터) is highly overall correlated with 에너지원High correlation
연번 is highly overall correlated with 발전용량(킬로와트)High correlation
발전용량(킬로와트) is highly overall correlated with 연번 and 1 other fieldsHigh correlation
발전용량(킬로와트) has 44 (40.4%) missing valuesMissing
연번 has unique valuesUnique

Reproduction

Analysis started2024-01-28 08:05:45.798725
Analysis finished2024-01-28 08:05:46.623335
Duration0.82 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct109
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean55
Minimum1
Maximum109
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.1 KiB
2024-01-28T17:05:46.684715image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile6.4
Q128
median55
Q382
95-th percentile103.6
Maximum109
Range108
Interquartile range (IQR)54

Descriptive statistics

Standard deviation31.609598
Coefficient of variation (CV)0.57471996
Kurtosis-1.2
Mean55
Median Absolute Deviation (MAD)27
Skewness0
Sum5995
Variance999.16667
MonotonicityStrictly increasing
2024-01-28T17:05:46.809022image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.9%
70 1
 
0.9%
81 1
 
0.9%
80 1
 
0.9%
79 1
 
0.9%
78 1
 
0.9%
77 1
 
0.9%
76 1
 
0.9%
75 1
 
0.9%
74 1
 
0.9%
Other values (99) 99
90.8%
ValueCountFrequency (%)
1 1
0.9%
2 1
0.9%
3 1
0.9%
4 1
0.9%
5 1
0.9%
6 1
0.9%
7 1
0.9%
8 1
0.9%
9 1
0.9%
10 1
0.9%
ValueCountFrequency (%)
109 1
0.9%
108 1
0.9%
107 1
0.9%
106 1
0.9%
105 1
0.9%
104 1
0.9%
103 1
0.9%
102 1
0.9%
101 1
0.9%
100 1
0.9%
Distinct66
Distinct (%)60.6%
Missing0
Missing (%)0.0%
Memory size1004.0 B
2024-01-28T17:05:47.011558image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length11
Median length10
Mean length7.7889908
Min length4

Characters and Unicode

Total characters849
Distinct characters103
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique23 ?
Unique (%)21.1%

Sample

1st row도화1동행정복지센터
2nd row쑥골 어린이 도서관
3rd row이랑어린이도서관
4th row장사래 어린이 도서관
5th row한우리 어린이 도서관
ValueCountFrequency (%)
분회경로당 32
 
17.3%
경로당 16
 
8.6%
주민센터 5
 
2.7%
도서관 4
 
2.2%
어린이 4
 
2.2%
어린이집 3
 
1.6%
주안6동 3
 
1.6%
구립 3
 
1.6%
문학동 3
 
1.6%
주안7동 3
 
1.6%
Other values (65) 109
58.9%
2024-01-28T17:05:47.330411image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
86
 
10.1%
86
 
10.1%
86
 
10.1%
76
 
9.0%
46
 
5.4%
35
 
4.1%
34
 
4.0%
21
 
2.5%
15
 
1.8%
14
 
1.6%
Other values (93) 350
41.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 732
86.2%
Space Separator 76
 
9.0%
Decimal Number 40
 
4.7%
Other Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
86
 
11.7%
86
 
11.7%
86
 
11.7%
46
 
6.3%
35
 
4.8%
34
 
4.6%
21
 
2.9%
15
 
2.0%
14
 
1.9%
13
 
1.8%
Other values (83) 296
40.4%
Decimal Number
ValueCountFrequency (%)
1 12
30.0%
4 9
22.5%
2 4
 
10.0%
5 4
 
10.0%
3 3
 
7.5%
7 3
 
7.5%
6 3
 
7.5%
8 2
 
5.0%
Space Separator
ValueCountFrequency (%)
76
100.0%
Other Punctuation
ValueCountFrequency (%)
, 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 732
86.2%
Common 117
 
13.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
86
 
11.7%
86
 
11.7%
86
 
11.7%
46
 
6.3%
35
 
4.8%
34
 
4.6%
21
 
2.9%
15
 
2.0%
14
 
1.9%
13
 
1.8%
Other values (83) 296
40.4%
Common
ValueCountFrequency (%)
76
65.0%
1 12
 
10.3%
4 9
 
7.7%
2 4
 
3.4%
5 4
 
3.4%
3 3
 
2.6%
7 3
 
2.6%
6 3
 
2.6%
8 2
 
1.7%
, 1
 
0.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 732
86.2%
ASCII 117
 
13.8%

Most frequent character per block

Hangul
ValueCountFrequency (%)
86
 
11.7%
86
 
11.7%
86
 
11.7%
46
 
6.3%
35
 
4.8%
34
 
4.6%
21
 
2.9%
15
 
2.0%
14
 
1.9%
13
 
1.8%
Other values (83) 296
40.4%
ASCII
ValueCountFrequency (%)
76
65.0%
1 12
 
10.3%
4 9
 
7.7%
2 4
 
3.4%
5 4
 
3.4%
3 3
 
2.6%
7 3
 
2.6%
6 3
 
2.6%
8 2
 
1.7%
, 1
 
0.9%
Distinct79
Distinct (%)72.5%
Missing0
Missing (%)0.0%
Memory size1004.0 B
2024-01-28T17:05:47.566976image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length26
Median length24
Mean length22.477064
Min length17

Characters and Unicode

Total characters2450
Distinct characters62
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique49 ?
Unique (%)45.0%

Sample

1st row인천광역시 미추홀구 경인로 295
2nd row인천광역시 미추홀구 염전로202번길 49
3rd row인천광역시 미추홀구 미추홀대로578번길 7
4th row인천광역시 미추홀구 경인로34번길 20
5th row인천광역시 미추홀구 주안서로53번길 22
ValueCountFrequency (%)
인천광역시 109
24.9%
미추홀구 109
24.9%
7 9
 
2.1%
25 5
 
1.1%
35 4
 
0.9%
소성로 4
 
0.9%
경인로34번길 3
 
0.7%
22 3
 
0.7%
32 3
 
0.7%
13 3
 
0.7%
Other values (107) 185
42.3%
2024-01-28T17:05:47.897799image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
379
 
15.5%
137
 
5.6%
121
 
4.9%
112
 
4.6%
112
 
4.6%
111
 
4.5%
111
 
4.5%
109
 
4.4%
109
 
4.4%
109
 
4.4%
Other values (52) 1040
42.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1545
63.1%
Decimal Number 496
 
20.2%
Space Separator 379
 
15.5%
Dash Punctuation 30
 
1.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
137
 
8.9%
121
 
7.8%
112
 
7.2%
112
 
7.2%
111
 
7.2%
111
 
7.2%
109
 
7.1%
109
 
7.1%
109
 
7.1%
103
 
6.7%
Other values (40) 411
26.6%
Decimal Number
ValueCountFrequency (%)
1 80
16.1%
3 75
15.1%
2 71
14.3%
4 54
10.9%
5 49
9.9%
7 47
9.5%
6 37
7.5%
9 35
7.1%
8 29
 
5.8%
0 19
 
3.8%
Space Separator
ValueCountFrequency (%)
379
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 30
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1545
63.1%
Common 905
36.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
137
 
8.9%
121
 
7.8%
112
 
7.2%
112
 
7.2%
111
 
7.2%
111
 
7.2%
109
 
7.1%
109
 
7.1%
109
 
7.1%
103
 
6.7%
Other values (40) 411
26.6%
Common
ValueCountFrequency (%)
379
41.9%
1 80
 
8.8%
3 75
 
8.3%
2 71
 
7.8%
4 54
 
6.0%
5 49
 
5.4%
7 47
 
5.2%
6 37
 
4.1%
9 35
 
3.9%
- 30
 
3.3%
Other values (2) 48
 
5.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1545
63.1%
ASCII 905
36.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
379
41.9%
1 80
 
8.8%
3 75
 
8.3%
2 71
 
7.8%
4 54
 
6.0%
5 49
 
5.4%
7 47
 
5.2%
6 37
 
4.1%
9 35
 
3.9%
- 30
 
3.3%
Other values (2) 48
 
5.3%
Hangul
ValueCountFrequency (%)
137
 
8.9%
121
 
7.8%
112
 
7.2%
112
 
7.2%
111
 
7.2%
111
 
7.2%
109
 
7.1%
109
 
7.1%
109
 
7.1%
103
 
6.7%
Other values (40) 411
26.6%

에너지원
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)3.7%
Missing0
Missing (%)0.0%
Memory size1004.0 B
태양광
63 
태양열
44 
연료전지
 
1
지열
 
1

Length

Max length4
Median length3
Mean length3
Min length2

Unique

Unique2 ?
Unique (%)1.8%

Sample

1st row태양광
2nd row태양광
3rd row연료전지
4th row태양광
5th row태양광

Common Values

ValueCountFrequency (%)
태양광 63
57.8%
태양열 44
40.4%
연료전지 1
 
0.9%
지열 1
 
0.9%

Length

2024-01-28T17:05:48.017382image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-28T17:05:48.119385image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
태양광 63
57.8%
태양열 44
40.4%
연료전지 1
 
0.9%
지열 1
 
0.9%

발전용량(킬로와트)
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct12
Distinct (%)18.5%
Missing44
Missing (%)40.4%
Infinite0
Infinite (%)0.0%
Mean8.4769231
Minimum3
Maximum70
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.1 KiB
2024-01-28T17:05:48.524885image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum3
5-th percentile3
Q13
median3
Q310
95-th percentile27
Maximum70
Range67
Interquartile range (IQR)7

Descriptive statistics

Standard deviation10.926041
Coefficient of variation (CV)1.2889159
Kurtosis15.101109
Mean8.4769231
Median Absolute Deviation (MAD)0
Skewness3.3278291
Sum551
Variance119.37837
MonotonicityNot monotonic
2024-01-28T17:05:48.610174image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=12)
ValueCountFrequency (%)
3 43
39.4%
10 4
 
3.7%
15 4
 
3.7%
12 3
 
2.8%
25 2
 
1.8%
20 2
 
1.8%
27 2
 
1.8%
30 1
 
0.9%
70 1
 
0.9%
6 1
 
0.9%
Other values (2) 2
 
1.8%
(Missing) 44
40.4%
ValueCountFrequency (%)
3 43
39.4%
6 1
 
0.9%
8 1
 
0.9%
10 4
 
3.7%
12 3
 
2.8%
15 4
 
3.7%
20 2
 
1.8%
25 2
 
1.8%
27 2
 
1.8%
28 1
 
0.9%
ValueCountFrequency (%)
70 1
 
0.9%
30 1
 
0.9%
28 1
 
0.9%
27 2
1.8%
25 2
1.8%
20 2
1.8%
15 4
3.7%
12 3
2.8%
10 4
3.7%
8 1
 
0.9%

발전용량(제곱미터)
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)2.8%
Missing0
Missing (%)0.0%
Memory size1004.0 B
<NA>
65 
10
34 
6
10 

Length

Max length4
Median length4
Mean length3.1009174
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 65
59.6%
10 34
31.2%
6 10
 
9.2%

Length

2024-01-28T17:05:48.712578image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-28T17:05:48.811265image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 65
59.6%
10 34
31.2%
6 10
 
9.2%

Interactions

2024-01-28T17:05:46.324236image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-28T17:05:46.185336image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-28T17:05:46.407949image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-28T17:05:46.252375image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-01-28T17:05:48.901155image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번건물명도로명주소에너지원발전용량(킬로와트)발전용량(제곱미터)
연번1.0000.9970.9980.2770.3830.520
건물명0.9971.0001.0000.0000.0001.000
도로명주소0.9981.0001.0000.0001.0001.000
에너지원0.2770.0000.0001.0000.941NaN
발전용량(킬로와트)0.3830.0001.0000.9411.000NaN
발전용량(제곱미터)0.5201.0001.000NaNNaN1.000
2024-01-28T17:05:48.994988image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
에너지원발전용량(제곱미터)
에너지원1.0001.000
발전용량(제곱미터)1.0001.000
2024-01-28T17:05:49.064472image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번발전용량(킬로와트)에너지원발전용량(제곱미터)
연번1.000-0.7930.1580.358
발전용량(킬로와트)-0.7931.0000.6890.000
에너지원0.1580.6891.0001.000
발전용량(제곱미터)0.3580.0001.0001.000

Missing values

2024-01-28T17:05:46.505190image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-28T17:05:46.587948image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번건물명도로명주소에너지원발전용량(킬로와트)발전용량(제곱미터)
01도화1동행정복지센터인천광역시 미추홀구 경인로 295태양광10<NA>
12쑥골 어린이 도서관인천광역시 미추홀구 염전로202번길 49태양광15<NA>
23이랑어린이도서관인천광역시 미추홀구 미추홀대로578번길 7연료전지10<NA>
34장사래 어린이 도서관인천광역시 미추홀구 경인로34번길 20태양광15<NA>
45한우리 어린이 도서관인천광역시 미추홀구 주안서로53번길 22태양광15<NA>
56숭의1,3동 주민센터인천광역시 미추홀구 석정로92번길 21-17태양광25<NA>
67이랑 어린이 도서관인천광역시 미추홀구 미추홀대로578번길 7태양광10<NA>
78학익1동 주민센터인천광역시 미추홀구 매소홀로 380태양광20<NA>
89에코센터인천광역시 미추홀구 매소홀로290번길 7태양광30<NA>
910에코센터인천광역시 미추홀구 매소홀로 290번길 7지열70<NA>
연번건물명도로명주소에너지원발전용량(킬로와트)발전용량(제곱미터)
99100주안6동 분회경로당인천광역시 미추홀구 주안동로26번길 35태양열<NA>10
100101쑥골경로당인천광역시 미추홀구 주안로7번길 14-35태양광3<NA>
101102쑥골경로당인천광역시 미추홀구 주안로7번길 14-35태양열<NA>6
102103주안1동 분회경로당인천광역시 미추홀구 주안서로43번길 20태양열<NA>6
103104인정경로당인천광역시 미추홀구 주안중로38번길 13태양광3<NA>
104105인정경로당인천광역시 미추홀구 주안중로38번길 13태양열<NA>10
105106사미경로당인천광역시 미추홀구 한나루로490번길 96태양광3<NA>
106107사미경로당인천광역시 미추홀구 한나루로490번길 96태양열<NA>10
107108장수경로당인천광역시 미추홀구 한나루로550번길 25태양광3<NA>
108109장수경로당인천광역시 미추홀구 한나루로550번길 25태양열<NA>10