Overview

Dataset statistics

Number of variables6
Number of observations49
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.6 KiB
Average record size in memory53.7 B

Variable types

Numeric3
Text2
Categorical1

Dataset

Description서울특별시 양천구 공공청사의 태양광발전설비 설치현황(위치, 주소, 용량, 설치일, 데이터기준일자)입니다.
URLhttps://www.data.go.kr/data/15048675/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
연번 is highly overall correlated with 설치년도High correlation
설치년도 is highly overall correlated with 연번High correlation
연번 has unique valuesUnique
위치 has unique valuesUnique

Reproduction

Analysis started2023-12-12 05:19:32.160656
Analysis finished2023-12-12 05:19:33.473041
Duration1.31 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct49
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean25
Minimum1
Maximum49
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size573.0 B
2023-12-12T14:19:33.562334image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile3.4
Q113
median25
Q337
95-th percentile46.6
Maximum49
Range48
Interquartile range (IQR)24

Descriptive statistics

Standard deviation14.28869
Coefficient of variation (CV)0.57154761
Kurtosis-1.2
Mean25
Median Absolute Deviation (MAD)12
Skewness0
Sum1225
Variance204.16667
MonotonicityStrictly increasing
2023-12-12T14:19:33.719412image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=49)
ValueCountFrequency (%)
1 1
 
2.0%
38 1
 
2.0%
28 1
 
2.0%
29 1
 
2.0%
30 1
 
2.0%
31 1
 
2.0%
32 1
 
2.0%
33 1
 
2.0%
34 1
 
2.0%
35 1
 
2.0%
Other values (39) 39
79.6%
ValueCountFrequency (%)
1 1
2.0%
2 1
2.0%
3 1
2.0%
4 1
2.0%
5 1
2.0%
6 1
2.0%
7 1
2.0%
8 1
2.0%
9 1
2.0%
10 1
2.0%
ValueCountFrequency (%)
49 1
2.0%
48 1
2.0%
47 1
2.0%
46 1
2.0%
45 1
2.0%
44 1
2.0%
43 1
2.0%
42 1
2.0%
41 1
2.0%
40 1
2.0%

위치
Text

UNIQUE 

Distinct49
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size524.0 B
2023-12-12T14:19:33.969440image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length19
Median length16
Mean length8.6530612
Min length5

Characters and Unicode

Total characters424
Distinct characters120
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique49 ?
Unique (%)100.0%

Sample

1st row양천해누리타운
2nd row가로공원주차장
3rd row신월4동주민센터
4th row어린이교통공원
5th row신월5동주민센터
ValueCountFrequency (%)
목사랑시장 2
 
3.3%
신월6동 2
 
3.3%
계남다목적체육관 2
 
3.3%
구청사 2
 
3.3%
고객지원센터 2
 
3.3%
양천해누리타운 1
 
1.7%
1
 
1.7%
양천어르신종합복지관 1
 
1.7%
마을사랑주차장 1
 
1.7%
자전거보관소 1
 
1.7%
Other values (45) 45
75.0%
2023-12-12T14:19:34.395741image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
22
 
5.2%
18
 
4.2%
16
 
3.8%
13
 
3.1%
13
 
3.1%
12
 
2.8%
12
 
2.8%
11
 
2.6%
11
 
2.6%
10
 
2.4%
Other values (110) 286
67.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 404
95.3%
Space Separator 13
 
3.1%
Decimal Number 6
 
1.4%
Other Punctuation 1
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
22
 
5.4%
18
 
4.5%
16
 
4.0%
13
 
3.2%
12
 
3.0%
12
 
3.0%
11
 
2.7%
11
 
2.7%
10
 
2.5%
10
 
2.5%
Other values (103) 269
66.6%
Decimal Number
ValueCountFrequency (%)
6 2
33.3%
5 1
16.7%
1 1
16.7%
3 1
16.7%
4 1
16.7%
Space Separator
ValueCountFrequency (%)
13
100.0%
Other Punctuation
ValueCountFrequency (%)
& 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 404
95.3%
Common 20
 
4.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
22
 
5.4%
18
 
4.5%
16
 
4.0%
13
 
3.2%
12
 
3.0%
12
 
3.0%
11
 
2.7%
11
 
2.7%
10
 
2.5%
10
 
2.5%
Other values (103) 269
66.6%
Common
ValueCountFrequency (%)
13
65.0%
6 2
 
10.0%
& 1
 
5.0%
5 1
 
5.0%
1 1
 
5.0%
3 1
 
5.0%
4 1
 
5.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 404
95.3%
ASCII 20
 
4.7%

Most frequent character per block

Hangul
ValueCountFrequency (%)
22
 
5.4%
18
 
4.5%
16
 
4.0%
13
 
3.2%
12
 
3.0%
12
 
3.0%
11
 
2.7%
11
 
2.7%
10
 
2.5%
10
 
2.5%
Other values (103) 269
66.6%
ASCII
ValueCountFrequency (%)
13
65.0%
6 2
 
10.0%
& 1
 
5.0%
5 1
 
5.0%
1 1
 
5.0%
3 1
 
5.0%
4 1
 
5.0%

주소
Text

Distinct48
Distinct (%)98.0%
Missing0
Missing (%)0.0%
Memory size524.0 B
2023-12-12T14:19:34.686936image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length23
Median length20
Mean length17.673469
Min length14

Characters and Unicode

Total characters866
Distinct characters43
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique47 ?
Unique (%)95.9%

Sample

1st row서울시 양천구 목동동로 81
2nd row서울시 양천구 가로공원로 133
3rd row서울시 양천구 오목로5길 34
4th row서울시 양천구 목동남로 106-23
5th row서울시 양천구 신월5동 52-2
ValueCountFrequency (%)
서울시 49
25.4%
양천구 49
25.4%
신정동 2
 
1.0%
목동서로 2
 
1.0%
목동중앙남로 2
 
1.0%
월정로 2
 
1.0%
81 2
 
1.0%
목동동로 2
 
1.0%
신정동321-8 2
 
1.0%
16 1
 
0.5%
Other values (80) 80
41.5%
2023-12-12T14:19:35.093916image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
145
16.7%
51
 
5.9%
50
 
5.8%
49
 
5.7%
49
 
5.7%
49
 
5.7%
49
 
5.7%
1 40
 
4.6%
37
 
4.3%
35
 
4.0%
Other values (33) 312
36.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 506
58.4%
Decimal Number 193
 
22.3%
Space Separator 145
 
16.7%
Dash Punctuation 20
 
2.3%
Close Punctuation 1
 
0.1%
Open Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
51
10.1%
50
9.9%
49
9.7%
49
9.7%
49
9.7%
49
9.7%
37
 
7.3%
35
 
6.9%
20
 
4.0%
18
 
3.6%
Other values (19) 99
19.6%
Decimal Number
ValueCountFrequency (%)
1 40
20.7%
3 34
17.6%
2 23
11.9%
5 19
9.8%
8 16
 
8.3%
6 16
 
8.3%
4 15
 
7.8%
7 15
 
7.8%
0 10
 
5.2%
9 5
 
2.6%
Space Separator
ValueCountFrequency (%)
145
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 20
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 506
58.4%
Common 360
41.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
51
10.1%
50
9.9%
49
9.7%
49
9.7%
49
9.7%
49
9.7%
37
 
7.3%
35
 
6.9%
20
 
4.0%
18
 
3.6%
Other values (19) 99
19.6%
Common
ValueCountFrequency (%)
145
40.3%
1 40
 
11.1%
3 34
 
9.4%
2 23
 
6.4%
- 20
 
5.6%
5 19
 
5.3%
8 16
 
4.4%
6 16
 
4.4%
4 15
 
4.2%
7 15
 
4.2%
Other values (4) 17
 
4.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 506
58.4%
ASCII 360
41.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
145
40.3%
1 40
 
11.1%
3 34
 
9.4%
2 23
 
6.4%
- 20
 
5.6%
5 19
 
5.3%
8 16
 
4.4%
6 16
 
4.4%
4 15
 
4.2%
7 15
 
4.2%
Other values (4) 17
 
4.7%
Hangul
ValueCountFrequency (%)
51
10.1%
50
9.9%
49
9.7%
49
9.7%
49
9.7%
49
9.7%
37
 
7.3%
35
 
6.9%
20
 
4.0%
18
 
3.6%
Other values (19) 99
19.6%

용량(Kw)
Real number (ℝ)

Distinct31
Distinct (%)63.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean17.977041
Minimum2.52
Maximum95.94
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size573.0 B
2023-12-12T14:19:35.596032image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2.52
5-th percentile3
Q18.69
median10.14
Q320.52
95-th percentile55.56
Maximum95.94
Range93.42
Interquartile range (IQR)11.83

Descriptive statistics

Standard deviation18.867788
Coefficient of variation (CV)1.0495491
Kurtosis6.708219
Mean17.977041
Median Absolute Deviation (MAD)7.14
Skewness2.4370636
Sum880.875
Variance355.99342
MonotonicityNot monotonic
2023-12-12T14:19:35.809195image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=31)
ValueCountFrequency (%)
10.0 10
20.4%
3.0 8
 
16.3%
15.0 2
 
4.1%
12.5 2
 
4.1%
50.4 1
 
2.0%
75.6 1
 
2.0%
18.375 1
 
2.0%
20.52 1
 
2.0%
95.94 1
 
2.0%
8.0 1
 
2.0%
Other values (21) 21
42.9%
ValueCountFrequency (%)
2.52 1
 
2.0%
3.0 8
16.3%
3.2 1
 
2.0%
6.16 1
 
2.0%
8.0 1
 
2.0%
8.69 1
 
2.0%
10.0 10
20.4%
10.05 1
 
2.0%
10.14 1
 
2.0%
10.15 1
 
2.0%
ValueCountFrequency (%)
95.94 1
2.0%
75.6 1
2.0%
59.0 1
2.0%
50.4 1
2.0%
43.16 1
2.0%
40.0 1
2.0%
30.0 1
2.0%
25.65 1
2.0%
25.0 1
2.0%
24.0 1
2.0%

설치년도
Real number (ℝ)

HIGH CORRELATION 

Distinct12
Distinct (%)24.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2016.9796
Minimum2010
Maximum2022
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size573.0 B
2023-12-12T14:19:35.959140image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2010
5-th percentile2012
Q12015
median2016
Q32020
95-th percentile2021.6
Maximum2022
Range12
Interquartile range (IQR)5

Descriptive statistics

Standard deviation3.3633428
Coefficient of variation (CV)0.0016675145
Kurtosis-1.2955609
Mean2016.9796
Median Absolute Deviation (MAD)3
Skewness-0.022695803
Sum98832
Variance11.312075
MonotonicityIncreasing
2023-12-12T14:19:36.133696image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=12)
ValueCountFrequency (%)
2015 10
20.4%
2021 8
16.3%
2020 7
14.3%
2013 4
 
8.2%
2014 4
 
8.2%
2016 4
 
8.2%
2012 3
 
6.1%
2022 3
 
6.1%
2017 2
 
4.1%
2019 2
 
4.1%
Other values (2) 2
 
4.1%
ValueCountFrequency (%)
2010 1
 
2.0%
2012 3
 
6.1%
2013 4
 
8.2%
2014 4
 
8.2%
2015 10
20.4%
2016 4
 
8.2%
2017 2
 
4.1%
2018 1
 
2.0%
2019 2
 
4.1%
2020 7
14.3%
ValueCountFrequency (%)
2022 3
 
6.1%
2021 8
16.3%
2020 7
14.3%
2019 2
 
4.1%
2018 1
 
2.0%
2017 2
 
4.1%
2016 4
 
8.2%
2015 10
20.4%
2014 4
 
8.2%
2013 4
 
8.2%

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size524.0 B
2023-05-30
49 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023-05-30
2nd row2023-05-30
3rd row2023-05-30
4th row2023-05-30
5th row2023-05-30

Common Values

ValueCountFrequency (%)
2023-05-30 49
100.0%

Length

2023-12-12T14:19:36.289929image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T14:19:36.414213image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023-05-30 49
100.0%

Interactions

2023-12-12T14:19:32.994007image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:19:32.389139image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:19:32.683204image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:19:33.077013image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:19:32.485450image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:19:32.763609image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:19:33.164868image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:19:32.568464image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:19:32.881579image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T14:19:36.486043image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번위치주소용량(Kw)설치년도
연번1.0001.0001.0000.4090.868
위치1.0001.0001.0001.0001.000
주소1.0001.0001.0001.0001.000
용량(Kw)0.4091.0001.0001.0000.285
설치년도0.8681.0001.0000.2851.000
2023-12-12T14:19:36.636864image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번용량(Kw)설치년도
연번1.000-0.1200.991
용량(Kw)-0.1201.000-0.110
설치년도0.991-0.1101.000

Missing values

2023-12-12T14:19:33.291097image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T14:19:33.417014image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번위치주소용량(Kw)설치년도데이터기준일자
01양천해누리타운서울시 양천구 목동동로 8150.420102023-05-30
12가로공원주차장서울시 양천구 가로공원로 13315.020122023-05-30
23신월4동주민센터서울시 양천구 오목로5길 3410.020122023-05-30
34어린이교통공원서울시 양천구 목동남로 106-233.020122023-05-30
45신월5동주민센터서울시 양천구 신월5동 52-210.020132023-05-30
56신정1동주민센터서울시 양천구 신정1동 1051-1110.020132023-05-30
67신정3동현장민원실서울시 양천구 신정3동 1287-510.020132023-05-30
78오금빗물펌프장서울시 양천구 신정7동 330-1410.1520132023-05-30
89신월종합사회복지관서울시 양천구 신월로24길 1910.020142023-05-30
910신월청소년문화센터서울시 양천구 가로공원로8620.0220142023-05-30
연번위치주소용량(Kw)설치년도데이터기준일자
3940지양어르신사랑방서울시 양천구 신월7동 331-1043.020212023-05-30
4041서부어르신사랑방서울시 양천구 신정4동 882-113.020212023-05-30
4142신남어르신사랑방서울시 양천구 신월3동 198-223.020212023-05-30
4243신월6동 복합청사서울시 양천구 남부순환로83길 5359.020212023-05-30
4344목사랑시장 고객지원센터서울시 양천구 목동중앙남로 7길 88.6920212023-05-30
4445다솜어린이집서울시 양천구 목동중앙로13길 226.1620212023-05-30
4546은혜어린이집서울시 양천구 목동로15길 242.5220212023-05-30
4647계남다목적체육관 주차장서울시 양천구 남부순환로56가길 1324.020222023-05-30
4748연의생태학습관서울시 양천구 신정동 1320-93.020222023-05-30
4849신원어르신사랑방서울시 양천구 남부순환로46길 1213.520222023-05-30