Overview

Dataset statistics

Number of variables11
Number of observations338
Missing cells52
Missing cells (%)1.4%
Duplicate rows1
Duplicate rows (%)0.3%
Total size in memory29.5 KiB
Average record size in memory89.4 B

Variable types

Text5
Categorical5
Numeric1

Dataset

Description경기도 고양시_태양광발전사업현황(시군명, 용도, 세부용도, 사업상태, 발전소명, 설치장소, 설비용량 등)
Author경기도 고양시
URLhttps://www.data.go.kr/data/15067630/fileData.do

Alerts

시군명 has constant value ""Constant
용도 has constant value ""Constant
세부용도 has constant value ""Constant
데이터기준일 has constant value ""Constant
Dataset has 1 (0.3%) duplicate rowsDuplicates
설비용량(KW) is highly overall correlated with 지목High correlation
지목 is highly overall correlated with 설비용량(KW)High correlation
사업개시일 has 52 (15.4%) missing valuesMissing

Reproduction

Analysis started2023-12-12 15:41:28.094252
Analysis finished2023-12-12 15:41:28.881001
Duration0.79 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct337
Distinct (%)99.7%
Missing0
Missing (%)0.0%
Memory size2.8 KiB
2023-12-13T00:41:29.100188image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length30
Median length13
Mean length13.068047
Min length5

Characters and Unicode

Total characters4417
Distinct characters21
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique336 ?
Unique (%)99.4%

Sample

1st row경기2-3
2nd row경기2-4
3rd row경기2-19
4th row제2012-111호(경기)
5th row제2012-120호(경기)
ValueCountFrequency (%)
제2015-01호(고양 2
 
0.6%
제2020-51호(고양 1
 
0.3%
제2020-50호(고양 1
 
0.3%
제2021-01호(고양 1
 
0.3%
제2020-57호(고양 1
 
0.3%
제2022-13호(고양 1
 
0.3%
제2022-14호(고양 1
 
0.3%
제2020-54호(고양 1
 
0.3%
제2020-53호(고양 1
 
0.3%
제2020-52호(고양 1
 
0.3%
Other values (331) 331
96.8%
2023-12-13T00:41:29.534272image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2 684
15.5%
0 480
10.9%
1 349
7.9%
- 345
7.8%
339
7.7%
339
7.7%
( 336
7.6%
) 336
7.6%
318
7.2%
318
7.2%
Other values (11) 573
13.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 2039
46.2%
Other Letter 1356
30.7%
Dash Punctuation 345
 
7.8%
Open Punctuation 336
 
7.6%
Close Punctuation 336
 
7.6%
Space Separator 4
 
0.1%
Other Punctuation 1
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 684
33.5%
0 480
23.5%
1 349
17.1%
3 105
 
5.1%
4 89
 
4.4%
9 77
 
3.8%
5 76
 
3.7%
8 68
 
3.3%
7 56
 
2.7%
6 55
 
2.7%
Other Letter
ValueCountFrequency (%)
339
25.0%
339
25.0%
318
23.5%
318
23.5%
21
 
1.5%
21
 
1.5%
Dash Punctuation
ValueCountFrequency (%)
- 345
100.0%
Open Punctuation
ValueCountFrequency (%)
( 336
100.0%
Close Punctuation
ValueCountFrequency (%)
) 336
100.0%
Space Separator
ValueCountFrequency (%)
4
100.0%
Other Punctuation
ValueCountFrequency (%)
, 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 3061
69.3%
Hangul 1356
30.7%

Most frequent character per script

Common
ValueCountFrequency (%)
2 684
22.3%
0 480
15.7%
1 349
11.4%
- 345
11.3%
( 336
11.0%
) 336
11.0%
3 105
 
3.4%
4 89
 
2.9%
9 77
 
2.5%
5 76
 
2.5%
Other values (5) 184
 
6.0%
Hangul
ValueCountFrequency (%)
339
25.0%
339
25.0%
318
23.5%
318
23.5%
21
 
1.5%
21
 
1.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 3061
69.3%
Hangul 1356
30.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2 684
22.3%
0 480
15.7%
1 349
11.4%
- 345
11.3%
( 336
11.0%
) 336
11.0%
3 105
 
3.4%
4 89
 
2.9%
9 77
 
2.5%
5 76
 
2.5%
Other values (5) 184
 
6.0%
Hangul
ValueCountFrequency (%)
339
25.0%
339
25.0%
318
23.5%
318
23.5%
21
 
1.5%
21
 
1.5%

시군명
Categorical

CONSTANT 

Distinct1
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size2.8 KiB
고양시
338 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row고양시
2nd row고양시
3rd row고양시
4th row고양시
5th row고양시

Common Values

ValueCountFrequency (%)
고양시 338
100.0%

Length

2023-12-13T00:41:30.014193image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T00:41:30.128227image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
고양시 338
100.0%

용도
Categorical

CONSTANT 

Distinct1
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size2.8 KiB
전기사업용
338 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row전기사업용
2nd row전기사업용
3rd row전기사업용
4th row전기사업용
5th row전기사업용

Common Values

ValueCountFrequency (%)
전기사업용 338
100.0%

Length

2023-12-13T00:41:30.344698image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T00:41:30.469288image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
전기사업용 338
100.0%

세부용도
Categorical

CONSTANT 

Distinct1
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size2.8 KiB
발전사업용
338 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row발전사업용
2nd row발전사업용
3rd row발전사업용
4th row발전사업용
5th row발전사업용

Common Values

ValueCountFrequency (%)
발전사업용 338
100.0%

Length

2023-12-13T00:41:30.605791image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T00:41:30.751391image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
발전사업용 338
100.0%
Distinct334
Distinct (%)98.8%
Missing0
Missing (%)0.0%
Memory size2.8 KiB
2023-12-13T00:41:31.122879image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length26
Median length21
Mean length10.789941
Min length2

Characters and Unicode

Total characters3647
Distinct characters265
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique330 ?
Unique (%)97.6%

Sample

1st row중앙 태양광발전소
2nd row일산종하 태양광발전소
3rd rowGS칼텍스장항주유소태양광발전소
4th row김상봉 태양광 발전소
5th row설문1 태양광 발전소
ValueCountFrequency (%)
태양광발전소 144
 
26.0%
발전소 18
 
3.3%
2호 5
 
0.9%
성은에너지 3
 
0.5%
태양광 3
 
0.5%
하늘소리 3
 
0.5%
세진프라스틱 3
 
0.5%
코스타단조 3
 
0.5%
1호기 3
 
0.5%
1호 3
 
0.5%
Other values (349) 365
66.0%
2023-12-13T00:41:31.806609image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
339
 
9.3%
335
 
9.2%
332
 
9.1%
329
 
9.0%
301
 
8.3%
301
 
8.3%
215
 
5.9%
102
 
2.8%
1 55
 
1.5%
2 51
 
1.4%
Other values (255) 1287
35.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3200
87.7%
Space Separator 215
 
5.9%
Decimal Number 181
 
5.0%
Uppercase Letter 38
 
1.0%
Dash Punctuation 6
 
0.2%
Open Punctuation 3
 
0.1%
Close Punctuation 3
 
0.1%
Other Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
339
 
10.6%
335
 
10.5%
332
 
10.4%
329
 
10.3%
301
 
9.4%
301
 
9.4%
102
 
3.2%
37
 
1.2%
27
 
0.8%
26
 
0.8%
Other values (226) 1071
33.5%
Uppercase Letter
ValueCountFrequency (%)
S 8
21.1%
J 5
13.2%
C 4
10.5%
K 3
 
7.9%
O 3
 
7.9%
G 3
 
7.9%
B 2
 
5.3%
F 2
 
5.3%
A 2
 
5.3%
E 2
 
5.3%
Other values (4) 4
10.5%
Decimal Number
ValueCountFrequency (%)
1 55
30.4%
2 51
28.2%
3 28
15.5%
4 20
 
11.0%
5 10
 
5.5%
6 5
 
2.8%
7 4
 
2.2%
8 4
 
2.2%
9 3
 
1.7%
0 1
 
0.6%
Space Separator
ValueCountFrequency (%)
215
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 6
100.0%
Open Punctuation
ValueCountFrequency (%)
( 3
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3
100.0%
Other Punctuation
ValueCountFrequency (%)
, 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3200
87.7%
Common 409
 
11.2%
Latin 38
 
1.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
339
 
10.6%
335
 
10.5%
332
 
10.4%
329
 
10.3%
301
 
9.4%
301
 
9.4%
102
 
3.2%
37
 
1.2%
27
 
0.8%
26
 
0.8%
Other values (226) 1071
33.5%
Common
ValueCountFrequency (%)
215
52.6%
1 55
 
13.4%
2 51
 
12.5%
3 28
 
6.8%
4 20
 
4.9%
5 10
 
2.4%
- 6
 
1.5%
6 5
 
1.2%
7 4
 
1.0%
8 4
 
1.0%
Other values (5) 11
 
2.7%
Latin
ValueCountFrequency (%)
S 8
21.1%
J 5
13.2%
C 4
10.5%
K 3
 
7.9%
O 3
 
7.9%
G 3
 
7.9%
B 2
 
5.3%
F 2
 
5.3%
A 2
 
5.3%
E 2
 
5.3%
Other values (4) 4
10.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3200
87.7%
ASCII 447
 
12.3%

Most frequent character per block

Hangul
ValueCountFrequency (%)
339
 
10.6%
335
 
10.5%
332
 
10.4%
329
 
10.3%
301
 
9.4%
301
 
9.4%
102
 
3.2%
37
 
1.2%
27
 
0.8%
26
 
0.8%
Other values (226) 1071
33.5%
ASCII
ValueCountFrequency (%)
215
48.1%
1 55
 
12.3%
2 51
 
11.4%
3 28
 
6.3%
4 20
 
4.5%
5 10
 
2.2%
S 8
 
1.8%
- 6
 
1.3%
J 5
 
1.1%
6 5
 
1.1%
Other values (19) 44
 
9.8%
Distinct292
Distinct (%)86.4%
Missing0
Missing (%)0.0%
Memory size2.8 KiB
2023-12-13T00:41:32.272397image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length73
Median length50
Mean length10.792899
Min length3

Characters and Unicode

Total characters3648
Distinct characters81
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique260 ?
Unique (%)76.9%

Sample

1st row가좌동 494-7
2nd row법곳동 209-19,27
3rd row장항동 541-1
4th row사리현동 183-4
5th row설문동 395, 395-1
ValueCountFrequency (%)
설문동 45
 
6.1%
구산동 27
 
3.6%
성석동 24
 
3.2%
장항동 22
 
3.0%
덕이동 22
 
3.0%
지영동 21
 
2.8%
가좌동 19
 
2.6%
문봉동 15
 
2.0%
사리현동 15
 
2.0%
내유동 14
 
1.9%
Other values (380) 517
69.8%
2023-12-13T00:41:32.951357image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
403
 
11.0%
1 381
 
10.4%
340
 
9.3%
- 308
 
8.4%
2 234
 
6.4%
3 189
 
5.2%
4 159
 
4.4%
6 158
 
4.3%
5 151
 
4.1%
0 131
 
3.6%
Other values (71) 1194
32.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1767
48.4%
Other Letter 1070
29.3%
Space Separator 403
 
11.0%
Dash Punctuation 308
 
8.4%
Other Punctuation 96
 
2.6%
Open Punctuation 2
 
0.1%
Close Punctuation 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
340
31.8%
62
 
5.8%
47
 
4.4%
42
 
3.9%
35
 
3.3%
35
 
3.3%
29
 
2.7%
28
 
2.6%
25
 
2.3%
24
 
2.2%
Other values (54) 403
37.7%
Decimal Number
ValueCountFrequency (%)
1 381
21.6%
2 234
13.2%
3 189
10.7%
4 159
9.0%
6 158
8.9%
5 151
 
8.5%
0 131
 
7.4%
7 125
 
7.1%
9 122
 
6.9%
8 117
 
6.6%
Other Punctuation
ValueCountFrequency (%)
, 93
96.9%
* 2
 
2.1%
. 1
 
1.0%
Space Separator
ValueCountFrequency (%)
403
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 308
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 2578
70.7%
Hangul 1070
29.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
340
31.8%
62
 
5.8%
47
 
4.4%
42
 
3.9%
35
 
3.3%
35
 
3.3%
29
 
2.7%
28
 
2.6%
25
 
2.3%
24
 
2.2%
Other values (54) 403
37.7%
Common
ValueCountFrequency (%)
403
15.6%
1 381
14.8%
- 308
11.9%
2 234
9.1%
3 189
7.3%
4 159
 
6.2%
6 158
 
6.1%
5 151
 
5.9%
0 131
 
5.1%
7 125
 
4.8%
Other values (7) 339
13.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2578
70.7%
Hangul 1070
29.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
403
15.6%
1 381
14.8%
- 308
11.9%
2 234
9.1%
3 189
7.3%
4 159
 
6.2%
6 158
 
6.1%
5 151
 
5.9%
0 131
 
5.1%
7 125
 
4.8%
Other values (7) 339
13.1%
Hangul
ValueCountFrequency (%)
340
31.8%
62
 
5.8%
47
 
4.4%
42
 
3.9%
35
 
3.3%
35
 
3.3%
29
 
2.7%
28
 
2.6%
25
 
2.3%
24
 
2.2%
Other values (54) 403
37.7%

설비용량(KW)
Real number (ℝ)

HIGH CORRELATION 

Distinct239
Distinct (%)70.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean92.763831
Minimum5
Maximum999.92
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.1 KiB
2023-12-13T00:41:33.171841image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum5
5-th percentile10.3275
Q129.6
median66.99
Q399
95-th percentile253.9085
Maximum999.92
Range994.92
Interquartile range (IQR)69.4

Descriptive statistics

Standard deviation133.04356
Coefficient of variation (CV)1.434218
Kurtosis26.253871
Mean92.763831
Median Absolute Deviation (MAD)32.91
Skewness4.6947363
Sum31354.175
Variance17700.588
MonotonicityNot monotonic
2023-12-13T00:41:33.366954image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
99.0 12
 
3.6%
29.76 7
 
2.1%
99.9 6
 
1.8%
30.0 5
 
1.5%
98.28 5
 
1.5%
10.35 4
 
1.2%
9.0 4
 
1.2%
19.8 4
 
1.2%
29.6 4
 
1.2%
79.68 4
 
1.2%
Other values (229) 283
83.7%
ValueCountFrequency (%)
5.0 1
 
0.3%
5.4 1
 
0.3%
9.0 4
1.2%
9.03 1
 
0.3%
9.5 1
 
0.3%
9.9 2
0.6%
10.08 1
 
0.3%
10.125 3
0.9%
10.2 3
0.9%
10.35 4
1.2%
ValueCountFrequency (%)
999.92 1
0.3%
994.175 1
0.3%
993.6 1
0.3%
978.4 1
0.3%
725.76 1
0.3%
702.0 1
0.3%
500.0 2
0.6%
499.8 1
0.3%
449.085 1
0.3%
385.0 1
0.3%
Distinct223
Distinct (%)66.0%
Missing0
Missing (%)0.0%
Memory size2.8 KiB
2023-12-13T00:41:33.724746image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length25
Median length10
Mean length10.16568
Min length5

Characters and Unicode

Total characters3436
Distinct characters14
Distinct categories4 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique157 ?
Unique (%)46.4%

Sample

1st row2006-07-06
2nd row2007-08-27
3rd row2008-09-09
4th row2012-07-11
5th row2012-07-20
ValueCountFrequency (%)
2019-07-12 8
 
2.3%
2021-08-18 7
 
2.0%
2022-05-10 6
 
1.7%
2022-06-20 5
 
1.4%
2014-02-24 5
 
1.4%
2021-10-28 5
 
1.4%
2019-03-07 4
 
1.2%
2021-09-28 4
 
1.2%
2020-02-14 4
 
1.2%
2021-12-24 4
 
1.2%
Other values (218) 294
85.0%
2023-12-13T00:41:34.265656image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 792
23.1%
2 772
22.5%
- 676
19.7%
1 523
15.2%
4 123
 
3.6%
8 113
 
3.3%
7 98
 
2.9%
9 85
 
2.5%
3 82
 
2.4%
6 79
 
2.3%
Other values (4) 93
 
2.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 2744
79.9%
Dash Punctuation 676
 
19.7%
Space Separator 13
 
0.4%
Other Punctuation 3
 
0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 792
28.9%
2 772
28.1%
1 523
19.1%
4 123
 
4.5%
8 113
 
4.1%
7 98
 
3.6%
9 85
 
3.1%
3 82
 
3.0%
6 79
 
2.9%
5 77
 
2.8%
Other Punctuation
ValueCountFrequency (%)
. 2
66.7%
, 1
33.3%
Dash Punctuation
ValueCountFrequency (%)
- 676
100.0%
Space Separator
ValueCountFrequency (%)
13
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 3436
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 792
23.1%
2 772
22.5%
- 676
19.7%
1 523
15.2%
4 123
 
3.6%
8 113
 
3.3%
7 98
 
2.9%
9 85
 
2.5%
3 82
 
2.4%
6 79
 
2.3%
Other values (4) 93
 
2.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 3436
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 792
23.1%
2 772
22.5%
- 676
19.7%
1 523
15.2%
4 123
 
3.6%
8 113
 
3.3%
7 98
 
2.9%
9 85
 
2.5%
3 82
 
2.4%
6 79
 
2.3%
Other values (4) 93
 
2.7%

사업개시일
Text

MISSING 

Distinct216
Distinct (%)75.5%
Missing52
Missing (%)15.4%
Memory size2.8 KiB
2023-12-13T00:41:34.689561image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length10
Mean length9.9965035
Min length5

Characters and Unicode

Total characters2859
Distinct characters23
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique173 ?
Unique (%)60.5%

Sample

1st row2006-09-16
2nd row2007-10-24
3rd row2008-11-18
4th row2012-10-23
5th row2012-09-27
ValueCountFrequency (%)
2022-09-03 5
 
1.7%
2021-12-27 5
 
1.7%
2014-05-21 5
 
1.7%
2021-01-05 4
 
1.4%
2022-03-21 4
 
1.4%
2022-06-22 3
 
1.0%
2019-12-26 3
 
1.0%
2020-07-03 3
 
1.0%
2018-05-01 3
 
1.0%
2019-05-14 3
 
1.0%
Other values (206) 248
86.7%
2023-12-13T00:41:35.363449image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 651
22.8%
2 640
22.4%
- 568
19.9%
1 462
16.2%
9 115
 
4.0%
5 78
 
2.7%
8 76
 
2.7%
7 70
 
2.4%
6 67
 
2.3%
4 64
 
2.2%
Other values (13) 68
 
2.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 2277
79.6%
Dash Punctuation 568
 
19.9%
Other Letter 9
 
0.3%
Other Punctuation 3
 
0.1%
Open Punctuation 1
 
< 0.1%
Close Punctuation 1
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 651
28.6%
2 640
28.1%
1 462
20.3%
9 115
 
5.1%
5 78
 
3.4%
8 76
 
3.3%
7 70
 
3.1%
6 67
 
2.9%
4 64
 
2.8%
3 54
 
2.4%
Other Letter
ValueCountFrequency (%)
1
11.1%
1
11.1%
1
11.1%
1
11.1%
1
11.1%
1
11.1%
1
11.1%
1
11.1%
1
11.1%
Dash Punctuation
ValueCountFrequency (%)
- 568
100.0%
Other Punctuation
ValueCountFrequency (%)
. 3
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 2850
99.7%
Hangul 9
 
0.3%

Most frequent character per script

Common
ValueCountFrequency (%)
0 651
22.8%
2 640
22.5%
- 568
19.9%
1 462
16.2%
9 115
 
4.0%
5 78
 
2.7%
8 76
 
2.7%
7 70
 
2.5%
6 67
 
2.4%
4 64
 
2.2%
Other values (4) 59
 
2.1%
Hangul
ValueCountFrequency (%)
1
11.1%
1
11.1%
1
11.1%
1
11.1%
1
11.1%
1
11.1%
1
11.1%
1
11.1%
1
11.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2850
99.7%
Hangul 9
 
0.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 651
22.8%
2 640
22.5%
- 568
19.9%
1 462
16.2%
9 115
 
4.0%
5 78
 
2.7%
8 76
 
2.7%
7 70
 
2.5%
6 67
 
2.4%
4 64
 
2.2%
Other values (4) 59
 
2.1%
Hangul
ValueCountFrequency (%)
1
11.1%
1
11.1%
1
11.1%
1
11.1%
1
11.1%
1
11.1%
1
11.1%
1
11.1%
1
11.1%

지목
Categorical

HIGH CORRELATION 

Distinct25
Distinct (%)7.4%
Missing0
Missing (%)0.0%
Memory size2.8 KiB
대지
93 
공장용지
59 
34 
31 
창고용지
27 
Other values (20)
94 

Length

Max length9
Median length7
Mean length2.6094675
Min length1

Unique

Unique9 ?
Unique (%)2.7%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
대지 93
27.5%
공장용지 59
17.5%
34
 
10.1%
31
 
9.2%
창고용지 27
 
8.0%
잡종지 23
 
6.8%
16
 
4.7%
도로 11
 
3.3%
<NA> 11
 
3.3%
목장용지 6
 
1.8%
Other values (15) 27
 
8.0%

Length

2023-12-13T00:41:35.582523image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
대지 94
27.4%
공장용지 60
17.5%
35
 
10.2%
32
 
9.3%
창고용지 27
 
7.9%
잡종지 24
 
7.0%
16
 
4.7%
도로 13
 
3.8%
na 11
 
3.2%
주유소용지 6
 
1.7%
Other values (11) 25
 
7.3%

데이터기준일
Categorical

CONSTANT 

Distinct1
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size2.8 KiB
2023-02-02
338 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023-02-02
2nd row2023-02-02
3rd row2023-02-02
4th row2023-02-02
5th row2023-02-02

Common Values

ValueCountFrequency (%)
2023-02-02 338
100.0%

Length

2023-12-13T00:41:35.790282image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T00:41:35.940481image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023-02-02 338
100.0%

Interactions

2023-12-13T00:41:28.487717image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T00:41:36.029957image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
설비용량(KW)지목
설비용량(KW)1.0000.816
지목0.8161.000
2023-12-13T00:41:36.152975image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
설비용량(KW)지목
설비용량(KW)1.0000.511
지목0.5111.000

Missing values

2023-12-13T00:41:28.630892image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T00:41:28.818549image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

허가번호시군명용도세부용도발전소명설비 위치설비용량(KW)사업허가일사업개시일지목데이터기준일
0경기2-3고양시전기사업용발전사업용중앙 태양광발전소가좌동 494-75.02006-07-062006-09-16<NA>2023-02-02
1경기2-4고양시전기사업용발전사업용일산종하 태양광발전소법곳동 209-19,2763.362007-08-272007-10-24<NA>2023-02-02
2경기2-19고양시전기사업용발전사업용GS칼텍스장항주유소태양광발전소장항동 541-112.02008-09-092008-11-18<NA>2023-02-02
3제2012-111호(경기)고양시전기사업용발전사업용김상봉 태양광 발전소사리현동 183-420.02012-07-112012-10-23<NA>2023-02-02
4제2012-120호(경기)고양시전기사업용발전사업용설문1 태양광 발전소설문동 395, 395-199.02012-07-202012-09-27<NA>2023-02-02
5제2012-121호(경기)고양시전기사업용발전사업용설문2 태양광 발전소설문동 395, 395-199.02012-07-202012-09-27<NA>2023-02-02
6제2012-159호(경기)고양시전기사업용발전사업용민서 태양광발전소용두동 361-25734.682012-08-302012-10-11<NA>2023-02-02
7제2012-200호(경기)고양시전기사업용발전사업용다현 태양광발전소백석동 1127-650.02012-10-122013-04-05<NA>2023-02-02
8제2012-193호(경기)고양시전기사업용발전사업용성석 태양광발전소성석동 768-299.752012-10-122013-06-15<NA>2023-02-02
9제2013-21호(경기)고양시전기사업용발전사업용신화 태양광발전소주교동 93819.02013-02-222013-05-06대지2023-02-02
허가번호시군명용도세부용도발전소명설비 위치설비용량(KW)사업허가일사업개시일지목데이터기준일
328제2022-64호(고양)고양시전기사업용발전사업용마루비젼2태양광발전소문봉동 242-549.562022-11-15<NA>2023-02-02
329제2022-65호(고양)고양시전기사업용발전사업용마루비젼3태양광발전소문봉동 24249.562022-11-15<NA>2023-02-02
330제2022-66호(고양)고양시전기사업용발전사업용가나1호 태양광발전소덕이동 309-1799.122022-12-14<NA>2023-02-02
331제2022-67호(고양)고양시전기사업용발전사업용가나2호 태양광발전소덕이동 309-1799.122022-12-14<NA>2023-02-02
332제2023-1호(고양)고양시전기사업용발전사업용대성포 태양광발전소사리현동 4380.922023-01-02<NA>2023-02-02
333제2023-2호(고양)고양시전기사업용발전사업용밀라노푸드 태양광발전소구산동 449-4,598.62023-01-02<NA>2023-02-02
334제2023-3호(고양)고양시전기사업용발전사업용하늘소리 제1발전소대자동 60629.72023-01-13<NA>2023-02-02
335제2023-4호(고양)고양시전기사업용발전사업용하늘소리 제2발전소대자동 60629.72023-01-13<NA>2023-02-02
336제2023-5호(고양)고양시전기사업용발전사업용하늘소리 제3발전소대자동 60629.72023-01-13<NA>2023-02-02
337제2023-6호(고양)고양시전기사업용발전사업용일품 태양광발전소관산동 35579.462023-01-17<NA>2023-02-02

Duplicate rows

Most frequently occurring

허가번호시군명용도세부용도발전소명설비 위치설비용량(KW)사업허가일사업개시일지목데이터기준일# duplicates
0제2015-01호(고양)고양시전기사업용발전사업용성은에너지 태양광발전소구산동 1883-199.02015-01-132015-05-212023-02-022