Overview

Dataset statistics

Number of variables16
Number of observations518
Missing cells774
Missing cells (%)9.3%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory66.4 KiB
Average record size in memory131.3 B

Variable types

Numeric3
Text3
DateTime3
Categorical6
Boolean1

Dataset

Description강릉시 태양광 발전사업 허가현황에 대한 데이터로 허가번호, 허가일, 상호, 원동력종류, 설비용량, 공급전압, 설치장소, 설치면적, 설치구분, 지목, 농업진흥구역여부, 사업개시4 사업개시신고일, 상태표시, 허가취소일, 허가취소사유 등을 제공합니다.
Author강원도 강릉시
URLhttps://www.data.go.kr/data/15066899/fileData.do

Alerts

상태표시 is highly overall correlated with 허가취소사유High correlation
허가취소사유 is highly overall correlated with 설치면적(제곱미터) and 3 other fieldsHigh correlation
설치구분 is highly overall correlated with 지목 and 2 other fieldsHigh correlation
원동력종류 is highly overall correlated with 허가취소사유High correlation
순번 is highly overall correlated with 설치면적(제곱미터)High correlation
설비용량(KW) is highly overall correlated with 설치면적(제곱미터)High correlation
설치면적(제곱미터) is highly overall correlated with 순번 and 2 other fieldsHigh correlation
지목 is highly overall correlated with 설치구분High correlation
농업진흥구역여부 is highly overall correlated with 설치구분High correlation
원동력종류 is highly imbalanced (96.3%)Imbalance
공급전압(V) is highly imbalanced (61.3%)Imbalance
설치구분 is highly imbalanced (59.0%)Imbalance
농업진흥구역여부 is highly imbalanced (68.1%)Imbalance
허가취소사유 is highly imbalanced (51.0%)Imbalance
사업개시신고일 has 370 (71.4%) missing valuesMissing
허가취소일 has 404 (78.0%) missing valuesMissing
순번 has unique valuesUnique
허가번호 has unique valuesUnique

Reproduction

Analysis started2023-12-12 08:00:13.854697
Analysis finished2023-12-12 08:00:16.351130
Duration2.5 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct518
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean259.5
Minimum1
Maximum518
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size4.7 KiB
2023-12-12T17:00:16.731793image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile26.85
Q1130.25
median259.5
Q3388.75
95-th percentile492.15
Maximum518
Range517
Interquartile range (IQR)258.5

Descriptive statistics

Standard deviation149.67799
Coefficient of variation (CV)0.57679379
Kurtosis-1.2
Mean259.5
Median Absolute Deviation (MAD)129.5
Skewness0
Sum134421
Variance22403.5
MonotonicityStrictly increasing
2023-12-12T17:00:16.882214image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.2%
326 1
 
0.2%
356 1
 
0.2%
355 1
 
0.2%
354 1
 
0.2%
353 1
 
0.2%
352 1
 
0.2%
351 1
 
0.2%
350 1
 
0.2%
349 1
 
0.2%
Other values (508) 508
98.1%
ValueCountFrequency (%)
1 1
0.2%
2 1
0.2%
3 1
0.2%
4 1
0.2%
5 1
0.2%
6 1
0.2%
7 1
0.2%
8 1
0.2%
9 1
0.2%
10 1
0.2%
ValueCountFrequency (%)
518 1
0.2%
517 1
0.2%
516 1
0.2%
515 1
0.2%
514 1
0.2%
513 1
0.2%
512 1
0.2%
511 1
0.2%
510 1
0.2%
509 1
0.2%

허가번호
Text

UNIQUE 

Distinct518
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size4.2 KiB
2023-12-12T17:00:17.108247image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length23
Median length23
Mean length22.988417
Min length22

Characters and Unicode

Total characters11908
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique518 ?
Unique (%)100.0%

Sample

1st row2018-4200000-38-5-00001
2nd row2018-4200000-38-5-00002
3rd row2018-4200000-38-5-00003
4th row2018-4200000-38-5-00004
5th row2018-4200000-38-5-00005
ValueCountFrequency (%)
2018-4200000-38-5-00001 1
 
0.2%
2019-4200000-38-5-00057 1
 
0.2%
2020-4200000-38-5-00020 1
 
0.2%
2020-4200000-38-5-00019 1
 
0.2%
2020-4200000-38-5-00018 1
 
0.2%
2020-4200000-38-5-00017 1
 
0.2%
2020-4200000-38-5-00016 1
 
0.2%
2020-4200000-38-5-00015 1
 
0.2%
2020-4200000-38-5-00014 1
 
0.2%
2020-4200000-38-5-00013 1
 
0.2%
Other values (508) 508
98.1%
2023-12-12T17:00:17.479144image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 4702
39.5%
- 2072
17.4%
2 1409
 
11.8%
8 864
 
7.3%
1 642
 
5.4%
4 631
 
5.3%
3 631
 
5.3%
5 629
 
5.3%
9 146
 
1.2%
6 99
 
0.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 9836
82.6%
Dash Punctuation 2072
 
17.4%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 4702
47.8%
2 1409
 
14.3%
8 864
 
8.8%
1 642
 
6.5%
4 631
 
6.4%
3 631
 
6.4%
5 629
 
6.4%
9 146
 
1.5%
6 99
 
1.0%
7 83
 
0.8%
Dash Punctuation
ValueCountFrequency (%)
- 2072
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 11908
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 4702
39.5%
- 2072
17.4%
2 1409
 
11.8%
8 864
 
7.3%
1 642
 
5.4%
4 631
 
5.3%
3 631
 
5.3%
5 629
 
5.3%
9 146
 
1.2%
6 99
 
0.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 11908
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 4702
39.5%
- 2072
17.4%
2 1409
 
11.8%
8 864
 
7.3%
1 642
 
5.4%
4 631
 
5.3%
3 631
 
5.3%
5 629
 
5.3%
9 146
 
1.2%
6 99
 
0.8%
Distinct167
Distinct (%)32.2%
Missing0
Missing (%)0.0%
Memory size4.2 KiB
Minimum2018-01-02 00:00:00
Maximum2021-07-29 00:00:00
2023-12-12T17:00:17.648875image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:00:17.786771image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

상호
Text

Distinct503
Distinct (%)97.1%
Missing0
Missing (%)0.0%
Memory size4.2 KiB
2023-12-12T17:00:18.204198image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length16
Mean length9.9227799
Min length3

Characters and Unicode

Total characters5140
Distinct characters299
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique491 ?
Unique (%)94.8%

Sample

1st row건너들태양광발전소
2nd row신석태양광발전소
3rd row사이소재 태양광발전소
4th row선희태양광발전소
5th row코포스 태양광발전소
ValueCountFrequency (%)
태양광발전소 305
32.7%
발전소 23
 
2.5%
태양광 19
 
2.0%
방내 6
 
0.6%
비상 6
 
0.6%
낙풍태양광발전소 5
 
0.5%
주식회사 5
 
0.5%
1호 5
 
0.5%
향호솔라 4
 
0.4%
그린 4
 
0.4%
Other values (512) 551
59.1%
2023-12-12T17:00:18.709239image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
458
 
8.9%
457
 
8.9%
447
 
8.7%
445
 
8.7%
437
 
8.5%
433
 
8.4%
416
 
8.1%
178
 
3.5%
1 73
 
1.4%
2 57
 
1.1%
Other values (289) 1739
33.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4401
85.6%
Space Separator 416
 
8.1%
Decimal Number 234
 
4.6%
Uppercase Letter 59
 
1.1%
Lowercase Letter 14
 
0.3%
Other Symbol 8
 
0.2%
Open Punctuation 4
 
0.1%
Close Punctuation 3
 
0.1%
Other Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
458
 
10.4%
457
 
10.4%
447
 
10.2%
445
 
10.1%
437
 
9.9%
433
 
9.8%
178
 
4.0%
44
 
1.0%
43
 
1.0%
40
 
0.9%
Other values (255) 1419
32.2%
Uppercase Letter
ValueCountFrequency (%)
E 12
20.3%
G 7
11.9%
P 6
10.2%
V 6
10.2%
N 6
10.2%
S 5
8.5%
J 4
 
6.8%
K 2
 
3.4%
Y 2
 
3.4%
A 2
 
3.4%
Other values (5) 7
11.9%
Decimal Number
ValueCountFrequency (%)
1 73
31.2%
2 57
24.4%
3 30
12.8%
4 16
 
6.8%
5 16
 
6.8%
6 12
 
5.1%
8 8
 
3.4%
7 8
 
3.4%
0 8
 
3.4%
9 6
 
2.6%
Lowercase Letter
ValueCountFrequency (%)
g 6
42.9%
n 6
42.9%
k 1
 
7.1%
y 1
 
7.1%
Space Separator
ValueCountFrequency (%)
416
100.0%
Other Symbol
ValueCountFrequency (%)
8
100.0%
Open Punctuation
ValueCountFrequency (%)
( 4
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3
100.0%
Other Punctuation
ValueCountFrequency (%)
& 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4409
85.8%
Common 658
 
12.8%
Latin 73
 
1.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
458
 
10.4%
457
 
10.4%
447
 
10.1%
445
 
10.1%
437
 
9.9%
433
 
9.8%
178
 
4.0%
44
 
1.0%
43
 
1.0%
40
 
0.9%
Other values (256) 1427
32.4%
Latin
ValueCountFrequency (%)
E 12
16.4%
G 7
9.6%
g 6
8.2%
n 6
8.2%
P 6
8.2%
V 6
8.2%
N 6
8.2%
S 5
 
6.8%
J 4
 
5.5%
K 2
 
2.7%
Other values (9) 13
17.8%
Common
ValueCountFrequency (%)
416
63.2%
1 73
 
11.1%
2 57
 
8.7%
3 30
 
4.6%
4 16
 
2.4%
5 16
 
2.4%
6 12
 
1.8%
8 8
 
1.2%
7 8
 
1.2%
0 8
 
1.2%
Other values (4) 14
 
2.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4401
85.6%
ASCII 731
 
14.2%
None 8
 
0.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
458
 
10.4%
457
 
10.4%
447
 
10.2%
445
 
10.1%
437
 
9.9%
433
 
9.8%
178
 
4.0%
44
 
1.0%
43
 
1.0%
40
 
0.9%
Other values (255) 1419
32.2%
ASCII
ValueCountFrequency (%)
416
56.9%
1 73
 
10.0%
2 57
 
7.8%
3 30
 
4.1%
4 16
 
2.2%
5 16
 
2.2%
E 12
 
1.6%
6 12
 
1.6%
8 8
 
1.1%
7 8
 
1.1%
Other values (23) 83
 
11.4%
None
ValueCountFrequency (%)
8
100.0%

원동력종류
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size4.2 KiB
태양광
516 
풍력
 
2

Length

Max length3
Median length3
Mean length2.996139
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row태양광
2nd row태양광
3rd row태양광
4th row태양광
5th row태양광

Common Values

ValueCountFrequency (%)
태양광 516
99.6%
풍력 2
 
0.4%

Length

2023-12-12T17:00:18.848462image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T17:00:18.936289image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
태양광 516
99.6%
풍력 2
 
0.4%

설비용량(KW)
Real number (ℝ)

HIGH CORRELATION 

Distinct137
Distinct (%)26.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean241.49847
Minimum10
Maximum1000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size4.7 KiB
2023-12-12T17:00:19.054819image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum10
5-th percentile19.791
Q189.625
median99
Q3200
95-th percentile998
Maximum1000
Range990
Interquartile range (IQR)110.375

Descriptive statistics

Standard deviation316.48036
Coefficient of variation (CV)1.310486
Kurtosis1.2590627
Mean241.49847
Median Absolute Deviation (MAD)33.96
Skewness1.6862716
Sum125096.21
Variance100159.82
MonotonicityNot monotonic
2023-12-12T17:00:19.176372image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
99.0 95
18.3%
100.0 46
 
8.9%
998.0 33
 
6.4%
98.0 24
 
4.6%
97.0 18
 
3.5%
99.68 17
 
3.3%
29.57 15
 
2.9%
20.0 14
 
2.7%
499.0 12
 
2.3%
97.9 10
 
1.9%
Other values (127) 234
45.2%
ValueCountFrequency (%)
10.0 1
 
0.2%
10.08 1
 
0.2%
13.0 1
 
0.2%
13.44 1
 
0.2%
14.0 1
 
0.2%
15.0 3
0.6%
15.3 1
 
0.2%
16.0 1
 
0.2%
16.38 1
 
0.2%
16.6 1
 
0.2%
ValueCountFrequency (%)
1000.0 6
 
1.2%
999.0 4
 
0.8%
998.58 1
 
0.2%
998.0 33
6.4%
996.0 4
 
0.8%
994.5 1
 
0.2%
994.0 2
 
0.4%
990.0 1
 
0.2%
986.0 4
 
0.8%
963.0 1
 
0.2%

공급전압(V)
Categorical

IMBALANCE 

Distinct5
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size4.2 KiB
220/380
414 
22900
85 
220/380
 
14
220/380
 
4
220/680
 
1

Length

Max length11
Median length7
Mean length6.7953668
Min length5

Unique

Unique1 ?
Unique (%)0.2%

Sample

1st row220/380
2nd row220/380
3rd row220/380
4th row220/380
5th row22900

Common Values

ValueCountFrequency (%)
220/380 414
79.9%
22900 85
 
16.4%
220/380 14
 
2.7%
220/380 4
 
0.8%
220/680 1
 
0.2%

Length

2023-12-12T17:00:19.286263image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T17:00:19.376093image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
220/380 432
83.4%
22900 85
 
16.4%
220/680 1
 
0.2%
Distinct333
Distinct (%)64.3%
Missing0
Missing (%)0.0%
Memory size4.2 KiB
2023-12-12T17:00:19.628379image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length92
Median length49
Mean length20.640927
Min length10

Characters and Unicode

Total characters10692
Distinct characters143
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique259 ?
Unique (%)50.0%

Sample

1st row강릉시 강동면 언별리 536-3
2nd row강릉시 신석동 230-2
3rd row강릉시 왕산면 송현리 282
4th row강릉시 박월동 407
5th row강릉시 왕산면 고단리 972, 973
ValueCountFrequency (%)
강릉시 502
 
20.9%
왕산면 119
 
5.0%
주문진읍 89
 
3.7%
강원도 83
 
3.5%
대기리 80
 
3.3%
향호리 73
 
3.0%
사천면 49
 
2.0%
연곡면 42
 
1.8%
강동면 36
 
1.5%
고단리 36
 
1.5%
Other values (546) 1290
53.8%
2023-12-12T17:00:20.034568image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1884
17.6%
623
 
5.8%
1 602
 
5.6%
504
 
4.7%
502
 
4.7%
418
 
3.9%
- 394
 
3.7%
388
 
3.6%
2 378
 
3.5%
5 361
 
3.4%
Other values (133) 4638
43.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4978
46.6%
Decimal Number 3009
28.1%
Space Separator 1884
 
17.6%
Dash Punctuation 394
 
3.7%
Other Punctuation 361
 
3.4%
Close Punctuation 31
 
0.3%
Open Punctuation 31
 
0.3%
Uppercase Letter 2
 
< 0.1%
Final Punctuation 1
 
< 0.1%
Initial Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
623
 
12.5%
504
 
10.1%
502
 
10.1%
418
 
8.4%
388
 
7.8%
324
 
6.5%
173
 
3.5%
128
 
2.6%
100
 
2.0%
95
 
1.9%
Other values (113) 1723
34.6%
Decimal Number
ValueCountFrequency (%)
1 602
20.0%
2 378
12.6%
5 361
12.0%
3 305
10.1%
7 272
9.0%
6 249
8.3%
9 244
8.1%
4 239
 
7.9%
0 185
 
6.1%
8 174
 
5.8%
Other Punctuation
ValueCountFrequency (%)
, 359
99.4%
" 2
 
0.6%
Uppercase Letter
ValueCountFrequency (%)
E 1
50.0%
B 1
50.0%
Space Separator
ValueCountFrequency (%)
1884
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 394
100.0%
Close Punctuation
ValueCountFrequency (%)
) 31
100.0%
Open Punctuation
ValueCountFrequency (%)
( 31
100.0%
Final Punctuation
ValueCountFrequency (%)
1
100.0%
Initial Punctuation
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 5712
53.4%
Hangul 4978
46.6%
Latin 2
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
623
 
12.5%
504
 
10.1%
502
 
10.1%
418
 
8.4%
388
 
7.8%
324
 
6.5%
173
 
3.5%
128
 
2.6%
100
 
2.0%
95
 
1.9%
Other values (113) 1723
34.6%
Common
ValueCountFrequency (%)
1884
33.0%
1 602
 
10.5%
- 394
 
6.9%
2 378
 
6.6%
5 361
 
6.3%
, 359
 
6.3%
3 305
 
5.3%
7 272
 
4.8%
6 249
 
4.4%
9 244
 
4.3%
Other values (8) 664
 
11.6%
Latin
ValueCountFrequency (%)
E 1
50.0%
B 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 5712
53.4%
Hangul 4978
46.6%
Punctuation 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1884
33.0%
1 602
 
10.5%
- 394
 
6.9%
2 378
 
6.6%
5 361
 
6.3%
, 359
 
6.3%
3 305
 
5.3%
7 272
 
4.8%
6 249
 
4.4%
9 244
 
4.3%
Other values (8) 664
 
11.6%
Hangul
ValueCountFrequency (%)
623
 
12.5%
504
 
10.1%
502
 
10.1%
418
 
8.4%
388
 
7.8%
324
 
6.5%
173
 
3.5%
128
 
2.6%
100
 
2.0%
95
 
1.9%
Other values (113) 1723
34.6%
Punctuation
ValueCountFrequency (%)
1
50.0%
1
50.0%

설치면적(제곱미터)
Real number (ℝ)

HIGH CORRELATION 

Distinct383
Distinct (%)73.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3056.5726
Minimum51.42
Maximum27384
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size4.7 KiB
2023-12-12T17:00:20.226321image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum51.42
5-th percentile99.34
Q1484.5
median1156.5
Q32787.5
95-th percentile13040.85
Maximum27384
Range27332.58
Interquartile range (IQR)2303

Descriptive statistics

Standard deviation4624.1004
Coefficient of variation (CV)1.5128384
Kurtosis5.3101394
Mean3056.5726
Median Absolute Deviation (MAD)802
Skewness2.2901768
Sum1583304.6
Variance21382304
MonotonicityNot monotonic
2023-12-12T17:00:20.353962image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
147.9 15
 
2.9%
11250.0 10
 
1.9%
1425.0 9
 
1.7%
1120.0 6
 
1.2%
860.0 6
 
1.2%
496.0 6
 
1.2%
100.69 5
 
1.0%
15000.0 5
 
1.0%
414.0 5
 
1.0%
319.0 4
 
0.8%
Other values (373) 447
86.3%
ValueCountFrequency (%)
51.42 1
 
0.2%
66.0 1
 
0.2%
72.0 1
 
0.2%
79.0 1
 
0.2%
81.0 1
 
0.2%
82.0 1
 
0.2%
85.0 1
 
0.2%
88.0 1
 
0.2%
92.0 2
0.4%
94.0 3
0.6%
ValueCountFrequency (%)
27384.0 1
0.2%
25255.0 1
0.2%
23767.0 1
0.2%
23418.0 1
0.2%
22000.0 1
0.2%
20993.0 1
0.2%
20000.0 1
0.2%
19653.0 1
0.2%
18604.0 1
0.2%
16500.0 1
0.2%

설치구분
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct8
Distinct (%)1.5%
Missing0
Missing (%)0.0%
Memory size4.2 KiB
지상
346 
건물
147 
건물+토지
 
8
건물(버섯재배사)
 
6
수상
 
4
Other values (3)
 
7

Length

Max length10
Median length2
Mean length2.1853282
Min length2

Unique

Unique1 ?
Unique (%)0.2%

Sample

1st row지상
2nd row지상
3rd row지상
4th row건물
5th row지상

Common Values

ValueCountFrequency (%)
지상 346
66.8%
건물 147
28.4%
건물+토지 8
 
1.5%
건물(버섯재배사) 6
 
1.2%
수상 4
 
0.8%
건물+지상 4
 
0.8%
건물+지상(주차장) 2
 
0.4%
지상 1
 
0.2%

Length

2023-12-12T17:00:20.497139image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T17:00:20.624486image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
지상 347
67.0%
건물 147
28.4%
건물+토지 8
 
1.5%
건물(버섯재배사 6
 
1.2%
수상 4
 
0.8%
건물+지상 4
 
0.8%
건물+지상(주차장 2
 
0.4%

지목
Categorical

HIGH CORRELATION 

Distinct29
Distinct (%)5.6%
Missing0
Missing (%)0.0%
Memory size4.2 KiB
임야
186 
95 
80 
73 
잡종지
19 
Other values (24)
65 

Length

Max length11
Median length7
Mean length1.8359073
Min length1

Unique

Unique15 ?
Unique (%)2.9%

Sample

1st row
2nd row
3rd row
4th row창고용지
5th row전+임야

Common Values

ValueCountFrequency (%)
임야 186
35.9%
95
18.3%
80
15.4%
73
 
14.1%
잡종지 19
 
3.7%
공장용지 15
 
2.9%
창고용지 14
 
2.7%
임야+전 4
 
0.8%
전+임야 4
 
0.8%
목장용지 4
 
0.8%
Other values (19) 24
 
4.6%

Length

2023-12-12T17:00:20.770370image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
임야 186
35.9%
95
18.3%
80
15.4%
73
 
14.1%
잡종지 19
 
3.7%
공장용지 15
 
2.9%
창고용지 14
 
2.7%
임야+전 4
 
0.8%
전+임야 4
 
0.8%
목장용지 4
 
0.8%
Other values (19) 24
 
4.6%

농업진흥구역여부
Boolean

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size650.0 B
False
488 
True
 
30
ValueCountFrequency (%)
False 488
94.2%
True 30
 
5.8%
2023-12-12T17:00:20.874370image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

사업개시신고일
Date

MISSING 

Distinct86
Distinct (%)58.1%
Missing370
Missing (%)71.4%
Memory size4.2 KiB
Minimum2018-06-14 00:00:00
Maximum2021-08-23 00:00:00
2023-12-12T17:00:20.990467image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:00:21.155535image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

상태표시
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)0.8%
Missing0
Missing (%)0.0%
Memory size4.2 KiB
인허가
223 
사업개시
149 
허가취소
114 
공사계획
32 

Length

Max length4
Median length4
Mean length3.5694981
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row사업개시
2nd row인허가
3rd row인허가
4th row사업개시
5th row허가취소

Common Values

ValueCountFrequency (%)
인허가 223
43.1%
사업개시 149
28.8%
허가취소 114
22.0%
공사계획 32
 
6.2%

Length

2023-12-12T17:00:21.308745image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T17:00:21.413822image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
인허가 223
43.1%
사업개시 149
28.8%
허가취소 114
22.0%
공사계획 32
 
6.2%

허가취소일
Date

MISSING 

Distinct46
Distinct (%)40.4%
Missing404
Missing (%)78.0%
Memory size4.2 KiB
Minimum2018-09-04 00:00:00
Maximum2021-09-08 00:00:00
2023-12-12T17:00:21.554142image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:00:21.723649image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=46)

허가취소사유
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct3
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size4.2 KiB
<NA>
404 
허가자진반납
113 
개발행위 불허가
 
1

Length

Max length8
Median length4
Mean length4.4440154
Min length4

Unique

Unique1 ?
Unique (%)0.2%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row허가자진반납

Common Values

ValueCountFrequency (%)
<NA> 404
78.0%
허가자진반납 113
 
21.8%
개발행위 불허가 1
 
0.2%

Length

2023-12-12T17:00:21.891497image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T17:00:22.029148image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 404
77.8%
허가자진반납 113
 
21.8%
개발행위 1
 
0.2%
불허가 1
 
0.2%

Interactions

2023-12-12T17:00:15.514031image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:00:14.918698image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:00:15.217062image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:00:15.618587image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:00:15.013124image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:00:15.313172image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:00:15.732664image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:00:15.126118image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:00:15.401940image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T17:00:22.136969image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번원동력종류설비용량(KW)공급전압(V)설치면적(제곱미터)설치구분지목농업진흥구역여부사업개시신고일상태표시허가취소일허가취소사유
순번1.0000.1730.5320.6130.5310.5100.6330.6480.9760.5970.9730.144
원동력종류0.1731.0000.1230.0900.3580.0000.0000.000NaN0.000NaNNaN
설비용량(KW)0.5320.1231.0000.8050.8760.3220.7750.1140.8710.2260.9430.000
공급전압(V)0.6130.0900.8051.0000.7490.2210.3500.0721.0000.2140.8930.110
설치면적(제곱미터)0.5310.3580.8760.7491.0000.5130.7680.0750.9670.1790.9090.668
설치구분0.5100.0000.3220.2210.5131.0000.8460.6930.6990.5520.9371.000
지목0.6330.0000.7750.3500.7680.8461.0000.2700.9280.6450.9460.000
농업진흥구역여부0.6480.0000.1140.0720.0750.6930.2701.0001.0000.2611.0000.000
사업개시신고일0.976NaN0.8711.0000.9670.6990.9281.0001.000NaNNaNNaN
상태표시0.5970.0000.2260.2140.1790.5520.6450.261NaN1.000NaNNaN
허가취소일0.973NaN0.9430.8930.9090.9370.9461.000NaNNaN1.0000.510
허가취소사유0.144NaN0.0000.1100.6681.0000.0000.000NaNNaN0.5101.000
2023-12-12T17:00:22.313198image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
상태표시허가취소사유지목공급전압(V)농업진흥구역여부설치구분원동력종류
상태표시1.0001.0000.3510.1760.1730.2710.000
허가취소사유1.0001.0000.0000.0710.0000.9871.000
지목0.3510.0001.0000.1720.2090.5180.000
공급전압(V)0.1760.0710.1721.0000.0880.1370.110
농업진흥구역여부0.1730.0000.2090.0881.0000.5260.000
설치구분0.2710.9870.5180.1370.5261.0000.000
원동력종류0.0001.0000.0000.1100.0000.0001.000
2023-12-12T17:00:22.468631image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번설비용량(KW)설치면적(제곱미터)원동력종류공급전압(V)설치구분지목농업진흥구역여부상태표시허가취소사유
순번1.000-0.414-0.6140.1320.2980.2730.2790.5000.3990.149
설비용량(KW)-0.4141.0000.8440.0930.4600.1590.4030.0860.1360.000
설치면적(제곱미터)-0.6140.8441.0000.2730.4040.2750.3940.0570.1070.655
원동력종류0.1320.0930.2731.0000.1100.0000.0000.0000.0001.000
공급전압(V)0.2980.4600.4040.1101.0000.1370.1720.0880.1760.071
설치구분0.2730.1590.2750.0000.1371.0000.5180.5260.2710.987
지목0.2790.4030.3940.0000.1720.5181.0000.2090.3510.000
농업진흥구역여부0.5000.0860.0570.0000.0880.5260.2091.0000.1730.000
상태표시0.3990.1360.1070.0000.1760.2710.3510.1731.0001.000
허가취소사유0.1490.0000.6551.0000.0710.9870.0000.0001.0001.000

Missing values

2023-12-12T17:00:15.874809image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T17:00:16.157054image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-12T17:00:16.286455image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

순번허가번호허가일자상호원동력종류설비용량(KW)공급전압(V)설치장소설치면적(제곱미터)설치구분지목농업진흥구역여부사업개시신고일상태표시허가취소일허가취소사유
012018-4200000-38-5-000012018-01-02건너들태양광발전소태양광100.0220/380강릉시 강동면 언별리 536-32000.0지상N2019-10-28사업개시<NA><NA>
122018-4200000-38-5-000022018-01-03신석태양광발전소태양광100.0220/380강릉시 신석동 230-21756.0지상N<NA>인허가<NA><NA>
232018-4200000-38-5-000032018-01-12사이소재 태양광발전소태양광495.0220/380강릉시 왕산면 송현리 28212595.0지상N<NA>인허가<NA><NA>
342018-4200000-38-5-000042018-01-16선희태양광발전소태양광78.0220/380강릉시 박월동 407597.0건물창고용지N2018-09-28사업개시<NA><NA>
452018-4200000-38-5-000052018-01-17코포스 태양광발전소태양광677.022900강릉시 왕산면 고단리 972, 97319653.0지상전+임야N<NA>허가취소2020-07-22허가자진반납
562018-4200000-38-5-000062018-01-17우주태양광발전소태양광99.0220/380강릉시 성산면 관음리 624, 828-22826.0지상N2019-11-11사업개시<NA><NA>
672018-4200000-38-5-000072018-01-18Sky 태양광 발전소태양광298.0220/380강릉시 강동면 임곡리 67, 67-54668.0지상하천+임야N2018-09-28사업개시<NA><NA>
782018-4200000-38-5-000082018-01-29남양 태양광발전소태양광100.0220/380강릉시 옥계면 남양리 1203-3, 1203-41717.0지상대+전N2019-03-04사업개시<NA><NA>
892018-4200000-38-5-000092018-01-29에스팩토리 태양광발전소태양광1000.022900강릉시 강동면 모전리 산31311161.0지상임야N<NA>인허가<NA><NA>
9102018-4200000-38-5-000102018-01-29진영 태양광발전소태양광1000.022900강릉시 강동면 모전리 산31310916.0지상임야N<NA>인허가<NA><NA>
순번허가번호허가일자상호원동력종류설비용량(KW)공급전압(V)설치장소설치면적(제곱미터)설치구분지목농업진흥구역여부사업개시신고일상태표시허가취소일허가취소사유
5085092021-4200000-38-5-000482021-07-15향호3 태양광발전소태양광99.19220/380강원도 강릉시 주문진읍 향호리 810486.0지상N<NA>인허가<NA><NA>
5095102021-4200000-38-5-000492021-07-20삼팔 태양광발전소태양광95.76220/380강원도 강릉시 사천면 판교리 198-3488.0건물N<NA>인허가<NA><NA>
5105112021-4200000-38-5-000502021-07-21향호Eng1호 태양광발전소태양광81.0220/380강원도 강릉시 주문진읍 향호리 819-7260.0건물+토지N<NA>인허가<NA><NA>
5115122021-4200000-38-5-000512021-07-21향호Eng2호 태양광발전소태양광99.36220/380강원도 강릉시 주문진읍 향호리 819-7319.0건물+토지N<NA>인허가<NA><NA>
5125132021-4200000-38-5-000522021-07-21향호Eng3호 태양광발전소태양광99.36220/380강원도 강릉시 주문진읍 향호리 819-7319.0건물+토지N<NA>인허가<NA><NA>
5135142021-4200000-38-5-000532021-07-21향호Eng4호 태양광발전소태양광99.36220/380강원도 강릉시 주문진읍 향호리 819-3319.0건물+토지N<NA>인허가<NA><NA>
5145152021-4200000-38-5-000542021-07-21향호Eng5호 태양광발전소태양광99.36220/380강원도 강릉시 주문진읍 향호리 819-3319.0건물+토지N<NA>인허가<NA><NA>
5155162021-4200000-38-5-000552021-07-21향호Eng6호 태양광발전소태양광48.6220/380강원도 강릉시 주문진읍 향호리 819-3156.0건물+토지N<NA>인허가<NA><NA>
5165172021-4200000-38-5-000562021-07-29장현 태양광발전소태양광67.41220/380강원도 강릉시 모산로 209번길 25-1321.0건물+토지N<NA>인허가<NA><NA>
5175182021-4200000-38-5-000572021-07-29보권 태양광발전소태양광67.41220/380강원도 강릉시 모산로 209번길 25321.0건물+토지N<NA>인허가<NA><NA>