Overview

Dataset statistics

Number of variables8
Number of observations7557
Missing cells1
Missing cells (%)< 0.1%
Duplicate rows2
Duplicate rows (%)< 0.1%
Total size in memory487.2 KiB
Average record size in memory66.0 B

Variable types

Numeric2
Text4
Categorical2

Dataset

Description전기사업체(태양광) 현황으로 허가일자, 상호, 설비용량, 설치장소, 원동력종류, 상태, 사업개시일을 제공합니다.
Author충청북도
URLhttps://www.data.go.kr/data/15033245/fileData.do

Alerts

원동력의 종류 has constant value ""Constant
Dataset has 2 (< 0.1%) duplicate rowsDuplicates
상세영업상태명 is highly imbalanced (97.4%)Imbalance

Reproduction

Analysis started2023-12-12 12:40:23.488310
Analysis finished2023-12-12 12:40:25.749470
Duration2.26 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

인허가일자
Real number (ℝ)

Distinct1781
Distinct (%)23.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean20182785
Minimum20050706
Maximum20221125
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size66.5 KiB
2023-12-12T21:40:25.819636image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum20050706
5-th percentile20131002
Q120170620
median20190410
Q320200928
95-th percentile20211214
Maximum20221125
Range170419
Interquartile range (IQR)30308

Descriptive statistics

Standard deviation26759.85
Coefficient of variation (CV)0.001325875
Kurtosis1.696053
Mean20182785
Median Absolute Deviation (MAD)10813
Skewness-1.1657764
Sum1.5252131 × 1011
Variance7.1608955 × 108
MonotonicityNot monotonic
2023-12-12T21:40:25.973349image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
20190820 42
 
0.6%
20180919 35
 
0.5%
20170206 32
 
0.4%
20161017 31
 
0.4%
20160525 30
 
0.4%
20180927 29
 
0.4%
20180123 26
 
0.3%
20200611 26
 
0.3%
20191231 25
 
0.3%
20180911 24
 
0.3%
Other values (1771) 7257
96.0%
ValueCountFrequency (%)
20050706 1
< 0.1%
20060227 1
< 0.1%
20060425 1
< 0.1%
20060726 2
< 0.1%
20060906 1
< 0.1%
20060921 1
< 0.1%
20070228 2
< 0.1%
20070403 1
< 0.1%
20070509 2
< 0.1%
20070619 1
< 0.1%
ValueCountFrequency (%)
20221125 1
 
< 0.1%
20221107 1
 
< 0.1%
20221102 1
 
< 0.1%
20221028 1
 
< 0.1%
20221018 4
0.1%
20221006 1
 
< 0.1%
20221005 6
0.1%
20221004 1
 
< 0.1%
20220930 3
< 0.1%
20220929 1
 
< 0.1%
Distinct6950
Distinct (%)92.0%
Missing0
Missing (%)0.0%
Memory size59.2 KiB
2023-12-12T21:40:26.342383image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length34
Median length30
Mean length10.320895
Min length1

Characters and Unicode

Total characters77995
Distinct characters652
Distinct categories12 ?
Distinct scripts4 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique6522 ?
Unique (%)86.3%

Sample

1st row중부철망태양광발전소
2nd row유한회사청주쏠라팜태양광발전소
3rd row유한회사남동태양광발전소
4th row유한회사엘케이에너지태양광발전소
5th row대성1호태양광발전소
ValueCountFrequency (%)
태양광발전소 5173
37.9%
발전소 135
 
1.0%
2호 70
 
0.5%
태양광 59
 
0.4%
1호 56
 
0.4%
주식회사 34
 
0.2%
3호 33
 
0.2%
태양광발전소2호기 18
 
0.1%
2호기 16
 
0.1%
5호 15
 
0.1%
Other values (6750) 8042
58.9%
2023-12-12T21:40:26.902998image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
7098
 
9.1%
7058
 
9.0%
7033
 
9.0%
7014
 
9.0%
7012
 
9.0%
6969
 
8.9%
6097
 
7.8%
2835
 
3.6%
1 1208
 
1.5%
2 1193
 
1.5%
Other values (642) 24478
31.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 66749
85.6%
Space Separator 6097
 
7.8%
Decimal Number 3867
 
5.0%
Uppercase Letter 493
 
0.6%
Open Punctuation 252
 
0.3%
Close Punctuation 252
 
0.3%
Lowercase Letter 151
 
0.2%
Dash Punctuation 86
 
0.1%
Other Symbol 20
 
< 0.1%
Other Punctuation 19
 
< 0.1%
Other values (2) 9
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
7098
 
10.6%
7058
 
10.6%
7033
 
10.5%
7014
 
10.5%
7012
 
10.5%
6969
 
10.4%
2835
 
4.2%
521
 
0.8%
515
 
0.8%
489
 
0.7%
Other values (578) 20205
30.3%
Uppercase Letter
ValueCountFrequency (%)
S 72
14.6%
J 53
 
10.8%
K 35
 
7.1%
C 32
 
6.5%
M 32
 
6.5%
H 31
 
6.3%
P 25
 
5.1%
B 24
 
4.9%
G 24
 
4.9%
L 23
 
4.7%
Other values (13) 142
28.8%
Lowercase Letter
ValueCountFrequency (%)
e 23
15.2%
o 23
15.2%
k 19
12.6%
c 16
10.6%
p 15
9.9%
i 7
 
4.6%
r 7
 
4.6%
s 6
 
4.0%
n 6
 
4.0%
a 5
 
3.3%
Other values (9) 24
15.9%
Decimal Number
ValueCountFrequency (%)
1 1208
31.2%
2 1193
30.9%
3 538
13.9%
4 277
 
7.2%
5 217
 
5.6%
6 131
 
3.4%
0 96
 
2.5%
7 89
 
2.3%
8 61
 
1.6%
9 57
 
1.5%
Other Punctuation
ValueCountFrequency (%)
& 9
47.4%
" 8
42.1%
. 2
 
10.5%
Letter Number
ValueCountFrequency (%)
3
50.0%
2
33.3%
1
 
16.7%
Space Separator
ValueCountFrequency (%)
6097
100.0%
Open Punctuation
ValueCountFrequency (%)
( 252
100.0%
Close Punctuation
ValueCountFrequency (%)
) 252
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 86
100.0%
Other Symbol
ValueCountFrequency (%)
20
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 66768
85.6%
Common 10576
 
13.6%
Latin 650
 
0.8%
Han 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
7098
 
10.6%
7058
 
10.6%
7033
 
10.5%
7014
 
10.5%
7012
 
10.5%
6969
 
10.4%
2835
 
4.2%
521
 
0.8%
515
 
0.8%
489
 
0.7%
Other values (578) 20224
30.3%
Latin
ValueCountFrequency (%)
S 72
 
11.1%
J 53
 
8.2%
K 35
 
5.4%
C 32
 
4.9%
M 32
 
4.9%
H 31
 
4.8%
P 25
 
3.8%
B 24
 
3.7%
G 24
 
3.7%
L 23
 
3.5%
Other values (35) 299
46.0%
Common
ValueCountFrequency (%)
6097
57.6%
1 1208
 
11.4%
2 1193
 
11.3%
3 538
 
5.1%
4 277
 
2.6%
( 252
 
2.4%
) 252
 
2.4%
5 217
 
2.1%
6 131
 
1.2%
0 96
 
0.9%
Other values (8) 315
 
3.0%
Han
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 66748
85.6%
ASCII 11220
 
14.4%
None 20
 
< 0.1%
Number Forms 6
 
< 0.1%
CJK 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
7098
 
10.6%
7058
 
10.6%
7033
 
10.5%
7014
 
10.5%
7012
 
10.5%
6969
 
10.4%
2835
 
4.2%
521
 
0.8%
515
 
0.8%
489
 
0.7%
Other values (577) 20204
30.3%
ASCII
ValueCountFrequency (%)
6097
54.3%
1 1208
 
10.8%
2 1193
 
10.6%
3 538
 
4.8%
4 277
 
2.5%
( 252
 
2.2%
) 252
 
2.2%
5 217
 
1.9%
6 131
 
1.2%
0 96
 
0.9%
Other values (50) 959
 
8.5%
None
ValueCountFrequency (%)
20
100.0%
Number Forms
ValueCountFrequency (%)
3
50.0%
2
33.3%
1
 
16.7%
CJK
ValueCountFrequency (%)
1
100.0%
Distinct6036
Distinct (%)79.9%
Missing1
Missing (%)< 0.1%
Memory size59.2 KiB
2023-12-12T21:40:27.278448image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length111
Median length84
Mean length26.9955
Min length1

Characters and Unicode

Total characters203978
Distinct characters427
Distinct categories9 ?
Distinct scripts4 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique5195 ?
Unique (%)68.8%

Sample

1st row충청북도 보은군 보은읍 용암리 288번지 4호
2nd row충청북도 보은군 마로면 수문리 산 341-1 산 34-4, 341-6
3rd row충청북도 보은군 마로면 수문리 341번지 2호 (341-1,산34-1 포함)
4th row충청북도 보은군 마로면 수문리 산 34번지 1호
5th row충청북도 보은군 마로면 수문리 155번지 (156번지 포함)
ValueCountFrequency (%)
충청북도 7461
 
15.9%
청주시 1627
 
3.5%
옥천군 979
 
2.1%
음성군 846
 
1.8%
충주시 789
 
1.7%
보은군 748
 
1.6%
722
 
1.5%
청원구 680
 
1.5%
1호 678
 
1.4%
563
 
1.2%
Other values (6147) 31729
67.8%
2023-12-12T21:40:27.782954image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
39430
 
19.3%
10326
 
5.1%
8292
 
4.1%
7860
 
3.9%
7795
 
3.8%
1 7356
 
3.6%
7060
 
3.5%
5332
 
2.6%
2 5108
 
2.5%
3 4679
 
2.3%
Other values (417) 100740
49.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 117411
57.6%
Space Separator 39430
 
19.3%
Decimal Number 37865
 
18.6%
Dash Punctuation 4301
 
2.1%
Other Punctuation 2959
 
1.5%
Open Punctuation 992
 
0.5%
Close Punctuation 990
 
0.5%
Uppercase Letter 28
 
< 0.1%
Lowercase Letter 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
10326
 
8.8%
8292
 
7.1%
7860
 
6.7%
7795
 
6.6%
7060
 
6.0%
5332
 
4.5%
4646
 
4.0%
4611
 
3.9%
3831
 
3.3%
2955
 
2.5%
Other values (382) 54703
46.6%
Uppercase Letter
ValueCountFrequency (%)
K 4
14.3%
C 4
14.3%
S 4
14.3%
G 2
7.1%
L 2
7.1%
T 2
7.1%
U 2
7.1%
R 2
7.1%
B 1
 
3.6%
A 1
 
3.6%
Other values (4) 4
14.3%
Decimal Number
ValueCountFrequency (%)
1 7356
19.4%
2 5108
13.5%
3 4679
12.4%
4 3639
9.6%
5 3577
9.4%
6 3334
8.8%
7 2760
 
7.3%
9 2576
 
6.8%
8 2568
 
6.8%
0 2268
 
6.0%
Other Punctuation
ValueCountFrequency (%)
, 2935
99.2%
. 13
 
0.4%
/ 11
 
0.4%
Open Punctuation
ValueCountFrequency (%)
( 878
88.5%
[ 114
 
11.5%
Close Punctuation
ValueCountFrequency (%)
) 876
88.5%
] 114
 
11.5%
Lowercase Letter
ValueCountFrequency (%)
k 1
50.0%
s 1
50.0%
Space Separator
ValueCountFrequency (%)
39430
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 4301
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 117325
57.5%
Common 86537
42.4%
Han 86
 
< 0.1%
Latin 30
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
10326
 
8.8%
8292
 
7.1%
7860
 
6.7%
7795
 
6.6%
7060
 
6.0%
5332
 
4.5%
4646
 
4.0%
4611
 
3.9%
3831
 
3.3%
2955
 
2.5%
Other values (376) 54617
46.6%
Common
ValueCountFrequency (%)
39430
45.6%
1 7356
 
8.5%
2 5108
 
5.9%
3 4679
 
5.4%
- 4301
 
5.0%
4 3639
 
4.2%
5 3577
 
4.1%
6 3334
 
3.9%
, 2935
 
3.4%
7 2760
 
3.2%
Other values (9) 9418
 
10.9%
Latin
ValueCountFrequency (%)
K 4
13.3%
C 4
13.3%
S 4
13.3%
G 2
 
6.7%
L 2
 
6.7%
T 2
 
6.7%
U 2
 
6.7%
R 2
 
6.7%
B 1
 
3.3%
A 1
 
3.3%
Other values (6) 6
20.0%
Han
ValueCountFrequency (%)
32
37.2%
31
36.0%
11
 
12.8%
10
 
11.6%
1
 
1.2%
1
 
1.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 117325
57.5%
ASCII 86567
42.4%
CJK 86
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
39430
45.5%
1 7356
 
8.5%
2 5108
 
5.9%
3 4679
 
5.4%
- 4301
 
5.0%
4 3639
 
4.2%
5 3577
 
4.1%
6 3334
 
3.9%
, 2935
 
3.4%
7 2760
 
3.2%
Other values (25) 9448
 
10.9%
Hangul
ValueCountFrequency (%)
10326
 
8.8%
8292
 
7.1%
7860
 
6.7%
7795
 
6.6%
7060
 
6.0%
5332
 
4.5%
4646
 
4.0%
4611
 
3.9%
3831
 
3.3%
2955
 
2.5%
Other values (376) 54617
46.6%
CJK
ValueCountFrequency (%)
32
37.2%
31
36.0%
11
 
12.8%
10
 
11.6%
1
 
1.2%
1
 
1.2%
Distinct2535
Distinct (%)33.5%
Missing0
Missing (%)0.0%
Memory size59.2 KiB
2023-12-12T21:40:28.041752image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length54
Median length1
Mean length11.304618
Min length1

Characters and Unicode

Total characters85429
Distinct characters468
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2101 ?
Unique (%)27.8%

Sample

1st row
2nd row
3rd row
4th row
5th row
ValueCountFrequency (%)
충청북도 3191
 
18.0%
청주시 682
 
3.8%
음성군 395
 
2.2%
옥천군 358
 
2.0%
보은군 341
 
1.9%
충주시 338
 
1.9%
영동군 330
 
1.9%
청원구 249
 
1.4%
제천시 243
 
1.4%
진천군 243
 
1.4%
Other values (3632) 11365
64.1%
2023-12-12T21:40:28.456032image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
18866
22.1%
4354
 
5.1%
3613
 
4.2%
3355
 
3.9%
3340
 
3.9%
1 2700
 
3.2%
2052
 
2.4%
1971
 
2.3%
1904
 
2.2%
1902
 
2.2%
Other values (458) 41372
48.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 50115
58.7%
Space Separator 18866
 
22.1%
Decimal Number 12722
 
14.9%
Dash Punctuation 1553
 
1.8%
Other Punctuation 873
 
1.0%
Open Punctuation 629
 
0.7%
Close Punctuation 629
 
0.7%
Uppercase Letter 40
 
< 0.1%
Lowercase Letter 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4354
 
8.7%
3613
 
7.2%
3355
 
6.7%
3340
 
6.7%
2052
 
4.1%
1971
 
3.9%
1904
 
3.8%
1902
 
3.8%
1324
 
2.6%
1319
 
2.6%
Other values (421) 24981
49.8%
Uppercase Letter
ValueCountFrequency (%)
S 7
17.5%
E 4
10.0%
C 4
10.0%
K 4
10.0%
R 3
 
7.5%
T 2
 
5.0%
U 2
 
5.0%
G 2
 
5.0%
D 2
 
5.0%
L 2
 
5.0%
Other values (6) 8
20.0%
Decimal Number
ValueCountFrequency (%)
1 2700
21.2%
2 1831
14.4%
3 1610
12.7%
4 1329
10.4%
5 1110
8.7%
6 958
 
7.5%
7 851
 
6.7%
8 803
 
6.3%
0 795
 
6.2%
9 735
 
5.8%
Other Punctuation
ValueCountFrequency (%)
, 870
99.7%
/ 2
 
0.2%
* 1
 
0.1%
Open Punctuation
ValueCountFrequency (%)
( 570
90.6%
[ 59
 
9.4%
Close Punctuation
ValueCountFrequency (%)
) 570
90.6%
] 59
 
9.4%
Lowercase Letter
ValueCountFrequency (%)
k 1
50.0%
s 1
50.0%
Space Separator
ValueCountFrequency (%)
18866
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1553
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 50115
58.7%
Common 35272
41.3%
Latin 42
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
4354
 
8.7%
3613
 
7.2%
3355
 
6.7%
3340
 
6.7%
2052
 
4.1%
1971
 
3.9%
1904
 
3.8%
1902
 
3.8%
1324
 
2.6%
1319
 
2.6%
Other values (421) 24981
49.8%
Common
ValueCountFrequency (%)
18866
53.5%
1 2700
 
7.7%
2 1831
 
5.2%
3 1610
 
4.6%
- 1553
 
4.4%
4 1329
 
3.8%
5 1110
 
3.1%
6 958
 
2.7%
, 870
 
2.5%
7 851
 
2.4%
Other values (9) 3594
 
10.2%
Latin
ValueCountFrequency (%)
S 7
16.7%
E 4
 
9.5%
C 4
 
9.5%
K 4
 
9.5%
R 3
 
7.1%
T 2
 
4.8%
U 2
 
4.8%
G 2
 
4.8%
D 2
 
4.8%
L 2
 
4.8%
Other values (8) 10
23.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 50115
58.7%
ASCII 35314
41.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
18866
53.4%
1 2700
 
7.6%
2 1831
 
5.2%
3 1610
 
4.6%
- 1553
 
4.4%
4 1329
 
3.8%
5 1110
 
3.1%
6 958
 
2.7%
, 870
 
2.5%
7 851
 
2.4%
Other values (27) 3636
 
10.3%
Hangul
ValueCountFrequency (%)
4354
 
8.7%
3613
 
7.2%
3355
 
6.7%
3340
 
6.7%
2052
 
4.1%
1971
 
3.9%
1904
 
3.8%
1902
 
3.8%
1324
 
2.6%
1319
 
2.6%
Other values (421) 24981
49.8%

원동력의 종류
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size59.2 KiB
태양광
7557 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row태양광
2nd row태양광
3rd row태양광
4th row태양광
5th row태양광

Common Values

ValueCountFrequency (%)
태양광 7557
100.0%

Length

2023-12-12T21:40:28.576007image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T21:40:28.668789image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
태양광 7557
100.0%

설비용량(KW)
Real number (ℝ)

Distinct1732
Distinct (%)22.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean153.70137
Minimum3
Maximum3000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size66.5 KiB
2023-12-12T21:40:28.764419image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum3
5-th percentile18.72
Q150.35
median99
Q399.75
95-th percentile499.32
Maximum3000
Range2997
Interquartile range (IQR)49.4

Descriptive statistics

Standard deviation290.1816
Coefficient of variation (CV)1.8879571
Kurtosis37.728807
Mean153.70137
Median Absolute Deviation (MAD)6.25
Skewness5.5583574
Sum1161521.3
Variance84205.361
MonotonicityNot monotonic
2023-12-12T21:40:28.885128image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
99.0 557
 
7.4%
99.9 298
 
3.9%
99.6 219
 
2.9%
99.45 179
 
2.4%
99.84 175
 
2.3%
19.8 158
 
2.1%
99.36 157
 
2.1%
97.92 154
 
2.0%
99.2 142
 
1.9%
99.96 121
 
1.6%
Other values (1722) 5397
71.4%
ValueCountFrequency (%)
3.0 2
 
< 0.1%
3.84 1
 
< 0.1%
5.0 1
 
< 0.1%
6.0 1
 
< 0.1%
6.58 1
 
< 0.1%
7.0 1
 
< 0.1%
8.1 1
 
< 0.1%
8.72 1
 
< 0.1%
9.0 9
0.1%
9.3 3
 
< 0.1%
ValueCountFrequency (%)
3000.0 2
< 0.1%
2997.0 2
< 0.1%
2996.3 1
< 0.1%
2995.47 1
< 0.1%
2994.84 1
< 0.1%
2990.0 1
< 0.1%
2988.6 1
< 0.1%
2984.85 2
< 0.1%
2965.92 1
< 0.1%
2821.09 1
< 0.1%
Distinct2021
Distinct (%)26.7%
Missing0
Missing (%)0.0%
Memory size59.2 KiB
2023-12-12T21:40:29.181085image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length10
Mean length9.9997353
Min length8

Characters and Unicode

Total characters75568
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique733 ?
Unique (%)9.7%

Sample

1st row2016-07-01
2nd row2016-01-04
3rd row2015-09-01
4th row2016-01-14
5th row2015-03-20
ValueCountFrequency (%)
2021-12-31 50
 
0.7%
2021-12-29 44
 
0.6%
2020-12-30 44
 
0.6%
2020-12-29 41
 
0.5%
2019-01-28 39
 
0.5%
2020-12-21 35
 
0.5%
2020-12-14 33
 
0.4%
2022-06-29 31
 
0.4%
2019-05-03 28
 
0.4%
2021-12-27 25
 
0.3%
Other values (2011) 7187
95.1%
2023-12-12T21:40:29.619561image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2 17865
23.6%
0 17697
23.4%
- 15112
20.0%
1 12060
16.0%
9 2640
 
3.5%
8 2020
 
2.7%
3 1861
 
2.5%
4 1644
 
2.2%
7 1596
 
2.1%
6 1559
 
2.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 60456
80.0%
Dash Punctuation 15112
 
20.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 17865
29.6%
0 17697
29.3%
1 12060
19.9%
9 2640
 
4.4%
8 2020
 
3.3%
3 1861
 
3.1%
4 1644
 
2.7%
7 1596
 
2.6%
6 1559
 
2.6%
5 1514
 
2.5%
Dash Punctuation
ValueCountFrequency (%)
- 15112
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 75568
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
2 17865
23.6%
0 17697
23.4%
- 15112
20.0%
1 12060
16.0%
9 2640
 
3.5%
8 2020
 
2.7%
3 1861
 
2.5%
4 1644
 
2.2%
7 1596
 
2.1%
6 1559
 
2.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 75568
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2 17865
23.6%
0 17697
23.4%
- 15112
20.0%
1 12060
16.0%
9 2640
 
3.5%
8 2020
 
2.7%
3 1861
 
2.5%
4 1644
 
2.2%
7 1596
 
2.1%
6 1559
 
2.1%

상세영업상태명
Categorical

IMBALANCE 

Distinct5
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size59.2 KiB
사업개시
7512 
공사진행
 
34
인허가
 
6
폐업
 
4
업종변경등록말소
 
1

Length

Max length8
Median length4
Mean length3.9986767
Min length2

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row사업개시
2nd row사업개시
3rd row사업개시
4th row사업개시
5th row사업개시

Common Values

ValueCountFrequency (%)
사업개시 7512
99.4%
공사진행 34
 
0.4%
인허가 6
 
0.1%
폐업 4
 
0.1%
업종변경등록말소 1
 
< 0.1%

Length

2023-12-12T21:40:29.765288image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T21:40:29.871409image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
사업개시 7512
99.4%
공사진행 34
 
0.4%
인허가 6
 
0.1%
폐업 4
 
0.1%
업종변경등록말소 1
 
< 0.1%

Interactions

2023-12-12T21:40:25.029280image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:40:24.782440image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:40:25.164298image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:40:24.899397image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T21:40:29.950864image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
인허가일자설비용량(KW)상세영업상태명
인허가일자1.0000.1700.442
설비용량(KW)0.1701.0000.000
상세영업상태명0.4420.0001.000
2023-12-12T21:40:30.062564image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
인허가일자설비용량(KW)상세영업상태명
인허가일자1.0000.0400.188
설비용량(KW)0.0401.0000.000
상세영업상태명0.1880.0001.000

Missing values

2023-12-12T21:40:25.546156image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T21:40:25.684132image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

인허가일자법인(상호)명설치장소(지번)설치장소(도로명)원동력의 종류설비용량(KW)사업개시일상세영업상태명
020150226중부철망태양광발전소충청북도 보은군 보은읍 용암리 288번지 4호태양광97.22016-07-01사업개시
120150226유한회사청주쏠라팜태양광발전소충청북도 보은군 마로면 수문리 산 341-1 산 34-4, 341-6태양광99.22016-01-04사업개시
220150226유한회사남동태양광발전소충청북도 보은군 마로면 수문리 341번지 2호 (341-1,산34-1 포함)태양광98.562015-09-01사업개시
320150226유한회사엘케이에너지태양광발전소충청북도 보은군 마로면 수문리 산 34번지 1호태양광99.22016-01-14사업개시
420150223대성1호태양광발전소충청북도 보은군 마로면 수문리 155번지 (156번지 포함)태양광98.562015-03-20사업개시
520150223세훈태양광발전소충청북도 보은군 탄부면 벽지리 448번지 1호태양광99.322017-06-12사업개시
620150223단비태양광발전소충청북도 보은군 탄부면 벽지리 448번지 5호태양광99.322017-06-12사업개시
720150223suntech태양광발전소충청북도 보은군 탄부면 벽지리 448번지 6호태양광99.322017-06-12사업개시
820150206세중태양광발전소충청북도 보은군 마로면 세중리 171번지 3호태양광99.02015-08-21사업개시
920150206현계태양광발전소충청북도 보은군 보은읍 강신리 98번지 1호태양광99.02015-08-21사업개시
인허가일자법인(상호)명설치장소(지번)설치장소(도로명)원동력의 종류설비용량(KW)사업개시일상세영업상태명
754720120601(주)구미오창2태양광발전소충청북도 청주시 흥덕구 옥산면 남촌리 1114-1충청북도 청주시 흥덕구 옥산면 과학산업3로 29태양광2988.62012-06-29사업개시
754820180716그린1태양광발전소충청북도 청주시 청원구 북이면 대율리 142-2 ,143-1,144-2(토지위)태양광1396.722019-10-30사업개시
754920161013(주)에스엠이2호 태양광발전소충청북도 영동군 영동읍 산이리 산 18번지 11호태양광1997.282018-10-16사업개시
755020161004지티씨솔라7호 태양광발전소충청북도 단양군 적성면 하리 산 35번지태양광1302.02019-08-16사업개시
755120160930(유)네모1태양광발전소(용곡1태양광발전소)충청북도 청주시 상당구 미원면 용곡리 산 17번지태양광2016.02019-03-13사업개시
755220160923솔라원(주) 길탕2호 태양광발전소충청북도 보은군 산외면 길탕리 산 45-1태양광2016.02021-02-01사업개시
755320160923길탕1호 태양광발전소충청북도 보은군 산외면 길탕리 산 45-4태양광2016.02021-02-01사업개시
755420160922(유)네모8 태양광발전소(네모12태양광발전소)충청북도 청주시 상당구 미원면 중리 산 24번지 2호태양광1824.842019-05-14사업개시
755520140513우림에너지산업주식회사충청북도 제천시 신월동 579번지충청북도 제천시 세명로 65 (신월동)태양광1470.62014-07-01사업개시
755620140507보광 태양광발전소(주)충청북도 보은군 수한면 질신리 98번지태양광1200.02017-09-12사업개시

Duplicate rows

Most frequently occurring

인허가일자법인(상호)명설치장소(지번)설치장소(도로명)원동력의 종류설비용량(KW)사업개시일상세영업상태명# duplicates
020201210지현 태양광발전소충청북도 충주시 금가면 도촌리 187-3충청북도 충주시 금가면 섬말1길 8태양광19.02021-02-16사업개시2
120210331주치1호 태양광발전소충청북도 충주시 소태면 주치리 476-1충청북도 충주시 소태면 주치길 427태양광19.82021-04-30사업개시2