Overview

Dataset statistics

Number of variables8
Number of observations801
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory51.8 KiB
Average record size in memory66.2 B

Variable types

Numeric2
Text2
Categorical3
DateTime1

Dataset

Description경상남도 김해시 전기사업자 현황에 대한 자료로 순번, 상호, 설치장소소재지, 법인여부,영업구분,원동력의종류 등에 대한 항목으로 구성되어입니다.
Author경상남도 김해시
URLhttps://www.data.go.kr/data/15033959/fileData.do

Alerts

원동력의종류 is highly imbalanced (96.8%)Imbalance
순번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 01:23:47.906170
Analysis finished2023-12-12 01:23:49.005787
Duration1.1 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

UNIQUE 

Distinct801
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean401
Minimum1
Maximum801
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size7.2 KiB
2023-12-12T10:23:49.085501image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile41
Q1201
median401
Q3601
95-th percentile761
Maximum801
Range800
Interquartile range (IQR)400

Descriptive statistics

Standard deviation231.37308
Coefficient of variation (CV)0.57699021
Kurtosis-1.2
Mean401
Median Absolute Deviation (MAD)200
Skewness0
Sum321201
Variance53533.5
MonotonicityStrictly increasing
2023-12-12T10:23:49.270248image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.1%
539 1
 
0.1%
529 1
 
0.1%
530 1
 
0.1%
531 1
 
0.1%
532 1
 
0.1%
533 1
 
0.1%
534 1
 
0.1%
535 1
 
0.1%
536 1
 
0.1%
Other values (791) 791
98.8%
ValueCountFrequency (%)
1 1
0.1%
2 1
0.1%
3 1
0.1%
4 1
0.1%
5 1
0.1%
6 1
0.1%
7 1
0.1%
8 1
0.1%
9 1
0.1%
10 1
0.1%
ValueCountFrequency (%)
801 1
0.1%
800 1
0.1%
799 1
0.1%
798 1
0.1%
797 1
0.1%
796 1
0.1%
795 1
0.1%
794 1
0.1%
793 1
0.1%
792 1
0.1%

상호
Text

Distinct787
Distinct (%)98.3%
Missing0
Missing (%)0.0%
Memory size6.4 KiB
2023-12-12T10:23:49.763205image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length30
Median length27
Mean length12.062422
Min length3

Characters and Unicode

Total characters9662
Distinct characters391
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique773 ?
Unique (%)96.5%

Sample

1st row김해 덕정초등학교 태양광발전소
2nd row김성대 태양광발전소
3rd row김보영 태양광발전소
4th row제영3호 태양광발전소
5th row제영2호 태양광발전소
ValueCountFrequency (%)
태양광발전소 593
38.8%
태양광 30
 
2.0%
발전소 29
 
1.9%
김해산단 21
 
1.4%
레일에너지 7
 
0.5%
jh 4
 
0.3%
태양광발전소1호 3
 
0.2%
주식회사 3
 
0.2%
덕산 3
 
0.2%
성진 3
 
0.2%
Other values (809) 832
54.5%
2023-12-12T10:23:50.411520image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
787
 
8.1%
778
 
8.1%
776
 
8.0%
772
 
8.0%
768
 
7.9%
754
 
7.8%
728
 
7.5%
189
 
2.0%
134
 
1.4%
1 124
 
1.3%
Other values (381) 3852
39.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 8018
83.0%
Space Separator 728
 
7.5%
Decimal Number 399
 
4.1%
Uppercase Letter 123
 
1.3%
Open Punctuation 109
 
1.1%
Close Punctuation 108
 
1.1%
Lowercase Letter 97
 
1.0%
Dash Punctuation 74
 
0.8%
Other Symbol 3
 
< 0.1%
Other Punctuation 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
787
 
9.8%
778
 
9.7%
776
 
9.7%
772
 
9.6%
768
 
9.6%
754
 
9.4%
189
 
2.4%
134
 
1.7%
113
 
1.4%
90
 
1.1%
Other values (331) 2857
35.6%
Uppercase Letter
ValueCountFrequency (%)
S 22
17.9%
H 15
12.2%
F 15
12.2%
J 8
 
6.5%
D 8
 
6.5%
C 7
 
5.7%
G 5
 
4.1%
B 5
 
4.1%
K 5
 
4.1%
M 5
 
4.1%
Other values (12) 28
22.8%
Decimal Number
ValueCountFrequency (%)
1 124
31.1%
2 120
30.1%
3 41
 
10.3%
6 28
 
7.0%
5 27
 
6.8%
4 23
 
5.8%
0 13
 
3.3%
7 10
 
2.5%
8 8
 
2.0%
9 5
 
1.3%
Lowercase Letter
ValueCountFrequency (%)
o 19
19.6%
k 16
16.5%
e 15
15.5%
p 15
15.5%
c 15
15.5%
s 4
 
4.1%
l 4
 
4.1%
a 4
 
4.1%
r 4
 
4.1%
w 1
 
1.0%
Other Punctuation
ValueCountFrequency (%)
& 1
50.0%
. 1
50.0%
Space Separator
ValueCountFrequency (%)
728
100.0%
Open Punctuation
ValueCountFrequency (%)
( 109
100.0%
Close Punctuation
ValueCountFrequency (%)
) 108
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 74
100.0%
Other Symbol
ValueCountFrequency (%)
3
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 8021
83.0%
Common 1420
 
14.7%
Latin 221
 
2.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
787
 
9.8%
778
 
9.7%
776
 
9.7%
772
 
9.6%
768
 
9.6%
754
 
9.4%
189
 
2.4%
134
 
1.7%
113
 
1.4%
90
 
1.1%
Other values (332) 2860
35.7%
Latin
ValueCountFrequency (%)
S 22
 
10.0%
o 19
 
8.6%
k 16
 
7.2%
H 15
 
6.8%
F 15
 
6.8%
e 15
 
6.8%
p 15
 
6.8%
c 15
 
6.8%
J 8
 
3.6%
D 8
 
3.6%
Other values (23) 73
33.0%
Common
ValueCountFrequency (%)
728
51.3%
1 124
 
8.7%
2 120
 
8.5%
( 109
 
7.7%
) 108
 
7.6%
- 74
 
5.2%
3 41
 
2.9%
6 28
 
2.0%
5 27
 
1.9%
4 23
 
1.6%
Other values (6) 38
 
2.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 8018
83.0%
ASCII 1640
 
17.0%
None 3
 
< 0.1%
Number Forms 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
787
 
9.8%
778
 
9.7%
776
 
9.7%
772
 
9.6%
768
 
9.6%
754
 
9.4%
189
 
2.4%
134
 
1.7%
113
 
1.4%
90
 
1.1%
Other values (331) 2857
35.6%
ASCII
ValueCountFrequency (%)
728
44.4%
1 124
 
7.6%
2 120
 
7.3%
( 109
 
6.6%
) 108
 
6.6%
- 74
 
4.5%
3 41
 
2.5%
6 28
 
1.7%
5 27
 
1.6%
4 23
 
1.4%
Other values (38) 258
 
15.7%
None
ValueCountFrequency (%)
3
100.0%
Number Forms
ValueCountFrequency (%)
1
100.0%
Distinct637
Distinct (%)79.5%
Missing0
Missing (%)0.0%
Memory size6.4 KiB
2023-12-12T10:23:50.838513image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length56
Median length37
Mean length24.97628
Min length19

Characters and Unicode

Total characters20006
Distinct characters167
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique544 ?
Unique (%)67.9%

Sample

1st row경상남도 김해시 대청로 31, 덕정초등학교 (관동동)
2nd row경상남도 김해시 진례면 고모로526번길 75
3rd row경상남도 김해시 진례면 서부로411번길 5
4th row경상남도 김해시 상동면 상동로197번길 2-24
5th row경상남도 김해시 상동면 상동로197번길 2-24
ValueCountFrequency (%)
경상남도 801
19.7%
김해시 801
19.7%
한림면 179
 
4.4%
진례면 152
 
3.7%
주촌면 113
 
2.8%
진영읍 107
 
2.6%
생림면 59
 
1.5%
상동면 56
 
1.4%
본산로269번길 24
 
0.6%
서부로 22
 
0.5%
Other values (842) 1748
43.0%
2023-12-12T10:23:51.337929image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3261
 
16.3%
891
 
4.5%
888
 
4.4%
887
 
4.4%
1 853
 
4.3%
805
 
4.0%
803
 
4.0%
802
 
4.0%
802
 
4.0%
742
 
3.7%
Other values (157) 9272
46.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 11892
59.4%
Decimal Number 4149
 
20.7%
Space Separator 3261
 
16.3%
Dash Punctuation 409
 
2.0%
Open Punctuation 122
 
0.6%
Close Punctuation 122
 
0.6%
Other Punctuation 49
 
0.2%
Uppercase Letter 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
891
 
7.5%
888
 
7.5%
887
 
7.5%
805
 
6.8%
803
 
6.8%
802
 
6.7%
802
 
6.7%
742
 
6.2%
572
 
4.8%
450
 
3.8%
Other values (139) 4250
35.7%
Decimal Number
ValueCountFrequency (%)
1 853
20.6%
2 609
14.7%
3 478
11.5%
4 373
9.0%
6 346
8.3%
7 318
 
7.7%
9 318
 
7.7%
5 316
 
7.6%
0 269
 
6.5%
8 269
 
6.5%
Other Punctuation
ValueCountFrequency (%)
, 48
98.0%
. 1
 
2.0%
Uppercase Letter
ValueCountFrequency (%)
G 1
50.0%
M 1
50.0%
Space Separator
ValueCountFrequency (%)
3261
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 409
100.0%
Open Punctuation
ValueCountFrequency (%)
( 122
100.0%
Close Punctuation
ValueCountFrequency (%)
) 122
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 11892
59.4%
Common 8112
40.5%
Latin 2
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
891
 
7.5%
888
 
7.5%
887
 
7.5%
805
 
6.8%
803
 
6.8%
802
 
6.7%
802
 
6.7%
742
 
6.2%
572
 
4.8%
450
 
3.8%
Other values (139) 4250
35.7%
Common
ValueCountFrequency (%)
3261
40.2%
1 853
 
10.5%
2 609
 
7.5%
3 478
 
5.9%
- 409
 
5.0%
4 373
 
4.6%
6 346
 
4.3%
7 318
 
3.9%
9 318
 
3.9%
5 316
 
3.9%
Other values (6) 831
 
10.2%
Latin
ValueCountFrequency (%)
G 1
50.0%
M 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 11892
59.4%
ASCII 8114
40.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3261
40.2%
1 853
 
10.5%
2 609
 
7.5%
3 478
 
5.9%
- 409
 
5.0%
4 373
 
4.6%
6 346
 
4.3%
7 318
 
3.9%
9 318
 
3.9%
5 316
 
3.9%
Other values (8) 833
 
10.3%
Hangul
ValueCountFrequency (%)
891
 
7.5%
888
 
7.5%
887
 
7.5%
805
 
6.8%
803
 
6.8%
802
 
6.7%
802
 
6.7%
742
 
6.2%
572
 
4.8%
450
 
3.8%
Other values (139) 4250
35.7%

법인여부
Categorical

Distinct2
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size6.4 KiB
개인
450 
법인
351 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row법인
2nd row개인
3rd row개인
4th row개인
5th row개인

Common Values

ValueCountFrequency (%)
개인 450
56.2%
법인 351
43.8%

Length

2023-12-12T10:23:51.509248image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T10:23:51.622760image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
개인 450
56.2%
법인 351
43.8%

영업구분
Categorical

Distinct5
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size6.4 KiB
사업개시
470 
인허가
168 
인허가취소
105 
공사진행
48 
폐업
 
10

Length

Max length5
Median length4
Mean length3.8963795
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row인허가
2nd row인허가
3rd row인허가
4th row인허가
5th row인허가

Common Values

ValueCountFrequency (%)
사업개시 470
58.7%
인허가 168
 
21.0%
인허가취소 105
 
13.1%
공사진행 48
 
6.0%
폐업 10
 
1.2%

Length

2023-12-12T10:23:51.767029image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T10:23:51.913077image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
사업개시 470
58.7%
인허가 168
 
21.0%
인허가취소 105
 
13.1%
공사진행 48
 
6.0%
폐업 10
 
1.2%

원동력의종류
Categorical

IMBALANCE 

Distinct4
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size6.4 KiB
태양광
796 
바이오가스
 
3
지열
 
1
가스엔진발전기
 
1

Length

Max length7
Median length3
Mean length3.011236
Min length2

Unique

Unique2 ?
Unique (%)0.2%

Sample

1st row태양광
2nd row태양광
3rd row태양광
4th row태양광
5th row태양광

Common Values

ValueCountFrequency (%)
태양광 796
99.4%
바이오가스 3
 
0.4%
지열 1
 
0.1%
가스엔진발전기 1
 
0.1%

Length

2023-12-12T10:23:52.082493image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T10:23:52.225330image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
태양광 796
99.4%
바이오가스 3
 
0.4%
지열 1
 
0.1%
가스엔진발전기 1
 
0.1%
Distinct527
Distinct (%)65.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean157.30026
Minimum4
Maximum1000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size7.2 KiB
2023-12-12T10:23:52.363443image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum4
5-th percentile19.2
Q152.29
median99.23
Q3177.3
95-th percentile534.1
Maximum1000
Range996
Interquartile range (IQR)125.01

Descriptive statistics

Standard deviation181.31277
Coefficient of variation (CV)1.152654
Kurtosis7.3533779
Mean157.30026
Median Absolute Deviation (MAD)52.82
Skewness2.6184959
Sum125997.51
Variance32874.321
MonotonicityNot monotonic
2023-12-12T10:23:52.531806image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
99.0 18
 
2.2%
99.68 17
 
2.1%
99.45 13
 
1.6%
99.36 12
 
1.5%
99.84 10
 
1.2%
99.96 9
 
1.1%
29.58 9
 
1.1%
99.19 8
 
1.0%
97.2 8
 
1.0%
99.9 8
 
1.0%
Other values (517) 689
86.0%
ValueCountFrequency (%)
4.0 1
 
0.1%
5.0 1
 
0.1%
6.6 1
 
0.1%
9.0 1
 
0.1%
9.16 1
 
0.1%
9.36 1
 
0.1%
10.0 3
0.4%
10.36 2
0.2%
10.5 1
 
0.1%
11.0 2
0.2%
ValueCountFrequency (%)
1000.0 1
0.1%
999.46 1
0.1%
999.0 1
0.1%
998.4 2
0.2%
997.92 1
0.1%
996.8 1
0.1%
990.0 1
0.1%
985.53 1
0.1%
903.0 1
0.1%
898.56 1
0.1%
Distinct309
Distinct (%)38.6%
Missing0
Missing (%)0.0%
Memory size6.4 KiB
Minimum2006-09-05 00:00:00
Maximum2022-10-21 00:00:00
2023-12-12T10:23:52.730007image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:23:52.920098image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

Interactions

2023-12-12T10:23:48.598053image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:23:48.387414image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:23:48.700032image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:23:48.500602image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T10:23:53.054375image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번법인여부영업구분원동력의종류설비용량(키로와트)
순번1.0000.2820.7570.0740.214
법인여부0.2821.0000.1430.0000.428
영업구분0.7570.1431.0000.0000.065
원동력의종류0.0740.0000.0001.0000.262
설비용량(키로와트)0.2140.4280.0650.2621.000
2023-12-12T10:23:53.183685image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
법인여부원동력의종류영업구분
법인여부1.0000.0000.174
원동력의종류0.0001.0000.000
영업구분0.1740.0001.000
2023-12-12T10:23:53.293088image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번설비용량(키로와트)법인여부영업구분원동력의종류
순번1.000-0.1230.2130.4130.044
설비용량(키로와트)-0.1231.0000.3270.0260.159
법인여부0.2130.3271.0000.1740.000
영업구분0.4130.0260.1741.0000.000
원동력의종류0.0440.1590.0000.0001.000

Missing values

2023-12-12T10:23:48.824718image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T10:23:48.953344image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

순번상호설치장소소재지법인여부영업구분원동력의종류설비용량(키로와트)허가일자
01김해 덕정초등학교 태양광발전소경상남도 김해시 대청로 31, 덕정초등학교 (관동동)법인인허가태양광175.912022-10-21
12김성대 태양광발전소경상남도 김해시 진례면 고모로526번길 75개인인허가태양광81.422022-10-21
23김보영 태양광발전소경상남도 김해시 진례면 서부로411번길 5개인인허가태양광99.762022-10-21
34제영3호 태양광발전소경상남도 김해시 상동면 상동로197번길 2-24개인인허가태양광99.962022-10-21
45제영2호 태양광발전소경상남도 김해시 상동면 상동로197번길 2-24개인인허가태양광99.962022-10-21
56제영4호 태양광발전소경상남도 김해시 상동면 상동로197번길 2-24개인인허가태양광99.962022-10-21
67제영1호 태양광발전소경상남도 김해시 상동면 상동로 338-21개인인허가태양광99.962022-10-21
78주촌스크린골프1호 태양광발전소경상남도 김해시 주촌면 선천로 85개인인허가태양광160.482022-10-21
89주촌스크린골프2호 태양광발전소경상남도 김해시 주촌면 선천로 85개인인허가태양광82.62022-10-21
910DS1호 태양광발전소경상남도 김해시 한림면 김해대로1538번길 46개인인허가태양광91.042022-10-07
순번상호설치장소소재지법인여부영업구분원동력의종류설비용량(키로와트)허가일자
791792성수태양광발전소경상남도 김해시 진례면 고모로442번길 11-11개인사업개시태양광30.02009-07-20
792793미소태양광발전소경상남도 김해시 진례면 고모로324번안길 45개인사업개시태양광21.02009-07-07
793794(주)제이에스디경상남도 김해시 장유로 77-24 (부곡동)법인사업개시태양광28.692009-03-05
794795(주)진례에너지경상남도 김해시 진례면 고모로560번길 32개인사업개시태양광200.02007-11-28
795796GS칼텍스(주) (GS칼텍스 그린주유소태양광발전소)경상남도 김해시 김해대로 2602, GM철강 (안동)개인인허가취소태양광26.02008-09-11
796797신용태양광발전소경상남도 김해시 진영읍 신용리 산 49 산51-2. 4개인인허가취소태양광1000.02007-07-25
797798여래태양광발전소경상남도 김해시 진영읍 여래리 172 , 172-8개인인허가취소태양광200.02007-01-31
798799내룡태양광발전소경상남도 김해시 진영읍 내룡리 391 , 398, 399, 400, 401, 403, 535, 536개인인허가취소태양광500.02007-01-24
799800창조1호 태양광발전소경상남도 김해시 가락로326번길 10(구산동)개인인허가취소태양광5.02006-10-18
800801(주)조은이엔씨경상남도 김해시 한림면 장재로 190-33개인인허가취소태양광4.02006-09-05