Overview

Dataset statistics

Number of variables3
Number of observations233
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory5.8 KiB
Average record size in memory25.5 B

Variable types

Numeric1
Categorical1
Text1

Dataset

Description한전KPS(주) 국내에 위치한 발전정비 현황에 관한 데이터입니다. (발주자, 호기명 등의 정보를 포함합니다.)
Author한전케이피에스주식회사
URLhttps://www.data.go.kr/data/15088221/fileData.do

Alerts

번호 is highly overall correlated with 발주자High correlation
발주자 is highly overall correlated with 번호High correlation
번호 has unique valuesUnique
호기명 has unique valuesUnique

Reproduction

Analysis started2024-03-14 17:48:39.532859
Analysis finished2024-03-14 17:48:40.377307
Duration0.84 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

번호
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct233
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean117
Minimum1
Maximum233
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.2 KiB
2024-03-15T02:48:40.683275image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile12.6
Q159
median117
Q3175
95-th percentile221.4
Maximum233
Range232
Interquartile range (IQR)116

Descriptive statistics

Standard deviation67.405489
Coefficient of variation (CV)0.57611529
Kurtosis-1.2
Mean117
Median Absolute Deviation (MAD)58
Skewness0
Sum27261
Variance4543.5
MonotonicityStrictly increasing
2024-03-15T02:48:41.126850image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.4%
161 1
 
0.4%
149 1
 
0.4%
150 1
 
0.4%
151 1
 
0.4%
152 1
 
0.4%
153 1
 
0.4%
154 1
 
0.4%
155 1
 
0.4%
156 1
 
0.4%
Other values (223) 223
95.7%
ValueCountFrequency (%)
1 1
0.4%
2 1
0.4%
3 1
0.4%
4 1
0.4%
5 1
0.4%
6 1
0.4%
7 1
0.4%
8 1
0.4%
9 1
0.4%
10 1
0.4%
ValueCountFrequency (%)
233 1
0.4%
232 1
0.4%
231 1
0.4%
230 1
0.4%
229 1
0.4%
228 1
0.4%
227 1
0.4%
226 1
0.4%
225 1
0.4%
224 1
0.4%

발주자
Categorical

HIGH CORRELATION 

Distinct16
Distinct (%)6.9%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
한국수력원자력
43 
남부발전
40 
중부발전
34 
서부발전
25 
동서발전
22 
Other values (11)
69 

Length

Max length16
Median length4
Mean length5.944206
Min length4

Unique

Unique2 ?
Unique (%)0.9%

Sample

1st row남동발전
2nd row남동발전
3rd row남동발전
4th row남동발전
5th row남동발전

Common Values

ValueCountFrequency (%)
한국수력원자력 43
18.5%
남부발전 40
17.2%
중부발전 34
14.6%
서부발전 25
10.7%
동서발전 22
9.4%
한국지역난방공사(주) 21
9.0%
남동발전 18
7.7%
GS파워주식회사 8
 
3.4%
동두천드림파워(주) 6
 
2.6%
한국지역난방공사㈜ 4
 
1.7%
Other values (6) 12
 
5.2%

Length

2024-03-15T02:48:41.422478image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
한국수력원자력 43
18.5%
남부발전 40
17.2%
중부발전 34
14.6%
서부발전 25
10.7%
동서발전 22
9.4%
한국지역난방공사(주 21
9.0%
남동발전 18
7.7%
gs파워주식회사 8
 
3.4%
동두천드림파워(주 6
 
2.6%
한국지역난방공사㈜ 4
 
1.7%
Other values (6) 12
 
5.2%

호기명
Text

UNIQUE 

Distinct233
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
2024-03-15T02:48:42.405416image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length11
Median length9
Mean length7.416309
Min length2

Characters and Unicode

Total characters1728
Distinct characters97
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique233 ?
Unique (%)100.0%

Sample

1st row삼천포 #1
2nd row삼천포 #2
3rd row삼천포 #3
4th row삼천포 #4
5th row삼천포 #5
ValueCountFrequency (%)
1 66
 
11.8%
gt 58
 
10.4%
2 42
 
7.5%
st 27
 
4.8%
3 22
 
3.9%
4 19
 
3.4%
인천 11
 
2.0%
신인천 10
 
1.8%
6 9
 
1.6%
태안 9
 
1.6%
Other values (88) 286
51.2%
2024-03-15T02:48:43.714540image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
326
18.9%
# 198
 
11.5%
T 113
 
6.5%
1 78
 
4.5%
G 76
 
4.4%
2 58
 
3.4%
53
 
3.1%
S 38
 
2.2%
31
 
1.8%
31
 
1.8%
Other values (87) 726
42.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 730
42.2%
Space Separator 326
18.9%
Uppercase Letter 230
 
13.3%
Decimal Number 228
 
13.2%
Other Punctuation 210
 
12.2%
Close Punctuation 2
 
0.1%
Open Punctuation 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
53
 
7.3%
31
 
4.2%
31
 
4.2%
29
 
4.0%
29
 
4.0%
28
 
3.8%
28
 
3.8%
24
 
3.3%
21
 
2.9%
20
 
2.7%
Other values (67) 436
59.7%
Decimal Number
ValueCountFrequency (%)
1 78
34.2%
2 58
25.4%
3 28
 
12.3%
4 25
 
11.0%
6 12
 
5.3%
5 11
 
4.8%
7 6
 
2.6%
8 6
 
2.6%
0 2
 
0.9%
9 2
 
0.9%
Uppercase Letter
ValueCountFrequency (%)
T 113
49.1%
G 76
33.0%
S 38
 
16.5%
C 2
 
0.9%
I 1
 
0.4%
Other Punctuation
ValueCountFrequency (%)
# 198
94.3%
/ 12
 
5.7%
Space Separator
ValueCountFrequency (%)
326
100.0%
Close Punctuation
ValueCountFrequency (%)
] 2
100.0%
Open Punctuation
ValueCountFrequency (%)
[ 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 768
44.4%
Hangul 730
42.2%
Latin 230
 
13.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
53
 
7.3%
31
 
4.2%
31
 
4.2%
29
 
4.0%
29
 
4.0%
28
 
3.8%
28
 
3.8%
24
 
3.3%
21
 
2.9%
20
 
2.7%
Other values (67) 436
59.7%
Common
ValueCountFrequency (%)
326
42.4%
# 198
25.8%
1 78
 
10.2%
2 58
 
7.6%
3 28
 
3.6%
4 25
 
3.3%
6 12
 
1.6%
/ 12
 
1.6%
5 11
 
1.4%
7 6
 
0.8%
Other values (5) 14
 
1.8%
Latin
ValueCountFrequency (%)
T 113
49.1%
G 76
33.0%
S 38
 
16.5%
C 2
 
0.9%
I 1
 
0.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 998
57.8%
Hangul 730
42.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
326
32.7%
# 198
19.8%
T 113
 
11.3%
1 78
 
7.8%
G 76
 
7.6%
2 58
 
5.8%
S 38
 
3.8%
3 28
 
2.8%
4 25
 
2.5%
6 12
 
1.2%
Other values (10) 46
 
4.6%
Hangul
ValueCountFrequency (%)
53
 
7.3%
31
 
4.2%
31
 
4.2%
29
 
4.0%
29
 
4.0%
28
 
3.8%
28
 
3.8%
24
 
3.3%
21
 
2.9%
20
 
2.7%
Other values (67) 436
59.7%

Interactions

2024-03-15T02:48:39.698961image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-15T02:48:43.971160image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호발주자
번호1.0000.927
발주자0.9271.000
2024-03-15T02:48:44.201555image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호발주자
번호1.0000.704
발주자0.7041.000

Missing values

2024-03-15T02:48:40.005571image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-15T02:48:40.252971image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

번호발주자호기명
01남동발전삼천포 #1
12남동발전삼천포 #2
23남동발전삼천포 #3
34남동발전삼천포 #4
45남동발전삼천포 #5
56남동발전삼천포 #6
67고성그린파워(SPC)-남동발전고성하이 #1
78고성그린파워(SPC)-남동발전고성하이 #2
89남동발전영흥 #1
910남동발전영흥 #2
번호발주자호기명
223224한국지역난방공사(주)광교 ST #1
224225한국수력원자력신한울 1호기
225226중부발전세종열병합 GT #1
226227강릉에코파워(SPC)-남동발전강릉안인 #1
227228강릉에코파워(SPC)-남동발전강릉안인 #2
228229한국지역난방공사(주)동탄열병합 GT #1
229230한국지역난방공사(주)동탄열병합 GT #2
230231한국지역난방공사(주)동탄열병합 ST #1
231232한국지역난방공사(주)동탄열병합 ST #2
232233(주)거금솔라파크미정