Overview

Dataset statistics

Number of variables6
Number of observations24
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.3 KiB
Average record size in memory55.5 B

Variable types

Numeric1
Categorical4
Text1

Dataset

Description송변전 직거래 고객관리
Author한국전력공사
URLhttps://www.data.go.kr/data/15053167/fileData.do

Alerts

순번 is highly overall correlated with 설비 and 1 other fieldsHigh correlation
설비 is highly overall correlated with 순번 and 1 other fieldsHigh correlation
비용 is highly overall correlated with 주기High correlation
주기 is highly overall correlated with 비용High correlation
신청업체 is highly overall correlated with 순번 and 1 other fieldsHigh correlation
순번 has unique valuesUnique

Reproduction

Analysis started2023-12-13 00:50:17.801609
Analysis finished2023-12-13 00:50:18.216177
Duration0.41 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct24
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean12.5
Minimum1
Maximum24
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size348.0 B
2023-12-13T09:50:18.264507image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2.15
Q16.75
median12.5
Q318.25
95-th percentile22.85
Maximum24
Range23
Interquartile range (IQR)11.5

Descriptive statistics

Standard deviation7.0710678
Coefficient of variation (CV)0.56568542
Kurtosis-1.2
Mean12.5
Median Absolute Deviation (MAD)6
Skewness0
Sum300
Variance50
MonotonicityStrictly increasing
2023-12-13T09:50:18.362107image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=24)
ValueCountFrequency (%)
1 1
 
4.2%
14 1
 
4.2%
24 1
 
4.2%
23 1
 
4.2%
22 1
 
4.2%
21 1
 
4.2%
20 1
 
4.2%
19 1
 
4.2%
18 1
 
4.2%
17 1
 
4.2%
Other values (14) 14
58.3%
ValueCountFrequency (%)
1 1
4.2%
2 1
4.2%
3 1
4.2%
4 1
4.2%
5 1
4.2%
6 1
4.2%
7 1
4.2%
8 1
4.2%
9 1
4.2%
10 1
4.2%
ValueCountFrequency (%)
24 1
4.2%
23 1
4.2%
22 1
4.2%
21 1
4.2%
20 1
4.2%
19 1
4.2%
18 1
4.2%
17 1
4.2%
16 1
4.2%
15 1
4.2%

설비
Categorical

HIGH CORRELATION 

Distinct6
Distinct (%)25.0%
Missing0
Missing (%)0.0%
Memory size324.0 B
가공송전선로
지중송전선로
개폐장치
변압기
공통

Length

Max length6
Median length4
Mean length4.5416667
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row가공송전선로
2nd row가공송전선로
3rd row가공송전선로
4th row가공송전선로
5th row가공송전선로

Common Values

ValueCountFrequency (%)
가공송전선로 6
25.0%
지중송전선로 5
20.8%
개폐장치 5
20.8%
변압기 3
12.5%
공통 3
12.5%
보호설비 2
 
8.3%

Length

2023-12-13T09:50:18.465252image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T09:50:18.549837image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
가공송전선로 6
25.0%
지중송전선로 5
20.8%
개폐장치 5
20.8%
변압기 3
12.5%
공통 3
12.5%
보호설비 2
 
8.3%
Distinct21
Distinct (%)87.5%
Missing0
Missing (%)0.0%
Memory size324.0 B
2023-12-13T09:50:18.687184image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length11.5
Mean length7.3333333
Min length4

Characters and Unicode

Total characters176
Distinct characters81
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique18 ?
Unique (%)75.0%

Sample

1st row보통순시
2nd row전선접속개소 편심측정
3rd row불량애자 검출
4th row기별점검
5th row고장순시
ValueCountFrequency (%)
측정 6
 
12.8%
보통순시 2
 
4.3%
점검 2
 
4.3%
정밀점검 2
 
4.3%
2
 
4.3%
확인점검 2
 
4.3%
고장순시 2
 
4.3%
gis 1
 
2.1%
제어회로 1
 
2.1%
누설전류 1
 
2.1%
Other values (26) 26
55.3%
2023-12-13T09:50:18.940897image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
23
 
13.1%
10
 
5.7%
10
 
5.7%
9
 
5.1%
8
 
4.5%
6
 
3.4%
5
 
2.8%
4
 
2.3%
3
 
1.7%
G 3
 
1.7%
Other values (71) 95
54.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 134
76.1%
Space Separator 23
 
13.1%
Uppercase Letter 11
 
6.2%
Lowercase Letter 6
 
3.4%
Decimal Number 1
 
0.6%
Dash Punctuation 1
 
0.6%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
10
 
7.5%
10
 
7.5%
9
 
6.7%
8
 
6.0%
6
 
4.5%
5
 
3.7%
4
 
3.0%
3
 
2.2%
3
 
2.2%
3
 
2.2%
Other values (57) 73
54.5%
Uppercase Letter
ValueCountFrequency (%)
G 3
27.3%
S 2
18.2%
I 1
 
9.1%
O 1
 
9.1%
P 1
 
9.1%
D 1
 
9.1%
X 1
 
9.1%
R 1
 
9.1%
Lowercase Letter
ValueCountFrequency (%)
a 3
50.0%
s 2
33.3%
y 1
 
16.7%
Space Separator
ValueCountFrequency (%)
23
100.0%
Decimal Number
ValueCountFrequency (%)
2 1
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 134
76.1%
Common 25
 
14.2%
Latin 17
 
9.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
10
 
7.5%
10
 
7.5%
9
 
6.7%
8
 
6.0%
6
 
4.5%
5
 
3.7%
4
 
3.0%
3
 
2.2%
3
 
2.2%
3
 
2.2%
Other values (57) 73
54.5%
Latin
ValueCountFrequency (%)
G 3
17.6%
a 3
17.6%
s 2
11.8%
S 2
11.8%
I 1
 
5.9%
O 1
 
5.9%
y 1
 
5.9%
P 1
 
5.9%
D 1
 
5.9%
X 1
 
5.9%
Common
ValueCountFrequency (%)
23
92.0%
2 1
 
4.0%
- 1
 
4.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 134
76.1%
ASCII 42
 
23.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
23
54.8%
G 3
 
7.1%
a 3
 
7.1%
s 2
 
4.8%
S 2
 
4.8%
I 1
 
2.4%
O 1
 
2.4%
2 1
 
2.4%
- 1
 
2.4%
y 1
 
2.4%
Other values (4) 4
 
9.5%
Hangul
ValueCountFrequency (%)
10
 
7.5%
10
 
7.5%
9
 
6.7%
8
 
6.0%
6
 
4.5%
5
 
3.7%
4
 
3.0%
3
 
2.2%
3
 
2.2%
3
 
2.2%
Other values (57) 73
54.5%

비용
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)8.3%
Missing0
Missing (%)0.0%
Memory size324.0 B
무상
17 
유상

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row무상
2nd row유상
3rd row유상
4th row무상
5th row무상

Common Values

ValueCountFrequency (%)
무상 17
70.8%
유상 7
29.2%

Length

2023-12-13T09:50:19.039424image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T09:50:19.123033image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
무상 17
70.8%
유상 7
29.2%

주기
Categorical

HIGH CORRELATION 

Distinct6
Distinct (%)25.0%
Missing0
Missing (%)0.0%
Memory size324.0 B
2회/년
10 
요청시
1회/년
발생시
필요시

Length

Max length4
Median length4
Mean length3.5416667
Min length3

Unique

Unique1 ?
Unique (%)4.2%

Sample

1st row2회/년
2nd row요청시
3rd row요청시
4th row1회/년
5th row발생시

Common Values

ValueCountFrequency (%)
2회/년 10
41.7%
요청시 6
25.0%
1회/년 3
 
12.5%
발생시 2
 
8.3%
필요시 2
 
8.3%
휴전시 1
 
4.2%

Length

2023-12-13T09:50:19.216928image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T09:50:19.303589image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2회/년 10
41.7%
요청시 6
25.0%
1회/년 3
 
12.5%
발생시 2
 
8.3%
필요시 2
 
8.3%
휴전시 1
 
4.2%

신청업체
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)8.3%
Missing0
Missing (%)0.0%
Memory size324.0 B
0
13 
1
11 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 13
54.2%
1 11
45.8%

Length

2023-12-13T09:50:19.392192image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T09:50:19.464194image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 13
54.2%
1 11
45.8%

Interactions

2023-12-13T09:50:18.028845image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T09:50:19.512159image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번설비점검항목비용주기신청업체
순번1.0000.9310.6870.8050.5420.905
설비0.9311.0000.6950.4560.6300.852
점검항목0.6870.6951.0000.7010.8230.000
비용0.8050.4560.7011.0000.9950.666
주기0.5420.6300.8230.9951.0000.746
신청업체0.9050.8520.0000.6660.7461.000
2023-12-13T09:50:19.588003image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
비용주기신청업체설비
비용1.0000.8440.4630.283
주기0.8441.0000.4950.251
신청업체0.4630.4951.0000.592
설비0.2830.2510.5921.000
2023-12-13T09:50:19.656420image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번설비비용주기신청업체
순번1.0000.7220.5000.2490.586
설비0.7221.0000.2830.2510.592
비용0.5000.2831.0000.8440.463
주기0.2490.2510.8441.0000.495
신청업체0.5860.5920.4630.4951.000

Missing values

2023-12-13T09:50:18.110233image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T09:50:18.185657image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

순번설비점검항목비용주기신청업체
01가공송전선로보통순시무상2회/년0
12가공송전선로전선접속개소 편심측정유상요청시0
23가공송전선로불량애자 검출유상요청시0
34가공송전선로기별점검무상1회/년0
45가공송전선로고장순시무상발생시0
56가공송전선로정밀점검유상요청시0
67지중송전선로보통순시무상2회/년0
78지중송전선로맨홀점검유상요청시0
89지중송전선로케이블 및 접속함 점검유상요청시0
910지중송전선로PD 측정유상요청시0
순번설비점검항목비용주기신청업체
1415개폐장치GIS 부분방전 측정무상1회/년1
1516개폐장치X-Ray 측정유상필요시0
1617개폐장치Gas중 수분 및 SO2 측정무상2회/년1
1718개폐장치Gas 누기 측정무상2회/년1
1819개폐장치조작기구부 확인점검무상2회/년0
1920공통접속개소 과열측정무상2회/년1
2021공통피뢰기 누설전류 측정무상2회/년1
2122공통제어회로 점검무상2회/년1
2223보호설비일반점검무상2회/년1
2324보호설비정밀점검무상휴전시1