Overview

Dataset statistics

Number of variables4
Number of observations115
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory3.9 KiB
Average record size in memory35.1 B

Variable types

Numeric1
Categorical2
Text1

Dataset

Description한국서부발전 연도별 정보화사업계획 및 담당 소속 정보입니다. 제공데이터는 순서,년도,사업명,사업소명 입니다. - 데이터 예) 1,2019,고객중심의 사외홈페이지 전면개편,국정과제추진실
URLhttps://www.data.go.kr/data/15067727/fileData.do

Alerts

순서 is highly overall correlated with 년도 and 1 other fieldsHigh correlation
년도 is highly overall correlated with 순서 and 1 other fieldsHigh correlation
사업소명 is highly overall correlated with 순서 and 1 other fieldsHigh correlation
순서 has unique valuesUnique

Reproduction

Analysis started2023-12-12 17:05:08.885078
Analysis finished2023-12-12 17:05:09.433387
Duration0.55 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순서
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct115
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean58
Minimum1
Maximum115
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.1 KiB
2023-12-13T02:05:09.511795image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile6.7
Q129.5
median58
Q386.5
95-th percentile109.3
Maximum115
Range114
Interquartile range (IQR)57

Descriptive statistics

Standard deviation33.341666
Coefficient of variation (CV)0.5748563
Kurtosis-1.2
Mean58
Median Absolute Deviation (MAD)29
Skewness0
Sum6670
Variance1111.6667
MonotonicityStrictly increasing
2023-12-13T02:05:10.069052image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.9%
74 1
 
0.9%
86 1
 
0.9%
85 1
 
0.9%
84 1
 
0.9%
83 1
 
0.9%
82 1
 
0.9%
81 1
 
0.9%
80 1
 
0.9%
79 1
 
0.9%
Other values (105) 105
91.3%
ValueCountFrequency (%)
1 1
0.9%
2 1
0.9%
3 1
0.9%
4 1
0.9%
5 1
0.9%
6 1
0.9%
7 1
0.9%
8 1
0.9%
9 1
0.9%
10 1
0.9%
ValueCountFrequency (%)
115 1
0.9%
114 1
0.9%
113 1
0.9%
112 1
0.9%
111 1
0.9%
110 1
0.9%
109 1
0.9%
108 1
0.9%
107 1
0.9%
106 1
0.9%

년도
Categorical

HIGH CORRELATION 

Distinct5
Distinct (%)4.3%
Missing0
Missing (%)0.0%
Memory size1.0 KiB
2023
42 
2021
24 
2019
18 
2022
16 
2020
15 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2019
2nd row2019
3rd row2019
4th row2019
5th row2019

Common Values

ValueCountFrequency (%)
2023 42
36.5%
2021 24
20.9%
2019 18
15.7%
2022 16
 
13.9%
2020 15
 
13.0%

Length

2023-12-13T02:05:10.257713image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T02:05:10.400232image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023 42
36.5%
2021 24
20.9%
2019 18
15.7%
2022 16
 
13.9%
2020 15
 
13.0%
Distinct99
Distinct (%)86.1%
Missing0
Missing (%)0.0%
Memory size1.0 KiB
2023-12-13T02:05:10.662960image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length34
Median length28
Mean length18.052174
Min length10

Characters and Unicode

Total characters2076
Distinct characters256
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique86 ?
Unique (%)74.8%

Sample

1st row고객중심의 사외홈페이지 전면개편
2nd row모바일 오피스 구축
3rd rowERP 서버 성능 개선
4th row사무자동화설비(OA) 교체
5th row사내 네트워크 설비 성능 개선
ValueCountFrequency (%)
구축 37
 
7.7%
교체 18
 
3.7%
고도화 16
 
3.3%
개선 15
 
3.1%
14
 
2.9%
시스템 11
 
2.3%
cctv 8
 
1.7%
노후 8
 
1.7%
개발 7
 
1.4%
확대 7
 
1.4%
Other values (233) 342
70.8%
2023-12-13T02:05:11.137062image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
370
 
17.8%
61
 
2.9%
59
 
2.8%
49
 
2.4%
46
 
2.2%
45
 
2.2%
39
 
1.9%
39
 
1.9%
38
 
1.8%
31
 
1.5%
Other values (246) 1299
62.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1512
72.8%
Space Separator 370
 
17.8%
Uppercase Letter 135
 
6.5%
Lowercase Letter 21
 
1.0%
Close Punctuation 11
 
0.5%
Open Punctuation 11
 
0.5%
Decimal Number 9
 
0.4%
Other Punctuation 4
 
0.2%
Dash Punctuation 2
 
0.1%
Math Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
61
 
4.0%
59
 
3.9%
49
 
3.2%
46
 
3.0%
45
 
3.0%
39
 
2.6%
39
 
2.6%
38
 
2.5%
31
 
2.1%
30
 
2.0%
Other values (201) 1075
71.1%
Uppercase Letter
ValueCountFrequency (%)
C 28
20.7%
P 19
14.1%
I 16
11.9%
T 15
11.1%
V 12
8.9%
R 7
 
5.2%
A 6
 
4.4%
E 5
 
3.7%
S 5
 
3.7%
O 4
 
3.0%
Other values (9) 18
13.3%
Lowercase Letter
ValueCountFrequency (%)
o 4
19.0%
i 4
19.0%
e 3
14.3%
a 2
9.5%
n 2
9.5%
t 1
 
4.8%
u 1
 
4.8%
l 1
 
4.8%
g 1
 
4.8%
k 1
 
4.8%
Decimal Number
ValueCountFrequency (%)
5 2
22.2%
6 1
11.1%
2 1
11.1%
3 1
11.1%
9 1
11.1%
0 1
11.1%
1 1
11.1%
8 1
11.1%
Other Punctuation
ValueCountFrequency (%)
, 2
50.0%
# 2
50.0%
Space Separator
ValueCountFrequency (%)
370
100.0%
Close Punctuation
ValueCountFrequency (%)
) 11
100.0%
Open Punctuation
ValueCountFrequency (%)
( 11
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1512
72.8%
Common 408
 
19.7%
Latin 156
 
7.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
61
 
4.0%
59
 
3.9%
49
 
3.2%
46
 
3.0%
45
 
3.0%
39
 
2.6%
39
 
2.6%
38
 
2.5%
31
 
2.1%
30
 
2.0%
Other values (201) 1075
71.1%
Latin
ValueCountFrequency (%)
C 28
17.9%
P 19
12.2%
I 16
10.3%
T 15
 
9.6%
V 12
 
7.7%
R 7
 
4.5%
A 6
 
3.8%
E 5
 
3.2%
S 5
 
3.2%
o 4
 
2.6%
Other values (20) 39
25.0%
Common
ValueCountFrequency (%)
370
90.7%
) 11
 
2.7%
( 11
 
2.7%
, 2
 
0.5%
5 2
 
0.5%
# 2
 
0.5%
- 2
 
0.5%
~ 1
 
0.2%
6 1
 
0.2%
2 1
 
0.2%
Other values (5) 5
 
1.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1512
72.8%
ASCII 564
 
27.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
370
65.6%
C 28
 
5.0%
P 19
 
3.4%
I 16
 
2.8%
T 15
 
2.7%
V 12
 
2.1%
) 11
 
2.0%
( 11
 
2.0%
R 7
 
1.2%
A 6
 
1.1%
Other values (35) 69
 
12.2%
Hangul
ValueCountFrequency (%)
61
 
4.0%
59
 
3.9%
49
 
3.2%
46
 
3.0%
45
 
3.0%
39
 
2.6%
39
 
2.6%
38
 
2.5%
31
 
2.1%
30
 
2.0%
Other values (201) 1075
71.1%

사업소명
Categorical

HIGH CORRELATION 

Distinct15
Distinct (%)13.0%
Missing0
Missing (%)0.0%
Memory size1.0 KiB
보안처
33 
평택발전본부
17 
정보기술처
17 
정보전략처
12 
발전기술처
11 
Other values (10)
25 

Length

Max length7
Median length6
Mean length4.7478261
Min length3

Unique

Unique5 ?
Unique (%)4.3%

Sample

1st row국정과제추진실
2nd row보안처
3rd row보안처
4th row보안처
5th row보안처

Common Values

ValueCountFrequency (%)
보안처 33
28.7%
평택발전본부 17
14.8%
정보기술처 17
14.8%
정보전략처 12
 
10.4%
발전기술처 11
 
9.6%
군산발전본부 7
 
6.1%
서인천발전본부 5
 
4.3%
태안발전본부 4
 
3.5%
발전운영처 2
 
1.7%
안전품질처 2
 
1.7%
Other values (5) 5
 
4.3%

Length

2023-12-13T02:05:11.362376image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
보안처 33
28.7%
평택발전본부 17
14.8%
정보기술처 17
14.8%
정보전략처 12
 
10.4%
발전기술처 11
 
9.6%
군산발전본부 7
 
6.1%
서인천발전본부 5
 
4.3%
태안발전본부 4
 
3.5%
발전운영처 2
 
1.7%
안전품질처 2
 
1.7%
Other values (5) 5
 
4.3%

Interactions

2023-12-13T02:05:09.175866image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T02:05:11.497049image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순서년도사업명사업소명
순서1.0000.9930.4280.875
년도0.9931.0000.9410.912
사업명0.4280.9411.0000.999
사업소명0.8750.9120.9991.000
2023-12-13T02:05:11.652934image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
년도사업소명
년도1.0000.602
사업소명0.6021.000
2023-12-13T02:05:11.757254image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순서년도사업소명
순서1.0000.8600.549
년도0.8601.0000.602
사업소명0.5490.6021.000

Missing values

2023-12-13T02:05:09.297136image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T02:05:09.397251image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

순서년도사업명사업소명
012019고객중심의 사외홈페이지 전면개편국정과제추진실
122019모바일 오피스 구축보안처
232019ERP 서버 성능 개선보안처
342019사무자동화설비(OA) 교체보안처
452019사내 네트워크 설비 성능 개선보안처
562019업무용 소프트웨어 구매보안처
672019인사노무시스템 웹 표준화 및 연말정산 패키지 개발보안처
782019GENi 웹표준화 및 UX 신규 구축보안처
892019시스템 접근제어기능 및 패스워드 관리체계 강화보안처
9102019빅데이터 기반보안 관제시스템 고도화보안처
순서년도사업명사업소명
1051062023인공지능 스마트 안전 CCTV감시시스템 확대 구축평택발전본부
1061072023CCTV망 정보보안 인프라 개선평택발전본부
1071082023IP교환기 성능 개선평택발전본부
1081092023소내 전광판설비 성능 개선평택발전본부
1091102023차세대 무선침입차단시스템(WIPS) 구축평택발전본부
1101112023노후 IP교환기 교체서인천발전본부
1111122023발전설비감시 CCTV 시스템 보강서인천발전본부
1121132023군산발전본부 네트워크 보안 강화군산발전본부
1131142023출입통제시스템 교체군산발전본부
1141152023정문 홍보용 전광판 교체군산발전본부