Overview

Dataset statistics

Number of variables8
Number of observations23
Missing cells129
Missing cells (%)70.1%
Duplicate rows1
Duplicate rows (%)4.3%
Total size in memory1.7 KiB
Average record size in memory76.7 B

Variable types

Numeric1
Categorical1
Text1
Unsupported5

Dataset

Description대용량고객 고객교육정보제공 게시판정보입니다.
Author한국전력공사
URLhttps://www.data.go.kr/data/15069019/fileData.do

Alerts

Dataset has 1 (4.3%) duplicate rowsDuplicates
순번 is highly overall correlated with 연도High correlation
연도 is highly overall correlated with 순번High correlation
순번 has 7 (30.4%) missing valuesMissing
제목 has 7 (30.4%) missing valuesMissing
Unnamed: 3 has 23 (100.0%) missing valuesMissing
Unnamed: 4 has 23 (100.0%) missing valuesMissing
Unnamed: 5 has 23 (100.0%) missing valuesMissing
Unnamed: 6 has 23 (100.0%) missing valuesMissing
Unnamed: 7 has 23 (100.0%) missing valuesMissing
Unnamed: 3 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 4 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 5 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 6 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 7 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2023-12-12 15:45:33.355006
Analysis finished2023-12-12 15:45:34.142550
Duration0.79 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct16
Distinct (%)100.0%
Missing7
Missing (%)30.4%
Infinite0
Infinite (%)0.0%
Mean8.5
Minimum1
Maximum16
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size339.0 B
2023-12-13T00:45:34.228543image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1.75
Q14.75
median8.5
Q312.25
95-th percentile15.25
Maximum16
Range15
Interquartile range (IQR)7.5

Descriptive statistics

Standard deviation4.7609523
Coefficient of variation (CV)0.56011203
Kurtosis-1.2
Mean8.5
Median Absolute Deviation (MAD)4
Skewness0
Sum136
Variance22.666667
MonotonicityStrictly increasing
2023-12-13T00:45:34.400129image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=16)
ValueCountFrequency (%)
10 1
 
4.3%
16 1
 
4.3%
15 1
 
4.3%
14 1
 
4.3%
13 1
 
4.3%
12 1
 
4.3%
11 1
 
4.3%
1 1
 
4.3%
2 1
 
4.3%
8 1
 
4.3%
Other values (6) 6
26.1%
(Missing) 7
30.4%
ValueCountFrequency (%)
1 1
4.3%
2 1
4.3%
3 1
4.3%
4 1
4.3%
5 1
4.3%
6 1
4.3%
7 1
4.3%
8 1
4.3%
9 1
4.3%
10 1
4.3%
ValueCountFrequency (%)
16 1
4.3%
15 1
4.3%
14 1
4.3%
13 1
4.3%
12 1
4.3%
11 1
4.3%
10 1
4.3%
9 1
4.3%
8 1
4.3%
7 1
4.3%

연도
Categorical

HIGH CORRELATION 

Distinct6
Distinct (%)26.1%
Missing0
Missing (%)0.0%
Memory size316.0 B
<NA>
2010
2011
2007
2009

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique2 ?
Unique (%)8.7%

Sample

1st row2007
2nd row2007
3rd row2007
4th row2007
5th row2009

Common Values

ValueCountFrequency (%)
<NA> 7
30.4%
2010 5
21.7%
2011 5
21.7%
2007 4
17.4%
2009 1
 
4.3%
2013 1
 
4.3%

Length

2023-12-13T00:45:34.580890image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T00:45:34.727900image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 7
30.4%
2010 5
21.7%
2011 5
21.7%
2007 4
17.4%
2009 1
 
4.3%
2013 1
 
4.3%

제목
Text

MISSING 

Distinct16
Distinct (%)100.0%
Missing7
Missing (%)30.4%
Memory size316.0 B
2023-12-13T00:45:34.983323image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length33
Median length25
Mean length25.375
Min length17

Characters and Unicode

Total characters406
Distinct characters84
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique16 ?
Unique (%)100.0%

Sample

1st row베트남 전력기자재 기술규격 및 시장동향 워크샵
2nd row베트남 전력설비 표준, 시험 및 인증제도
3rd row베트남 배전자동화 실증사업 계획
4th row베트남 투자진출 절차 및 유의사항
5th row녹색성장 8대기술 국제표준화 추진 워크숍
ValueCountFrequency (%)
workshop 10
 
12.5%
시장동향 6
 
7.5%
중국 5
 
6.2%
기술규격및 5
 
6.2%
중국전력기자재 5
 
6.2%
교류 5
 
6.2%
기술정보 5
 
6.2%
베트남 4
 
5.0%
3
 
3.8%
워크숍 2
 
2.5%
Other values (30) 30
37.5%
2023-12-13T00:45:35.416075image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
64
 
15.8%
18
 
4.4%
12
 
3.0%
12
 
3.0%
R 10
 
2.5%
10
 
2.5%
W 10
 
2.5%
o 10
 
2.5%
P 10
 
2.5%
O 10
 
2.5%
Other values (74) 240
59.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 225
55.4%
Uppercase Letter 65
 
16.0%
Space Separator 64
 
15.8%
Lowercase Letter 35
 
8.6%
Decimal Number 16
 
3.9%
Other Punctuation 1
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
18
 
8.0%
12
 
5.3%
12
 
5.3%
10
 
4.4%
9
 
4.0%
8
 
3.6%
8
 
3.6%
8
 
3.6%
7
 
3.1%
7
 
3.1%
Other values (50) 126
56.0%
Uppercase Letter
ValueCountFrequency (%)
R 10
15.4%
W 10
15.4%
P 10
15.4%
O 10
15.4%
A 5
7.7%
T 5
7.7%
S 5
7.7%
H 5
7.7%
K 5
7.7%
Decimal Number
ValueCountFrequency (%)
2 4
25.0%
1 3
18.8%
3 3
18.8%
5 2
12.5%
4 2
12.5%
0 1
 
6.2%
8 1
 
6.2%
Lowercase Letter
ValueCountFrequency (%)
o 10
28.6%
r 5
14.3%
k 5
14.3%
h 5
14.3%
p 5
14.3%
s 5
14.3%
Space Separator
ValueCountFrequency (%)
64
100.0%
Other Punctuation
ValueCountFrequency (%)
, 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 225
55.4%
Latin 100
24.6%
Common 81
 
20.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
18
 
8.0%
12
 
5.3%
12
 
5.3%
10
 
4.4%
9
 
4.0%
8
 
3.6%
8
 
3.6%
8
 
3.6%
7
 
3.1%
7
 
3.1%
Other values (50) 126
56.0%
Latin
ValueCountFrequency (%)
R 10
 
10.0%
W 10
 
10.0%
o 10
 
10.0%
P 10
 
10.0%
O 10
 
10.0%
A 5
 
5.0%
T 5
 
5.0%
S 5
 
5.0%
r 5
 
5.0%
H 5
 
5.0%
Other values (5) 25
25.0%
Common
ValueCountFrequency (%)
64
79.0%
2 4
 
4.9%
1 3
 
3.7%
3 3
 
3.7%
5 2
 
2.5%
4 2
 
2.5%
0 1
 
1.2%
, 1
 
1.2%
8 1
 
1.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 225
55.4%
ASCII 181
44.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
64
35.4%
R 10
 
5.5%
W 10
 
5.5%
o 10
 
5.5%
P 10
 
5.5%
O 10
 
5.5%
A 5
 
2.8%
T 5
 
2.8%
S 5
 
2.8%
r 5
 
2.8%
Other values (14) 47
26.0%
Hangul
ValueCountFrequency (%)
18
 
8.0%
12
 
5.3%
12
 
5.3%
10
 
4.4%
9
 
4.0%
8
 
3.6%
8
 
3.6%
8
 
3.6%
7
 
3.1%
7
 
3.1%
Other values (50) 126
56.0%

Unnamed: 3
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing23
Missing (%)100.0%
Memory size339.0 B

Unnamed: 4
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing23
Missing (%)100.0%
Memory size339.0 B

Unnamed: 5
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing23
Missing (%)100.0%
Memory size339.0 B

Unnamed: 6
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing23
Missing (%)100.0%
Memory size339.0 B

Unnamed: 7
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing23
Missing (%)100.0%
Memory size339.0 B

Interactions

2023-12-13T00:45:33.555099image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T00:45:35.527195image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번연도제목
순번1.0000.7751.000
연도0.7751.0001.000
제목1.0001.0001.000
2023-12-13T00:45:35.635873image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번연도
순번1.0000.584
연도0.5841.000

Missing values

2023-12-13T00:45:33.741679image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T00:45:33.928632image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-13T00:45:34.052158image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

순번연도제목Unnamed: 3Unnamed: 4Unnamed: 5Unnamed: 6Unnamed: 7
012007베트남 전력기자재 기술규격 및 시장동향 워크샵<NA><NA><NA><NA><NA>
122007베트남 전력설비 표준, 시험 및 인증제도<NA><NA><NA><NA><NA>
232007베트남 배전자동화 실증사업 계획<NA><NA><NA><NA><NA>
342007베트남 투자진출 절차 및 유의사항<NA><NA><NA><NA><NA>
452009녹색성장 8대기술 국제표준화 추진 워크숍<NA><NA><NA><NA><NA>
562010중국 기술정보 교류 Workshop 제2강<NA><NA><NA><NA><NA>
672010중국 기술정보 교류 Workshop 제1강<NA><NA><NA><NA><NA>
782010중국 기술정보 교류 Workshop 제3강<NA><NA><NA><NA><NA>
892010중국 기술정보 교류 Workshop 제4강<NA><NA><NA><NA><NA>
9102010중국 기술정보 교류 Workshop 제5강<NA><NA><NA><NA><NA>
순번연도제목Unnamed: 3Unnamed: 4Unnamed: 5Unnamed: 6Unnamed: 7
13142011중국전력기자재 기술규격및 시장동향 WORKSHOP PART4<NA><NA><NA><NA><NA>
14152011중국전력기자재 기술규격및 시장동향 WORKSHOP PART5<NA><NA><NA><NA><NA>
151620132013년도 제2차 국제표준위원회 워크숍<NA><NA><NA><NA><NA>
16<NA><NA><NA><NA><NA><NA><NA><NA>
17<NA><NA><NA><NA><NA><NA><NA><NA>
18<NA><NA><NA><NA><NA><NA><NA><NA>
19<NA><NA><NA><NA><NA><NA><NA><NA>
20<NA><NA><NA><NA><NA><NA><NA><NA>
21<NA><NA><NA><NA><NA><NA><NA><NA>
22<NA><NA><NA><NA><NA><NA><NA><NA>

Duplicate rows

Most frequently occurring

순번연도제목# duplicates
0<NA><NA><NA>7