Overview

Dataset statistics

Number of variables4
Number of observations142
Missing cells142
Missing cells (%)25.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory4.8 KiB
Average record size in memory34.9 B

Variable types

Numeric1
Categorical1
Text1
Unsupported1

Dataset

Description한국폴리텍대학에서 제공하는 있는 직업교육 이러닝 콘텐츠 목록입니다. 제공하는 데이터항목 정보는 분야, 콘텐츠명 입니다.
Author학교법인한국폴리텍
URLhttps://www.data.go.kr/data/15053545/fileData.do

Alerts

번호 is highly overall correlated with 분야High correlation
분야 is highly overall correlated with 번호High correlation
비고 has 142 (100.0%) missing valuesMissing
번호 has unique valuesUnique
비고 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2023-12-12 19:35:02.448187
Analysis finished2023-12-12 19:35:03.203421
Duration0.76 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

번호
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct142
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean71.5
Minimum1
Maximum142
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.4 KiB
2023-12-13T04:35:03.271584image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile8.05
Q136.25
median71.5
Q3106.75
95-th percentile134.95
Maximum142
Range141
Interquartile range (IQR)70.5

Descriptive statistics

Standard deviation41.135953
Coefficient of variation (CV)0.57532802
Kurtosis-1.2
Mean71.5
Median Absolute Deviation (MAD)35.5
Skewness0
Sum10153
Variance1692.1667
MonotonicityStrictly increasing
2023-12-13T04:35:03.416928image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.7%
99 1
 
0.7%
93 1
 
0.7%
94 1
 
0.7%
95 1
 
0.7%
96 1
 
0.7%
97 1
 
0.7%
98 1
 
0.7%
100 1
 
0.7%
91 1
 
0.7%
Other values (132) 132
93.0%
ValueCountFrequency (%)
1 1
0.7%
2 1
0.7%
3 1
0.7%
4 1
0.7%
5 1
0.7%
6 1
0.7%
7 1
0.7%
8 1
0.7%
9 1
0.7%
10 1
0.7%
ValueCountFrequency (%)
142 1
0.7%
141 1
0.7%
140 1
0.7%
139 1
0.7%
138 1
0.7%
137 1
0.7%
136 1
0.7%
135 1
0.7%
134 1
0.7%
133 1
0.7%

분야
Categorical

HIGH CORRELATION 

Distinct20
Distinct (%)14.1%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
교양
31 
전기전자
18 
영어
17 
정보통신
17 
기계
13 
Other values (15)
46 

Length

Max length6
Median length2
Mean length2.9859155
Min length2

Unique

Unique5 ?
Unique (%)3.5%

Sample

1st rowOA
2nd rowOA
3rd rowOA
4th rowOA
5th rowOA

Common Values

ValueCountFrequency (%)
교양 31
21.8%
전기전자 18
12.7%
영어 17
12.0%
정보통신 17
12.0%
기계 13
9.2%
미디어디자인 9
 
6.3%
OA 7
 
4.9%
자동차 5
 
3.5%
자동화 5
 
3.5%
바이오 4
 
2.8%
Other values (10) 16
11.3%

Length

2023-12-13T04:35:03.579776image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
교양 31
21.8%
전기전자 18
12.7%
영어 17
12.0%
정보통신 17
12.0%
기계 13
9.2%
미디어디자인 9
 
6.3%
oa 7
 
4.9%
자동차 5
 
3.5%
자동화 5
 
3.5%
바이오 4
 
2.8%
Other values (10) 16
11.3%
Distinct141
Distinct (%)99.3%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
2023-12-13T04:35:03.798620image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length33
Median length26
Mean length16.880282
Min length5

Characters and Unicode

Total characters2397
Distinct characters328
Distinct categories12 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique140 ?
Unique (%)98.6%

Sample

1st row사무_사무자동화산업기사
2nd row사무_업무의 달인을 인정받는 Excel 2007
3rd row사무_업무의 달인을 인정받는 Powerpoint2007
4th row사무_컴퓨터그래픽스운용기능사
5th row전기전자_Must Know PC Advanced
ValueCountFrequency (%)
이용한 6
 
1.5%
위한 6
 
1.5%
영어_how 6
 
1.5%
to 6
 
1.5%
basic 5
 
1.3%
기초 5
 
1.3%
5
 
1.3%
toeic 4
 
1.0%
2 3
 
0.8%
전기전자_must 3
 
0.8%
Other values (293) 345
87.6%
2023-12-13T04:35:04.188325image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
258
 
10.8%
_ 132
 
5.5%
97
 
4.0%
64
 
2.7%
51
 
2.1%
44
 
1.8%
38
 
1.6%
35
 
1.5%
C 35
 
1.5%
33
 
1.4%
Other values (318) 1610
67.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1535
64.0%
Space Separator 258
 
10.8%
Uppercase Letter 243
 
10.1%
Lowercase Letter 136
 
5.7%
Connector Punctuation 132
 
5.5%
Decimal Number 39
 
1.6%
Other Punctuation 22
 
0.9%
Close Punctuation 11
 
0.5%
Open Punctuation 11
 
0.5%
Dash Punctuation 6
 
0.3%
Other values (2) 4
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
97
 
6.3%
64
 
4.2%
51
 
3.3%
44
 
2.9%
38
 
2.5%
35
 
2.3%
33
 
2.1%
29
 
1.9%
28
 
1.8%
28
 
1.8%
Other values (255) 1088
70.9%
Uppercase Letter
ValueCountFrequency (%)
C 35
14.4%
I 23
9.5%
A 19
 
7.8%
E 19
 
7.8%
P 17
 
7.0%
O 17
 
7.0%
S 16
 
6.6%
D 15
 
6.2%
T 14
 
5.8%
M 11
 
4.5%
Other values (12) 57
23.5%
Lowercase Letter
ValueCountFrequency (%)
t 20
14.7%
o 14
10.3%
e 12
8.8%
n 11
 
8.1%
a 11
 
8.1%
c 9
 
6.6%
u 9
 
6.6%
r 8
 
5.9%
i 7
 
5.1%
w 7
 
5.1%
Other values (9) 28
20.6%
Decimal Number
ValueCountFrequency (%)
1 11
28.2%
0 9
23.1%
2 9
23.1%
3 6
15.4%
7 2
 
5.1%
6 1
 
2.6%
5 1
 
2.6%
Other Punctuation
ValueCountFrequency (%)
/ 14
63.6%
: 4
 
18.2%
, 1
 
4.5%
% 1
 
4.5%
! 1
 
4.5%
. 1
 
4.5%
Close Punctuation
ValueCountFrequency (%)
) 10
90.9%
] 1
 
9.1%
Open Punctuation
ValueCountFrequency (%)
( 10
90.9%
[ 1
 
9.1%
Space Separator
ValueCountFrequency (%)
258
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 132
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 6
100.0%
Letter Number
ValueCountFrequency (%)
2
100.0%
Math Symbol
ValueCountFrequency (%)
+ 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1535
64.0%
Common 481
 
20.1%
Latin 381
 
15.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
97
 
6.3%
64
 
4.2%
51
 
3.3%
44
 
2.9%
38
 
2.5%
35
 
2.3%
33
 
2.1%
29
 
1.9%
28
 
1.8%
28
 
1.8%
Other values (255) 1088
70.9%
Latin
ValueCountFrequency (%)
C 35
 
9.2%
I 23
 
6.0%
t 20
 
5.2%
A 19
 
5.0%
E 19
 
5.0%
P 17
 
4.5%
O 17
 
4.5%
S 16
 
4.2%
D 15
 
3.9%
T 14
 
3.7%
Other values (32) 186
48.8%
Common
ValueCountFrequency (%)
258
53.6%
_ 132
27.4%
/ 14
 
2.9%
1 11
 
2.3%
) 10
 
2.1%
( 10
 
2.1%
0 9
 
1.9%
2 9
 
1.9%
- 6
 
1.2%
3 6
 
1.2%
Other values (11) 16
 
3.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1535
64.0%
ASCII 860
35.9%
Number Forms 2
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
258
30.0%
_ 132
15.3%
C 35
 
4.1%
I 23
 
2.7%
t 20
 
2.3%
A 19
 
2.2%
E 19
 
2.2%
P 17
 
2.0%
O 17
 
2.0%
S 16
 
1.9%
Other values (52) 304
35.3%
Hangul
ValueCountFrequency (%)
97
 
6.3%
64
 
4.2%
51
 
3.3%
44
 
2.9%
38
 
2.5%
35
 
2.3%
33
 
2.1%
29
 
1.9%
28
 
1.8%
28
 
1.8%
Other values (255) 1088
70.9%
Number Forms
ValueCountFrequency (%)
2
100.0%

비고
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing142
Missing (%)100.0%
Memory size1.4 KiB

Interactions

2023-12-13T04:35:02.977513image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T04:35:04.276254image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호분야
번호1.0000.973
분야0.9731.000
2023-12-13T04:35:04.355419image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호분야
번호1.0000.705
분야0.7051.000

Missing values

2023-12-13T04:35:03.095558image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T04:35:03.171699image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

번호분야콘텐츠명비고
01OA사무_사무자동화산업기사<NA>
12OA사무_업무의 달인을 인정받는 Excel 2007<NA>
23OA사무_업무의 달인을 인정받는 Powerpoint2007<NA>
34OA사무_컴퓨터그래픽스운용기능사<NA>
45OA전기전자_Must Know PC Advanced<NA>
56OA전기전자_Must Know PC Basic<NA>
67OA전기전자_Must Know PC Intermediate<NA>
78교양공통_경영학원론<NA>
89교양공통_경영학원론2<NA>
910교양공통_경제학 이해<NA>
번호분야콘텐츠명비고
132133정보통신정보통신_사례로알아보는홈네트워크시공<NA>
133134정보통신정보통신_윈도우웹서버구축/세상을 바꾸는 정보통신기술/운영체제<NA>
134135정보통신정보통신_정보처리산업기사<NA>
135136정보통신정보통신_홈네트워크건축도면의이해및설계<NA>
136137정보통신정보통신_홈네트워크망설계및유지관리방법<NA>
137138패션패션디자인_패션VMD/디자인 도식화/패션코디네이터<NA>
138139패션패션디자인_패션디자인의 요소 및 원리<NA>
139140표면처리표면처리_전기도금<NA>
140141환경화학환경화학_신재생에너지<NA>
141142환경화학환경화학_위험물산업기사<NA>