Overview

Dataset statistics

Number of variables5
Number of observations310
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory12.8 KiB
Average record size in memory42.4 B

Variable types

Text1
Categorical3
Numeric1

Dataset

Description한국산업안전보건공단 교육원에서 실시하는 사업 중 온라인(이러닝) 교육과정 콘텐츠 정보 목록에 대한 데이터입니다."23년 기준 강의 목록명과 정원 수 등을 확인하실 수 있습니다.
Author한국산업안전보건공단
URLhttps://www.data.go.kr/data/15118243/fileData.do

Alerts

교육연도 has constant value ""Constant
정원 is highly overall correlated with 교육대상High correlation
교육대상 is highly overall correlated with 정원 and 1 other fieldsHigh correlation
법정인정 교육시간_h is highly overall correlated with 교육대상High correlation

Reproduction

Analysis started2024-03-14 08:47:08.560288
Analysis finished2024-03-14 08:47:09.732026
Duration1.17 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct305
Distinct (%)98.4%
Missing0
Missing (%)0.0%
Memory size2.5 KiB
2024-03-14T17:47:10.682270image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length47
Median length35
Mean length26.658065
Min length9

Characters and Unicode

Total characters8264
Distinct characters336
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique300 ?
Unique (%)96.8%

Sample

1st row특수형태근로종사자 최초 노무제공 시 교육: 불도저
2nd row특수형태근로종사자 최초 노무제공 시 교육: 롤러
3rd row건축공사 작업안전
4th row전문가 특강: [추락재해 예방]
5th row['23년 공단직원 기타교육] 도·소매업 관리감독자 실무능력 향상
ValueCountFrequency (%)
23년 58
 
3.7%
교육 44
 
2.8%
최초 35
 
2.2%
35
 
2.2%
노무제공 35
 
2.2%
특수형태근로종사자 35
 
2.2%
공단직원 32
 
2.0%
안전관리자 28
 
1.8%
28
 
1.8%
고용노동부 23
 
1.5%
Other values (435) 1211
77.4%
2024-03-14T17:47:12.510718image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1255
 
15.2%
249
 
3.0%
213
 
2.6%
206
 
2.5%
198
 
2.4%
196
 
2.4%
194
 
2.3%
191
 
2.3%
( 163
 
2.0%
) 163
 
2.0%
Other values (326) 5236
63.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5868
71.0%
Space Separator 1255
 
15.2%
Open Punctuation 254
 
3.1%
Close Punctuation 254
 
3.1%
Decimal Number 235
 
2.8%
Other Punctuation 225
 
2.7%
Lowercase Letter 72
 
0.9%
Uppercase Letter 49
 
0.6%
Letter Number 40
 
0.5%
Connector Punctuation 6
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
249
 
4.2%
213
 
3.6%
206
 
3.5%
198
 
3.4%
196
 
3.3%
194
 
3.3%
191
 
3.3%
153
 
2.6%
143
 
2.4%
142
 
2.4%
Other values (278) 3983
67.9%
Uppercase Letter
ValueCountFrequency (%)
O 13
26.5%
Z 11
22.4%
M 10
20.4%
P 5
 
10.2%
S 4
 
8.2%
B 2
 
4.1%
A 2
 
4.1%
L 1
 
2.0%
H 1
 
2.0%
Lowercase Letter
ValueCountFrequency (%)
i 20
27.8%
m 14
19.4%
n 10
13.9%
o 8
 
11.1%
p 5
 
6.9%
l 5
 
6.9%
a 5
 
6.9%
y 5
 
6.9%
Other Punctuation
ValueCountFrequency (%)
: 123
54.7%
' 51
22.7%
· 26
 
11.6%
, 15
 
6.7%
/ 5
 
2.2%
? 4
 
1.8%
. 1
 
0.4%
Letter Number
ValueCountFrequency (%)
12
30.0%
12
30.0%
7
17.5%
6
15.0%
1
 
2.5%
1
 
2.5%
1
 
2.5%
Decimal Number
ValueCountFrequency (%)
2 103
43.8%
3 85
36.2%
1 20
 
8.5%
0 17
 
7.2%
4 8
 
3.4%
6 2
 
0.9%
Open Punctuation
ValueCountFrequency (%)
( 163
64.2%
[ 80
31.5%
7
 
2.8%
4
 
1.6%
Close Punctuation
ValueCountFrequency (%)
) 163
64.2%
] 80
31.5%
7
 
2.8%
4
 
1.6%
Space Separator
ValueCountFrequency (%)
1255
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 6
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 6
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5868
71.0%
Common 2235
 
27.0%
Latin 161
 
1.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
249
 
4.2%
213
 
3.6%
206
 
3.5%
198
 
3.4%
196
 
3.3%
194
 
3.3%
191
 
3.3%
153
 
2.6%
143
 
2.4%
142
 
2.4%
Other values (278) 3983
67.9%
Common
ValueCountFrequency (%)
1255
56.2%
( 163
 
7.3%
) 163
 
7.3%
: 123
 
5.5%
2 103
 
4.6%
3 85
 
3.8%
[ 80
 
3.6%
] 80
 
3.6%
' 51
 
2.3%
· 26
 
1.2%
Other values (14) 106
 
4.7%
Latin
ValueCountFrequency (%)
i 20
12.4%
m 14
 
8.7%
O 13
 
8.1%
12
 
7.5%
12
 
7.5%
Z 11
 
6.8%
n 10
 
6.2%
M 10
 
6.2%
o 8
 
5.0%
7
 
4.3%
Other values (14) 44
27.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5868
71.0%
ASCII 2308
 
27.9%
None 48
 
0.6%
Number Forms 40
 
0.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1255
54.4%
( 163
 
7.1%
) 163
 
7.1%
: 123
 
5.3%
2 103
 
4.5%
3 85
 
3.7%
[ 80
 
3.5%
] 80
 
3.5%
' 51
 
2.2%
1 20
 
0.9%
Other values (26) 185
 
8.0%
Hangul
ValueCountFrequency (%)
249
 
4.2%
213
 
3.6%
206
 
3.5%
198
 
3.4%
196
 
3.3%
194
 
3.3%
191
 
3.3%
153
 
2.6%
143
 
2.4%
142
 
2.4%
Other values (278) 3983
67.9%
None
ValueCountFrequency (%)
· 26
54.2%
7
 
14.6%
7
 
14.6%
4
 
8.3%
4
 
8.3%
Number Forms
ValueCountFrequency (%)
12
30.0%
12
30.0%
7
17.5%
6
15.0%
1
 
2.5%
1
 
2.5%
1
 
2.5%

교육대상
Categorical

HIGH CORRELATION 

Distinct40
Distinct (%)12.9%
Missing0
Missing (%)0.0%
Memory size2.5 KiB
교육희망자
42 
특수형태근로종사자 및 교육희망자
35 
공단직원
33 
안전관리자
28 
관리감독자 및 교육희망자
26 
Other values (35)
146 

Length

Max length33
Median length26
Mean length12.619355
Min length3

Unique

Unique16 ?
Unique (%)5.2%

Sample

1st row특수형태근로종사자 및 교육희망자
2nd row특수형태근로종사자 및 교육희망자
3rd row관리감독자 및 교육희망자
4th row교육희망자
5th row공단직원

Common Values

ValueCountFrequency (%)
교육희망자 42
13.5%
특수형태근로종사자 및 교육희망자 35
11.3%
공단직원 33
10.6%
안전관리자 28
 
9.0%
관리감독자 및 교육희망자 26
 
8.4%
2023년 민간위탁사업 수행요원(전 분야) 20
 
6.5%
안전보건관리책임자 20
 
6.5%
고용노동부 현업업무종사자 12
 
3.9%
고용노동부 11
 
3.5%
근로자 및 교육희망자 10
 
3.2%
Other values (30) 73
23.5%

Length

2024-03-14T17:47:12.949530image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
교육희망자 117
 
15.2%
89
 
11.5%
특수형태근로종사자 35
 
4.5%
공단직원 33
 
4.3%
관리감독자 30
 
3.9%
안전관리자 29
 
3.8%
고용노동부 23
 
3.0%
근로자 21
 
2.7%
안전보건관리책임자 20
 
2.6%
분야 20
 
2.6%
Other values (90) 355
46.0%

교육연도
Categorical

CONSTANT 

Distinct1
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size2.5 KiB
2023
310 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023
2nd row2023
3rd row2023
4th row2023
5th row2023

Common Values

ValueCountFrequency (%)
2023 310
100.0%

Length

2024-03-14T17:47:13.312155image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-14T17:47:13.476842image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023 310
100.0%

정원
Real number (ℝ)

HIGH CORRELATION 

Distinct17
Distinct (%)5.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6573.7
Minimum10
Maximum100000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.8 KiB
2024-03-14T17:47:13.634837image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum10
5-th percentile45
Q1500
median10000
Q310000
95-th percentile10000
Maximum100000
Range99990
Interquartile range (IQR)9500

Descriptive statistics

Standard deviation9021.1478
Coefficient of variation (CV)1.372309
Kurtosis43.895865
Mean6573.7
Median Absolute Deviation (MAD)0
Skewness5.3237714
Sum2037847
Variance81381107
MonotonicityNot monotonic
2024-03-14T17:47:13.840849image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=17)
ValueCountFrequency (%)
10000 157
50.6%
500 50
 
16.1%
50 24
 
7.7%
1500 20
 
6.5%
3000 11
 
3.5%
45 11
 
3.5%
100 8
 
2.6%
60 8
 
2.6%
50000 5
 
1.6%
40 5
 
1.6%
Other values (7) 11
 
3.5%
ValueCountFrequency (%)
10 2
 
0.6%
40 5
 
1.6%
45 11
 
3.5%
48 1
 
0.3%
50 24
7.7%
52 2
 
0.6%
60 8
 
2.6%
100 8
 
2.6%
500 50
16.1%
1000 1
 
0.3%
ValueCountFrequency (%)
100000 1
 
0.3%
50000 5
 
1.6%
10500 1
 
0.3%
10000 157
50.6%
5000 3
 
1.0%
3000 11
 
3.5%
1500 20
 
6.5%
1000 1
 
0.3%
500 50
 
16.1%
100 8
 
2.6%

법정인정 교육시간_h
Categorical

HIGH CORRELATION 

Distinct19
Distinct (%)6.1%
Missing0
Missing (%)0.0%
Memory size2.5 KiB
6
53 
8
45 
0
43 
2
38 
개인선택(2H, 1H, 0.5H)
35 
Other values (14)
96 

Length

Max length18
Median length1
Mean length3.0870968
Min length1

Unique

Unique7 ?
Unique (%)2.3%

Sample

1st row개인선택(2H, 1H, 0.5H)
2nd row개인선택(2H, 1H, 0.5H)
3rd row8
4th row0
5th row8

Common Values

ValueCountFrequency (%)
6 53
17.1%
8 45
14.5%
0 43
13.9%
2 38
12.3%
개인선택(2H, 1H, 0.5H) 35
11.3%
4 26
8.4%
34 17
 
5.5%
24 17
 
5.5%
16 11
 
3.5%
5 8
 
2.6%
Other values (9) 17
 
5.5%

Length

2024-03-14T17:47:14.073737image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
6 53
13.9%
8 45
11.8%
0 43
11.3%
2 38
10.0%
개인선택(2h 35
9.2%
1h 35
9.2%
0.5h 35
9.2%
4 26
6.8%
34 17
 
4.5%
24 17
 
4.5%
Other values (11) 36
9.5%

Interactions

2024-03-14T17:47:08.952028image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-14T17:47:14.213468image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
교육대상정원법정인정 교육시간_h
교육대상1.0000.9020.965
정원0.9021.0000.319
법정인정 교육시간_h0.9650.3191.000
2024-03-14T17:47:14.411196image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
교육대상법정인정 교육시간_h
교육대상1.0000.658
법정인정 교육시간_h0.6581.000
2024-03-14T17:47:14.578975image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
정원교육대상법정인정 교육시간_h
정원1.0000.6190.174
교육대상0.6191.0000.658
법정인정 교육시간_h0.1740.6581.000

Missing values

2024-03-14T17:47:09.282117image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-14T17:47:09.607260image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

교육과정명교육대상교육연도정원법정인정 교육시간_h
0특수형태근로종사자 최초 노무제공 시 교육: 불도저특수형태근로종사자 및 교육희망자202310000개인선택(2H, 1H, 0.5H)
1특수형태근로종사자 최초 노무제공 시 교육: 롤러특수형태근로종사자 및 교육희망자202310000개인선택(2H, 1H, 0.5H)
2건축공사 작업안전관리감독자 및 교육희망자2023100008
3전문가 특강: [추락재해 예방]교육희망자2023100000
4['23년 공단직원 기타교육] 도·소매업 관리감독자 실무능력 향상공단직원2023100008
5(play) 과로와 스트레스: 낭만 집배원교육희망자2023100000
6[병무청] 전문연구요원 및 산업기능요원 대상 산업재해예방 이러닝교육(연구실)전문연구요원 및 산업기능요원202330008
7'23년 1학기 울산대학교 「안전공감 더하기」 이러닝 과정(1주차)울산대학교 오픈캠퍼스 9기 「안전공감 더하기+ 과정」 수강생20231001
8[필수] 민간위탁 수행요원 자기규율 예방체계Ⅳ: 위험성평가 및 진단2023년 민간위탁사업 수행요원(전 분야)202315002
9특수형태근로종사자 최초 노무제공 시 교육: 지게차특수형태근로종사자 및 교육희망자202310000개인선택(2H, 1H, 0.5H)
교육과정명교육대상교육연도정원법정인정 교육시간_h
300안전관리자 보수교육(건설업)안전관리자20234524
301안전보건관리책임자 신규교육(제조 및 기타업)안전보건관리책임자2023456
30223년 안전관리자 신규교육 (제조업및기타/ZOOM 실시간 비대면 교육)안전관리자20235234
303안전보건관리책임자 보수교육(건설업)안전보건관리책임자2023506
304안전관리자 신규교육(건설업)안전관리자20234534
305안전보건관리책임자 신규교육(건설업)안전보건관리책임자2023506
306안전보건관리책임자 신규교육(건설업)안전보건관리책임자2023456
307안전보건관리담당자 보수교육안전보건관리담당자2023458
30823년 안전관리자 보수교육 (건설업/Zoom,실시간 비대면 교육)안전관리자20235024
3092023년 관리책임자 신규(위탁교육)안전보건관리책임자2023106