Overview

Dataset statistics

Number of variables6
Number of observations48
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.4 KiB
Average record size in memory51.7 B

Variable types

DateTime1
Text1
Categorical4

Dataset

Description공사가 매년 실시하고 있는 안전 보건 교육에 관한 데이터로 교육년월, 교육내용, 교육시간, 교육대상, 교육방법, 교육강사를 제공합니다.
Author예금보험공사
URLhttps://www.data.go.kr/data/15112618/fileData.do

Alerts

교육방법 is highly overall correlated with 교육강사High correlation
교육대상 is highly overall correlated with 교육시간High correlation
교육시간 is highly overall correlated with 교육대상High correlation
교육강사 is highly overall correlated with 교육방법High correlation
교육방법 is highly imbalanced (75.1%)Imbalance
교육강사 is highly imbalanced (85.4%)Imbalance

Reproduction

Analysis started2024-03-14 13:07:11.557650
Analysis finished2024-03-14 13:07:12.509193
Duration0.95 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct24
Distinct (%)50.0%
Missing0
Missing (%)0.0%
Memory size512.0 B
Minimum2022-01-01 00:00:00
Maximum2023-12-01 00:00:00
2024-03-14T22:07:12.703729image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T22:07:12.908353image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=24)
Distinct47
Distinct (%)97.9%
Missing0
Missing (%)0.0%
Memory size512.0 B
2024-03-14T22:07:13.920865image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length40
Median length28.5
Mean length21.666667
Min length2

Characters and Unicode

Total characters1040
Distinct characters173
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique46 ?
Unique (%)95.8%

Sample

1st row금연
2nd row산업안전보건법 및 일반관리에 관한 사항, 중대재해처벌법 이해 등
3rd row근골격계질환 예방을 위한 스트레칭
4th row근골격계 질환과 예방, 정리정돈과 재해예방 등
5th row미세먼지
ValueCountFrequency (%)
예방 16
 
6.3%
15
 
5.9%
11
 
4.3%
관리 7
 
2.7%
직무스트레스 4
 
1.6%
이해 4
 
1.6%
사고 4
 
1.6%
안전 4
 
1.6%
재해예방 4
 
1.6%
근골격계질환 3
 
1.2%
Other values (142) 183
71.8%
2024-03-14T22:07:15.204140image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
209
 
20.1%
, 49
 
4.7%
34
 
3.3%
29
 
2.8%
24
 
2.3%
19
 
1.8%
19
 
1.8%
19
 
1.8%
18
 
1.7%
17
 
1.6%
Other values (163) 603
58.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 759
73.0%
Space Separator 209
 
20.1%
Other Punctuation 50
 
4.8%
Uppercase Letter 18
 
1.7%
Decimal Number 2
 
0.2%
Close Punctuation 1
 
0.1%
Open Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
34
 
4.5%
29
 
3.8%
24
 
3.2%
19
 
2.5%
19
 
2.5%
19
 
2.5%
18
 
2.4%
17
 
2.2%
16
 
2.1%
16
 
2.1%
Other values (147) 548
72.2%
Uppercase Letter
ValueCountFrequency (%)
S 4
22.2%
D 4
22.2%
M 2
11.1%
O 2
11.1%
T 2
11.1%
E 1
 
5.6%
A 1
 
5.6%
V 1
 
5.6%
L 1
 
5.6%
Other Punctuation
ValueCountFrequency (%)
, 49
98.0%
/ 1
 
2.0%
Decimal Number
ValueCountFrequency (%)
5 1
50.0%
3 1
50.0%
Space Separator
ValueCountFrequency (%)
209
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 759
73.0%
Common 263
 
25.3%
Latin 18
 
1.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
34
 
4.5%
29
 
3.8%
24
 
3.2%
19
 
2.5%
19
 
2.5%
19
 
2.5%
18
 
2.4%
17
 
2.2%
16
 
2.1%
16
 
2.1%
Other values (147) 548
72.2%
Latin
ValueCountFrequency (%)
S 4
22.2%
D 4
22.2%
M 2
11.1%
O 2
11.1%
T 2
11.1%
E 1
 
5.6%
A 1
 
5.6%
V 1
 
5.6%
L 1
 
5.6%
Common
ValueCountFrequency (%)
209
79.5%
, 49
 
18.6%
5 1
 
0.4%
/ 1
 
0.4%
) 1
 
0.4%
( 1
 
0.4%
3 1
 
0.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 759
73.0%
ASCII 281
 
27.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
209
74.4%
, 49
 
17.4%
S 4
 
1.4%
D 4
 
1.4%
M 2
 
0.7%
O 2
 
0.7%
T 2
 
0.7%
5 1
 
0.4%
E 1
 
0.4%
A 1
 
0.4%
Other values (6) 6
 
2.1%
Hangul
ValueCountFrequency (%)
34
 
4.5%
29
 
3.8%
24
 
3.2%
19
 
2.5%
19
 
2.5%
19
 
2.5%
18
 
2.4%
17
 
2.2%
16
 
2.1%
16
 
2.1%
Other values (147) 548
72.2%

교육시간
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)4.2%
Missing0
Missing (%)0.0%
Memory size512.0 B
1
24 
2
24 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row2
3rd row1
4th row2
5th row1

Common Values

ValueCountFrequency (%)
1 24
50.0%
2 24
50.0%

Length

2024-03-14T22:07:15.426418image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-14T22:07:15.587872image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 24
50.0%
2 24
50.0%

교육대상
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)4.2%
Missing0
Missing (%)0.0%
Memory size512.0 B
공사 전직원
24 
수급업체 전직원
24 

Length

Max length8
Median length7
Mean length7
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row공사 전직원
2nd row수급업체 전직원
3rd row공사 전직원
4th row수급업체 전직원
5th row공사 전직원

Common Values

ValueCountFrequency (%)
공사 전직원 24
50.0%
수급업체 전직원 24
50.0%

Length

2024-03-14T22:07:15.778996image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-14T22:07:16.033441image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
전직원 48
50.0%
공사 24
25.0%
수급업체 24
25.0%

교육방법
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct3
Distinct (%)6.2%
Missing0
Missing (%)0.0%
Memory size512.0 B
회람
45 
회람, 영상시청
 
2
회람, 외부강의
 
1

Length

Max length8
Median length2
Mean length2.375
Min length2

Unique

Unique1 ?
Unique (%)2.1%

Sample

1st row회람
2nd row회람
3rd row회람
4th row회람
5th row회람

Common Values

ValueCountFrequency (%)
회람 45
93.8%
회람, 영상시청 2
 
4.2%
회람, 외부강의 1
 
2.1%

Length

2024-03-14T22:07:16.374572image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-14T22:07:16.687161image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
회람 48
94.1%
영상시청 2
 
3.9%
외부강의 1
 
2.0%

교육강사
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)4.2%
Missing0
Missing (%)0.0%
Memory size512.0 B
관리감독자
47 
관리감독자, 외부 강사
 
1

Length

Max length12
Median length5
Mean length5.1458333
Min length5

Unique

Unique1 ?
Unique (%)2.1%

Sample

1st row관리감독자
2nd row관리감독자
3rd row관리감독자
4th row관리감독자
5th row관리감독자

Common Values

ValueCountFrequency (%)
관리감독자 47
97.9%
관리감독자, 외부 강사 1
 
2.1%

Length

2024-03-14T22:07:17.049007image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-14T22:07:17.369022image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
관리감독자 48
96.0%
외부 1
 
2.0%
강사 1
 
2.0%

Correlations

2024-03-14T22:07:17.560234image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
교육년월교육내용교육시간교육대상교육방법교육강사
교육년월1.0000.9770.0000.0000.1930.177
교육내용0.9771.0001.0001.0001.0001.000
교육시간0.0001.0001.0000.9980.0900.000
교육대상0.0001.0000.9981.0000.0900.000
교육방법0.1931.0000.0900.0901.0001.000
교육강사0.1771.0000.0000.0001.0001.000
2024-03-14T22:07:17.761350image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
교육방법교육대상교육시간교육강사
교육방법1.0000.1440.1440.989
교육대상0.1441.0000.9570.000
교육시간0.1440.9571.0000.000
교육강사0.9890.0000.0001.000
2024-03-14T22:07:17.962556image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
교육시간교육대상교육방법교육강사
교육시간1.0000.9570.1440.000
교육대상0.9571.0000.1440.000
교육방법0.1440.1441.0000.989
교육강사0.0000.0000.9891.000

Missing values

2024-03-14T22:07:12.024959image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-14T22:07:12.372192image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

교육년월교육내용교육시간교육대상교육방법교육강사
02022-01금연1공사 전직원회람관리감독자
12022-01산업안전보건법 및 일반관리에 관한 사항, 중대재해처벌법 이해 등2수급업체 전직원회람관리감독자
22022-02근골격계질환 예방을 위한 스트레칭1공사 전직원회람관리감독자
32022-02근골격계 질환과 예방, 정리정돈과 재해예방 등2수급업체 전직원회람관리감독자
42022-03미세먼지1공사 전직원회람관리감독자
52022-03위험성 평가, 작업장소 이동 중 재해 예방, 미세먼지 예방 등2수급업체 전직원회람관리감독자
62022-04비만1공사 전직원회람관리감독자
72022-04산재보상보험법, 비상대응매뉴얼, VDT 작업 등2수급업체 전직원회람관리감독자
82022-05수근관 증후군 예방1공사 전직원회람관리감독자
92022-05사업장의 안전보건, 직무스트레스 관리, 3대 다발재해 예방 등2수급업체 전직원회람관리감독자
교육년월교육내용교육시간교육대상교육방법교육강사
382023-08응급처치요령, 집중호우 복구 및 작업재개 안전수칙1공사 전직원회람관리감독자
392023-08전기재해예방, 전기 감전사고예방2수급업체 전직원회람관리감독자
402023-09가을철 발열성질환 예방 수칙1공사 전직원회람관리감독자
412023-09직무 스트레스 예방 관리, 소음작업 안전대책2수급업체 전직원회람관리감독자
422023-10독감, 하임리히법, 소화기/소화전 사용방법1공사 전직원회람, 영상시청관리감독자
432023-10보호구 사용 및 관리, 응급처치2수급업체 전직원회람관리감독자
442023-11눈 건강을 위한 교육, AED 사용방법, 심폐소생술 교육1공사 전직원회람관리감독자
452023-11소방안전, 양중기 위험요인 및 안전작업 방법2수급업체 전직원회람관리감독자
462023-12절주, 겨울철 안전사고 교육(동상 등), 전기 관련 사고 대응요렁1공사 전직원회람관리감독자
472023-12동절기 재해예방, 뇌심혈관질환 예방2수급업체 전직원회람관리감독자