Overview

Dataset statistics

Number of variables7
Number of observations441
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory25.5 KiB
Average record size in memory59.3 B

Variable types

Categorical2
Text1
DateTime2
Numeric2

Dataset

Description교육연수는 직원의 업무수행에 필요한 지식과 능력을 향상함으로써 개인의 역량을 강화하고 회사의 성장에 이바지 하기 위함이다.한국장학재단은 전직원 교육연수 실시 현황을 데이터로 제공하고 있습니다.
Author한국장학재단
URLhttps://www.data.go.kr/data/15099396/fileData.do

Alerts

과정년도 has constant value ""Constant
연수이수자수 is highly overall correlated with 교육시간(이수시간)High correlation
교육시간(이수시간) is highly overall correlated with 연수이수자수High correlation
교육과정명 has unique valuesUnique

Reproduction

Analysis started2024-03-23 05:42:28.282767
Analysis finished2024-03-23 05:42:30.124453
Duration1.84 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

과정년도
Categorical

CONSTANT 

Distinct1
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size3.6 KiB
2023
441 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023
2nd row2023
3rd row2023
4th row2023
5th row2023

Common Values

ValueCountFrequency (%)
2023 441
100.0%

Length

2024-03-23T14:42:30.243746image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-23T14:42:30.481674image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023 441
100.0%

교육구분
Categorical

Distinct4
Distinct (%)0.9%
Missing0
Missing (%)0.0%
Memory size3.6 KiB
직무교육
337 
기본교육
52 
리더십교육
35 
기타교육
 
17

Length

Max length5
Median length4
Mean length4.0793651
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row리더십교육
2nd row리더십교육
3rd row리더십교육
4th row직무교육
5th row직무교육

Common Values

ValueCountFrequency (%)
직무교육 337
76.4%
기본교육 52
 
11.8%
리더십교육 35
 
7.9%
기타교육 17
 
3.9%

Length

2024-03-23T14:42:30.718441image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-23T14:42:30.957514image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
직무교육 337
76.4%
기본교육 52
 
11.8%
리더십교육 35
 
7.9%
기타교육 17
 
3.9%

교육과정명
Text

UNIQUE 

Distinct441
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size3.6 KiB
2024-03-23T14:42:31.427852image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length52
Median length32
Mean length18.063492
Min length4

Characters and Unicode

Total characters7966
Distinct characters458
Distinct categories12 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique441 ?
Unique (%)100.0%

Sample

1st row(2차)감사실 조직문화 혁신 워크숍
2nd row(2차)국민소통부 조직문화 혁신 워크숍
3rd row(2차)디지털전략부 조직문화 혁신 워크숍
4th row(3차)공공기관 소방안전관리자 강습교육
5th row(감사일반)확인서,질문서,문답서 작성방법
ValueCountFrequency (%)
38
 
2.4%
혁신 27
 
1.7%
조직문화 27
 
1.7%
워크숍 26
 
1.6%
실무 26
 
1.6%
교육 22
 
1.4%
과정 19
 
1.2%
2023 19
 
1.2%
공공기관 17
 
1.1%
위한 16
 
1.0%
Other values (903) 1341
85.0%
2024-03-23T14:42:32.270986image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1138
 
14.3%
186
 
2.3%
146
 
1.8%
2 137
 
1.7%
121
 
1.5%
( 119
 
1.5%
) 119
 
1.5%
118
 
1.5%
110
 
1.4%
109
 
1.4%
Other values (448) 5663
71.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5592
70.2%
Space Separator 1138
 
14.3%
Decimal Number 338
 
4.2%
Uppercase Letter 248
 
3.1%
Lowercase Letter 211
 
2.6%
Open Punctuation 168
 
2.1%
Close Punctuation 168
 
2.1%
Other Punctuation 50
 
0.6%
Dash Punctuation 26
 
0.3%
Connector Punctuation 17
 
0.2%
Other values (2) 10
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
186
 
3.3%
146
 
2.6%
121
 
2.2%
118
 
2.1%
110
 
2.0%
109
 
1.9%
105
 
1.9%
89
 
1.6%
84
 
1.5%
83
 
1.5%
Other values (369) 4441
79.4%
Uppercase Letter
ValueCountFrequency (%)
S 46
18.5%
A 21
 
8.5%
O 20
 
8.1%
C 19
 
7.7%
F 15
 
6.0%
K 13
 
5.2%
P 12
 
4.8%
I 12
 
4.8%
L 12
 
4.8%
T 10
 
4.0%
Other values (14) 68
27.4%
Lowercase Letter
ValueCountFrequency (%)
e 36
17.1%
a 22
 
10.4%
i 18
 
8.5%
n 14
 
6.6%
r 13
 
6.2%
t 13
 
6.2%
l 12
 
5.7%
o 10
 
4.7%
g 9
 
4.3%
h 7
 
3.3%
Other values (14) 57
27.0%
Decimal Number
ValueCountFrequency (%)
2 137
40.5%
0 65
19.2%
3 63
18.6%
1 34
 
10.1%
4 15
 
4.4%
9 8
 
2.4%
7 5
 
1.5%
5 4
 
1.2%
6 4
 
1.2%
8 3
 
0.9%
Other Punctuation
ValueCountFrequency (%)
, 25
50.0%
· 8
 
16.0%
! 7
 
14.0%
/ 5
 
10.0%
. 3
 
6.0%
& 1
 
2.0%
: 1
 
2.0%
Letter Number
ValueCountFrequency (%)
2
25.0%
2
25.0%
2
25.0%
2
25.0%
Open Punctuation
ValueCountFrequency (%)
( 119
70.8%
[ 48
28.6%
1
 
0.6%
Close Punctuation
ValueCountFrequency (%)
) 119
70.8%
] 48
28.6%
1
 
0.6%
Space Separator
ValueCountFrequency (%)
1138
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 26
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 17
100.0%
Math Symbol
ValueCountFrequency (%)
~ 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5592
70.2%
Common 1907
 
23.9%
Latin 467
 
5.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
186
 
3.3%
146
 
2.6%
121
 
2.2%
118
 
2.1%
110
 
2.0%
109
 
1.9%
105
 
1.9%
89
 
1.6%
84
 
1.5%
83
 
1.5%
Other values (369) 4441
79.4%
Latin
ValueCountFrequency (%)
S 46
 
9.9%
e 36
 
7.7%
a 22
 
4.7%
A 21
 
4.5%
O 20
 
4.3%
C 19
 
4.1%
i 18
 
3.9%
F 15
 
3.2%
n 14
 
3.0%
r 13
 
2.8%
Other values (42) 243
52.0%
Common
ValueCountFrequency (%)
1138
59.7%
2 137
 
7.2%
( 119
 
6.2%
) 119
 
6.2%
0 65
 
3.4%
3 63
 
3.3%
] 48
 
2.5%
[ 48
 
2.5%
1 34
 
1.8%
- 26
 
1.4%
Other values (17) 110
 
5.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5592
70.2%
ASCII 2356
29.6%
None 10
 
0.1%
Number Forms 8
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1138
48.3%
2 137
 
5.8%
( 119
 
5.1%
) 119
 
5.1%
0 65
 
2.8%
3 63
 
2.7%
] 48
 
2.0%
[ 48
 
2.0%
S 46
 
2.0%
e 36
 
1.5%
Other values (62) 537
22.8%
Hangul
ValueCountFrequency (%)
186
 
3.3%
146
 
2.6%
121
 
2.2%
118
 
2.1%
110
 
2.0%
109
 
1.9%
105
 
1.9%
89
 
1.6%
84
 
1.5%
83
 
1.5%
Other values (369) 4441
79.4%
None
ValueCountFrequency (%)
· 8
80.0%
1
 
10.0%
1
 
10.0%
Number Forms
ValueCountFrequency (%)
2
25.0%
2
25.0%
2
25.0%
2
25.0%
Distinct164
Distinct (%)37.2%
Missing0
Missing (%)0.0%
Memory size3.6 KiB
Minimum2023-01-01 00:00:00
Maximum2023-12-19 00:00:00
2024-03-23T14:42:32.597975image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-23T14:42:32.939169image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct156
Distinct (%)35.4%
Missing0
Missing (%)0.0%
Memory size3.6 KiB
Minimum2023-01-12 00:00:00
Maximum2023-12-31 00:00:00
2024-03-23T14:42:33.253469image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-23T14:42:33.606875image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

연수이수자수
Real number (ℝ)

HIGH CORRELATION 

Distinct57
Distinct (%)12.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean22.129252
Minimum1
Maximum495
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size4.0 KiB
2024-03-23T14:42:33.893906image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11
median1
Q36
95-th percentile61
Maximum495
Range494
Interquartile range (IQR)5

Descriptive statistics

Standard deviation80.714728
Coefficient of variation (CV)3.6474224
Kurtosis23.705787
Mean22.129252
Median Absolute Deviation (MAD)0
Skewness4.9240799
Sum9759
Variance6514.8673
MonotonicityNot monotonic
2024-03-23T14:42:34.300710image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 232
52.6%
2 49
 
11.1%
3 24
 
5.4%
4 16
 
3.6%
6 10
 
2.3%
20 8
 
1.8%
7 6
 
1.4%
17 6
 
1.4%
10 5
 
1.1%
15 5
 
1.1%
Other values (47) 80
 
18.1%
ValueCountFrequency (%)
1 232
52.6%
2 49
 
11.1%
3 24
 
5.4%
4 16
 
3.6%
5 4
 
0.9%
6 10
 
2.3%
7 6
 
1.4%
8 3
 
0.7%
9 2
 
0.5%
10 5
 
1.1%
ValueCountFrequency (%)
495 3
0.7%
490 2
0.5%
485 1
 
0.2%
472 1
 
0.2%
436 1
 
0.2%
432 1
 
0.2%
420 1
 
0.2%
414 1
 
0.2%
407 1
 
0.2%
397 1
 
0.2%

교육시간(이수시간)
Real number (ℝ)

HIGH CORRELATION 

Distinct121
Distinct (%)27.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean69.020862
Minimum1
Maximum1802
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size4.0 KiB
2024-03-23T14:42:34.598096image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile3
Q18
median16
Q345
95-th percentile293
Maximum1802
Range1801
Interquartile range (IQR)37

Descriptive statistics

Standard deviation188.3317
Coefficient of variation (CV)2.72862
Kurtosis41.57204
Mean69.020862
Median Absolute Deviation (MAD)11
Skewness5.999795
Sum30438.2
Variance35468.831
MonotonicityNot monotonic
2024-03-23T14:42:34.858576image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
16.0 36
 
8.2%
14.0 26
 
5.9%
7.0 24
 
5.4%
4.0 20
 
4.5%
3.0 19
 
4.3%
8.0 18
 
4.1%
12.0 16
 
3.6%
6.0 16
 
3.6%
35.0 14
 
3.2%
2.0 11
 
2.5%
Other values (111) 241
54.6%
ValueCountFrequency (%)
1.0 3
 
0.7%
2.0 11
2.5%
3.0 19
4.3%
4.0 20
4.5%
4.5 1
 
0.2%
5.0 9
 
2.0%
6.0 16
3.6%
7.0 24
5.4%
7.5 1
 
0.2%
8.0 18
4.1%
ValueCountFrequency (%)
1802.0 1
 
0.2%
1628.0 1
 
0.2%
1308.0 1
 
0.2%
1296.0 1
 
0.2%
1260.0 1
 
0.2%
1242.0 1
 
0.2%
882.0 1
 
0.2%
708.0 1
 
0.2%
495.0 3
0.7%
490.0 2
0.5%

Interactions

2024-03-23T14:42:29.297355image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-23T14:42:28.851496image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-23T14:42:29.489126image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-23T14:42:29.049999image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-23T14:42:35.065438image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
교육구분연수이수자수교육시간(이수시간)
교육구분1.0000.4670.419
연수이수자수0.4671.0000.948
교육시간(이수시간)0.4190.9481.000
2024-03-23T14:42:35.229015image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연수이수자수교육시간(이수시간)교육구분
연수이수자수1.0000.7190.222
교육시간(이수시간)0.7191.0000.197
교육구분0.2220.1971.000

Missing values

2024-03-23T14:42:29.762185image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-23T14:42:30.021124image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

과정년도교육구분교육과정명교육시작일교육종료일연수이수자수교육시간(이수시간)
02023리더십교육(2차)감사실 조직문화 혁신 워크숍2023-11-282023-11-281030.0
12023리더십교육(2차)국민소통부 조직문화 혁신 워크숍2023-12-062023-12-061560.0
22023리더십교육(2차)디지털전략부 조직문화 혁신 워크숍2023-12-072023-12-073552.5
32023직무교육(3차)공공기관 소방안전관리자 강습교육2023-08-282023-09-01280.0
42023직무교육(감사일반)확인서,질문서,문답서 작성방법2023-03-012023-11-3039.0
52023직무교육(신용분석/여신심사) 기술금융2023-06-192023-06-21124.0
62023직무교육(응용 및 활용) 보조사업자 집행정산과정2023-03-012023-06-3016.0
72023직무교육(체납처분) 국가채권관리 실무2023-03-012023-11-30324.0
82023직무교육[2023 재정교육] 국가재정법의 이해(2기)2023-04-012023-04-3013.0
92023직무교육[2023 재정교육] 예산의 이해(2기)2023-04-012023-04-3017.0
과정년도교육구분교육과정명교육시작일교육종료일연수이수자수교육시간(이수시간)
4312023직무교육행복이음 개인정보보호 교육2023-10-012023-10-10735.0
4322023직무교육행정기본법 해설2023-03-012023-11-3039.0
4332023직무교육행정안전부 2023년 데이터 역량강화교육2023-06-142023-06-16372.0
4342023직무교육행정안전부 데이터 역량강화 10월 교육2023-10-042023-10-05116.0
4352023직무교육행정안전부 데이터 역량강화(중급분석 3회차)2023-08-012023-08-03124.0
4362023직무교육행정쟁송 실무2023-02-012023-11-3015.0
4372023직무교육행정집행실무특강과정2023-03-012023-11-30113.0
4382023직무교육확인서, 질문서, 문답서 작성 방법2023-10-122023-10-2026.0
4392023직무교육회복탄력성을 높이는 긍정테라피2023-10-202023-10-2017.0
4402023직무교육회생과 파산에 관한 실무법률2023-08-282023-09-01134.0