Overview

Dataset statistics

Number of variables10
Number of observations344
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory28.7 KiB
Average record size in memory85.4 B

Variable types

Categorical4
Text1
DateTime2
Numeric3

Dataset

Description교육부 소속 공무원 및 17시 시․도 교육청 소속 지방공무원 및 교원의 교육연수 정보 제공공동활용기관,과정년도,과정운영구분,과정명,기수,교육시작일,교육종료일,승인인원,이수인원,이수율 항목을 제공
Author교육부 중앙교육연수원
URLhttps://www.data.go.kr/data/3039487/fileData.do

Alerts

공동활용기관 has constant value ""Constant
과정년도 has constant value ""Constant
승인인원 is highly overall correlated with 이수인원High correlation
이수인원 is highly overall correlated with 승인인원High correlation
과정운영구분 is highly imbalanced (75.5%)Imbalance
기수 is highly imbalanced (93.6%)Imbalance
과정명 has unique valuesUnique

Reproduction

Analysis started2023-12-12 12:09:59.243570
Analysis finished2023-12-12 12:10:01.593014
Duration2.35 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

공동활용기관
Categorical

CONSTANT 

Distinct1
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size2.8 KiB
중앙교육연수원
344 

Length

Max length7
Median length7
Mean length7
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row중앙교육연수원
2nd row중앙교육연수원
3rd row중앙교육연수원
4th row중앙교육연수원
5th row중앙교육연수원

Common Values

ValueCountFrequency (%)
중앙교육연수원 344
100.0%

Length

2023-12-12T21:10:01.686492image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T21:10:01.814619image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
중앙교육연수원 344
100.0%

과정년도
Categorical

CONSTANT 

Distinct1
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size2.8 KiB
2020
344 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2020
2nd row2020
3rd row2020
4th row2020
5th row2020

Common Values

ValueCountFrequency (%)
2020 344
100.0%

Length

2023-12-12T21:10:01.937747image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T21:10:02.070139image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2020 344
100.0%

과정운영구분
Categorical

IMBALANCE 

Distinct2
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size2.8 KiB
상시
330 
원격
 
14

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row상시
2nd row상시
3rd row상시
4th row상시
5th row상시

Common Values

ValueCountFrequency (%)
상시 330
95.9%
원격 14
 
4.1%

Length

2023-12-12T21:10:02.195874image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T21:10:02.312603image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
상시 330
95.9%
원격 14
 
4.1%

과정명
Text

UNIQUE 

Distinct344
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size2.8 KiB
2023-12-12T21:10:02.653313image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length67
Median length38
Mean length22.636628
Min length7

Characters and Unicode

Total characters7787
Distinct characters445
Distinct categories11 ?
Distinct scripts4 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique344 ?
Unique (%)100.0%

Sample

1st row(2015 개정 국어과 교육과정)'한 학기 한 권 읽기'
2nd row(고등학교)어울림 학교폭력예방프로그램(갈등해결, 감정조절, 자기존중감)
3rd row(고등학교)어울림 학교폭력예방프로그램(공감, 의사소통, 학교폭력 인식 및 대처)
4th row(공통)교사와 학생이 함께 성장하는 민주시민교육
5th row(긴급지원 신고의무자 교육)긴급복지지원 신고의무자편
ValueCountFrequency (%)
34
 
2.0%
위한 26
 
1.5%
과정 22
 
1.3%
평가 22
 
1.3%
이해 21
 
1.2%
수업 19
 
1.1%
중심 19
 
1.1%
돕는 18
 
1.1%
발달을 17
 
1.0%
성장과 17
 
1.0%
Other values (901) 1492
87.4%
2023-12-12T21:10:03.205482image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1363
 
17.5%
294
 
3.8%
199
 
2.6%
) 172
 
2.2%
( 172
 
2.2%
129
 
1.7%
128
 
1.6%
111
 
1.4%
102
 
1.3%
95
 
1.2%
Other values (435) 5022
64.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5644
72.5%
Space Separator 1363
 
17.5%
Close Punctuation 174
 
2.2%
Open Punctuation 174
 
2.2%
Decimal Number 174
 
2.2%
Other Punctuation 143
 
1.8%
Uppercase Letter 53
 
0.7%
Dash Punctuation 33
 
0.4%
Lowercase Letter 21
 
0.3%
Math Symbol 4
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
294
 
5.2%
199
 
3.5%
129
 
2.3%
128
 
2.3%
111
 
2.0%
102
 
1.8%
95
 
1.7%
94
 
1.7%
92
 
1.6%
91
 
1.6%
Other values (387) 4309
76.3%
Uppercase Letter
ValueCountFrequency (%)
S 11
20.8%
N 7
13.2%
U 5
9.4%
W 4
 
7.5%
O 4
 
7.5%
P 4
 
7.5%
E 3
 
5.7%
C 3
 
5.7%
T 3
 
5.7%
R 2
 
3.8%
Other values (4) 7
13.2%
Decimal Number
ValueCountFrequency (%)
2 58
33.3%
0 52
29.9%
1 24
13.8%
5 16
 
9.2%
4 8
 
4.6%
3 6
 
3.4%
9 6
 
3.4%
6 3
 
1.7%
7 1
 
0.6%
Other Punctuation
ValueCountFrequency (%)
, 60
42.0%
· 36
25.2%
! 18
 
12.6%
? 16
 
11.2%
' 10
 
7.0%
. 1
 
0.7%
% 1
 
0.7%
/ 1
 
0.7%
Lowercase Letter
ValueCountFrequency (%)
l 6
28.6%
a 4
19.0%
p 3
14.3%
k 3
14.3%
y 2
 
9.5%
e 1
 
4.8%
w 1
 
4.8%
i 1
 
4.8%
Close Punctuation
ValueCountFrequency (%)
) 172
98.9%
2
 
1.1%
Open Punctuation
ValueCountFrequency (%)
( 172
98.9%
2
 
1.1%
Letter Number
ValueCountFrequency (%)
2
50.0%
2
50.0%
Space Separator
ValueCountFrequency (%)
1363
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 33
100.0%
Math Symbol
ValueCountFrequency (%)
~ 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5641
72.4%
Common 2065
 
26.5%
Latin 78
 
1.0%
Han 3
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
294
 
5.2%
199
 
3.5%
129
 
2.3%
128
 
2.3%
111
 
2.0%
102
 
1.8%
95
 
1.7%
94
 
1.7%
92
 
1.6%
91
 
1.6%
Other values (385) 4306
76.3%
Common
ValueCountFrequency (%)
1363
66.0%
) 172
 
8.3%
( 172
 
8.3%
, 60
 
2.9%
2 58
 
2.8%
0 52
 
2.5%
· 36
 
1.7%
- 33
 
1.6%
1 24
 
1.2%
! 18
 
0.9%
Other values (14) 77
 
3.7%
Latin
ValueCountFrequency (%)
S 11
14.1%
N 7
 
9.0%
l 6
 
7.7%
U 5
 
6.4%
a 4
 
5.1%
W 4
 
5.1%
O 4
 
5.1%
P 4
 
5.1%
E 3
 
3.8%
C 3
 
3.8%
Other values (14) 27
34.6%
Han
ValueCountFrequency (%)
2
66.7%
1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5641
72.4%
ASCII 2099
 
27.0%
None 40
 
0.5%
Number Forms 4
 
0.1%
CJK 3
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1363
64.9%
) 172
 
8.2%
( 172
 
8.2%
, 60
 
2.9%
2 58
 
2.8%
0 52
 
2.5%
- 33
 
1.6%
1 24
 
1.1%
! 18
 
0.9%
? 16
 
0.8%
Other values (33) 131
 
6.2%
Hangul
ValueCountFrequency (%)
294
 
5.2%
199
 
3.5%
129
 
2.3%
128
 
2.3%
111
 
2.0%
102
 
1.8%
95
 
1.7%
94
 
1.7%
92
 
1.6%
91
 
1.6%
Other values (385) 4306
76.3%
None
ValueCountFrequency (%)
· 36
90.0%
2
 
5.0%
2
 
5.0%
CJK
ValueCountFrequency (%)
2
66.7%
1
33.3%
Number Forms
ValueCountFrequency (%)
2
50.0%
2
50.0%

기수
Categorical

IMBALANCE 

Distinct3
Distinct (%)0.9%
Missing0
Missing (%)0.0%
Memory size2.8 KiB
1
340 
2
 
3
3
 
1

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique1 ?
Unique (%)0.3%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
1 340
98.8%
2 3
 
0.9%
3 1
 
0.3%

Length

2023-12-12T21:10:03.367861image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T21:10:03.479845image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 340
98.8%
2 3
 
0.9%
3 1
 
0.3%
Distinct44
Distinct (%)12.8%
Missing0
Missing (%)0.0%
Memory size2.8 KiB
Minimum2020-01-05 00:00:00
Maximum2020-11-16 00:00:00
2023-12-12T21:10:03.621504image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:10:03.819280image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=44)
Distinct19
Distinct (%)5.5%
Missing0
Missing (%)0.0%
Memory size2.8 KiB
Minimum2020-03-13 00:00:00
Maximum2020-12-13 00:00:00
2023-12-12T21:10:03.976157image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:10:04.106142image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=19)

승인인원
Real number (ℝ)

HIGH CORRELATION 

Distinct311
Distinct (%)90.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4374.3837
Minimum1
Maximum194202
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.2 KiB
2023-12-12T21:10:04.272580image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile40
Q1165.75
median481.5
Q31464.25
95-th percentile13094.9
Maximum194202
Range194201
Interquartile range (IQR)1298.5

Descriptive statistics

Standard deviation17975.475
Coefficient of variation (CV)4.1092588
Kurtosis54.987242
Mean4374.3837
Median Absolute Deviation (MAD)403.5
Skewness6.937319
Sum1504788
Variance3.231177 × 108
MonotonicityNot monotonic
2023-12-12T21:10:04.433673image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
109 3
 
0.9%
123 3
 
0.9%
40 3
 
0.9%
25 3
 
0.9%
57 3
 
0.9%
165 2
 
0.6%
62 2
 
0.6%
374 2
 
0.6%
613 2
 
0.6%
229 2
 
0.6%
Other values (301) 319
92.7%
ValueCountFrequency (%)
1 1
 
0.3%
5 2
0.6%
12 1
 
0.3%
13 1
 
0.3%
17 1
 
0.3%
19 1
 
0.3%
21 1
 
0.3%
23 1
 
0.3%
25 3
0.9%
28 1
 
0.3%
ValueCountFrequency (%)
194202 1
0.3%
130411 1
0.3%
130287 1
0.3%
99534 1
0.3%
98644 1
0.3%
77082 1
0.3%
73296 1
0.3%
69184 1
0.3%
57775 1
0.3%
33733 1
0.3%

이수인원
Real number (ℝ)

HIGH CORRELATION 

Distinct304
Distinct (%)88.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4155.9826
Minimum1
Maximum185327
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.2 KiB
2023-12-12T21:10:04.584129image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile33.45
Q1143.75
median419
Q31355.75
95-th percentile12104.3
Maximum185327
Range185326
Interquartile range (IQR)1212

Descriptive statistics

Standard deviation17418.185
Coefficient of variation (CV)4.1911111
Kurtosis54.078318
Mean4155.9826
Median Absolute Deviation (MAD)347
Skewness6.9120531
Sum1429658
Variance3.0339316 × 108
MonotonicityNot monotonic
2023-12-12T21:10:04.763484image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
80 3
 
0.9%
72 3
 
0.9%
93 3
 
0.9%
181 3
 
0.9%
306 2
 
0.6%
22 2
 
0.6%
28 2
 
0.6%
110 2
 
0.6%
369 2
 
0.6%
145 2
 
0.6%
Other values (294) 320
93.0%
ValueCountFrequency (%)
1 1
0.3%
2 1
0.3%
5 1
0.3%
11 2
0.6%
12 1
0.3%
16 1
0.3%
17 1
0.3%
18 1
0.3%
21 1
0.3%
22 2
0.6%
ValueCountFrequency (%)
185327 1
0.3%
129640 1
0.3%
128288 1
0.3%
97882 1
0.3%
92590 1
0.3%
76125 1
0.3%
70979 1
0.3%
67317 1
0.3%
53826 1
0.3%
33389 1
0.3%

이수율
Real number (ℝ)

Distinct36
Distinct (%)10.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean89.148256
Minimum40
Maximum100
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.2 KiB
2023-12-12T21:10:04.939469image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum40
5-th percentile77.15
Q186
median90
Q394
95-th percentile98
Maximum100
Range60
Interquartile range (IQR)8

Descriptive statistics

Standard deviation7.1838495
Coefficient of variation (CV)0.080583175
Kurtosis8.7255331
Mean89.148256
Median Absolute Deviation (MAD)4
Skewness-2.0783614
Sum30667
Variance51.607694
MonotonicityNot monotonic
2023-12-12T21:10:05.071897image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=36)
ValueCountFrequency (%)
90 33
 
9.6%
89 25
 
7.3%
92 24
 
7.0%
87 23
 
6.7%
91 23
 
6.7%
93 22
 
6.4%
95 22
 
6.4%
94 21
 
6.1%
86 17
 
4.9%
96 15
 
4.4%
Other values (26) 119
34.6%
ValueCountFrequency (%)
40 1
0.3%
57 1
0.3%
58 1
0.3%
62 1
0.3%
64 1
0.3%
65 1
0.3%
68 2
0.6%
70 1
0.3%
71 1
0.3%
74 1
0.3%
ValueCountFrequency (%)
100 11
3.2%
99 4
 
1.2%
98 8
 
2.3%
97 9
 
2.6%
96 15
4.4%
95 22
6.4%
94 21
6.1%
93 22
6.4%
92 24
7.0%
91 23
6.7%

Interactions

2023-12-12T21:10:00.891003image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:09:59.745339image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:10:00.098166image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:10:01.024529image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:09:59.864324image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:10:00.229869image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:10:01.135128image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:09:59.982477image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:10:00.752575image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T21:10:05.172446image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
과정운영구분기수교육시작일교육종료일승인인원이수인원이수율
과정운영구분1.0000.1530.9991.0000.0000.0000.368
기수0.1531.0000.7970.9140.5400.5600.000
교육시작일0.9990.7971.0000.9870.8200.8470.855
교육종료일1.0000.9140.9871.0000.0000.0000.642
승인인원0.0000.5400.8200.0001.0000.9820.000
이수인원0.0000.5600.8470.0000.9821.0000.000
이수율0.3680.0000.8550.6420.0000.0001.000
2023-12-12T21:10:05.278875image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
기수과정운영구분
기수1.0000.252
과정운영구분0.2521.000
2023-12-12T21:10:05.367456image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
승인인원이수인원이수율과정운영구분기수
승인인원1.0000.9990.1630.0000.426
이수인원0.9991.0000.1980.0000.423
이수율0.1630.1981.0000.3640.000
과정운영구분0.0000.0000.3641.0000.252
기수0.4260.4230.0000.2521.000

Missing values

2023-12-12T21:10:01.318530image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T21:10:01.514248image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

공동활용기관과정년도과정운영구분과정명기수교육시작일교육종료일승인인원이수인원이수율
0중앙교육연수원2020상시(2015 개정 국어과 교육과정)'한 학기 한 권 읽기'12020-01-062020-12-134518354278
1중앙교육연수원2020상시(고등학교)어울림 학교폭력예방프로그램(갈등해결, 감정조절, 자기존중감)12020-01-062020-12-132580214483
2중앙교육연수원2020상시(고등학교)어울림 학교폭력예방프로그램(공감, 의사소통, 학교폭력 인식 및 대처)12020-01-062020-12-131592137887
3중앙교육연수원2020상시(공통)교사와 학생이 함께 성장하는 민주시민교육12020-01-062020-12-132564217185
4중앙교육연수원2020상시(긴급지원 신고의무자 교육)긴급복지지원 신고의무자편12020-05-232020-12-1313041112964099
5중앙교육연수원2020상시(사회복지)공감과 소통을 위한 교실 속 다문화교육12020-01-062020-12-137104598284
6중앙교육연수원2020상시(사회복지)저출산·고령화 해법을 위한 인구교육12020-01-062020-12-134265400994
7중앙교육연수원2020상시(사회복지)탈북학생 교육의 실제12020-01-062020-12-132068192093
8중앙교육연수원2020상시(성희롱 예방)공직자를 위한 성희롱 예방교육12020-02-262020-12-13193081836595
9중앙교육연수원2020상시(성희롱·성폭력·성매매·가정폭력 예방교육)공공기관 폭력 예방교육12020-01-062020-12-13216562111798
공동활용기관과정년도과정운영구분과정명기수교육시작일교육종료일승인인원이수인원이수율
334중앙교육연수원2020원격(소방공무원)교수설계 및 강의능력 향상과정12020-05-252020-06-027979100
335중앙교육연수원2020원격(자격연수) 2020년 희소교과 1급 정교사 자격연수 특별연수과정12020-08-032020-08-2330830699
336중앙교육연수원2020원격(중앙교육연수원)교육부 개인정보 보호지침 및 안전성 확보 기준 이해12020-07-062020-08-101008484
337중앙교육연수원2020원격(파라과이한국학교)학교 안전사고 예방교육12020-07-132020-08-155240
338중앙교육연수원2020원격(호치민시한국국제학교)공감과 소통을 위한 교실 속 다문화교육12020-05-252020-07-31131292
339중앙교육연수원2020원격(호치민시한국국제학교)공직자를 위한 성희롱 예방교육12020-10-122020-12-041159381
340중앙교육연수원2020원격(호치민시한국국제학교)모두가 함께하는 나침반 안전교육12020-05-252020-07-31251768
341중앙교육연수원2020원격2020년 6·7급 승진후보자 역량향상과정32020-03-052020-11-19646094
342중앙교육연수원2020원격2020년 교육부 전입자 역량강화과정12020-03-052020-03-13121192
343중앙교육연수원2020원격충남대학교 - 안전한 학교 만들기 체험연수(학교 관리자 대상)12020-11-022020-12-0439538898