Overview

Dataset statistics

Number of variables9
Number of observations1280
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory95.1 KiB
Average record size in memory76.1 B

Variable types

Numeric3
Categorical2
Text1
DateTime3

Dataset

Description한국기술교육대학교 온라인평생교육원 스마트 직업훈련 플랫폼 (STEP)에 대한 LMS 학기 개설 이력을 제공합니다.
Author한국기술교육대학교
URLhttps://www.data.go.kr/data/15090844/fileData.do

Alerts

기관아이디 has constant value ""Constant
아이디 is highly overall correlated with 년도High correlation
년도 is highly overall correlated with 아이디High correlation
아이디 has unique valuesUnique
기수 has 13 (1.0%) zerosZeros

Reproduction

Analysis started2023-12-12 10:02:22.424909
Analysis finished2023-12-12 10:02:24.238402
Duration1.81 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

아이디
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct1280
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2038.0187
Minimum5
Maximum4561
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size11.4 KiB
2023-12-12T19:02:24.318077image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum5
5-th percentile70.95
Q1816.5
median1957
Q33248
95-th percentile4324.3
Maximum4561
Range4556
Interquartile range (IQR)2431.5

Descriptive statistics

Standard deviation1398.9926
Coefficient of variation (CV)0.68644738
Kurtosis-1.2460246
Mean2038.0187
Median Absolute Deviation (MAD)1207
Skewness0.16492834
Sum2608664
Variance1957180.4
MonotonicityStrictly increasing
2023-12-12T19:02:24.500715image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
5 1
 
0.1%
1966 1
 
0.1%
2837 1
 
0.1%
2833 1
 
0.1%
2829 1
 
0.1%
2825 1
 
0.1%
2821 1
 
0.1%
2819 1
 
0.1%
2815 1
 
0.1%
2811 1
 
0.1%
Other values (1270) 1270
99.2%
ValueCountFrequency (%)
5 1
0.1%
6 1
0.1%
9 1
0.1%
10 1
0.1%
11 1
0.1%
12 1
0.1%
13 1
0.1%
14 1
0.1%
15 1
0.1%
16 1
0.1%
ValueCountFrequency (%)
4561 1
0.1%
4558 1
0.1%
4555 1
0.1%
4552 1
0.1%
4549 1
0.1%
4546 1
0.1%
4543 1
0.1%
4540 1
0.1%
4537 1
0.1%
4534 1
0.1%

기관아이디
Categorical

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size10.1 KiB
1
1280 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
1 1280
100.0%

Length

2023-12-12T19:02:24.636417image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T19:02:24.747416image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 1280
100.0%

년도
Real number (ℝ)

HIGH CORRELATION 

Distinct11
Distinct (%)0.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2019.4344
Minimum2013
Maximum2023
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size11.4 KiB
2023-12-12T19:02:24.855746image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2013
5-th percentile2015
Q12018
median2020
Q32021
95-th percentile2023
Maximum2023
Range10
Interquartile range (IQR)3

Descriptive statistics

Standard deviation2.4112856
Coefficient of variation (CV)0.0011940401
Kurtosis-0.90795922
Mean2019.4344
Median Absolute Deviation (MAD)2
Skewness-0.2717673
Sum2584876
Variance5.8142983
MonotonicityNot monotonic
2023-12-12T19:02:24.969748image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=11)
ValueCountFrequency (%)
2020 222
17.3%
2022 167
13.0%
2018 161
12.6%
2021 151
11.8%
2023 144
11.2%
2019 142
11.1%
2016 98
7.7%
2017 98
7.7%
2015 93
7.3%
2013 2
 
0.2%
ValueCountFrequency (%)
2013 2
 
0.2%
2014 2
 
0.2%
2015 93
7.3%
2016 98
7.7%
2017 98
7.7%
2018 161
12.6%
2019 142
11.1%
2020 222
17.3%
2021 151
11.8%
2022 167
13.0%
ValueCountFrequency (%)
2023 144
11.2%
2022 167
13.0%
2021 151
11.8%
2020 222
17.3%
2019 142
11.1%
2018 161
12.6%
2017 98
7.7%
2016 98
7.7%
2015 93
7.3%
2014 2
 
0.2%

기수
Real number (ℝ)

ZEROS 

Distinct90
Distinct (%)7.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean67.738281
Minimum0
Maximum12121
Zeros13
Zeros (%)1.0%
Negative0
Negative (%)0.0%
Memory size11.4 KiB
2023-12-12T19:02:25.138198image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile1
Q12
median5
Q312
95-th percentile100
Maximum12121
Range12121
Interquartile range (IQR)10

Descriptive statistics

Standard deviation488.5794
Coefficient of variation (CV)7.2127517
Kurtosis352.52312
Mean67.738281
Median Absolute Deviation (MAD)4
Skewness16.488122
Sum86705
Variance238709.83
MonotonicityNot monotonic
2023-12-12T19:02:25.723503image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 296
23.1%
2 158
12.3%
3 94
 
7.3%
4 72
 
5.6%
5 63
 
4.9%
6 54
 
4.2%
7 47
 
3.7%
8 42
 
3.3%
9 37
 
2.9%
10 34
 
2.7%
Other values (80) 383
29.9%
ValueCountFrequency (%)
0 13
 
1.0%
1 296
23.1%
2 158
12.3%
3 94
 
7.3%
4 72
 
5.6%
5 63
 
4.9%
6 54
 
4.2%
7 47
 
3.7%
8 42
 
3.3%
9 37
 
2.9%
ValueCountFrequency (%)
12121 1
 
0.1%
8194 1
 
0.1%
2021 1
 
0.1%
2020 1
 
0.1%
2019 3
0.2%
2018 3
0.2%
2017 3
0.2%
2016 5
0.4%
2015 3
0.2%
2014 3
0.2%

이름
Text

Distinct1013
Distinct (%)79.1%
Missing0
Missing (%)0.0%
Memory size10.1 KiB
2023-12-12T19:02:26.135606image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length55
Median length29
Mean length12.309375
Min length2

Characters and Unicode

Total characters15756
Distinct characters366
Distinct categories12 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique927 ?
Unique (%)72.4%

Sample

1st row13기
2nd row22기
3rd row0기 (삼성협력사)
4th row1기
5th row1기 (한국기술교육대학교)
ValueCountFrequency (%)
직무 45
 
2.4%
핵심 43
 
2.3%
삼성협력사 39
 
2.1%
1기 36
 
1.9%
2020-1 30
 
1.6%
주)광진 25
 
1.3%
2기 22
 
1.2%
3기 18
 
1.0%
2022-1 16
 
0.9%
6기 16
 
0.9%
Other values (1023) 1592
84.6%
2023-12-12T19:02:26.800124image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2 1892
 
12.0%
0 1243
 
7.9%
1 1076
 
6.8%
) 921
 
5.8%
( 921
 
5.8%
- 900
 
5.7%
608
 
3.9%
533
 
3.4%
493
 
3.1%
3 290
 
1.8%
Other values (356) 6879
43.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 6672
42.3%
Decimal Number 5365
34.1%
Close Punctuation 927
 
5.9%
Open Punctuation 927
 
5.9%
Dash Punctuation 900
 
5.7%
Space Separator 608
 
3.9%
Uppercase Letter 252
 
1.6%
Lowercase Letter 69
 
0.4%
Other Punctuation 19
 
0.1%
Connector Punctuation 9
 
0.1%
Other values (2) 8
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
533
 
8.0%
493
 
7.4%
217
 
3.3%
200
 
3.0%
193
 
2.9%
184
 
2.8%
183
 
2.7%
181
 
2.7%
133
 
2.0%
108
 
1.6%
Other values (299) 4247
63.7%
Uppercase Letter
ValueCountFrequency (%)
S 48
19.0%
P 37
14.7%
L 30
11.9%
I 24
9.5%
C 23
9.1%
T 22
8.7%
K 15
 
6.0%
A 14
 
5.6%
H 9
 
3.6%
B 6
 
2.4%
Other values (9) 24
9.5%
Lowercase Letter
ValueCountFrequency (%)
t 16
23.2%
a 11
15.9%
s 10
14.5%
b 9
13.0%
e 8
11.6%
i 5
 
7.2%
o 3
 
4.3%
m 3
 
4.3%
l 1
 
1.4%
g 1
 
1.4%
Other values (2) 2
 
2.9%
Decimal Number
ValueCountFrequency (%)
2 1892
35.3%
0 1243
23.2%
1 1076
20.1%
3 290
 
5.4%
8 192
 
3.6%
9 191
 
3.6%
7 150
 
2.8%
4 125
 
2.3%
5 108
 
2.0%
6 98
 
1.8%
Other Punctuation
ValueCountFrequency (%)
% 5
26.3%
· 4
21.1%
/ 4
21.1%
. 2
 
10.5%
, 2
 
10.5%
& 1
 
5.3%
! 1
 
5.3%
Close Punctuation
ValueCountFrequency (%)
) 921
99.4%
] 6
 
0.6%
Open Punctuation
ValueCountFrequency (%)
( 921
99.4%
[ 6
 
0.6%
Dash Punctuation
ValueCountFrequency (%)
- 900
100.0%
Space Separator
ValueCountFrequency (%)
608
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 9
100.0%
Math Symbol
ValueCountFrequency (%)
+ 6
100.0%
Other Symbol
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 8761
55.6%
Hangul 6674
42.4%
Latin 321
 
2.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
533
 
8.0%
493
 
7.4%
217
 
3.3%
200
 
3.0%
193
 
2.9%
184
 
2.8%
183
 
2.7%
181
 
2.7%
133
 
2.0%
108
 
1.6%
Other values (300) 4249
63.7%
Latin
ValueCountFrequency (%)
S 48
15.0%
P 37
11.5%
L 30
 
9.3%
I 24
 
7.5%
C 23
 
7.2%
T 22
 
6.9%
t 16
 
5.0%
K 15
 
4.7%
A 14
 
4.4%
a 11
 
3.4%
Other values (21) 81
25.2%
Common
ValueCountFrequency (%)
2 1892
21.6%
0 1243
14.2%
1 1076
12.3%
) 921
10.5%
( 921
10.5%
- 900
10.3%
608
 
6.9%
3 290
 
3.3%
8 192
 
2.2%
9 191
 
2.2%
Other values (15) 527
 
6.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 9078
57.6%
Hangul 6672
42.3%
None 6
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2 1892
20.8%
0 1243
13.7%
1 1076
11.9%
) 921
10.1%
( 921
10.1%
- 900
9.9%
608
 
6.7%
3 290
 
3.2%
8 192
 
2.1%
9 191
 
2.1%
Other values (45) 844
9.3%
Hangul
ValueCountFrequency (%)
533
 
8.0%
493
 
7.4%
217
 
3.3%
200
 
3.0%
193
 
2.9%
184
 
2.8%
183
 
2.7%
181
 
2.7%
133
 
2.0%
108
 
1.6%
Other values (299) 4247
63.7%
None
ValueCountFrequency (%)
· 4
66.7%
2
33.3%
Distinct459
Distinct (%)35.9%
Missing0
Missing (%)0.0%
Memory size10.1 KiB
Minimum2015-01-21 00:00:00
Maximum2023-10-01 00:00:00
2023-12-12T19:02:27.060095image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T19:02:27.370188image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct544
Distinct (%)42.5%
Missing0
Missing (%)0.0%
Memory size10.1 KiB
Minimum2015-02-12 23:59:59
Maximum2023-10-25 23:59:59
2023-12-12T19:02:27.563478image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T19:02:27.724786image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

등록 국가
Categorical

Distinct2
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size10.1 KiB
KR
1122 
UNKNOWN
158 

Length

Max length7
Median length2
Mean length2.6171875
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowKR
2nd rowKR
3rd rowKR
4th rowKR
5th rowKR

Common Values

ValueCountFrequency (%)
KR 1122
87.7%
UNKNOWN 158
 
12.3%

Length

2023-12-12T19:02:27.909667image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T19:02:28.037665image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
kr 1122
87.7%
unknown 158
 
12.3%
Distinct1111
Distinct (%)86.8%
Missing0
Missing (%)0.0%
Memory size10.1 KiB
Minimum2016-09-28 07:08:14
Maximum2023-09-22 17:38:23
2023-12-12T19:02:28.151968image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T19:02:28.308939image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

Interactions

2023-12-12T19:02:23.571670image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T19:02:22.886266image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T19:02:23.240467image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T19:02:23.740594image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T19:02:22.992548image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T19:02:23.362128image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T19:02:23.878033image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T19:02:23.102313image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T19:02:23.469666image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T19:02:28.405171image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
아이디년도기수등록 국가
아이디1.0000.9770.3810.302
년도0.9771.0000.2430.337
기수0.3810.2431.0000.150
등록 국가0.3020.3370.1501.000
2023-12-12T19:02:28.496833image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
아이디년도기수등록 국가
아이디1.0000.992-0.2160.231
년도0.9921.000-0.2590.258
기수-0.216-0.2591.0000.099
등록 국가0.2310.2580.0991.000

Missing values

2023-12-12T19:02:24.024735image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T19:02:24.182821image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

아이디기관아이디년도기수이름과정 시작 일시과정 종료 일시등록 국가등록 일시
05120131313기2015-10-26 00:00:002015-10-26 23:59:59KR2016-09-28 07:08:14
16120132222기2016-01-18 00:00:002016-01-19 23:59:59KR2016-09-28 07:08:14
291201500기 (삼성협력사)2015-11-01 00:00:002015-11-02 23:59:59KR2016-09-28 07:08:14
3101201511기2015-02-01 00:00:002015-12-21 23:59:59KR2016-09-28 07:08:14
4111201511기 (한국기술교육대학교)2015-03-19 00:00:002015-03-27 23:59:59KR2016-09-28 07:08:14
5121201511기 (삼성협력사)2015-03-23 00:00:002015-10-02 23:59:59KR2016-09-28 07:08:14
6131201511기 (지오메디칼)2015-04-01 00:00:002015-04-02 23:59:59KR2016-09-28 07:08:14
7141201511기 ((주)광진)2015-04-01 00:00:002015-08-02 23:59:59KR2016-09-28 07:08:14
8151201511기 (이화전기공업(주))2015-05-01 00:00:002015-05-02 23:59:59KR2016-09-28 07:08:14
9161201511기 (진영지앤티)2015-06-01 00:00:002015-06-02 23:59:59KR2016-09-28 07:08:14
아이디기관아이디년도기수이름과정 시작 일시과정 종료 일시등록 국가등록 일시
127045341202322023-2차(공무원연금공단)2023-09-13 00:00:002023-09-13 23:59:59KR2023-09-13 15:00:59
127145371202392023-9차(삼성전자 협력회사)2023-10-01 00:00:002023-10-25 23:59:59KR2023-09-19 18:12:53
127245401202392023-9차(앰코테크놀로지코리아)2023-10-01 00:00:002023-10-25 23:59:59KR2023-09-19 18:50:53
12734543120231002023-100차(스마트양성-테스트)2023-09-17 00:00:002023-09-17 23:59:59KR2023-09-20 10:26:53
127445461202382023-8차(글로벌상생협력센터)2023-09-20 00:00:002023-09-20 23:59:59KR2023-09-21 16:39:18
127545491202372023-7차(현대제철)2023-10-01 00:00:002023-10-25 23:59:59KR2023-09-22 10:57:47
127645521202332023-3차(대덕전자)2023-10-01 00:00:002023-10-25 23:59:59KR2023-09-22 11:17:03
12774555120231002023-100차(히타치-테스트)2023-09-21 00:00:002023-09-21 23:59:59KR2023-09-22 14:20:58
127845581202342023-4차(유라코퍼레이션)2023-09-21 00:00:002023-09-21 23:59:59KR2023-09-22 15:19:16
12794561120231616기2023-10-01 00:00:002023-10-14 23:59:59KR2023-09-22 17:38:23