Overview

Dataset statistics

Number of variables8
Number of observations65
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory4.4 KiB
Average record size in memory70.0 B

Variable types

Numeric3
Categorical3
Text1
DateTime1

Dataset

Description한국기술교육대학교 온라인평생교육원 스마트 직업훈련 플랫폼 (STEP)에 대한 LMS 과정 아이템 관련 내용을 제공합니다.
Author한국기술교육대학교
URLhttps://www.data.go.kr/data/15091002/fileData.do

Alerts

타입 코드 has constant value ""Constant
아이디 is highly overall correlated with 과정 아이디 and 2 other fieldsHigh correlation
과정 아이디 is highly overall correlated with 아이디 and 2 other fieldsHigh correlation
요구 과목 코드 수 is highly overall correlated with 아이디 and 2 other fieldsHigh correlation
등록 국가 is highly overall correlated with 아이디 and 2 other fieldsHigh correlation
아이디 has unique valuesUnique
등록 일시 has unique valuesUnique
요구 과목 코드 수 has 5 (7.7%) zerosZeros

Reproduction

Analysis started2023-12-12 12:59:29.124174
Analysis finished2023-12-12 12:59:30.751315
Duration1.63 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

아이디
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct65
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean92.676923
Minimum3
Maximum199
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size717.0 B
2023-12-12T21:59:30.850464image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum3
5-th percentile11.2
Q141
median81
Q3148
95-th percentile189.4
Maximum199
Range196
Interquartile range (IQR)107

Descriptive statistics

Standard deviation60.789881
Coefficient of variation (CV)0.65593331
Kurtosis-1.3318213
Mean92.676923
Median Absolute Deviation (MAD)51
Skewness0.25926197
Sum6024
Variance3695.4096
MonotonicityStrictly increasing
2023-12-12T21:59:31.015024image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
3 1
 
1.5%
151 1
 
1.5%
94 1
 
1.5%
97 1
 
1.5%
100 1
 
1.5%
103 1
 
1.5%
109 1
 
1.5%
117 1
 
1.5%
125 1
 
1.5%
130 1
 
1.5%
Other values (55) 55
84.6%
ValueCountFrequency (%)
3 1
1.5%
5 1
1.5%
8 1
1.5%
11 1
1.5%
12 1
1.5%
15 1
1.5%
18 1
1.5%
21 1
1.5%
27 1
1.5%
29 1
1.5%
ValueCountFrequency (%)
199 1
1.5%
196 1
1.5%
193 1
1.5%
190 1
1.5%
187 1
1.5%
184 1
1.5%
181 1
1.5%
178 1
1.5%
175 1
1.5%
172 1
1.5%

과정 아이디
Real number (ℝ)

HIGH CORRELATION 

Distinct47
Distinct (%)72.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean73.538462
Minimum2
Maximum172
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size717.0 B
2023-12-12T21:59:31.174699image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2
5-th percentile8.2
Q125
median61
Q3127
95-th percentile162.4
Maximum172
Range170
Interquartile range (IQR)102

Descriptive statistics

Standard deviation55.327174
Coefficient of variation (CV)0.75235697
Kurtosis-1.4113493
Mean73.538462
Median Absolute Deviation (MAD)45
Skewness0.34994571
Sum4780
Variance3061.0962
MonotonicityNot monotonic
2023-12-12T21:59:31.318371image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=47)
ValueCountFrequency (%)
9 3
 
4.6%
16 3
 
4.6%
46 2
 
3.1%
26 2
 
3.1%
106 2
 
3.1%
5 2
 
3.1%
145 2
 
3.1%
61 2
 
3.1%
34 2
 
3.1%
30 2
 
3.1%
Other values (37) 43
66.2%
ValueCountFrequency (%)
2 1
 
1.5%
5 2
3.1%
8 1
 
1.5%
9 3
4.6%
12 1
 
1.5%
16 3
4.6%
20 1
 
1.5%
21 2
3.1%
24 2
3.1%
25 2
3.1%
ValueCountFrequency (%)
172 1
1.5%
169 1
1.5%
166 1
1.5%
163 1
1.5%
160 1
1.5%
157 1
1.5%
154 1
1.5%
148 1
1.5%
145 2
3.1%
142 2
3.1%

타입 코드
Categorical

CONSTANT 

Distinct1
Distinct (%)1.5%
Missing0
Missing (%)0.0%
Memory size652.0 B
일반
65 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row일반
2nd row일반
3rd row일반
4th row일반
5th row일반

Common Values

ValueCountFrequency (%)
일반 65
100.0%

Length

2023-12-12T21:59:31.454224image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T21:59:31.536253image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
일반 65
100.0%

표시 순서
Categorical

Distinct3
Distinct (%)4.6%
Missing0
Missing (%)0.0%
Memory size652.0 B
1
48 
2
15 
3
 
2

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
1 48
73.8%
2 15
 
23.1%
3 2
 
3.1%

Length

2023-12-12T21:59:31.625703image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T21:59:31.715004image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 48
73.8%
2 15
 
23.1%
3 2
 
3.1%

제목
Text

Distinct37
Distinct (%)56.9%
Missing0
Missing (%)0.0%
Memory size652.0 B
2023-12-12T21:59:31.876049image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length21
Median length15
Mean length6.0769231
Min length2

Characters and Unicode

Total characters395
Distinct characters85
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique29 ?
Unique (%)44.6%

Sample

1st row빅데이터 분석 실무 활용 중급 집체교육
2nd row파트 1
3rd row파트 2
4th row필수
5th row선택 과목
ValueCountFrequency (%)
필수 24
18.3%
과정 22
16.8%
과목 11
 
8.4%
수강 7
 
5.3%
패키지 7
 
5.3%
테스트 6
 
4.6%
직무전문 3
 
2.3%
선택 3
 
2.3%
test 3
 
2.3%
1 3
 
2.3%
Other values (34) 42
32.1%
2023-12-12T21:59:32.179384image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
66
16.7%
37
 
9.4%
34
 
8.6%
26
 
6.6%
25
 
6.3%
13
 
3.3%
9
 
2.3%
8
 
2.0%
8
 
2.0%
8
 
2.0%
Other values (75) 161
40.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 297
75.2%
Space Separator 66
 
16.7%
Lowercase Letter 12
 
3.0%
Decimal Number 8
 
2.0%
Other Punctuation 7
 
1.8%
Uppercase Letter 2
 
0.5%
Open Punctuation 1
 
0.3%
Close Punctuation 1
 
0.3%
Connector Punctuation 1
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
37
 
12.5%
34
 
11.4%
26
 
8.8%
25
 
8.4%
13
 
4.4%
9
 
3.0%
8
 
2.7%
8
 
2.7%
8
 
2.7%
7
 
2.4%
Other values (61) 122
41.1%
Lowercase Letter
ValueCountFrequency (%)
t 6
50.0%
e 3
25.0%
s 3
25.0%
Other Punctuation
ValueCountFrequency (%)
? 3
42.9%
# 2
28.6%
. 2
28.6%
Decimal Number
ValueCountFrequency (%)
2 4
50.0%
1 4
50.0%
Uppercase Letter
ValueCountFrequency (%)
U 1
50.0%
X 1
50.0%
Space Separator
ValueCountFrequency (%)
66
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 297
75.2%
Common 84
 
21.3%
Latin 14
 
3.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
37
 
12.5%
34
 
11.4%
26
 
8.8%
25
 
8.4%
13
 
4.4%
9
 
3.0%
8
 
2.7%
8
 
2.7%
8
 
2.7%
7
 
2.4%
Other values (61) 122
41.1%
Common
ValueCountFrequency (%)
66
78.6%
2 4
 
4.8%
1 4
 
4.8%
? 3
 
3.6%
# 2
 
2.4%
. 2
 
2.4%
( 1
 
1.2%
) 1
 
1.2%
_ 1
 
1.2%
Latin
ValueCountFrequency (%)
t 6
42.9%
e 3
21.4%
s 3
21.4%
U 1
 
7.1%
X 1
 
7.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 294
74.4%
ASCII 98
 
24.8%
Compat Jamo 3
 
0.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
66
67.3%
t 6
 
6.1%
2 4
 
4.1%
1 4
 
4.1%
? 3
 
3.1%
e 3
 
3.1%
s 3
 
3.1%
# 2
 
2.0%
. 2
 
2.0%
U 1
 
1.0%
Other values (4) 4
 
4.1%
Hangul
ValueCountFrequency (%)
37
 
12.6%
34
 
11.6%
26
 
8.8%
25
 
8.5%
13
 
4.4%
9
 
3.1%
8
 
2.7%
8
 
2.7%
8
 
2.7%
7
 
2.4%
Other values (58) 119
40.5%
Compat Jamo
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%

요구 과목 코드 수
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct6
Distinct (%)9.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.0307692
Minimum0
Maximum5
Zeros5
Zeros (%)7.7%
Negative0
Negative (%)0.0%
Memory size717.0 B
2023-12-12T21:59:32.300502image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q11
median2
Q33
95-th percentile4
Maximum5
Range5
Interquartile range (IQR)2

Descriptive statistics

Standard deviation1.2496153
Coefficient of variation (CV)0.61534088
Kurtosis-0.87163216
Mean2.0307692
Median Absolute Deviation (MAD)1
Skewness0.28728053
Sum132
Variance1.5615385
MonotonicityNot monotonic
2023-12-12T21:59:32.432436image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
1 23
35.4%
3 16
24.6%
2 12
18.5%
4 8
 
12.3%
0 5
 
7.7%
5 1
 
1.5%
ValueCountFrequency (%)
0 5
 
7.7%
1 23
35.4%
2 12
18.5%
3 16
24.6%
4 8
 
12.3%
5 1
 
1.5%
ValueCountFrequency (%)
5 1
 
1.5%
4 8
 
12.3%
3 16
24.6%
2 12
18.5%
1 23
35.4%
0 5
 
7.7%

등록 국가
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)3.1%
Missing0
Missing (%)0.0%
Memory size652.0 B
KR
42 
UNKNOWN
23 

Length

Max length7
Median length2
Mean length3.7692308
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowKR
2nd rowKR
3rd rowKR
4th rowKR
5th rowKR

Common Values

ValueCountFrequency (%)
KR 42
64.6%
UNKNOWN 23
35.4%

Length

2023-12-12T21:59:32.546120image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T21:59:32.656399image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
kr 42
64.6%
unknown 23
35.4%

등록 일시
Date

UNIQUE 

Distinct65
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size652.0 B
Minimum2017-12-04 19:43:47
Maximum2023-08-21 13:32:11
2023-12-12T21:59:32.780098image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:59:32.903917image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

Interactions

2023-12-12T21:59:30.228279image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:59:29.367795image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:59:29.625213image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:59:30.332347image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:59:29.451971image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:59:29.749231image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:59:30.428362image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:59:29.534029image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:59:30.132236image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T21:59:32.990824image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
아이디과정 아이디표시 순서제목요구 과목 코드 수등록 국가등록 일시
아이디1.0000.9610.0000.8510.5530.9841.000
과정 아이디0.9611.0000.3140.9180.5870.9871.000
표시 순서0.0000.3141.0000.0000.5680.1481.000
제목0.8510.9180.0001.0000.8020.9841.000
요구 과목 코드 수0.5530.5870.5680.8021.0000.9461.000
등록 국가0.9840.9870.1480.9840.9461.0001.000
등록 일시1.0001.0001.0001.0001.0001.0001.000
2023-12-12T21:59:33.092087image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
등록 국가표시 순서
등록 국가1.0000.241
표시 순서0.2411.000
2023-12-12T21:59:33.183249image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
아이디과정 아이디요구 과목 코드 수표시 순서등록 국가
아이디1.0000.9950.6540.0000.830
과정 아이디0.9951.0000.6510.1750.831
요구 과목 코드 수0.6540.6511.0000.2730.766
표시 순서0.0000.1750.2731.0000.241
등록 국가0.8300.8310.7660.2411.000

Missing values

2023-12-12T21:59:30.550141image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T21:59:30.697416image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

아이디과정 아이디타입 코드표시 순서제목요구 과목 코드 수등록 국가등록 일시
032일반1빅데이터 분석 실무 활용 중급 집체교육1KR2017-12-04 19:43:47
155일반1파트 11KR2017-12-07 17:10:58
285일반1파트 21KR2017-12-07 17:11:46
3118일반1필수2KR2018-04-16 21:56:29
4129일반1선택 과목0KR2018-05-04 18:55:30
5159일반3필수 수강 과목1KR2018-05-04 18:56:04
6189일반2필수 수강 과목1KR2018-05-04 18:56:20
72112일반1필수 수강 과목5KR2018-05-04 19:02:30
82716일반1선택 과목0KR2018-05-24 16:56:46
92916일반2필수 수강 과목1KR2018-05-24 16:57:08
아이디과정 아이디타입 코드표시 순서제목요구 과목 코드 수등록 국가등록 일시
55172145일반1필수과정3UNKNOWN2023-08-21 11:02:46
56175145일반2보조 과정0UNKNOWN2023-08-21 11:04:38
57178148일반1필수 과정4UNKNOWN2023-08-21 11:17:33
58181154일반1필수 과정3UNKNOWN2023-08-21 11:20:26
59184157일반1필수 과정3UNKNOWN2023-08-21 11:24:17
60187160일반1필수 과정3UNKNOWN2023-08-21 13:12:09
61190163일반1필수 과정4UNKNOWN2023-08-21 13:15:27
62193166일반1필수 과정3UNKNOWN2023-08-21 13:18:00
63196169일반1필수 과정4UNKNOWN2023-08-21 13:28:54
64199172일반1필수 과정4UNKNOWN2023-08-21 13:32:11