Overview

Dataset statistics

Number of variables11
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows33
Duplicate rows (%)0.3%
Total size in memory966.8 KiB
Average record size in memory99.0 B

Variable types

Numeric3
Categorical4
Text3
Unsupported1

Dataset

Description대학의 연도별 교육과정 정보를 공공데이터 포털을 통해 제공합니다. 2022년 기준으로 10개 대학(강원대학교, 경북대학교, 경상국립대학교, 부산대학교, 서울대학교, 전남대학교, 전북대학교, 제주대학교, 충남대학교, 충북대학교)의 교육과정 데이터가 시범적으로 서비스 됩니다.
Author한국교육학술정보원
URLhttps://www.data.go.kr/data/15113535/fileData.do

Alerts

Dataset has 33 (0.3%) duplicate rowsDuplicates
학점 is highly overall correlated with 이론시간High correlation
이론시간 is highly overall correlated with 학점High correlation
대학교명 is highly overall correlated with 과목구분High correlation
과목구분 is highly overall correlated with 대학교명High correlation
학기 is highly imbalanced (51.0%)Imbalance
이론시간 is highly skewed (γ1 = 27.03271406)Skewed
실습시간 is an unsupported type, check if it needs cleaning or further analysisUnsupported
학점 has 211 (2.1%) zerosZeros
이론시간 has 1564 (15.6%) zerosZeros

Reproduction

Analysis started2024-04-21 02:22:58.725397
Analysis finished2024-04-21 02:23:03.039917
Duration4.31 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연도
Real number (ℝ)

Distinct10
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2022.0731
Minimum2018
Maximum2027
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-21T11:23:03.096545image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2018
5-th percentile2021
Q12022
median2022
Q32022
95-th percentile2024
Maximum2027
Range9
Interquartile range (IQR)0

Descriptive statistics

Standard deviation0.89276878
Coefficient of variation (CV)0.00044151162
Kurtosis5.8477868
Mean2022.0731
Median Absolute Deviation (MAD)0
Skewness0.47393582
Sum20220731
Variance0.79703609
MonotonicityNot monotonic
2024-04-21T11:23:03.204408image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%)
2022 8337
83.4%
2024 378
 
3.8%
2025 337
 
3.4%
2023 272
 
2.7%
2020 250
 
2.5%
2021 213
 
2.1%
2019 206
 
2.1%
2026 3
 
< 0.1%
2027 3
 
< 0.1%
2018 1
 
< 0.1%
ValueCountFrequency (%)
2018 1
 
< 0.1%
2019 206
 
2.1%
2020 250
 
2.5%
2021 213
 
2.1%
2022 8337
83.4%
2023 272
 
2.7%
2024 378
 
3.8%
2025 337
 
3.4%
2026 3
 
< 0.1%
2027 3
 
< 0.1%
ValueCountFrequency (%)
2027 3
 
< 0.1%
2026 3
 
< 0.1%
2025 337
 
3.4%
2024 378
 
3.8%
2023 272
 
2.7%
2022 8337
83.4%
2021 213
 
2.1%
2020 250
 
2.5%
2019 206
 
2.1%
2018 1
 
< 0.1%

대학교명
Categorical

HIGH CORRELATION 

Distinct10
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
충남대학교
1448 
전남대학교
1229 
경상국립대학교
1132 
부산대학교
1101 
경북대학교
1021 
Other values (5)
4069 

Length

Max length7
Median length5
Mean length5.2264
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row서울대학교
2nd row서울대학교
3rd row경상국립대학교
4th row서울대학교
5th row경북대학교

Common Values

ValueCountFrequency (%)
충남대학교 1448
14.5%
전남대학교 1229
12.3%
경상국립대학교 1132
11.3%
부산대학교 1101
11.0%
경북대학교 1021
10.2%
강원대학교 922
9.2%
제주대학교 858
8.6%
전북대학교 852
8.5%
충북대학교 757
7.6%
서울대학교 680
6.8%

Length

2024-04-21T11:23:03.375849image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-21T11:23:03.582577image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
충남대학교 1448
14.5%
전남대학교 1229
12.3%
경상국립대학교 1132
11.3%
부산대학교 1101
11.0%
경북대학교 1021
10.2%
강원대학교 922
9.2%
제주대학교 858
8.6%
전북대학교 852
8.5%
충북대학교 757
7.6%
서울대학교 680
6.8%
Distinct68
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-04-21T11:23:03.858060image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length18
Median length4
Mean length5.3101
Min length3

Characters and Unicode

Total characters53101
Distinct characters104
Distinct categories4 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2 ?
Unique (%)< 0.1%

Sample

1st row공과대학
2nd row사범대학
3rd row농업생명과학대학
4th row사회과학대학
5th row인문대학
ValueCountFrequency (%)
공과대학 1348
13.5%
사범대학 1089
 
10.9%
인문대학 804
 
8.0%
농업생명과학대학 726
 
7.3%
자연과학대학 685
 
6.9%
단과대구분없음 562
 
5.6%
예술대학 525
 
5.2%
사회과학대학 494
 
4.9%
약학대학 235
 
2.4%
공학대학 224
 
2.2%
Other values (58) 3308
33.1%
2024-04-21T11:23:04.255744image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
12208
23.0%
9446
17.8%
5275
 
9.9%
2302
 
4.3%
1729
 
3.3%
1371
 
2.6%
1091
 
2.1%
1089
 
2.1%
1085
 
2.0%
1055
 
2.0%
Other values (94) 16450
31.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 52521
98.9%
Uppercase Letter 366
 
0.7%
Other Punctuation 200
 
0.4%
Letter Number 14
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
12208
23.2%
9446
18.0%
5275
 
10.0%
2302
 
4.4%
1729
 
3.3%
1371
 
2.6%
1091
 
2.1%
1089
 
2.1%
1085
 
2.1%
1055
 
2.0%
Other values (88) 15870
30.2%
Uppercase Letter
ValueCountFrequency (%)
I 183
50.0%
T 109
29.8%
A 74
20.2%
Letter Number
ValueCountFrequency (%)
10
71.4%
4
 
28.6%
Other Punctuation
ValueCountFrequency (%)
· 200
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 52521
98.9%
Latin 380
 
0.7%
Common 200
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
12208
23.2%
9446
18.0%
5275
 
10.0%
2302
 
4.4%
1729
 
3.3%
1371
 
2.6%
1091
 
2.1%
1089
 
2.1%
1085
 
2.1%
1055
 
2.0%
Other values (88) 15870
30.2%
Latin
ValueCountFrequency (%)
I 183
48.2%
T 109
28.7%
A 74
19.5%
10
 
2.6%
4
 
1.1%
Common
ValueCountFrequency (%)
· 200
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 52521
98.9%
ASCII 366
 
0.7%
None 200
 
0.4%
Number Forms 14
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
12208
23.2%
9446
18.0%
5275
 
10.0%
2302
 
4.4%
1729
 
3.3%
1371
 
2.6%
1091
 
2.1%
1089
 
2.1%
1085
 
2.1%
1055
 
2.0%
Other values (88) 15870
30.2%
None
ValueCountFrequency (%)
· 200
100.0%
ASCII
ValueCountFrequency (%)
I 183
50.0%
T 109
29.8%
A 74
20.2%
Number Forms
ValueCountFrequency (%)
10
71.4%
4
 
28.6%
Distinct888
Distinct (%)8.9%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-04-21T11:23:04.504174image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length25
Median length23
Mean length6.4735
Min length2

Characters and Unicode

Total characters64735
Distinct characters311
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique74 ?
Unique (%)0.7%

Sample

1st row재료공학부
2nd row윤리교육과
3rd row원예과학부
4th row인류학과
5th row독어독문학과
ValueCountFrequency (%)
기타모집단위 184
 
1.7%
수의학과 131
 
1.2%
간호학과 121
 
1.1%
약학과 111
 
1.0%
체육교육과 106
 
1.0%
의학과 103
 
1.0%
연계전공 96
 
0.9%
철학과 84
 
0.8%
정치외교학과 80
 
0.8%
사학과 76
 
0.7%
Other values (895) 9493
89.7%
2024-04-21T11:23:04.865359image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
8731
 
13.5%
7190
 
11.1%
4859
 
7.5%
2276
 
3.5%
1609
 
2.5%
1409
 
2.2%
1322
 
2.0%
960
 
1.5%
906
 
1.4%
895
 
1.4%
Other values (301) 34578
53.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 62598
96.7%
Space Separator 587
 
0.9%
Close Punctuation 472
 
0.7%
Open Punctuation 472
 
0.7%
Other Punctuation 269
 
0.4%
Uppercase Letter 215
 
0.3%
Decimal Number 55
 
0.1%
Lowercase Letter 45
 
0.1%
Dash Punctuation 17
 
< 0.1%
Math Symbol 5
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
8731
 
13.9%
7190
 
11.5%
4859
 
7.8%
2276
 
3.6%
1609
 
2.6%
1409
 
2.3%
1322
 
2.1%
960
 
1.5%
906
 
1.4%
895
 
1.4%
Other values (270) 32441
51.8%
Uppercase Letter
ValueCountFrequency (%)
I 80
37.2%
T 57
26.5%
A 23
 
10.7%
E 19
 
8.8%
S 10
 
4.7%
B 6
 
2.8%
O 5
 
2.3%
M 5
 
2.3%
P 4
 
1.9%
W 3
 
1.4%
Lowercase Letter
ValueCountFrequency (%)
o 15
33.3%
i 10
22.2%
y 5
 
11.1%
t 5
 
11.1%
l 5
 
11.1%
b 5
 
11.1%
Decimal Number
ValueCountFrequency (%)
5 28
50.9%
6 14
25.5%
4 6
 
10.9%
2 6
 
10.9%
1 1
 
1.8%
Other Punctuation
ValueCountFrequency (%)
· 101
37.5%
. 92
34.2%
/ 74
27.5%
& 2
 
0.7%
Space Separator
ValueCountFrequency (%)
587
100.0%
Close Punctuation
ValueCountFrequency (%)
) 472
100.0%
Open Punctuation
ValueCountFrequency (%)
( 472
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 17
100.0%
Math Symbol
ValueCountFrequency (%)
+ 5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 62598
96.7%
Common 1877
 
2.9%
Latin 260
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
8731
 
13.9%
7190
 
11.5%
4859
 
7.8%
2276
 
3.6%
1609
 
2.6%
1409
 
2.3%
1322
 
2.1%
960
 
1.5%
906
 
1.4%
895
 
1.4%
Other values (270) 32441
51.8%
Latin
ValueCountFrequency (%)
I 80
30.8%
T 57
21.9%
A 23
 
8.8%
E 19
 
7.3%
o 15
 
5.8%
i 10
 
3.8%
S 10
 
3.8%
B 6
 
2.3%
y 5
 
1.9%
t 5
 
1.9%
Other values (7) 30
 
11.5%
Common
ValueCountFrequency (%)
587
31.3%
) 472
25.1%
( 472
25.1%
· 101
 
5.4%
. 92
 
4.9%
/ 74
 
3.9%
5 28
 
1.5%
- 17
 
0.9%
6 14
 
0.7%
4 6
 
0.3%
Other values (4) 14
 
0.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 62593
96.7%
ASCII 2036
 
3.1%
None 101
 
0.2%
Compat Jamo 5
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
8731
 
13.9%
7190
 
11.5%
4859
 
7.8%
2276
 
3.6%
1609
 
2.6%
1409
 
2.3%
1322
 
2.1%
960
 
1.5%
906
 
1.4%
895
 
1.4%
Other values (269) 32436
51.8%
ASCII
ValueCountFrequency (%)
587
28.8%
) 472
23.2%
( 472
23.2%
. 92
 
4.5%
I 80
 
3.9%
/ 74
 
3.6%
T 57
 
2.8%
5 28
 
1.4%
A 23
 
1.1%
E 19
 
0.9%
Other values (20) 132
 
6.5%
None
ValueCountFrequency (%)
· 101
100.0%
Compat Jamo
ValueCountFrequency (%)
5
100.0%
Distinct7433
Distinct (%)74.3%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-04-21T11:23:05.142905image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length44
Median length27
Mean length7.3164
Min length2

Characters and Unicode

Total characters73164
Distinct characters639
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique6485 ?
Unique (%)64.8%

Sample

1st row고분자재료화학
2nd row정치사상교육론
3rd row도시농업학
4th row한국 민속 문화의 이해
5th row독일문장구조의이해
ValueCountFrequency (%)
241
 
1.9%
1 195
 
1.6%
2 183
 
1.5%
101
 
0.8%
실습 75
 
0.6%
이해 65
 
0.5%
전공진로설계 64
 
0.5%
i 59
 
0.5%
실험 47
 
0.4%
ii 44
 
0.4%
Other values (7708) 11405
91.4%
2024-04-21T11:23:05.537209image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3882
 
5.3%
2502
 
3.4%
1600
 
2.2%
1226
 
1.7%
1195
 
1.6%
1176
 
1.6%
1079
 
1.5%
1036
 
1.4%
1028
 
1.4%
999
 
1.4%
Other values (629) 57441
78.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 65313
89.3%
Space Separator 2502
 
3.4%
Decimal Number 2025
 
2.8%
Uppercase Letter 1419
 
1.9%
Open Punctuation 463
 
0.6%
Close Punctuation 463
 
0.6%
Letter Number 290
 
0.4%
Other Punctuation 264
 
0.4%
Lowercase Letter 251
 
0.3%
Dash Punctuation 167
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3882
 
5.9%
1600
 
2.4%
1226
 
1.9%
1195
 
1.8%
1176
 
1.8%
1079
 
1.7%
1036
 
1.6%
1028
 
1.6%
999
 
1.5%
983
 
1.5%
Other values (551) 51109
78.3%
Uppercase Letter
ValueCountFrequency (%)
I 783
55.2%
C 76
 
5.4%
A 64
 
4.5%
V 58
 
4.1%
S 52
 
3.7%
D 48
 
3.4%
T 43
 
3.0%
P 41
 
2.9%
B 30
 
2.1%
L 28
 
2.0%
Other values (16) 196
 
13.8%
Lowercase Letter
ValueCountFrequency (%)
e 37
14.7%
o 24
 
9.6%
i 21
 
8.4%
a 20
 
8.0%
t 19
 
7.6%
n 18
 
7.2%
r 13
 
5.2%
s 12
 
4.8%
l 12
 
4.8%
u 12
 
4.8%
Other values (10) 63
25.1%
Decimal Number
ValueCountFrequency (%)
1 767
37.9%
2 744
36.7%
3 201
 
9.9%
4 115
 
5.7%
6 68
 
3.4%
5 68
 
3.4%
7 26
 
1.3%
8 25
 
1.2%
9 6
 
0.3%
0 5
 
0.2%
Letter Number
ValueCountFrequency (%)
141
48.6%
126
43.4%
8
 
2.8%
5
 
1.7%
5
 
1.7%
2
 
0.7%
2
 
0.7%
1
 
0.3%
Other Punctuation
ValueCountFrequency (%)
: 116
43.9%
· 99
37.5%
, 23
 
8.7%
. 10
 
3.8%
/ 8
 
3.0%
& 6
 
2.3%
* 1
 
0.4%
; 1
 
0.4%
Math Symbol
ValueCountFrequency (%)
+ 6
85.7%
~ 1
 
14.3%
Space Separator
ValueCountFrequency (%)
2502
100.0%
Open Punctuation
ValueCountFrequency (%)
( 463
100.0%
Close Punctuation
ValueCountFrequency (%)
) 463
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 167
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 65313
89.3%
Common 5891
 
8.1%
Latin 1960
 
2.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3882
 
5.9%
1600
 
2.4%
1226
 
1.9%
1195
 
1.8%
1176
 
1.8%
1079
 
1.7%
1036
 
1.6%
1028
 
1.6%
999
 
1.5%
983
 
1.5%
Other values (551) 51109
78.3%
Latin
ValueCountFrequency (%)
I 783
39.9%
141
 
7.2%
126
 
6.4%
C 76
 
3.9%
A 64
 
3.3%
V 58
 
3.0%
S 52
 
2.7%
D 48
 
2.4%
T 43
 
2.2%
P 41
 
2.1%
Other values (44) 528
26.9%
Common
ValueCountFrequency (%)
2502
42.5%
1 767
 
13.0%
2 744
 
12.6%
( 463
 
7.9%
) 463
 
7.9%
3 201
 
3.4%
- 167
 
2.8%
: 116
 
2.0%
4 115
 
2.0%
· 99
 
1.7%
Other values (14) 254
 
4.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 65309
89.3%
ASCII 7462
 
10.2%
Number Forms 290
 
0.4%
None 99
 
0.1%
Compat Jamo 4
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
3882
 
5.9%
1600
 
2.4%
1226
 
1.9%
1195
 
1.8%
1176
 
1.8%
1079
 
1.7%
1036
 
1.6%
1028
 
1.6%
999
 
1.5%
983
 
1.5%
Other values (550) 51105
78.3%
ASCII
ValueCountFrequency (%)
2502
33.5%
I 783
 
10.5%
1 767
 
10.3%
2 744
 
10.0%
( 463
 
6.2%
) 463
 
6.2%
3 201
 
2.7%
- 167
 
2.2%
: 116
 
1.6%
4 115
 
1.5%
Other values (59) 1141
15.3%
Number Forms
ValueCountFrequency (%)
141
48.6%
126
43.4%
8
 
2.8%
5
 
1.7%
5
 
1.7%
2
 
0.7%
2
 
0.7%
1
 
0.3%
None
ValueCountFrequency (%)
· 99
100.0%
Compat Jamo
ValueCountFrequency (%)
4
100.0%

학년
Categorical

Distinct7
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
3
2908 
2
2496 
4
2305 
1
1604 
전학년
594 
Other values (2)
 
93

Length

Max length3
Median length1
Mean length1.1188
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row3
2nd row3
3rd row4
4th row3
5th row3

Common Values

ValueCountFrequency (%)
3 2908
29.1%
2 2496
25.0%
4 2305
23.1%
1 1604
16.0%
전학년 594
 
5.9%
5 71
 
0.7%
6 22
 
0.2%

Length

2024-04-21T11:23:05.697279image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-21T11:23:05.814614image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
3 2908
29.1%
2 2496
25.0%
4 2305
23.1%
1 1604
16.0%
전학년 594
 
5.9%
5 71
 
0.7%
6 22
 
0.2%

학기
Categorical

IMBALANCE 

Distinct12
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
1학기
4428 
2학기
4213 
1,2학기
 
426
학기구분없음
 
277
전학기
 
265
Other values (7)
 
391

Length

Max length9
Median length3
Mean length3.363
Min length3

Unique

Unique3 ?
Unique (%)< 0.1%

Sample

1st row1학기
2nd row2학기
3rd row1학기
4th row2학기
5th row1학기

Common Values

ValueCountFrequency (%)
1학기 4428
44.3%
2학기 4213
42.1%
1,2학기 426
 
4.3%
학기구분없음 277
 
2.8%
전학기 265
 
2.6%
1,2학기동시개설 204
 
2.0%
동하계계절학기 166
 
1.7%
하계계절학기 9
 
0.1%
동계계절학기 9
 
0.1%
겨울학기 1
 
< 0.1%
Other values (2) 2
 
< 0.1%

Length

2024-04-21T11:23:05.936247image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
1학기 4428
44.3%
2학기 4213
42.1%
1,2학기 426
 
4.3%
학기구분없음 277
 
2.8%
전학기 265
 
2.6%
1,2학기동시개설 204
 
2.0%
동하계계절학기 166
 
1.7%
하계계절학기 9
 
0.1%
동계계절학기 9
 
0.1%
겨울학기 1
 
< 0.1%
Other values (2) 2
 
< 0.1%

학점
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct18
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.76555
Minimum0
Maximum18
Zeros211
Zeros (%)2.1%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-21T11:23:06.049439image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile1
Q13
median3
Q33
95-th percentile3
Maximum18
Range18
Interquartile range (IQR)0

Descriptive statistics

Standard deviation1.3113658
Coefficient of variation (CV)0.47417901
Kurtosis50.31811
Mean2.76555
Median Absolute Deviation (MAD)0
Skewness5.402237
Sum27655.5
Variance1.7196802
MonotonicityNot monotonic
2024-04-21T11:23:06.168550image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=18)
ValueCountFrequency (%)
3.0 7236
72.4%
2.0 1565
 
15.7%
1.0 624
 
6.2%
0.0 211
 
2.1%
6.0 120
 
1.2%
0.5 66
 
0.7%
15.0 64
 
0.6%
4.0 58
 
0.6%
1.5 30
 
0.3%
5.0 11
 
0.1%
Other values (8) 15
 
0.1%
ValueCountFrequency (%)
0.0 211
 
2.1%
0.5 66
 
0.7%
1.0 624
 
6.2%
1.5 30
 
0.3%
2.0 1565
 
15.7%
2.5 1
 
< 0.1%
3.0 7236
72.4%
4.0 58
 
0.6%
5.0 11
 
0.1%
5.5 1
 
< 0.1%
ValueCountFrequency (%)
18.0 1
 
< 0.1%
15.0 64
0.6%
13.0 1
 
< 0.1%
12.0 5
 
0.1%
9.0 3
 
< 0.1%
8.0 2
 
< 0.1%
6.5 1
 
< 0.1%
6.0 120
1.2%
5.5 1
 
< 0.1%
5.0 11
 
0.1%

이론시간
Real number (ℝ)

HIGH CORRELATION  SKEWED  ZEROS 

Distinct34
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.44475
Minimum0
Maximum200
Zeros1564
Zeros (%)15.6%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-21T11:23:06.309373image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q12
median3
Q33
95-th percentile3
Maximum200
Range200
Interquartile range (IQR)1

Descriptive statistics

Standard deviation3.6555586
Coefficient of variation (CV)1.4952689
Kurtosis1109.4315
Mean2.44475
Median Absolute Deviation (MAD)0
Skewness27.032714
Sum24447.5
Variance13.363109
MonotonicityNot monotonic
2024-04-21T11:23:06.456880image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=34)
ValueCountFrequency (%)
3.0 6008
60.1%
2.0 1985
 
19.9%
0.0 1564
 
15.6%
1.0 370
 
3.7%
4.0 13
 
0.1%
32.0 12
 
0.1%
10.0 4
 
< 0.1%
36.0 4
 
< 0.1%
16.0 4
 
< 0.1%
45.0 3
 
< 0.1%
Other values (24) 33
 
0.3%
ValueCountFrequency (%)
0.0 1564
 
15.6%
0.5 3
 
< 0.1%
1.0 370
 
3.7%
1.5 1
 
< 0.1%
2.0 1985
 
19.9%
3.0 6008
60.1%
3.5 1
 
< 0.1%
4.0 13
 
0.1%
4.5 1
 
< 0.1%
5.0 2
 
< 0.1%
ValueCountFrequency (%)
200.0 1
 
< 0.1%
120.0 1
 
< 0.1%
102.0 1
 
< 0.1%
92.0 1
 
< 0.1%
75.0 1
 
< 0.1%
69.0 1
 
< 0.1%
60.0 1
 
< 0.1%
56.0 1
 
< 0.1%
53.0 1
 
< 0.1%
45.0 3
< 0.1%

실습시간
Unsupported

REJECTED  UNSUPPORTED 

Missing0
Missing (%)0.0%
Memory size156.2 KiB

과목구분
Categorical

HIGH CORRELATION 

Distinct47
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
전공선택
3223 
전선
1285 
전공
1223 
전공필수
1134 
전공(심화)
473 
Other values (42)
2662 

Length

Max length13
Median length11
Mean length3.6685
Min length2

Unique

Unique4 ?
Unique (%)< 0.1%

Sample

1st row전선
2nd row전선
3rd row전공선택
4th row전선
5th row전공

Common Values

ValueCountFrequency (%)
전공선택 3223
32.2%
전선 1285
 
12.8%
전공 1223
 
12.2%
전공필수 1134
 
11.3%
전공(심화) 473
 
4.7%
전필 421
 
4.2%
전공(핵심) 388
 
3.9%
교양(핵심) 249
 
2.5%
전공(기초) 231
 
2.3%
교양 208
 
2.1%
Other values (37) 1165
 
11.7%

Length

2024-04-21T11:23:06.594826image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
전공선택 3223
32.2%
전선 1285
 
12.8%
전공 1223
 
12.2%
전공필수 1134
 
11.3%
전공(심화 473
 
4.7%
전필 421
 
4.2%
전공(핵심 388
 
3.9%
교양(핵심 249
 
2.5%
전공(기초 231
 
2.3%
교양 208
 
2.1%
Other values (37) 1165
 
11.7%

Interactions

2024-04-21T11:23:02.426410image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T11:23:01.749299image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T11:23:02.114760image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T11:23:02.526833image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T11:23:01.906244image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T11:23:02.215302image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T11:23:02.635396image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T11:23:02.012637image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T11:23:02.320720image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-21T11:23:06.675951image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연도대학교명단과대학명학년학기학점이론시간과목구분
연도1.0000.7180.5560.5530.3570.3330.0000.594
대학교명0.7181.0000.8570.2540.5770.3460.0920.915
단과대학명0.5560.8571.0000.5900.4900.3480.3120.822
학년0.5530.2540.5901.0000.5890.3310.0000.767
학기0.3570.5770.4900.5891.0000.4950.0560.839
학점0.3330.3460.3480.3310.4951.0000.1970.420
이론시간0.0000.0920.3120.0000.0560.1971.0000.053
과목구분0.5940.9150.8220.7670.8390.4200.0531.000
2024-04-21T11:23:06.788441image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
학기대학교명학년과목구분
학기1.0000.2860.3360.442
대학교명0.2861.0000.1310.619
학년0.3360.1311.0000.436
과목구분0.4420.6190.4361.000
2024-04-21T11:23:06.883305image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연도학점이론시간대학교명학년학기과목구분
연도1.0000.056-0.1370.4060.3360.1510.242
학점0.0561.0000.6030.1180.1620.2460.164
이론시간-0.1370.6031.0000.0450.0000.0280.046
대학교명0.4060.1180.0451.0000.1310.2860.619
학년0.3360.1620.0000.1311.0000.3360.436
학기0.1510.2460.0280.2860.3361.0000.442
과목구분0.2420.1640.0460.6190.4360.4421.000

Missing values

2024-04-21T11:23:02.798672image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-21T11:23:02.964567image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연도대학교명단과대학명학부 과명과목명학년학기학점이론시간실습시간과목구분
292012022서울대학교공과대학재료공학부고분자재료화학31학기3.03.00전선
302452022서울대학교사범대학윤리교육과정치사상교육론32학기3.03.00전선
154422025경상국립대학교농업생명과학대학원예과학부도시농업학41학기2.02.00전공선택
306352022서울대학교사회과학대학인류학과한국 민속 문화의 이해32학기3.03.00전선
121172020경북대학교인문대학독어독문학과독일문장구조의이해31학기3.03.00전공
37002022강원대학교단과대구분없음기타모집단위지식재산권의철학21학기3.03.00전공선택
391932022전남대학교자연과학대학수학과미분기하1및실습32학기3.02.02전필
334762022전남대학교경영대학경제학부도시·지역관광개발론22학기3.03.00전선
508392022제주대학교인문대학철학과정보기술철학41학기3.03.00전공
283052022부산대학교자연과학대학지질환경과학과3영역 : 문학과예술전학년전학기3.03.00교양선택
연도대학교명단과대학명학부 과명과목명학년학기학점이론시간실습시간과목구분
107702022경북대학교생태환경대학축산생명공학과화학 II12학기3.03.00교양
130792022경북대학교IT대학전자공학부화학 I11,2학기동시개설3.03.00교양
270432022부산대학교의과대학의학과의학연구(I)11학기2.00.04전공필수
459102022전북대학교예술대학음악과부전공실기 4(작곡)22학기1.00.02전공선택
162122023경상국립대학교농업생명과학대학산림환경자원학전공(통합대학)산림자원경제학22학기3.03.00전공선택
123582022경북대학교인문대학철학과교양한문11학기3.03.00교양
111012022경북대학교수의과대학수의학과수의화학111학기3.03.00전공
597602022충남대학교인문대학영어영문학과처음 만나는 역사학11,2학기3.03.00교양(핵심)
364742022전남대학교사범대학체육교육과학교현장실습41학기2.00.04전필
334452022전남대학교경영대학경제학부금융경제학31학기3.03.00전선

Duplicate rows

Most frequently occurring

연도대학교명단과대학명학부 과명과목명학년학기학점이론시간과목구분# duplicates
12022강원대학교단과대구분없음기타모집단위운영체제31학기3.03.0전공선택3
22022강원대학교단과대구분없음기타모집단위인공지능32학기3.03.0전공선택3
92022제주대학교교육대학초등교육과진로와취·창업상담Ⅰ-13학기구분없음0.00.0전공필수3
122022제주대학교교육대학초등교육과진로와학업설계상담Ⅰ-21학기구분없음0.00.0전공필수3
132022제주대학교교육대학초등교육과진로와학업설계상담Ⅱ-12학기구분없음0.00.0전공필수3
02022강원대학교단과대구분없음기타모집단위SW국내단기현장실습(8주)42학기6.00.0전공선택2
32022부산대학교약학대학약학전공임상약물치료학(III)52학기3.03.0전공필수2
42022전남대학교AI융합대학인공지능학부C++프로그래밍및실습22학기3.02.0전선2
52022전남대학교공과대학기계공학부공학설계입문12학기3.02.0전필2
62022전남대학교공과대학기계공학부내연기관32학기3.03.0전선2