Overview

Dataset statistics

Number of variables4
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows961
Duplicate rows (%)9.6%
Total size in memory390.6 KiB
Average record size in memory40.0 B

Variable types

Categorical2
DateTime1
Text1

Dataset

Description다문화 및 탈북가정 자녀와 대학생 간 멘토링 시, 대학생의 학업 시간표와 중복된 시간에는 멘토링 활동이 불가능합니다. 이에, 대학생의 학업 시간표가 어떤 시간대에 주로 있는지 데이터를 공개하오니 멘토링 계획 수립 시 참고하시길 바랍니다.
URLhttps://www.data.go.kr/data/15100460/fileData.do

Alerts

Dataset has 961 (9.6%) duplicate rowsDuplicates

Reproduction

Analysis started2023-12-12 09:15:48.608973
Analysis finished2023-12-12 09:15:49.146413
Duration0.54 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

수업구분명
Categorical

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
정기학기
5222 
정기2학기
4630 
계절학기
 
111
계절2학기
 
37

Length

Max length5
Median length4
Mean length4.4667
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row정기학기
2nd row정기학기
3rd row정기학기
4th row정기학기
5th row정기학기

Common Values

ValueCountFrequency (%)
정기학기 5222
52.2%
정기2학기 4630
46.3%
계절학기 111
 
1.1%
계절2학기 37
 
0.4%

Length

2023-12-12T18:15:49.232518image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T18:15:49.404337image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
정기학기 5222
52.2%
정기2학기 4630
46.3%
계절학기 111
 
1.1%
계절2학기 37
 
0.4%

요일구분명
Categorical

Distinct7
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2498 
2188 
2173 
2112 
976 
Other values (2)
 
53

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row
2nd row
3rd row
4th row
5th row

Common Values

ValueCountFrequency (%)
2498
25.0%
2188
21.9%
2173
21.7%
2112
21.1%
976
 
9.8%
37
 
0.4%
16
 
0.2%

Length

2023-12-12T18:15:49.550577image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T18:15:49.705070image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2498
25.0%
2188
21.9%
2173
21.7%
2112
21.1%
976
 
9.8%
37
 
0.4%
16
 
0.2%
Distinct68
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2023-12-12 07:00:00
Maximum2023-12-12 22:55:00
2023-12-12T18:15:50.190585image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:15:50.369815image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct119
Distinct (%)1.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T18:15:50.696160image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length5
Median length5
Mean length4.9998
Min length4

Characters and Unicode

Total characters49998
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique27 ?
Unique (%)0.3%

Sample

1st row10:50
2nd row12:00
3rd row10:50
4th row11:45
5th row18:00
ValueCountFrequency (%)
12:00 712
 
7.1%
13:00 588
 
5.9%
15:00 574
 
5.7%
15:50 515
 
5.1%
16:00 459
 
4.6%
12:50 457
 
4.6%
11:00 449
 
4.5%
11:50 448
 
4.5%
18:00 433
 
4.3%
17:00 399
 
4.0%
Other values (109) 4966
49.7%
2023-12-12T18:15:51.195845image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 12819
25.6%
1 12153
24.3%
: 9998
20.0%
5 6105
12.2%
2 2078
 
4.2%
4 2046
 
4.1%
3 1730
 
3.5%
6 1299
 
2.6%
7 1115
 
2.2%
8 539
 
1.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 40000
80.0%
Other Punctuation 9998
 
20.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 12819
32.0%
1 12153
30.4%
5 6105
15.3%
2 2078
 
5.2%
4 2046
 
5.1%
3 1730
 
4.3%
6 1299
 
3.2%
7 1115
 
2.8%
8 539
 
1.3%
9 116
 
0.3%
Other Punctuation
ValueCountFrequency (%)
: 9998
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 49998
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 12819
25.6%
1 12153
24.3%
: 9998
20.0%
5 6105
12.2%
2 2078
 
4.2%
4 2046
 
4.1%
3 1730
 
3.5%
6 1299
 
2.6%
7 1115
 
2.2%
8 539
 
1.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 49998
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 12819
25.6%
1 12153
24.3%
: 9998
20.0%
5 6105
12.2%
2 2078
 
4.2%
4 2046
 
4.1%
3 1730
 
3.5%
6 1299
 
2.6%
7 1115
 
2.2%
8 539
 
1.1%

Correlations

2023-12-12T18:15:51.311636image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
수업구분명요일구분명학업시작시각
수업구분명1.0000.0440.112
요일구분명0.0441.0000.219
학업시작시각0.1120.2191.000
2023-12-12T18:15:51.415300image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
요일구분명수업구분명
요일구분명1.0000.030
수업구분명0.0301.000
2023-12-12T18:15:51.526849image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
수업구분명요일구분명
수업구분명1.0000.030
요일구분명0.0301.000

Missing values

2023-12-12T18:15:48.922716image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T18:15:49.083727image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

수업구분명요일구분명학업시작시각학업종료시각
25719정기학기09:0010:50
16896정기학기10:3012:00
27031정기학기09:0010:50
21134정기학기10:3011:45
21286정기학기15:0018:00
6837정기2학기15:0015:50
17148정기학기09:0010:50
4940정기2학기15:0016:30
13130정기2학기16:0016:50
25949정기학기09:0009:50
수업구분명요일구분명학업시작시각학업종료시각
1873정기2학기09:0012:00
21834정기학기10:3011:45
2434정기2학기13:0014:30
2543정기2학기16:3017:45
960정기2학기15:0017:00
10849정기2학기12:0013:15
1559정기학기16:0017:50
11758정기2학기15:0016:00
7534정기2학기09:0011:45
201정기2학기07:0008:00

Duplicate rows

Most frequently occurring

수업구분명요일구분명학업시작시각학업종료시각# duplicates
678정기학기11:0012:5058
604정기학기13:3014:4555
810정기학기13:3014:4555
921정기학기14:0015:5054
300정기2학기10:3011:4553
933정기학기15:0016:1551
216정기2학기11:0013:0050
817정기학기14:0015:5049
610정기학기14:0015:5048
711정기학기14:0015:5047