Overview

Dataset statistics

Number of variables4
Number of observations120
Missing cells0
Missing cells (%)0.0%
Duplicate rows40
Duplicate rows (%)33.3%
Total size in memory4.2 KiB
Average record size in memory36.1 B

Variable types

Categorical3
DateTime1

Dataset

Description학교 수업중 프로젝트 교과와 관련하여 사용하는 데이터로 프로젝트 평가를 위해 사용하는 여러값 중 년도, 학기, 중분류코드, 처리일자를 제공
URLhttps://www.data.go.kr/data/15093787/fileData.do

Alerts

Dataset has 40 (33.3%) duplicate rowsDuplicates

Reproduction

Analysis started2023-12-12 13:41:09.591954
Analysis finished2023-12-12 13:41:09.876660
Duration0.28 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

년도
Categorical

Distinct5
Distinct (%)4.2%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
2017
36 
2018
36 
2019
24 
2016
12 
2020
12 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2016
2nd row2016
3rd row2016
4th row2016
5th row2016

Common Values

ValueCountFrequency (%)
2017 36
30.0%
2018 36
30.0%
2019 24
20.0%
2016 12
 
10.0%
2020 12
 
10.0%

Length

2023-12-12T22:41:09.940438image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T22:41:10.059470image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2017 36
30.0%
2018 36
30.0%
2019 24
20.0%
2016 12
 
10.0%
2020 12
 
10.0%

학기
Categorical

Distinct3
Distinct (%)2.5%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
1
48 
3
36 
2
36 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row3
2nd row3
3rd row3
4th row3
5th row3

Common Values

ValueCountFrequency (%)
1 48
40.0%
3 36
30.0%
2 36
30.0%

Length

2023-12-12T22:41:10.194739image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T22:41:10.634782image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 48
40.0%
3 36
30.0%
2 36
30.0%

중분류코드
Categorical

Distinct5
Distinct (%)4.2%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
1
30 
2
30 
3
30 
4
20 
5
10 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row2
5th row2

Common Values

ValueCountFrequency (%)
1 30
25.0%
2 30
25.0%
3 30
25.0%
4 20
16.7%
5 10
 
8.3%

Length

2023-12-12T22:41:10.757287image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T22:41:10.876875image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 30
25.0%
2 30
25.0%
3 30
25.0%
4 20
16.7%
5 10
 
8.3%
Distinct10
Distinct (%)8.3%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
Minimum2017-02-21 00:00:00
Maximum2020-06-04 00:00:00
2023-12-12T22:41:10.987579image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:41:11.111413image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=10)

Correlations

2023-12-12T22:41:11.186448image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
년도학기중분류코드처리일자
년도1.0000.5270.0001.000
학기0.5271.0000.0001.000
중분류코드0.0000.0001.0000.000
처리일자1.0001.0000.0001.000
2023-12-12T22:41:11.290405image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
중분류코드학기년도
중분류코드1.0000.0000.000
학기0.0001.0000.462
년도0.0000.4621.000
2023-12-12T22:41:11.381658image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
년도학기중분류코드
년도1.0000.4620.000
학기0.4621.0000.000
중분류코드0.0000.0001.000

Missing values

2023-12-12T22:41:09.754679image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T22:41:09.839192image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

년도학기중분류코드처리일자
02016312017-02-21
12016312017-02-21
22016312017-02-21
32016322017-02-21
42016322017-02-21
52016322017-02-21
62016332017-02-21
72016332017-02-21
82016332017-02-21
92016342017-02-21
년도학기중분류코드처리일자
1102020112020-06-04
1112020122020-06-04
1122020122020-06-04
1132020122020-06-04
1142020132020-06-04
1152020132020-06-04
1162020132020-06-04
1172020142020-06-04
1182020142020-06-04
1192020152020-06-04

Duplicate rows

Most frequently occurring

년도학기중분류코드처리일자# duplicates
02016312017-02-213
12016322017-02-213
22016332017-02-213
42017112017-03-073
52017122017-03-073
62017132017-03-073
82017212017-07-063
92017222017-07-063
102017232017-07-063
122017312017-06-123