Overview

Dataset statistics

Number of variables3
Number of observations26
Missing cells0
Missing cells (%)0.0%
Duplicate rows2
Duplicate rows (%)7.7%
Total size in memory808.0 B
Average record size in memory31.1 B

Variable types

Numeric1
Categorical1
DateTime1

Dataset

Description온라인 원서접수 일정관리를 위해 사용하는 데이터로 온라인 입시원서 접수를 받기위해 필요한 년도, 학기, 처리일자 데이터
URLhttps://www.data.go.kr/data/15093628/fileData.do

Alerts

Dataset has 2 (7.7%) duplicate rowsDuplicates

Reproduction

Analysis started2023-12-12 17:09:38.178894
Analysis finished2023-12-12 17:09:38.517418
Duration0.34 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

년도
Real number (ℝ)

Distinct11
Distinct (%)42.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2017.3846
Minimum2012
Maximum2023
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size366.0 B
2023-12-13T02:09:38.610308image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2012
5-th percentile2013
Q12015.25
median2017.5
Q32019.75
95-th percentile2022
Maximum2023
Range11
Interquartile range (IQR)4.5

Descriptive statistics

Standard deviation3.1505799
Coefficient of variation (CV)0.0015617151
Kurtosis-0.94103229
Mean2017.3846
Median Absolute Deviation (MAD)2.5
Skewness-0.019136598
Sum52452
Variance9.9261538
MonotonicityNot monotonic
2023-12-13T02:09:38.737223image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=11)
ValueCountFrequency (%)
2013 4
15.4%
2016 4
15.4%
2018 3
11.5%
2019 3
11.5%
2015 2
7.7%
2017 2
7.7%
2020 2
7.7%
2021 2
7.7%
2022 2
7.7%
2012 1
 
3.8%
ValueCountFrequency (%)
2012 1
 
3.8%
2013 4
15.4%
2015 2
7.7%
2016 4
15.4%
2017 2
7.7%
2018 3
11.5%
2019 3
11.5%
2020 2
7.7%
2021 2
7.7%
2022 2
7.7%
ValueCountFrequency (%)
2023 1
 
3.8%
2022 2
7.7%
2021 2
7.7%
2020 2
7.7%
2019 3
11.5%
2018 3
11.5%
2017 2
7.7%
2016 4
15.4%
2015 2
7.7%
2013 4
15.4%

학기
Categorical

Distinct3
Distinct (%)11.5%
Missing0
Missing (%)0.0%
Memory size340.0 B
1
21 
2
3
 
2

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row3
4th row1
5th row3

Common Values

ValueCountFrequency (%)
1 21
80.8%
2 3
 
11.5%
3 2
 
7.7%

Length

2023-12-13T02:09:38.869541image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T02:09:38.956618image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 21
80.8%
2 3
 
11.5%
3 2
 
7.7%
Distinct23
Distinct (%)88.5%
Missing0
Missing (%)0.0%
Memory size340.0 B
Minimum2013-05-03 00:00:00
Maximum2022-04-06 00:00:00
2023-12-13T02:09:39.040138image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:09:39.152673image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=23)

Interactions

2023-12-13T02:09:38.274623image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T02:09:39.451138image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
년도학기처리일자
년도1.0000.2051.000
학기0.2051.0000.537
처리일자1.0000.5371.000
2023-12-13T02:09:39.533018image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
년도학기
년도1.0000.000
학기0.0001.000

Missing values

2023-12-13T02:09:38.379823image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T02:09:38.475542image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

년도학기처리일자
0201212013-05-03
1201312013-07-31
2201332013-10-16
3201312014-01-03
4201332014-01-03
5201512015-01-02
6201512015-06-08
7201612016-01-13
8201622016-05-25
9201612016-08-16
년도학기처리일자
16201922019-01-30
17201912019-02-07
18202012019-12-19
19201912020-02-17
20202012020-03-19
21202112020-05-22
22202112020-05-22
23202212022-04-05
24202212022-04-05
25202312022-04-06

Duplicate rows

Most frequently occurring

년도학기처리일자# duplicates
0202112020-05-222
1202212022-04-052