Overview

Dataset statistics

Number of variables4
Number of observations500
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory16.7 KiB
Average record size in memory34.3 B

Variable types

Numeric1
Categorical3

Dataset

Description본 데이터는 이어드림 스쿨 관련 데이터입니다. 이어드림 스쿨의 전공별, 학력별 선발(입교)현황 정보를 확인할 수 있습니다.
Author중소벤처기업진흥공단
URLhttps://www.data.go.kr/data/15124547/fileData.do

Alerts

연번 is highly overall correlated with 입교년도High correlation
입교년도 is highly overall correlated with 연번High correlation
연번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 21:57:08.437775
Analysis finished2023-12-12 21:57:08.783898
Duration0.35 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct500
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean250.5
Minimum1
Maximum500
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size4.5 KiB
2023-12-13T06:57:08.847890image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile25.95
Q1125.75
median250.5
Q3375.25
95-th percentile475.05
Maximum500
Range499
Interquartile range (IQR)249.5

Descriptive statistics

Standard deviation144.48183
Coefficient of variation (CV)0.57677378
Kurtosis-1.2
Mean250.5
Median Absolute Deviation (MAD)125
Skewness0
Sum125250
Variance20875
MonotonicityStrictly increasing
2023-12-13T06:57:08.982058image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.2%
331 1
 
0.2%
344 1
 
0.2%
343 1
 
0.2%
342 1
 
0.2%
341 1
 
0.2%
340 1
 
0.2%
339 1
 
0.2%
338 1
 
0.2%
337 1
 
0.2%
Other values (490) 490
98.0%
ValueCountFrequency (%)
1 1
0.2%
2 1
0.2%
3 1
0.2%
4 1
0.2%
5 1
0.2%
6 1
0.2%
7 1
0.2%
8 1
0.2%
9 1
0.2%
10 1
0.2%
ValueCountFrequency (%)
500 1
0.2%
499 1
0.2%
498 1
0.2%
497 1
0.2%
496 1
0.2%
495 1
0.2%
494 1
0.2%
493 1
0.2%
492 1
0.2%
491 1
0.2%

입교년도
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size4.0 KiB
2022
200 
2023
200 
2021
100 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2021
2nd row2021
3rd row2021
4th row2021
5th row2021

Common Values

ValueCountFrequency (%)
2022 200
40.0%
2023 200
40.0%
2021 100
20.0%

Length

2023-12-13T06:57:09.113604image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T06:57:09.194173image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2022 200
40.0%
2023 200
40.0%
2021 100
20.0%

전공
Categorical

Distinct2
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size4.0 KiB
비전공
367 
전공
133 

Length

Max length3
Median length3
Mean length2.734
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row비전공
2nd row비전공
3rd row비전공
4th row비전공
5th row비전공

Common Values

ValueCountFrequency (%)
비전공 367
73.4%
전공 133
 
26.6%

Length

2023-12-13T06:57:09.681967image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T06:57:09.809945image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
비전공 367
73.4%
전공 133
 
26.6%

학력
Categorical

Distinct4
Distinct (%)0.8%
Missing0
Missing (%)0.0%
Memory size4.0 KiB
대학교_4년제
355 
고등학교
88 
대학교_2·3년제
40 
대학원_석사
 
17

Length

Max length9
Median length7
Mean length6.598
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row대학교_4년제
2nd row대학교_4년제
3rd row대학교_4년제
4th row대학교_4년제
5th row대학교_2·3년제

Common Values

ValueCountFrequency (%)
대학교_4년제 355
71.0%
고등학교 88
 
17.6%
대학교_2·3년제 40
 
8.0%
대학원_석사 17
 
3.4%

Length

2023-12-13T06:57:09.957817image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T06:57:10.104361image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
대학교_4년제 355
71.0%
고등학교 88
 
17.6%
대학교_2·3년제 40
 
8.0%
대학원_석사 17
 
3.4%

Interactions

2023-12-13T06:57:08.578939image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T06:57:10.199266image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번입교년도전공학력
연번1.0001.0000.1040.081
입교년도1.0001.0000.0000.065
전공0.1040.0001.0000.211
학력0.0810.0650.2111.000
2023-12-13T06:57:10.317190image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
학력입교년도전공
학력1.0000.0620.140
입교년도0.0621.0000.000
전공0.1400.0001.000
2023-12-13T06:57:10.456196image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번입교년도전공학력
연번1.0000.9930.0790.048
입교년도0.9931.0000.0000.062
전공0.0790.0001.0000.140
학력0.0480.0620.1401.000

Missing values

2023-12-13T06:57:08.676081image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T06:57:08.747631image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번입교년도전공학력
012021비전공대학교_4년제
122021비전공대학교_4년제
232021비전공대학교_4년제
342021비전공대학교_4년제
452021비전공대학교_2·3년제
562021비전공대학교_4년제
672021비전공대학교_2·3년제
782021전공대학원_석사
892021비전공대학교_4년제
9102021비전공대학교_4년제
연번입교년도전공학력
4904912023비전공대학교_4년제
4914922023비전공대학원_석사
4924932023비전공고등학교
4934942023비전공대학교_4년제
4944952023전공대학교_4년제
4954962023비전공대학교_4년제
4964972023전공대학교_4년제
4974982023전공대학교_4년제
4984992023전공대학교_4년제
4995002023비전공대학교_2·3년제