Overview

Dataset statistics

Number of variables7
Number of observations387
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory22.4 KiB
Average record size in memory59.3 B

Variable types

Categorical4
Text1
Numeric2

Dataset

Description매년 4월 대학정보공시 기준의 대학별 입학정원, 평균입학금, 평균등록금을 확인할 수 있음(분교, 캠퍼스는 본교에 통합 산출)
URLhttps://www.data.go.kr/data/3071171/fileData.do

Alerts

평균입학금(원) has constant value ""Constant
평균등록금(원) is highly overall correlated with 학제별 and 1 other fieldsHigh correlation
학제별 is highly overall correlated with 평균등록금(원)High correlation
설립별 is highly overall correlated with 평균등록금(원)High correlation
평균등록금(원) has 6 (1.6%) zerosZeros

Reproduction

Analysis started2023-12-12 15:16:20.928020
Analysis finished2023-12-12 15:16:22.206666
Duration1.28 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

학제별
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size3.2 KiB
대학
219 
전문대학
168 

Length

Max length4
Median length2
Mean length2.8682171
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row대학
2nd row대학
3rd row대학
4th row대학
5th row대학

Common Values

ValueCountFrequency (%)
대학 219
56.6%
전문대학 168
43.4%

Length

2023-12-13T00:16:22.313753image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T00:16:22.450075image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
대학 219
56.6%
전문대학 168
43.4%

설립별
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size3.2 KiB
사립
332 
국공립
55 

Length

Max length3
Median length2
Mean length2.1421189
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row국공립
2nd row국공립
3rd row국공립
4th row국공립
5th row국공립

Common Values

ValueCountFrequency (%)
사립 332
85.8%
국공립 55
 
14.2%

Length

2023-12-13T00:16:22.582891image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T00:16:22.731932image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
사립 332
85.8%
국공립 55
 
14.2%
Distinct386
Distinct (%)99.7%
Missing0
Missing (%)0.0%
Memory size3.2 KiB
2023-12-13T00:16:23.130945image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length19
Median length18
Mean length7.1757106
Min length4

Characters and Unicode

Total characters2777
Distinct characters188
Distinct categories3 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique385 ?
Unique (%)99.5%

Sample

1st row강릉원주대학교
2nd row강원대학교
3rd row경북대학교
4th row경상국립대학교
5th row경인교육대학교
ValueCountFrequency (%)
한국폴리텍 32
 
6.7%
대학 28
 
5.8%
v 5
 
1.0%
vii 5
 
1.0%
특성화대학 4
 
0.8%
i 4
 
0.8%
iv 4
 
0.8%
ii 4
 
0.8%
iii 3
 
0.6%
vi 3
 
0.6%
Other values (386) 387
80.8%
2023-12-13T00:16:23.667920image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
415
 
14.9%
405
 
14.6%
355
 
12.8%
92
 
3.3%
78
 
2.8%
71
 
2.6%
I 39
 
1.4%
34
 
1.2%
33
 
1.2%
33
 
1.2%
Other values (178) 1222
44.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2627
94.6%
Space Separator 92
 
3.3%
Uppercase Letter 58
 
2.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
415
 
15.8%
405
 
15.4%
355
 
13.5%
78
 
3.0%
71
 
2.7%
34
 
1.3%
33
 
1.3%
33
 
1.3%
32
 
1.2%
32
 
1.2%
Other values (173) 1139
43.4%
Uppercase Letter
ValueCountFrequency (%)
I 39
67.2%
V 17
29.3%
C 1
 
1.7%
T 1
 
1.7%
Space Separator
ValueCountFrequency (%)
92
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2627
94.6%
Common 92
 
3.3%
Latin 58
 
2.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
415
 
15.8%
405
 
15.4%
355
 
13.5%
78
 
3.0%
71
 
2.7%
34
 
1.3%
33
 
1.3%
33
 
1.3%
32
 
1.2%
32
 
1.2%
Other values (173) 1139
43.4%
Latin
ValueCountFrequency (%)
I 39
67.2%
V 17
29.3%
C 1
 
1.7%
T 1
 
1.7%
Common
ValueCountFrequency (%)
92
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2627
94.6%
ASCII 150
 
5.4%

Most frequent character per block

Hangul
ValueCountFrequency (%)
415
 
15.8%
405
 
15.4%
355
 
13.5%
78
 
3.0%
71
 
2.7%
34
 
1.3%
33
 
1.3%
33
 
1.3%
32
 
1.2%
32
 
1.2%
Other values (173) 1139
43.4%
ASCII
ValueCountFrequency (%)
92
61.3%
I 39
26.0%
V 17
 
11.3%
C 1
 
0.7%
T 1
 
0.7%

지역별
Categorical

Distinct17
Distinct (%)4.4%
Missing0
Missing (%)0.0%
Memory size3.2 KiB
경기
64 
서울
63 
경북
37 
충남
26 
부산
25 
Other values (12)
172 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row강원
2nd row강원
3rd row대구
4th row경남
5th row인천

Common Values

ValueCountFrequency (%)
경기 64
16.5%
서울 63
16.3%
경북 37
9.6%
충남 26
 
6.7%
부산 25
 
6.5%
경남 23
 
5.9%
전남 21
 
5.4%
전북 21
 
5.4%
강원 19
 
4.9%
광주 18
 
4.7%
Other values (7) 70
18.1%

Length

2023-12-13T00:16:23.862931image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
경기 64
16.5%
서울 63
16.3%
경북 37
9.6%
충남 26
 
6.7%
부산 25
 
6.5%
경남 23
 
5.9%
전남 21
 
5.4%
전북 21
 
5.4%
강원 19
 
4.9%
광주 18
 
4.7%
Other values (7) 70
18.1%

입학정원 합(명)
Real number (ℝ)

Distinct367
Distinct (%)94.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5147.8966
Minimum0
Maximum232190
Zeros3
Zeros (%)0.8%
Negative0
Negative (%)0.0%
Memory size3.5 KiB
2023-12-13T00:16:24.075248image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile203.7
Q11140.5
median3143
Q36311
95-th percentile15512.6
Maximum232190
Range232190
Interquartile range (IQR)5170.5

Descriptive statistics

Standard deviation12431.344
Coefficient of variation (CV)2.4148394
Kurtosis289.81882
Mean5147.8966
Median Absolute Deviation (MAD)2305
Skewness15.921301
Sum1992236
Variance1.5453831 × 108
MonotonicityNot monotonic
2023-12-13T00:16:24.268534image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
200 4
 
1.0%
360 3
 
0.8%
50 3
 
0.8%
0 3
 
0.8%
480 3
 
0.8%
720 2
 
0.5%
560 2
 
0.5%
2973 2
 
0.5%
160 2
 
0.5%
8710 2
 
0.5%
Other values (357) 361
93.3%
ValueCountFrequency (%)
0 3
0.8%
36 1
 
0.3%
50 3
0.8%
100 2
0.5%
120 1
 
0.3%
128 1
 
0.3%
145 1
 
0.3%
160 2
0.5%
174 1
 
0.3%
200 4
1.0%
ValueCountFrequency (%)
232190 1
0.3%
21423 1
0.3%
19869 1
0.3%
19823 1
0.3%
19346 1
0.3%
19070 1
0.3%
18692 1
0.3%
18606 1
0.3%
18374 1
0.3%
18255 1
0.3%

평균입학금(원)
Categorical

CONSTANT 

Distinct1
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size3.2 KiB
0
387 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 387
100.0%

Length

2023-12-13T00:16:24.420618image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T00:16:24.518503image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 387
100.0%

평균등록금(원)
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct363
Distinct (%)93.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5673873.3
Minimum0
Maximum9034616
Zeros6
Zeros (%)1.6%
Negative0
Negative (%)0.0%
Memory size3.5 KiB
2023-12-13T00:16:24.676284image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile2195019.9
Q14276230.5
median6211367
Q37171223
95-th percentile8201451.9
Maximum9034616
Range9034616
Interquartile range (IQR)2894992.5

Descriptive statistics

Standard deviation2014170.8
Coefficient of variation (CV)0.35499044
Kurtosis-0.2462094
Mean5673873.3
Median Absolute Deviation (MAD)1058757
Skewness-0.79162306
Sum2.195789 × 109
Variance4.0568839 × 1012
MonotonicityNot monotonic
2023-12-13T00:16:24.832612image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2432000 13
 
3.4%
0 6
 
1.6%
2164000 5
 
1.3%
2494000 4
 
1.0%
4262795 1
 
0.3%
5925590 1
 
0.3%
5363375 1
 
0.3%
5813980 1
 
0.3%
6353625 1
 
0.3%
6682105 1
 
0.3%
Other values (353) 353
91.2%
ValueCountFrequency (%)
0 6
1.6%
760969 1
 
0.3%
1215000 1
 
0.3%
1760000 1
 
0.3%
1868552 1
 
0.3%
1934625 1
 
0.3%
2000000 1
 
0.3%
2060000 1
 
0.3%
2164000 5
1.3%
2174000 1
 
0.3%
ValueCountFrequency (%)
9034616 1
0.3%
9012272 1
0.3%
9000000 1
0.3%
8815848 1
0.3%
8786286 1
0.3%
8742198 1
0.3%
8543761 1
0.3%
8529148 1
0.3%
8510869 1
0.3%
8448561 1
0.3%

Interactions

2023-12-13T00:16:21.785236image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T00:16:21.251852image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T00:16:21.876502image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T00:16:21.335055image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T00:16:24.934948image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
학제별설립별지역별입학정원 합(명)평균등록금(원)
학제별1.0000.3450.2460.0000.821
설립별0.3451.0000.0930.0180.891
지역별0.2460.0931.0000.0000.335
입학정원 합(명)0.0000.0180.0001.0000.451
평균등록금(원)0.8210.8910.3350.4511.000
2023-12-13T00:16:25.036628image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
학제별지역별설립별
학제별1.0000.2160.224
지역별0.2161.0000.081
설립별0.2240.0811.000
2023-12-13T00:16:25.139430image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
입학정원 합(명)평균등록금(원)학제별설립별지역별
입학정원 합(명)1.0000.4440.0000.0110.000
평균등록금(원)0.4441.0000.6480.7190.135
학제별0.0000.6481.0000.2240.216
설립별0.0110.7190.2241.0000.081
지역별0.0000.1350.2160.0811.000

Missing values

2023-12-13T00:16:22.004163image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T00:16:22.149440image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

학제별설립별대학명지역별입학정원 합(명)평균입학금(원)평균등록금(원)
0대학국공립강릉원주대학교강원725904262795
1대학국공립강원대학교강원1825504145285
2대학국공립경북대학교대구1907004499843
3대학국공립경상국립대학교경남1732904042102
4대학국공립경인교육대학교인천239203316000
5대학국공립공주교육대학교충남141503424000
6대학국공립공주대학교충남1111503828317
7대학국공립광주과학기술원광주80002060000
8대학국공립광주교육대학교광주130303614682
9대학국공립군산대학교전북688803918285
학제별설립별대학명지역별입학정원 합(명)평균입학금(원)평균등록금(원)
377전문대학사립한국폴리텍 VII 대학 창원캠퍼스경남118002432000
378전문대학사립한국폴리텍 특성화대학 로봇캠퍼스경북20002164000
379전문대학사립한국폴리텍 특성화대학 바이오캠퍼스충남36002595000
380전문대학사립한국폴리텍 특성화대학 섬유패션캠퍼스대구30502292344
381전문대학사립한국폴리텍 특성화대학 항공캠퍼스경남44002433568
382전문대학사립한림성심대학교강원234805741603
383전문대학사립한양여자대학교서울628706189253
384전문대학사립한영대학교전남119605611605
385전문대학사립혜전대학교충남260305828352
386전문대학사립호산대학교경북185205801199