Overview

Dataset statistics

Number of variables7
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory673.8 KiB
Average record size in memory69.0 B

Variable types

Numeric2
Categorical5

Dataset

Description국가우수장학금(이공계)의 지원금액 현황 정보(시도별, 설립구분별, 학년별 지원금액 등)를 제공하고 있습니다. 국가 우수장학금(이공계)는 우수인재를 이공계로 적극 유도하여 국가 핵심 인재군으로 육성하는 사업으로 관련 상세 정보는 한국장학재단 누리집(http://www.kosaf.go.kr/ko/scholar.do?pg=scholarship05_06_01)을 참고바랍니다.
URLhttps://www.data.go.kr/data/15114952/fileData.do

Alerts

연도 has constant value ""Constant
순번 is highly overall correlated with 시도High correlation
지원금액 is highly overall correlated with 설립구분High correlation
시도 is highly overall correlated with 순번High correlation
설립구분 is highly overall correlated with 지원금액High correlation
순번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 05:09:31.428499
Analysis finished2023-12-12 05:09:32.941537
Duration1.51 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6923.5429
Minimum1
Maximum13850
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T14:09:33.036834image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile709.95
Q13464.75
median6856.5
Q310427.25
95-th percentile13160.1
Maximum13850
Range13849
Interquartile range (IQR)6962.5

Descriptive statistics

Standard deviation4005.9831
Coefficient of variation (CV)0.57860306
Kurtosis-1.209365
Mean6923.5429
Median Absolute Deviation (MAD)3477.5
Skewness0.010339944
Sum69235429
Variance16047901
MonotonicityNot monotonic
2023-12-12T14:09:33.208662image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
9912 1
 
< 0.1%
4764 1
 
< 0.1%
7967 1
 
< 0.1%
10409 1
 
< 0.1%
9656 1
 
< 0.1%
7107 1
 
< 0.1%
11196 1
 
< 0.1%
8753 1
 
< 0.1%
13022 1
 
< 0.1%
2862 1
 
< 0.1%
Other values (9990) 9990
99.9%
ValueCountFrequency (%)
1 1
< 0.1%
3 1
< 0.1%
4 1
< 0.1%
6 1
< 0.1%
7 1
< 0.1%
8 1
< 0.1%
9 1
< 0.1%
10 1
< 0.1%
11 1
< 0.1%
12 1
< 0.1%
ValueCountFrequency (%)
13850 1
< 0.1%
13849 1
< 0.1%
13848 1
< 0.1%
13846 1
< 0.1%
13845 1
< 0.1%
13844 1
< 0.1%
13843 1
< 0.1%
13840 1
< 0.1%
13839 1
< 0.1%
13838 1
< 0.1%

시도
Categorical

HIGH CORRELATION 

Distinct17
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
서울
2279 
부산
1167 
경북
856 
대전
733 
대구
696 
Other values (12)
4269 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row경북
2nd row충남
3rd row서울
4th row부산
5th row부산

Common Values

ValueCountFrequency (%)
서울 2279
22.8%
부산 1167
11.7%
경북 856
 
8.6%
대전 733
 
7.3%
대구 696
 
7.0%
경기 654
 
6.5%
충남 619
 
6.2%
광주 572
 
5.7%
충북 569
 
5.7%
경남 450
 
4.5%
Other values (7) 1405
14.1%

Length

2023-12-12T14:09:33.412448image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
서울 2279
22.8%
부산 1167
11.7%
경북 856
 
8.6%
대전 733
 
7.3%
대구 696
 
7.0%
경기 654
 
6.5%
충남 619
 
6.2%
광주 572
 
5.7%
충북 569
 
5.7%
경남 450
 
4.5%
Other values (7) 1405
14.1%

설립구분
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
국립
5279 
사립
4671 
공립
 
50

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row사립
2nd row국립
3rd row국립
4th row국립
5th row국립

Common Values

ValueCountFrequency (%)
국립 5279
52.8%
사립 4671
46.7%
공립 50
 
0.5%

Length

2023-12-12T14:09:33.605245image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T14:09:33.775163image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
국립 5279
52.8%
사립 4671
46.7%
공립 50
 
0.5%

학년
Categorical

Distinct5
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
3
3650 
4
3150 
1
1825 
2
1336 
5
 
39

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2
2nd row4
3rd row4
4th row4
5th row4

Common Values

ValueCountFrequency (%)
3 3650
36.5%
4 3150
31.5%
1 1825
18.2%
2 1336
 
13.4%
5 39
 
0.4%

Length

2023-12-12T14:09:33.957125image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T14:09:34.139230image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
3 3650
36.5%
4 3150
31.5%
1 1825
18.2%
2 1336
 
13.4%
5 39
 
0.4%

연도
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2022
10000 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2022
2nd row2022
3rd row2022
4th row2022
5th row2022

Common Values

ValueCountFrequency (%)
2022 10000
100.0%

Length

2023-12-12T14:09:34.314649image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T14:09:34.467982image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2022 10000
100.0%

학기
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
1
5123 
2
4877 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2
2nd row2
3rd row2
4th row2
5th row2

Common Values

ValueCountFrequency (%)
1 5123
51.2%
2 4877
48.8%

Length

2023-12-12T14:09:34.623478image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T14:09:34.778163image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 5123
51.2%
2 4877
48.8%

지원금액
Real number (ℝ)

HIGH CORRELATION 

Distinct473
Distinct (%)4.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3172009.3
Minimum753000
Maximum7214000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T14:09:34.951589image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum753000
5-th percentile1711790
Q12179000
median2998000
Q34199000
95-th percentile4684000
Maximum7214000
Range6461000
Interquartile range (IQR)2020000

Descriptive statistics

Standard deviation1113574.7
Coefficient of variation (CV)0.35106288
Kurtosis-0.89136895
Mean3172009.3
Median Absolute Deviation (MAD)948000
Skewness0.29742887
Sum3.1720093 × 1010
Variance1.2400487 × 1012
MonotonicityNot monotonic
2023-12-12T14:09:35.207182image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2364000 435
 
4.3%
2331000 329
 
3.3%
3433000 283
 
2.8%
2179000 228
 
2.3%
2173500 211
 
2.1%
2220000 207
 
2.1%
4075000 204
 
2.0%
2422000 200
 
2.0%
4630000 196
 
2.0%
2790000 167
 
1.7%
Other values (463) 7540
75.4%
ValueCountFrequency (%)
753000 1
 
< 0.1%
923400 1
 
< 0.1%
1030000 69
0.7%
1126500 4
 
< 0.1%
1228500 7
 
0.1%
1350500 34
0.3%
1385100 2
 
< 0.1%
1386500 1
 
< 0.1%
1437000 40
0.4%
1460000 25
 
0.2%
ValueCountFrequency (%)
7214000 1
 
< 0.1%
7213000 1
 
< 0.1%
7130000 3
 
< 0.1%
7080000 2
 
< 0.1%
7057000 1
 
< 0.1%
7018000 2
 
< 0.1%
6993000 3
 
< 0.1%
6992000 3
 
< 0.1%
6914000 1
 
< 0.1%
6869000 11
0.1%

Interactions

2023-12-12T14:09:32.391325image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:09:32.052905image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:09:32.536199image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:09:32.219192image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T14:09:35.377673image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번시도설립구분학년학기지원금액
순번1.0000.8470.4830.1160.0000.700
시도0.8471.0000.6870.1170.0000.691
설립구분0.4830.6871.0000.0610.0000.836
학년0.1160.1170.0611.0000.0550.116
학기0.0000.0000.0000.0551.0000.000
지원금액0.7000.6910.8360.1160.0001.000
2023-12-12T14:09:35.541466image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
설립구분학년시도학기
설립구분1.0000.0460.4910.000
학년0.0461.0000.0600.068
시도0.4910.0601.0000.000
학기0.0000.0680.0001.000
2023-12-12T14:09:35.690408image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번지원금액시도설립구분학년학기
순번1.0000.3250.5400.3310.0480.000
지원금액0.3251.0000.3540.7450.0480.000
시도0.5400.3541.0000.4910.0600.000
설립구분0.3310.7450.4911.0000.0460.000
학년0.0480.0480.0600.0461.0000.068
학기0.0000.0000.0000.0000.0681.000

Missing values

2023-12-12T14:09:32.721409image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T14:09:32.872614image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

순번시도설립구분학년연도학기지원금액
99119912경북사립2202224075000
69236924충남국립4202221712000
1189011891서울국립4202222998000
1045810459부산국립4202222041500
1126411265부산국립4202222364000
35813582충북국립2202211437000
30423043경북사립3202215290000
94559456서울사립1202214857000
45614562부산사립3202223245000
1038110382부산국립3202212173500
순번시도설립구분학년연도학기지원금액
65646565서울사립3202224201000
264265충북사립1202224491000
1038310384부산국립3202222173500
41094110서울공립1202211228500
1338713388울산국립4202223144000
33573358대전국립3202223433000
76547655전남국립1202221887000
50105011대전국립3202212059000
1250612507충남사립1202214445000
10411042대구국립3202212331000