Overview

Dataset statistics

Number of variables7
Number of observations7179
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory427.8 KiB
Average record size in memory61.0 B

Variable types

Numeric2
Categorical5

Dataset

Description인문100년 장학금에 대한 지원금액 현황 정보(시도별, 설립구분별, 학년별 지원금액 등)입니다. 인문100년장학금은 인문사회계열 우수학생에게 학자금을 지원하여 인문학 소양을 갖춘 인재 양성 사업입니다. 관련 상세 정보는 한국장학재단 누리집(http://www.kosaf.go.kr/ko/scholar.do?pg=scholarship05_07_01)을 참고바랍니다.
URLhttps://www.data.go.kr/data/15114953/fileData.do

Alerts

연도 has constant value ""Constant
지원금액 is highly overall correlated with 설립구분High correlation
설립구분 is highly overall correlated with 지원금액High correlation
설립구분 is highly imbalanced (51.1%)Imbalance
순번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 19:42:27.446192
Analysis finished2023-12-12 19:42:28.801036
Duration1.35 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

UNIQUE 

Distinct7179
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3590
Minimum1
Maximum7179
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size63.2 KiB
2023-12-13T04:42:28.915475image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile359.9
Q11795.5
median3590
Q35384.5
95-th percentile6820.1
Maximum7179
Range7178
Interquartile range (IQR)3589

Descriptive statistics

Standard deviation2072.5431
Coefficient of variation (CV)0.57731006
Kurtosis-1.2
Mean3590
Median Absolute Deviation (MAD)1795
Skewness0
Sum25772610
Variance4295435
MonotonicityStrictly increasing
2023-12-13T04:42:29.142435image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
< 0.1%
4937 1
 
< 0.1%
4795 1
 
< 0.1%
4794 1
 
< 0.1%
4793 1
 
< 0.1%
4792 1
 
< 0.1%
4791 1
 
< 0.1%
4790 1
 
< 0.1%
4789 1
 
< 0.1%
4788 1
 
< 0.1%
Other values (7169) 7169
99.9%
ValueCountFrequency (%)
1 1
< 0.1%
2 1
< 0.1%
3 1
< 0.1%
4 1
< 0.1%
5 1
< 0.1%
6 1
< 0.1%
7 1
< 0.1%
8 1
< 0.1%
9 1
< 0.1%
10 1
< 0.1%
ValueCountFrequency (%)
7179 1
< 0.1%
7178 1
< 0.1%
7177 1
< 0.1%
7176 1
< 0.1%
7175 1
< 0.1%
7174 1
< 0.1%
7173 1
< 0.1%
7172 1
< 0.1%
7171 1
< 0.1%
7170 1
< 0.1%

시도
Categorical

Distinct17
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size56.2 KiB
서울
2134 
경기
859 
부산
725 
경북
537 
충남
473 
Other values (12)
2451 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row대전
2nd row대전
3rd row대전
4th row대전
5th row대전

Common Values

ValueCountFrequency (%)
서울 2134
29.7%
경기 859
12.0%
부산 725
 
10.1%
경북 537
 
7.5%
충남 473
 
6.6%
대전 385
 
5.4%
전북 329
 
4.6%
충북 318
 
4.4%
강원 289
 
4.0%
경남 263
 
3.7%
Other values (7) 867
12.1%

Length

2023-12-13T04:42:29.335823image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
서울 2134
29.7%
경기 859
12.0%
부산 725
 
10.1%
경북 537
 
7.5%
충남 473
 
6.6%
대전 385
 
5.4%
전북 329
 
4.6%
충북 318
 
4.4%
강원 289
 
4.0%
경남 263
 
3.7%
Other values (7) 867
12.1%

설립구분
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size56.2 KiB
사립
5689 
국립
1449 
공립
 
41

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row사립
2nd row사립
3rd row사립
4th row사립
5th row사립

Common Values

ValueCountFrequency (%)
사립 5689
79.2%
국립 1449
 
20.2%
공립 41
 
0.6%

Length

2023-12-13T04:42:29.474793image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T04:42:29.656149image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
사립 5689
79.2%
국립 1449
 
20.2%
공립 41
 
0.6%

학년
Categorical

Distinct4
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size56.2 KiB
3
3242 
4
2203 
1
1003 
2
731 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
3 3242
45.2%
4 2203
30.7%
1 1003
 
14.0%
2 731
 
10.2%

Length

2023-12-13T04:42:29.822206image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T04:42:29.994669image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
3 3242
45.2%
4 2203
30.7%
1 1003
 
14.0%
2 731
 
10.2%

연도
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size56.2 KiB
2022
7179 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2022
2nd row2022
3rd row2022
4th row2022
5th row2022

Common Values

ValueCountFrequency (%)
2022 7179
100.0%

Length

2023-12-13T04:42:30.166240image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T04:42:30.289583image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2022 7179
100.0%

학기
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size56.2 KiB
2
4121 
1
3058 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2
2nd row1
3rd row2
4th row1
5th row2

Common Values

ValueCountFrequency (%)
2 4121
57.4%
1 3058
42.6%

Length

2023-12-13T04:42:30.437225image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T04:42:30.567707image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2 4121
57.4%
1 3058
42.6%

지원금액
Real number (ℝ)

HIGH CORRELATION 

Distinct983
Distinct (%)13.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3705121
Minimum568000
Maximum9564000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size63.2 KiB
2023-12-13T04:42:30.718649image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum568000
5-th percentile1659000
Q12918000
median3403000
Q34699000
95-th percentile6013000
Maximum9564000
Range8996000
Interquartile range (IQR)1781000

Descriptive statistics

Standard deviation1382468
Coefficient of variation (CV)0.37312358
Kurtosis0.70636788
Mean3705121
Median Absolute Deviation (MAD)540000
Skewness0.7816891
Sum2.6599064 × 1010
Variance1.9112178 × 1012
MonotonicityNot monotonic
2023-12-13T04:42:30.912409image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
3560000 124
 
1.7%
3360500 113
 
1.6%
2913000 105
 
1.5%
3469000 102
 
1.4%
2857000 97
 
1.4%
3406000 91
 
1.3%
3513000 84
 
1.2%
2882000 80
 
1.1%
3351000 77
 
1.1%
3147000 76
 
1.1%
Other values (973) 6230
86.8%
ValueCountFrequency (%)
568000 3
 
< 0.1%
1022000 20
0.3%
1437000 19
0.3%
1460000 16
0.2%
1512000 13
0.2%
1544000 14
0.2%
1551000 7
 
0.1%
1585000 9
0.1%
1586000 11
0.2%
1596600 21
0.3%
ValueCountFrequency (%)
9564000 1
< 0.1%
9369000 2
< 0.1%
9344000 1
< 0.1%
9055000 1
< 0.1%
8869000 1
< 0.1%
8860000 1
< 0.1%
8791000 1
< 0.1%
8750000 2
< 0.1%
8666000 2
< 0.1%
8648200 1
< 0.1%

Interactions

2023-12-13T04:42:28.250092image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T04:42:28.013799image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T04:42:28.399650image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T04:42:28.142785image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T04:42:31.031076image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번시도설립구분학년학기지원금액
순번1.0000.7510.4610.0260.0000.445
시도0.7511.0000.5600.0870.0000.546
설립구분0.4610.5601.0000.0250.0000.750
학년0.0260.0870.0251.0000.2360.235
학기0.0000.0000.0000.2361.0000.216
지원금액0.4450.5460.7500.2350.2161.000
2023-12-13T04:42:31.173747image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
학년설립구분학기시도
학년1.0000.0230.1570.049
설립구분0.0231.0000.0000.364
학기0.1570.0001.0000.000
시도0.0490.3640.0001.000
2023-12-13T04:42:31.307400image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번지원금액시도설립구분학년학기
순번1.0000.1580.4120.3120.0160.000
지원금액0.1581.0000.2460.6170.1420.166
시도0.4120.2461.0000.3640.0490.000
설립구분0.3120.6170.3641.0000.0230.000
학년0.0160.1420.0490.0231.0000.157
학기0.0000.1660.0000.0000.1571.000

Missing values

2023-12-13T04:42:28.576698image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T04:42:28.725252image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

순번시도설립구분학년연도학기지원금액
01대전사립1202223204000
12대전사립1202213204000
23대전사립1202223422000
34대전사립1202213422000
45대전사립1202225922000
56대전사립1202215922000
67대전사립2202225204000
78대전사립2202215204000
89대전사립1202223204000
910대전사립3202223422000
순번시도설립구분학년연도학기지원금액
71697170경기사립3202223750000
71707171경기사립3202215750000
71717172경기사립4202223750000
71727173경기사립4202213750000
71737174경기사립4202225750000
71747175경기사립4202215750000
71757176경기사립4202213750000
71767177경기사립4202225750000
71777178경기사립4202215750000
71787179경기사립3202213750000