Overview

Dataset statistics

Number of variables8
Number of observations4324
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory283.0 KiB
Average record size in memory67.0 B

Variable types

Numeric1
Categorical6
Boolean1

Dataset

Description복권기금 꿈사다리 장학사업은 복권기금을 재원으로 저소득층 우수 중·고생을 발굴, 대학까지 연계지원하는 국가장학제도로서, 꿈 장학금과 SOS 장학금의 2가지 유형이 있습니다. 꿈장학금은 대한민국 국적을 가진 국내 중학교 1학년~고등학교 3학년 재학생 중 저소득층 가정의 역량과 잠재력을 갖춘 학생을 선발하여 매월 장학금 지급 및 교육프로그램을 지원합니다. ※ 상세 정보는 한국장학재단 홈페이지(https://www.kosaf.go.kr/ko/scholar.do?pg=scholarship05_17_01)를 참고바랍니다.
URLhttps://www.data.go.kr/data/15107488/fileData.do

Alerts

상품유형 has constant value ""Constant
연번 is highly overall correlated with 학제High correlation
학제 is highly overall correlated with 연번High correlation
저소득구분 is highly imbalanced (92.4%)Imbalance
연번 has unique valuesUnique

Reproduction

Analysis started2023-12-11 23:42:29.834769
Analysis finished2023-12-11 23:42:30.772047
Duration0.94 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct4324
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2162.5
Minimum1
Maximum4324
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size38.1 KiB
2023-12-12T08:42:30.877864image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile217.15
Q11081.75
median2162.5
Q33243.25
95-th percentile4107.85
Maximum4324
Range4323
Interquartile range (IQR)2161.5

Descriptive statistics

Standard deviation1248.3756
Coefficient of variation (CV)0.57728352
Kurtosis-1.2
Mean2162.5
Median Absolute Deviation (MAD)1081
Skewness0
Sum9350650
Variance1558441.7
MonotonicityStrictly increasing
2023-12-12T08:42:31.070549image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
< 0.1%
2890 1
 
< 0.1%
2876 1
 
< 0.1%
2877 1
 
< 0.1%
2878 1
 
< 0.1%
2879 1
 
< 0.1%
2880 1
 
< 0.1%
2881 1
 
< 0.1%
2882 1
 
< 0.1%
2883 1
 
< 0.1%
Other values (4314) 4314
99.8%
ValueCountFrequency (%)
1 1
< 0.1%
2 1
< 0.1%
3 1
< 0.1%
4 1
< 0.1%
5 1
< 0.1%
6 1
< 0.1%
7 1
< 0.1%
8 1
< 0.1%
9 1
< 0.1%
10 1
< 0.1%
ValueCountFrequency (%)
4324 1
< 0.1%
4323 1
< 0.1%
4322 1
< 0.1%
4321 1
< 0.1%
4320 1
< 0.1%
4319 1
< 0.1%
4318 1
< 0.1%
4317 1
< 0.1%
4316 1
< 0.1%
4315 1
< 0.1%

상품유형
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size33.9 KiB
꿈장학금
4324 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row꿈장학금
2nd row꿈장학금
3rd row꿈장학금
4th row꿈장학금
5th row꿈장학금

Common Values

ValueCountFrequency (%)
꿈장학금 4324
100.0%

Length

2023-12-12T08:42:31.238196image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T08:42:31.366655image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
꿈장학금 4324
100.0%
Distinct4
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size33.9 KiB
2022
1836 
2019
1113 
2021
733 
2020
642 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2022
2nd row2022
3rd row2021
4th row2022
5th row2022

Common Values

ValueCountFrequency (%)
2022 1836
42.5%
2019 1113
25.7%
2021 733
 
17.0%
2020 642
 
14.8%

Length

2023-12-12T08:42:31.464457image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T08:42:31.636996image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2022 1836
42.5%
2019 1113
25.7%
2021 733
 
17.0%
2020 642
 
14.8%

성별
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size33.9 KiB
여성
2653 
남성
1671 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row여성
2nd row남성
3rd row남성
4th row남성
5th row남성

Common Values

ValueCountFrequency (%)
여성 2653
61.4%
남성 1671
38.6%

Length

2023-12-12T08:42:31.788576image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T08:42:31.918449image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
여성 2653
61.4%
남성 1671
38.6%

학년
Categorical

Distinct3
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size33.9 KiB
3
1680 
2
1533 
1
1111 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row3
2nd row2
3rd row3
4th row3
5th row3

Common Values

ValueCountFrequency (%)
3 1680
38.9%
2 1533
35.5%
1 1111
25.7%

Length

2023-12-12T08:42:32.057493image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T08:42:32.196669image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
3 1680
38.9%
2 1533
35.5%
1 1111
25.7%

지역
Categorical

Distinct17
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size33.9 KiB
경기
794 
서울
714 
경북
346 
경남
322 
부산
263 
Other values (12)
1885 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row서울
2nd row서울
3rd row서울
4th row서울
5th row서울

Common Values

ValueCountFrequency (%)
경기 794
18.4%
서울 714
16.5%
경북 346
8.0%
경남 322
7.4%
부산 263
 
6.1%
충남 254
 
5.9%
전남 250
 
5.8%
전북 235
 
5.4%
강원 199
 
4.6%
인천 188
 
4.3%
Other values (7) 759
17.6%

Length

2023-12-12T08:42:32.332807image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
경기 794
18.4%
서울 714
16.5%
경북 346
8.0%
경남 322
7.4%
부산 263
 
6.1%
충남 254
 
5.9%
전남 250
 
5.8%
전북 235
 
5.4%
강원 199
 
4.6%
인천 188
 
4.3%
Other values (7) 759
17.6%

학제
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size33.9 KiB
고등학교
2495 
중학교
994 
대학교
835 

Length

Max length4
Median length4
Mean length3.577012
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row고등학교
2nd row중학교
3rd row중학교
4th row중학교
5th row중학교

Common Values

ValueCountFrequency (%)
고등학교 2495
57.7%
중학교 994
 
23.0%
대학교 835
 
19.3%

Length

2023-12-12T08:42:32.475560image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T08:42:32.618345image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
고등학교 2495
57.7%
중학교 994
 
23.0%
대학교 835
 
19.3%

저소득구분
Boolean

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size4.4 KiB
True
4284 
False
 
40
ValueCountFrequency (%)
True 4284
99.1%
False 40
 
0.9%
2023-12-12T08:42:32.737233image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Interactions

2023-12-12T08:42:30.373658image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T08:42:32.822072image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번최초선발연도성별학년지역학제저소득구분
연번1.0000.4180.0580.3180.8050.7450.066
최초선발연도0.4181.0000.0190.2310.0740.4300.052
성별0.0580.0191.0000.0130.0440.0150.000
학년0.3180.2310.0131.0000.1400.6340.027
지역0.8050.0740.0440.1401.0000.2760.000
학제0.7450.4300.0150.6340.2761.0000.032
저소득구분0.0660.0520.0000.0270.0000.0321.000
2023-12-12T08:42:32.958761image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
학년학제성별지역최초선발연도저소득구분
학년1.0000.2990.0210.0750.2200.045
학제0.2991.0000.0250.1540.4230.053
성별0.0210.0251.0000.0390.0120.000
지역0.0750.1540.0391.0000.0410.000
최초선발연도0.2200.4230.0120.0411.0000.034
저소득구분0.0450.0530.0000.0000.0341.000
2023-12-12T08:42:33.078475image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번최초선발연도성별학년지역학제저소득구분
연번1.0000.2620.0450.2010.4770.6140.051
최초선발연도0.2621.0000.0120.2200.0410.4230.034
성별0.0450.0121.0000.0210.0390.0250.000
학년0.2010.2200.0211.0000.0750.2990.045
지역0.4770.0410.0390.0751.0000.1540.000
학제0.6140.4230.0250.2990.1541.0000.053
저소득구분0.0510.0340.0000.0450.0000.0531.000

Missing values

2023-12-12T08:42:30.528023image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T08:42:30.692421image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번상품유형최초선발연도성별학년지역학제저소득구분
01꿈장학금2022여성3서울고등학교Y
12꿈장학금2022남성2서울중학교Y
23꿈장학금2021남성3서울중학교Y
34꿈장학금2022남성3서울중학교Y
45꿈장학금2022남성3서울중학교Y
56꿈장학금2021여성3서울중학교Y
67꿈장학금2022여성2서울고등학교Y
78꿈장학금2021여성3서울고등학교Y
89꿈장학금2019남성3서울고등학교Y
910꿈장학금2022여성3서울고등학교Y
연번상품유형최초선발연도성별학년지역학제저소득구분
43144315꿈장학금2019여성1강원대학교Y
43154316꿈장학금2020여성1서울대학교Y
43164317꿈장학금2020여성2서울대학교Y
43174318꿈장학금2020남성2서울대학교Y
43184319꿈장학금2019여성1서울대학교Y
43194320꿈장학금2020남성1광주대학교Y
43204321꿈장학금2019여성2서울대학교Y
43214322꿈장학금2019남성1경북대학교Y
43224323꿈장학금2020여성2강원대학교Y
43234324꿈장학금2019여성2전북대학교Y