Overview

Dataset statistics

Number of variables9
Number of observations1672
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory125.9 KiB
Average record size in memory77.1 B

Variable types

Categorical5
Text1
Numeric3

Dataset

Description대학생 청소년교육지원장학금 멘토링 관련 집행액 세부 정보: 상품명, 연도, 학기, 대학명, 근로내역대분류, 근로내역소분류, 출근부건수, 대상인원수, 집행금액 등의 정보를 제공함대학생 청소년교육지원 장학금 관련 정보는 '한국장학재단 누리집(http://www.koaf.go.kr) > 인재육성 > 대학생지식멘토링 > 대학생 청소년교육지원장학금'에서 확인 가능
Author한국장학재단
URLhttps://www.data.go.kr/data/15070474/fileData.do

Alerts

상품명 has constant value ""Constant
연도 has constant value ""Constant
학기 has constant value ""Constant
근로내역대분류 is highly overall correlated with 근로내역소분류High correlation
근로내역소분류 is highly overall correlated with 근로내역대분류High correlation
출근부건수 is highly overall correlated with 대상인원수 and 1 other fieldsHigh correlation
대상인원수 is highly overall correlated with 출근부건수 and 1 other fieldsHigh correlation
집행금액(원) is highly overall correlated with 출근부건수 and 1 other fieldsHigh correlation

Reproduction

Analysis started2024-04-06 08:35:20.395087
Analysis finished2024-04-06 08:35:23.742123
Duration3.35 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

상품명
Categorical

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.2 KiB
대학생 청소년교육지원 장학금
1672 

Length

Max length15
Median length15
Mean length15
Min length15

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row대학생 청소년교육지원 장학금
2nd row대학생 청소년교육지원 장학금
3rd row대학생 청소년교육지원 장학금
4th row대학생 청소년교육지원 장학금
5th row대학생 청소년교육지원 장학금

Common Values

ValueCountFrequency (%)
대학생 청소년교육지원 장학금 1672
100.0%

Length

2024-04-06T17:35:23.941503image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-06T17:35:24.200145image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
대학생 1672
33.3%
청소년교육지원 1672
33.3%
장학금 1672
33.3%

연도
Categorical

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.2 KiB
2022
1672 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2022
2nd row2022
3rd row2022
4th row2022
5th row2022

Common Values

ValueCountFrequency (%)
2022 1672
100.0%

Length

2024-04-06T17:35:24.930861image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-06T17:35:25.122495image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2022 1672
100.0%

학기
Categorical

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.2 KiB
2
1672 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2
2nd row2
3rd row2
4th row2
5th row2

Common Values

ValueCountFrequency (%)
2 1672
100.0%

Length

2024-04-06T17:35:25.336501image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-06T17:35:25.554576image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2 1672
100.0%
Distinct180
Distinct (%)10.8%
Missing0
Missing (%)0.0%
Memory size13.2 KiB
2024-04-06T17:35:25.891207image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length18
Median length17
Mean length12.783493
Min length5

Characters and Unicode

Total characters21374
Distinct characters158
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row부산대학교(본교) 학부
2nd row부산대학교(본교) 학부
3rd row부산대학교(본교) 학부
4th row부산대학교(본교) 학부
5th row부산대학교(본교) 학부
ValueCountFrequency (%)
학부 1396
44.9%
중앙대학교 18
 
0.6%
경상국립대학교(본교 13
 
0.4%
우석대학교(본교 13
 
0.4%
영남대학교(본교 13
 
0.4%
국립순천대학교(본교 13
 
0.4%
경북대학교(본교 13
 
0.4%
삼육대학교(본교 13
 
0.4%
전남대학교(본교 12
 
0.4%
대진대학교(본교 12
 
0.4%
Other values (174) 1592
51.2%
2024-04-06T17:35:26.714079image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3275
15.3%
3155
14.8%
1776
 
8.3%
( 1620
 
7.6%
) 1620
 
7.6%
1491
 
7.0%
1478
 
6.9%
1436
 
6.7%
317
 
1.5%
204
 
1.0%
Other values (148) 5002
23.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 16698
78.1%
Open Punctuation 1620
 
7.6%
Close Punctuation 1620
 
7.6%
Space Separator 1436
 
6.7%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3275
19.6%
3155
18.9%
1776
 
10.6%
1491
 
8.9%
1478
 
8.9%
317
 
1.9%
204
 
1.2%
176
 
1.1%
164
 
1.0%
137
 
0.8%
Other values (145) 4525
27.1%
Open Punctuation
ValueCountFrequency (%)
( 1620
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1620
100.0%
Space Separator
ValueCountFrequency (%)
1436
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 16698
78.1%
Common 4676
 
21.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3275
19.6%
3155
18.9%
1776
 
10.6%
1491
 
8.9%
1478
 
8.9%
317
 
1.9%
204
 
1.2%
176
 
1.1%
164
 
1.0%
137
 
0.8%
Other values (145) 4525
27.1%
Common
ValueCountFrequency (%)
( 1620
34.6%
) 1620
34.6%
1436
30.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 16698
78.1%
ASCII 4676
 
21.9%

Most frequent character per block

Hangul
ValueCountFrequency (%)
3275
19.6%
3155
18.9%
1776
 
10.6%
1491
 
8.9%
1478
 
8.9%
317
 
1.9%
204
 
1.2%
176
 
1.1%
164
 
1.0%
137
 
0.8%
Other values (145) 4525
27.1%
ASCII
ValueCountFrequency (%)
( 1620
34.6%
) 1620
34.6%
1436
30.7%

근로내역대분류
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size13.2 KiB
활동기관구분
669 
학년
661 
남여
342 

Length

Max length6
Median length2
Mean length3.6004785
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row활동기관구분
2nd row활동기관구분
3rd row활동기관구분
4th row활동기관구분
5th row남여

Common Values

ValueCountFrequency (%)
활동기관구분 669
40.0%
학년 661
39.5%
남여 342
20.5%

Length

2024-04-06T17:35:27.150550image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-06T17:35:27.394890image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
활동기관구분 669
40.0%
학년 661
39.5%
남여 342
20.5%

근로내역소분류
Categorical

HIGH CORRELATION 

Distinct14
Distinct (%)0.8%
Missing0
Missing (%)0.0%
Memory size13.2 KiB
180 
지역아동센터
171 
2학년
170 
3학년
164 
162 
Other values (9)
825 

Length

Max length12
Median length6
Mean length3.5257177
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row초등학교
2nd row중학교
3rd row고등학교
4th row기타
5th row

Common Values

ValueCountFrequency (%)
180
10.8%
지역아동센터 171
10.2%
2학년 170
10.2%
3학년 164
9.8%
162
9.7%
1학년 160
9.6%
기타 155
9.3%
4학년 153
9.2%
VMS/1365등록기관 118
7.1%
초등학교 91
5.4%
Other values (4) 148
8.9%

Length

2024-04-06T17:35:27.627621image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
180
10.8%
지역아동센터 171
10.2%
2학년 170
10.2%
3학년 164
9.8%
162
9.7%
1학년 160
9.6%
기타 155
9.3%
4학년 153
9.2%
vms/1365등록기관 118
7.1%
초등학교 91
5.4%
Other values (4) 148
8.9%

출근부건수
Real number (ℝ)

HIGH CORRELATION 

Distinct1114
Distinct (%)66.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1344.9707
Minimum1
Maximum36261
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size14.8 KiB
2024-04-06T17:35:27.877723image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile29
Q1167.75
median488
Q31313
95-th percentile5520.85
Maximum36261
Range36260
Interquartile range (IQR)1145.25

Descriptive statistics

Standard deviation2806.1646
Coefficient of variation (CV)2.0864132
Kurtosis52.080806
Mean1344.9707
Median Absolute Deviation (MAD)397
Skewness6.0842181
Sum2248791
Variance7874559.8
MonotonicityNot monotonic
2024-04-06T17:35:28.178348image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
32 10
 
0.6%
18 7
 
0.4%
10 6
 
0.4%
26 6
 
0.4%
45 6
 
0.4%
249 6
 
0.4%
124 6
 
0.4%
24 6
 
0.4%
75 6
 
0.4%
29 6
 
0.4%
Other values (1104) 1607
96.1%
ValueCountFrequency (%)
1 1
 
0.1%
2 1
 
0.1%
3 1
 
0.1%
4 1
 
0.1%
5 3
0.2%
6 5
0.3%
7 1
 
0.1%
8 5
0.3%
9 4
0.2%
10 6
0.4%
ValueCountFrequency (%)
36261 1
0.1%
32429 1
0.1%
31540 1
0.1%
31539 1
0.1%
25880 1
0.1%
24940 1
0.1%
22007 1
0.1%
20292 1
0.1%
17337 1
0.1%
16493 1
0.1%

대상인원수
Real number (ℝ)

HIGH CORRELATION 

Distinct147
Distinct (%)8.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean22.840311
Minimum1
Maximum587
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size14.8 KiB
2024-04-06T17:35:28.538972image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q13
median9
Q323
95-th percentile87
Maximum587
Range586
Interquartile range (IQR)20

Descriptive statistics

Standard deviation43.773124
Coefficient of variation (CV)1.9164855
Kurtosis52.865293
Mean22.840311
Median Absolute Deviation (MAD)7
Skewness5.9895364
Sum38189
Variance1916.0864
MonotonicityNot monotonic
2024-04-06T17:35:28.819613image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 193
 
11.5%
2 136
 
8.1%
3 121
 
7.2%
4 88
 
5.3%
5 77
 
4.6%
6 70
 
4.2%
7 69
 
4.1%
8 58
 
3.5%
11 48
 
2.9%
10 47
 
2.8%
Other values (137) 765
45.8%
ValueCountFrequency (%)
1 193
11.5%
2 136
8.1%
3 121
7.2%
4 88
5.3%
5 77
 
4.6%
6 70
 
4.2%
7 69
 
4.1%
8 58
 
3.5%
9 46
 
2.8%
10 47
 
2.8%
ValueCountFrequency (%)
587 1
0.1%
575 1
0.1%
463 1
0.1%
431 1
0.1%
363 1
0.1%
360 1
0.1%
355 1
0.1%
304 1
0.1%
297 1
0.1%
294 1
0.1%

집행금액(원)
Real number (ℝ)

HIGH CORRELATION 

Distinct1549
Distinct (%)92.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean76574716
Minimum12500
Maximum2.0035938 × 109
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size14.8 KiB
2024-04-06T17:35:29.129811image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum12500
5-th percentile995000
Q18367187.5
median25643750
Q370870312
95-th percentile3.064475 × 108
Maximum2.0035938 × 109
Range2.0035812 × 109
Interquartile range (IQR)62503125

Descriptive statistics

Standard deviation1.6627589 × 108
Coefficient of variation (CV)2.1714202
Kurtosis49.729797
Mean76574716
Median Absolute Deviation (MAD)21587500
Skewness6.0116935
Sum1.2803293 × 1011
Variance2.764767 × 1016
MonotonicityNot monotonic
2024-04-06T17:35:29.507103image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2962500 4
 
0.2%
6431250 3
 
0.2%
6412500 3
 
0.2%
16150000 3
 
0.2%
943750 3
 
0.2%
2256250 3
 
0.2%
2343750 3
 
0.2%
887500 3
 
0.2%
3150000 3
 
0.2%
281250 3
 
0.2%
Other values (1539) 1641
98.1%
ValueCountFrequency (%)
12500 1
0.1%
68750 1
0.1%
81250 1
0.1%
93750 1
0.1%
137500 1
0.1%
143750 1
0.1%
162500 1
0.1%
168750 1
0.1%
175000 1
0.1%
187500 1
0.1%
ValueCountFrequency (%)
2003593750 1
0.1%
1952418750 1
0.1%
1921418750 1
0.1%
1703975000 1
0.1%
1539168750 1
0.1%
1526281250 1
0.1%
1508618750 1
0.1%
1068775000 1
0.1%
1046981250 1
0.1%
1035462500 1
0.1%

Interactions

2024-04-06T17:35:22.456075image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T17:35:21.161943image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T17:35:21.857169image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T17:35:22.685415image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T17:35:21.419179image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T17:35:22.040571image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T17:35:22.928712image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T17:35:21.645887image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T17:35:22.222283image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-06T17:35:29.742499image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
근로내역대분류근로내역소분류출근부건수대상인원수집행금액(원)
근로내역대분류1.0001.0000.1650.2170.182
근로내역소분류1.0001.0000.1360.1250.151
출근부건수0.1650.1361.0000.9080.925
대상인원수0.2170.1250.9081.0000.957
집행금액(원)0.1820.1510.9250.9571.000
2024-04-06T17:35:29.938808image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
근로내역대분류근로내역소분류
근로내역대분류1.0000.997
근로내역소분류0.9971.000
2024-04-06T17:35:30.145419image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
출근부건수대상인원수집행금액(원)근로내역대분류근로내역소분류
출근부건수1.0000.9440.9750.0990.055
대상인원수0.9441.0000.8850.0970.053
집행금액(원)0.9750.8851.0000.0810.064
근로내역대분류0.0990.0970.0811.0000.997
근로내역소분류0.0550.0530.0640.9971.000

Missing values

2024-04-06T17:35:23.246960image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-06T17:35:23.604557image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

상품명연도학기대학명근로내역대분류근로내역소분류출근부건수대상인원수집행금액(원)
0대학생 청소년교육지원 장학금20222부산대학교(본교) 학부활동기관구분초등학교12123531100000
1대학생 청소년교육지원 장학금20222부산대학교(본교) 학부활동기관구분중학교24717865593750
2대학생 청소년교육지원 장학금20222부산대학교(본교) 학부활동기관구분고등학교8872422793750
3대학생 청소년교육지원 장학금20222부산대학교(본교) 학부활동기관구분기타23903773062500
4대학생 청소년교육지원 장학금20222부산대학교(본교) 학부남여443395120537500
5대학생 청소년교육지원 장학금20222부산대학교(본교) 학부학년2학년26306371081250
6대학생 청소년교육지원 장학금20222부산대학교(본교) 학부학년3학년26935974993750
7대학생 청소년교육지원 장학금20222부산대학교(본교) 학부학년4학년5931018637500
8대학생 청소년교육지원 장학금20222부산대학교(본교) 학부남여25276972012500
9대학생 청소년교육지원 장학금20222부산대학교(본교) 학부학년1학년10443227837500
상품명연도학기대학명근로내역대분류근로내역소분류출근부건수대상인원수집행금액(원)
1662대학생 청소년교육지원 장학금20222호남대학교(본교) 학부활동기관구분지역아동센터6248113392100000
1663대학생 청소년교육지원 장학금20222호남대학교(본교) 학부활동기관구분VMS/1365등록기관13712180762500
1664대학생 청소년교육지원 장학금20222호남대학교(본교) 학부활동기관구분기타224734124750000
1665대학생 청소년교육지원 장학금20222호남대학교(본교) 학부학년1학년8961755525000
1666대학생 청소년교육지원 장학금20222호남대학교(본교) 학부활동기관구분초등학교5301225481250
1667대학생 청소년교육지원 장학금20222호남대학교(본교) 학부학년3학년404758243368750
1668대학생 청소년교육지원 장학금20222호남대학교(본교) 학부학년4학년404568246600000
1669대학생 청소년교육지원 장학금20222호남대학교(본교) 학부남여232139133531250
1670대학생 청소년교육지원 장학금20222호남대학교(본교) 학부남여8646135523637500
1671대학생 청소년교육지원 장학금20222호남대학교(본교) 학부학년2학년197931111675000