Overview

Dataset statistics

Number of variables5
Number of observations72
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory3.2 KiB
Average record size in memory45.8 B

Variable types

Categorical2
Numeric3

Dataset

Description□사업명: 체육인재 장학지원사업 □사업내용: 학업의지가 높고 성장잠재력을 보유한 저소득층 학생선수 대상 장학금 지원 □지원대상: 저소득층 가정의 초·중·고 학생 선수 1,266명(2023년 기준이며 매년 변동됨) □지원내용 ㅇ지원금액: 1인당 매월 40만원 이내(사용부문 한정) ㅇ지원방식: 바우처 지원(매월 정액 포인트 지급 후 익월 정산) ㅇ지원기간: 5월 ∼ 차년도 2월 (최대 10개월) ㅇ장학금 사용부문 ① 스포츠 부문: 스포츠 용품 및 의류, 스포츠시설, 프로그램 수강료 등 ② 학업 부문: 서적 및 수업교재, 입시 및 보습학원, 체육학원 및 무술도장 등 이 사업을 통해 도출한 체육인재 장학지원사업 장학금 사용부문 통계 정보를 공개합니다.
URLhttps://www.data.go.kr/data/15118807/fileData.do

Alerts

평균사용건수 is highly overall correlated with 총사용건수 and 2 other fieldsHigh correlation
총사용건수 is highly overall correlated with 평균사용건수 and 2 other fieldsHigh correlation
사용비율 is highly overall correlated with 평균사용건수 and 2 other fieldsHigh correlation
업종명 is highly overall correlated with 평균사용건수 and 2 other fieldsHigh correlation
평균사용건수 has 14 (19.4%) zerosZeros
총사용건수 has 14 (19.4%) zerosZeros
사용비율 has 38 (52.8%) zerosZeros

Reproduction

Analysis started2023-12-12 03:57:59.624636
Analysis finished2023-12-12 03:58:01.328777
Duration1.7 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

기준연도
Categorical

Distinct3
Distinct (%)4.2%
Missing0
Missing (%)0.0%
Memory size708.0 B
2020
24 
2021
24 
2022
24 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2020
2nd row2020
3rd row2020
4th row2020
5th row2020

Common Values

ValueCountFrequency (%)
2020 24
33.3%
2021 24
33.3%
2022 24
33.3%

Length

2023-12-12T12:58:01.438316image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T12:58:01.595030image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2020 24
33.3%
2021 24
33.3%
2022 24
33.3%

업종명
Categorical

HIGH CORRELATION 

Distinct24
Distinct (%)33.3%
Missing0
Missing (%)0.0%
Memory size708.0 B
골프장
 
3
공공편의서비스
 
3
교육및 교구
 
3
기타교육,교습,학원
 
3
기타레저,스포츠용품
 
3
Other values (19)
57 

Length

Max length13
Median length8
Mean length6.6666667
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row골프장
2nd row공공편의서비스
3rd row교육및 교구
4th row기타교육,교습,학원
5th row기타레저,스포츠용품

Common Values

ValueCountFrequency (%)
골프장 3
 
4.2%
공공편의서비스 3
 
4.2%
교육및 교구 3
 
4.2%
기타교육,교습,학원 3
 
4.2%
기타레저,스포츠용품 3
 
4.2%
기타서비스 3
 
4.2%
기타 취미,레저, 서비스 3
 
4.2%
당구장 3
 
4.2%
무술도장 등(학원) 3
 
4.2%
문구,사무용품 3
 
4.2%
Other values (14) 42
58.3%

Length

2023-12-12T12:58:01.780665image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
골프장 3
 
3.3%
공공편의서비스 3
 
3.3%
인쇄.복사 3
 
3.3%
판촉물 3
 
3.3%
구두,신발 3
 
3.3%
헬스클럽 3
 
3.3%
총포류 3
 
3.3%
초,중,고등학교 3
 
3.3%
전문스포츠용품점 3
 
3.3%
자전거 3
 
3.3%
Other values (20) 60
66.7%

평균사용건수
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct55
Distinct (%)76.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean138.87222
Minimum0
Maximum1442.5
Zeros14
Zeros (%)19.4%
Negative0
Negative (%)0.0%
Memory size780.0 B
2023-12-12T12:58:01.990783image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q11.15
median15
Q361.65
95-th percentile1120.95
Maximum1442.5
Range1442.5
Interquartile range (IQR)60.5

Descriptive statistics

Standard deviation339.97591
Coefficient of variation (CV)2.4481203
Kurtosis8.0656622
Mean138.87222
Median Absolute Deviation (MAD)15
Skewness3.0370075
Sum9998.8
Variance115583.62
MonotonicityNot monotonic
2023-12-12T12:58:02.245495image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.0 14
 
19.4%
37.0 2
 
2.8%
0.1 2
 
2.8%
1.2 2
 
2.8%
12.6 2
 
2.8%
51.8 1
 
1.4%
12.4 1
 
1.4%
3.1 1
 
1.4%
27.1 1
 
1.4%
16.7 1
 
1.4%
Other values (45) 45
62.5%
ValueCountFrequency (%)
0.0 14
19.4%
0.1 2
 
2.8%
0.4 1
 
1.4%
1.0 1
 
1.4%
1.2 2
 
2.8%
2.5 1
 
1.4%
3.1 1
 
1.4%
3.8 1
 
1.4%
5.5 1
 
1.4%
5.7 1
 
1.4%
ValueCountFrequency (%)
1442.5 1
1.4%
1395.6 1
1.4%
1345.3 1
1.4%
1127.0 1
1.4%
1116.0 1
1.4%
923.0 1
1.4%
313.7 1
1.4%
298.0 1
1.4%
242.8 1
1.4%
163.6 1
1.4%

총사용건수
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct55
Distinct (%)76.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1249.8333
Minimum0
Maximum14425
Zeros14
Zeros (%)19.4%
Negative0
Negative (%)0.0%
Memory size780.0 B
2023-12-12T12:58:02.439209image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q111.5
median150
Q3616.5
95-th percentile10148
Maximum14425
Range14425
Interquartile range (IQR)605

Descriptive statistics

Standard deviation3192.9263
Coefficient of variation (CV)2.5546817
Kurtosis10.52768
Mean1249.8333
Median Absolute Deviation (MAD)150
Skewness3.385363
Sum89988
Variance10194778
MonotonicityNot monotonic
2023-12-12T12:58:02.630200image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 14
 
19.4%
370 2
 
2.8%
1 2
 
2.8%
12 2
 
2.8%
126 2
 
2.8%
518 1
 
1.4%
124 1
 
1.4%
31 1
 
1.4%
271 1
 
1.4%
167 1
 
1.4%
Other values (45) 45
62.5%
ValueCountFrequency (%)
0 14
19.4%
1 2
 
2.8%
4 1
 
1.4%
10 1
 
1.4%
12 2
 
2.8%
25 1
 
1.4%
31 1
 
1.4%
38 1
 
1.4%
55 1
 
1.4%
57 1
 
1.4%
ValueCountFrequency (%)
14425 1
1.4%
13956 1
1.4%
13453 1
1.4%
11270 1
1.4%
9230 1
1.4%
3137 1
1.4%
2980 1
1.4%
2428 1
1.4%
1636 1
1.4%
1578 1
1.4%

사용비율
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct14
Distinct (%)19.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4.1111111
Minimum0
Maximum45
Zeros38
Zeros (%)52.8%
Negative0
Negative (%)0.0%
Memory size780.0 B
2023-12-12T12:58:02.779763image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q32
95-th percentile32.45
Maximum45
Range45
Interquartile range (IQR)2

Descriptive statistics

Standard deviation10.25115
Coefficient of variation (CV)2.4935229
Kurtosis8.0695407
Mean4.1111111
Median Absolute Deviation (MAD)0
Skewness3.027968
Sum296
Variance105.08607
MonotonicityNot monotonic
2023-12-12T12:58:02.949566image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=14)
ValueCountFrequency (%)
0 38
52.8%
1 12
 
16.7%
2 7
 
9.7%
3 3
 
4.2%
5 2
 
2.8%
9 2
 
2.8%
33 1
 
1.4%
40 1
 
1.4%
32 1
 
1.4%
41 1
 
1.4%
Other values (4) 4
 
5.6%
ValueCountFrequency (%)
0 38
52.8%
1 12
 
16.7%
2 7
 
9.7%
3 3
 
4.2%
4 1
 
1.4%
5 2
 
2.8%
8 1
 
1.4%
9 2
 
2.8%
30 1
 
1.4%
32 1
 
1.4%
ValueCountFrequency (%)
45 1
1.4%
41 1
1.4%
40 1
1.4%
33 1
1.4%
32 1
1.4%
30 1
1.4%
9 2
2.8%
8 1
1.4%
5 2
2.8%
4 1
1.4%

Interactions

2023-12-12T12:58:00.705622image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T12:57:59.875497image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T12:58:00.310938image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T12:58:00.837237image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T12:58:00.018913image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T12:58:00.452841image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T12:58:00.965278image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T12:58:00.191839image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T12:58:00.583575image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T12:58:03.064463image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
기준연도업종명평균사용건수총사용건수사용비율
기준연도1.0000.0000.0000.0000.000
업종명0.0001.0000.9390.9010.841
평균사용건수0.0000.9391.0000.9991.000
총사용건수0.0000.9010.9991.0000.967
사용비율0.0000.8411.0000.9671.000
2023-12-12T12:58:03.205692image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업종명기준연도
업종명1.0000.000
기준연도0.0001.000
2023-12-12T12:58:03.672633image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
평균사용건수총사용건수사용비율기준연도업종명
평균사용건수1.0000.9990.9240.0000.585
총사용건수0.9991.0000.9230.0000.518
사용비율0.9240.9231.0000.0000.516
기준연도0.0000.0000.0001.0000.000
업종명0.5850.5180.5160.0001.000

Missing values

2023-12-12T12:58:01.136010image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T12:58:01.275895image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

기준연도업종명평균사용건수총사용건수사용비율
02020골프장15.11510
12020공공편의서비스0.000
22020교육및 교구37.03701
32020기타교육,교습,학원37.03701
42020기타레저,스포츠용품1127.01127033
52020기타서비스0.000
62020기타 취미,레저, 서비스99.79973
72020당구장0.000
82020무술도장 등(학원)43.84381
92020문구,사무용품157.815785
기준연도업종명평균사용건수총사용건수사용비율
622022일반의류1.2120
632022입시학원, 보습학원50.35032
642022자전거5.5550
652022전문스포츠용품점242.824288
662022초,중,고등학교13.41340
672022총포류6.0600
682022헬스클럽28.12811
692022구두,신발0.110
702022판촉물. 인쇄.복사1.0100
712022기타상품판매점0.110