Overview

Dataset statistics

Number of variables4
Number of observations136
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory4.6 KiB
Average record size in memory35.0 B

Variable types

Categorical3
Numeric1

Dataset

Description- 시도 및 학교별 학생 1인당 월평균 사교육비 정보를 제공합니다. - 학생 1인당 월평균 사교육비는 우리나라 초중고 전체 학생(사교육을 받지 않은 학생 포함)을 대상으로 한 평균 금액입니다. - 데이터 제공처: KOSIS 국가통계포털
Author제주특별자치도 미래성장과
URLhttps://www.jejudatahub.net/data/view/data/886

Alerts

사교육비(만원) is highly overall correlated with 학교 구분High correlation
학교 구분 is highly overall correlated with 사교육비(만원)High correlation

Reproduction

Analysis started2023-12-11 20:17:21.192371
Analysis finished2023-12-11 20:17:21.767211
Duration0.57 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

기준연도
Categorical

Distinct2
Distinct (%)1.5%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
2020
68 
2019
68 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2020
2nd row2020
3rd row2020
4th row2020
5th row2020

Common Values

ValueCountFrequency (%)
2020 68
50.0%
2019 68
50.0%

Length

2023-12-12T05:17:21.815970image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T05:17:21.902273image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2020 68
50.0%
2019 68
50.0%

시도
Categorical

Distinct17
Distinct (%)12.5%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
서울
 
8
부산
 
8
대구
 
8
인천
 
8
광주
 
8
Other values (12)
96 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row서울
2nd row부산
3rd row대구
4th row인천
5th row광주

Common Values

ValueCountFrequency (%)
서울 8
 
5.9%
부산 8
 
5.9%
대구 8
 
5.9%
인천 8
 
5.9%
광주 8
 
5.9%
대전 8
 
5.9%
울산 8
 
5.9%
세종 8
 
5.9%
경기 8
 
5.9%
강원 8
 
5.9%
Other values (7) 56
41.2%

Length

2023-12-12T05:17:21.989821image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
서울 8
 
5.9%
강원 8
 
5.9%
경남 8
 
5.9%
경북 8
 
5.9%
전남 8
 
5.9%
전북 8
 
5.9%
충남 8
 
5.9%
충북 8
 
5.9%
경기 8
 
5.9%
부산 8
 
5.9%
Other values (7) 56
41.2%

학교 구분
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)2.9%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
초등학교
34 
중학교
34 
고등학교
34 
일반고
34 

Length

Max length4
Median length3.5
Mean length3.5
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row초등학교
2nd row초등학교
3rd row초등학교
4th row초등학교
5th row초등학교

Common Values

ValueCountFrequency (%)
초등학교 34
25.0%
중학교 34
25.0%
고등학교 34
25.0%
일반고 34
25.0%

Length

2023-12-12T05:17:22.076138image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T05:17:22.154793image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
초등학교 34
25.0%
중학교 34
25.0%
고등학교 34
25.0%
일반고 34
25.0%

사교육비(만원)
Real number (ℝ)

HIGH CORRELATION 

Distinct117
Distinct (%)86.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean47.002941
Minimum21.8
Maximum87.3
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.3 KiB
2023-12-12T05:17:22.244123image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum21.8
5-th percentile27.075
Q137.95
median46.65
Q353.975
95-th percentile66.85
Maximum87.3
Range65.5
Interquartile range (IQR)16.025

Descriptive statistics

Standard deviation13.408554
Coefficient of variation (CV)0.28527053
Kurtosis0.26009504
Mean47.002941
Median Absolute Deviation (MAD)8.65
Skewness0.43589103
Sum6392.4
Variance179.78932
MonotonicityNot monotonic
2023-12-12T05:17:22.353218image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
60.6 2
 
1.5%
27.0 2
 
1.5%
41.3 2
 
1.5%
51.4 2
 
1.5%
48.6 2
 
1.5%
46.8 2
 
1.5%
46.1 2
 
1.5%
53.1 2
 
1.5%
45.0 2
 
1.5%
49.0 2
 
1.5%
Other values (107) 116
85.3%
ValueCountFrequency (%)
21.8 1
0.7%
23.3 1
0.7%
23.8 1
0.7%
24.3 1
0.7%
26.2 1
0.7%
27.0 2
1.5%
27.1 1
0.7%
27.6 1
0.7%
27.7 1
0.7%
28.0 1
0.7%
ValueCountFrequency (%)
87.3 1
0.7%
85.6 1
0.7%
82.9 1
0.7%
81.9 1
0.7%
69.9 1
0.7%
68.8 1
0.7%
67.0 1
0.7%
66.8 1
0.7%
66.3 1
0.7%
65.0 1
0.7%

Interactions

2023-12-12T05:17:21.338195image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T05:17:22.444325image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
기준연도시도학교 구분사교육비(만원)
기준연도1.0000.0000.0000.000
시도0.0001.0000.0000.610
학교 구분0.0000.0001.0000.769
사교육비(만원)0.0000.6100.7691.000
2023-12-12T05:17:22.531931image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
기준연도학교 구분시도
기준연도1.0000.0000.000
학교 구분0.0001.0000.000
시도0.0000.0001.000
2023-12-12T05:17:22.605437image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사교육비(만원)기준연도시도학교 구분
사교육비(만원)1.0000.0000.2890.618
기준연도0.0001.0000.0000.000
시도0.2890.0001.0000.000
학교 구분0.6180.0000.0001.000

Missing values

2023-12-12T05:17:21.677159image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T05:17:21.741158image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

기준연도시도학교 구분사교육비(만원)
02020서울초등학교42.4
12020부산초등학교29.3
22020대구초등학교29.6
32020인천초등학교32.8
42020광주초등학교34.2
52020대전초등학교30.2
62020울산초등학교27.7
72020세종초등학교30.7
82020경기초등학교32.0
92020강원초등학교28.0
기준연도시도학교 구분사교육비(만원)
1262019세종일반고59.9
1272019경기일반고66.3
1282019강원일반고48.2
1292019충북일반고49.0
1302019충남일반고48.2
1312019전북일반고51.4
1322019전남일반고44.0
1332019경북일반고43.5
1342019경남일반고46.0
1352019제주일반고52.7