Overview

Dataset statistics

Number of variables2
Number of observations1329
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory23.5 KiB
Average record size in memory18.1 B

Variable types

Numeric2

Dataset

Description한국기술교육대학교 온라인평생교육원 스마트 직업훈련 플랫폼 (STEP)에 대한 학습자 그룹 관련 내용을 제공합니다.
Author한국기술교육대학교
URLhttps://www.data.go.kr/data/15091064/fileData.do

Alerts

그룹ID is highly overall correlated with 과정IDHigh correlation
과정ID is highly overall correlated with 그룹IDHigh correlation
그룹ID has unique valuesUnique

Reproduction

Analysis started2023-12-12 13:07:20.093228
Analysis finished2023-12-12 13:07:20.638201
Duration0.54 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

그룹ID
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct1329
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2165.7524
Minimum8
Maximum4165
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size11.8 KiB
2023-12-12T22:07:20.756713image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum8
5-th percentile374.2
Q11171
median2167
Q33163
95-th percentile3959.8
Maximum4165
Range4157
Interquartile range (IQR)1992

Descriptive statistics

Standard deviation1153.629
Coefficient of variation (CV)0.53266892
Kurtosis-1.189058
Mean2165.7524
Median Absolute Deviation (MAD)996
Skewness-0.0071414363
Sum2878285
Variance1330859.9
MonotonicityNot monotonic
2023-12-12T22:07:20.942740image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
8 1
 
0.1%
3202 1
 
0.1%
3196 1
 
0.1%
3193 1
 
0.1%
3190 1
 
0.1%
3187 1
 
0.1%
3184 1
 
0.1%
3181 1
 
0.1%
3178 1
 
0.1%
3175 1
 
0.1%
Other values (1319) 1319
99.2%
ValueCountFrequency (%)
8 1
0.1%
10 1
0.1%
12 1
0.1%
19 1
0.1%
22 1
0.1%
25 1
0.1%
28 1
0.1%
31 1
0.1%
34 1
0.1%
35 1
0.1%
ValueCountFrequency (%)
4165 1
0.1%
4156 1
0.1%
4153 1
0.1%
4150 1
0.1%
4147 1
0.1%
4144 1
0.1%
4141 1
0.1%
4138 1
0.1%
4135 1
0.1%
4132 1
0.1%

과정ID
Real number (ℝ)

HIGH CORRELATION 

Distinct47
Distinct (%)3.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean107510.36
Minimum33796
Maximum112555
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size11.8 KiB
2023-12-12T22:07:21.055561image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum33796
5-th percentile107959
Q1107986
median108019
Q3108034
95-th percentile108061
Maximum112555
Range78759
Interquartile range (IQR)48

Descriptive statistics

Standard deviation5656.7683
Coefficient of variation (CV)0.05261603
Kurtosis125.76143
Mean107510.36
Median Absolute Deviation (MAD)18
Skewness-11.186113
Sum1.4288127 × 108
Variance31999027
MonotonicityIncreasing
2023-12-12T22:07:21.183035image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=47)
ValueCountFrequency (%)
108010 137
 
10.3%
108016 137
 
10.3%
108061 64
 
4.8%
108037 59
 
4.4%
108034 59
 
4.4%
108046 59
 
4.4%
108031 59
 
4.4%
108028 59
 
4.4%
108025 59
 
4.4%
108022 59
 
4.4%
Other values (37) 578
43.5%
ValueCountFrequency (%)
33796 3
 
0.2%
48032 4
 
0.3%
48072 2
 
0.2%
57426 1
 
0.1%
61677 1
 
0.1%
107959 57
4.3%
107965 57
4.3%
107971 57
4.3%
107974 57
4.3%
107980 57
4.3%
ValueCountFrequency (%)
112555 1
0.1%
108421 1
0.1%
108307 1
0.1%
108304 1
0.1%
108301 1
0.1%
108295 1
0.1%
108292 1
0.1%
108289 1
0.1%
108286 1
0.1%
108283 1
0.1%

Interactions

2023-12-12T22:07:20.351167image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:07:20.154516image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:07:20.435397image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:07:20.243309image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T22:07:21.266921image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
그룹ID과정ID
그룹ID1.0000.319
과정ID0.3191.000
2023-12-12T22:07:21.347838image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
그룹ID과정ID
그룹ID1.0000.540
과정ID0.5401.000

Missing values

2023-12-12T22:07:20.552090image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T22:07:20.614381image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

그룹ID과정ID
0833796
11033796
21233796
32548032
42848032
53148032
63448032
71948072
82248072
93557426
그룹ID과정ID
13194120108283
13204144108286
13214108108289
13224111108292
13234105108295
13244132108301
13254147108304
13264150108307
13274153108421
13284165112555