Overview

Dataset statistics

Number of variables2
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows10
Duplicate rows (%)0.1%
Total size in memory244.1 KiB
Average record size in memory25.0 B

Variable types

Numeric1
Categorical1

Dataset

Description한국기술교육대학교 온라인평생교육원 스마트 직업훈련 플랫폼 (STEP)에 대한 즐겨찾기 과정 카데고리 관련 내용을 제공합니다.
Author한국기술교육대학교
URLhttps://www.data.go.kr/data/15090903/fileData.do

Alerts

마이그레이션 원천 구분 has constant value ""Constant
Dataset has 10 (0.1%) duplicate rowsDuplicates

Reproduction

Analysis started2023-12-12 14:53:32.212089
Analysis finished2023-12-12 14:53:32.522162
Duration0.31 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct10
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean920.1187
Minimum130
Maximum1140
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T23:53:32.587943image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum130
5-th percentile130
Q11108
median1116
Q31120
95-th percentile1140
Maximum1140
Range1010
Interquartile range (IQR)12

Descriptive statistics

Standard deviation393.70857
Coefficient of variation (CV)0.42788889
Kurtosis0.15706197
Mean920.1187
Median Absolute Deviation (MAD)4
Skewness-1.467655
Sum9201187
Variance155006.44
MonotonicityNot monotonic
2023-12-12T23:53:32.713048image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%)
1119 1936
19.4%
1120 1895
18.9%
1115 1445
14.4%
150 1314
13.1%
1140 949
9.5%
1108 824
8.2%
130 727
 
7.3%
1114 499
 
5.0%
1116 207
 
2.1%
1117 204
 
2.0%
ValueCountFrequency (%)
130 727
 
7.3%
150 1314
13.1%
1108 824
8.2%
1114 499
 
5.0%
1115 1445
14.4%
1116 207
 
2.1%
1117 204
 
2.0%
1119 1936
19.4%
1120 1895
18.9%
1140 949
9.5%
ValueCountFrequency (%)
1140 949
9.5%
1120 1895
18.9%
1119 1936
19.4%
1117 204
 
2.0%
1116 207
 
2.1%
1115 1445
14.4%
1114 499
 
5.0%
1108 824
8.2%
150 1314
13.1%
130 727
 
7.3%

마이그레이션 원천 구분
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
OLEIPORTAL
10000 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowOLEIPORTAL
2nd rowOLEIPORTAL
3rd rowOLEIPORTAL
4th rowOLEIPORTAL
5th rowOLEIPORTAL

Common Values

ValueCountFrequency (%)
OLEIPORTAL 10000
100.0%

Length

2023-12-12T23:53:32.855093image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T23:53:32.956337image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
oleiportal 10000
100.0%

Interactions

2023-12-12T23:53:32.268730image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Missing values

2023-12-12T23:53:32.401367image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T23:53:32.482416image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

과목 카테고리 코드마이그레이션 원천 구분
137171108OLEIPORTAL
56509150OLEIPORTAL
130291120OLEIPORTAL
28014130OLEIPORTAL
496751114OLEIPORTAL
620731108OLEIPORTAL
724671120OLEIPORTAL
785761120OLEIPORTAL
499451120OLEIPORTAL
431811116OLEIPORTAL
과목 카테고리 코드마이그레이션 원천 구분
6699150OLEIPORTAL
794171120OLEIPORTAL
67884150OLEIPORTAL
13972150OLEIPORTAL
10081140OLEIPORTAL
802211140OLEIPORTAL
743091120OLEIPORTAL
622311108OLEIPORTAL
138941115OLEIPORTAL
593371119OLEIPORTAL

Duplicate rows

Most frequently occurring

과목 카테고리 코드마이그레이션 원천 구분# duplicates
71119OLEIPORTAL1936
81120OLEIPORTAL1895
41115OLEIPORTAL1445
1150OLEIPORTAL1314
91140OLEIPORTAL949
21108OLEIPORTAL824
0130OLEIPORTAL727
31114OLEIPORTAL499
51116OLEIPORTAL207
61117OLEIPORTAL204