Overview

Dataset statistics

Number of variables7
Number of observations10000
Missing cells1
Missing cells (%)< 0.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory664.1 KiB
Average record size in memory68.0 B

Variable types

Numeric2
Categorical4
DateTime1

Dataset

Description한국기술교육대학교 온라인평생교육원 스마트 직업훈련 플랫폼 (STEP)에 대한 설문 요청 관련 내용을 제공합니다.
Author한국기술교육대학교
URLhttps://www.data.go.kr/data/15090921/fileData.do

Alerts

타입 코드 is highly overall correlated with 제목 and 1 other fieldsHigh correlation
과목 평가항목 타입 코드 is highly overall correlated with 타입 코드High correlation
아이디 is highly overall correlated with 과목 아이디 and 2 other fieldsHigh correlation
과목 아이디 is highly overall correlated with 아이디 and 1 other fieldsHigh correlation
제목 is highly overall correlated with 아이디 and 2 other fieldsHigh correlation
등록 국가 is highly overall correlated with 아이디High correlation
타입 코드 is highly imbalanced (91.9%)Imbalance
아이디 has unique valuesUnique

Reproduction

Analysis started2023-12-12 22:31:08.959085
Analysis finished2023-12-12 22:31:09.911949
Duration0.95 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

아이디
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean123405.29
Minimum26
Maximum261322
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-13T07:31:10.217880image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum26
5-th percentile14231.8
Q161317.5
median124549
Q3178744
95-th percentile246769.75
Maximum261322
Range261296
Interquartile range (IQR)117426.5

Descriptive statistics

Standard deviation71961.078
Coefficient of variation (CV)0.583128
Kurtosis-1.0455744
Mean123405.29
Median Absolute Deviation (MAD)60898
Skewness0.14413059
Sum1.2340529 × 109
Variance5.1783967 × 109
MonotonicityNot monotonic
2023-12-13T07:31:10.334758image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
86425 1
 
< 0.1%
14065 1
 
< 0.1%
217965 1
 
< 0.1%
16420 1
 
< 0.1%
158355 1
 
< 0.1%
110008 1
 
< 0.1%
120679 1
 
< 0.1%
209157 1
 
< 0.1%
92926 1
 
< 0.1%
150857 1
 
< 0.1%
Other values (9990) 9990
99.9%
ValueCountFrequency (%)
26 1
< 0.1%
56 1
< 0.1%
128 1
< 0.1%
215 1
< 0.1%
227 1
< 0.1%
230 1
< 0.1%
245 1
< 0.1%
254 1
< 0.1%
269 1
< 0.1%
311 1
< 0.1%
ValueCountFrequency (%)
261322 1
< 0.1%
261298 1
< 0.1%
261274 1
< 0.1%
261250 1
< 0.1%
261235 1
< 0.1%
261187 1
< 0.1%
261151 1
< 0.1%
261145 1
< 0.1%
261139 1
< 0.1%
261130 1
< 0.1%

타입 코드
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct5
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
3
9759 
1
 
188
7
 
36
2
 
16
9
 
1

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row3
2nd row3
3rd row3
4th row3
5th row3

Common Values

ValueCountFrequency (%)
3 9759
97.6%
1 188
 
1.9%
7 36
 
0.4%
2 16
 
0.2%
9 1
 
< 0.1%

Length

2023-12-13T07:31:10.448895image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T07:31:10.541892image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
3 9759
97.6%
1 188
 
1.9%
7 36
 
0.4%
2 16
 
0.2%
9 1
 
< 0.1%

제목
Categorical

HIGH CORRELATION 

Distinct46
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
평생연수 기수제 설문(주관식 선택 입력)_20210517
3547 
평생연수 설문_20200327
1999 
과정 설문(강사 만족도 포함)
1839 
평생연수 기수제 설문(주관식 선택)_20230306
681 
평생연수 과정 설문
367 
Other values (41)
1567 

Length

Max length48
Median length42
Mean length23.1602
Min length7

Unique

Unique13 ?
Unique (%)0.1%

Sample

1st row평생연수 설문_20200327
2nd row평생연수 기수제 설문(주관식 선택 입력)_20210517
3rd row평생연수 기수제 설문(주관식 선택 입력)_20210517
4th row평생연수 기수제 설문(주관식 선택)_20230306
5th row과정 설문(강사 만족도 포함)

Common Values

ValueCountFrequency (%)
평생연수 기수제 설문(주관식 선택 입력)_20210517 3547
35.5%
평생연수 설문_20200327 1999
20.0%
과정 설문(강사 만족도 포함) 1839
18.4%
평생연수 기수제 설문(주관식 선택)_20230306 681
 
6.8%
평생연수 과정 설문 367
 
3.7%
현대자동차 협력사 대상 이러닝 (2022년 NEW) 271
 
2.7%
평생연수 과정 설문(학습기간 선호도 추가) 264
 
2.6%
평생연수 과정 설문(수료기준 변경) 163
 
1.6%
평생연수 과정 설문(수료기준 변경)_191209 161
 
1.6%
평생연수 과정 설문(수료기준 변경)_카테고리 설정 90
 
0.9%
Other values (36) 618
 
6.2%

Length

2023-12-13T07:31:10.647211image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
평생연수 7416
18.3%
기수제 4228
10.4%
설문(주관식 4228
10.4%
선택 3547
8.8%
입력)_20210517 3547
8.8%
과정 3163
7.8%
설문_20200327 1999
 
4.9%
만족도 1932
 
4.8%
포함 1862
 
4.6%
설문(강사 1839
 
4.5%
Other values (97) 6766
16.7%

과목 아이디
Real number (ℝ)

HIGH CORRELATION 

Distinct9922
Distinct (%)99.2%
Missing1
Missing (%)< 0.1%
Infinite0
Infinite (%)0.0%
Mean162952.64
Minimum2608
Maximum324964
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-13T07:31:10.775571image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2608
5-th percentile20645.2
Q185021.5
median169111
Q3234517
95-th percentile303905.8
Maximum324964
Range322356
Interquartile range (IQR)149495.5

Descriptive statistics

Standard deviation89673.384
Coefficient of variation (CV)0.55030335
Kurtosis-1.1646886
Mean162952.64
Median Absolute Deviation (MAD)78138
Skewness-0.032911229
Sum1.6293635 × 109
Variance8.0413158 × 109
MonotonicityNot monotonic
2023-12-13T07:31:10.895499image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2608 4
 
< 0.1%
16818 3
 
< 0.1%
16629 2
 
< 0.1%
214285 2
 
< 0.1%
66491 2
 
< 0.1%
180761 2
 
< 0.1%
214185 2
 
< 0.1%
16662 2
 
< 0.1%
297901 2
 
< 0.1%
53001 2
 
< 0.1%
Other values (9912) 9976
99.8%
ValueCountFrequency (%)
2608 4
< 0.1%
9005 1
 
< 0.1%
9009 1
 
< 0.1%
9013 1
 
< 0.1%
9016 1
 
< 0.1%
9020 1
 
< 0.1%
9061 1
 
< 0.1%
9073 1
 
< 0.1%
9074 1
 
< 0.1%
9091 1
 
< 0.1%
ValueCountFrequency (%)
324964 1
< 0.1%
324943 1
< 0.1%
324913 1
< 0.1%
324856 1
< 0.1%
324844 1
< 0.1%
324790 1
< 0.1%
324760 1
< 0.1%
324730 1
< 0.1%
324709 1
< 0.1%
324547 1
< 0.1%

과목 평가항목 타입 코드
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
10
6861 
<NA>
3126 
30
 
13

Length

Max length4
Median length2
Mean length2.6252
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row10
2nd row10
3rd row10
4th row10
5th row<NA>

Common Values

ValueCountFrequency (%)
10 6861
68.6%
<NA> 3126
31.3%
30 13
 
0.1%

Length

2023-12-13T07:31:11.036629image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T07:31:11.137761image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
10 6861
68.6%
na 3126
31.3%
30 13
 
0.1%

등록 국가
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
KR
7383 
UNKNOWN
2617 

Length

Max length7
Median length2
Mean length3.3085
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowUNKNOWN
2nd rowKR
3rd rowKR
4th rowKR
5th rowKR

Common Values

ValueCountFrequency (%)
KR 7383
73.8%
UNKNOWN 2617
 
26.2%

Length

2023-12-13T07:31:11.250859image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T07:31:11.332944image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
kr 7383
73.8%
unknown 2617
 
26.2%
Distinct618
Distinct (%)6.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2016-09-29 11:13:23
Maximum2023-04-27 13:46:57
2023-12-13T07:31:11.438959image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T07:31:11.549801image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

Interactions

2023-12-13T07:31:09.584708image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T07:31:09.434019image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T07:31:09.663101image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T07:31:09.509403image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T07:31:11.636235image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
아이디타입 코드제목과목 아이디과목 평가항목 타입 코드등록 국가
아이디1.0000.2110.9210.9880.1040.669
타입 코드0.2111.0000.9100.229NaN0.019
제목0.9210.9101.0000.9130.4010.490
과목 아이디0.9880.2290.9131.0000.1050.615
과목 평가항목 타입 코드0.104NaN0.4010.1051.0000.000
등록 국가0.6690.0190.4900.6150.0001.000
2023-12-13T07:31:11.755131image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
제목타입 코드과목 평가항목 타입 코드등록 국가
제목1.0000.6910.3180.391
타입 코드0.6911.0001.0000.023
과목 평가항목 타입 코드0.3181.0001.0000.000
등록 국가0.3910.0230.0001.000
2023-12-13T07:31:11.836931image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
아이디과목 아이디타입 코드제목과목 평가항목 타입 코드등록 국가
아이디1.0000.9990.0890.6360.0800.520
과목 아이디0.9991.0000.0970.6070.0820.476
타입 코드0.0890.0971.0000.6911.0000.023
제목0.6360.6070.6911.0000.3180.391
과목 평가항목 타입 코드0.0800.0821.0000.3181.0000.000
등록 국가0.5200.4760.0230.3910.0001.000

Missing values

2023-12-13T07:31:09.772799image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T07:31:09.867493image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

아이디타입 코드제목과목 아이디과목 평가항목 타입 코드등록 국가등록 일시
33989864253평생연수 설문_2020032711989010UNKNOWN2020-05-27 09:57:33
847602228653평생연수 기수제 설문(주관식 선택 입력)_2021051728830110KR2023-02-03 16:35:37
651541599773평생연수 기수제 설문(주관식 선택 입력)_2021051721653910KR2022-03-29 15:50:30
901992446213평생연수 기수제 설문(주관식 선택)_2023030628281710KR2023-03-07 10:49:44
22378601623과정 설문(강사 만족도 포함)84852<NA>KR2019-05-29 16:36:55
27938693163평생연수 과정 설문(수료기준 변경)_카테고리 설정9294710KR2019-12-06 11:34:03
19002496783평생연수 과정 설문(학습 충실도 문항 추가)6657310KR2019-01-29 17:38:33
398421039843평생연수 설문_2020032714227910UNKNOWN2020-09-09 11:06:05
17395464693평생연수 과정 설문(학습기간 선호도 추가)6174210KR2018-11-28 11:34:19
10543311891과정 설문(강사 만족도 포함)37754<NA>KR2018-04-05 10:48:54
아이디타입 코드제목과목 아이디과목 평가항목 타입 코드등록 국가등록 일시
36276932863평생연수 설문_2020032712900710UNKNOWN2020-07-09 08:52:04
766391925323평생연수 기수제 설문(주관식 선택 입력)_2021051724848210UNKNOWN2022-10-13 09:28:03
952332598763평생연수 기수제 설문(주관식 선택)_2023030632284310KR2023-04-27 13:27:44
423451114933평생연수 설문_2020032715694610KR2020-10-23 15:35:05
505921308253평생연수 기수제 설문(주관식 선택 입력)_20210517181481<NA>KR2021-05-28 11:19:06
1875583e-koreatech 이러닝 강의 만족도1006110KR2016-09-29 13:15:11
780481967593평생연수 기수제 설문(주관식 선택 입력)_2021051725548110UNKNOWN2022-11-11 09:05:53
939442560093현대자동차 협력사 대상 이러닝 (2022년 NEW)31689110KR2023-03-29 11:22:27
36815949033평생연수 설문_2020032713089410UNKNOWN2020-07-30 09:38:16
591731480073평생연수 기수제 설문(주관식 선택 입력)_2021051719948110UNKNOWN2021-11-09 15:54:01