Overview

Dataset statistics

Number of variables7
Number of observations4035
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory224.7 KiB
Average record size in memory57.0 B

Variable types

Text4
Numeric1
Categorical1
Boolean1

Dataset

Description한국자산관리공사에서 운영하는 교육과정 별 설문 문항(질문, 구분, 응답 필수여부)에 대해 정리한 데이터를 제공합니다.
Author한국자산관리공사
URLhttps://www.data.go.kr/data/15111494/fileData.do

Alerts

설문유형 is highly overall correlated with 응답필수여부High correlation
응답필수여부 is highly overall correlated with 설문유형High correlation
응답필수여부 is highly imbalanced (57.0%)Imbalance

Reproduction

Analysis started2023-12-12 06:14:10.303851
Analysis finished2023-12-12 06:14:11.058133
Duration0.75 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct289
Distinct (%)7.2%
Missing0
Missing (%)0.0%
Memory size31.7 KiB
2023-12-12T15:14:11.453636image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length4
Median length4
Mean length4
Min length4

Characters and Unicode

Total characters16140
Distinct characters11
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowR068
2nd rowR068
3rd rowR068
4th rowR068
5th rowR069
ValueCountFrequency (%)
r233 45
 
1.1%
r239 40
 
1.0%
r068 38
 
0.9%
r033 38
 
0.9%
r227 37
 
0.9%
r224 32
 
0.8%
r070 31
 
0.8%
r158 30
 
0.7%
r186 30
 
0.7%
r225 29
 
0.7%
Other values (279) 3685
91.3%
2023-12-12T15:14:12.041050image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
R 4035
25.0%
2 2413
15.0%
0 2075
12.9%
1 1964
12.2%
3 953
 
5.9%
5 854
 
5.3%
6 841
 
5.2%
7 813
 
5.0%
4 804
 
5.0%
8 739
 
4.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 12105
75.0%
Uppercase Letter 4035
 
25.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 2413
19.9%
0 2075
17.1%
1 1964
16.2%
3 953
 
7.9%
5 854
 
7.1%
6 841
 
6.9%
7 813
 
6.7%
4 804
 
6.6%
8 739
 
6.1%
9 649
 
5.4%
Uppercase Letter
ValueCountFrequency (%)
R 4035
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 12105
75.0%
Latin 4035
 
25.0%

Most frequent character per script

Common
ValueCountFrequency (%)
2 2413
19.9%
0 2075
17.1%
1 1964
16.2%
3 953
 
7.9%
5 854
 
7.1%
6 841
 
6.9%
7 813
 
6.7%
4 804
 
6.6%
8 739
 
6.1%
9 649
 
5.4%
Latin
ValueCountFrequency (%)
R 4035
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 16140
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
R 4035
25.0%
2 2413
15.0%
0 2075
12.9%
1 1964
12.2%
3 953
 
5.9%
5 854
 
5.3%
6 841
 
5.2%
7 813
 
5.0%
4 804
 
5.0%
8 739
 
4.6%
Distinct238
Distinct (%)5.9%
Missing0
Missing (%)0.0%
Memory size31.7 KiB
2023-12-12T15:14:12.478323image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length4
Median length4
Mean length4
Min length4

Characters and Unicode

Total characters16140
Distinct characters11
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowS079
2nd rowS079
3rd rowS079
4th rowS079
5th rowS080
ValueCountFrequency (%)
s239 67
 
1.7%
s243 66
 
1.6%
s246 65
 
1.6%
s240 63
 
1.6%
s079 59
 
1.5%
s080 58
 
1.4%
s040 57
 
1.4%
s245 50
 
1.2%
s248 50
 
1.2%
s242 50
 
1.2%
Other values (228) 3450
85.5%
2023-12-12T15:14:12.977598image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
S 4035
25.0%
2 2454
15.2%
0 2082
12.9%
1 1766
10.9%
4 1134
 
7.0%
5 1030
 
6.4%
3 876
 
5.4%
6 813
 
5.0%
9 684
 
4.2%
7 640
 
4.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 12105
75.0%
Uppercase Letter 4035
 
25.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 2454
20.3%
0 2082
17.2%
1 1766
14.6%
4 1134
9.4%
5 1030
8.5%
3 876
 
7.2%
6 813
 
6.7%
9 684
 
5.7%
7 640
 
5.3%
8 626
 
5.2%
Uppercase Letter
ValueCountFrequency (%)
S 4035
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 12105
75.0%
Latin 4035
 
25.0%

Most frequent character per script

Common
ValueCountFrequency (%)
2 2454
20.3%
0 2082
17.2%
1 1766
14.6%
4 1134
9.4%
5 1030
8.5%
3 876
 
7.2%
6 813
 
6.7%
9 684
 
5.7%
7 640
 
5.3%
8 626
 
5.2%
Latin
ValueCountFrequency (%)
S 4035
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 16140
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
S 4035
25.0%
2 2454
15.2%
0 2082
12.9%
1 1766
10.9%
4 1134
 
7.0%
5 1030
 
6.4%
3 876
 
5.4%
6 813
 
5.0%
9 684
 
4.2%
7 640
 
4.0%
Distinct659
Distinct (%)16.3%
Missing0
Missing (%)0.0%
Memory size31.7 KiB
2023-12-12T15:14:13.335614image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length5
Median length5
Mean length5
Min length5

Characters and Unicode

Total characters20175
Distinct characters11
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique375 ?
Unique (%)9.3%

Sample

1st rowQ0001
2nd rowQ0002
3rd rowQ0003
4th rowQ0004
5th rowQ0005
ValueCountFrequency (%)
q0116 389
 
9.6%
q0070 235
 
5.8%
q0009 224
 
5.6%
q0556 166
 
4.1%
q0006 123
 
3.0%
q0554 95
 
2.4%
q0612 83
 
2.1%
q0606 83
 
2.1%
q0191 67
 
1.7%
q0282 65
 
1.6%
Other values (649) 2505
62.1%
2023-12-12T15:14:13.774369image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 6169
30.6%
Q 4035
20.0%
1 1901
 
9.4%
6 1670
 
8.3%
5 1215
 
6.0%
2 1089
 
5.4%
3 1004
 
5.0%
4 906
 
4.5%
7 803
 
4.0%
9 753
 
3.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 16140
80.0%
Uppercase Letter 4035
 
20.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 6169
38.2%
1 1901
 
11.8%
6 1670
 
10.3%
5 1215
 
7.5%
2 1089
 
6.7%
3 1004
 
6.2%
4 906
 
5.6%
7 803
 
5.0%
9 753
 
4.7%
8 630
 
3.9%
Uppercase Letter
ValueCountFrequency (%)
Q 4035
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 16140
80.0%
Latin 4035
 
20.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 6169
38.2%
1 1901
 
11.8%
6 1670
 
10.3%
5 1215
 
7.5%
2 1089
 
6.7%
3 1004
 
6.2%
4 906
 
5.6%
7 803
 
5.0%
9 753
 
4.7%
8 630
 
3.9%
Latin
ValueCountFrequency (%)
Q 4035
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 20175
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 6169
30.6%
Q 4035
20.0%
1 1901
 
9.4%
6 1670
 
8.3%
5 1215
 
6.0%
2 1089
 
5.4%
3 1004
 
5.0%
4 906
 
4.5%
7 803
 
4.0%
9 753
 
3.7%

문항번호
Real number (ℝ)

Distinct45
Distinct (%)1.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean8.7692689
Minimum1
Maximum45
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size35.6 KiB
2023-12-12T15:14:13.913133image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q14
median8
Q312
95-th percentile21
Maximum45
Range44
Interquartile range (IQR)8

Descriptive statistics

Standard deviation6.4834595
Coefficient of variation (CV)0.73933866
Kurtosis3.3920268
Mean8.7692689
Median Absolute Deviation (MAD)4
Skewness1.5183
Sum35384
Variance42.035248
MonotonicityNot monotonic
2023-12-12T15:14:14.367207image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=45)
ValueCountFrequency (%)
2 290
 
7.2%
1 290
 
7.2%
3 289
 
7.2%
4 289
 
7.2%
5 282
 
7.0%
6 281
 
7.0%
7 273
 
6.8%
8 267
 
6.6%
9 262
 
6.5%
10 256
 
6.3%
Other values (35) 1256
31.1%
ValueCountFrequency (%)
1 290
7.2%
2 290
7.2%
3 289
7.2%
4 289
7.2%
5 282
7.0%
6 281
7.0%
7 273
6.8%
8 267
6.6%
9 262
6.5%
10 256
6.3%
ValueCountFrequency (%)
45 1
 
< 0.1%
44 1
 
< 0.1%
43 1
 
< 0.1%
42 1
 
< 0.1%
41 1
 
< 0.1%
40 2
 
< 0.1%
39 2
 
< 0.1%
38 4
0.1%
37 5
0.1%
36 5
0.1%
Distinct642
Distinct (%)15.9%
Missing0
Missing (%)0.0%
Memory size31.7 KiB
2023-12-12T15:14:14.781514image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length135
Median length87
Mean length23.72342
Min length2

Characters and Unicode

Total characters95724
Distinct characters396
Distinct categories12 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique359 ?
Unique (%)8.9%

Sample

1st row이 강의의 운영(수업공지,자료전달, 환경설정, 출석관리)은 적절하였습니까? [수업공지]
2nd row이 강의의 운영(수업공지,자료전달, 환경설정, 출석관리)은 적절하였습니까? [자료전달]
3rd row이 강의의 운영(수업공지,자료전달, 환경설정, 출석관리)은 적절하였습니까? [환경설정]
4th row이 강의의 운영(수업공지,자료전달, 환경설정, 출석관리)은 적절하였습니까? [출석관리]
5th row 희망하는 강의 범위를 모두 선택하여 주십시오.
ValueCountFrequency (%)
만족도 1206
 
5.2%
903
 
3.9%
주십시오 751
 
3.3%
강사 750
 
3.2%
귀하의 695
 
3.0%
676
 
2.9%
대한 633
 
2.7%
선택하여 559
 
2.4%
454
 
2.0%
향상에 447
 
1.9%
Other values (962) 16021
69.4%
2023-12-12T15:14:15.497997image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
19185
 
20.0%
3119
 
3.3%
1917
 
2.0%
1905
 
2.0%
1774
 
1.9%
1690
 
1.8%
1520
 
1.6%
1501
 
1.6%
1475
 
1.5%
1431
 
1.5%
Other values (386) 60207
62.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 67646
70.7%
Space Separator 19185
 
20.0%
Other Punctuation 3740
 
3.9%
Decimal Number 2656
 
2.8%
Uppercase Letter 989
 
1.0%
Close Punctuation 607
 
0.6%
Open Punctuation 607
 
0.6%
Lowercase Letter 241
 
0.3%
Dash Punctuation 42
 
< 0.1%
Other Symbol 6
 
< 0.1%
Other values (2) 5
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3119
 
4.6%
1917
 
2.8%
1905
 
2.8%
1774
 
2.6%
1690
 
2.5%
1520
 
2.2%
1501
 
2.2%
1475
 
2.2%
1431
 
2.1%
1420
 
2.1%
Other values (326) 49894
73.8%
Lowercase Letter
ValueCountFrequency (%)
n 39
16.2%
e 29
12.0%
s 27
11.2%
t 23
9.5%
i 20
8.3%
a 16
6.6%
r 14
 
5.8%
o 14
 
5.8%
y 11
 
4.6%
g 9
 
3.7%
Other values (9) 39
16.2%
Uppercase Letter
ValueCountFrequency (%)
P 876
88.6%
I 26
 
2.6%
A 25
 
2.5%
B 18
 
1.8%
M 9
 
0.9%
E 8
 
0.8%
S 6
 
0.6%
T 6
 
0.6%
Q 6
 
0.6%
K 2
 
0.2%
Other values (5) 7
 
0.7%
Decimal Number
ValueCountFrequency (%)
1 656
24.7%
0 526
19.8%
2 272
10.2%
8 207
 
7.8%
4 198
 
7.5%
5 172
 
6.5%
3 170
 
6.4%
6 160
 
6.0%
9 152
 
5.7%
7 143
 
5.4%
Other Punctuation
ValueCountFrequency (%)
? 970
25.9%
. 967
25.9%
: 861
23.0%
, 689
18.4%
· 240
 
6.4%
/ 9
 
0.2%
& 4
 
0.1%
Close Punctuation
ValueCountFrequency (%)
] 450
74.1%
) 157
 
25.9%
Open Punctuation
ValueCountFrequency (%)
[ 450
74.1%
( 157
 
25.9%
Space Separator
ValueCountFrequency (%)
19185
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 42
100.0%
Other Symbol
ValueCountFrequency (%)
6
100.0%
Math Symbol
ValueCountFrequency (%)
> 3
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 67646
70.7%
Common 26848
 
28.0%
Latin 1230
 
1.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3119
 
4.6%
1917
 
2.8%
1905
 
2.8%
1774
 
2.6%
1690
 
2.5%
1520
 
2.2%
1501
 
2.2%
1475
 
2.2%
1431
 
2.1%
1420
 
2.1%
Other values (326) 49894
73.8%
Latin
ValueCountFrequency (%)
P 876
71.2%
n 39
 
3.2%
e 29
 
2.4%
s 27
 
2.2%
I 26
 
2.1%
A 25
 
2.0%
t 23
 
1.9%
i 20
 
1.6%
B 18
 
1.5%
a 16
 
1.3%
Other values (24) 131
 
10.7%
Common
ValueCountFrequency (%)
19185
71.5%
? 970
 
3.6%
. 967
 
3.6%
: 861
 
3.2%
, 689
 
2.6%
1 656
 
2.4%
0 526
 
2.0%
] 450
 
1.7%
[ 450
 
1.7%
2 272
 
1.0%
Other values (16) 1822
 
6.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 67646
70.7%
ASCII 27832
29.1%
None 240
 
0.3%
Misc Symbols 6
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
19185
68.9%
? 970
 
3.5%
. 967
 
3.5%
P 876
 
3.1%
: 861
 
3.1%
, 689
 
2.5%
1 656
 
2.4%
0 526
 
1.9%
] 450
 
1.6%
[ 450
 
1.6%
Other values (48) 2202
 
7.9%
Hangul
ValueCountFrequency (%)
3119
 
4.6%
1917
 
2.8%
1905
 
2.8%
1774
 
2.6%
1690
 
2.5%
1520
 
2.2%
1501
 
2.2%
1475
 
2.2%
1431
 
2.1%
1420
 
2.1%
Other values (326) 49894
73.8%
None
ValueCountFrequency (%)
· 240
100.0%
Misc Symbols
ValueCountFrequency (%)
6
100.0%

설문유형
Categorical

HIGH CORRELATION 

Distinct13
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size31.7 KiB
10점 척도형
1244 
5점 척도형
883 
객관형
868 
서술형
340 
10점 척도형 단답형
271 
Other values (8)
429 

Length

Max length11
Median length7
Mean length5.4356877
Min length3

Unique

Unique2 ?
Unique (%)< 0.1%

Sample

1st row객관형
2nd row객관형
3rd row객관형
4th row객관형
5th row객관형

Common Values

ValueCountFrequency (%)
10점 척도형 1244
30.8%
5점 척도형 883
21.9%
객관형 868
21.5%
서술형 340
 
8.4%
10점 척도형 단답형 271
 
6.7%
주관식 229
 
5.7%
객관식 147
 
3.6%
선택형 25
 
0.6%
척도형 13
 
0.3%
4점 척도형 10
 
0.2%
Other values (3) 5
 
0.1%

Length

2023-12-12T15:14:15.699475image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
척도형 2421
36.1%
10점 1515
22.6%
5점 883
 
13.1%
객관형 868
 
12.9%
서술형 340
 
5.1%
단답형 274
 
4.1%
주관식 229
 
3.4%
객관식 147
 
2.2%
선택형 26
 
0.4%
4점 10
 
0.1%
Other values (2) 2
 
< 0.1%

응답필수여부
Boolean

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size4.1 KiB
True
3680 
False
 
355
ValueCountFrequency (%)
True 3680
91.2%
False 355
 
8.8%
2023-12-12T15:14:15.861035image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Interactions

2023-12-12T15:14:10.776802image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T15:14:15.949686image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
문항번호설문유형응답필수여부
문항번호1.0000.4100.367
설문유형0.4101.0000.980
응답필수여부0.3670.9801.000
2023-12-12T15:14:16.063449image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
설문유형응답필수여부
설문유형1.0000.879
응답필수여부0.8791.000
2023-12-12T15:14:16.192642image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
문항번호설문유형응답필수여부
문항번호1.0000.1840.275
설문유형0.1841.0000.879
응답필수여부0.2750.8791.000

Missing values

2023-12-12T15:14:10.896501image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T15:14:11.010352image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

설문관리번호과정번호문항관리번호문항번호설문문항설문유형응답필수여부
0R068S079Q00015이 강의의 운영(수업공지,자료전달, 환경설정, 출석관리)은 적절하였습니까? [수업공지]객관형Y
1R068S079Q00026이 강의의 운영(수업공지,자료전달, 환경설정, 출석관리)은 적절하였습니까? [자료전달]객관형Y
2R068S079Q00037이 강의의 운영(수업공지,자료전달, 환경설정, 출석관리)은 적절하였습니까? [환경설정]객관형Y
3R068S079Q00048이 강의의 운영(수업공지,자료전달, 환경설정, 출석관리)은 적절하였습니까? [출석관리]객관형Y
4R069S080Q000524희망하는 강의 범위를 모두 선택하여 주십시오.객관형Y
5R004S004Q000611교육과정에 대한 개선의견서술형N
6R018S019Q000612교육과정에 대한 개선의견서술형N
7R019S020Q000613교육과정에 대한 개선의견서술형N
8R020S021Q000613교육과정에 대한 개선의견서술형N
9R021S023Q000610교육과정에 대한 개선의견서술형N
설문관리번호과정번호문항관리번호문항번호설문문항설문유형응답필수여부
4025R067S079Q065617다음 PM 업무 수행을 위한 요구역량 중 실습을 진행해 보고 싶은 분야를 한가지만 선택하여 주십시오.객관형Y
4026R010S011Q06579본 교육과정에 대한 의견 및 바라는 점이 있다면 자유롭게 말씀해 주시기 바랍니다.서술형N
4027R013S014Q06577본 교육과정에 대한 의견 및 바라는 점이 있다면 자유롭게 말씀해 주시기 바랍니다.서술형N
4028R034S043Q06579본 교육과정에 대한 의견 및 바라는 점이 있다면 자유롭게 말씀해 주시기 바랍니다.서술형N
4029R052S063Q065713본 교육과정에 대한 의견 및 바라는 점이 있다면 자유롭게 말씀해 주시기 바랍니다.서술형N
4030R074S084Q06577본 교육과정에 대한 의견 및 바라는 점이 있다면 자유롭게 말씀해 주시기 바랍니다.서술형N
4031R082S094Q06588개발범위 정의를 위한 요구사항 관련 프로세스를 순서대로 설명할 수 있다.5점 척도형Y
4032R085S095Q06589개발범위 정의를 위한 요구사항 관련 프로세스를 순서대로 설명할 수 있다.5점 척도형Y
4033R083S094Q06598비즈니스 프로세스 모델링을 위해 활용할 수 있는 모델링 기법을 2개 이상 기술하시오.객관형Y
4034R083S094Q066092003년 설립된 IIBA(International Institute of Business Analysis)에서 제공하는 비즈니스 분석 영역의 지식체계 프레임워크는 무엇인가?객관형Y