Overview

Dataset statistics

Number of variables4
Number of observations295
Missing cells1
Missing cells (%)0.1%
Duplicate rows55
Duplicate rows (%)18.6%
Total size in memory9.3 KiB
Average record size in memory32.4 B

Variable types

Categorical3
Text1

Dataset

Description광주광역시 공무원 대상 위탁교육 실시내역(전화, 사이버, 언어교육원) 에 대한 일회성 데이터입니다.주요내용 : 교육과정명, 교육기간, 교육기관, 대상자 수
Author광주광역시
URLhttps://www.data.go.kr/data/15118935/fileData.do

Alerts

Dataset has 55 (18.6%) duplicate rowsDuplicates
수업기간 is highly overall correlated with 위탁교육구분 and 1 other fieldsHigh correlation
위탁교육구분 is highly overall correlated with 과정구분 and 1 other fieldsHigh correlation
과정구분 is highly overall correlated with 위탁교육구분 and 1 other fieldsHigh correlation

Reproduction

Analysis started2024-03-15 00:58:00.112330
Analysis finished2024-03-15 00:58:01.015793
Duration0.9 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

위탁교육구분
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size2.4 KiB
온라인어학
206 
전화외국어
60 
관내 대학교 언어교육원
29 

Length

Max length12
Median length5
Mean length5.6881356
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row전화외국어
2nd row전화외국어
3rd row전화외국어
4th row전화외국어
5th row전화외국어

Common Values

ValueCountFrequency (%)
온라인어학 206
69.8%
전화외국어 60
 
20.3%
관내 대학교 언어교육원 29
 
9.8%

Length

2024-03-15T09:58:01.253197image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-15T09:58:01.649303image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
온라인어학 206
58.4%
전화외국어 60
 
17.0%
관내 29
 
8.2%
대학교 29
 
8.2%
언어교육원 29
 
8.2%

과정구분
Categorical

HIGH CORRELATION 

Distinct7
Distinct (%)2.4%
Missing0
Missing (%)0.0%
Memory size2.4 KiB
영어
193 
전화영어
47 
일본어
32 
전화일본어
 
9
중국어
 
7
Other values (2)
 
7

Length

Max length6
Median length2
Mean length2.6237288
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row전화영어
2nd row전화영어
3rd row전화영어
4th row전화영어
5th row전화영어

Common Values

ValueCountFrequency (%)
영어 193
65.4%
전화영어 47
 
15.9%
일본어 32
 
10.8%
전화일본어 9
 
3.1%
중국어 7
 
2.4%
전화중국어 4
 
1.4%
기타 외국어 3
 
1.0%

Length

2024-03-15T09:58:02.008333image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-15T09:58:02.245657image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
영어 193
64.8%
전화영어 47
 
15.8%
일본어 32
 
10.7%
전화일본어 9
 
3.0%
중국어 7
 
2.3%
전화중국어 4
 
1.3%
기타 3
 
1.0%
외국어 3
 
1.0%
Distinct142
Distinct (%)48.3%
Missing1
Missing (%)0.3%
Memory size2.4 KiB
2024-03-15T09:58:02.935180image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length60
Median length41
Mean length22.295918
Min length9

Characters and Unicode

Total characters6555
Distinct characters302
Distinct categories10 ?
Distinct scripts5 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique81 ?
Unique (%)27.6%

Sample

1st row[60Days Master] pre-Basic English > Book.01
2nd rowLevel 2 > A
3rd rowDaily Talk English > Book.01
4th rowConversation by topic > Pre-intermediate
5th rowThe Debate Club > Topics 1
ValueCountFrequency (%)
81
 
5.8%
1/2 44
 
3.1%
ets 40
 
2.9%
toeic 39
 
2.8%
english 38
 
2.7%
ybm 30
 
2.1%
1 26
 
1.9%
전남대학교 25
 
1.8%
초간단 24
 
1.7%
영어회화 22
 
1.6%
Other values (309) 1031
73.6%
2024-03-15T09:58:04.126184image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1107
 
16.9%
159
 
2.4%
0 155
 
2.4%
e 148
 
2.3%
E 132
 
2.0%
1 130
 
2.0%
i 129
 
2.0%
2 124
 
1.9%
s 118
 
1.8%
T 114
 
1.7%
Other values (292) 4239
64.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2335
35.6%
Lowercase Letter 1243
19.0%
Space Separator 1107
16.9%
Uppercase Letter 771
 
11.8%
Decimal Number 543
 
8.3%
Other Punctuation 157
 
2.4%
Open Punctuation 130
 
2.0%
Close Punctuation 130
 
2.0%
Math Symbol 98
 
1.5%
Dash Punctuation 41
 
0.6%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
159
 
6.8%
91
 
3.9%
81
 
3.5%
80
 
3.4%
58
 
2.5%
57
 
2.4%
56
 
2.4%
51
 
2.2%
44
 
1.9%
43
 
1.8%
Other values (222) 1615
69.2%
Lowercase Letter
ValueCountFrequency (%)
e 148
11.9%
i 129
10.4%
s 118
 
9.5%
a 98
 
7.9%
n 95
 
7.6%
o 80
 
6.4%
l 77
 
6.2%
r 75
 
6.0%
t 74
 
6.0%
h 54
 
4.3%
Other values (13) 295
23.7%
Uppercase Letter
ValueCountFrequency (%)
E 132
17.1%
T 114
14.8%
C 73
9.5%
S 63
8.2%
B 62
8.0%
I 52
 
6.7%
O 46
 
6.0%
M 44
 
5.7%
N 36
 
4.7%
Y 30
 
3.9%
Other values (13) 119
15.4%
Decimal Number
ValueCountFrequency (%)
0 155
28.5%
1 130
23.9%
2 124
22.8%
5 50
 
9.2%
3 21
 
3.9%
9 20
 
3.7%
6 18
 
3.3%
8 12
 
2.2%
7 8
 
1.5%
4 5
 
0.9%
Other Punctuation
ValueCountFrequency (%)
/ 74
47.1%
? 40
25.5%
. 22
 
14.0%
& 9
 
5.7%
! 9
 
5.7%
, 3
 
1.9%
Open Punctuation
ValueCountFrequency (%)
( 89
68.5%
[ 41
31.5%
Close Punctuation
ValueCountFrequency (%)
) 89
68.5%
] 41
31.5%
Math Symbol
ValueCountFrequency (%)
> 60
61.2%
+ 38
38.8%
Space Separator
ValueCountFrequency (%)
1107
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 41
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2327
35.5%
Common 2206
33.7%
Latin 2014
30.7%
Hiragana 4
 
0.1%
Han 4
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
159
 
6.8%
91
 
3.9%
81
 
3.5%
80
 
3.4%
58
 
2.5%
57
 
2.4%
56
 
2.4%
51
 
2.2%
44
 
1.9%
43
 
1.8%
Other values (218) 1607
69.1%
Latin
ValueCountFrequency (%)
e 148
 
7.3%
E 132
 
6.6%
i 129
 
6.4%
s 118
 
5.9%
T 114
 
5.7%
a 98
 
4.9%
n 95
 
4.7%
o 80
 
4.0%
l 77
 
3.8%
r 75
 
3.7%
Other values (36) 948
47.1%
Common
ValueCountFrequency (%)
1107
50.2%
0 155
 
7.0%
1 130
 
5.9%
2 124
 
5.6%
( 89
 
4.0%
) 89
 
4.0%
/ 74
 
3.4%
> 60
 
2.7%
5 50
 
2.3%
] 41
 
1.9%
Other values (14) 287
 
13.0%
Hiragana
ValueCountFrequency (%)
2
50.0%
2
50.0%
Han
ValueCountFrequency (%)
2
50.0%
2
50.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 4220
64.4%
Hangul 2327
35.5%
Hiragana 4
 
0.1%
CJK 4
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1107
26.2%
0 155
 
3.7%
e 148
 
3.5%
E 132
 
3.1%
1 130
 
3.1%
i 129
 
3.1%
2 124
 
2.9%
s 118
 
2.8%
T 114
 
2.7%
a 98
 
2.3%
Other values (60) 1965
46.6%
Hangul
ValueCountFrequency (%)
159
 
6.8%
91
 
3.9%
81
 
3.5%
80
 
3.4%
58
 
2.5%
57
 
2.4%
56
 
2.4%
51
 
2.2%
44
 
1.9%
43
 
1.8%
Other values (218) 1607
69.1%
Hiragana
ValueCountFrequency (%)
2
50.0%
2
50.0%
CJK
ValueCountFrequency (%)
2
50.0%
2
50.0%

수업기간
Categorical

HIGH CORRELATION 

Distinct13
Distinct (%)4.4%
Missing0
Missing (%)0.0%
Memory size2.4 KiB
2023-03-01~2023-03-31
71 
2023-04-01~2023-04-30
48 
2023-03-13 ~ 2023-06-07
45 
2023-05-01~2023-05-31
44 
2023-06-01~2023-06-30
43 
Other values (8)
44 

Length

Max length23
Median length21
Mean length19.735593
Min length4

Unique

Unique4 ?
Unique (%)1.4%

Sample

1st row2023-03-15 ~ 2023-06-09
2nd row2023-03-14 ~ 2023-06-08
3rd row2023-03-13 ~ 2023-06-09
4th row2023-03-14 ~ 2023-06-08
5th row2023-03-14 ~ 2023-06-08

Common Values

ValueCountFrequency (%)
2023-03-01~2023-03-31 71
24.1%
2023-04-01~2023-04-30 48
16.3%
2023-03-13 ~ 2023-06-07 45
15.3%
2023-05-01~2023-05-31 44
14.9%
2023-06-01~2023-06-30 43
14.6%
<NA> 29
9.8%
2023-03-13 ~ 2023-06-09 5
 
1.7%
2023-03-14 ~ 2023-06-08 3
 
1.0%
2023-03-13 ~ 2023-06-08 3
 
1.0%
2023-03-15 ~ 2023-06-09 1
 
0.3%
Other values (3) 3
 
1.0%

Length

2024-03-15T09:58:04.558905image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
2023-03-01~2023-03-31 71
17.1%
60
14.5%
2023-03-13 54
13.0%
2023-04-01~2023-04-30 48
11.6%
2023-06-07 45
10.8%
2023-05-01~2023-05-31 44
10.6%
2023-06-01~2023-06-30 43
10.4%
na 29
7.0%
2023-06-09 6
 
1.4%
2023-06-08 6
 
1.4%
Other values (5) 9
 
2.2%

Correlations

2024-03-15T09:58:04.862532image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
위탁교육구분과정구분수업기간
위탁교육구분1.0000.7701.000
과정구분0.7701.0000.801
수업기간1.0000.8011.000
2024-03-15T09:58:05.228758image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
수업기간위탁교육구분과정구분
수업기간1.0000.9810.554
위탁교육구분0.9811.0000.704
과정구분0.5540.7041.000
2024-03-15T09:58:05.557428image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
위탁교육구분과정구분수업기간
위탁교육구분1.0000.7040.981
과정구분0.7041.0000.554
수업기간0.9810.5541.000

Missing values

2024-03-15T09:58:00.613849image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-15T09:58:00.900119image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

위탁교육구분과정구분강좌명수업기간
0전화외국어전화영어[60Days Master] pre-Basic English > Book.012023-03-15 ~ 2023-06-09
1전화외국어전화영어Level 2 > A2023-03-14 ~ 2023-06-08
2전화외국어전화영어Daily Talk English > Book.012023-03-13 ~ 2023-06-09
3전화외국어전화영어Conversation by topic > Pre-intermediate2023-03-14 ~ 2023-06-08
4전화외국어전화영어The Debate Club > Topics 12023-03-14 ~ 2023-06-08
5전화외국어전화영어Intermediate(edit) > Intermediate IELTs - Book.12023-03-13 ~ 2023-06-07
6전화외국어전화영어Travel > Travel 22023-03-13 ~ 2023-06-07
7전화외국어전화영어Level 2 > A2023-03-13 ~ 2023-06-07
8전화외국어전화영어[60Days Master] pre-Basic English > Book.012023-03-13 ~ 2023-06-07
9전화외국어전화영어[60Days Master] pre-Basic English > Book.012023-03-13 ~ 2023-06-07
위탁교육구분과정구분강좌명수업기간
285관내 대학교 언어교육원영어전남대학교 영어회화(온라인 화상교육)<NA>
286관내 대학교 언어교육원영어호남대학교 스피킹중급(상)<NA>
287관내 대학교 언어교육원영어전남대학교 Essay Writing<NA>
288관내 대학교 언어교육원영어전남대학교 온라인 영어회화<NA>
289관내 대학교 언어교육원영어전남대학교 영어회화<NA>
290관내 대학교 언어교육원영어전남대학교 영어회화<NA>
291관내 대학교 언어교육원영어전남대학교 온라인 영어회화<NA>
292관내 대학교 언어교육원영어전남대학교 The Art of Speaking<NA>
293관내 대학교 언어교육원영어전남대학교 온라인 영작문<NA>
294관내 대학교 언어교육원영어조선대학교 영어회화인터뷰<NA>

Duplicate rows

Most frequently occurring

위탁교육구분과정구분강좌명수업기간# duplicates
21온라인어학영어YBM 초간단 여행영어2023-03-01~2023-03-318
11온라인어학영어ETS TOEIC? 단기공략 550+ (1/2)2023-03-01~2023-03-315
50전화외국어전화영어[60Days Master] pre-Basic English > Book.012023-03-13 ~ 2023-06-075
2관내 대학교 언어교육원영어전남대학교 영어회화<NA>4
3관내 대학교 언어교육원영어전남대학교 온라인 영어회화<NA>4
46전화외국어전화영어Level 2 > A2023-03-13 ~ 2023-06-074
52전화외국어전화영어두근두근 영어회화 > 초급2023-03-13 ~ 2023-06-074
0관내 대학교 언어교육원영어전남대학교 Essay Writing<NA>3
7온라인어학영어10분컷 회화-회사&일상 편2023-05-01~2023-05-313
15온라인어학영어ETS TOEIC? 단기공략 650+ (1/2)2023-03-01~2023-03-313