Overview

Dataset statistics

Number of variables6
Number of observations35
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.8 KiB
Average record size in memory53.8 B

Variable types

Categorical2
Text1
DateTime2
Numeric1

Dataset

Description경남 지역 9개소 여성새로일하기센터에서 진행되는 총 40개 과정의 맞춤형 직업교육훈련과정으로, 직업전문·직무소양·취업준비 등 취업이 용이한 여성교육강좌
Author경상남도
URLhttps://bigdata.gyeongnam.go.kr/index.gn?menuCd=DOM_000000114002001000&publicdatapk=15048088

Alerts

모집인원 is highly imbalanced (76.5%)Imbalance
훈련과정명 has unique valuesUnique

Reproduction

Analysis started2023-12-11 00:22:24.672535
Analysis finished2023-12-11 00:22:25.219544
Duration0.55 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

센터명
Categorical

Distinct9
Distinct (%)25.7%
Missing0
Missing (%)0.0%
Memory size412.0 B
경남여성새로일하기센터(산단)
마산여성새로일하기센터
창원여성새로일하기센터
김해여성새로일하기센터
양산여성새로일하기센터
Other values (4)
11 

Length

Max length15
Median length11
Mean length12.057143
Min length11

Unique

Unique1 ?
Unique (%)2.9%

Sample

1st row거제여성새로일하기센터
2nd row거제여성새로일하기센터
3rd row거제여성새로일하기센터
4th row경남여성새로일하기센터(산단)
5th row경남여성새로일하기센터(산단)

Common Values

ValueCountFrequency (%)
경남여성새로일하기센터(산단) 6
17.1%
마산여성새로일하기센터 5
14.3%
창원여성새로일하기센터 5
14.3%
김해여성새로일하기센터 4
11.4%
양산여성새로일하기센터 4
11.4%
진주여성새로일하기센터 4
11.4%
거제여성새로일하기센터 3
8.6%
김해시동부여성새로일하기센터 3
8.6%
김해시동부여성새로일하기센터 1
 
2.9%

Length

2023-12-11T09:22:25.291508image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T09:22:25.426881image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
경남여성새로일하기센터(산단 6
17.1%
마산여성새로일하기센터 5
14.3%
창원여성새로일하기센터 5
14.3%
김해여성새로일하기센터 4
11.4%
양산여성새로일하기센터 4
11.4%
진주여성새로일하기센터 4
11.4%
김해시동부여성새로일하기센터 4
11.4%
거제여성새로일하기센터 3
8.6%

훈련과정명
Text

UNIQUE 

Distinct35
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size412.0 B
2023-12-11T09:22:25.676834image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length22
Median length18
Mean length14.057143
Min length7

Characters and Unicode

Total characters492
Distinct characters133
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique35 ?
Unique (%)100.0%

Sample

1st row미래농업 융복합 창업가 양성과정
2nd row메이커 융합 크리에이티브 전문가 양성과정
3rd row유휴사회복지사 재취업 양성과정
4th row방위산업분야 기술인력 양성과정(1)
5th row방위산업분야 기술인력 양성과정(2)
ValueCountFrequency (%)
양성과정 11
 
12.8%
양성 5
 
5.8%
전문가 3
 
3.5%
맞춤형 2
 
2.3%
방위산업분야 2
 
2.3%
기술인력 2
 
2.3%
창업 2
 
2.3%
미래농업 1
 
1.2%
전산세무회계사무원양성 1
 
1.2%
커스텀 1
 
1.2%
Other values (56) 56
65.1%
2023-12-11T09:22:26.099066image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
52
 
10.6%
23
 
4.7%
22
 
4.5%
22
 
4.5%
20
 
4.1%
18
 
3.7%
15
 
3.0%
13
 
2.6%
12
 
2.4%
9
 
1.8%
Other values (123) 286
58.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 421
85.6%
Space Separator 52
 
10.6%
Uppercase Letter 7
 
1.4%
Open Punctuation 3
 
0.6%
Close Punctuation 3
 
0.6%
Other Punctuation 2
 
0.4%
Decimal Number 2
 
0.4%
Math Symbol 1
 
0.2%
Lowercase Letter 1
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
23
 
5.5%
22
 
5.2%
22
 
5.2%
20
 
4.8%
18
 
4.3%
15
 
3.6%
13
 
3.1%
12
 
2.9%
9
 
2.1%
8
 
1.9%
Other values (108) 259
61.5%
Uppercase Letter
ValueCountFrequency (%)
S 2
28.6%
E 1
14.3%
R 1
14.3%
P 1
14.3%
N 1
14.3%
K 1
14.3%
Other Punctuation
ValueCountFrequency (%)
· 1
50.0%
& 1
50.0%
Decimal Number
ValueCountFrequency (%)
2 1
50.0%
1 1
50.0%
Space Separator
ValueCountFrequency (%)
52
100.0%
Open Punctuation
ValueCountFrequency (%)
( 3
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3
100.0%
Math Symbol
ValueCountFrequency (%)
+ 1
100.0%
Lowercase Letter
ValueCountFrequency (%)
e 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 421
85.6%
Common 63
 
12.8%
Latin 8
 
1.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
23
 
5.5%
22
 
5.2%
22
 
5.2%
20
 
4.8%
18
 
4.3%
15
 
3.6%
13
 
3.1%
12
 
2.9%
9
 
2.1%
8
 
1.9%
Other values (108) 259
61.5%
Common
ValueCountFrequency (%)
52
82.5%
( 3
 
4.8%
) 3
 
4.8%
+ 1
 
1.6%
· 1
 
1.6%
2 1
 
1.6%
1 1
 
1.6%
& 1
 
1.6%
Latin
ValueCountFrequency (%)
S 2
25.0%
E 1
12.5%
R 1
12.5%
P 1
12.5%
N 1
12.5%
e 1
12.5%
K 1
12.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 421
85.6%
ASCII 70
 
14.2%
None 1
 
0.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
52
74.3%
( 3
 
4.3%
) 3
 
4.3%
S 2
 
2.9%
E 1
 
1.4%
R 1
 
1.4%
P 1
 
1.4%
+ 1
 
1.4%
N 1
 
1.4%
e 1
 
1.4%
Other values (4) 4
 
5.7%
Hangul
ValueCountFrequency (%)
23
 
5.5%
22
 
5.2%
22
 
5.2%
20
 
4.8%
18
 
4.3%
15
 
3.6%
13
 
3.1%
12
 
2.9%
9
 
2.1%
8
 
1.9%
Other values (108) 259
61.5%
None
ValueCountFrequency (%)
· 1
100.0%
Distinct26
Distinct (%)74.3%
Missing0
Missing (%)0.0%
Memory size412.0 B
Minimum2023-03-13 00:00:00
Maximum2023-10-04 00:00:00
2023-12-11T09:22:26.234272image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T09:22:26.359912image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=26)
Distinct25
Distinct (%)71.4%
Missing0
Missing (%)0.0%
Memory size412.0 B
Minimum2023-05-03 00:00:00
Maximum2023-11-27 00:00:00
2023-12-11T09:22:26.478400image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T09:22:26.629381image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=25)

교육시간
Real number (ℝ)

Distinct22
Distinct (%)62.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean186.85714
Minimum120
Maximum296
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size447.0 B
2023-12-11T09:22:26.749426image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum120
5-th percentile142.8
Q1156
median180
Q3204
95-th percentile266
Maximum296
Range176
Interquartile range (IQR)48

Descriptive statistics

Standard deviation41.838726
Coefficient of variation (CV)0.22390755
Kurtosis0.38675273
Mean186.85714
Median Absolute Deviation (MAD)24
Skewness0.94013324
Sum6540
Variance1750.479
MonotonicityNot monotonic
2023-12-11T09:22:26.881657image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=22)
ValueCountFrequency (%)
180 4
 
11.4%
160 4
 
11.4%
156 3
 
8.6%
200 3
 
8.6%
208 2
 
5.7%
144 2
 
5.7%
184 2
 
5.7%
148 1
 
2.9%
260 1
 
2.9%
234 1
 
2.9%
Other values (12) 12
34.3%
ValueCountFrequency (%)
120 1
 
2.9%
140 1
 
2.9%
144 2
5.7%
146 1
 
2.9%
148 1
 
2.9%
152 1
 
2.9%
156 3
8.6%
160 4
11.4%
167 1
 
2.9%
180 4
11.4%
ValueCountFrequency (%)
296 1
 
2.9%
280 1
 
2.9%
260 1
 
2.9%
245 1
 
2.9%
240 1
 
2.9%
238 1
 
2.9%
234 1
 
2.9%
208 2
5.7%
200 3
8.6%
192 1
 
2.9%

모집인원
Categorical

IMBALANCE 

Distinct3
Distinct (%)8.6%
Missing0
Missing (%)0.0%
Memory size412.0 B
20
33 
22
 
1
16
 
1

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique2 ?
Unique (%)5.7%

Sample

1st row20
2nd row20
3rd row20
4th row20
5th row20

Common Values

ValueCountFrequency (%)
20 33
94.3%
22 1
 
2.9%
16 1
 
2.9%

Length

2023-12-11T09:22:27.000522image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T09:22:27.094071image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
20 33
94.3%
22 1
 
2.9%
16 1
 
2.9%

Interactions

2023-12-11T09:22:24.944402image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T09:22:27.159783image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
센터명훈련과정명교육시작일교육종료일교육시간모집인원
센터명1.0001.0000.6840.7910.4820.000
훈련과정명1.0001.0001.0001.0001.0001.000
교육시작일0.6841.0001.0000.8740.6151.000
교육종료일0.7911.0000.8741.0000.0000.000
교육시간0.4821.0000.6150.0001.0000.488
모집인원0.0001.0001.0000.0000.4881.000
2023-12-11T09:22:27.526625image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
센터명모집인원
센터명1.0000.000
모집인원0.0001.000
2023-12-11T09:22:27.604357image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
교육시간센터명모집인원
교육시간1.0000.2400.318
센터명0.2401.0000.000
모집인원0.3180.0001.000

Missing values

2023-12-11T09:22:25.060361image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T09:22:25.180609image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

센터명훈련과정명교육시작일교육종료일교육시간모집인원
0거제여성새로일하기센터미래농업 융복합 창업가 양성과정2023-03-132023-06-1614820
1거제여성새로일하기센터메이커 융합 크리에이티브 전문가 양성과정2023-04-122023-08-1114020
2거제여성새로일하기센터유휴사회복지사 재취업 양성과정2023-07-172023-10-1815220
3경남여성새로일하기센터(산단)방위산업분야 기술인력 양성과정(1)2023-03-272023-05-0315620
4경남여성새로일하기센터(산단)방위산업분야 기술인력 양성과정(2)2023-10-042023-11-1015620
5경남여성새로일하기센터(산단)e커머스 온라인홍보마케팅 과정2023-05-082023-07-3118020
6경남여성새로일하기센터(산단)품질검사 실무인력 양성과정2023-05-082023-07-3123820
7경남여성새로일하기센터(산단)중소기업 캐드회계사무원 양성과정2023-04-052023-07-2529620
8경남여성새로일하기센터(산단)미래형 농촌체험교육장 창업과정2023-08-082023-11-2724020
9김해시동부여성새로일하기센터온라인마케팅사무원 양성과정2023-05-122023-09-0624520
센터명훈련과정명교육시작일교육종료일교육시간모집인원
25양산여성새로일하기센터산모신생아건강관리사 양성2023-04-202023-06-0212020
26진주여성새로일하기센터사무디자인실무활용2023-04-172023-06-3020020
27진주여성새로일하기센터경리회계사무원 양성2023-04-172023-06-2820020
28진주여성새로일하기센터노인맞춤형 생활지원사2023-04-032023-06-1420020
29진주여성새로일하기센터캔들 토탈 공예 창업2023-03-272023-05-2316020
30창원여성새로일하기센터공동주택 경리실무 전문가 양성과정2023-05-022023-07-2723420
31창원여성새로일하기센터디지털(영상+SNS)마케팅전문가 양성과정2023-04-102023-07-1426016
32창원여성새로일하기센터온·오프라인 창업과 홍보마케팅 교육과정2023-04-052023-06-0216020
33창원여성새로일하기센터실버케어 맞춤형 전문가 양성과정2023-07-032023-09-1218020
34창원여성새로일하기센터재가노인시설 사회복지사 양성과정2023-04-172023-06-1616020