Overview

Dataset statistics

Number of variables16
Number of observations227
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory28.9 KiB
Average record size in memory130.6 B

Variable types

Categorical10
Numeric2
Text4

Dataset

Description한국보건복지인재원 보건산업분야 교육의 운영정보 및 계획 정보 제공 (과정명, 교육기간, 교육대상, 교육방식, 교육시간, 계획인원, 권역구분 등의 항목 제공 ) 단, 계획이므로 변동될 수 있음
Author한국보건복지인재원
URLhttps://www.data.go.kr/data/15086206/fileData.do

Alerts

교육대상 has constant value ""Constant
본부 has constant value ""Constant
분야 has constant value ""Constant
부서 is highly overall correlated with 교육방식 and 4 other fieldsHigh correlation
세부교육분야 is highly overall correlated with 교육재원 and 2 other fieldsHigh correlation
교육재원 is highly overall correlated with 교육방식 and 4 other fieldsHigh correlation
계획인원 is highly overall correlated with 교육시간High correlation
교육방식 is highly overall correlated with 교육재원 and 3 other fieldsHigh correlation
교육일수 is highly overall correlated with 교육방식 and 3 other fieldsHigh correlation
교육시간 is highly overall correlated with 계획인원 and 5 other fieldsHigh correlation
교육재원 is highly imbalanced (51.3%)Imbalance
교육일수 is highly imbalanced (52.2%)Imbalance

Reproduction

Analysis started2023-12-12 02:07:10.798208
Analysis finished2023-12-12 02:07:12.980560
Duration2.18 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

교육방식
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)1.3%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
라이브
167 
집합
45 
혼합
 
15

Length

Max length3
Median length3
Mean length2.7356828
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row혼합
2nd row혼합
3rd row혼합
4th row혼합
5th row라이브

Common Values

ValueCountFrequency (%)
라이브 167
73.6%
집합 45
 
19.8%
혼합 15
 
6.6%

Length

2023-12-12T11:07:13.055938image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T11:07:13.188716image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
라이브 167
73.6%
집합 45
 
19.8%
혼합 15
 
6.6%

교육재원
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)0.9%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
민경
203 
수탁
24 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row민경
2nd row민경
3rd row민경
4th row민경
5th row민경

Common Values

ValueCountFrequency (%)
민경 203
89.4%
수탁 24
 
10.6%

Length

2023-12-12T11:07:13.334283image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T11:07:13.449968image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
민경 203
89.4%
수탁 24
 
10.6%

교육대상
Categorical

CONSTANT 

Distinct1
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
민간
227 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row민간
2nd row민간
3rd row민간
4th row민간
5th row민간

Common Values

ValueCountFrequency (%)
민간 227
100.0%

Length

2023-12-12T11:07:13.593696image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T11:07:13.731849image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
민간 227
100.0%

기수
Real number (ℝ)

Distinct10
Distinct (%)4.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.9779736
Minimum1
Maximum10
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.1 KiB
2023-12-12T11:07:13.873900image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11
median1
Q32
95-th percentile5
Maximum10
Range9
Interquartile range (IQR)1

Descriptive statistics

Standard deviation1.6467854
Coefficient of variation (CV)0.83256187
Kurtosis8.3991152
Mean1.9779736
Median Absolute Deviation (MAD)0
Skewness2.7223976
Sum449
Variance2.7119021
MonotonicityNot monotonic
2023-12-12T11:07:14.081810image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%)
1 120
52.9%
2 63
27.8%
3 20
 
8.8%
4 8
 
3.5%
5 5
 
2.2%
6 3
 
1.3%
7 2
 
0.9%
8 2
 
0.9%
9 2
 
0.9%
10 2
 
0.9%
ValueCountFrequency (%)
1 120
52.9%
2 63
27.8%
3 20
 
8.8%
4 8
 
3.5%
5 5
 
2.2%
6 3
 
1.3%
7 2
 
0.9%
8 2
 
0.9%
9 2
 
0.9%
10 2
 
0.9%
ValueCountFrequency (%)
10 2
 
0.9%
9 2
 
0.9%
8 2
 
0.9%
7 2
 
0.9%
6 3
 
1.3%
5 5
 
2.2%
4 8
 
3.5%
3 20
 
8.8%
2 63
27.8%
1 120
52.9%
Distinct120
Distinct (%)52.9%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
2023-12-12T11:07:14.402655image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length38
Median length29
Mean length13.779736
Min length8

Characters and Unicode

Total characters3128
Distinct characters237
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique57 ?
Unique (%)25.1%

Sample

1st row바이오의약품개발입문과정
2nd row바이오의약품개발입문과정
3rd row보건산업분야일자리교육강사양성과정
4th row보건산업분야일자리교육강사양성과정
5th row보건산업일자리아카데미
ValueCountFrequency (%)
보건산업일자리아카데미 10
 
4.3%
글로벌헬스케어직업탐색과정 10
 
4.3%
국제진료간호사임상영어회화과정 6
 
2.6%
의료용어표준심화과정 5
 
2.2%
의약품후보물질발굴전문과정 5
 
2.2%
외국인환자유치사업역량강화과정 4
 
1.7%
의료기기입문마스터과정 4
 
1.7%
의료기기품질경영시스템)iso13485:2016)실습과정 4
 
1.7%
의약품gmp기본과정 3
 
1.3%
의약품qbd기본과정 3
 
1.3%
Other values (113) 176
76.5%
2023-12-12T11:07:14.950242image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
220
 
7.0%
215
 
6.9%
124
 
4.0%
105
 
3.4%
92
 
2.9%
77
 
2.5%
74
 
2.4%
64
 
2.0%
61
 
2.0%
56
 
1.8%
Other values (227) 2040
65.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2835
90.6%
Uppercase Letter 123
 
3.9%
Lowercase Letter 87
 
2.8%
Decimal Number 39
 
1.2%
Other Punctuation 16
 
0.5%
Close Punctuation 15
 
0.5%
Open Punctuation 7
 
0.2%
Dash Punctuation 3
 
0.1%
Space Separator 3
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
220
 
7.8%
215
 
7.6%
124
 
4.4%
105
 
3.7%
92
 
3.2%
77
 
2.7%
74
 
2.6%
64
 
2.3%
61
 
2.2%
56
 
2.0%
Other values (178) 1747
61.6%
Lowercase Letter
ValueCountFrequency (%)
e 12
13.8%
n 9
10.3%
t 9
10.3%
a 9
10.3%
b 7
8.0%
c 6
 
6.9%
o 6
 
6.9%
r 4
 
4.6%
i 4
 
4.6%
s 4
 
4.6%
Other values (7) 17
19.5%
Uppercase Letter
ValueCountFrequency (%)
D 15
12.2%
P 15
12.2%
M 13
10.6%
R 13
10.6%
C 13
10.6%
G 10
8.1%
Q 9
7.3%
O 7
5.7%
S 6
 
4.9%
I 5
 
4.1%
Other values (6) 17
13.8%
Decimal Number
ValueCountFrequency (%)
1 10
25.6%
2 5
12.8%
8 4
 
10.3%
5 4
 
10.3%
0 4
 
10.3%
6 4
 
10.3%
3 4
 
10.3%
4 4
 
10.3%
Other Punctuation
ValueCountFrequency (%)
& 7
43.8%
: 4
25.0%
· 4
25.0%
/ 1
 
6.2%
Close Punctuation
ValueCountFrequency (%)
) 15
100.0%
Open Punctuation
ValueCountFrequency (%)
( 7
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3
100.0%
Space Separator
ValueCountFrequency (%)
3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2835
90.6%
Latin 210
 
6.7%
Common 83
 
2.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
220
 
7.8%
215
 
7.6%
124
 
4.4%
105
 
3.7%
92
 
3.2%
77
 
2.7%
74
 
2.6%
64
 
2.3%
61
 
2.2%
56
 
2.0%
Other values (178) 1747
61.6%
Latin
ValueCountFrequency (%)
D 15
 
7.1%
P 15
 
7.1%
M 13
 
6.2%
R 13
 
6.2%
C 13
 
6.2%
e 12
 
5.7%
G 10
 
4.8%
n 9
 
4.3%
Q 9
 
4.3%
t 9
 
4.3%
Other values (23) 92
43.8%
Common
ValueCountFrequency (%)
) 15
18.1%
1 10
12.0%
( 7
 
8.4%
& 7
 
8.4%
2 5
 
6.0%
8 4
 
4.8%
5 4
 
4.8%
: 4
 
4.8%
0 4
 
4.8%
· 4
 
4.8%
Other values (6) 19
22.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2833
90.6%
ASCII 289
 
9.2%
None 4
 
0.1%
Compat Jamo 2
 
0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
220
 
7.8%
215
 
7.6%
124
 
4.4%
105
 
3.7%
92
 
3.2%
77
 
2.7%
74
 
2.6%
64
 
2.3%
61
 
2.2%
56
 
2.0%
Other values (177) 1745
61.6%
ASCII
ValueCountFrequency (%)
D 15
 
5.2%
P 15
 
5.2%
) 15
 
5.2%
M 13
 
4.5%
R 13
 
4.5%
C 13
 
4.5%
e 12
 
4.2%
G 10
 
3.5%
1 10
 
3.5%
n 9
 
3.1%
Other values (38) 164
56.7%
None
ValueCountFrequency (%)
· 4
100.0%
Compat Jamo
ValueCountFrequency (%)
2
100.0%
Distinct120
Distinct (%)52.9%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
2023-12-12T11:07:15.289813image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length38
Median length29
Mean length13.779736
Min length8

Characters and Unicode

Total characters3128
Distinct characters237
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique57 ?
Unique (%)25.1%

Sample

1st row바이오의약품개발입문과정
2nd row바이오의약품개발입문과정
3rd row보건산업분야일자리교육강사양성과정
4th row보건산업분야일자리교육강사양성과정
5th row보건산업일자리아카데미
ValueCountFrequency (%)
보건산업일자리아카데미 10
 
4.3%
글로벌헬스케어직업탐색과정 10
 
4.3%
국제진료간호사임상영어회화과정 6
 
2.6%
의료용어표준심화과정 5
 
2.2%
의약품후보물질발굴전문과정 5
 
2.2%
외국인환자유치사업역량강화과정 4
 
1.7%
의료기기입문마스터과정 4
 
1.7%
의료기기품질경영시스템)iso13485:2016)실습과정 4
 
1.7%
의약품gmp기본과정 3
 
1.3%
의약품qbd기본과정 3
 
1.3%
Other values (113) 176
76.5%
2023-12-12T11:07:15.807630image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
220
 
7.0%
215
 
6.9%
124
 
4.0%
105
 
3.4%
92
 
2.9%
77
 
2.5%
74
 
2.4%
64
 
2.0%
61
 
2.0%
56
 
1.8%
Other values (227) 2040
65.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2835
90.6%
Uppercase Letter 123
 
3.9%
Lowercase Letter 87
 
2.8%
Decimal Number 39
 
1.2%
Other Punctuation 16
 
0.5%
Close Punctuation 15
 
0.5%
Open Punctuation 7
 
0.2%
Dash Punctuation 3
 
0.1%
Space Separator 3
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
220
 
7.8%
215
 
7.6%
124
 
4.4%
105
 
3.7%
92
 
3.2%
77
 
2.7%
74
 
2.6%
64
 
2.3%
61
 
2.2%
56
 
2.0%
Other values (178) 1747
61.6%
Lowercase Letter
ValueCountFrequency (%)
e 12
13.8%
n 9
10.3%
t 9
10.3%
a 9
10.3%
b 7
8.0%
c 6
 
6.9%
o 6
 
6.9%
r 4
 
4.6%
i 4
 
4.6%
s 4
 
4.6%
Other values (7) 17
19.5%
Uppercase Letter
ValueCountFrequency (%)
D 15
12.2%
P 15
12.2%
M 13
10.6%
R 13
10.6%
C 13
10.6%
G 10
8.1%
Q 9
7.3%
O 7
5.7%
S 6
 
4.9%
I 5
 
4.1%
Other values (6) 17
13.8%
Decimal Number
ValueCountFrequency (%)
1 10
25.6%
2 5
12.8%
8 4
 
10.3%
5 4
 
10.3%
0 4
 
10.3%
6 4
 
10.3%
3 4
 
10.3%
4 4
 
10.3%
Other Punctuation
ValueCountFrequency (%)
& 7
43.8%
: 4
25.0%
· 4
25.0%
/ 1
 
6.2%
Close Punctuation
ValueCountFrequency (%)
) 15
100.0%
Open Punctuation
ValueCountFrequency (%)
( 7
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3
100.0%
Space Separator
ValueCountFrequency (%)
3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2835
90.6%
Latin 210
 
6.7%
Common 83
 
2.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
220
 
7.8%
215
 
7.6%
124
 
4.4%
105
 
3.7%
92
 
3.2%
77
 
2.7%
74
 
2.6%
64
 
2.3%
61
 
2.2%
56
 
2.0%
Other values (178) 1747
61.6%
Latin
ValueCountFrequency (%)
D 15
 
7.1%
P 15
 
7.1%
M 13
 
6.2%
R 13
 
6.2%
C 13
 
6.2%
e 12
 
5.7%
G 10
 
4.8%
n 9
 
4.3%
Q 9
 
4.3%
t 9
 
4.3%
Other values (23) 92
43.8%
Common
ValueCountFrequency (%)
) 15
18.1%
1 10
12.0%
( 7
 
8.4%
& 7
 
8.4%
2 5
 
6.0%
8 4
 
4.8%
5 4
 
4.8%
: 4
 
4.8%
0 4
 
4.8%
· 4
 
4.8%
Other values (6) 19
22.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2833
90.6%
ASCII 289
 
9.2%
None 4
 
0.1%
Compat Jamo 2
 
0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
220
 
7.8%
215
 
7.6%
124
 
4.4%
105
 
3.7%
92
 
3.2%
77
 
2.7%
74
 
2.6%
64
 
2.3%
61
 
2.2%
56
 
2.0%
Other values (177) 1745
61.6%
ASCII
ValueCountFrequency (%)
D 15
 
5.2%
P 15
 
5.2%
) 15
 
5.2%
M 13
 
4.5%
R 13
 
4.5%
C 13
 
4.5%
e 12
 
4.2%
G 10
 
3.5%
1 10
 
3.5%
n 9
 
3.1%
Other values (38) 164
56.7%
None
ValueCountFrequency (%)
· 4
100.0%
Compat Jamo
ValueCountFrequency (%)
2
100.0%

본부
Categorical

CONSTANT 

Distinct1
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
보건인재양성본부
227 

Length

Max length8
Median length8
Mean length8
Min length8

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row보건인재양성본부
2nd row보건인재양성본부
3rd row보건인재양성본부
4th row보건인재양성본부
5th row보건인재양성본부

Common Values

ValueCountFrequency (%)
보건인재양성본부 227
100.0%

Length

2023-12-12T11:07:15.998150image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T11:07:16.124331image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
보건인재양성본부 227
100.0%

부서
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)1.8%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
제약의료기기화장품교육단
129 
국제교육부
48 
보건산업교육부
26 
정밀의료인재양성부
24 

Length

Max length12
Median length12
Mean length9.6299559
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row보건산업교육부
2nd row보건산업교육부
3rd row보건산업교육부
4th row보건산업교육부
5th row보건산업교육부

Common Values

ValueCountFrequency (%)
제약의료기기화장품교육단 129
56.8%
국제교육부 48
 
21.1%
보건산업교육부 26
 
11.5%
정밀의료인재양성부 24
 
10.6%

Length

2023-12-12T11:07:16.264686image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T11:07:16.402966image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
제약의료기기화장품교육단 129
56.8%
국제교육부 48
 
21.1%
보건산업교육부 26
 
11.5%
정밀의료인재양성부 24
 
10.6%

분야
Categorical

CONSTANT 

Distinct1
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
보건산업
227 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row보건산업
2nd row보건산업
3rd row보건산업
4th row보건산업
5th row보건산업

Common Values

ValueCountFrequency (%)
보건산업 227
100.0%

Length

2023-12-12T11:07:16.526457image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T11:07:16.657405image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
보건산업 227
100.0%

세부교육분야
Categorical

HIGH CORRELATION 

Distinct7
Distinct (%)3.1%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
제약
72 
글로벌헬스케어
48 
화장품
27 
의료기기
26 
정밀의료
24 
Other values (2)
30 

Length

Max length7
Median length6
Mean length3.8281938
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row제약
2nd row제약
3rd row보건산업일반
4th row보건산업일반
5th row보건산업일반

Common Values

ValueCountFrequency (%)
제약 72
31.7%
글로벌헬스케어 48
21.1%
화장품 27
 
11.9%
의료기기 26
 
11.5%
정밀의료 24
 
10.6%
병원 18
 
7.9%
보건산업일반 12
 
5.3%

Length

2023-12-12T11:07:16.757348image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T11:07:16.896620image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
제약 72
31.7%
글로벌헬스케어 48
21.1%
화장품 27
 
11.9%
의료기기 26
 
11.5%
정밀의료 24
 
10.6%
병원 18
 
7.9%
보건산업일반 12
 
5.3%
Distinct102
Distinct (%)44.9%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
2023-12-12T11:07:17.176363image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length10
Mean length7.9735683
Min length2

Characters and Unicode

Total characters1810
Distinct characters15
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique56 ?
Unique (%)24.7%

Sample

1st row2021-04-19
2nd row2021-08-03
3rd row미정
4th row미정
5th row2021-06-04
ValueCountFrequency (%)
미정 57
25.1%
2021-05-27 4
 
1.8%
2021-08-10 4
 
1.8%
2021-06-29 4
 
1.8%
2021-06-04 4
 
1.8%
2021-09-07 3
 
1.3%
2021-07-20 3
 
1.3%
2021-04-19 3
 
1.3%
2021-05-25 3
 
1.3%
2021-08-24 3
 
1.3%
Other values (92) 139
61.2%
2023-12-12T11:07:17.647831image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2 419
23.1%
0 405
22.4%
- 339
18.7%
1 252
13.9%
58
 
3.2%
58
 
3.2%
7 46
 
2.5%
5 42
 
2.3%
8 42
 
2.3%
6 40
 
2.2%
Other values (5) 109
 
6.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1353
74.8%
Dash Punctuation 339
 
18.7%
Other Letter 116
 
6.4%
Open Punctuation 1
 
0.1%
Close Punctuation 1
 
0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 419
31.0%
0 405
29.9%
1 252
18.6%
7 46
 
3.4%
5 42
 
3.1%
8 42
 
3.1%
6 40
 
3.0%
9 38
 
2.8%
3 35
 
2.6%
4 34
 
2.5%
Other Letter
ValueCountFrequency (%)
58
50.0%
58
50.0%
Dash Punctuation
ValueCountFrequency (%)
- 339
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1694
93.6%
Hangul 116
 
6.4%

Most frequent character per script

Common
ValueCountFrequency (%)
2 419
24.7%
0 405
23.9%
- 339
20.0%
1 252
14.9%
7 46
 
2.7%
5 42
 
2.5%
8 42
 
2.5%
6 40
 
2.4%
9 38
 
2.2%
3 35
 
2.1%
Other values (3) 36
 
2.1%
Hangul
ValueCountFrequency (%)
58
50.0%
58
50.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1694
93.6%
Hangul 116
 
6.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2 419
24.7%
0 405
23.9%
- 339
20.0%
1 252
14.9%
7 46
 
2.7%
5 42
 
2.5%
8 42
 
2.5%
6 40
 
2.4%
9 38
 
2.2%
3 35
 
2.1%
Other values (3) 36
 
2.1%
Hangul
ValueCountFrequency (%)
58
50.0%
58
50.0%
Distinct112
Distinct (%)49.3%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
2023-12-12T11:07:18.054459image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length10
Mean length7.938326
Min length2

Characters and Unicode

Total characters1802
Distinct characters15
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique68 ?
Unique (%)30.0%

Sample

1st row2021-06-02
2nd row2021-09-06
3rd row미정
4th row미정
5th row2021-06-02
ValueCountFrequency (%)
미정 58
25.6%
2021-08-25 4
 
1.8%
2021-05-28 4
 
1.8%
2021-06-04 4
 
1.8%
2021-07-16 3
 
1.3%
2021-06-02 3
 
1.3%
2021-05-13 3
 
1.3%
2021-07-07 3
 
1.3%
2021-08-11 3
 
1.3%
2021-06-30 3
 
1.3%
Other values (102) 139
61.2%
2023-12-12T11:07:18.578060image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2 412
22.9%
0 396
22.0%
- 337
18.7%
1 265
14.7%
59
 
3.3%
59
 
3.3%
6 48
 
2.7%
8 41
 
2.3%
7 41
 
2.3%
3 40
 
2.2%
Other values (5) 104
 
5.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1345
74.6%
Dash Punctuation 337
 
18.7%
Other Letter 118
 
6.5%
Open Punctuation 1
 
0.1%
Close Punctuation 1
 
0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 412
30.6%
0 396
29.4%
1 265
19.7%
6 48
 
3.6%
8 41
 
3.0%
7 41
 
3.0%
3 40
 
3.0%
4 37
 
2.8%
9 34
 
2.5%
5 31
 
2.3%
Other Letter
ValueCountFrequency (%)
59
50.0%
59
50.0%
Dash Punctuation
ValueCountFrequency (%)
- 337
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1684
93.5%
Hangul 118
 
6.5%

Most frequent character per script

Common
ValueCountFrequency (%)
2 412
24.5%
0 396
23.5%
- 337
20.0%
1 265
15.7%
6 48
 
2.9%
8 41
 
2.4%
7 41
 
2.4%
3 40
 
2.4%
4 37
 
2.2%
9 34
 
2.0%
Other values (3) 33
 
2.0%
Hangul
ValueCountFrequency (%)
59
50.0%
59
50.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1684
93.5%
Hangul 118
 
6.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2 412
24.5%
0 396
23.5%
- 337
20.0%
1 265
15.7%
6 48
 
2.9%
8 41
 
2.4%
7 41
 
2.4%
3 40
 
2.4%
4 37
 
2.2%
9 34
 
2.0%
Other values (3) 33
 
2.0%
Hangul
ValueCountFrequency (%)
59
50.0%
59
50.0%

교육일수
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct14
Distinct (%)6.2%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
2
136 
1
56 
45
 
11
3
 
8
미정
 
2
Other values (9)
14 

Length

Max length2
Median length1
Mean length1.0969163
Min length1

Unique

Unique4 ?
Unique (%)1.8%

Sample

1st row45
2nd row45
3rd row미정
4th row미정
5th row2

Common Values

ValueCountFrequency (%)
2 136
59.9%
1 56
24.7%
45 11
 
4.8%
3 8
 
3.5%
미정 2
 
0.9%
16 2
 
0.9%
5 2
 
0.9%
43 2
 
0.9%
12 2
 
0.9%
11 2
 
0.9%
Other values (4) 4
 
1.8%

Length

2023-12-12T11:07:18.747541image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
2 136
59.9%
1 56
24.7%
45 11
 
4.8%
3 8
 
3.5%
미정 2
 
0.9%
16 2
 
0.9%
5 2
 
0.9%
43 2
 
0.9%
12 2
 
0.9%
11 2
 
0.9%
Other values (4) 4
 
1.8%

교육시간
Categorical

HIGH CORRELATION 

Distinct30
Distinct (%)13.2%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
16
118 
8
15 
4
 
11
21
 
9
12
 
9
Other values (25)
65 

Length

Max length3
Median length2
Mean length1.8502203
Min length1

Unique

Unique7 ?
Unique (%)3.1%

Sample

1st row66
2nd row66
3rd row미정
4th row미정
5th row5

Common Values

ValueCountFrequency (%)
16 118
52.0%
8 15
 
6.6%
4 11
 
4.8%
21 9
 
4.0%
12 9
 
4.0%
6 7
 
3.1%
10 7
 
3.1%
7 6
 
2.6%
40 5
 
2.2%
66 4
 
1.8%
Other values (20) 36
 
15.9%

Length

2023-12-12T11:07:18.934368image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
16 118
52.0%
8 15
 
6.6%
4 11
 
4.8%
21 9
 
4.0%
12 9
 
4.0%
6 7
 
3.1%
10 7
 
3.1%
7 6
 
2.6%
40 5
 
2.2%
66 4
 
1.8%
Other values (20) 36
 
15.9%

계획인원
Real number (ℝ)

HIGH CORRELATION 

Distinct11
Distinct (%)4.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean25.837004
Minimum5
Maximum85
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.1 KiB
2023-12-12T11:07:19.095534image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum5
5-th percentile15
Q120
median25
Q330
95-th percentile50
Maximum85
Range80
Interquartile range (IQR)10

Descriptive statistics

Standard deviation9.487059
Coefficient of variation (CV)0.36718881
Kurtosis8.2598745
Mean25.837004
Median Absolute Deviation (MAD)5
Skewness2.070521
Sum5865
Variance90.004288
MonotonicityNot monotonic
2023-12-12T11:07:19.251723image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=11)
ValueCountFrequency (%)
20 79
34.8%
30 74
32.6%
25 40
17.6%
15 11
 
4.8%
50 10
 
4.4%
10 5
 
2.2%
5 2
 
0.9%
40 2
 
0.9%
60 2
 
0.9%
85 1
 
0.4%
ValueCountFrequency (%)
5 2
 
0.9%
10 5
 
2.2%
15 11
 
4.8%
20 79
34.8%
25 40
17.6%
30 74
32.6%
40 2
 
0.9%
50 10
 
4.4%
55 1
 
0.4%
60 2
 
0.9%
ValueCountFrequency (%)
85 1
 
0.4%
60 2
 
0.9%
55 1
 
0.4%
50 10
 
4.4%
40 2
 
0.9%
30 74
32.6%
25 40
17.6%
20 79
34.8%
15 11
 
4.8%
10 5
 
2.2%

권역구분
Categorical

Distinct4
Distinct (%)1.8%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
미정
122 
서울
103 
부산
 
1
충북
 
1

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique2 ?
Unique (%)0.9%

Sample

1st row서울
2nd row서울
3rd row서울
4th row서울
5th row서울

Common Values

ValueCountFrequency (%)
미정 122
53.7%
서울 103
45.4%
부산 1
 
0.4%
충북 1
 
0.4%

Length

2023-12-12T11:07:19.446391image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T11:07:19.557717image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
미정 122
53.7%
서울 103
45.4%
부산 1
 
0.4%
충북 1
 
0.4%

Interactions

2023-12-12T11:07:12.360734image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:07:12.122835image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:07:12.466980image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:07:12.236664image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T11:07:19.699015image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
교육방식교육재원기수부서세부교육분야교육일수교육시간계획인원권역구분
교육방식1.0000.3640.1840.6390.5350.8740.9440.4870.232
교육재원0.3641.0000.0001.0001.0000.7490.9570.3370.446
기수0.1840.0001.0000.1820.2470.0000.0000.2640.000
부서0.6391.0000.1821.0000.9220.8980.9830.7490.660
세부교육분야0.5351.0000.2470.9221.0000.8410.9040.6610.392
교육일수0.8740.7490.0000.8980.8411.0000.9800.5460.000
교육시간0.9440.9570.0000.9830.9040.9801.0000.9000.420
계획인원0.4870.3370.2640.7490.6610.5460.9001.0000.223
권역구분0.2320.4460.0000.6600.3920.0000.4200.2231.000
2023-12-12T11:07:19.945573image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
부서교육일수권역구분교육방식세부교육분야교육재원교육시간
부서1.0000.7370.3120.6600.8900.9960.868
교육일수0.7371.0000.0000.7430.4710.5850.804
권역구분0.3120.0001.0000.2200.2770.2990.215
교육방식0.6600.7430.2201.0000.4190.5780.716
세부교육분야0.8900.4710.2770.4191.0000.9890.632
교육재원0.9960.5850.2990.5780.9891.0000.796
교육시간0.8680.8040.2150.7160.6320.7961.000
2023-12-12T11:07:20.120921image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
기수계획인원교육방식교육재원부서세부교육분야교육일수교육시간권역구분
기수1.0000.3030.1090.0000.1070.1260.0000.0000.000
계획인원0.3031.0000.3500.2490.4130.4310.2720.6050.100
교육방식0.1090.3501.0000.5780.6600.4190.7430.7160.220
교육재원0.0000.2490.5781.0000.9960.9890.5850.7960.299
부서0.1070.4130.6600.9961.0000.8900.7370.8680.312
세부교육분야0.1260.4310.4190.9890.8901.0000.4710.6320.277
교육일수0.0000.2720.7430.5850.7370.4711.0000.8040.000
교육시간0.0000.6050.7160.7960.8680.6320.8041.0000.215
권역구분0.0000.1000.2200.2990.3120.2770.0000.2151.000

Missing values

2023-12-12T11:07:12.657731image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T11:07:12.897193image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

교육방식교육재원교육대상기수과정명실과정명본부부서분야세부교육분야교육기간(시작)교육기간(종료)교육일수교육시간계획인원권역구분
0혼합민경민간1바이오의약품개발입문과정바이오의약품개발입문과정보건인재양성본부보건산업교육부보건산업제약2021-04-192021-06-02456620서울
1혼합민경민간2바이오의약품개발입문과정바이오의약품개발입문과정보건인재양성본부보건산업교육부보건산업제약2021-08-032021-09-06456620서울
2혼합민경민간1보건산업분야일자리교육강사양성과정보건산업분야일자리교육강사양성과정보건인재양성본부보건산업교육부보건산업보건산업일반미정미정미정미정5서울
3혼합민경민간2보건산업분야일자리교육강사양성과정보건산업분야일자리교육강사양성과정보건인재양성본부보건산업교육부보건산업보건산업일반미정미정미정미정5서울
4라이브민경민간1보건산업일자리아카데미보건산업일자리아카데미보건인재양성본부보건산업교육부보건산업보건산업일반2021-06-042021-06-022550서울
5라이브민경민간2보건산업일자리아카데미보건산업일자리아카데미보건인재양성본부보건산업교육부보건산업보건산업일반미정미정1450서울
6라이브민경민간3보건산업일자리아카데미보건산업일자리아카데미보건인재양성본부보건산업교육부보건산업보건산업일반미정미정1450서울
7라이브민경민간4보건산업일자리아카데미보건산업일자리아카데미보건인재양성본부보건산업교육부보건산업보건산업일반미정미정1450서울
8라이브민경민간5보건산업일자리아카데미보건산업일자리아카데미보건인재양성본부보건산업교육부보건산업보건산업일반미정미정1450서울
9라이브민경민간6보건산업일자리아카데미보건산업일자리아카데미보건인재양성본부보건산업교육부보건산업보건산업일반미정미정1450서울
교육방식교육재원교육대상기수과정명실과정명본부부서분야세부교육분야교육기간(시작)교육기간(종료)교육일수교육시간계획인원권역구분
217라이브민경민간1화장품ㆍ의약품융합기술법령및규제심화과정화장품ㆍ의약품융합기술법령및규제심화과정보건인재양성본부제약의료기기화장품교육단보건산업화장품2021-05-272021-05-2821620서울
218집합민경민간1임상시험ProjectManagement역량강화심화과정임상시험ProjectManagement역량강화심화과정보건인재양성본부제약의료기기화장품교육단보건산업제약미정미정21620미정
219집합민경민간1전문의약품신약연구심화과정전문의약품신약연구심화과정보건인재양성본부제약의료기기화장품교육단보건산업제약2021-06-142021-06-1521620미정
220라이브민경민간1기능성화장품전략과정기능성화장품전략과정보건인재양성본부제약의료기기화장품교육단보건산업화장품2021-03-232021-03-2421625서울
221라이브민경민간4의약품후보물질발굴전문과정의약품후보물질발굴전문과정보건인재양성본부제약의료기기화장품교육단보건산업제약2021-07-062021-07-0721625서울
222라이브민경민간5의약품후보물질발굴전문과정의약품후보물질발굴전문과정보건인재양성본부제약의료기기화장품교육단보건산업제약2021-10-192021-10-2021625서울
223집합민경민간1K-뷰티 디지털마케팅 크리에이터 심화과정K-뷰티 디지털마케팅 크리에이터 심화과정보건인재양성본부제약의료기기화장품교육단보건산업화장품2021-06-292021-06-3021620미정
224라이브민경민간2의약품경제성평가실무실습과정의약품경제성평가실무실습과정보건인재양성본부제약의료기기화장품교육단보건산업제약2021-07-082021-07-0921620미정
225라이브민경민간1비임상시험전문과정비임상시험전문과정보건인재양성본부제약의료기기화장품교육단보건산업제약2021-06-242021-06-2521630서울
226라이브민경민간2비임상시험전문과정비임상시험전문과정보건인재양성본부제약의료기기화장품교육단보건산업제약2021-07-152021-07-1621630서울