Overview

Dataset statistics

Number of variables8
Number of observations707
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory45.7 KiB
Average record size in memory66.2 B

Variable types

Categorical3
Text2
DateTime2
Numeric1

Dataset

Description특성화고 인력양성 교육 및 중소기업 취업연계를 이루어낸 특성화고 명단 및 관련 시작일, 종료일, 참여인원, 교육내용 등 상세정보
URLhttps://www.data.go.kr/data/15047641/fileData.do

Alerts

년도 has constant value ""Constant

Reproduction

Analysis started2023-12-13 00:57:31.612393
Analysis finished2023-12-13 00:57:32.348315
Duration0.74 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

년도
Categorical

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size5.7 KiB
2023
707 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023
2nd row2023
3rd row2023
4th row2023
5th row2023

Common Values

ValueCountFrequency (%)
2023 707
100.0%

Length

2023-12-13T09:57:32.622108image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T09:57:32.690391image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023 707
100.0%

학교
Text

Distinct193
Distinct (%)27.3%
Missing0
Missing (%)0.0%
Memory size5.7 KiB
2023-12-13T09:57:32.853570image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length11
Mean length8.9377652
Min length6

Characters and Unicode

Total characters6319
Distinct characters167
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3 ?
Unique (%)0.4%

Sample

1st row강릉정보공업고등학교
2nd row강릉정보공업고등학교
3rd row강릉정보공업고등학교
4th row강서공업고등학교
5th row강서공업고등학교
ValueCountFrequency (%)
세그루패션디자인고등학교 10
 
1.4%
유성생명과학고등학교 7
 
1.0%
창원기계공업고등학교 7
 
1.0%
춘천한샘고등학교 7
 
1.0%
수원공업고등학교 7
 
1.0%
춘천기계공업고등학교 6
 
0.8%
천안상업고등학교 6
 
0.8%
분당경영고등학교 6
 
0.8%
성동공업고등학교 6
 
0.8%
충주상업고등학교 6
 
0.8%
Other values (183) 639
90.4%
2023-12-13T09:57:33.171434image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
807
 
12.8%
709
 
11.2%
707
 
11.2%
707
 
11.2%
297
 
4.7%
221
 
3.5%
144
 
2.3%
112
 
1.8%
98
 
1.6%
92
 
1.5%
Other values (157) 2425
38.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 6311
99.9%
Lowercase Letter 4
 
0.1%
Uppercase Letter 4
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
807
 
12.8%
709
 
11.2%
707
 
11.2%
707
 
11.2%
297
 
4.7%
221
 
3.5%
144
 
2.3%
112
 
1.8%
98
 
1.6%
92
 
1.5%
Other values (154) 2417
38.3%
Uppercase Letter
ValueCountFrequency (%)
T 2
50.0%
I 2
50.0%
Lowercase Letter
ValueCountFrequency (%)
e 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 6311
99.9%
Latin 8
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
807
 
12.8%
709
 
11.2%
707
 
11.2%
707
 
11.2%
297
 
4.7%
221
 
3.5%
144
 
2.3%
112
 
1.8%
98
 
1.6%
92
 
1.5%
Other values (154) 2417
38.3%
Latin
ValueCountFrequency (%)
e 4
50.0%
T 2
25.0%
I 2
25.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 6311
99.9%
ASCII 8
 
0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
807
 
12.8%
709
 
11.2%
707
 
11.2%
707
 
11.2%
297
 
4.7%
221
 
3.5%
144
 
2.3%
112
 
1.8%
98
 
1.6%
92
 
1.5%
Other values (154) 2417
38.3%
ASCII
ValueCountFrequency (%)
e 4
50.0%
T 2
25.0%
I 2
25.0%

소재지
Categorical

Distinct16
Distinct (%)2.3%
Missing0
Missing (%)0.0%
Memory size5.7 KiB
경기
131 
서울
101 
부산
75 
전남
56 
대구
50 
Other values (11)
294 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row강원
2nd row강원
3rd row강원
4th row서울
5th row서울

Common Values

ValueCountFrequency (%)
경기 131
18.5%
서울 101
14.3%
부산 75
10.6%
전남 56
7.9%
대구 50
 
7.1%
경남 43
 
6.1%
충북 43
 
6.1%
인천 38
 
5.4%
충남 37
 
5.2%
강원 36
 
5.1%
Other values (6) 97
13.7%

Length

2023-12-13T09:57:33.281567image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
경기 131
18.5%
서울 101
14.3%
부산 75
10.6%
전남 56
7.9%
대구 50
 
7.1%
경남 43
 
6.1%
충북 43
 
6.1%
인천 38
 
5.4%
충남 37
 
5.2%
강원 36
 
5.1%
Other values (6) 97
13.7%

전공
Categorical

Distinct19
Distinct (%)2.7%
Missing0
Missing (%)0.0%
Memory size5.7 KiB
경영,회계,사무
136 
기계
125 
전기, 전자
122 
문화,예술,디자인,방송
56 
음식서비스
51 
Other values (14)
217 

Length

Max length19
Median length13
Mean length7.8896747
Min length3

Unique

Unique1 ?
Unique (%)0.1%

Sample

1st row경영,회계,사무
2nd row음식서비스
3rd row미용,숙박,여행,오락 및 스포츠
4th row건설
5th row전기, 전자

Common Values

ValueCountFrequency (%)
경영,회계,사무 136
19.2%
기계 125
17.7%
전기, 전자 122
17.3%
문화,예술,디자인,방송 56
7.9%
음식서비스 51
 
7.2%
미용,숙박,여행,오락 및 스포츠 48
 
6.8%
정보통신 39
 
5.5%
건설 29
 
4.1%
화학 23
 
3.3%
보건 및 의료 14
 
2.0%
Other values (9) 64
9.1%

Length

2023-12-13T09:57:33.380992image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
경영,회계,사무 136
12.9%
기계 125
11.8%
전기 122
11.6%
전자 122
11.6%
89
8.4%
문화,예술,디자인,방송 56
 
5.3%
음식서비스 51
 
4.8%
미용,숙박,여행,오락 48
 
4.5%
스포츠 48
 
4.5%
정보통신 39
 
3.7%
Other values (22) 220
20.8%
Distinct70
Distinct (%)9.9%
Missing0
Missing (%)0.0%
Memory size5.7 KiB
Minimum2023-03-02 00:00:00
Maximum2023-08-28 00:00:00
2023-12-13T09:57:33.477772image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T09:57:33.579457image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct70
Distinct (%)9.9%
Missing0
Missing (%)0.0%
Memory size5.7 KiB
Minimum2023-06-21 00:00:00
Maximum2024-01-05 00:00:00
2023-12-13T09:57:33.680682image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T09:57:33.780854image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

참여인원
Real number (ℝ)

Distinct23
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean8.7708628
Minimum1
Maximum45
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size6.3 KiB
2023-12-13T09:57:33.878147image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile5
Q16
median8
Q310
95-th percentile16
Maximum45
Range44
Interquartile range (IQR)4

Descriptive statistics

Standard deviation3.9297802
Coefficient of variation (CV)0.44804944
Kurtosis11.288976
Mean8.7708628
Median Absolute Deviation (MAD)2
Skewness2.1390258
Sum6201
Variance15.443172
MonotonicityNot monotonic
2023-12-13T09:57:33.961076image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=23)
ValueCountFrequency (%)
5 124
17.5%
7 92
13.0%
8 89
12.6%
6 77
10.9%
10 68
9.6%
9 63
8.9%
11 41
 
5.8%
12 33
 
4.7%
14 21
 
3.0%
15 19
 
2.7%
Other values (13) 80
11.3%
ValueCountFrequency (%)
1 2
 
0.3%
3 4
 
0.6%
4 15
 
2.1%
5 124
17.5%
6 77
10.9%
7 92
13.0%
8 89
12.6%
9 63
8.9%
10 68
9.6%
11 41
 
5.8%
ValueCountFrequency (%)
45 1
 
0.1%
26 2
 
0.3%
24 2
 
0.3%
23 1
 
0.1%
20 3
 
0.4%
19 5
 
0.7%
18 11
1.6%
17 5
 
0.7%
16 11
1.6%
15 19
2.7%
Distinct642
Distinct (%)90.8%
Missing0
Missing (%)0.0%
Memory size5.7 KiB
2023-12-13T09:57:34.198065image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length32
Median length28
Mean length8.9024045
Min length2

Characters and Unicode

Total characters6294
Distinct characters319
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique603 ?
Unique (%)85.3%

Sample

1st row사무일반 및 디자인 과정
2nd row외식산업전문인 양성과정
3rd row전문미용인 양성과정
4th row건축시공과정
5th row전자 통신 정보융합과정
ValueCountFrequency (%)
67
 
5.6%
과정 53
 
4.4%
스마트 24
 
2.0%
실무과정 20
 
1.7%
사무행정 13
 
1.1%
제조 12
 
1.0%
제작 12
 
1.0%
스마트공장 10
 
0.8%
운용 9
 
0.8%
양성과정 9
 
0.8%
Other values (747) 968
80.9%
2023-12-13T09:57:34.552424image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
493
 
7.8%
411
 
6.5%
371
 
5.9%
220
 
3.5%
172
 
2.7%
168
 
2.7%
165
 
2.6%
161
 
2.6%
133
 
2.1%
131
 
2.1%
Other values (309) 3869
61.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5467
86.9%
Space Separator 493
 
7.8%
Uppercase Letter 169
 
2.7%
Lowercase Letter 56
 
0.9%
Decimal Number 40
 
0.6%
Other Punctuation 32
 
0.5%
Open Punctuation 14
 
0.2%
Close Punctuation 14
 
0.2%
Dash Punctuation 7
 
0.1%
Letter Number 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
411
 
7.5%
371
 
6.8%
220
 
4.0%
172
 
3.1%
168
 
3.1%
165
 
3.0%
161
 
2.9%
133
 
2.4%
131
 
2.4%
117
 
2.1%
Other values (258) 3418
62.5%
Uppercase Letter
ValueCountFrequency (%)
C 23
13.6%
T 18
10.7%
I 17
10.1%
D 13
 
7.7%
M 12
 
7.1%
S 12
 
7.1%
E 12
 
7.1%
P 12
 
7.1%
A 9
 
5.3%
N 8
 
4.7%
Other values (10) 33
19.5%
Lowercase Letter
ValueCountFrequency (%)
e 8
14.3%
i 7
12.5%
o 6
10.7%
n 6
10.7%
t 6
10.7%
a 6
10.7%
l 2
 
3.6%
g 2
 
3.6%
y 2
 
3.6%
r 2
 
3.6%
Other values (6) 9
16.1%
Other Punctuation
ValueCountFrequency (%)
· 16
50.0%
, 11
34.4%
/ 3
 
9.4%
& 1
 
3.1%
. 1
 
3.1%
Decimal Number
ValueCountFrequency (%)
2 18
45.0%
3 14
35.0%
0 7
 
17.5%
1 1
 
2.5%
Letter Number
ValueCountFrequency (%)
1
50.0%
1
50.0%
Space Separator
ValueCountFrequency (%)
493
100.0%
Open Punctuation
ValueCountFrequency (%)
( 14
100.0%
Close Punctuation
ValueCountFrequency (%)
) 14
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 7
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5467
86.9%
Common 600
 
9.5%
Latin 227
 
3.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
411
 
7.5%
371
 
6.8%
220
 
4.0%
172
 
3.1%
168
 
3.1%
165
 
3.0%
161
 
2.9%
133
 
2.4%
131
 
2.4%
117
 
2.1%
Other values (258) 3418
62.5%
Latin
ValueCountFrequency (%)
C 23
 
10.1%
T 18
 
7.9%
I 17
 
7.5%
D 13
 
5.7%
M 12
 
5.3%
S 12
 
5.3%
E 12
 
5.3%
P 12
 
5.3%
A 9
 
4.0%
N 8
 
3.5%
Other values (28) 91
40.1%
Common
ValueCountFrequency (%)
493
82.2%
2 18
 
3.0%
· 16
 
2.7%
( 14
 
2.3%
3 14
 
2.3%
) 14
 
2.3%
, 11
 
1.8%
- 7
 
1.2%
0 7
 
1.2%
/ 3
 
0.5%
Other values (3) 3
 
0.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5465
86.8%
ASCII 809
 
12.9%
None 16
 
0.3%
Compat Jamo 2
 
< 0.1%
Number Forms 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
493
60.9%
C 23
 
2.8%
2 18
 
2.2%
T 18
 
2.2%
I 17
 
2.1%
( 14
 
1.7%
3 14
 
1.7%
) 14
 
1.7%
D 13
 
1.6%
M 12
 
1.5%
Other values (38) 173
 
21.4%
Hangul
ValueCountFrequency (%)
411
 
7.5%
371
 
6.8%
220
 
4.0%
172
 
3.1%
168
 
3.1%
165
 
3.0%
161
 
2.9%
133
 
2.4%
131
 
2.4%
117
 
2.1%
Other values (257) 3416
62.5%
None
ValueCountFrequency (%)
· 16
100.0%
Compat Jamo
ValueCountFrequency (%)
2
100.0%
Number Forms
ValueCountFrequency (%)
1
50.0%
1
50.0%

Interactions

2023-12-13T09:57:32.135016image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T09:57:34.628384image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
소재지전공시작일종료일참여인원
소재지1.0000.4330.8830.8890.204
전공0.4331.0000.6430.5900.343
시작일0.8830.6431.0000.9880.446
종료일0.8890.5900.9881.0000.555
참여인원0.2040.3430.4460.5551.000
2023-12-13T09:57:34.707856image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
소재지전공
소재지1.0000.150
전공0.1501.000
2023-12-13T09:57:34.774291image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
참여인원소재지전공
참여인원1.0000.0960.172
소재지0.0961.0000.150
전공0.1720.1501.000

Missing values

2023-12-13T09:57:32.216778image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T09:57:32.306588image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

년도학교소재지전공시작일종료일참여인원교육내용
02023강릉정보공업고등학교강원경영,회계,사무2023-07-102023-08-1010사무일반 및 디자인 과정
12023강릉정보공업고등학교강원음식서비스2023-07-102023-08-109외식산업전문인 양성과정
22023강릉정보공업고등학교강원미용,숙박,여행,오락 및 스포츠2023-07-102023-08-1011전문미용인 양성과정
32023강서공업고등학교서울건설2023-07-032023-08-3110건축시공과정
42023강서공업고등학교서울전기, 전자2023-07-032023-08-318전자 통신 정보융합과정
52023강서공업고등학교서울화학2023-07-032023-08-316환경측정 분석 및 화학제품제조과정
62023강원생명과학고등학교강원기계2023-07-312023-09-015IOT그린전기차
72023강원생명과학고등학교강원농림어업2023-07-192023-08-045미래농산업
82023강원생명과학고등학교강원식품가공2023-07-092023-07-2810바이오식품가공
92023강원생명과학고등학교강원전기, 전자2023-07-312023-08-256스마트전기전자
년도학교소재지전공시작일종료일참여인원교육내용
6972023해운대공업고등학교부산기계2023-07-172023-09-0110기계
6982023해운대공업고등학교부산전기, 전자2023-07-172023-08-3116전기전자
6992023해운대관광고등학교부산음식서비스2023-05-012023-09-278제과실무과정
7002023해운대관광고등학교부산음식서비스2023-05-012023-09-278조리실무과정
7012023해운대관광고등학교부산미용,숙박,여행,오락 및 스포츠2023-05-012023-09-278호텔서비스실무과정
7022023홍성공업고등학교충남기계2023-07-202023-08-109기계부품가공설비과정
7032023홍성공업고등학교충남전기, 전자2023-07-202023-08-108전기제어시스템운용과정
7042023휘경공업고등학교서울건설2023-07-102023-08-045건설시공과정
7052023휘경공업고등학교서울기계2023-07-102023-08-048기계가공과정
7062023휘경공업고등학교서울전기, 전자2023-07-102023-08-043전자제품생산과정