Overview

Dataset statistics

Number of variables9
Number of observations37
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.8 KiB
Average record size in memory78.5 B

Variable types

Numeric2
Categorical4
Text1
DateTime2

Dataset

Description전북특별자치도 여성가족부 국비 직업훈련 과정(새일센터, 과정유형, 세부유형, 교육훈련 과정명, 인원, 교육기관, 교육 시간 등)
Author전북특별자치도
URLhttps://www.data.go.kr/data/15055714/fileData.do

Alerts

번호 is highly overall correlated with 새일센터High correlation
교육 시간 is highly overall correlated with 세부유형High correlation
새일센터 is highly overall correlated with 번호 and 1 other fieldsHigh correlation
과정유형 is highly overall correlated with 세부유형High correlation
세부유형 is highly overall correlated with 교육 시간 and 1 other fieldsHigh correlation
인원 is highly overall correlated with 새일센터High correlation
번호 has unique valuesUnique
교육훈련 과정명 has unique valuesUnique

Reproduction

Analysis started2024-03-15 00:56:03.497053
Analysis finished2024-03-15 00:56:06.059912
Duration2.56 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

번호
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct37
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean19
Minimum1
Maximum37
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size461.0 B
2024-03-15T09:56:06.249687image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2.8
Q110
median19
Q328
95-th percentile35.2
Maximum37
Range36
Interquartile range (IQR)18

Descriptive statistics

Standard deviation10.824355
Coefficient of variation (CV)0.56970291
Kurtosis-1.2
Mean19
Median Absolute Deviation (MAD)9
Skewness0
Sum703
Variance117.16667
MonotonicityStrictly increasing
2024-03-15T09:56:06.712397image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=37)
ValueCountFrequency (%)
1 1
 
2.7%
29 1
 
2.7%
22 1
 
2.7%
23 1
 
2.7%
24 1
 
2.7%
25 1
 
2.7%
26 1
 
2.7%
27 1
 
2.7%
28 1
 
2.7%
30 1
 
2.7%
Other values (27) 27
73.0%
ValueCountFrequency (%)
1 1
2.7%
2 1
2.7%
3 1
2.7%
4 1
2.7%
5 1
2.7%
6 1
2.7%
7 1
2.7%
8 1
2.7%
9 1
2.7%
10 1
2.7%
ValueCountFrequency (%)
37 1
2.7%
36 1
2.7%
35 1
2.7%
34 1
2.7%
33 1
2.7%
32 1
2.7%
31 1
2.7%
30 1
2.7%
29 1
2.7%
28 1
2.7%

새일센터
Categorical

HIGH CORRELATION 

Distinct8
Distinct (%)21.6%
Missing0
Missing (%)0.0%
Memory size424.0 B
전북새일
11 
전주새일
익산새일(산단형)
군산새일
정읍새일
Other values (3)

Length

Max length9
Median length4
Mean length4.6756757
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row전북새일
2nd row전북새일
3rd row전북새일
4th row전북새일
5th row전북새일

Common Values

ValueCountFrequency (%)
전북새일 11
29.7%
전주새일 6
16.2%
익산새일(산단형) 5
13.5%
군산새일 3
 
8.1%
정읍새일 3
 
8.1%
남원새일 3
 
8.1%
김제새일 3
 
8.1%
완주새일 3
 
8.1%

Length

2024-03-15T09:56:07.202220image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-15T09:56:07.614470image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
전북새일 11
29.7%
전주새일 6
16.2%
익산새일(산단형 5
13.5%
군산새일 3
 
8.1%
정읍새일 3
 
8.1%
남원새일 3
 
8.1%
김제새일 3
 
8.1%
완주새일 3
 
8.1%

과정유형
Categorical

HIGH CORRELATION 

Distinct5
Distinct (%)13.5%
Missing0
Missing (%)0.0%
Memory size424.0 B
일반
18 
전문
11 
취약계층
창업
취약/계층
 
1

Length

Max length5
Median length2
Mean length2.3513514
Min length2

Unique

Unique1 ?
Unique (%)2.7%

Sample

1st row전문
2nd row전문
3rd row전문
4th row전문
5th row전문

Common Values

ValueCountFrequency (%)
일반 18
48.6%
전문 11
29.7%
취약계층 5
 
13.5%
창업 2
 
5.4%
취약/계층 1
 
2.7%

Length

2024-03-15T09:56:08.070239image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-15T09:56:08.430872image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
일반 18
48.6%
전문 11
29.7%
취약계층 5
 
13.5%
창업 2
 
5.4%
취약/계층 1
 
2.7%

세부유형
Categorical

HIGH CORRELATION 

Distinct7
Distinct (%)18.9%
Missing0
Missing (%)0.0%
Memory size424.0 B
일반
15 
기업
결혼/이민
역량
기술
Other values (2)

Length

Max length5
Median length2
Mean length2.4054054
Min length2

Unique

Unique1 ?
Unique (%)2.7%

Sample

1st row기업
2nd row기업
3rd row기업
4th row기업
5th row기업

Common Values

ValueCountFrequency (%)
일반 15
40.5%
기업 9
24.3%
결혼/이민 5
 
13.5%
역량 3
 
8.1%
기술 2
 
5.4%
창업 2
 
5.4%
장애 1
 
2.7%

Length

2024-03-15T09:56:08.707782image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-15T09:56:08.930690image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
일반 15
40.5%
기업 9
24.3%
결혼/이민 5
 
13.5%
역량 3
 
8.1%
기술 2
 
5.4%
창업 2
 
5.4%
장애 1
 
2.7%
Distinct37
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size424.0 B
2024-03-15T09:56:09.691074image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length15
Mean length10.783784
Min length5

Characters and Unicode

Total characters399
Distinct characters129
Distinct categories5 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique37 ?
Unique (%)100.0%

Sample

1st rowLED반도체산업분야 제조인력 양성과정
2nd row법무사무원 양성교육
3rd row성장동력산업 멀티품질관리원 양성
4th row자동차부품소개 제조인력 양성교육
5th row전기전자부품 품질검사원 양성교육
ValueCountFrequency (%)
양성교육 5
 
8.5%
양성과정 3
 
5.1%
제조인력 3
 
5.1%
새일역량교육 3
 
5.1%
단체급식조리사 2
 
3.4%
양성 2
 
3.4%
품질검사원 2
 
3.4%
한식푸드컨설턴트 1
 
1.7%
복합섬유봉제교육 1
 
1.7%
스마트전산마스터과정 1
 
1.7%
Other values (36) 36
61.0%
2024-03-15T09:56:10.683105image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
22
 
5.5%
17
 
4.3%
15
 
3.8%
15
 
3.8%
15
 
3.8%
12
 
3.0%
11
 
2.8%
9
 
2.3%
9
 
2.3%
8
 
2.0%
Other values (119) 266
66.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 363
91.0%
Space Separator 22
 
5.5%
Uppercase Letter 9
 
2.3%
Decimal Number 4
 
1.0%
Other Punctuation 1
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
17
 
4.7%
15
 
4.1%
15
 
4.1%
15
 
4.1%
12
 
3.3%
11
 
3.0%
9
 
2.5%
9
 
2.5%
8
 
2.2%
8
 
2.2%
Other values (108) 244
67.2%
Uppercase Letter
ValueCountFrequency (%)
D 3
33.3%
I 2
22.2%
T 1
 
11.1%
Y 1
 
11.1%
L 1
 
11.1%
E 1
 
11.1%
Decimal Number
ValueCountFrequency (%)
3 2
50.0%
1 1
25.0%
2 1
25.0%
Space Separator
ValueCountFrequency (%)
22
100.0%
Other Punctuation
ValueCountFrequency (%)
& 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 363
91.0%
Common 27
 
6.8%
Latin 9
 
2.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
17
 
4.7%
15
 
4.1%
15
 
4.1%
15
 
4.1%
12
 
3.3%
11
 
3.0%
9
 
2.5%
9
 
2.5%
8
 
2.2%
8
 
2.2%
Other values (108) 244
67.2%
Latin
ValueCountFrequency (%)
D 3
33.3%
I 2
22.2%
T 1
 
11.1%
Y 1
 
11.1%
L 1
 
11.1%
E 1
 
11.1%
Common
ValueCountFrequency (%)
22
81.5%
3 2
 
7.4%
& 1
 
3.7%
1 1
 
3.7%
2 1
 
3.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 363
91.0%
ASCII 36
 
9.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
22
61.1%
D 3
 
8.3%
I 2
 
5.6%
3 2
 
5.6%
& 1
 
2.8%
T 1
 
2.8%
Y 1
 
2.8%
L 1
 
2.8%
E 1
 
2.8%
1 1
 
2.8%
Hangul
ValueCountFrequency (%)
17
 
4.7%
15
 
4.1%
15
 
4.1%
15
 
4.1%
12
 
3.3%
11
 
3.0%
9
 
2.5%
9
 
2.5%
8
 
2.2%
8
 
2.2%
Other values (108) 244
67.2%

인원
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)10.8%
Missing0
Missing (%)0.0%
Memory size424.0 B
20
25 
22
15
24

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row20
2nd row20
3rd row20
4th row20
5th row20

Common Values

ValueCountFrequency (%)
20 25
67.6%
22 5
 
13.5%
15 4
 
10.8%
24 3
 
8.1%

Length

2024-03-15T09:56:11.031133image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-15T09:56:11.221504image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
20 25
67.6%
22 5
 
13.5%
15 4
 
10.8%
24 3
 
8.1%
Distinct22
Distinct (%)59.5%
Missing0
Missing (%)0.0%
Memory size424.0 B
Minimum2016-03-02 00:00:00
Maximum2016-09-21 00:00:00
2024-03-15T09:56:11.407147image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T09:56:11.665075image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=22)
Distinct33
Distinct (%)89.2%
Missing0
Missing (%)0.0%
Memory size424.0 B
Minimum2016-04-12 00:00:00
Maximum2016-10-20 00:00:00
2024-03-15T09:56:12.047657image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T09:56:12.386054image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=33)

교육 시간
Real number (ℝ)

HIGH CORRELATION 

Distinct20
Distinct (%)54.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean152.21622
Minimum20
Maximum260
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size461.0 B
2024-03-15T09:56:12.768565image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum20
5-th percentile20
Q1120
median164
Q3200
95-th percentile223.2
Maximum260
Range240
Interquartile range (IQR)80

Descriptive statistics

Standard deviation58.210411
Coefficient of variation (CV)0.38241925
Kurtosis0.28790636
Mean152.21622
Median Absolute Deviation (MAD)44
Skewness-0.68630838
Sum5632
Variance3388.452
MonotonicityNot monotonic
2024-03-15T09:56:13.138929image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=20)
ValueCountFrequency (%)
120 9
24.3%
200 4
 
10.8%
184 3
 
8.1%
160 3
 
8.1%
20 3
 
8.1%
166 1
 
2.7%
164 1
 
2.7%
100 1
 
2.7%
180 1
 
2.7%
240 1
 
2.7%
Other values (10) 10
27.0%
ValueCountFrequency (%)
20 3
 
8.1%
80 1
 
2.7%
92 1
 
2.7%
100 1
 
2.7%
120 9
24.3%
160 3
 
8.1%
164 1
 
2.7%
166 1
 
2.7%
174 1
 
2.7%
176 1
 
2.7%
ValueCountFrequency (%)
260 1
 
2.7%
240 1
 
2.7%
219 1
 
2.7%
214 1
 
2.7%
205 1
 
2.7%
202 1
 
2.7%
200 4
10.8%
188 1
 
2.7%
184 3
8.1%
180 1
 
2.7%

Interactions

2024-03-15T09:56:04.843067image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T09:56:04.339935image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T09:56:05.095002image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T09:56:04.596932image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-15T09:56:13.386380image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호새일센터과정유형세부유형교육훈련 과정명인원교육시작일교육종료일교육 시간
번호1.0000.8760.6650.7401.0000.5710.4900.8960.664
새일센터0.8761.0000.0000.3541.0000.9400.0000.8440.629
과정유형0.6650.0001.0000.8991.0000.0000.9090.0000.554
세부유형0.7400.3540.8991.0001.0000.0000.7690.9390.738
교육훈련 과정명1.0001.0001.0001.0001.0001.0001.0001.0001.000
인원0.5710.9400.0000.0001.0001.0000.0000.8260.443
교육시작일0.4900.0000.9090.7691.0000.0001.0000.9550.594
교육종료일0.8960.8440.0000.9391.0000.8260.9551.0000.888
교육 시간0.6640.6290.5540.7381.0000.4430.5940.8881.000
2024-03-15T09:56:13.689257image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
새일센터과정유형세부유형인원
새일센터1.0000.0000.1790.636
과정유형0.0001.0000.8170.000
세부유형0.1790.8171.0000.000
인원0.6360.0000.0001.000
2024-03-15T09:56:13.950985image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호교육 시간새일센터과정유형세부유형인원
번호1.0000.2830.6500.3350.4860.360
교육 시간0.2831.0000.3850.3880.5180.350
새일센터0.6500.3851.0000.0000.1790.636
과정유형0.3350.3880.0001.0000.8170.000
세부유형0.4860.5180.1790.8171.0000.000
인원0.3600.3500.6360.0000.0001.000

Missing values

2024-03-15T09:56:05.427564image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-15T09:56:05.882025image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

번호새일센터과정유형세부유형교육훈련 과정명인원교육시작일교육종료일교육 시간
01전북새일전문기업LED반도체산업분야 제조인력 양성과정202016-04-042016-05-10120
12전북새일전문기업법무사무원 양성교육202016-03-282016-06-29260
23전북새일전문기업성장동력산업 멀티품질관리원 양성202016-04-112016-05-17120
34전북새일전문기업자동차부품소개 제조인력 양성교육202016-09-012016-10-11120
45전북새일전문기업전기전자부품 품질검사원 양성교육202016-03-282016-05-02120
56전북새일전문기업탄소산업분야 제조인력 양성과정202016-09-052016-10-13120
67전북새일취약계층결혼/이민객실관리사 양성교육202016-06-072016-06-3080
78전북새일일반일반단체급식조리사 양성교육202016-05-232016-07-04120
89전북새일일반역량새일역량교육 1202016-04-062016-04-1220
910전북새일일반역량새일역량교육 2202016-05-092016-05-1320
번호새일센터과정유형세부유형교육훈련 과정명인원교육시작일교육종료일교육 시간
2728정읍새일일반일반한식푸드컨설턴트242016-04-042016-07-14205
2829남원새일창업창업DIY나무코디네이터양성과정202016-04-182016-09-02174
2930남원새일일반일반방과후아동지도사양성과정202016-04-042016-09-05219
3031남원새일일반일반사무행정실무과정202016-04-042016-07-26240
3132김제새일취약계층결혼/이민네일아트국가자격증222016-03-022016-05-24184
3233김제새일일반일반로봇과학방과후지도사242016-04-182016-06-20180
3334김제새일일반일반커리어IT실무자202016-05-092016-07-08184
3435완주새일일반일반생산제조품질관리원152016-05-182016-06-16120
3536완주새일일반일반자동차부품제조양성과정152016-09-212016-10-20120
3637완주새일창업창업폐백이야기152016-04-212016-05-27100