Overview

Dataset statistics

Number of variables5
Number of observations52
Missing cells0
Missing cells (%)0.0%
Duplicate rows9
Duplicate rows (%)17.3%
Total size in memory2.2 KiB
Average record size in memory43.5 B

Variable types

Categorical5

Dataset

Description년도: 기간제근로자가 채용된 연 근무시작월: 기간제근로자가 근무시작월 성별: 기간제근로자의 성별 * 기간제 근로자 지원자의 연령, 성별, 채용월에 대한 정보 제공
URLhttps://www.data.go.kr/data/15077171/fileData.do

Alerts

Dataset has 9 (17.3%) duplicate rowsDuplicates
년도 is highly overall correlated with 분야High correlation
근무시작월 is highly overall correlated with 분야High correlation
분야 is highly overall correlated with 년도 and 1 other fieldsHigh correlation

Reproduction

Analysis started2023-12-12 11:48:10.041873
Analysis finished2023-12-12 11:48:10.492409
Duration0.45 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

년도
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)5.8%
Missing0
Missing (%)0.0%
Memory size548.0 B
2023
20 
2022
17 
2021
15 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2021
2nd row2021
3rd row2021
4th row2021
5th row2021

Common Values

ValueCountFrequency (%)
2023 20
38.5%
2022 17
32.7%
2021 15
28.8%

Length

2023-12-12T20:48:10.571840image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T20:48:10.783835image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023 20
38.5%
2022 17
32.7%
2021 15
28.8%

근무시작월
Categorical

HIGH CORRELATION 

Distinct7
Distinct (%)13.5%
Missing0
Missing (%)0.0%
Memory size548.0 B
02월
19 
03월
11 
02월03월
06월
07월
Other values (2)

Length

Max length6
Median length3
Mean length3.4615385
Min length3

Unique

Unique2 ?
Unique (%)3.8%

Sample

1st row02월
2nd row02월
3rd row02월
4th row02월
5th row02월

Common Values

ValueCountFrequency (%)
02월 19
36.5%
03월 11
21.2%
02월03월 8
15.4%
06월 7
 
13.5%
07월 5
 
9.6%
01월 1
 
1.9%
04월 1
 
1.9%

Length

2023-12-12T20:48:11.162292image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T20:48:11.271600image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
02월 19
36.5%
03월 11
21.2%
02월03월 8
15.4%
06월 7
 
13.5%
07월 5
 
9.6%
01월 1
 
1.9%
04월 1
 
1.9%

성별
Categorical

Distinct2
Distinct (%)3.8%
Missing0
Missing (%)0.0%
Memory size548.0 B
40 
12 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row
2nd row
3rd row
4th row
5th row

Common Values

ValueCountFrequency (%)
40
76.9%
12
 
23.1%

Length

2023-12-12T20:48:11.415407image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T20:48:11.514184image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
40
76.9%
12
 
23.1%

나이
Categorical

Distinct5
Distinct (%)9.6%
Missing0
Missing (%)0.0%
Memory size548.0 B
20대
29 
30대
10 
40대
50대
10대
 
1

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique1 ?
Unique (%)1.9%

Sample

1st row20대
2nd row40대
3rd row30대
4th row20대
5th row20대

Common Values

ValueCountFrequency (%)
20대 29
55.8%
30대 10
 
19.2%
40대 9
 
17.3%
50대 3
 
5.8%
10대 1
 
1.9%

Length

2023-12-12T20:48:11.670230image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T20:48:11.819993image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
20대 29
55.8%
30대 10
 
19.2%
40대 9
 
17.3%
50대 3
 
5.8%
10대 1
 
1.9%

분야
Categorical

HIGH CORRELATION 

Distinct8
Distinct (%)15.4%
Missing0
Missing (%)0.0%
Memory size548.0 B
식중독균
10 
식품미생물
질병원인체
IGRA(잠복결핵)
식품분석
Other values (3)

Length

Max length10
Median length5
Mean length5.2884615
Min length2

Unique

Unique1 ?
Unique (%)1.9%

Sample

1st row식품미생물
2nd row식품미생물
3rd row식품미생물
4th row식품미생물
5th row식품미생물

Common Values

ValueCountFrequency (%)
식중독균 10
19.2%
식품미생물 9
17.3%
질병원인체 9
17.3%
IGRA(잠복결핵) 9
17.3%
식품분석 9
17.3%
조리 3
 
5.8%
위해미생물 2
 
3.8%
먹는물 1
 
1.9%

Length

2023-12-12T20:48:11.987707image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T20:48:12.162727image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
식중독균 10
19.2%
식품미생물 9
17.3%
질병원인체 9
17.3%
igra(잠복결핵 9
17.3%
식품분석 9
17.3%
조리 3
 
5.8%
위해미생물 2
 
3.8%
먹는물 1
 
1.9%

Correlations

2023-12-12T20:48:12.275200image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
년도근무시작월성별나이분야
년도1.0000.5680.0000.0000.765
근무시작월0.5681.0000.0000.2990.816
성별0.0000.0001.0000.1370.000
나이0.0000.2990.1371.0000.392
분야0.7650.8160.0000.3921.000
2023-12-12T20:48:12.393027image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
근무시작월년도성별분야나이
근무시작월1.0000.4360.0000.6070.186
년도0.4361.0000.0000.6370.000
성별0.0000.0001.0000.0000.158
분야0.6070.6370.0001.0000.239
나이0.1860.0000.1580.2391.000
2023-12-12T20:48:12.510984image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
년도근무시작월성별나이분야
년도1.0000.4360.0000.0000.637
근무시작월0.4361.0000.0000.1860.607
성별0.0000.0001.0000.1580.000
나이0.0000.1860.1581.0000.239
분야0.6370.6070.0000.2391.000

Missing values

2023-12-12T20:48:10.326216image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T20:48:10.456296image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

년도근무시작월성별나이분야
0202102월20대식품미생물
1202102월40대식품미생물
2202102월30대식품미생물
3202102월20대식품미생물
4202102월20대식품미생물
5202102월20대식품미생물
6202102월03월40대질병원인체
7202102월03월30대질병원인체
8202102월03월30대질병원인체
9202102월03월20대질병원인체
년도근무시작월성별나이분야
42202303월30대IGRA(잠복결핵)
43202303월20대IGRA(잠복결핵)
44202303월20대IGRA(잠복결핵)
45202304월30대먹는물
46202306월40대식중독균
47202306월20대식중독균
48202306월20대식중독균
49202306월50대식중독균
50202306월20대식품미생물
51202306월40대식품미생물

Duplicate rows

Most frequently occurring

년도근무시작월성별나이분야# duplicates
0202102월20대식품미생물3
1202103월20대질병원인체3
6202302월20대식품분석3
8202303월20대IGRA(잠복결핵)3
2202202월30대식중독균2
3202202월03월20대IGRA(잠복결핵)2
4202207월50대조리2
5202302월20대식품분석2
7202302월40대식품분석2