Overview

Dataset statistics

Number of variables5
Number of observations50
Missing cells0
Missing cells (%)0.0%
Duplicate rows2
Duplicate rows (%)4.0%
Total size in memory2.1 KiB
Average record size in memory42.6 B

Variable types

Text2
DateTime2
Categorical1

Dataset

Description한국기계연구원의 연구관리 분야에서 사업/과제계획서참여연구원파견자를 관리하는 테이블 정보(파견자, 파견시작일자, 파견종료일자 등을 관리)
URLhttps://www.data.go.kr/data/15078068/fileData.do

Alerts

작성일 has constant value ""Constant
Dataset has 2 (4.0%) duplicate rowsDuplicates

Reproduction

Analysis started2023-12-12 09:40:54.310678
Analysis finished2023-12-12 09:40:54.861076
Duration0.55 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct30
Distinct (%)60.0%
Missing0
Missing (%)0.0%
Memory size532.0 B
2023-12-12T18:40:54.996603image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters150
Distinct characters31
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique15 ?
Unique (%)30.0%

Sample

1st row*선*
2nd row*대*
3rd row*관*
4th row*필*
5th row*진*
ValueCountFrequency (%)
6
 
12.0%
3
 
6.0%
2
 
4.0%
2
 
4.0%
2
 
4.0%
2
 
4.0%
2
 
4.0%
2
 
4.0%
2
 
4.0%
2
 
4.0%
Other values (20) 25
50.0%
2023-12-12T18:40:55.351834image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
* 100
66.7%
6
 
4.0%
3
 
2.0%
2
 
1.3%
2
 
1.3%
2
 
1.3%
2
 
1.3%
2
 
1.3%
2
 
1.3%
2
 
1.3%
Other values (21) 27
 
18.0%

Most occurring categories

ValueCountFrequency (%)
Other Punctuation 100
66.7%
Other Letter 50
33.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
6
 
12.0%
3
 
6.0%
2
 
4.0%
2
 
4.0%
2
 
4.0%
2
 
4.0%
2
 
4.0%
2
 
4.0%
2
 
4.0%
2
 
4.0%
Other values (20) 25
50.0%
Other Punctuation
ValueCountFrequency (%)
* 100
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 100
66.7%
Hangul 50
33.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
6
 
12.0%
3
 
6.0%
2
 
4.0%
2
 
4.0%
2
 
4.0%
2
 
4.0%
2
 
4.0%
2
 
4.0%
2
 
4.0%
2
 
4.0%
Other values (20) 25
50.0%
Common
ValueCountFrequency (%)
* 100
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 100
66.7%
Hangul 50
33.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
* 100
100.0%
Hangul
ValueCountFrequency (%)
6
 
12.0%
3
 
6.0%
2
 
4.0%
2
 
4.0%
2
 
4.0%
2
 
4.0%
2
 
4.0%
2
 
4.0%
2
 
4.0%
2
 
4.0%
Other values (20) 25
50.0%
Distinct41
Distinct (%)82.0%
Missing0
Missing (%)0.0%
Memory size532.0 B
Minimum2016-12-26 00:00:00
Maximum2022-01-04 00:00:00
2023-12-12T18:40:55.525269image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:40:55.696665image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=41)
Distinct38
Distinct (%)76.0%
Missing0
Missing (%)0.0%
Memory size532.0 B
Minimum2017-06-25 00:00:00
Maximum2024-01-03 00:00:00
2023-12-12T18:40:55.874885image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:40:56.045993image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=38)

비고
Text

Distinct34
Distinct (%)68.0%
Missing0
Missing (%)0.0%
Memory size532.0 B
2023-12-12T18:40:56.328877image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length46
Median length32
Mean length21.52
Min length4

Characters and Unicode

Total characters1076
Distinct characters137
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique27 ?
Unique (%)54.0%

Sample

1st row중기 기술연수
2nd row화학(연) CCP 융합연구단 파견
3rd row화학(연) CCP 융합연구단 파견
4th row한국에너지기술연구원FEP융합연구단 파견
5th row국제공동연구
ValueCountFrequency (%)
파견 8
 
5.7%
중기연수 8
 
5.7%
한국전자통신연구원 7
 
5.0%
한국산업기술진흥원 7
 
5.0%
소재부품장비 7
 
5.0%
융합혁신지원단 7
 
5.0%
기업지원데스크 7
 
5.0%
dmc융합연구단 7
 
5.0%
파견(미국 7
 
5.0%
university 3
 
2.1%
Other values (58) 73
51.8%
2023-12-12T18:40:56.792260image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
91
 
8.5%
55
 
5.1%
54
 
5.0%
36
 
3.3%
34
 
3.2%
32
 
3.0%
28
 
2.6%
27
 
2.5%
) 23
 
2.1%
( 23
 
2.1%
Other values (127) 673
62.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 692
64.3%
Lowercase Letter 143
 
13.3%
Uppercase Letter 97
 
9.0%
Space Separator 91
 
8.5%
Close Punctuation 23
 
2.1%
Open Punctuation 23
 
2.1%
Other Punctuation 7
 
0.7%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
55
 
7.9%
54
 
7.8%
36
 
5.2%
34
 
4.9%
32
 
4.6%
28
 
4.0%
27
 
3.9%
21
 
3.0%
20
 
2.9%
20
 
2.9%
Other values (83) 365
52.7%
Lowercase Letter
ValueCountFrequency (%)
i 19
13.3%
n 14
9.8%
o 14
9.8%
t 14
9.8%
a 11
7.7%
e 11
7.7%
s 10
 
7.0%
r 10
 
7.0%
v 7
 
4.9%
y 6
 
4.2%
Other values (10) 27
18.9%
Uppercase Letter
ValueCountFrequency (%)
C 18
18.6%
M 12
12.4%
D 10
10.3%
P 9
9.3%
U 8
8.2%
A 6
 
6.2%
L 4
 
4.1%
S 4
 
4.1%
R 4
 
4.1%
N 4
 
4.1%
Other values (9) 18
18.6%
Other Punctuation
ValueCountFrequency (%)
& 4
57.1%
, 3
42.9%
Space Separator
ValueCountFrequency (%)
91
100.0%
Close Punctuation
ValueCountFrequency (%)
) 23
100.0%
Open Punctuation
ValueCountFrequency (%)
( 23
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 692
64.3%
Latin 240
 
22.3%
Common 144
 
13.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
55
 
7.9%
54
 
7.8%
36
 
5.2%
34
 
4.9%
32
 
4.6%
28
 
4.0%
27
 
3.9%
21
 
3.0%
20
 
2.9%
20
 
2.9%
Other values (83) 365
52.7%
Latin
ValueCountFrequency (%)
i 19
 
7.9%
C 18
 
7.5%
n 14
 
5.8%
o 14
 
5.8%
t 14
 
5.8%
M 12
 
5.0%
a 11
 
4.6%
e 11
 
4.6%
s 10
 
4.2%
D 10
 
4.2%
Other values (29) 107
44.6%
Common
ValueCountFrequency (%)
91
63.2%
) 23
 
16.0%
( 23
 
16.0%
& 4
 
2.8%
, 3
 
2.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 691
64.2%
ASCII 384
35.7%
Compat Jamo 1
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
91
23.7%
) 23
 
6.0%
( 23
 
6.0%
i 19
 
4.9%
C 18
 
4.7%
n 14
 
3.6%
o 14
 
3.6%
t 14
 
3.6%
M 12
 
3.1%
a 11
 
2.9%
Other values (34) 145
37.8%
Hangul
ValueCountFrequency (%)
55
 
8.0%
54
 
7.8%
36
 
5.2%
34
 
4.9%
32
 
4.6%
28
 
4.1%
27
 
3.9%
21
 
3.0%
20
 
2.9%
20
 
2.9%
Other values (82) 364
52.7%
Compat Jamo
ValueCountFrequency (%)
1
100.0%

작성일
Categorical

CONSTANT 

Distinct1
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size532.0 B
2023-07-28
50 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023-07-28
2nd row2023-07-28
3rd row2023-07-28
4th row2023-07-28
5th row2023-07-28

Common Values

ValueCountFrequency (%)
2023-07-28 50
100.0%

Length

2023-12-12T18:40:56.955095image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T18:40:57.412760image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023-07-28 50
100.0%

Correlations

2023-12-12T18:40:57.510690image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
파견자파견시작일자파견종료일자비고
파견자1.0000.9210.8520.809
파견시작일자0.9211.0000.9980.998
파견종료일자0.8520.9981.0000.994
비고0.8090.9980.9941.000

Missing values

2023-12-12T18:40:54.672410image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T18:40:54.803750image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

파견자파견시작일자파견종료일자비고작성일
0*선*2016-12-262017-06-25중기 기술연수2023-07-28
1*대*2017-01-012017-12-31화학(연) CCP 융합연구단 파견2023-07-28
2*관*2017-01-012017-12-31화학(연) CCP 융합연구단 파견2023-07-28
3*필*2017-01-012018-12-31한국에너지기술연구원FEP융합연구단 파견2023-07-28
4*진*2017-01-222017-07-21국제공동연구2023-07-28
5*재*2017-06-192018-06-17국가과학기술연구회2023-07-28
6*봉*2017-05-152017-08-11해외학위파견2023-07-28
7*희*2017-07-012017-12-31연구연가 파견(미국 마이애미대학교)2023-07-28
8*택*2017-08-282018-02-27중기연수 파견(미국 미네소타대학교)2023-07-28
9*병*2017-12-042018-06-03중기연수 파견(미국 퍼듀대학교)2023-07-28
파견자파견시작일자파견종료일자비고작성일
40*재*2020-02-102022-11-30한국전자통신연구원 DMC융합연구단2023-07-28
41*승*2020-02-102022-11-30한국전자통신연구원 DMC융합연구단2023-07-28
42*아*2020-02-102022-11-30한국전자통신연구원 DMC융합연구단2023-07-28
43*승*2020-02-102022-11-30한국전자통신연구원 DMC융합연구단2023-07-28
44*성*2020-02-102022-11-30한국전자통신연구원 DMC융합연구단2023-07-28
45*신*2020-08-182020-11-13한국산업기술진흥원 소재부품장비 융합혁신지원단 기업지원데스크2023-07-28
46*동*2020-11-162021-02-10한국산업기술진흥원 소재부품장비 융합혁신지원단 기업지원데스크2023-07-28
47*태*2021-02-152021-05-14한국산업기술진흥원 소재부품장비 융합혁신지원단 기업지원데스크2023-07-28
48*상*2021-04-012021-06-30육아휴직2023-07-28
49*유*2021-05-172022-01-03한국산업기술진흥원 소재부품장비 융합혁신지원단 기업지원데스크2023-07-28

Duplicate rows

Most frequently occurring

파견자파견시작일자파견종료일자비고작성일# duplicates
0*승*2020-02-102022-11-30한국전자통신연구원 DMC융합연구단2023-07-282
1*태*2021-02-152021-05-14한국산업기술진흥원 소재부품장비 융합혁신지원단 기업지원데스크2023-07-282