Overview

Dataset statistics

Number of variables5
Number of observations2469
Missing cells0
Missing cells (%)0.0%
Duplicate rows4
Duplicate rows (%)0.2%
Total size in memory96.6 KiB
Average record size in memory40.1 B

Variable types

DateTime1
Text2
Categorical2

Dataset

Description한국기계연구원에서 수행한 연구과제의 성과 등에 관련한 언론보도 목록 정보(보도일자,제목, 매체명, 매체구분 및 분류항목 등)
URLhttps://www.data.go.kr/data/15048732/fileData.do

Alerts

Dataset has 4 (0.2%) duplicate rowsDuplicates

Reproduction

Analysis started2023-12-12 22:23:01.209181
Analysis finished2023-12-12 22:23:02.138581
Duration0.93 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct429
Distinct (%)17.4%
Missing0
Missing (%)0.0%
Memory size19.4 KiB
Minimum2020-01-01 00:00:00
Maximum2023-07-27 00:00:00
2023-12-13T07:23:02.194690image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T07:23:02.316085image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

제목
Text

Distinct2162
Distinct (%)87.6%
Missing0
Missing (%)0.0%
Memory size19.4 KiB
2023-12-13T07:23:02.587487image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length101
Median length92
Mean length33.874848
Min length9

Characters and Unicode

Total characters83637
Distinct characters870
Distinct categories15 ?
Distinct scripts4 ?
Distinct blocks11 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1967 ?
Unique (%)79.7%

Sample

1st row과학계 신년 희망 "경자년, 민첩하고 부지런한 해로"
2nd row[신년 인터뷰] 김창기 한국기계연구원 연구위원
3rd row김현석·홍순국·김형국 사장 '공학계 명예의 전당' 올랐다
4th row이들의 1년 계획 과학 100년 밝힌다
5th row스스로 빛내고 알아서 선명하게…디스플레이가 살아 숨쉰다
ValueCountFrequency (%)
개발 681
 
3.9%
기계연 582
 
3.3%
로봇 251
 
1.4%
기술 235
 
1.3%
성공 125
 
0.7%
국산화 106
 
0.6%
기계연구원 102
 
0.6%
미세먼지 101
 
0.6%
포토뉴스 87
 
0.5%
한국기계연구원 79
 
0.4%
Other values (4471) 15249
86.7%
2023-12-13T07:23:02.967451image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
17268
 
20.6%
2597
 
3.1%
e 1523
 
1.8%
1369
 
1.6%
1336
 
1.6%
' 1330
 
1.6%
, 1289
 
1.5%
1037
 
1.2%
996
 
1.2%
937
 
1.1%
Other values (860) 53955
64.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 46471
55.6%
Space Separator 17268
 
20.6%
Lowercase Letter 11028
 
13.2%
Other Punctuation 4370
 
5.2%
Uppercase Letter 2374
 
2.8%
Decimal Number 1294
 
1.5%
Dash Punctuation 239
 
0.3%
Open Punctuation 211
 
0.3%
Close Punctuation 210
 
0.3%
Math Symbol 39
 
< 0.1%
Other values (5) 133
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2597
 
5.6%
1369
 
2.9%
1336
 
2.9%
1037
 
2.2%
996
 
2.1%
937
 
2.0%
786
 
1.7%
669
 
1.4%
565
 
1.2%
562
 
1.2%
Other values (758) 35617
76.6%
Lowercase Letter
ValueCountFrequency (%)
e 1523
13.8%
a 916
 
8.3%
t 889
 
8.1%
o 876
 
7.9%
s 829
 
7.5%
r 802
 
7.3%
i 774
 
7.0%
n 713
 
6.5%
l 570
 
5.2%
c 440
 
4.0%
Other values (16) 2696
24.4%
Uppercase Letter
ValueCountFrequency (%)
I 235
 
9.9%
S 210
 
8.8%
A 204
 
8.6%
D 200
 
8.4%
M 169
 
7.1%
L 146
 
6.1%
K 143
 
6.0%
T 133
 
5.6%
R 119
 
5.0%
N 107
 
4.5%
Other values (16) 708
29.8%
Other Punctuation
ValueCountFrequency (%)
' 1330
30.4%
, 1289
29.5%
. 696
15.9%
· 459
 
10.5%
" 295
 
6.8%
206
 
4.7%
% 62
 
1.4%
: 15
 
0.3%
& 10
 
0.2%
3
 
0.1%
Other values (3) 5
 
0.1%
Decimal Number
ValueCountFrequency (%)
0 437
33.8%
1 290
22.4%
2 206
15.9%
9 105
 
8.1%
3 97
 
7.5%
5 50
 
3.9%
4 37
 
2.9%
6 26
 
2.0%
8 26
 
2.0%
7 20
 
1.5%
Math Symbol
ValueCountFrequency (%)
18
46.2%
6
 
15.4%
6
 
15.4%
~ 5
 
12.8%
+ 2
 
5.1%
= 2
 
5.1%
Other Symbol
ValueCountFrequency (%)
12
37.5%
8
25.0%
5
15.6%
5
15.6%
1
 
3.1%
1
 
3.1%
Other Number
ValueCountFrequency (%)
7
77.8%
1
 
11.1%
1
 
11.1%
Open Punctuation
ValueCountFrequency (%)
( 106
50.2%
[ 105
49.8%
Close Punctuation
ValueCountFrequency (%)
] 105
50.0%
) 105
50.0%
Final Punctuation
ValueCountFrequency (%)
27
71.1%
11
28.9%
Initial Punctuation
ValueCountFrequency (%)
24
70.6%
10
29.4%
Modifier Symbol
ValueCountFrequency (%)
` 18
90.0%
¨ 2
 
10.0%
Space Separator
ValueCountFrequency (%)
17268
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 239
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 46300
55.4%
Common 23763
28.4%
Latin 13402
 
16.0%
Han 172
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2597
 
5.6%
1369
 
3.0%
1336
 
2.9%
1037
 
2.2%
996
 
2.2%
937
 
2.0%
786
 
1.7%
669
 
1.4%
565
 
1.2%
562
 
1.2%
Other values (732) 35446
76.6%
Latin
ValueCountFrequency (%)
e 1523
 
11.4%
a 916
 
6.8%
t 889
 
6.6%
o 876
 
6.5%
s 829
 
6.2%
r 802
 
6.0%
i 774
 
5.8%
n 713
 
5.3%
l 570
 
4.3%
c 440
 
3.3%
Other values (42) 5070
37.8%
Common
ValueCountFrequency (%)
17268
72.7%
' 1330
 
5.6%
, 1289
 
5.4%
. 696
 
2.9%
· 459
 
1.9%
0 437
 
1.8%
" 295
 
1.2%
1 290
 
1.2%
- 239
 
1.0%
206
 
0.9%
Other values (39) 1254
 
5.3%
Han
ValueCountFrequency (%)
73
42.4%
14
 
8.1%
13
 
7.6%
11
 
6.4%
11
 
6.4%
11
 
6.4%
9
 
5.2%
4
 
2.3%
4
 
2.3%
3
 
1.7%
Other values (17) 19
 
11.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 46297
55.4%
ASCII 36350
43.5%
None 472
 
0.6%
Punctuation 281
 
0.3%
CJK 171
 
0.2%
Arrows 30
 
< 0.1%
CJK Compat 26
 
< 0.1%
Letterlike Symbols 5
 
< 0.1%
Compat Jamo 2
 
< 0.1%
Enclosed Alphanum 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
17268
47.5%
e 1523
 
4.2%
' 1330
 
3.7%
, 1289
 
3.5%
a 916
 
2.5%
t 889
 
2.4%
o 876
 
2.4%
s 829
 
2.3%
r 802
 
2.2%
i 774
 
2.1%
Other values (71) 9854
27.1%
Hangul
ValueCountFrequency (%)
2597
 
5.6%
1369
 
3.0%
1336
 
2.9%
1037
 
2.2%
996
 
2.2%
937
 
2.0%
786
 
1.7%
669
 
1.4%
565
 
1.2%
562
 
1.2%
Other values (730) 35443
76.6%
None
ValueCountFrequency (%)
· 459
97.2%
7
 
1.5%
3
 
0.6%
¨ 2
 
0.4%
1
 
0.2%
Punctuation
ValueCountFrequency (%)
206
73.3%
27
 
9.6%
24
 
8.5%
11
 
3.9%
10
 
3.6%
3
 
1.1%
CJK
ValueCountFrequency (%)
73
42.7%
14
 
8.2%
13
 
7.6%
11
 
6.4%
11
 
6.4%
11
 
6.4%
9
 
5.3%
4
 
2.3%
4
 
2.3%
3
 
1.8%
Other values (16) 18
 
10.5%
Arrows
ValueCountFrequency (%)
18
60.0%
6
 
20.0%
6
 
20.0%
CJK Compat
ValueCountFrequency (%)
12
46.2%
8
30.8%
5
19.2%
1
 
3.8%
Letterlike Symbols
ValueCountFrequency (%)
5
100.0%
Compat Jamo
ValueCountFrequency (%)
2
100.0%
Enclosed Alphanum
ValueCountFrequency (%)
1
50.0%
1
50.0%
CJK Compat Ideographs
ValueCountFrequency (%)
1
100.0%

매체
Text

Distinct328
Distinct (%)13.3%
Missing0
Missing (%)0.0%
Memory size19.4 KiB
2023-12-13T07:23:03.241286image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length33
Median length32
Mean length5.8562171
Min length3

Characters and Unicode

Total characters14459
Distinct characters273
Distinct categories9 ?
Distinct scripts4 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique166 ?
Unique (%)6.7%

Sample

1st row이데일리
2nd row투데이에너지
3rd row한국경제
4th row중도일보
5th row동아일보
ValueCountFrequency (%)
연합뉴스 127
 
4.7%
전자신문 111
 
4.1%
헤럴드경제 92
 
3.4%
매일경제 91
 
3.3%
머니투데이 77
 
2.8%
ytn 65
 
2.4%
대덕넷 65
 
2.4%
news1 63
 
2.3%
디지털타임스 62
 
2.3%
충청뉴스 58
 
2.1%
Other values (376) 1919
70.3%
2023-12-13T07:23:03.627898image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
736
 
5.1%
582
 
4.0%
468
 
3.2%
466
 
3.2%
393
 
2.7%
385
 
2.7%
369
 
2.6%
348
 
2.4%
341
 
2.4%
e 300
 
2.1%
Other values (263) 10071
69.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 9896
68.4%
Lowercase Letter 1834
 
12.7%
Uppercase Letter 1725
 
11.9%
Space Separator 348
 
2.4%
Open Punctuation 235
 
1.6%
Close Punctuation 235
 
1.6%
Decimal Number 138
 
1.0%
Other Punctuation 46
 
0.3%
Dash Punctuation 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
736
 
7.4%
582
 
5.9%
468
 
4.7%
466
 
4.7%
393
 
4.0%
385
 
3.9%
369
 
3.7%
341
 
3.4%
264
 
2.7%
245
 
2.5%
Other values (194) 5647
57.1%
Lowercase Letter
ValueCountFrequency (%)
e 300
16.4%
s 207
11.3%
i 192
10.5%
a 146
 
8.0%
r 133
 
7.3%
o 114
 
6.2%
n 109
 
5.9%
u 81
 
4.4%
l 73
 
4.0%
t 66
 
3.6%
Other values (19) 413
22.5%
Uppercase Letter
ValueCountFrequency (%)
N 244
14.1%
B 209
12.1%
S 198
11.5%
T 198
11.5%
E 121
 
7.0%
C 91
 
5.3%
K 87
 
5.0%
W 83
 
4.8%
A 81
 
4.7%
Y 72
 
4.2%
Other values (17) 341
19.8%
Decimal Number
ValueCountFrequency (%)
1 63
45.7%
2 33
23.9%
4 33
23.9%
0 3
 
2.2%
3 3
 
2.2%
6 3
 
2.2%
Other Punctuation
ValueCountFrequency (%)
/ 25
54.3%
. 17
37.0%
& 4
 
8.7%
Space Separator
ValueCountFrequency (%)
348
100.0%
Open Punctuation
ValueCountFrequency (%)
( 235
100.0%
Close Punctuation
ValueCountFrequency (%)
) 235
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 9896
68.4%
Latin 3554
 
24.6%
Common 1004
 
6.9%
Cyrillic 5
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
736
 
7.4%
582
 
5.9%
468
 
4.7%
466
 
4.7%
393
 
4.0%
385
 
3.9%
369
 
3.7%
341
 
3.4%
264
 
2.7%
245
 
2.5%
Other values (194) 5647
57.1%
Latin
ValueCountFrequency (%)
e 300
 
8.4%
N 244
 
6.9%
B 209
 
5.9%
s 207
 
5.8%
S 198
 
5.6%
T 198
 
5.6%
i 192
 
5.4%
a 146
 
4.1%
r 133
 
3.7%
E 121
 
3.4%
Other values (41) 1606
45.2%
Common
ValueCountFrequency (%)
348
34.7%
( 235
23.4%
) 235
23.4%
1 63
 
6.3%
2 33
 
3.3%
4 33
 
3.3%
/ 25
 
2.5%
. 17
 
1.7%
& 4
 
0.4%
0 3
 
0.3%
Other values (3) 8
 
0.8%
Cyrillic
ValueCountFrequency (%)
Ф 1
20.0%
о 1
20.0%
к 1
20.0%
у 1
20.0%
с 1
20.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 9896
68.4%
ASCII 4558
31.5%
Cyrillic 5
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
736
 
7.4%
582
 
5.9%
468
 
4.7%
466
 
4.7%
393
 
4.0%
385
 
3.9%
369
 
3.7%
341
 
3.4%
264
 
2.7%
245
 
2.5%
Other values (194) 5647
57.1%
ASCII
ValueCountFrequency (%)
348
 
7.6%
e 300
 
6.6%
N 244
 
5.4%
( 235
 
5.2%
) 235
 
5.2%
B 209
 
4.6%
s 207
 
4.5%
S 198
 
4.3%
T 198
 
4.3%
i 192
 
4.2%
Other values (54) 2192
48.1%
Cyrillic
ValueCountFrequency (%)
Ф 1
20.0%
о 1
20.0%
к 1
20.0%
у 1
20.0%
с 1
20.0%

매체구분
Categorical

Distinct6
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size19.4 KiB
신문
1091 
인터넷
929 
통신
234 
TV방송
183 
월간지
 
31

Length

Max length5
Median length2
Mean length2.5382746
Min length2

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row신문
2nd row신문
3rd row신문
4th row신문
5th row신문

Common Values

ValueCountFrequency (%)
신문 1091
44.2%
인터넷 929
37.6%
통신 234
 
9.5%
TV방송 183
 
7.4%
월간지 31
 
1.3%
라디오방송 1
 
< 0.1%

Length

2023-12-13T07:23:03.742351image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T07:23:03.824869image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
신문 1091
44.2%
인터넷 929
37.6%
통신 234
 
9.5%
tv방송 183
 
7.4%
월간지 31
 
1.3%
라디오방송 1
 
< 0.1%

분류항목
Categorical

Distinct4
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size19.4 KiB
연구성과
1519 
일반
705 
기획특집
212 
기고칼럼
 
33

Length

Max length4
Median length4
Mean length3.4289186
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row기획특집
2nd row기획특집
3rd row일반
4th row기획특집
5th row기획특집

Common Values

ValueCountFrequency (%)
연구성과 1519
61.5%
일반 705
28.6%
기획특집 212
 
8.6%
기고칼럼 33
 
1.3%

Length

2023-12-13T07:23:03.930837image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T07:23:04.022280image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
연구성과 1519
61.5%
일반 705
28.6%
기획특집 212
 
8.6%
기고칼럼 33
 
1.3%

Correlations

2023-12-13T07:23:04.075947image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
매체구분분류항목
매체구분1.0000.182
분류항목0.1821.000
2023-12-13T07:23:04.139232image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
매체구분분류항목
매체구분1.0000.118
분류항목0.1181.000
2023-12-13T07:23:04.204813image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
매체구분분류항목
매체구분1.0000.118
분류항목0.1181.000

Missing values

2023-12-13T07:23:02.022681image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T07:23:02.106286image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

보도일자제목매체매체구분분류항목
02020-01-01과학계 신년 희망 "경자년, 민첩하고 부지런한 해로"이데일리신문기획특집
12020-01-02[신년 인터뷰] 김창기 한국기계연구원 연구위원투데이에너지신문기획특집
22020-01-07김현석·홍순국·김형국 사장 '공학계 명예의 전당' 올랐다한국경제신문일반
32020-01-08이들의 1년 계획 과학 100년 밝힌다중도일보신문기획특집
42020-01-13스스로 빛내고 알아서 선명하게…디스플레이가 살아 숨쉰다동아일보신문기획특집
52020-01-13스스로 빛내고 알아서 선명하게...디스플레이가 살아 숨쉰다동아사이언스인터넷기획특집
62020-01-13박천홍 한국기계연구원장이 '기계설비신문'에 바란다기계설비신문인터넷기획특집
72020-01-14꿈의 디스플레이' 마이크로 LED 생산성 1,000배↑YTNTV방송기획특집
82020-01-14'꿈의 디스플레이' 마이크로 LED 생산성 1,000배↑YTN 사이언스TV방송기획특집
92020-01-15용접학회-용접조합, 공동 신년인사회 개최전자신문신문기획특집
보도일자제목매체매체구분분류항목
24592023-07-11"3D 바이오프린팅으로 면역세포 강화해 암 치료"팍스넷인터넷연구성과
24602023-07-113D 바이오프린팅 기술로 ‘암세포’ 없앤다헤럴드경제신문연구성과
24612023-07-11NK면역세포 기능 키워 암세포 없앤다...국내연구진 3D바이오프린팅기술 개발헤럴드경제신문연구성과
24622023-07-12기계연구원, 생명공학연구원과 ‘암세포 제거’ 3D바이오프린팅 기술 개발기계신문인터넷연구성과
24632023-07-12‘암 치료 효과 향상’ 기계연, NK세포 3D 바이오프린팅 기술 개발사이언스타임즈인터넷연구성과
24642023-07-123D 프린팅 신기술 잇단 개발…맞춤형 치료 기대이헬스통신인터넷연구성과
24652023-07-12NK세포 담은 하이드로젤 3D 프린팅 기술 개발코리아헬스로그인터넷연구성과
24662023-07-133D 바이오프린팅 기술 세계 최초 개발산업종합저널인터넷연구성과
24672023-07-13기계硏, 암세포 제거 가능한 3D 바이오프린팅 기술 세계 최초 개발철강금속신문신문연구성과
24682023-07-27[과학게시판] 과학 똑동아사이언스인터넷일반

Duplicate rows

Most frequently occurring

보도일자제목매체매체구분분류항목# duplicates
02020-01-22(포토뉴스) 사람 손처럼 움직이는 로봇 손연합뉴스통신연구성과2
12020-04-14검증 안 된 '이동형 음압기' 기준 마련 속도전자신문신문기획특집2
22020-04-21(포토뉴스) 국방부-환경부-과기정통부, 군 차량 미세먼지 저감 위해 맞손연합뉴스통신연구성과2
32020-05-19(포토뉴스) 인사말 하는 강건용 한국자동차공학회 회장매일경제신문기획특집2