Overview

Dataset statistics

Number of variables13
Number of observations3052
Missing cells2891
Missing cells (%)7.3%
Duplicate rows15
Duplicate rows (%)0.5%
Total size in memory316.1 KiB
Average record size in memory106.0 B

Variable types

Text8
Categorical1
DateTime2
Numeric2

Dataset

Description농림식품RnD 타연구개발사업활용 성과 정보 데이터. 과제명, 주관기관, 연구사업명, 연구제목, 연구기관명 등의 항목으로 구성
Author농림식품기술기획평가원
URLhttps://www.data.go.kr/data/15126293/fileData.do

Alerts

Dataset has 15 (0.5%) duplicate rowsDuplicates
기준년도 is highly overall correlated with 성과활용년도High correlation
성과활용년도 is highly overall correlated with 기준년도High correlation
내역사업명 has 1962 (64.3%) missing valuesMissing
연구사업명 has 129 (4.2%) missing valuesMissing
연구기관명 has 88 (2.9%) missing valuesMissing
본연구와의관계 has 702 (23.0%) missing valuesMissing

Reproduction

Analysis started2024-03-15 00:11:51.878310
Analysis finished2024-03-15 00:11:57.273348
Duration5.4 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct1586
Distinct (%)52.0%
Missing0
Missing (%)0.0%
Memory size24.0 KiB
2024-03-15T09:11:59.013422image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length8
Mean length8.0589777
Min length8

Characters and Unicode

Total characters24596
Distinct characters24
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique913 ?
Unique (%)29.9%

Sample

1st row105150-3
2nd row102050-3
3rd row102050-3
4th row103041-2
5th row105139-3
ValueCountFrequency (%)
914008-4 28
 
0.9%
301008-3 16
 
0.5%
300005-4 16
 
0.5%
609002-5 16
 
0.5%
397011-3 13
 
0.4%
315089-3 13
 
0.4%
199045-3 12
 
0.4%
707002-1 12
 
0.4%
199024-3 11
 
0.4%
107009-3 11
 
0.4%
Other values (1576) 2904
95.2%
2024-03-15T09:12:00.959268image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 5105
20.8%
1 3880
15.8%
3 3057
12.4%
- 3022
12.3%
2 2734
11.1%
9 1753
 
7.1%
5 1266
 
5.1%
4 1226
 
5.0%
8 853
 
3.5%
7 815
 
3.3%
Other values (14) 885
 
3.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 21500
87.4%
Dash Punctuation 3022
 
12.3%
Uppercase Letter 74
 
0.3%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
G 18
24.3%
C 16
21.6%
B 15
20.3%
S 14
18.9%
Y 2
 
2.7%
N 2
 
2.7%
D 1
 
1.4%
F 1
 
1.4%
U 1
 
1.4%
A 1
 
1.4%
Other values (3) 3
 
4.1%
Decimal Number
ValueCountFrequency (%)
0 5105
23.7%
1 3880
18.0%
3 3057
14.2%
2 2734
12.7%
9 1753
 
8.2%
5 1266
 
5.9%
4 1226
 
5.7%
8 853
 
4.0%
7 815
 
3.8%
6 811
 
3.8%
Dash Punctuation
ValueCountFrequency (%)
- 3022
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 24522
99.7%
Latin 74
 
0.3%

Most frequent character per script

Latin
ValueCountFrequency (%)
G 18
24.3%
C 16
21.6%
B 15
20.3%
S 14
18.9%
Y 2
 
2.7%
N 2
 
2.7%
D 1
 
1.4%
F 1
 
1.4%
U 1
 
1.4%
A 1
 
1.4%
Other values (3) 3
 
4.1%
Common
ValueCountFrequency (%)
0 5105
20.8%
1 3880
15.8%
3 3057
12.5%
- 3022
12.3%
2 2734
11.1%
9 1753
 
7.1%
5 1266
 
5.2%
4 1226
 
5.0%
8 853
 
3.5%
7 815
 
3.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 24596
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 5105
20.8%
1 3880
15.8%
3 3057
12.4%
- 3022
12.3%
2 2734
11.1%
9 1753
 
7.1%
5 1266
 
5.1%
4 1226
 
5.0%
8 853
 
3.5%
7 815
 
3.3%
Other values (14) 885
 
3.6%

과제구분
Categorical

Distinct37
Distinct (%)1.2%
Missing0
Missing (%)0.0%
Memory size24.0 KiB
첨단기술개발과제
838 
현장애로기술개발과제
593 
농생명산업기술개발
382 
고부가가치식품기술개발
236 
현장적용기술개발
183 
Other values (32)
820 

Length

Max length25
Median length24
Mean length9.0933814
Min length4

Unique

Unique2 ?
Unique (%)0.1%

Sample

1st row수산실용화기술개발
2nd row수산실용화기술개발
3rd row수산실용화기술개발
4th row현장애로기술개발과제
5th row현장적용기술개발

Common Values

ValueCountFrequency (%)
첨단기술개발과제 838
27.5%
현장애로기술개발과제 593
19.4%
농생명산업기술개발 382
12.5%
고부가가치식품기술개발 236
 
7.7%
현장적용기술개발 183
 
6.0%
첨단생산기술개발 173
 
5.7%
기획연구 168
 
5.5%
기술사업화지원 70
 
2.3%
수출전략기술개발 69
 
2.3%
수산실용화기술개발 62
 
2.0%
Other values (27) 278
 
9.1%

Length

2024-03-15T09:12:01.265879image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
첨단기술개발과제 838
24.7%
현장애로기술개발과제 593
17.5%
농생명산업기술개발 382
11.3%
고부가가치식품기술개발 236
 
7.0%
현장적용기술개발 183
 
5.4%
첨단생산기술개발 173
 
5.1%
기획연구 168
 
5.0%
기술사업화지원 70
 
2.1%
수출전략기술개발 69
 
2.0%
수산실용화기술개발 62
 
1.8%
Other values (51) 615
18.1%

내역사업명
Text

MISSING 

Distinct55
Distinct (%)5.0%
Missing1962
Missing (%)64.3%
Memory size24.0 KiB
2024-03-15T09:12:02.055887image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length21
Median length20
Mean length10.550459
Min length5

Characters and Unicode

Total characters11500
Distinct characters173
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique8 ?
Unique (%)0.7%

Sample

1st row생명자원 부가가치 제고기술
2nd row생명자원 부가가치 제고기술
3rd row생명자원 부가가치 제고기술
4th row기능성강화식품
5th row생명자원 부가가치 제고기술
ValueCountFrequency (%)
생명자원 335
17.0%
제고기술 304
15.5%
부가가치 304
15.5%
기능성강화식품 172
 
8.8%
첨단농자재생산 100
 
5.1%
수출전략형상품개발 64
 
3.3%
ict융복합시스템 57
 
2.9%
현장연계고부가가치사업화 49
 
2.5%
산업화미생물유전체전략연구 41
 
2.1%
생산·관리기술 31
 
1.6%
Other values (88) 508
25.9%
2024-03-15T09:12:03.383315image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
875
 
7.6%
743
 
6.5%
587
 
5.1%
531
 
4.6%
456
 
4.0%
382
 
3.3%
372
 
3.2%
366
 
3.2%
366
 
3.2%
361
 
3.1%
Other values (163) 6461
56.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 10083
87.7%
Space Separator 875
 
7.6%
Lowercase Letter 240
 
2.1%
Uppercase Letter 237
 
2.1%
Other Punctuation 47
 
0.4%
Decimal Number 18
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
743
 
7.4%
587
 
5.8%
531
 
5.3%
456
 
4.5%
382
 
3.8%
372
 
3.7%
366
 
3.6%
366
 
3.6%
361
 
3.6%
361
 
3.6%
Other values (147) 5558
55.1%
Uppercase Letter
ValueCountFrequency (%)
C 57
24.1%
T 57
24.1%
I 57
24.1%
S 30
12.7%
G 30
12.7%
R 3
 
1.3%
D 3
 
1.3%
Lowercase Letter
ValueCountFrequency (%)
e 90
37.5%
d 60
25.0%
o 30
 
12.5%
n 30
 
12.5%
l 30
 
12.5%
Other Punctuation
ValueCountFrequency (%)
· 44
93.6%
& 3
 
6.4%
Space Separator
ValueCountFrequency (%)
875
100.0%
Decimal Number
ValueCountFrequency (%)
1 18
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 10083
87.7%
Common 940
 
8.2%
Latin 477
 
4.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
743
 
7.4%
587
 
5.8%
531
 
5.3%
456
 
4.5%
382
 
3.8%
372
 
3.7%
366
 
3.6%
366
 
3.6%
361
 
3.6%
361
 
3.6%
Other values (147) 5558
55.1%
Latin
ValueCountFrequency (%)
e 90
18.9%
d 60
12.6%
C 57
11.9%
T 57
11.9%
I 57
11.9%
S 30
 
6.3%
o 30
 
6.3%
n 30
 
6.3%
G 30
 
6.3%
l 30
 
6.3%
Other values (2) 6
 
1.3%
Common
ValueCountFrequency (%)
875
93.1%
· 44
 
4.7%
1 18
 
1.9%
& 3
 
0.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 10083
87.7%
ASCII 1373
 
11.9%
None 44
 
0.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
875
63.7%
e 90
 
6.6%
d 60
 
4.4%
C 57
 
4.2%
T 57
 
4.2%
I 57
 
4.2%
S 30
 
2.2%
o 30
 
2.2%
n 30
 
2.2%
G 30
 
2.2%
Other values (5) 57
 
4.2%
Hangul
ValueCountFrequency (%)
743
 
7.4%
587
 
5.8%
531
 
5.3%
456
 
4.5%
382
 
3.8%
372
 
3.7%
366
 
3.6%
366
 
3.6%
361
 
3.6%
361
 
3.6%
Other values (147) 5558
55.1%
None
ValueCountFrequency (%)
· 44
100.0%
Distinct1583
Distinct (%)51.9%
Missing0
Missing (%)0.0%
Memory size24.0 KiB
2024-03-15T09:12:04.656000image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length97
Median length63
Mean length33.447248
Min length7

Characters and Unicode

Total characters102081
Distinct characters752
Distinct categories12 ?
Distinct scripts5 ?
Distinct blocks7 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique909 ?
Unique (%)29.8%

Sample

1st row해양환경내 다이옥신 오염 신속 검출 기술 개발
2nd row수산 동·식물로부터 유전자 조작에 의한 천연 단백분해효소 저해제 (protease inhibitor)의 대량생산 및 수산제품 응용기술개발
3rd row수산 동·식물로부터 유전자 조작에 의한 천연 단백분해효소 저해제 (protease inhibitor)의 대량생산 및 수산제품 응용기술개발
4th rowDairy beef의 비육기술 개발에 관한 연구
5th row지렁이를 이용한 유기성 자원의 처리와 이용
ValueCountFrequency (%)
개발 1748
 
7.1%
1528
 
6.2%
이용한 603
 
2.5%
위한 599
 
2.4%
연구 493
 
2.0%
기술 266
 
1.1%
시스템 231
 
0.9%
관한 203
 
0.8%
기능성 191
 
0.8%
기술개발 161
 
0.7%
Other values (5572) 18519
75.5%
2024-03-15T09:12:06.329906image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
21537
 
21.1%
2536
 
2.5%
2456
 
2.4%
2094
 
2.1%
2067
 
2.0%
1727
 
1.7%
1706
 
1.7%
1535
 
1.5%
1232
 
1.2%
1184
 
1.2%
Other values (742) 64007
62.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 73915
72.4%
Space Separator 21565
 
21.1%
Lowercase Letter 3735
 
3.7%
Uppercase Letter 1508
 
1.5%
Other Punctuation 560
 
0.5%
Open Punctuation 267
 
0.3%
Close Punctuation 267
 
0.3%
Decimal Number 150
 
0.1%
Dash Punctuation 109
 
0.1%
Initial Punctuation 3
 
< 0.1%
Other values (2) 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2536
 
3.4%
2456
 
3.3%
2094
 
2.8%
2067
 
2.8%
1727
 
2.3%
1706
 
2.3%
1535
 
2.1%
1232
 
1.7%
1184
 
1.6%
1181
 
1.6%
Other values (661) 56197
76.0%
Lowercase Letter
ValueCountFrequency (%)
e 387
10.4%
i 378
10.1%
a 341
 
9.1%
o 288
 
7.7%
r 283
 
7.6%
s 277
 
7.4%
t 264
 
7.1%
c 225
 
6.0%
n 219
 
5.9%
l 196
 
5.2%
Other values (16) 877
23.5%
Uppercase Letter
ValueCountFrequency (%)
C 157
 
10.4%
P 147
 
9.7%
A 140
 
9.3%
S 138
 
9.2%
D 111
 
7.4%
I 92
 
6.1%
N 89
 
5.9%
R 82
 
5.4%
M 76
 
5.0%
T 75
 
5.0%
Other values (15) 401
26.6%
Other Punctuation
ValueCountFrequency (%)
, 352
62.9%
· 81
 
14.5%
. 48
 
8.6%
' 29
 
5.2%
/ 28
 
5.0%
& 10
 
1.8%
# 4
 
0.7%
; 4
 
0.7%
% 3
 
0.5%
: 1
 
0.2%
Decimal Number
ValueCountFrequency (%)
1 34
22.7%
2 33
22.0%
6 24
16.0%
0 21
14.0%
3 14
9.3%
9 8
 
5.3%
5 8
 
5.3%
8 5
 
3.3%
7 3
 
2.0%
Close Punctuation
ValueCountFrequency (%)
) 259
97.0%
7
 
2.6%
] 1
 
0.4%
Space Separator
ValueCountFrequency (%)
21537
99.9%
  28
 
0.1%
Open Punctuation
ValueCountFrequency (%)
( 266
99.6%
[ 1
 
0.4%
Dash Punctuation
ValueCountFrequency (%)
- 109
100.0%
Initial Punctuation
ValueCountFrequency (%)
3
100.0%
Final Punctuation
ValueCountFrequency (%)
1
100.0%
Math Symbol
ValueCountFrequency (%)
+ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 73907
72.4%
Common 22923
 
22.5%
Latin 5238
 
5.1%
Han 8
 
< 0.1%
Greek 5
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2536
 
3.4%
2456
 
3.3%
2094
 
2.8%
2067
 
2.8%
1727
 
2.3%
1706
 
2.3%
1535
 
2.1%
1232
 
1.7%
1184
 
1.6%
1181
 
1.6%
Other values (656) 56189
76.0%
Latin
ValueCountFrequency (%)
e 387
 
7.4%
i 378
 
7.2%
a 341
 
6.5%
o 288
 
5.5%
r 283
 
5.4%
s 277
 
5.3%
t 264
 
5.0%
c 225
 
4.3%
n 219
 
4.2%
l 196
 
3.7%
Other values (40) 2380
45.4%
Common
ValueCountFrequency (%)
21537
94.0%
, 352
 
1.5%
( 266
 
1.2%
) 259
 
1.1%
- 109
 
0.5%
· 81
 
0.4%
. 48
 
0.2%
1 34
 
0.1%
2 33
 
0.1%
' 29
 
0.1%
Other values (20) 175
 
0.8%
Han
ValueCountFrequency (%)
2
25.0%
2
25.0%
2
25.0%
1
12.5%
1
12.5%
Greek
ValueCountFrequency (%)
α 5
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 73888
72.4%
ASCII 28041
 
27.5%
None 121
 
0.1%
Compat Jamo 19
 
< 0.1%
CJK 6
 
< 0.1%
Punctuation 4
 
< 0.1%
CJK Compat Ideographs 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
21537
76.8%
e 387
 
1.4%
i 378
 
1.3%
, 352
 
1.3%
a 341
 
1.2%
o 288
 
1.0%
r 283
 
1.0%
s 277
 
1.0%
( 266
 
0.9%
t 264
 
0.9%
Other values (65) 3668
 
13.1%
Hangul
ValueCountFrequency (%)
2536
 
3.4%
2456
 
3.3%
2094
 
2.8%
2067
 
2.8%
1727
 
2.3%
1706
 
2.3%
1535
 
2.1%
1232
 
1.7%
1184
 
1.6%
1181
 
1.6%
Other values (655) 56170
76.0%
None
ValueCountFrequency (%)
· 81
66.9%
  28
 
23.1%
7
 
5.8%
α 5
 
4.1%
Compat Jamo
ValueCountFrequency (%)
19
100.0%
Punctuation
ValueCountFrequency (%)
3
75.0%
1
 
25.0%
CJK
ValueCountFrequency (%)
2
33.3%
2
33.3%
2
33.3%
CJK Compat Ideographs
ValueCountFrequency (%)
1
50.0%
1
50.0%
Distinct512
Distinct (%)16.8%
Missing0
Missing (%)0.0%
Memory size24.0 KiB
2024-03-15T09:12:07.183884image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length19
Mean length7.3558322
Min length2

Characters and Unicode

Total characters22450
Distinct characters353
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique227 ?
Unique (%)7.4%

Sample

1st row(주)네오엔비즈
2nd row강릉원주대학교
3rd row강릉원주대학교
4th row농협중앙회 축산연구소
5th row연세대학교(원주)
ValueCountFrequency (%)
한국식품연구원 211
 
6.0%
서울대학교 192
 
5.5%
산학협력단 164
 
4.7%
경상대학교 122
 
3.5%
경북대학교 118
 
3.4%
강원대학교 104
 
3.0%
전남대학교 98
 
2.8%
충남대학교 98
 
2.8%
건국대학교(서울 77
 
2.2%
충북대학교 77
 
2.2%
Other values (506) 2247
64.1%
2024-03-15T09:12:08.370419image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2149
 
9.6%
1826
 
8.1%
1729
 
7.7%
730
 
3.3%
) 705
 
3.1%
( 705
 
3.1%
658
 
2.9%
589
 
2.6%
570
 
2.5%
552
 
2.5%
Other values (343) 12237
54.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 20554
91.6%
Close Punctuation 705
 
3.1%
Open Punctuation 705
 
3.1%
Space Separator 461
 
2.1%
Uppercase Letter 23
 
0.1%
Other Punctuation 1
 
< 0.1%
Decimal Number 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2149
 
10.5%
1826
 
8.9%
1729
 
8.4%
730
 
3.6%
658
 
3.2%
589
 
2.9%
570
 
2.8%
552
 
2.7%
476
 
2.3%
430
 
2.1%
Other values (328) 10845
52.8%
Uppercase Letter
ValueCountFrequency (%)
H 6
26.1%
N 3
13.0%
K 3
13.0%
G 3
13.0%
C 2
 
8.7%
A 2
 
8.7%
S 1
 
4.3%
P 1
 
4.3%
B 1
 
4.3%
J 1
 
4.3%
Close Punctuation
ValueCountFrequency (%)
) 705
100.0%
Open Punctuation
ValueCountFrequency (%)
( 705
100.0%
Space Separator
ValueCountFrequency (%)
461
100.0%
Other Punctuation
ValueCountFrequency (%)
& 1
100.0%
Decimal Number
ValueCountFrequency (%)
1 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 20554
91.6%
Common 1873
 
8.3%
Latin 23
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2149
 
10.5%
1826
 
8.9%
1729
 
8.4%
730
 
3.6%
658
 
3.2%
589
 
2.9%
570
 
2.8%
552
 
2.7%
476
 
2.3%
430
 
2.1%
Other values (328) 10845
52.8%
Latin
ValueCountFrequency (%)
H 6
26.1%
N 3
13.0%
K 3
13.0%
G 3
13.0%
C 2
 
8.7%
A 2
 
8.7%
S 1
 
4.3%
P 1
 
4.3%
B 1
 
4.3%
J 1
 
4.3%
Common
ValueCountFrequency (%)
) 705
37.6%
( 705
37.6%
461
24.6%
& 1
 
0.1%
1 1
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 20554
91.6%
ASCII 1896
 
8.4%

Most frequent character per block

Hangul
ValueCountFrequency (%)
2149
 
10.5%
1826
 
8.9%
1729
 
8.4%
730
 
3.6%
658
 
3.2%
589
 
2.9%
570
 
2.8%
552
 
2.7%
476
 
2.3%
430
 
2.1%
Other values (328) 10845
52.8%
ASCII
ValueCountFrequency (%)
) 705
37.2%
( 705
37.2%
461
24.3%
H 6
 
0.3%
N 3
 
0.2%
K 3
 
0.2%
G 3
 
0.2%
C 2
 
0.1%
A 2
 
0.1%
S 1
 
0.1%
Other values (5) 5
 
0.3%
Distinct310
Distinct (%)10.2%
Missing0
Missing (%)0.0%
Memory size24.0 KiB
Minimum1994-12-01 00:00:00
Maximum2022-04-01 00:00:00
2024-03-15T09:12:08.740901image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T09:12:09.179248image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct451
Distinct (%)14.8%
Missing0
Missing (%)0.0%
Memory size24.0 KiB
Minimum1995-11-30 00:00:00
Maximum2026-12-31 00:00:00
2024-03-15T09:12:09.590773image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T09:12:10.090992image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

기준년도
Real number (ℝ)

HIGH CORRELATION 

Distinct32
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2007.5374
Minimum1991
Maximum2024
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size26.9 KiB
2024-03-15T09:12:10.356427image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1991
5-th percentile1998
Q12002
median2006
Q32013
95-th percentile2019
Maximum2024
Range33
Interquartile range (IQR)11

Descriptive statistics

Standard deviation6.8430152
Coefficient of variation (CV)0.0034086615
Kurtosis-0.8810905
Mean2007.5374
Median Absolute Deviation (MAD)5
Skewness0.25714292
Sum6127004
Variance46.826857
MonotonicityNot monotonic
2024-03-15T09:12:10.843161image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=32)
ValueCountFrequency (%)
2005 200
 
6.6%
2004 190
 
6.2%
2003 179
 
5.9%
2006 155
 
5.1%
2000 154
 
5.0%
2012 152
 
5.0%
2002 147
 
4.8%
2001 134
 
4.4%
1999 131
 
4.3%
2017 126
 
4.1%
Other values (22) 1484
48.6%
ValueCountFrequency (%)
1991 3
 
0.1%
1994 13
 
0.4%
1995 24
 
0.8%
1996 38
 
1.2%
1997 57
 
1.9%
1998 108
3.5%
1999 131
4.3%
2000 154
5.0%
2001 134
4.4%
2002 147
4.8%
ValueCountFrequency (%)
2024 7
 
0.2%
2023 14
 
0.5%
2022 21
 
0.7%
2021 31
 
1.0%
2020 44
 
1.4%
2019 70
2.3%
2018 107
3.5%
2017 126
4.1%
2016 110
3.6%
2015 85
2.8%

성과활용년도
Real number (ℝ)

HIGH CORRELATION 

Distinct32
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2007.8244
Minimum1991
Maximum2024
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size26.9 KiB
2024-03-15T09:12:11.172186image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1991
5-th percentile1998
Q12002
median2006
Q32014
95-th percentile2020
Maximum2024
Range33
Interquartile range (IQR)12

Descriptive statistics

Standard deviation7.1181872
Coefficient of variation (CV)0.003545224
Kurtosis-1.0129951
Mean2007.8244
Median Absolute Deviation (MAD)6
Skewness0.24013103
Sum6127880
Variance50.668589
MonotonicityNot monotonic
2024-03-15T09:12:11.560144image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=32)
ValueCountFrequency (%)
2005 200
 
6.6%
2004 190
 
6.2%
2003 179
 
5.9%
2000 154
 
5.0%
2006 150
 
4.9%
2002 147
 
4.8%
2001 134
 
4.4%
2017 134
 
4.4%
1999 131
 
4.3%
2016 131
 
4.3%
Other values (22) 1502
49.2%
ValueCountFrequency (%)
1991 3
 
0.1%
1994 13
 
0.4%
1995 24
 
0.8%
1996 38
 
1.2%
1997 57
 
1.9%
1998 108
3.5%
1999 131
4.3%
2000 154
5.0%
2001 134
4.4%
2002 147
4.8%
ValueCountFrequency (%)
2024 1
 
< 0.1%
2023 24
 
0.8%
2022 25
 
0.8%
2021 51
 
1.7%
2020 57
1.9%
2019 84
2.8%
2018 105
3.4%
2017 134
4.4%
2016 131
4.3%
2015 123
4.0%

연구사업명
Text

MISSING 

Distinct1928
Distinct (%)66.0%
Missing129
Missing (%)4.2%
Memory size24.0 KiB
2024-03-15T09:12:12.573707image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length80
Median length51
Mean length13.119398
Min length2

Characters and Unicode

Total characters38348
Distinct characters613
Distinct categories10 ?
Distinct scripts4 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1571 ?
Unique (%)53.7%

Sample

1st row국립환경과학원 자체 사업
2nd row한일국제협력사업
3rd row지역R&D클러스터구축사업
4th row연구용역과제
5th row예비 기술창업자 육성사업
ValueCountFrequency (%)
146
 
2.2%
개발 128
 
1.9%
연구 95
 
1.4%
사업 87
 
1.3%
농림기술개발사업 78
 
1.2%
지원사업 54
 
0.8%
농림기술개발 37
 
0.6%
이용한 36
 
0.5%
위한 35
 
0.5%
농림부 33
 
0.5%
Other values (3064) 5963
89.1%
2024-03-15T09:12:14.287394image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3815
 
9.9%
1891
 
4.9%
1564
 
4.1%
1501
 
3.9%
1077
 
2.8%
1074
 
2.8%
1065
 
2.8%
1052
 
2.7%
1013
 
2.6%
796
 
2.1%
Other values (603) 23500
61.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 32173
83.9%
Space Separator 3815
 
9.9%
Decimal Number 728
 
1.9%
Uppercase Letter 613
 
1.6%
Lowercase Letter 482
 
1.3%
Open Punctuation 181
 
0.5%
Close Punctuation 181
 
0.5%
Other Punctuation 163
 
0.4%
Connector Punctuation 7
 
< 0.1%
Math Symbol 5
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1891
 
5.9%
1564
 
4.9%
1501
 
4.7%
1077
 
3.3%
1074
 
3.3%
1065
 
3.3%
1052
 
3.3%
1013
 
3.1%
796
 
2.5%
636
 
2.0%
Other values (530) 20504
63.7%
Lowercase Letter
ValueCountFrequency (%)
e 88
18.3%
o 52
10.8%
d 44
9.1%
n 41
8.5%
r 40
8.3%
l 34
 
7.1%
i 31
 
6.4%
a 24
 
5.0%
t 20
 
4.1%
s 19
 
3.9%
Other values (12) 89
18.5%
Uppercase Letter
ValueCountFrequency (%)
R 87
14.2%
S 60
9.8%
I 57
9.3%
C 56
9.1%
P 51
8.3%
G 51
8.3%
D 42
6.9%
A 41
6.7%
T 37
 
6.0%
B 29
 
4.7%
Other values (12) 102
16.6%
Decimal Number
ValueCountFrequency (%)
2 234
32.1%
1 176
24.2%
0 172
23.6%
8 31
 
4.3%
9 26
 
3.6%
5 21
 
2.9%
3 19
 
2.6%
4 17
 
2.3%
7 16
 
2.2%
6 16
 
2.2%
Other Punctuation
ValueCountFrequency (%)
, 51
31.3%
& 38
23.3%
/ 33
20.2%
· 9
 
5.5%
; 8
 
4.9%
# 7
 
4.3%
" 6
 
3.7%
' 5
 
3.1%
: 4
 
2.5%
. 2
 
1.2%
Open Punctuation
ValueCountFrequency (%)
( 179
98.9%
[ 1
 
0.6%
1
 
0.6%
Close Punctuation
ValueCountFrequency (%)
) 178
98.3%
] 2
 
1.1%
1
 
0.6%
Space Separator
ValueCountFrequency (%)
3815
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 7
100.0%
Math Symbol
ValueCountFrequency (%)
+ 5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 32170
83.9%
Common 5080
 
13.2%
Latin 1095
 
2.9%
Han 3
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1891
 
5.9%
1564
 
4.9%
1501
 
4.7%
1077
 
3.3%
1074
 
3.3%
1065
 
3.3%
1052
 
3.3%
1013
 
3.1%
796
 
2.5%
636
 
2.0%
Other values (527) 20501
63.7%
Latin
ValueCountFrequency (%)
e 88
 
8.0%
R 87
 
7.9%
S 60
 
5.5%
I 57
 
5.2%
C 56
 
5.1%
o 52
 
4.7%
P 51
 
4.7%
G 51
 
4.7%
d 44
 
4.0%
D 42
 
3.8%
Other values (34) 507
46.3%
Common
ValueCountFrequency (%)
3815
75.1%
2 234
 
4.6%
( 179
 
3.5%
) 178
 
3.5%
1 176
 
3.5%
0 172
 
3.4%
, 51
 
1.0%
& 38
 
0.7%
/ 33
 
0.6%
8 31
 
0.6%
Other values (19) 173
 
3.4%
Han
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 32165
83.9%
ASCII 6164
 
16.1%
None 11
 
< 0.1%
Compat Jamo 5
 
< 0.1%
CJK 3
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3815
61.9%
2 234
 
3.8%
( 179
 
2.9%
) 178
 
2.9%
1 176
 
2.9%
0 172
 
2.8%
e 88
 
1.4%
R 87
 
1.4%
S 60
 
1.0%
I 57
 
0.9%
Other values (60) 1118
 
18.1%
Hangul
ValueCountFrequency (%)
1891
 
5.9%
1564
 
4.9%
1501
 
4.7%
1077
 
3.3%
1074
 
3.3%
1065
 
3.3%
1052
 
3.3%
1013
 
3.1%
796
 
2.5%
636
 
2.0%
Other values (522) 20496
63.7%
None
ValueCountFrequency (%)
· 9
81.8%
1
 
9.1%
1
 
9.1%
CJK
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%
Compat Jamo
ValueCountFrequency (%)
1
20.0%
1
20.0%
1
20.0%
1
20.0%
1
20.0%
Distinct2821
Distinct (%)92.7%
Missing10
Missing (%)0.3%
Memory size24.0 KiB
2024-03-15T09:12:15.408561image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length161
Median length77
Mean length28.060815
Min length2

Characters and Unicode

Total characters85361
Distinct characters836
Distinct categories13 ?
Distinct scripts5 ?
Distinct blocks8 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2644 ?
Unique (%)86.9%

Sample

1st row내분비계장애물질에 대한 생물학적 분석법 적용연구(II)
2nd row해조류 유래 새로운 효소저해제의 정제 및 새로운 기능성 물질로서의 이용
3rd row기능성 저염 수산발효식품 개발
4th row비육우에너지사료의 원가절감을 위한 수용성유화제 활용검토
5th row지렁이 분변토를 이용한 채소 육묘용 유기상토의 제조
ValueCountFrequency (%)
개발 1061
 
5.4%
970
 
4.9%
이용한 396
 
2.0%
위한 387
 
2.0%
연구 379
 
1.9%
기술 136
 
0.7%
기능성 128
 
0.7%
시스템 108
 
0.5%
관한 102
 
0.5%
활용한 98
 
0.5%
Other values (8283) 15888
80.8%
2024-03-15T09:12:16.967191image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
16721
 
19.6%
1756
 
2.1%
1699
 
2.0%
1620
 
1.9%
1466
 
1.7%
1296
 
1.5%
1276
 
1.5%
1128
 
1.3%
993
 
1.2%
986
 
1.2%
Other values (826) 56420
66.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 62430
73.1%
Space Separator 16721
 
19.6%
Lowercase Letter 3611
 
4.2%
Uppercase Letter 1355
 
1.6%
Decimal Number 488
 
0.6%
Other Punctuation 355
 
0.4%
Open Punctuation 195
 
0.2%
Close Punctuation 190
 
0.2%
Final Punctuation 4
 
< 0.1%
Initial Punctuation 4
 
< 0.1%
Other values (3) 8
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1756
 
2.8%
1699
 
2.7%
1620
 
2.6%
1466
 
2.3%
1296
 
2.1%
1276
 
2.0%
1128
 
1.8%
993
 
1.6%
986
 
1.6%
903
 
1.4%
Other values (736) 49307
79.0%
Lowercase Letter
ValueCountFrequency (%)
e 350
 
9.7%
i 347
 
9.6%
o 337
 
9.3%
a 315
 
8.7%
t 261
 
7.2%
n 260
 
7.2%
r 258
 
7.1%
s 221
 
6.1%
l 188
 
5.2%
c 161
 
4.5%
Other values (18) 913
25.3%
Uppercase Letter
ValueCountFrequency (%)
C 150
11.1%
A 136
 
10.0%
P 110
 
8.1%
S 109
 
8.0%
I 108
 
8.0%
T 88
 
6.5%
R 82
 
6.1%
M 78
 
5.8%
D 69
 
5.1%
B 56
 
4.1%
Other values (16) 369
27.2%
Other Punctuation
ValueCountFrequency (%)
, 200
56.3%
. 48
 
13.5%
/ 40
 
11.3%
· 19
 
5.4%
: 11
 
3.1%
& 10
 
2.8%
# 7
 
2.0%
; 7
 
2.0%
' 5
 
1.4%
" 4
 
1.1%
Decimal Number
ValueCountFrequency (%)
1 114
23.4%
2 112
23.0%
0 100
20.5%
5 33
 
6.8%
9 31
 
6.4%
3 25
 
5.1%
7 22
 
4.5%
8 21
 
4.3%
6 17
 
3.5%
4 13
 
2.7%
Open Punctuation
ValueCountFrequency (%)
( 193
99.0%
1
 
0.5%
1
 
0.5%
Close Punctuation
ValueCountFrequency (%)
) 188
98.9%
1
 
0.5%
1
 
0.5%
Math Symbol
ValueCountFrequency (%)
+ 2
50.0%
< 1
25.0%
> 1
25.0%
Other Symbol
ValueCountFrequency (%)
2
66.7%
1
33.3%
Space Separator
ValueCountFrequency (%)
16721
100.0%
Final Punctuation
ValueCountFrequency (%)
4
100.0%
Initial Punctuation
ValueCountFrequency (%)
4
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 62428
73.1%
Common 17963
 
21.0%
Latin 4964
 
5.8%
Greek 3
 
< 0.1%
Han 3
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1756
 
2.8%
1699
 
2.7%
1620
 
2.6%
1466
 
2.3%
1296
 
2.1%
1276
 
2.0%
1128
 
1.8%
993
 
1.6%
986
 
1.6%
903
 
1.4%
Other values (734) 49305
79.0%
Latin
ValueCountFrequency (%)
e 350
 
7.1%
i 347
 
7.0%
o 337
 
6.8%
a 315
 
6.3%
t 261
 
5.3%
n 260
 
5.2%
r 258
 
5.2%
s 221
 
4.5%
l 188
 
3.8%
c 161
 
3.2%
Other values (43) 2266
45.6%
Common
ValueCountFrequency (%)
16721
93.1%
, 200
 
1.1%
( 193
 
1.1%
) 188
 
1.0%
1 114
 
0.6%
2 112
 
0.6%
0 100
 
0.6%
. 48
 
0.3%
/ 40
 
0.2%
5 33
 
0.2%
Other values (24) 214
 
1.2%
Han
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%
Greek
ValueCountFrequency (%)
α 2
66.7%
μ 1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 62417
73.1%
ASCII 22893
 
26.8%
None 27
 
< 0.1%
Compat Jamo 10
 
< 0.1%
Punctuation 8
 
< 0.1%
CJK 3
 
< 0.1%
Geometric Shapes 2
 
< 0.1%
Number Forms 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
16721
73.0%
e 350
 
1.5%
i 347
 
1.5%
o 337
 
1.5%
a 315
 
1.4%
t 261
 
1.1%
n 260
 
1.1%
r 258
 
1.1%
s 221
 
1.0%
, 200
 
0.9%
Other values (68) 3623
 
15.8%
Hangul
ValueCountFrequency (%)
1756
 
2.8%
1699
 
2.7%
1620
 
2.6%
1466
 
2.3%
1296
 
2.1%
1276
 
2.0%
1128
 
1.8%
993
 
1.6%
986
 
1.6%
903
 
1.4%
Other values (724) 49294
79.0%
None
ValueCountFrequency (%)
· 19
70.4%
α 2
 
7.4%
1
 
3.7%
μ 1
 
3.7%
1
 
3.7%
1
 
3.7%
1
 
3.7%
1
 
3.7%
Punctuation
ValueCountFrequency (%)
4
50.0%
4
50.0%
Geometric Shapes
ValueCountFrequency (%)
2
100.0%
Compat Jamo
ValueCountFrequency (%)
2
20.0%
1
10.0%
1
10.0%
1
10.0%
1
10.0%
1
10.0%
1
10.0%
1
10.0%
1
10.0%
CJK
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%
Number Forms
ValueCountFrequency (%)
1
100.0%

연구기관명
Text

MISSING 

Distinct971
Distinct (%)32.8%
Missing88
Missing (%)2.9%
Memory size24.0 KiB
2024-03-15T09:12:17.807436image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length98
Median length63
Mean length7.2736167
Min length2

Characters and Unicode

Total characters21559
Distinct characters442
Distinct categories10 ?
Distinct scripts4 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique661 ?
Unique (%)22.3%

Sample

1st row국립환경연구원
2nd row강릉원주대학교
3rd row강릉원주대학교
4th row농협중앙회 축산연구원
5th row연세대학교 산학협력단
ValueCountFrequency (%)
한국식품연구원 141
 
4.1%
서울대학교 115
 
3.4%
산학협력단 93
 
2.7%
건국대학교 88
 
2.6%
강원대학교 86
 
2.5%
경북대학교 83
 
2.4%
농촌진흥청 80
 
2.3%
경상대학교 79
 
2.3%
충북대학교 59
 
1.7%
전남대학교 54
 
1.6%
Other values (988) 2539
74.3%
2024-03-15T09:12:19.118620image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1632
 
7.6%
1425
 
6.6%
1245
 
5.8%
781
 
3.6%
664
 
3.1%
663
 
3.1%
611
 
2.8%
586
 
2.7%
467
 
2.2%
462
 
2.1%
Other values (432) 13023
60.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 19800
91.8%
Space Separator 462
 
2.1%
Close Punctuation 385
 
1.8%
Open Punctuation 381
 
1.8%
Lowercase Letter 276
 
1.3%
Uppercase Letter 126
 
0.6%
Other Punctuation 121
 
0.6%
Decimal Number 5
 
< 0.1%
Other Symbol 2
 
< 0.1%
Modifier Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1632
 
8.2%
1425
 
7.2%
1245
 
6.3%
781
 
3.9%
664
 
3.4%
663
 
3.3%
611
 
3.1%
586
 
3.0%
467
 
2.4%
453
 
2.3%
Other values (372) 11273
56.9%
Lowercase Letter
ValueCountFrequency (%)
a 34
12.3%
e 31
11.2%
i 29
10.5%
s 26
9.4%
t 24
8.7%
o 23
8.3%
n 23
8.3%
r 17
 
6.2%
c 10
 
3.6%
y 9
 
3.3%
Other values (15) 50
18.1%
Uppercase Letter
ValueCountFrequency (%)
S 13
 
10.3%
C 9
 
7.1%
F 9
 
7.1%
B 9
 
7.1%
N 8
 
6.3%
K 8
 
6.3%
L 7
 
5.6%
H 7
 
5.6%
E 7
 
5.6%
P 7
 
5.6%
Other values (13) 42
33.3%
Other Punctuation
ValueCountFrequency (%)
, 101
83.5%
/ 8
 
6.6%
. 5
 
4.1%
& 5
 
4.1%
; 2
 
1.7%
Decimal Number
ValueCountFrequency (%)
1 3
60.0%
2 2
40.0%
Space Separator
ValueCountFrequency (%)
462
100.0%
Close Punctuation
ValueCountFrequency (%)
) 385
100.0%
Open Punctuation
ValueCountFrequency (%)
( 381
100.0%
Other Symbol
ValueCountFrequency (%)
2
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 19800
91.8%
Common 1355
 
6.3%
Latin 402
 
1.9%
Han 2
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1632
 
8.2%
1425
 
7.2%
1245
 
6.3%
781
 
3.9%
664
 
3.4%
663
 
3.3%
611
 
3.1%
586
 
3.0%
467
 
2.4%
453
 
2.3%
Other values (371) 11273
56.9%
Latin
ValueCountFrequency (%)
a 34
 
8.5%
e 31
 
7.7%
i 29
 
7.2%
s 26
 
6.5%
t 24
 
6.0%
o 23
 
5.7%
n 23
 
5.7%
r 17
 
4.2%
S 13
 
3.2%
c 10
 
2.5%
Other values (38) 172
42.8%
Common
ValueCountFrequency (%)
462
34.1%
) 385
28.4%
( 381
28.1%
, 101
 
7.5%
/ 8
 
0.6%
. 5
 
0.4%
& 5
 
0.4%
1 3
 
0.2%
; 2
 
0.1%
2 2
 
0.1%
Han
ValueCountFrequency (%)
1
50.0%
1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 19797
91.8%
ASCII 1757
 
8.1%
None 2
 
< 0.1%
CJK 2
 
< 0.1%
Compat Jamo 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1632
 
8.2%
1425
 
7.2%
1245
 
6.3%
781
 
3.9%
664
 
3.4%
663
 
3.3%
611
 
3.1%
586
 
3.0%
467
 
2.4%
453
 
2.3%
Other values (369) 11270
56.9%
ASCII
ValueCountFrequency (%)
462
26.3%
) 385
21.9%
( 381
21.7%
, 101
 
5.7%
a 34
 
1.9%
e 31
 
1.8%
i 29
 
1.7%
s 26
 
1.5%
t 24
 
1.4%
o 23
 
1.3%
Other values (49) 261
14.9%
None
ValueCountFrequency (%)
2
100.0%
CJK
ValueCountFrequency (%)
1
50.0%
1
50.0%
Compat Jamo
ValueCountFrequency (%)
1
100.0%

본연구와의관계
Text

MISSING 

Distinct1778
Distinct (%)75.7%
Missing702
Missing (%)23.0%
Memory size24.0 KiB
2024-03-15T09:12:20.425775image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length169
Median length102
Mean length19.154468
Min length1

Characters and Unicode

Total characters45013
Distinct characters697
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1568 ?
Unique (%)66.7%

Sample

1st row유전자재조합세포 이용 생물학적 분석법 동일 원리
2nd row해조류 유래 탄수화물저해제의 정제 및 산업화
3rd row응용
4th row연구결과참조
5th row지렁이 분변토를 이용한 상토 제조
ValueCountFrequency (%)
357
 
3.3%
활용 351
 
3.3%
279
 
2.6%
연구 221
 
2.1%
개발 139
 
1.3%
위한 95
 
0.9%
응용 86
 
0.8%
기술 78
 
0.7%
연구에서 77
 
0.7%
적용 70
 
0.6%
Other values (4583) 9024
83.7%
2024-03-15T09:12:22.418535image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
8508
 
18.9%
1331
 
3.0%
1321
 
2.9%
1252
 
2.8%
1024
 
2.3%
773
 
1.7%
690
 
1.5%
651
 
1.4%
581
 
1.3%
530
 
1.2%
Other values (687) 28352
63.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 34567
76.8%
Space Separator 8508
 
18.9%
Lowercase Letter 921
 
2.0%
Uppercase Letter 463
 
1.0%
Other Punctuation 223
 
0.5%
Decimal Number 198
 
0.4%
Open Punctuation 65
 
0.1%
Close Punctuation 64
 
0.1%
Math Symbol 2
 
< 0.1%
Initial Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1331
 
3.9%
1321
 
3.8%
1252
 
3.6%
1024
 
3.0%
773
 
2.2%
690
 
2.0%
651
 
1.9%
581
 
1.7%
530
 
1.5%
528
 
1.5%
Other values (611) 25886
74.9%
Lowercase Letter
ValueCountFrequency (%)
e 119
12.9%
a 85
 
9.2%
t 70
 
7.6%
i 70
 
7.6%
s 68
 
7.4%
o 64
 
6.9%
l 55
 
6.0%
r 53
 
5.8%
n 50
 
5.4%
m 48
 
5.2%
Other values (16) 239
26.0%
Uppercase Letter
ValueCountFrequency (%)
S 47
 
10.2%
A 41
 
8.9%
P 40
 
8.6%
C 38
 
8.2%
R 34
 
7.3%
T 33
 
7.1%
M 30
 
6.5%
D 25
 
5.4%
I 21
 
4.5%
G 20
 
4.3%
Other values (14) 134
28.9%
Decimal Number
ValueCountFrequency (%)
2 54
27.3%
1 49
24.7%
0 26
13.1%
3 17
 
8.6%
4 15
 
7.6%
9 14
 
7.1%
6 11
 
5.6%
8 9
 
4.5%
5 2
 
1.0%
7 1
 
0.5%
Other Punctuation
ValueCountFrequency (%)
, 137
61.4%
. 62
27.8%
/ 8
 
3.6%
& 5
 
2.2%
% 5
 
2.2%
· 2
 
0.9%
: 2
 
0.9%
# 1
 
0.4%
; 1
 
0.4%
Math Symbol
ValueCountFrequency (%)
~ 1
50.0%
+ 1
50.0%
Space Separator
ValueCountFrequency (%)
8508
100.0%
Open Punctuation
ValueCountFrequency (%)
( 65
100.0%
Close Punctuation
ValueCountFrequency (%)
) 64
100.0%
Initial Punctuation
ValueCountFrequency (%)
1
100.0%
Final Punctuation
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 34567
76.8%
Common 9062
 
20.1%
Latin 1384
 
3.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1331
 
3.9%
1321
 
3.8%
1252
 
3.6%
1024
 
3.0%
773
 
2.2%
690
 
2.0%
651
 
1.9%
581
 
1.7%
530
 
1.5%
528
 
1.5%
Other values (611) 25886
74.9%
Latin
ValueCountFrequency (%)
e 119
 
8.6%
a 85
 
6.1%
t 70
 
5.1%
i 70
 
5.1%
s 68
 
4.9%
o 64
 
4.6%
l 55
 
4.0%
r 53
 
3.8%
n 50
 
3.6%
m 48
 
3.5%
Other values (40) 702
50.7%
Common
ValueCountFrequency (%)
8508
93.9%
, 137
 
1.5%
( 65
 
0.7%
) 64
 
0.7%
. 62
 
0.7%
2 54
 
0.6%
1 49
 
0.5%
0 26
 
0.3%
3 17
 
0.2%
4 15
 
0.2%
Other values (16) 65
 
0.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 34472
76.6%
ASCII 10442
 
23.2%
Compat Jamo 95
 
0.2%
None 2
 
< 0.1%
Punctuation 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
8508
81.5%
, 137
 
1.3%
e 119
 
1.1%
a 85
 
0.8%
t 70
 
0.7%
i 70
 
0.7%
s 68
 
0.7%
( 65
 
0.6%
) 64
 
0.6%
o 64
 
0.6%
Other values (63) 1192
 
11.4%
Hangul
ValueCountFrequency (%)
1331
 
3.9%
1321
 
3.8%
1252
 
3.6%
1024
 
3.0%
773
 
2.2%
690
 
2.0%
651
 
1.9%
581
 
1.7%
530
 
1.5%
528
 
1.5%
Other values (610) 25791
74.8%
Compat Jamo
ValueCountFrequency (%)
95
100.0%
None
ValueCountFrequency (%)
· 2
100.0%
Punctuation
ValueCountFrequency (%)
1
50.0%
1
50.0%

Interactions

2024-03-15T09:11:55.634444image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T09:11:55.263511image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T09:11:55.910053image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T09:11:55.446437image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-15T09:12:22.704681image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
과제구분내역사업명기준년도성과활용년도
과제구분1.0000.9990.8250.799
내역사업명0.9991.0000.8040.718
기준년도0.8250.8041.0000.997
성과활용년도0.7990.7180.9971.000
2024-03-15T09:12:23.019653image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
기준년도성과활용년도과제구분
기준년도1.0000.9940.460
성과활용년도0.9941.0000.426
과제구분0.4600.4261.000

Missing values

2024-03-15T09:11:56.262780image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-15T09:11:56.708466image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-03-15T09:11:57.069484image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

과제관리번호과제구분내역사업명과제명주관기관과제시작년도과제종료년도기준년도성과활용년도연구사업명연구제목연구기관명본연구와의관계
0105150-3수산실용화기술개발<NA>해양환경내 다이옥신 오염 신속 검출 기술 개발(주)네오엔비즈2005-09-142007-09-1420092009국립환경과학원 자체 사업내분비계장애물질에 대한 생물학적 분석법 적용연구(II)국립환경연구원유전자재조합세포 이용 생물학적 분석법 동일 원리
1102050-3수산실용화기술개발<NA>수산 동·식물로부터 유전자 조작에 의한 천연 단백분해효소 저해제 (protease inhibitor)의 대량생산 및 수산제품 응용기술개발강릉원주대학교2002-10-152006-10-1320042004한일국제협력사업해조류 유래 새로운 효소저해제의 정제 및 새로운 기능성 물질로서의 이용강릉원주대학교해조류 유래 탄수화물저해제의 정제 및 산업화
2102050-3수산실용화기술개발<NA>수산 동·식물로부터 유전자 조작에 의한 천연 단백분해효소 저해제 (protease inhibitor)의 대량생산 및 수산제품 응용기술개발강릉원주대학교2002-10-152006-10-1320052005지역R&D클러스터구축사업기능성 저염 수산발효식품 개발강릉원주대학교응용
3103041-2현장애로기술개발과제<NA>Dairy beef의 비육기술 개발에 관한 연구농협중앙회 축산연구소2003-07-152005-07-1420092009연구용역과제비육우에너지사료의 원가절감을 위한 수용성유화제 활용검토농협중앙회 축산연구원연구결과참조
4105139-3현장적용기술개발<NA>지렁이를 이용한 유기성 자원의 처리와 이용연세대학교(원주)2005-04-252008-04-2420102010예비 기술창업자 육성사업지렁이 분변토를 이용한 채소 육묘용 유기상토의 제조연세대학교 산학협력단지렁이 분변토를 이용한 상토 제조
5307009-3고부가가치식품기술개발생명자원 부가가치 제고기술연의 수확후 저장·가공기술 개발 및 기능성의 임상적 연구목포대학교2007-05-302010-05-2920102010제조현장 녹색화 기술 개발사업연잎발효 농축액 특허기술을 활용하고 식품가공 부산물을 이용한 미생물 발효 기능성 사료개발농업회사법인연잎발효농축액 발효기술을
6103047-3현장애로기술개발과제<NA>유용 천연물(녹차, 들깻잎, 인산, 갈근, 톳, 김 등)을 첨가한 목장형 자연치즈 개발순천대학교2003-07-152006-07-1420102010산학연 공동기술개발사업쌀 막걸리와 요구르트 유산균을 이용한 기능성 발효 식품 상품화중소기업청총괄책임자
7202116-3첨단기술개발과제<NA>뇨에서 백혈구 조절인자를 생산하는 형질전환 가축(소)생산한경대학교2002-10-152005-10-1420092009바이오그린 21생리활성물질 분석한경대학교형질전환 가축으로부터 생산된 물질의 활성분석
8107009-3농생명산업기술개발생명자원 부가가치 제고기술한국 참다래 고당도 대과 골드 및 그린 신품종 육성과 조기 실용화전라남도농업기술원2007-05-302010-05-2920082008국내 신품종 전남권 우량묘 증식, 공급 및 재배단지 조성골드키위『해금』고접갱신 적기전남농업기술원개발신품종의 증식연구
9204137-3현장적용기술개발<NA>가금인플루엔자 유전자재조합 백신개발 및 방제연구건국대학교(서울)2004-05-252007-05-2420072007용역연구사업조류 분변시료를 이용한 조류인플루엔자 감염 철새종류 조사국립수의과학검역원기초자료
과제관리번호과제구분내역사업명과제명주관기관과제시작년도과제종료년도기준년도성과활용년도연구사업명연구제목연구기관명본연구와의관계
3042213001044CG500Golden Seed 프로젝트Golden Seed 프로젝트수출대상국 수요 맞춤형 감자 신품종 육성국립식량과학원 고령지농업연구소2013-07-252016-12-3120162017동남아시아 및 중국 동남부지역 적응 수출용 감자품종 육성감자 생산력검정본시험 선발국립식량과학원 고령지농업연구소본 연구
3043213002044CGU00Golden Seed 프로젝트Golden Seed 프로젝트토경재배용 블로키 타입 파프리카 품종 개발아시아종묘(주)2013-07-252016-12-3120162016채소종자사업단국내토경재배용품종개발농업회사법인 삼성종묘<NA>
3044213005044SBD20Golden Seed 프로젝트Golden Seed 프로젝트육질이 우수한 닭 종자개발을 위한 남부권역 교배조합 검정 시험경남과학기술대학교산학협력단2014-07-182016-12-3120162019GSP 종계신품종 토종닭 생산을 위한 교배조합 능력검정 시험 및 생산체계 확립경남과학기술대학교<NA>
3045213005042SB210Golden Seed 프로젝트Golden Seed 프로젝트수입대체 및 수출대비용 부계 우량계통 조성국립축산과학원 축산자원개발부2013-07-172017-05-0220172018두록 참조돈군을 통한 GSP 씨돼지 개량 효율 제고(213010052SB720)두록 참조돈군을 통한 GSP 씨돼지 개량 효율 제고국립축산과학원GSP 2단계 사업
3046213001041SB320Golden Seed 프로젝트Golden Seed 프로젝트지중해 연안지역 적응 조숙 중단립형 벼품종 개발경북대학교 산학협력단2013-07-252017-05-0220172017수출용 중ㆍ장립형 다수성 벼 품종 개발 및 수출기반 조성지중해 연안 및 고위도 지역 적응 수출용 다수성 중단립종 벼 품종 개발경북대학교 산학협력단2단계 위탁과제로 수행
3047213003043SBY10Golden Seed 프로젝트Golden Seed 프로젝트표고의 주요형질, 내병성 및 버섯발이 온도 판별 관련 분자마커 개발충북대학교 산학협력단2015-06-192017-02-2820162017품종보호, 수입대체용 표고 신품종 개발을 위한 분자마커품종보호, 수입대체용 표고 신품종 개발을 위한 분자마커충북대학교<NA>
3048213005044SBB10Golden Seed 프로젝트Golden Seed 프로젝트수출 확대 저해요인 실시간 파악분석 및 해결책 제시건국대학교 산학협력단2013-07-252016-12-3120162018GSP 프로젝트토종닭 해외 현지 보급 확대를 위한 기관 및 대학 협력체계 구축한국축산경제연구원2단계 연구
3049213003044SBF20Golden Seed 프로젝트Golden Seed 프로젝트동남아 청고병 및 TYLCV 복합내병계 토마토 품종 육성아시아종묘(주)2013-07-252016-12-3120162018차세대바이오그린21_농생물게놈활용연구사업토마토 육종집단의 형질특성검정 및 유전체육종 기반 구축 및 활용아시아종묘<NA>
3050213008051SB110Golden Seed 프로젝트Golden Seed 프로젝트수산종자 장거리 수송기술 개발 및 수출 경쟁력 강화국립수산과학원2017-01-012021-12-3120212023수산종자산업 디지털 혁신 기술개발 사업수산종자 검인증 기술개발한국수산식품안전연구소종자 관리기준 연구결과를 종자 생산이력 관리에 적용 연구
3051213010055SB510Golden Seed 프로젝트Golden Seed 프로젝트GSP 참여종돈장 통합육종시스템 확립 및 적용정피엔씨연구소2017-01-012021-12-3120212023한돈 농가별 맞춤형 종돈 공급체계 실증사업을 위한 연구한돈 농가별 맞춤형 종돈 공급체계 실증사업을 위한 연구(주)정피엔씨연구소추가 연구 활용

Duplicate rows

Most frequently occurring

과제관리번호과제구분내역사업명과제명주관기관과제시작년도과제종료년도기준년도성과활용년도연구사업명연구제목연구기관명본연구와의관계# duplicates
8300011-3기획연구<NA>종계의 생산성 향상을 위한 기술 개발고려대학교(서울)2000-07-212003-07-2020002000종계의 생산성 향상종계의 생산성 향상고려대학교<NA>4
13397004-3기획연구<NA>김치의 고품질 상품화 기술 개발한국식품연구원1997-10-082000-10-0720002000<NA><NA><NA><NA>4
2109158-3고부가가치식품기술개발기능성강화식품폐계육을 이용한 조미소재 개발 및 펩티드 함유 고부가가치 제품 산업화진주산업대학교2009-04-102012-04-0920122015농촌진흥청식육·가공품판매업 활성화를 위한 제품 제조법 및 위생관리 기술개발경남과학기술대학교 산학협력단<NA>3
3109158-3고부가가치식품기술개발기능성강화식품폐계육을 이용한 조미소재 개발 및 펩티드 함유 고부가가치 제품 산업화진주산업대학교2009-04-102012-04-0920122015한국연구재단 중점연구소 사업탐색된 천연 기능성 소재를 활용한 육제품 개발경남과학기술대학교 산학협력단<NA>3
0107088-4농생명산업기술개발생명자원 부가가치 제고기술수정란이식 기법에 의한 한우의 초고도 유전형질 선발 &#8231; 증식엠트렌(주)2007-05-302011-05-2920112016농림생명산업기술개발유전자 가위 기술을 활용한 락토페린 생성 젖소 개발광개토한우농업법인(주)락토페린 생성 형질 송아지 생산에 수정란이식기술 확용2
1107120-1수산실용화기술개발<NA>새우 양식에 피해를주는 바이러스및 기타 병원균 제거 양식수 제조 시스템 개발한국과학기술연구원2007-09-202008-09-1920092009수산특정THM을 생성하지 않는 녹색 친환경 수산 양식수 제조 시스템 개발 및 현장 적용 연구한국과학기술연구원후속연구2
4120055-1농축산자재산업화기술개발에너지절감자재수중 플라즈마 방전 방식을 이용한 고효율 농업용 보일러 개발(주)지에이(GA)2020-04-292021-04-2820212021기술사업화지원사업수중 플라즈마 방전 방식 농업용 보일러 상품화 기술 개발(주)지에이후속 상품화 기술개발과제2
5195069-2현장애로기술개발과제<NA>공정육묘 온실의 표준모델과 자동화 시스템 개발과 활용기술 개발경상대학교1995-12-301997-12-2919991999<NA>온실설계,경량형 온실표준화,단동생력화 온실<NA><NA>2
6203130-3첨단기술개발과제<NA>소경목을 이용한 수도작용 지효성비료 개발강원대학교2003-07-152006-12-3120042004신진연구소경목을 이용한 수도작용 지효성비료 개발강원대학교신진연구학생2
7295154-5첨단기술개발과제<NA>식물공장의 최적 배양액 관리 자동화 시스템 개발서울시립대학교1995-12-212000-12-2020002000생물반응기를 이용한 고품질 국화묘의 대량생산기술 개발현장애로서울시립대학교<NA>2