Overview

Dataset statistics

Number of variables3
Number of observations772
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory18.2 KiB
Average record size in memory24.2 B

Variable types

Text3

Dataset

Description농림축산식품부 농림축산검역본부에서 수행하고 있는 농림축산검역검사기술개발사업의 연구 성과에 대하여 수행년도, 사업구분, 과제분류, 과제명, 연구책임자 등에 대한 정보를 제공하고 있음
URLhttps://www.data.go.kr/data/3072967/fileData.do

Alerts

과제번호 has unique valuesUnique

Reproduction

Analysis started2023-12-12 09:39:56.034398
Analysis finished2023-12-12 09:39:56.621964
Duration0.59 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

과제번호
Text

UNIQUE 

Distinct772
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size6.2 KiB
2023-12-12T18:39:56.763510image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length20
Mean length18.63342
Min length8

Characters and Unicode

Total characters14385
Distinct characters30
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique772 ?
Unique (%)100.0%

Sample

1st rowB-FS06-2005-06-01
2nd rowB-AD16-2004-06-03
3rd rowF-AD16-2006-06-04
4th rowM-AD18-2004-05-04
5th rowB-AD14-2005-06-01
ValueCountFrequency (%)
b-fs06-2005-06-01 1
 
0.1%
m-1541778-2013-14-01 1
 
0.1%
i-1541766-2013-15-01 1
 
0.1%
b-1541785-2013-15-03 1
 
0.1%
b-1543069-2014-15-02 1
 
0.1%
b-1543073-2014-15-01 1
 
0.1%
b-1543073-2014-15-02 1
 
0.1%
b-1543082-2015-15-01 1
 
0.1%
b-1543083-2014-15-01 1
 
0.1%
b-1543084-2014-15-02 1
 
0.1%
Other values (763) 763
98.7%
2023-12-12T18:39:57.076706image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 3081
21.4%
1 2675
18.6%
0 2439
17.0%
2 1189
 
8.3%
4 714
 
5.0%
5 679
 
4.7%
3 631
 
4.4%
7 500
 
3.5%
8 490
 
3.4%
6 318
 
2.2%
Other values (20) 1669
11.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 9836
68.4%
Dash Punctuation 3081
 
21.4%
Uppercase Letter 1461
 
10.2%
Lowercase Letter 5
 
< 0.1%
Connector Punctuation 1
 
< 0.1%
Space Separator 1
 
< 0.1%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
A 268
18.3%
D 268
18.3%
B 259
17.7%
Z 225
15.4%
F 119
8.1%
S 77
 
5.3%
N 66
 
4.5%
M 61
 
4.2%
I 44
 
3.0%
C 43
 
2.9%
Other values (2) 31
 
2.1%
Decimal Number
ValueCountFrequency (%)
1 2675
27.2%
0 2439
24.8%
2 1189
12.1%
4 714
 
7.3%
5 679
 
6.9%
3 631
 
6.4%
7 500
 
5.1%
8 490
 
5.0%
6 318
 
3.2%
9 201
 
2.0%
Lowercase Letter
ValueCountFrequency (%)
n 1
20.0%
v 1
20.0%
r 1
20.0%
q 1
20.0%
s 1
20.0%
Dash Punctuation
ValueCountFrequency (%)
- 3081
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%
Space Separator
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 12919
89.8%
Latin 1466
 
10.2%

Most frequent character per script

Latin
ValueCountFrequency (%)
A 268
18.3%
D 268
18.3%
B 259
17.7%
Z 225
15.3%
F 119
8.1%
S 77
 
5.3%
N 66
 
4.5%
M 61
 
4.2%
I 44
 
3.0%
C 43
 
2.9%
Other values (7) 36
 
2.5%
Common
ValueCountFrequency (%)
- 3081
23.8%
1 2675
20.7%
0 2439
18.9%
2 1189
 
9.2%
4 714
 
5.5%
5 679
 
5.3%
3 631
 
4.9%
7 500
 
3.9%
8 490
 
3.8%
6 318
 
2.5%
Other values (3) 203
 
1.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 14385
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 3081
21.4%
1 2675
18.6%
0 2439
17.0%
2 1189
 
8.3%
4 714
 
5.0%
5 679
 
4.7%
3 631
 
4.4%
7 500
 
3.5%
8 490
 
3.4%
6 318
 
2.2%
Other values (20) 1669
11.6%
Distinct770
Distinct (%)99.7%
Missing0
Missing (%)0.0%
Memory size6.2 KiB
2023-12-12T18:39:57.408174image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length81
Median length57
Mean length33.415803
Min length6

Characters and Unicode

Total characters25797
Distinct characters528
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique769 ?
Unique (%)99.6%

Sample

1st row동물용 생약제제의 기준 및 시험방법 설정에 관한 연구
2nd row웨스트나일열 항체 진단법 개발 연구
3rd row웨스트나일열 바이러스 비구조단백질의 기능분석 연구(기초)
4th row돼지오제스키병 유전자재조합 생독백신의 산업화연구
5th row돼지인플루엔자 혈청학적조사 및 분리바이러스의 항원성 분석 연구
ValueCountFrequency (%)
425
 
7.1%
연구 297
 
4.9%
개발 198
 
3.3%
관한 126
 
2.1%
국내 85
 
1.4%
조사 77
 
1.3%
이용한 72
 
1.2%
위한 63
 
1.0%
대한 57
 
0.9%
구제역 52
 
0.9%
Other values (2331) 4554
75.8%
2023-12-12T18:39:57.983741image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5298
 
20.5%
530
 
2.1%
426
 
1.7%
383
 
1.5%
381
 
1.5%
364
 
1.4%
341
 
1.3%
341
 
1.3%
341
 
1.3%
323
 
1.3%
Other values (518) 17069
66.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 18406
71.3%
Space Separator 5298
 
20.5%
Lowercase Letter 1000
 
3.9%
Uppercase Letter 637
 
2.5%
Open Punctuation 131
 
0.5%
Close Punctuation 131
 
0.5%
Decimal Number 88
 
0.3%
Other Punctuation 86
 
0.3%
Dash Punctuation 16
 
0.1%
Modifier Symbol 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
530
 
2.9%
426
 
2.3%
383
 
2.1%
381
 
2.1%
364
 
2.0%
341
 
1.9%
341
 
1.9%
341
 
1.9%
323
 
1.8%
314
 
1.7%
Other values (448) 14662
79.7%
Lowercase Letter
ValueCountFrequency (%)
i 121
12.1%
e 95
 
9.5%
a 92
 
9.2%
s 77
 
7.7%
r 74
 
7.4%
o 72
 
7.2%
p 55
 
5.5%
t 55
 
5.5%
l 54
 
5.4%
c 51
 
5.1%
Other values (15) 254
25.4%
Uppercase Letter
ValueCountFrequency (%)
A 75
11.8%
P 72
11.3%
S 66
 
10.4%
R 49
 
7.7%
D 38
 
6.0%
E 37
 
5.8%
C 37
 
5.8%
I 36
 
5.7%
M 31
 
4.9%
V 31
 
4.9%
Other values (14) 165
25.9%
Decimal Number
ValueCountFrequency (%)
1 24
27.3%
2 20
22.7%
3 13
14.8%
0 9
 
10.2%
5 5
 
5.7%
6 5
 
5.7%
4 5
 
5.7%
8 3
 
3.4%
7 2
 
2.3%
9 2
 
2.3%
Other Punctuation
ValueCountFrequency (%)
, 55
64.0%
· 13
 
15.1%
/ 10
 
11.6%
. 5
 
5.8%
: 3
 
3.5%
Space Separator
ValueCountFrequency (%)
5298
100.0%
Open Punctuation
ValueCountFrequency (%)
( 131
100.0%
Close Punctuation
ValueCountFrequency (%)
) 131
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 16
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 2
100.0%
Math Symbol
ValueCountFrequency (%)
~ 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 18406
71.3%
Common 5754
 
22.3%
Latin 1637
 
6.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
530
 
2.9%
426
 
2.3%
383
 
2.1%
381
 
2.1%
364
 
2.0%
341
 
1.9%
341
 
1.9%
341
 
1.9%
323
 
1.8%
314
 
1.7%
Other values (448) 14662
79.7%
Latin
ValueCountFrequency (%)
i 121
 
7.4%
e 95
 
5.8%
a 92
 
5.6%
s 77
 
4.7%
A 75
 
4.6%
r 74
 
4.5%
P 72
 
4.4%
o 72
 
4.4%
S 66
 
4.0%
p 55
 
3.4%
Other values (39) 838
51.2%
Common
ValueCountFrequency (%)
5298
92.1%
( 131
 
2.3%
) 131
 
2.3%
, 55
 
1.0%
1 24
 
0.4%
2 20
 
0.3%
- 16
 
0.3%
3 13
 
0.2%
· 13
 
0.2%
/ 10
 
0.2%
Other values (11) 43
 
0.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 18405
71.3%
ASCII 7378
28.6%
None 13
 
0.1%
Compat Jamo 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5298
71.8%
( 131
 
1.8%
) 131
 
1.8%
i 121
 
1.6%
e 95
 
1.3%
a 92
 
1.2%
s 77
 
1.0%
A 75
 
1.0%
r 74
 
1.0%
P 72
 
1.0%
Other values (59) 1212
 
16.4%
Hangul
ValueCountFrequency (%)
530
 
2.9%
426
 
2.3%
383
 
2.1%
381
 
2.1%
364
 
2.0%
341
 
1.9%
341
 
1.9%
341
 
1.9%
323
 
1.8%
314
 
1.7%
Other values (447) 14661
79.7%
None
ValueCountFrequency (%)
· 13
100.0%
Compat Jamo
ValueCountFrequency (%)
1
100.0%
Distinct291
Distinct (%)37.7%
Missing0
Missing (%)0.0%
Memory size6.2 KiB
2023-12-12T18:39:58.388083image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length4
Median length3
Mean length2.9870466
Min length2

Characters and Unicode

Total characters2306
Distinct characters156
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique136 ?
Unique (%)17.6%

Sample

1st row이명헌
2nd row나진주
3rd row나진주
4th row양동군
5th row최은진
ValueCountFrequency (%)
강승원 15
 
1.9%
양동군 14
 
1.8%
윤하정 11
 
1.4%
이광직 11
 
1.4%
조윤상 11
 
1.4%
최강석 11
 
1.4%
박종현 10
 
1.3%
이광녕 10
 
1.3%
안동준 10
 
1.3%
강환구 10
 
1.3%
Other values (280) 659
85.4%
2023-12-12T18:39:59.052008image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
117
 
5.1%
114
 
4.9%
79
 
3.4%
65
 
2.8%
61
 
2.6%
56
 
2.4%
55
 
2.4%
54
 
2.3%
50
 
2.2%
47
 
2.0%
Other values (146) 1608
69.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2297
99.6%
Space Separator 9
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
117
 
5.1%
114
 
5.0%
79
 
3.4%
65
 
2.8%
61
 
2.7%
56
 
2.4%
55
 
2.4%
54
 
2.4%
50
 
2.2%
47
 
2.0%
Other values (145) 1599
69.6%
Space Separator
ValueCountFrequency (%)
9
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2297
99.6%
Common 9
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
117
 
5.1%
114
 
5.0%
79
 
3.4%
65
 
2.8%
61
 
2.7%
56
 
2.4%
55
 
2.4%
54
 
2.4%
50
 
2.2%
47
 
2.0%
Other values (145) 1599
69.6%
Common
ValueCountFrequency (%)
9
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2297
99.6%
ASCII 9
 
0.4%

Most frequent character per block

Hangul
ValueCountFrequency (%)
117
 
5.1%
114
 
5.0%
79
 
3.4%
65
 
2.8%
61
 
2.7%
56
 
2.4%
55
 
2.4%
54
 
2.4%
50
 
2.2%
47
 
2.0%
Other values (145) 1599
69.6%
ASCII
ValueCountFrequency (%)
9
100.0%

Missing values

2023-12-12T18:39:56.510978image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T18:39:56.588624image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

과제번호과제명연구책임자
0B-FS06-2005-06-01동물용 생약제제의 기준 및 시험방법 설정에 관한 연구이명헌
1B-AD16-2004-06-03웨스트나일열 항체 진단법 개발 연구나진주
2F-AD16-2006-06-04웨스트나일열 바이러스 비구조단백질의 기능분석 연구(기초)나진주
3M-AD18-2004-05-04돼지오제스키병 유전자재조합 생독백신의 산업화연구양동군
4B-AD14-2005-06-01돼지인플루엔자 혈청학적조사 및 분리바이러스의 항원성 분석 연구최은진
5B-AD14-2006-08-02PRRSV 바이러스의 역상유전자시스템을 이용한 병원성 분석 및 재조합 백신용 바이러스 작성송재영
6M-AD14-2005-07-02돼지콜레라 및 이유후전신소모성증후군(PMWS) 관련 바이러스 원인체 조사 및 유전자 분석송재영
7N-AD14-2006-06-06제주지역 PRRS감염실태조사(기본)최은진
8N-AD15-2003-07-02닭전염성기관지염 및 조류 인플루엔자 유전자 모니터링전우진
9P-AD15-2006-06-03진단액 생산 및 검정기술 표준화 연구: 조류질병(기획)권준헌
과제번호과제명연구책임자
762N-1543386-2017-19-01구제역 혈청 은행 구축표현미
763N-1543386-2018-19-01파일롯(100리터) 규모 장비를 이용한 구제역 백신항원 제조공정 표준화 및 항원비축고영준
764B-1543068-2018-19-01야생조류에서 HPAI 바이러스 검출시 가금사육 농장의 발생위험도 평가모델 개발윤하정
765B-1543073-2018-19-02거점소독시설 유효성 평가에 관한 연구정우석
766B-1543073-2018-19-03소독제 효력시험지침 개선에 관한 연구김영욱
767B-1543418-2018-19-01조류인플루엔자 정밀진단 효율화 연구이광녕
768I-1543068-2018-19-01한-러 야생조류 위치추적기 부착 및 이동경로 추적 연구윤하정
769B-1543081-2019-20-06가축 항생제 사용 가이드라인 및 교육 컨텐츠 개발임숙경
770B-1543084-2016-18-01국내유행 닭전염성기관지염 바이러스 변이유형 분석 및 특성연구이지연
771N-1543069-2015-99-03소 아보바이러스 매개 모기검색을 통한 경보시스템 운영이경기