Overview

Dataset statistics

Number of variables7
Number of observations343
Missing cells1
Missing cells (%)< 0.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory18.9 KiB
Average record size in memory56.4 B

Variable types

Text6
DateTime1

Dataset

Description수행년도, 사업구분, 과제분류, 과제명, 연구책임자 등
Author농림축산검역본부
URLhttps://data.mafra.go.kr/opendata/data/indexOpenDataDetail.do?data_id=20220214000000001864

Alerts

최종보고서번호 has unique valuesUnique
과제계획서번호 has unique valuesUnique
연차계획서번호 has unique valuesUnique

Reproduction

Analysis started2023-12-11 03:09:44.287982
Analysis finished2023-12-11 03:09:45.222062
Duration0.93 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct343
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size2.8 KiB
2023-12-11T12:09:45.393121image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length26
Median length23
Mean length23.247813
Min length14

Characters and Unicode

Total characters7974
Distinct characters30
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique343 ?
Unique (%)100.0%

Sample

1st row2007B-FS06-2005-06-0101
2nd row2007F-AD16-2006-06-0401
3rd row2007M-AD18-2004-05-0401
4th row2007B-AD16-2004-06-0301
5th row2007M-AD14-2005-07-0201
ValueCountFrequency (%)
2007b-fs06-2005-06-0101 1
 
0.3%
2011b-fs09-2010-11-0202 1
 
0.3%
2011z-ad14-2011-11-0201 1
 
0.3%
2011z-ad13-2010-11-0302 1
 
0.3%
2011z-ad14-2011-11-0101 1
 
0.3%
2011z-ad13-2011-11-0402 1
 
0.3%
2011z-ad20-2011-11-0101 1
 
0.3%
2011b-ad14-2011-11-0201 1
 
0.3%
2011z-ad13-2010-11-0502 1
 
0.3%
2010b-fs09-2009-10-0202 1
 
0.3%
Other values (334) 334
97.1%
2023-12-11T12:09:45.891069image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 2167
27.2%
- 1365
17.1%
1 1286
16.1%
2 1107
13.9%
7 244
 
3.1%
A 237
 
3.0%
D 237
 
3.0%
8 173
 
2.2%
9 147
 
1.8%
3 145
 
1.8%
Other values (20) 866
 
10.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 5638
70.7%
Dash Punctuation 1365
 
17.1%
Uppercase Letter 964
 
12.1%
Lowercase Letter 5
 
0.1%
Connector Punctuation 1
 
< 0.1%
Space Separator 1
 
< 0.1%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
A 237
24.6%
D 237
24.6%
Z 110
11.4%
F 101
10.5%
B 98
10.2%
S 74
 
7.7%
M 33
 
3.4%
N 22
 
2.3%
C 20
 
2.1%
P 17
 
1.8%
Other values (2) 15
 
1.6%
Decimal Number
ValueCountFrequency (%)
0 2167
38.4%
1 1286
22.8%
2 1107
19.6%
7 244
 
4.3%
8 173
 
3.1%
9 147
 
2.6%
3 145
 
2.6%
4 125
 
2.2%
6 124
 
2.2%
5 120
 
2.1%
Lowercase Letter
ValueCountFrequency (%)
n 1
20.0%
v 1
20.0%
r 1
20.0%
q 1
20.0%
s 1
20.0%
Dash Punctuation
ValueCountFrequency (%)
- 1365
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%
Space Separator
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 7005
87.8%
Latin 969
 
12.2%

Most frequent character per script

Latin
ValueCountFrequency (%)
A 237
24.5%
D 237
24.5%
Z 110
11.4%
F 101
10.4%
B 98
10.1%
S 74
 
7.6%
M 33
 
3.4%
N 22
 
2.3%
C 20
 
2.1%
P 17
 
1.8%
Other values (7) 20
 
2.1%
Common
ValueCountFrequency (%)
0 2167
30.9%
- 1365
19.5%
1 1286
18.4%
2 1107
15.8%
7 244
 
3.5%
8 173
 
2.5%
9 147
 
2.1%
3 145
 
2.1%
4 125
 
1.8%
6 124
 
1.8%
Other values (3) 122
 
1.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 7974
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 2167
27.2%
- 1365
17.1%
1 1286
16.1%
2 1107
13.9%
7 244
 
3.1%
A 237
 
3.0%
D 237
 
3.0%
8 173
 
2.2%
9 147
 
1.8%
3 145
 
1.8%
Other values (20) 866
 
10.9%
Distinct146
Distinct (%)42.6%
Missing0
Missing (%)0.0%
Memory size2.8 KiB
Minimum2007-05-22 00:00:00
Maximum2013-10-10 00:00:00
2023-12-11T12:09:46.380571image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:09:46.587811image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct343
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size2.8 KiB
2023-12-11T12:09:46.885511image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length17
Mean length17.247813
Min length8

Characters and Unicode

Total characters5916
Distinct characters30
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique343 ?
Unique (%)100.0%

Sample

1st rowB-FS06-2005-06-01
2nd rowF-AD16-2006-06-04
3rd rowM-AD18-2004-05-04
4th rowB-AD16-2004-06-03
5th rowM-AD14-2005-07-02
ValueCountFrequency (%)
b-fs06-2005-06-01 1
 
0.3%
b-fs09-2010-11-02 1
 
0.3%
z-ad14-2011-11-02 1
 
0.3%
z-ad13-2010-11-03 1
 
0.3%
z-ad14-2011-11-01 1
 
0.3%
z-ad13-2011-11-04 1
 
0.3%
z-ad20-2011-11-01 1
 
0.3%
b-ad14-2011-11-02 1
 
0.3%
z-ad13-2010-11-05 1
 
0.3%
b-fs09-2009-10-02 1
 
0.3%
Other values (334) 334
97.1%
2023-12-11T12:09:47.359772image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 1365
23.1%
0 1271
21.5%
1 907
15.3%
2 548
9.3%
A 237
 
4.0%
D 237
 
4.0%
7 169
 
2.9%
6 120
 
2.0%
4 119
 
2.0%
9 118
 
2.0%
Other values (20) 825
13.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 3580
60.5%
Dash Punctuation 1365
 
23.1%
Uppercase Letter 964
 
16.3%
Lowercase Letter 5
 
0.1%
Connector Punctuation 1
 
< 0.1%
Space Separator 1
 
< 0.1%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
A 237
24.6%
D 237
24.6%
Z 110
11.4%
F 101
10.5%
B 98
10.2%
S 74
 
7.7%
M 33
 
3.4%
N 22
 
2.3%
C 20
 
2.1%
P 17
 
1.8%
Other values (2) 15
 
1.6%
Decimal Number
ValueCountFrequency (%)
0 1271
35.5%
1 907
25.3%
2 548
15.3%
7 169
 
4.7%
6 120
 
3.4%
4 119
 
3.3%
9 118
 
3.3%
5 117
 
3.3%
8 116
 
3.2%
3 95
 
2.7%
Lowercase Letter
ValueCountFrequency (%)
n 1
20.0%
v 1
20.0%
r 1
20.0%
q 1
20.0%
s 1
20.0%
Dash Punctuation
ValueCountFrequency (%)
- 1365
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%
Space Separator
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 4947
83.6%
Latin 969
 
16.4%

Most frequent character per script

Latin
ValueCountFrequency (%)
A 237
24.5%
D 237
24.5%
Z 110
11.4%
F 101
10.4%
B 98
10.1%
S 74
 
7.6%
M 33
 
3.4%
N 22
 
2.3%
C 20
 
2.1%
P 17
 
1.8%
Other values (7) 20
 
2.1%
Common
ValueCountFrequency (%)
- 1365
27.6%
0 1271
25.7%
1 907
18.3%
2 548
11.1%
7 169
 
3.4%
6 120
 
2.4%
4 119
 
2.4%
9 118
 
2.4%
5 117
 
2.4%
8 116
 
2.3%
Other values (3) 97
 
2.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 5916
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 1365
23.1%
0 1271
21.5%
1 907
15.3%
2 548
9.3%
A 237
 
4.0%
D 237
 
4.0%
7 169
 
2.9%
6 120
 
2.0%
4 119
 
2.0%
9 118
 
2.0%
Other values (20) 825
13.9%
Distinct343
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size2.8 KiB
2023-12-11T12:09:47.696982image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length22
Median length19
Mean length19.247813
Min length10

Characters and Unicode

Total characters6602
Distinct characters30
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique343 ?
Unique (%)100.0%

Sample

1st rowB-FS06-2005-06-0101
2nd rowF-AD16-2006-06-0401
3rd rowM-AD18-2004-05-0401
4th rowB-AD16-2004-06-0301
5th rowM-AD14-2005-07-0201
ValueCountFrequency (%)
b-fs06-2005-06-0101 1
 
0.3%
b-fs09-2010-11-0202 1
 
0.3%
z-ad14-2011-11-0201 1
 
0.3%
z-ad13-2010-11-0302 1
 
0.3%
z-ad14-2011-11-0101 1
 
0.3%
z-ad13-2011-11-0402 1
 
0.3%
z-ad20-2011-11-0101 1
 
0.3%
b-ad14-2011-11-0201 1
 
0.3%
z-ad13-2010-11-0502 1
 
0.3%
b-fs09-2009-10-0202 1
 
0.3%
Other values (334) 334
97.1%
2023-12-11T12:09:48.179145image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 1614
24.4%
- 1365
20.7%
1 1044
15.8%
2 693
10.5%
A 237
 
3.6%
D 237
 
3.6%
7 169
 
2.6%
3 145
 
2.2%
4 125
 
1.9%
6 122
 
1.8%
Other values (20) 851
12.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 4266
64.6%
Dash Punctuation 1365
 
20.7%
Uppercase Letter 964
 
14.6%
Lowercase Letter 5
 
0.1%
Connector Punctuation 1
 
< 0.1%
Space Separator 1
 
< 0.1%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
A 237
24.6%
D 237
24.6%
Z 110
11.4%
F 101
10.5%
B 98
10.2%
S 74
 
7.7%
M 33
 
3.4%
N 22
 
2.3%
C 20
 
2.1%
P 17
 
1.8%
Other values (2) 15
 
1.6%
Decimal Number
ValueCountFrequency (%)
0 1614
37.8%
1 1044
24.5%
2 693
16.2%
7 169
 
4.0%
3 145
 
3.4%
4 125
 
2.9%
6 122
 
2.9%
5 120
 
2.8%
9 118
 
2.8%
8 116
 
2.7%
Lowercase Letter
ValueCountFrequency (%)
n 1
20.0%
v 1
20.0%
r 1
20.0%
q 1
20.0%
s 1
20.0%
Dash Punctuation
ValueCountFrequency (%)
- 1365
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%
Space Separator
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 5633
85.3%
Latin 969
 
14.7%

Most frequent character per script

Latin
ValueCountFrequency (%)
A 237
24.5%
D 237
24.5%
Z 110
11.4%
F 101
10.4%
B 98
10.1%
S 74
 
7.6%
M 33
 
3.4%
N 22
 
2.3%
C 20
 
2.1%
P 17
 
1.8%
Other values (7) 20
 
2.1%
Common
ValueCountFrequency (%)
0 1614
28.7%
- 1365
24.2%
1 1044
18.5%
2 693
12.3%
7 169
 
3.0%
3 145
 
2.6%
4 125
 
2.2%
6 122
 
2.2%
5 120
 
2.1%
9 118
 
2.1%
Other values (3) 118
 
2.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 6602
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 1614
24.4%
- 1365
20.7%
1 1044
15.8%
2 693
10.5%
A 237
 
3.6%
D 237
 
3.6%
7 169
 
2.6%
3 145
 
2.2%
4 125
 
1.9%
6 122
 
1.8%
Other values (20) 851
12.9%
Distinct340
Distinct (%)99.1%
Missing0
Missing (%)0.0%
Memory size2.8 KiB
2023-12-11T12:09:48.549875image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length79
Median length52
Mean length33.699708
Min length12

Characters and Unicode

Total characters11559
Distinct characters450
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique337 ?
Unique (%)98.3%

Sample

1st row동물용 생약제제의 기준 및 시험방법 설정에 관한 연구
2nd row웨스트나일열 바이러스 비구조단백질의 기능분석 연구(기초)
3rd row돼지오제스키병 유전자재조합 생독백신의 산업화연구
4th row웨스트나일열 항체 진단법 개발 연구
5th row돼지콜레라 및 이유후전신소모성증후군(PMWS) 관련 바이러스 원인체 조사 및 유전자 분석
ValueCountFrequency (%)
189
 
7.1%
연구 133
 
5.0%
개발 79
 
3.0%
관한 57
 
2.2%
국내 38
 
1.4%
조사 33
 
1.2%
이용한 33
 
1.2%
바이러스 28
 
1.1%
대한 28
 
1.1%
위한 24
 
0.9%
Other values (1259) 2004
75.7%
2023-12-11T12:09:49.172727image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2342
 
20.3%
238
 
2.1%
191
 
1.7%
190
 
1.6%
190
 
1.6%
161
 
1.4%
160
 
1.4%
153
 
1.3%
145
 
1.3%
137
 
1.2%
Other values (440) 7652
66.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 8197
70.9%
Space Separator 2342
 
20.3%
Lowercase Letter 579
 
5.0%
Uppercase Letter 245
 
2.1%
Open Punctuation 60
 
0.5%
Close Punctuation 60
 
0.5%
Other Punctuation 37
 
0.3%
Decimal Number 27
 
0.2%
Dash Punctuation 12
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
238
 
2.9%
191
 
2.3%
190
 
2.3%
190
 
2.3%
161
 
2.0%
160
 
2.0%
153
 
1.9%
145
 
1.8%
137
 
1.7%
129
 
1.6%
Other values (378) 6503
79.3%
Lowercase Letter
ValueCountFrequency (%)
i 70
12.1%
e 56
9.7%
a 50
 
8.6%
o 48
 
8.3%
s 46
 
7.9%
r 44
 
7.6%
p 35
 
6.0%
t 33
 
5.7%
l 31
 
5.4%
c 30
 
5.2%
Other values (13) 136
23.5%
Uppercase Letter
ValueCountFrequency (%)
A 31
12.7%
P 26
10.6%
D 22
 
9.0%
S 21
 
8.6%
C 18
 
7.3%
R 17
 
6.9%
N 15
 
6.1%
V 15
 
6.1%
E 13
 
5.3%
I 10
 
4.1%
Other values (12) 57
23.3%
Decimal Number
ValueCountFrequency (%)
2 10
37.0%
1 8
29.6%
3 3
 
11.1%
0 2
 
7.4%
8 1
 
3.7%
5 1
 
3.7%
9 1
 
3.7%
4 1
 
3.7%
Other Punctuation
ValueCountFrequency (%)
, 22
59.5%
· 7
 
18.9%
/ 5
 
13.5%
: 2
 
5.4%
. 1
 
2.7%
Space Separator
ValueCountFrequency (%)
2342
100.0%
Open Punctuation
ValueCountFrequency (%)
( 60
100.0%
Close Punctuation
ValueCountFrequency (%)
) 60
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 12
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 8197
70.9%
Common 2538
 
22.0%
Latin 824
 
7.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
238
 
2.9%
191
 
2.3%
190
 
2.3%
190
 
2.3%
161
 
2.0%
160
 
2.0%
153
 
1.9%
145
 
1.8%
137
 
1.7%
129
 
1.6%
Other values (378) 6503
79.3%
Latin
ValueCountFrequency (%)
i 70
 
8.5%
e 56
 
6.8%
a 50
 
6.1%
o 48
 
5.8%
s 46
 
5.6%
r 44
 
5.3%
p 35
 
4.2%
t 33
 
4.0%
A 31
 
3.8%
l 31
 
3.8%
Other values (35) 380
46.1%
Common
ValueCountFrequency (%)
2342
92.3%
( 60
 
2.4%
) 60
 
2.4%
, 22
 
0.9%
- 12
 
0.5%
2 10
 
0.4%
1 8
 
0.3%
· 7
 
0.3%
/ 5
 
0.2%
3 3
 
0.1%
Other values (7) 9
 
0.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 8197
70.9%
ASCII 3355
29.0%
None 7
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2342
69.8%
i 70
 
2.1%
( 60
 
1.8%
) 60
 
1.8%
e 56
 
1.7%
a 50
 
1.5%
o 48
 
1.4%
s 46
 
1.4%
r 44
 
1.3%
p 35
 
1.0%
Other values (51) 544
 
16.2%
Hangul
ValueCountFrequency (%)
238
 
2.9%
191
 
2.3%
190
 
2.3%
190
 
2.3%
161
 
2.0%
160
 
2.0%
153
 
1.9%
145
 
1.8%
137
 
1.7%
129
 
1.6%
Other values (378) 6503
79.3%
None
ValueCountFrequency (%)
· 7
100.0%
Distinct169
Distinct (%)49.3%
Missing0
Missing (%)0.0%
Memory size2.8 KiB
2023-12-11T12:09:49.642868image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length3
Median length3
Mean length2.9737609
Min length2

Characters and Unicode

Total characters1020
Distinct characters125
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique84 ?
Unique (%)24.5%

Sample

1st row이명헌
2nd row나진주
3rd row양동군
4th row나진주
5th row송재영
ValueCountFrequency (%)
강승원 10
 
2.9%
현방훈 9
 
2.6%
조성준 6
 
1.7%
정상희 5
 
1.5%
정병열 5
 
1.5%
박선일 5
 
1.5%
강환구 5
 
1.5%
최강석 5
 
1.5%
윤하정 5
 
1.5%
이윤정 5
 
1.5%
Other values (159) 283
82.5%
2023-12-11T12:09:50.306238image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
46
 
4.5%
35
 
3.4%
35
 
3.4%
32
 
3.1%
29
 
2.8%
27
 
2.6%
26
 
2.5%
25
 
2.5%
25
 
2.5%
22
 
2.2%
Other values (115) 718
70.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1020
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
46
 
4.5%
35
 
3.4%
35
 
3.4%
32
 
3.1%
29
 
2.8%
27
 
2.6%
26
 
2.5%
25
 
2.5%
25
 
2.5%
22
 
2.2%
Other values (115) 718
70.4%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1020
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
46
 
4.5%
35
 
3.4%
35
 
3.4%
32
 
3.1%
29
 
2.8%
27
 
2.6%
26
 
2.5%
25
 
2.5%
25
 
2.5%
22
 
2.2%
Other values (115) 718
70.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1020
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
46
 
4.5%
35
 
3.4%
35
 
3.4%
32
 
3.1%
29
 
2.8%
27
 
2.6%
26
 
2.5%
25
 
2.5%
25
 
2.5%
22
 
2.2%
Other values (115) 718
70.4%
Distinct342
Distinct (%)100.0%
Missing1
Missing (%)0.3%
Memory size2.8 KiB
2023-12-11T12:09:50.633856image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length67
Median length48
Mean length26.774854
Min length5

Characters and Unicode

Total characters9157
Distinct characters419
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique342 ?
Unique (%)100.0%

Sample

1st row06 최종 보고서(화학, 수정).hwp
2nd row2006년 연차실적보고서_기초_신종실.hwp
3rd row2006년 최종연구보고서(오제스키산업화).hwp
4th row2006년 연차실적보고서_중점_신종실.hwp
5th row06년 PMWS 최종결과보고서.hwp
ValueCountFrequency (%)
24
 
2.7%
2008년 17
 
1.9%
2007년 16
 
1.8%
연구과제 14
 
1.6%
2010년 13
 
1.5%
최종결과보고서.hwp 10
 
1.1%
최종 9
 
1.0%
연구보고서 8
 
0.9%
최종보고서.hwp 8
 
0.9%
2012년 7
 
0.8%
Other values (650) 753
85.7%
2023-12-11T12:09:51.148915image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
539
 
5.9%
0 406
 
4.4%
. 380
 
4.1%
p 354
 
3.9%
h 331
 
3.6%
w 329
 
3.6%
313
 
3.4%
308
 
3.4%
305
 
3.3%
2 293
 
3.2%
Other values (409) 5599
61.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4713
51.5%
Lowercase Letter 1264
 
13.8%
Decimal Number 1208
 
13.2%
Space Separator 539
 
5.9%
Other Punctuation 395
 
4.3%
Open Punctuation 267
 
2.9%
Close Punctuation 265
 
2.9%
Uppercase Letter 223
 
2.4%
Connector Punctuation 189
 
2.1%
Dash Punctuation 92
 
1.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
313
 
6.6%
308
 
6.5%
305
 
6.5%
257
 
5.5%
248
 
5.3%
180
 
3.8%
153
 
3.2%
143
 
3.0%
141
 
3.0%
98
 
2.1%
Other values (339) 2567
54.5%
Lowercase Letter
ValueCountFrequency (%)
p 354
28.0%
h 331
26.2%
w 329
26.0%
i 27
 
2.1%
e 24
 
1.9%
s 21
 
1.7%
o 21
 
1.7%
l 19
 
1.5%
a 16
 
1.3%
d 16
 
1.3%
Other values (14) 106
 
8.4%
Uppercase Letter
ValueCountFrequency (%)
P 27
12.1%
V 25
11.2%
R 21
9.4%
S 19
 
8.5%
D 15
 
6.7%
A 15
 
6.7%
C 14
 
6.3%
B 13
 
5.8%
I 13
 
5.8%
N 11
 
4.9%
Other values (10) 50
22.4%
Decimal Number
ValueCountFrequency (%)
0 406
33.6%
2 293
24.3%
1 282
23.3%
8 50
 
4.1%
7 43
 
3.6%
9 39
 
3.2%
6 30
 
2.5%
4 25
 
2.1%
3 25
 
2.1%
5 15
 
1.2%
Other Punctuation
ValueCountFrequency (%)
. 380
96.2%
, 10
 
2.5%
' 3
 
0.8%
% 1
 
0.3%
· 1
 
0.3%
Open Punctuation
ValueCountFrequency (%)
( 242
90.6%
[ 22
 
8.2%
3
 
1.1%
Close Punctuation
ValueCountFrequency (%)
) 240
90.6%
] 22
 
8.3%
3
 
1.1%
Math Symbol
ValueCountFrequency (%)
+ 1
50.0%
~ 1
50.0%
Space Separator
ValueCountFrequency (%)
539
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 189
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 92
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4713
51.5%
Common 2957
32.3%
Latin 1487
 
16.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
313
 
6.6%
308
 
6.5%
305
 
6.5%
257
 
5.5%
248
 
5.3%
180
 
3.8%
153
 
3.2%
143
 
3.0%
141
 
3.0%
98
 
2.1%
Other values (339) 2567
54.5%
Latin
ValueCountFrequency (%)
p 354
23.8%
h 331
22.3%
w 329
22.1%
P 27
 
1.8%
i 27
 
1.8%
V 25
 
1.7%
e 24
 
1.6%
R 21
 
1.4%
s 21
 
1.4%
o 21
 
1.4%
Other values (34) 307
20.6%
Common
ValueCountFrequency (%)
539
18.2%
0 406
13.7%
. 380
12.9%
2 293
9.9%
1 282
9.5%
( 242
8.2%
) 240
8.1%
_ 189
 
6.4%
- 92
 
3.1%
8 50
 
1.7%
Other values (16) 244
8.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4713
51.5%
ASCII 4437
48.5%
None 7
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
539
12.1%
0 406
 
9.2%
. 380
 
8.6%
p 354
 
8.0%
h 331
 
7.5%
w 329
 
7.4%
2 293
 
6.6%
1 282
 
6.4%
( 242
 
5.5%
) 240
 
5.4%
Other values (57) 1041
23.5%
Hangul
ValueCountFrequency (%)
313
 
6.6%
308
 
6.5%
305
 
6.5%
257
 
5.5%
248
 
5.3%
180
 
3.8%
153
 
3.2%
143
 
3.0%
141
 
3.0%
98
 
2.1%
Other values (339) 2567
54.5%
None
ValueCountFrequency (%)
3
42.9%
3
42.9%
· 1
 
14.3%

Missing values

2023-12-11T12:09:45.008998image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T12:09:45.158720image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

최종보고서번호제출일과제계획서번호연차계획서번호과제명연구책임자최종보고서_파일명
02007B-FS06-2005-06-01012007-05-22B-FS06-2005-06-01B-FS06-2005-06-0101동물용 생약제제의 기준 및 시험방법 설정에 관한 연구이명헌06 최종 보고서(화학, 수정).hwp
12007F-AD16-2006-06-04012007-05-23F-AD16-2006-06-04F-AD16-2006-06-0401웨스트나일열 바이러스 비구조단백질의 기능분석 연구(기초)나진주2006년 연차실적보고서_기초_신종실.hwp
22007M-AD18-2004-05-04012007-05-23M-AD18-2004-05-04M-AD18-2004-05-0401돼지오제스키병 유전자재조합 생독백신의 산업화연구양동군2006년 최종연구보고서(오제스키산업화).hwp
32007B-AD16-2004-06-03012007-05-23B-AD16-2004-06-03B-AD16-2004-06-0301웨스트나일열 항체 진단법 개발 연구나진주2006년 연차실적보고서_중점_신종실.hwp
42007M-AD14-2005-07-02012007-05-28M-AD14-2005-07-02M-AD14-2005-07-0201돼지콜레라 및 이유후전신소모성증후군(PMWS) 관련 바이러스 원인체 조사 및 유전자 분석송재영06년 PMWS 최종결과보고서.hwp
52007P-AD15-2006-06-03012007-05-28P-AD15-2006-06-03P-AD15-2006-06-0301진단액 생산 및 검정기술 표준화 연구: 조류질병(기획)권준헌총괄연구과제 보고서(진단액 생산 및 검정기술표준화-취합).hwp
62007N-AD15-2003-07-02012007-05-28N-AD15-2003-07-02N-AD15-2003-07-0201닭전염성기관지염 및 조류 인플루엔자 유전자 모니터링전우진2007 AI-IB 기본과제 최종보고서.hwp
72007N-AD14-2006-06-06012007-05-28N-AD14-2006-06-06N-AD14-2006-06-0601제주지역 PRRS감염실태조사(기본)최은진06년 제주 PRRSV 최종결과보고서.hwp
82007B-AD14-2005-06-01012007-05-28B-AD14-2005-06-01B-AD14-2005-06-0101돼지인플루엔자 혈청학적조사 및 분리바이러스의 항원성 분석 연구최은진06년 SIV 최종결과보고서.hwp
92007B-AD14-2006-08-02012007-05-28B-AD14-2006-08-02B-AD14-2006-08-0201PRRSV 바이러스의 역상유전자시스템을 이용한 병원성 분석 및 재조합 백신용 바이러스 작성송재영06년 PRRSV 역상유전자 과제 연차실적보고서.hwp
최종보고서번호제출일과제계획서번호연차계획서번호과제명연구책임자최종보고서_파일명
3332012Z-1541779-2012-12-01012013-02-05Z-1541779-2012-12-01Z-1541779-2012-12-0101구제역백신센터 건립 타당성 조사지인배구제역백신센터_연구결과보고서(제출용)(1).hwp
3342012Z-FS03-2011-12-03022013-02-06Z-FS03-2011-12-03Z-FS03-2011-12-0302축산물의 유통기한 설정실험지표 개발 및 효과적인 커뮤니케이션 방법개발김진만용역사업_연구결과보고서_유통기한_완.hwp
3352012Z-1541745-2011-12-01022013-02-06Z-1541745-2011-12-01Z-1541745-2011-12-0102과실파리 발생예측 시스템 개발김동순과실파리류_발생예찰_시스템_개발_완료보고서.pdf
3362011Z-AD20-2011-12-01012013-02-06Z-AD20-2011-12-01Z-AD20-2011-12-0101비둘기 분변 유래 효모양 병원성진균에 대한 인수공통전염성질병 방제기술 구축장경수【붙임 4】연구결과보고-장경수 (부산가톨릭대학교)-3.hwp
3372012Z-1541745-2012-12-02012013-02-06Z-1541745-2012-12-02Z-1541745-2012-12-0201광견병 발생지역 인근 야생동물 예찰 및 환경생태학적 연구이한수테스트용.hwp
3382012P-AD21-2011-14-01022013-02-07P-AD21-2011-14-01P-AD21-2011-14-0102구제역 음성 동물(소 및 돼지) 공급 기반 구축 및 구제역 백신 효능평가김연희2012년구제역_최종보고서(최종-제본용).hwp
3392012Z-1542057-2012-12-01012013-02-07Z-1542057-2012-12-01Z-1542057-2012-12-0101동물용의약품 환경영향평가의 지침개발김의경검역원_연구결과보고서 경상대 20130102 (최종보고서).hwp
3402012P-AD16-2011-13-01012013-02-12P-AD16-2011-13-01P-AD16-2011-13-0101국내분리 구제역바이러스 O형(SEA지역형)을 이용한 백신종독 개발연구탁동섭국내분리 구제역바이러스 O형 (SEA지역형)을 이용한 백신종독 개발연구(2012 년말)-최종.hwp
3412012Z-AD21-2011-12-02022013-02-13Z-AD21-2011-12-02Z-AD21-2011-12-0202국내 오리농장의 방역위생 실태조사 및 질병 발생동향 분석장형관130206-장형관-검역검사본부용역과제-연구결과보고서(제출용).hwp
3422012Z-1541777-2011-12-03022013-10-10Z-1541777-2011-12-03Z-1541777-2011-12-0302돼지유래 iPS 신기술 구축 및 개발연구우흥명58.우흥명_돼지유래 iPS 신기술 구축 및 개발연구.hwp