Dataset statistics
Number of variables | 4 |
---|---|
Number of observations | 1014 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 21 |
Duplicate rows (%) | 2.1% |
Total size in memory | 31.8 KiB |
Average record size in memory | 32.1 B |
Variable types
Text | 3 |
---|---|
Categorical | 1 |
Dataset
Description | 생물종의 유전정보 분석 관련 바코드 유전자 및 계통 분석에 기반이 되는 프라이머 서열의 정의, 활용 및 라이브러리 정보 관련 자료 입니다. |
---|---|
Author | 환경부 국립생물자원관 |
URL | https://www.data.go.kr/data/15067613/fileData.do |
Dataset has 21 (2.1%) duplicate rows | Duplicates |
Reproduction
Analysis started | 2023-12-12 06:23:57.454126 |
---|---|
Analysis finished | 2023-12-12 06:23:57.855312 |
Duration | 0.4 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
프라이머명
Text
Distinct | 888 |
---|---|
Distinct (%) | 87.6% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 8.1 KiB |
Length
Max length | 23 |
---|---|
Median length | 18 |
Mean length | 7.199211 |
Min length | 2 |
Characters and Unicode
Total characters | 7300 |
---|---|
Distinct characters | 70 |
Distinct categories | 10 ? |
Distinct scripts | 3 ? |
Distinct blocks | 3 ? |
Unique
Unique | 801 ? |
---|---|
Unique (%) | 79.0% |
Sample
1st row | Dorid_COI_3F |
---|---|
2nd row | 1F-spionid-LCO |
3rd row | 18S329 |
4th row | 18SL |
5th row | 18S R8 |
Value | Count | Frequency (%) |
psbaf | 6 | 0.6% |
ycf1 | 5 | 0.5% |
m13_trnh | 5 | 0.5% |
trnhr | 5 | 0.5% |
its4 | 5 | 0.5% |
m13_its1a | 5 | 0.5% |
m13_its4 | 5 | 0.5% |
18s | 5 | 0.5% |
m13_psba | 5 | 0.5% |
lco1490 | 4 | 0.4% |
Other values (885) | 1007 |
Most occurring characters
Value | Count | Frequency (%) |
1 | 589 | 8.1% |
R | 397 | 5.4% |
_ | 396 | 5.4% |
F | 383 | 5.2% |
2 | 282 | 3.9% |
- | 250 | 3.4% |
r | 249 | 3.4% |
t | 247 | 3.4% |
a | 236 | 3.2% |
L | 231 | 3.2% |
Other values (60) | 4040 |
Most occurring categories
Value | Count | Frequency (%) |
Uppercase Letter | 2564 | |
Decimal Number | 2007 | |
Lowercase Letter | 2004 | |
Connector Punctuation | 396 | 5.4% |
Dash Punctuation | 250 | 3.4% |
Space Separator | 50 | 0.7% |
Other Punctuation | 10 | 0.1% |
Open Punctuation | 9 | 0.1% |
Close Punctuation | 9 | 0.1% |
Letter Number | 1 | < 0.1% |
Most frequent character per category
Uppercase Letter
Value | Count | Frequency (%) |
R | 397 | |
F | 383 | |
L | 231 | |
S | 187 | 7.3% |
C | 164 | 6.4% |
K | 156 | 6.1% |
I | 129 | 5.0% |
T | 126 | 4.9% |
A | 116 | 4.5% |
H | 105 | 4.1% |
Other values (16) | 570 |
Lowercase Letter
Value | Count | Frequency (%) |
r | 249 | |
t | 247 | |
a | 236 | |
m | 175 | |
c | 166 | 8.3% |
b | 165 | 8.2% |
n | 104 | 5.2% |
s | 78 | 3.9% |
p | 77 | 3.8% |
e | 72 | 3.6% |
Other values (16) | 435 |
Decimal Number
Value | Count | Frequency (%) |
1 | 589 | |
2 | 282 | |
3 | 196 | 9.8% |
4 | 172 | 8.6% |
8 | 158 | 7.9% |
0 | 137 | 6.8% |
9 | 123 | 6.1% |
5 | 121 | 6.0% |
6 | 121 | 6.0% |
7 | 108 | 5.4% |
Other Punctuation
Value | Count | Frequency (%) |
. | 9 | |
: | 1 | 10.0% |
Connector Punctuation
Value | Count | Frequency (%) |
_ | 396 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 250 |
Space Separator
Value | Count | Frequency (%) |
50 |
Open Punctuation
Value | Count | Frequency (%) |
( | 9 |
Close Punctuation
Value | Count | Frequency (%) |
) | 9 |
Letter Number
Value | Count | Frequency (%) |
Ⅱ | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 4567 | |
Common | 2731 | |
Greek | 2 | < 0.1% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
R | 397 | 8.7% |
F | 383 | 8.4% |
r | 249 | 5.5% |
t | 247 | 5.4% |
a | 236 | 5.2% |
L | 231 | 5.1% |
S | 187 | 4.1% |
m | 175 | 3.8% |
c | 166 | 3.6% |
b | 165 | 3.6% |
Other values (42) | 2131 |
Common
Value | Count | Frequency (%) |
1 | 589 | |
_ | 396 | |
2 | 282 | |
- | 250 | |
3 | 196 | 7.2% |
4 | 172 | 6.3% |
8 | 158 | 5.8% |
0 | 137 | 5.0% |
9 | 123 | 4.5% |
5 | 121 | 4.4% |
Other values (7) | 307 |
Greek
Value | Count | Frequency (%) |
β | 2 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 7297 | |
None | 2 | < 0.1% |
Number Forms | 1 | < 0.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
1 | 589 | 8.1% |
R | 397 | 5.4% |
_ | 396 | 5.4% |
F | 383 | 5.2% |
2 | 282 | 3.9% |
- | 250 | 3.4% |
r | 249 | 3.4% |
t | 247 | 3.4% |
a | 236 | 3.2% |
L | 231 | 3.2% |
Other values (58) | 4037 |
None
Value | Count | Frequency (%) |
β | 2 |
Number Forms
Value | Count | Frequency (%) |
Ⅱ | 1 |
방향
Categorical
Distinct | 2 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 8.1 KiB |
reverse | |
---|---|
forward |
Length
Max length | 7 |
---|---|
Median length | 7 |
Mean length | 7 |
Min length | 7 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | forward |
---|---|
2nd row | forward |
3rd row | reverse |
4th row | forward |
5th row | reverse |
Common Values
Value | Count | Frequency (%) |
reverse | 514 | |
forward | 500 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
reverse | 514 | |
forward | 500 |
마커명
Text
Distinct | 60 |
---|---|
Distinct (%) | 5.9% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 8.1 KiB |
Value | Count | Frequency (%) |
coi | 172 | |
matk | 148 | |
rbcl | 140 | |
its | 88 | 8.0% |
rrna | 85 | 7.7% |
cytb | 55 | 5.0% |
18s | 52 | 4.7% |
16s | 38 | 3.5% |
28s | 33 | 3.0% |
trnh-psba | 32 | 2.9% |
Other values (51) | 258 |
Most occurring characters
Value | Count | Frequency (%) |
S | 319 | 7.1% |
r | 314 | 6.9% |
t | 287 | 6.3% |
I | 279 | 6.2% |
b | 259 | 5.7% |
C | 251 | 5.6% |
L | 186 | 4.1% |
O | 180 | 4.0% |
c | 175 | 3.9% |
a | 162 | 3.6% |
Other values (44) | 2108 |
Most occurring categories
Value | Count | Frequency (%) |
Uppercase Letter | 2072 | |
Lowercase Letter | 1858 | |
Decimal Number | 365 | 8.1% |
Dash Punctuation | 124 | 2.7% |
Space Separator | 99 | 2.2% |
Other Punctuation | 2 | < 0.1% |
Most frequent character per category
Uppercase Letter
Value | Count | Frequency (%) |
S | 319 | |
I | 279 | |
C | 251 | |
L | 186 | |
O | 180 | |
K | 157 | |
A | 138 | |
T | 108 | 5.2% |
R | 100 | 4.8% |
N | 91 | 4.4% |
Other values (12) | 263 |
Lowercase Letter
Value | Count | Frequency (%) |
r | 314 | |
t | 287 | |
b | 259 | |
c | 175 | |
a | 162 | |
m | 152 | |
p | 99 | 5.3% |
n | 92 | 5.0% |
s | 68 | 3.7% |
y | 64 | 3.4% |
Other values (11) | 186 |
Decimal Number
Value | Count | Frequency (%) |
1 | 140 | |
8 | 85 | |
2 | 76 | |
6 | 47 | 12.9% |
3 | 8 | 2.2% |
5 | 5 | 1.4% |
4 | 2 | 0.5% |
9 | 2 | 0.5% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 124 |
Space Separator
Value | Count | Frequency (%) |
99 |
Other Punctuation
Value | Count | Frequency (%) |
/ | 2 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 3916 | |
Common | 590 | 13.1% |
Greek | 14 | 0.3% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
S | 319 | 8.1% |
r | 314 | 8.0% |
t | 287 | 7.3% |
I | 279 | 7.1% |
b | 259 | 6.6% |
C | 251 | 6.4% |
L | 186 | 4.7% |
O | 180 | 4.6% |
c | 175 | 4.5% |
a | 162 | 4.1% |
Other values (32) | 1504 |
Common
Value | Count | Frequency (%) |
1 | 140 | |
- | 124 | |
99 | ||
8 | 85 | |
2 | 76 | |
6 | 47 | 8.0% |
3 | 8 | 1.4% |
5 | 5 | 0.8% |
/ | 2 | 0.3% |
4 | 2 | 0.3% |
Greek
Value | Count | Frequency (%) |
α | 14 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 4506 | |
None | 14 | 0.3% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
S | 319 | 7.1% |
r | 314 | 7.0% |
t | 287 | 6.4% |
I | 279 | 6.2% |
b | 259 | 5.7% |
C | 251 | 5.6% |
L | 186 | 4.1% |
O | 180 | 4.0% |
c | 175 | 3.9% |
a | 162 | 3.6% |
Other values (43) | 2094 |
None
Value | Count | Frequency (%) |
α | 14 |
대상 분류군
Text
Distinct | 190 |
---|---|
Distinct (%) | 18.7% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 8.1 KiB |
Length
Max length | 64 |
---|---|
Median length | 37 |
Mean length | 20.99211 |
Min length | 4 |
Characters and Unicode
Total characters | 21286 |
---|---|
Distinct characters | 67 |
Distinct categories | 8 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 24 ? |
---|---|
Unique (%) | 2.4% |
Sample
1st row | Pseudopolydora gigeriosa |
---|---|
2nd row | Polychaeta Grube, 1850 |
3rd row | Crustacea Brnnich, 1772 |
4th row | Crustacea Brnnich, 1772 |
5th row | Crustacea Brnnich, 1772 |
Value | Count | Frequency (%) |
ex | 92 | 3.2% |
88 | 3.1% | |
plantae | 75 | 2.6% |
fungi | 73 | 2.6% |
l | 57 | 2.0% |
linnaeus | 32 | 1.1% |
wettstein | 29 | 1.0% |
rhodophyta | 29 | 1.0% |
1922 | 29 | 1.0% |
insecta | 28 | 1.0% |
Other values (438) | 2325 |
Most occurring characters
Value | Count | Frequency (%) |
a | 2164 | 10.2% |
1843 | 8.7% | |
e | 1632 | 7.7% |
i | 1275 | 6.0% |
r | 997 | 4.7% |
n | 967 | 4.5% |
o | 957 | 4.5% |
t | 931 | 4.4% |
s | 875 | 4.1% |
l | 722 | 3.4% |
Other values (57) | 8923 |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 14681 | |
Uppercase Letter | 2108 | 9.9% |
Space Separator | 1843 | 8.7% |
Decimal Number | 1600 | 7.5% |
Other Punctuation | 830 | 3.9% |
Open Punctuation | 100 | 0.5% |
Close Punctuation | 100 | 0.5% |
Dash Punctuation | 24 | 0.1% |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
a | 2164 | |
e | 1632 | |
i | 1275 | 8.7% |
r | 997 | 6.8% |
n | 967 | 6.6% |
o | 957 | 6.5% |
t | 931 | 6.3% |
s | 875 | 6.0% |
l | 722 | 4.9% |
c | 637 | 4.3% |
Other values (16) | 3524 |
Uppercase Letter
Value | Count | Frequency (%) |
C | 235 | 11.1% |
P | 211 | 10.0% |
L | 200 | 9.5% |
A | 172 | 8.2% |
R | 138 | 6.5% |
F | 130 | 6.2% |
S | 109 | 5.2% |
B | 106 | 5.0% |
M | 104 | 4.9% |
H | 99 | 4.7% |
Other values (14) | 604 |
Decimal Number
Value | Count | Frequency (%) |
1 | 461 | |
8 | 248 | |
9 | 211 | |
2 | 157 | 9.8% |
7 | 150 | 9.4% |
0 | 122 | 7.6% |
5 | 76 | 4.8% |
4 | 61 | 3.8% |
6 | 60 | 3.8% |
3 | 54 | 3.4% |
Other Punctuation
Value | Count | Frequency (%) |
. | 431 | |
, | 311 | |
& | 88 | 10.6% |
Space Separator
Value | Count | Frequency (%) |
1843 |
Open Punctuation
Value | Count | Frequency (%) |
( | 100 |
Close Punctuation
Value | Count | Frequency (%) |
) | 100 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 24 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 16789 | |
Common | 4497 | 21.1% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
a | 2164 | 12.9% |
e | 1632 | 9.7% |
i | 1275 | 7.6% |
r | 997 | 5.9% |
n | 967 | 5.8% |
o | 957 | 5.7% |
t | 931 | 5.5% |
s | 875 | 5.2% |
l | 722 | 4.3% |
c | 637 | 3.8% |
Other values (40) | 5632 |
Common
Value | Count | Frequency (%) |
1843 | ||
1 | 461 | 10.3% |
. | 431 | 9.6% |
, | 311 | 6.9% |
8 | 248 | 5.5% |
9 | 211 | 4.7% |
2 | 157 | 3.5% |
7 | 150 | 3.3% |
0 | 122 | 2.7% |
( | 100 | 2.2% |
Other values (7) | 463 | 10.3% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 21286 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
a | 2164 | 10.2% |
1843 | 8.7% | |
e | 1632 | 7.7% |
i | 1275 | 6.0% |
r | 997 | 4.7% |
n | 967 | 4.5% |
o | 957 | 4.5% |
t | 931 | 4.4% |
s | 875 | 4.1% |
l | 722 | 3.4% |
Other values (57) | 8923 |
방향 | 마커명 | |
---|---|---|
방향 | 1.000 | 0.000 |
마커명 | 0.000 | 1.000 |
프라이머명 | 방향 | 마커명 | 대상 분류군 | |
---|---|---|---|---|
0 | Dorid_COI_3F | forward | COI | Pseudopolydora gigeriosa |
1 | 1F-spionid-LCO | forward | COI | Polychaeta Grube, 1850 |
2 | 18S329 | reverse | 18S rRNA | Crustacea Brnnich, 1772 |
3 | 18SL | forward | 18S rRNA | Crustacea Brnnich, 1772 |
4 | 18S R8 | reverse | 18S rRNA | Crustacea Brnnich, 1772 |
5 | 18S F2 | forward | 18S rRNA | Crustacea Brnnich, 1772 |
6 | 18S F2 | forward | 18S rRNA | Crustacea Brnnich, 1772 |
7 | psbA | reverse | trnH-psbA | Elaeocarpus L. |
8 | trnL | forward | trnL-F | Elaeocarpus L. |
9 | mtDNA_ext(Cytb)R | reverse | Cytb | Aves Linnaeus, 1758 |
프라이머명 | 방향 | 마커명 | 대상 분류군 | |
---|---|---|---|---|
1004 | rbcL_902R | reverse | rbcL | Lamiales Bromhead |
1005 | rbcL_26F | forward | rbcL | Lamiales Bromhead |
1006 | Am β-tubulin-R | reverse | Tubulin | Amanita Pers. 1797 |
1007 | Am β-tubulin F | forward | Tubulin | Amanita Pers. 1797 |
1008 | Am-7R-DK | reverse | RPB2 | Amanita Pers. 1797 |
1009 | Am-6F-DK | forward | RPB2 | Amanita Pers. 1797 |
1010 | LROR-DK | forward | LSU | Amanita Pers. 1797 |
1011 | LR5-DK | reverse | LSU | Amanita Pers. 1797 |
1012 | ITS4-DK | reverse | ITS | Amanita Pers. 1797 |
1013 | ITS1-DK | forward | ITS | Amanita Pers. 1797 |
Most frequently occurring
프라이머명 | 방향 | 마커명 | 대상 분류군 | # duplicates | |
---|---|---|---|---|---|
4 | 28sFF | reverse | 28S rRNA | Insecta | 3 |
0 | 1055F | forward | 18S rRNA | Protozoa | 2 |
1 | 1055R | reverse | 18S rRNA | Protozoa | 2 |
2 | 18S F2 | forward | 18S rRNA | Crustacea Brnnich, 1772 | 2 |
3 | 28sDD | forward | 28S rRNA | Insecta | 2 |
5 | BTUB4Rd | reverse | Tubulin | Fungi | 2 |
6 | COI2F | forward | COI | Acari | 2 |
7 | ITS4 | reverse | ITS | Plantae | 2 |
8 | LCOech1aF1 | forward | COI | Echinodermata Klein, 1734 | 2 |
9 | M13_ITS1a | forward | ITS | Rubia argyi (H. Lv. & Vaniot) H. Hara ex Lauener & D.K. Ferguson | 2 |