Skip to main content

Table 5 Protein names and lengths (in aminoacids, aa) for the five most redundant hits in each transcriptome

From: Comparative description of ten transcriptomes of newly sequenced invertebrates and efficiency estimation of genomic sampling in non-model taxa

# Hits Protein name and [species name] Putative transposable element Protein length (aa) Accession number
Petrosia ficiformis     
x9 PREDICTED: hypothetical protein LOC100641198 [Amphimedon queenslandica] - 673 XP_003382742
x9 PREDICTED: hypothetical protein LOC100639583 [Amphimedon queenslandica] yes 1768 XP_003390293
x10 PREDICTED: RING finger protein 213-like [Amphimedon queenslandica] - 5361 XP_003389786
x12 ankyrin 2,3/unc44 [Aedes aegypti] - 789 XP_001649474
x16 PREDICTED: hypothetical protein LOC100637079 [Amphimedon queenslandica] - 41943 XP_003386025
Crella elegans     
x25 Collagen protein [Suberites domuncula] - 282 CAC81019
x36 aggregation factor protein 3, form C [Microciona prolifera] - 2205 AAC33162
x38 PREDICTED: deleted in malignant brain tumors 1 protein-like [Amphimedon queenslandica] - 3131 XP_003389240
x46 PREDICTED: hypothetical protein LOC100640736 [Amphimedon queenslandica] - 5715 XP_003383871
x193 PREDICTED: hypothetical protein LOC100637079 [Amphimedon queenslandica] - 41943 XP_003386025
Cephalothrix hongkongiensis     
x14 pol-like protein [Ciona intestinalis] yes 1235 BAC82623
x14 pol-like protein [Ciona intestinalis] yes 1263 BAC82626
x15 PREDICTED: similar to ORF2-encoded protein, partial [Hydra magnipapillata] yes 372 XP_002155414
x15 PREDICTED: Pao retrotransposon peptidase family protein-like [Saccoglossus kowalevskii] - 1559 XP_002731015
x23 putative zinc finger protein [Schistosoma mansoni] - 486 CCD80531
Cerebratulus marginatus     
x9 PREDICTED: hypothetical protein LOC497165 [Danio rerio] yes 2265 XP_003200870
x11 ORF2-encoded protein [Danio rerio] yes 1027 BAE46429
x11 PREDICTED: similar to ORF2-encoded protein, partial [Strongylocentrotus purpuratus] yes 1117 XP_001187755
x11 PREDICTED: similar to ORF2-encoded protein [Strongylocentrotus purpuratus] yes 1124 XP_001189850
x11 PREDICTED: hypothetical protein LOC100535924 [Danio rerio] - 1448 XP_003199942
Octopus vulgaris     
x38 PREDICTED: hypothetical protein LOC100609033 [Pan troglodytes] yes 255 XP_003317434
x44 PREDICTED: hypothetical protein LOC100597269 [Nomascus leucogenys] yes 220 XP_003276349
x57 PREDICTED: hypothetical protein LOC100414382, partial [Callithrix jacchus] yes 178 XP_002762361
x57 PREDICTED: zinc finger protein 91-like [Acyrthosiphon pisum] - 818 XP_003243211
x90 PREDICTED: hypothetical protein LOC100608502, partial [Pan troglodytes] yes 211 XP_003315526
Chiton olivaceus     
x16 predicted protein [Nematostella vectensis] yes 1079 XP_001630327
x17 PREDICTED: similar to tyrosine recombinase [Strongylocentrotus purpuratus] - 461 XP_001183896
x22 pol-like protein [Biomphalaria glabrata] yes 1222 ABN58714
x29 hypothetical protein EAI_13357 [Harpegnathos saltator] - 172 EFN88744
x48 PREDICTED: similar to ORF2-encoded protein, partial [Hydra magnipapillata] yes 372 XP_002155414
Sipunculus nudus     
x7 dopamine beta hydroxylase-like protein, partial [Pomatoceros lamarckii] - 504 ADB11406
x7 pol-like protein [Ciona intestinalis] yes 1263 BAC82626
x7 PREDICTED: similar to transposase [Strongylocentrotus purpuratus] yes 1312 XP_001193486
x9 pol-like protein [Ciona intestinalis] yes 1235 BAC82623
x11 lectin 1B [Arenicola marina] - 243 ADO22714
Hormogaster samnitica     
x15 leechCAM [Hirudo medicinalis] - 858 AAC47655
x15 pannexin 4 [Aplysia californica] - 413 NP_001191576
x16 predicted protein [Nematostella vectensis] - 2047 XP_001624963
x19 hypothetical protein CBG_27119 [Caenorhabditis briggsae AF16] - 224 CAR99373
x24 tractin [Hirudo medicinalis] - 1880 AAC47654
Metasiro americanus     
x14 transglutaminase [Limulus polyphemus] - 764 2012342A
x15 putative reverse transcriptase [Takifugu rubripes] yes 851 AAK58879
x30 hypothetical protein BRAFLDRAFT_210900 [Branchiostoma floridae] - 489 XP_002611360
x39 hypothetical protein BRAFLDRAFT_79800 [Branchiostoma floridae] - 512 XP_002597956
x53 hypothetical protein BRAFLDRAFT_89523 [Branchiostoma floridae] - 396 XP_002590717
Alipes grandidieri     
x55 PREDICTED: similar to predicted protein [Hydra magnipapillata] yes 1371 XP_002161911
x56 Transposable element Tcb1 transposase [Salmo salar] yes 281 ACN11475
x57 hypothetical protein TcasGA2_TC002110 [Tribolium castaneum] yes 346 EEZ99596
x58 hypothetical protein EAG_05969 [Camponotus floridanus] yes 282 EFN71217
x123 hypothetical protein TcasGA2_TC000717 [Tribolium castaneum] yes 346 EEZ98274
  1. Their putative transposable element nature is indicated, as well as the Genbank accession number for each protein.