Memórias do Instituto Oswaldo Cruz
On-line version ISSN 1678-8060
FARIA-CAMPOS, Alessandra C et al. Production of full-length cDNA sequences by sequencing and analysis of expressed sequence tags from Schistosoma mansoni. Mem. Inst. Oswaldo Cruz [online]. 2006, vol.101, suppl.1, pp. 161-165. ISSN 1678-8060. http://dx.doi.org/10.1590/S0074-02762006000900026.
The number of sequences generated by genome projects has increased exponentially, but gene characterization has not followed at the same rate. Sequencing and analysis of full-length cDNAs is an important step in gene characterization that has been used nowadays by several research groups. In this work, we have selected Schistosoma mansoni clones for full-length sequencing, using an algorithm that investigates the presence of the initial methionine in the parasite sequence based on the positions of alignment start between two sequences. BLAST searches to produce such alignments have been performed using parasite expressed sequence tags produced by Minas Gerais Genome Network against sequences from the database Eukaryotic Cluster of Orthologous Groups (KOG). This procedure has allowed the selection of clones representing 398 proteins which have not been deposited as S. mansoni complete CDS in any public database. Dedicated sequencing of 96 of such clones with reads from both 5' and 3' ends has been performed. These reads have been assembled using PHRAP, resulting in the production of 33 full-length sequences that represent novel S. mansoni proteins. These results shall contribute to construct a more complete view of the biology of this important parasite.
Keywords : expressed sequence tags; sequencing; Schistosoma; full-length cDNA.