Print version ISSN 1415-4757
Genet. Mol. Biol. vol.35 no.2 São Paulo 2012
Ana-Paula ChristoffI; Elgion L.S. LoretoII; Lenira M.N. SepelII
ICurso de Ciências Biológicas, Centro de Ciências Naturais e Exatas, Universidade Federal de Santa Maria, Santa Maria, RS, Brazil
IIDepartamento de Biologia, Centro de Ciências Naturais e Exatas, Universidade Federal de Santa Maria, Santa Maria, RS, Brazil
Tip100 is an Ac-like transposable element that belongs to the hAT superfamily. First discovered in Ipomoea purpurea (common morning glory), it was classified as an autonomous element capable of movement within the genome. As Tip100 data were already available in databases, the sequences of related elements in ten additional species of Ipomoea and five commercial varieties were isolated and analyzed. Evolutionary analysis based on sequence diversity in nuclear ribosomal Internal Transcribed Spacers (ITS), was also applied to compare the evolution of these elements with that of Tip100 in the Ipomoea genus. Tip100 sequences were found in I. purpurea, I. nil, I. indica and I. alba, all of which showed high levels of similarity. The results of phylogenetic analysis of transposon sequences were congruent with the phylogenetic topology obtained for ITS sequences, thereby demonstrating that Tip100 is restricted to a particular group of species within Ipomoea. We hypothesize that Tip100 was probably acquired from a common ancestor and has been transmitted vertically within this genus.
Key words: hAT, transposable elements, Ac-Ds, Ipomoea, genome evolution, ITS.
Transposable elements (TE), which are also referred to as "jumping genes", due to their ability to move around inside the genome, are important sources of genetic variability that have contributed to genome evolution (Biémont and Vieira, 2006; Slotkin and Martienssen, 2007; Naito et al., 2009; Blumenstiel, 2010). Through their being extremely variable in sequence, molecular organization and replication mechanisms, these characteristics have been used to classify TEs in a hierarchical manner (Wicker et al., 2007). Some transposable elements can also be domesticated by their host genomes, thereby contributing to important processes in the organism (Knon el al., 2009).
The transposable element Tip100 was initially identified in Ipomoea purpurea by Habu et al. (1998). It is a class II transposable element that moves through a DNA intermediary, and is classified in the order TIR and the superfamily hAT (Wicker et al., 2007). It possesses 11 bp-long TIRs (terminal inverted repeats), produces 8 bp target site duplications (TSDs) as co-products of mobilization, and has a conserved hATC (hAT family dimerization domain) protein domain in the transposase, all these being characteristic features of the hAT superfamily (Kempken and Windhofer, 2001; Rubin et al., 2001; Arensburger et al., 2011).
Tip100 is an autonomous, freely moving element in I. purpurea (Ishikawa et al., 2002), to which has been attributed the color variegation patterns observed in flowers of some strains. Habu et al. (1998) demonstrated that this TE is inserted into either the 5' regulatory region, or the intron of the Chalcone Synthase D gene (CHS-D). Its presence in this gene, which encodes the enzyme responsible for the first step of anthocyanin production, can induce modification of colors in flowers. Recurrent somatic excision of Tip100 in the CHS-D gene can generate the variegated patterns observed in some I. purpurea plants (Iida et al., 2004). Likewise, many other transposable elements are capable of affecting the genes of the anthocyanin pathway (Park et al., 2004).
The genus Ipomoea is a member of the Convolvulaceae, one of the large families of Solanales. It includes numerous species that are mainly distributed in the Americas (Austin and Huáman, 1996; Austin and Bianchini, 1998). Some are simply weeds, whereas others are economically important, viz., sweet potatoes and ornamental plants, such as the morning glories I. purpurea and I. nil. Plants of this genus are appropriate biological models for research, through presenting exceptional morphological and habitat-use diversity, whereby their extensive experimental versatility (Stefanovi et al., 2003; Clegg and Durbin, 2003).
In plants, nuclear ribosomal internal transcribed spacers (ITS) comprise one of the most useful sequences for phylogenetic studies at the species level (Feliner and Rosselló, 2007). Results from previous studies on Ipomoea using ITS sequences (Miller et al., 1999, 2004; Manos et al., 2001), are congruent with those based on morphological characteristics (McDonald and Mabry, 1992; Austin and Huáman, 1996; Austin and Bianchini, 1998).
In the present study, Tip100-related elements in ten Ipomoea species and five commercial cultivars were investigated. These species are representative of the three subgenera of Ipomoea, namely Eriospermum, Ipomoea and Quamoclit (Austin, 1975; Austin and Huáman, 1996; Austin and Bianchini, 1998). Our aim was to shed light on how the Tip100 transposable element is distributed among Ipomoea species, and how it may have evolved, by comparing the phylogenetic relationships of TE sequences with the host-species phylogeny.
Materials and Methods
DNA was extracted from 0.1 g of germinated plant-leaf tissue, according to the protocol described by Oliveira et al. (2009). The species examined in this study and their origins are shown in Table 1.
PCR cloning and sequencing of Tip100 sequences
Primers, designed with Oligo 4.1 software (Rychlik, 1992) were based on the Tip100 sequence from I. purpurea (Habu et al., 1998). The forward primer (5'-CGTTCTCC TTTTGTTGGTGT-3') anneals in the putative regulatory region of the element at positions 621-640 and the reverse primer (5'-GCTTCTCAATGGGGCACTTC-3') does so in the first region of the transposase ORF at positions 1526-1545. A non-coding sequence region was chosen, as this part is expected to be more variable, and so, phylogenetically more informative. PCR assays were performed in 10 µL volumes with 20 ng of genomic DNA, 0.2 U Taq DNA polymerase (Invitrogen), 1X Reaction Buffer, 1.5 mM of MgCl2 and 200 pmol of each primer. The following thermocycler amplification process was used: 94 °C for 5 min, 30 cycles at 94 °C for 45 s, 55 °C for 30 s and 72 °C for 60 s, followed by a final extension cycle at 72 °C for 7 min. The amplified fragments were cloned using the TA Cloning Kit pCR 2.1 Vector (Invitrogen). Plasmid DNA was isolated by miniprep alkaline lysis (Sambrook and Russel, 2001), and then precipitated with 13% PEG and 1.6 M NaCl. 35 plasmids from all the species and varieties were selected, for direct sequencing of the two strands in a MegaBACE 500 automatic sequencer. The dideoxy chain-termination reaction was implemented with the DYEnamic ET kit (GE Healthcare). To obtain sequences for each clone, reads, were assembled using Gap4 software from the Staden Package (Staden, 1996), with assembly continuing until a confidence value higher than 30 was obtained. The Tip100 sequence described by Habu et al. (1998) (GenBank AB004906) was also included in the analysis. All the new sequences obtained in this study were deposited in GenBank (Accession No: HM014415-HM014422).
Analysis of transposon sequences
The identity of the cloned sequences was determined by Blast searches (Altschul et al., 1990) in the NCBI and RepBase databases. Nucleotide sequences were aligned using Clustal W (Thompson et al., 1994), with default parameters. Cons software (Rice et al., 2000) was used to obtain consensus sequences of clones that presented divergences of less than 8.5%, and belonged to the same species or variety. Mega 4 software (Tamura et al., 2007) was used to obtain divergences for sequences with Tamura 3 parameters.
Phylogenetic analysis using Bayesian criteria was done in MrBayes 3.1.2 software (Huelsenbeck and Ronquist, 2001). The HKY evolutionary model was chosen in the MrModelTest 2.2 software (Nylander, 2004) implemented in PAUP 4.0b10 (Swofford 2003), and using the Akaike (AIC) criterion (Akaike 1974). Two independent runs of four heated Monte Carlo Markov chains (MCMC) were carried out, each for 1,000,000 generations. Results were saved every 100 generations.
PCR and sequencing of internal transcribed spacers (ITS)
The primers used to amplify ITS sequences, viz., ITS92 (5'-AAGGTTTCCGTAGGTGAAC-3') and ITS75 (5'-TATGCTTAAACTCAGCGGG-3'), had already been described by Baldwin (1992). The amplified region corresponded to the two internal spacers (ITS1 and ITS2), as well as the complete 5.8S ribosomal gene region between these. PCR conditions were similar to those used for the Tip100 PCR runs, except for the temperature cycles which were as follows: 94 °C for 5 min, 35 cycles at 94 °C for 40 s, 55 °C for 30 s and 72 °C for 80 s, followed by a final cycle of 72 °C for 7 min. The resultant PCR fragments were purified with 13% PEG and 1.6 M NaCl, and directly sequenced in a MegaBACE 500 automatic sequencer. The dideoxy chain-termination reaction was carried out with a DYEnamic ET kit (GE Healthcare). An ITS sequence for Merremia tuberosa (AF110909), obtained from GenBank, was used as outgroup during analysis. The newly obtained sequences were deposited in GenBank (Accession No: HM14423-HM14437).
Analysis of ITS sequences
ITS sequence-processing was the same as that for Tip100 sequences, except that sequence-distance calculations were performed using a Tamura Nei model in Mega 4 (Tamura et al., 2007), and Bayesian analysis with a GTR+G model.
The Tip100 transposon
The molecular investigation of Tip100 homologous sequences in ten different Ipomoea species and five Ipomoea commercial varieties, lead to identification of the transposon in four species and four varieties, through positive PCR amplification of the expected 900 bp fragment (Table 1).
Sequence analysis showed the different cloned elements to be very similar, with levels of divergence varying from 0.0% to 2.8% (Table 2). The only exception was the Tip100 sequence in I. alba, which was more divergent (14.9%) from that in other species. The second highest divergence was 2.8% between I. nil 'Candy Pink' and the Tip100 sequence described by Habu et al. (1998) for I. purpurea (Tip100-AB004906). The lowest levels of divergence were found between I. nil and I. nil 'Candy Pink' (0.1%), among I. purpurea 'Kniolas Black Knight', I. purpurea and I. purpurea 'Split Personality' (0.1%), and between I. purpurea and I. purpurea 'Split Personality' (0.0%).
The complete Tip100 transposase CDS contains 2,426 bp that encode 808 amino acids. The region analyzed in this study covers the first 268 bp of the 5' end of the transposase CDS, corresponding to 73 amino acids. In this region, amino acid sequences are well conserved among the different Ipomoea. Although some nucleotide changes were found, amino acid sequences and physiochemical properties remained conserved in the analyzed region. The only exception was nucleotide loss at position 30 of the transposase ORF in I. nil and I. nil 'Candy Pink sequences', thereby causing amino acid deletion (Figure S1, Supplementary Material).
Bayesian analysis indicated three clusters in an unrooted tree. As expected, the most divergent clade was formed by I. alba Tip100 (Figure 1). The second clade included the two transposons in I. nil and I. nil 'Candy Pink'. Posterior probability (1.00) conferred strong support for this clade. The third clade, also well supported (0.98), was formed by the Tip100 sequences in I. indica and I. purpurea, Tip100-AB004906 and I. purpurea commercial varieties.
PCR amplification of ITS sequences was uniform and positive for all the species and varieties tested (Table 1). The obtained PCR fragments matched the expected fragment size of 550 bp for the ITS1 and ITS2 spacers, and the 5.8S sequence.
Comparison among sequences indicated the largest divergence to be between Merremia tuberosa and I. quamoclit (36.2%). No sequence difference was observed between I. purpurea and the I. purpurea varieties (I. purpurea 'Kniolas Black Knight', 'Light Blue Star' and 'Split Personality') (Table 3).
ITS sequences appeared to be good markers for reconstructing the phylogenetic history of Ipomoea, since all the clades received highly satisfactory statistical support (Figure 2). Miller et al. (2004) proposed that Ipomoea is formed by two principal clades. Our results are in partial agreement, since Clade I was identified as including I. quamoclit, I. coccinea and 'Ipomoea X Slotari' with strong statistical support, and Clade II as containing the remaining Ipomoea taxa, with I. alba as the basal member of the group. Within Clade II, three clusters were formed, one for I. indica, a second for I. nil and I. nil 'Candy Pink', and a third for I. purpurea and its varieties. All the branches are strongly supported, except for that joining I. nil and I. nil 'Candy Pink'. I. cairica is basal to these clades, and Miller et al. (2004) used this very species as outgroup of the genus. The other species that were studied here, but were not included in the phylogenetic analysis done by Miller et al. (2004), are I. batatas, I. triloba and I. carnea. These three species appear to be basal to Clades I and II. The basal clade of I. batatas and I. triloba is strongly supported.
Numerous transposable elements known to be involved in the process of variegation in Ipomoea, thereby leading to wide diversification in flower pigmentation, also represent an important evolutionary process. One of these elements is Tip100, which is inserted in the CHS-D gene. After extensive searches in the NCBI database with Blastn, Blastx and tBlastx, no similarities between the Tip100 sequence and other transposable elements came to light. Habu et al. (1998) classified Tip100 as a member of the Ac/Ds family (Kunze et al.,1997). However, according to more recent criteria for TE classification (e.g., Wicker et al., 2007), "two elements belong to the same family if they share at least 80% of sequence identity in their coding domain, or within their terminal repeat regions, or in both". Hence, Tip100 would not belong to the Ac/Ds family, since no close similarity was found between Tip100 and the other transposons of this family. Nevertheless, Tip100 sequences and structural characteristics clearly place this element in the hAT superfamily (Kempken and Windhofer, 2001; Rubin et al., 2001). Therefore, we propose that Tip100 belongs to a new TE family, which, to date, has only been observed in the genus Ipomoea.
Recently, Arensburger et al. (2011) undertook a rigorous phylogenetic analysis of the hAT superfamily. They discovered that this superfamily is formed by two large families, namely Buster and AC, and even indicated the existence of a third clade, maybe a new family, which currently contains only three members, viz., Tip100 of Ipomoea and two Tip100-related sequences, one from a hydra (H. magnipapillata) and the other from zebrafish (Danio rerio). These findings give to understand that this possibly new family may be widely distributed.
The species included in this study are representatives of three Ipomoea subgenera. The well-supported, mutual phylogenetic relationships established by ITS analysis are congruent with other studies based on morphological and molecular data (McDonald and Mabry, 1992; Austin and Huáman, 1996; Austin and Bianchini, 1998; Miller et al., 1999, 2002, 2004; Stefanovi et al., 2003), whereby, I. batatas, I. triloba and I. carnea were identified as members of the subgenus Eriospermum, I. purpurea, I. nil and I. indica as members of the subgenus Ipomoea, and I. cairica, I. coccinea, I. quamoclit and I. alba as part of the subgenus Quamoclit. In the present analysis, we found that I. alba is more closely related to species of the subgenus Ipomoea, rather than to Quamoclit, as previously proposed by Miller et al. (1999; 2004). Furthermore, I. cairica, formally in the subgenus Ipomoea, appears as outgroup to the clade formed by the subgenera Quamoclit and Ipomoea, although the statistical support for this branch (0.82) is less than for the other branches.
There is significant consistency between the phylogeny built with species with Tip100 sequences and the one constructed with ITS data, the latter including more species representing the host phylogeny. As Tip100 was found only in a restricted clade in the ITS phylogeny (Figure 2, Clade II), we propose that this element was present in an ancestor of these related species, thereby implying that Tip100 was vertically transferred during evolution of the genus. Thus, it was more effectively maintained in the subgenus Ipomoea, were it apparently remains more conserved. Although, Tip100 was more divergent in I. alba, this is consistent with its basal position in relation to the subgenus Ipomoea, possibly through more available time to diverge from other species of Ipomoea subgenus. Nevertheless, why Tip100 is restricted to only one cluster in Ipomoea is unknown A possible explanation for the emergence of this element in this species could be horizontal transfer of Tip100 from an unknown donor to an ancestor of I. alba, I. indica and I. nil, and the I. purpurea cluster (Clade II, Figure 2). Horizontal transfer of TEs has been recognized as an important evolutionary force in eukaryotes (Keeling and Palmer, 2008), although few examples have been encountered in plants (Diao et al., 2006; Roulin et al., 2009). As an alternative explanation, the element was present in all the other clusters of the genus Ipomoea, but could have been stochastically lost.
A more plausible explanation for this peculiar TE occurrence could be the presence of Tip100 sequences in other species of the genus that have diverged throughout the evolution and expansion of these plants, since the evolutionary history of the genus Ipomoea is relatively recent, i.e.,approximately 35 to 40-million-years, as calculated by molecular clock inference (Clegg and Durbin, 2003). Hence, additional studies are required to determine whether TE arrival in this genus was through horizontal transfer, or whether it is an ancient genome component.
We are grateful to Dr Lizandra Jaqueline Robe for her help in the analysis, MSc Luiz Felipe Valter de Oliveira and MSc Gabriel da Luz Wallau for their assistance in experimental procedures and Dr Ronaldo Medeiros Golombieski and MSc Paloma Rubin for undertaking the sequencing reactions. This work was supported by the Brazilian agencies CNPq and FAPERGS.
Akaike H (1974) A new look at the statistical model identication. IEEE Trans Automat Control 19:716-723. [ Links ]
Altschul S, Gish W, Miller W, Myers E and Lipman D (1990) Basic local alignment search tool. J Mol Biol 215:403-410. [ Links ]
Arensburger P, Hice RH, Zhou L, Smith RC, Tom AC, Wright JA, Knapp J, O'Brochta DA, Craig NL and Atkinson PW (2011) Phylogenetic and functional characterization of the hAT transposon superfamily. Genetics 188:145-57. [ Links ]
Austin DF (1975) Typification of the New World subdivisions of Ipomoea L. (Convolvulaceae). Taxon 24:107-110. [ Links ]
Austin DF and Bianchini RS (1998) Additions and corrections in American Ipomoea (Convolvulaceae). Taxon 47:833-838. [ Links ]
Austin DF and Huáman Z (1996) A synopsis of Ipomoea (Convolvuceae) in the Americas. Taxon 45:3-38. [ Links ]
Baldwin BG (1992) Phylogenetic utility of the internal transcribed spacers of nuclear ribosomal DNA in plants: An example from the compositae. Mol Phylogenet Evol 1:3-16. [ Links ]
Biémont C and Vieira C (2006) Junk DNA as an evolutionary force. Nature 443:521-524. [ Links ]
Blumenstiel JP (2010) Evolutionary dynamics of transposable elements in a small RNA world. Trends Genet 855:1-9. [ Links ]
Clegg MT and Durbin ML (2003) Tracing floral adaptations from ecology to molecules. Nat Rev Genet 4:206-215. [ Links ]
Diao X, Freeling M and Lisch D (2006) Horizontal transfer of a plant transposon. PLoS Biol 4:119-128. [ Links ]
Feliner GN and Rosselló JA (2007) Better the devil you know? Guidelines for insightful utilization of nrDNA ITS in species-level evolutionary studies in plants. Mol Phylogenet Evol 44:911-919. [ Links ]
Habu Y, Hisatomi Y and Iida S (1998) Molecular characterization of the mutable flaked allele for flower variegation in the common morning glory. Plant J 16:371-376. [ Links ]
Huelsenbeck J and Ronquist F (2001) MrBayes: Bayesian inference of phylogenetic trees. Bioinformatics 17:754-755. [ Links ]
Iida S, Morita Y, Choi J, Park K and Hoshino A (2004) Genetics and epigenetics in flower pigmentation associated with transposable elements in morning glories. Adv Biophys 38:141-159. [ Links ]
Ishikawa N, Johzuka-Hisatomi Y, Sugita K, Ebinuma H and Iida S (2002) The transposon Tip100 from the common morning glory is an autonomous element that can transpose in tobacco plants. Mol Genet Genomics 5:732-739. [ Links ]
Keeling PJ and Palmer JD (2008) Horizontal transfer in eukaryotic evolution. Nat Rev Genet 9:605-618. [ Links ]
Kempken F and Windhofer F (2001) The hAT family: A versatile transposon group common to plants, fungi, animals and man. Chromosoma 110:1-9. [ Links ]
Kunze R, Saedler H and Lönnig W (1997) Plant transposable elements. Adv Bot Res 27:331-470. [ Links ]
Manos PS, Miller RE and Wilkin P (2001) Phylogenetic analysis of Ipomoea, Argyreia, Stictocardia, and Turbina suggests a generalized model of morphological evolution in Morning Glories. Syst Bot 26:585-602. [ Links ]
McDonald JA and Mabry TJ (1992) Phylogenetic systematics of New World Ipomoea (Convolvulaceae) based on chloroplast DNA restriction site variation. Plant Syst Evol 180:243-259. [ Links ]
Miller RE, Buckley TR and Manos PS (2002) An examination of the monophyly of morning glory taxa using Bayesian phylogenetic inference. Syst Biol 51:740-753. [ Links ]
Miller RE, McDonald JA and Manos PS (2004) Systematics of Ipomoea subgenus Quamoclit (Convolvulaceae) based on ITS sequence data and a Bayesian phylogenetic analysis. Am J Bot 91:1208-1218. [ Links ]
Miller RE, Rausher MD and Manos PS (1999) Phylogenetic systematics of Ipomoea (Convolvulaceae) based on ITS and waxy sequences. Syst Bot 24:209-227. [ Links ]
Naito K, Zhang F, Tsukiyama T, Saito H, Hancock NC, Richardson AO, Okumoto Y, Tanisaka T and Wessler SR (2009) Unexpected consequences of a sudden and massive transposon amplification on rice gene expression. Nature 461:1130-1134. [ Links ]
Nylander J (2004) MrModeltest ver. 2. Program distributed by the author. Evolutionary Biology Center, Uppsala University. [ Links ]
Oliveira LFV, Wallau GL and Loreto ELS (2009) Isolation of high quality DNA: A protocol combining "rennet" and glass milk. Electron J Biotech 12:2. [ Links ]
Park K-I, Choi J-D, Hoshino A, Morita Y and Iida S. (2004) An intragenic tandem duplication in a transcriptional regulatory gene for anthocyanin biosynthesis confers pale-colored flowers and seeds with fine spots in Ipomoea tricolor. Plant J 38:840-849. [ Links ]
Rice P, Longden I and Bleasby A (2000) EMBOSS: The European Molecular Biology Open Software Suite. Trends Genet 16:276-277. [ Links ]
Roulin A, Piegu B, Fortune PM, Sabot F, D'Hont A, Manicacci D and Panaud O (2009) Whole genome surveys of rice, maize and sorghum reveal multiple horizontal transfers of the LTR-retrotransposon Route66 in Poaceae. BMC Evol Biol 9:e58. [ Links ]
Rubin E, Lithwick G and Levy AA (2001) Structure and evolution of the hAT transposon superfamily. Genetics 158:949-957. [ Links ]
Rychlik W (1992) Oligo ver. 4.1 Primers Analysis Software. National Biosciences Inc., Plymouth. [ Links ]
Sambrook J and Russel DW (2001) Molecular cloning: A Laboratory Manual. Cold Spring Harbor Laboratory Press, New York. [ Links ]
Slotkin RK and Martienssen R (2007) Transposable elements and the epigenetic regulation of the genome. Nat Rev Genet 8:272-285. [ Links ]
Staden R (1996) The Staden sequence analysis package. Mol Biotechnol 5:233-241. [ Links ]
Stefanovi S, Austin DF and Olmstead RG (2003) Classfication of Convolvulaceae: A phylogenetic approach. Syst Bot 28:791-806. [ Links ]
Swofford DL (2003) PAUP: Phylogenetic Analysis using Parsimony (and other methods), ver. 4. Sinauer Associates, Massachusetts. [ Links ]
Tamura K, Dudley J, Nei M and Kumar S (2007) MEGA4: Molecular Evolutionary Genetics Analysis (MEGA) software ver. 4.0. Mol Biol Evol 24:1596-1599. [ Links ]
Thompson J, Higgins D and Gibson T (1994) CLUSTAL W: Improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specic gap penalties and weight matrix choice. Nucleic Acids Res 22:4673-4680. [ Links ]
Wicker T, Sabot F, Hua-Van A, Bennetzen JL, Capy P and Chalhoub B (2007) A unified classification system for eukaryotic transposable elements. Nat Rev Genet 8:973-982. [ Links ]
The following online material is available for this article:
Figure S1 - Nucleotide sequence alignment of the 5' end of the Tip100 transposase CDS.
This material is available as part of the online article from http://www.scielo.br/gmb.
Send correspondence to:
Lenira M.N. Sepel
Departamento de Biologia
Centro de Ciências Naturais e Exatas
Universidade Federal de Santa Maria
Caixa Postal 5050, Agência Campus, Camobi
97105-900 Santa Maria, RS, Brazil
Received: September 21, 2011
Accepted: January 30, 2012.
Associate Editor: Marcia Pinheiro Margis
License information: This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.