Acessibilidade / Reportar erro

Isolation of (GA)n microsatellite sequences and description of a predicted MADS-box sequence isolated from common bean (Phaseolus vulgaris L.)

Abstract

The isolation of (GA)n microsatellites using a highly microsatellite-enriched library is described for the first time in common bean (Phaseolus vulgaris L.). A relatively simple and effective method to isolate DNA repeats from microsatellite-enriched libraries based on hybridization-capture of repeat regions using biotin-conjugated oligonucleotids and non-radioactive colony hybridization was carried out. PCR products from 200 to 800 bp were obtained and cloned. Of the 60 clones sequenced, 21 yielded (GA)n microsatellites with n values equal or higher than six. These (GA)n microsatellite-containing loci could be useful for further genetic mapping studies. A (GA)n microsatellite linked to a putative MADS-box gene was identified. This sequence, which represents the first MADS-box locus described to date in common bean, showed a very high similarity with other known MADS-box sequences and was grouped within the AGL2 subfamily cluster of the Arabidopsis MADS-box genes. The vicinity of microsatellites to some genes is also discussed.

Phaseolus vulgaris L.; genetic markers; microsatellites; MADS-box; transcriptional factor


RESEARCH ARTICLE

Isolation of (GA)n microsatellite sequences and description of a predicted MADS-box sequence isolated from common bean (Phaseolus vulgaris L.)

M.W.F. Yaish; M. Pérez de la Vega

Departamento de Genética, Facultad de Biología, Universidad de León, 24071 León, Spain

Correspondence Correspondence to Marcelino Pérez de la Vega Universidad de León, Facultad de Biología, Area de Genética 24071 León, Spain E-mail: degmpv@unileon.es

ABSTRACT

The isolation of (GA)n microsatellites using a highly microsatellite-enriched library is described for the first time in common bean (Phaseolus vulgaris L.). A relatively simple and effective method to isolate DNA repeats from microsatellite-enriched libraries based on hybridization-capture of repeat regions using biotin-conjugated oligonucleotids and non-radioactive colony hybridization was carried out. PCR products from 200 to 800 bp were obtained and cloned. Of the 60 clones sequenced, 21 yielded (GA)n microsatellites with n values equal or higher than six. These (GA)n microsatellite-containing loci could be useful for further genetic mapping studies. A (GA)n microsatellite linked to a putative MADS-box gene was identified. This sequence, which represents the first MADS-box locus described to date in common bean, showed a very high similarity with other known MADS-box sequences and was grouped within the AGL2 subfamily cluster of the Arabidopsis MADS-box genes. The vicinity of microsatellites to some genes is also discussed.

Key words:Phaseolus vulgaris L., genetic markers, microsatellites, MADS-box, transcriptional factor.

Introduction

Repetitive DNA covers up to 90% of the plant genome (Nagl et al., 1983). Tandemly repetitive DNA is classified into three major classes (Tautz and Renz, 1984): (i) satellite DNA which shows repeat units with a length of up to 300 base pairs (bp), (ii) minisatellite monomers comprised of 9-100 bp, and (iii) microsatellites or simple sequence repeats (SSR) which exhibit repeat units of 1-6 bp in length.SSRs occur ubiquitously and abundantly in eukaryotic genomes. Each microsatellite is usually located at a single locus with large variation in the number of repeats among individuals. Thus, microsatellite loci are often multi-allelic (Saghai-Maroof et al., 1994). In addition to high levels of polymorphism, SSR sequences possess most of the desirable attributes of molecular markers, including information content, unambiguous designation of alleles, neutral selectively (although they can be subjected to hitch-hiking effects), high reproducibility, codominance, and fast and easy assaying of genotypes and therefore microsatellite markers or SSR have proved to be very useful for cultivar identification, pedigree analysis and the evaluation of genetic distance between organisms (Priolli et al., 2002) and genetic mapping (Yu et al., 2000).

The most abundant microsatellite in several well-known mammals is (AC)n (Beckmann and Weber, 1992), while in many plant species they are (AT)n or (AG)n (Wang et al., 1994). A high abundance of (GA)n microsatellites compared to other dinucleotid SSR has been observed in plant genomes such as Oryza, Aegilops, Arabidopsis or Brassica (Gupta and Varshney, 2000; Guyomarc et al., 2002; Suwabe et al., 2002; Uzunova and Ecke, 1999). Previous studies on plant (GA)n microsatellites also show that they are well-distributed throughout the genome, thus ensuring good genome coverage. In many cases, SSR-containing sequences are part (in exons or introns) of, or linked to, some important genes of agronomic interest (see Yu et al., 2000).

The MADS box is a highly conserved sequence motif found in a family of transcriptional factors which play important roles in developmental processes and have been found in species from all the eukaryotic kingdoms. In plants MADS-box genes are scattered throughout the entire genome and encode a family of transcriptional factors which control diverse developmental processes ranging from root to flower and fruit development (Sommer et al., 1990; Theissen et al., 2000). In fact, many of the genes which direct flower development contain a MADS-box domain (Schwarz-Sommer et al., 1992). The MADS-box proteins of plants usually contain other domains, but the MADS domain is by far the most conserved region of these proteins, it is the major determinant of DNA binding, but it also performs dimerization and accessory factor-binding functions (Theissen et al., 2000). The MADS-box is highly conserved, about 56 amino acids long, and is usually found at the N-terminus of the protein (Penueli et al., 1991). The MADS-box gene family is also an important source of plant evolutionary data. For instance, Winter et al. (1999) proved that the gnetophytes are more related to conifers than to flowering plants by constructing phylogenic trees based on MADS-box gene-families, and Theissen et al. (2000) demonstrated that the phylogeny of MADS-box genes was strongly correlated with the origin and evolution of plant productive structures, such as ovules and flowers, by reviewing current knowledge on MADS-box genes in ferns, gymnosperms and different types of angiosperms. Finally, the MADS-box plays a role in the plant-microbial interaction at least in the nodule cells where its expression has been localized in infected alfalfa (Medicago sativa) root nodule cells (Heard and Dunn, 1995). So far, of more than two hundred plant MADS genes described only four have been described in legume species (two in Pisum sativum and two in Medicago sativa). Here we show the first MADS sequence described in Phaseolus to date, a sequence which is linked to an upstream (GA)n repeat.

Materials and Methods

Microsatellite isolation

DNA was extracted from trifoliate leaves of 15-day-old Phaseolus vulgaris L. seedlings, using the method described by Vallejos et al. (1992) with minor modifications. Ten mg of common bean genomic DNA were digested with five U/mg RsaI restriction enzyme (GTAC target site). The size of the DNA fragment obtained was checked by agarose gel electrophoresis. Blunt-end DNA fragments were ligated by T4 DNA ligase (Promega, Madison, USA) to MluI self-complementary adaptors (10 mmol) RSA21 5-CTCTTGCTTACGCGTGGACTA-3 and RSA25 5-AGTCCACGCGTAAGCAAGAGCACA-3 according to Edwards et al. (1996). The ligation was checked by PCR amplification. Five ng of ligated DNA were amplified in a final volume of 25 mL with 1 mL of 10 mmol RSA21 primer in a buffer containing 10 mM TrisHCl, pH 8, 100 mM KCl, 0.05% w/v gelatin and 1.5 mM MgCl2, using the following PCR program: denaturation at 95 °C for 1 min and 28 cycles of 94 °C for 40 s, 60 °C for 60 s, 72 °C for 120 s. The size of the amplified ligated fragments was checked by agarose gel electrophoresis and the rest were purified in anion exchange micro columns (GIBCO-BRL). Microsatellite sequences were selected using biotin-labeled microsatellite oligoprobe and streptavidin-coated magnetic beads, following the hybridization based capture methodology adapted from Kijas et al. (1994) and Billote et al. (1999). Magnetic bead-based selection was carried out using the Magnetosphere Magnetic Separation Product Kit (Promega Madison, USA). The oligoprobe used in this experiment was 5-I*IIIITCTCTCTCTCTCTCTC-3 with the inosine at 5 biotynilated. Purified PCR products were denatured at 95 °C for 15 min before adding three mL of 50 mM biotynilated oligoprobe and 13 mL of 20XSSC. Hybridization was carried out for 20 min at room temperature. Six hundred mg of pre-washed streptavidin-coated magnetic beads were used following the manufacturer's instructions. Aliquots of the resulting solution after the last elution were used as templates for a second PCR round following the same above-mentioned conditions. The purified PCR products were cloned into pGEM-T plasmid (Promega, Madison, USA) following the manufacturer's instructions and the plasmid was used to transform a DH5a competent E. coli strain. About 1,000 colonies were blotted to positive charged nylon membranes (Boehringer Mannheim) for hybridization with a digoxigenin labeled (GA)10 probe. The colony hybridization and washing temperatures were carried out at 37 °C and at room temperature, respectively. Other conditions followed the standard protocols. Chemiluminescent detection was carried out using CSPD® (Boehringer Mannheim). Positive clones were cultured in a liquid LB medium, the plasmids were extracted, and the inserts sequenced by using the dideoxynucleotide chain termination method. Universal and reverse primers, the Thermo Sequenase Fluorescent Labelled Primer Cycle Sequence Kit (Amersham Pharmacia Biotech, N.J.) following the manufacturer's instructions, and automatic sequencing, ALFTM Manager v. 2.6 (Amersham Pharmacia Biotech, N.J.) were used for DNA sequencing. About 60 colonies were tested; the sequences which contained interesting information were repeated at least three times.

Amplification of microsatellite (SSR) sequences from genomic DNA

In order to check the amplification of SSR-containing sequences from common bean genomic DNA, specific primer pairs were designed from sequences flanking SSRs using the OLIGOTEST V 2.0 program (Beroud et al., 1990). Pairs of primers (length 18-24) were designated with annealing temperatures ranging from 50 °C to 60 °C, and a variance limited to 4 °C. PCRs were carried out using 50 ng of DNA in a final volume of 25 mL with 10 pmol of each primer in a buffer containing 10 mM TrisHCl, pH 8, 100 mM KCl, 0.05% w/v gelatin and 1.5 mM MgCl2, and 1.5 units of Taq DNA polymerase (Promega, Madison, USA). DNA was extracted from trifoliate leaves of two weeks old "Jules" plants. The following PCR program was used: denaturation at 95 °C for 1 min and 36 cycles of 94 °C for 40 s, the specific annealing temperature for 40 s, 72 °C for 60 s and a final extension temperature of 72 °C for 10 min. The annealing temperature used for each specific PCR is shown in Table 1.

MADS-box sequence analysis

MADS-box comparative sequence analysis was carried out with the CLUSTAL W method (Thompson et al., 1994) using the standard parameters suggested in the program. The DNA Maximum Likelihood (Dnaml) (Felsenstein, 1993) method was used to construct the phylogenetic tree. Database Searching was carried out using Blast web page http://www.ncbi.nlm.nih.gov/blastfrom the National Library of Science, USA. Some information was also obtained through the MADS-box home page: http://www. mpiz-koeln.mpg.de/madsfrom the "Max-Planck-Institut für Züchtungsforschung", Köln, Germany. SSRs were screened within the sequence using the Simple Sequence Repeat Identification Tool (SSRIT), which is available on the web page address http://ars-genome.cornell.edu/rice/tools.htmlfrom the Rice Genome Data Base of Cornell University, USA.

Results and Discussion

Characteristic of (GA)n microsatellites of common bean

Small DNA fragments (200-800 bp) were obtained by means of the PCR amplification. These fragments were cloned and the positive clones were easily identified using the colony hybridization method.A total of 21 out of 60 sequenced inserts contained (GA)n motifs of different sizes, arrangements and types (Table 1). Dimeric, trimeric, tetrameric tandem nucleotide repeat motifs were identified. A combination of several types of microsatellites (SSR) was noted within a sequence, which is a common feature in microsatellites isolated for other different plant species (Wang et al., 1994). This is probably due to the fact that microsatellites are highly mutable. Some sequences contained a relatively simple arrangement of microsatellites (e.g. 14M1 which contains a (GA)19 motif ). However, other sequences had motifs that were distributed in complex and multiple arrays (e.g. 210M3) (Table 1). Several of these complex motifs were imperfect repetitions of the basic motif and were most likely generated by microsatellite instability.

The majority of SSR-containing sequences (17 out of 21) were amplified from common bean genomic DNA (Figure 1) using primer combinations indicated in Table 1. For the other four sequences no product was obtained or non-specific products were observed at lower annealing temperatures.


Description of a MADS-box sequence

A DNA sequence 350 bp long, which included the self-complementary adapters on both sides, a GA microsatellite, and a MADS-box sequence, was isolated. The microsatellite included two perfect GA motifs, (GA)10 and (GA)7, which are part of an imperfect (GA)26 sequence. The current sequence differed from the (GA)26 by the addition of an A and three base substitutions (Figure 2). Upstream from this GA region was a sequence rich in A + T pairs (65%). The methionine codon of the MADS-box was immediately downstream from the GA microsatellite sequence (Figure 2). Analysis of the deduced amino acid sequence and a search for the available protein sequence in data bases revealed highly significant similarity with other MADS-box peptide sequences, i.e., it was identical to DEFH72 isolated from Antirrhinum majus, NSMADS3 from Nicotiana sylvestris, AGL9 from Arabidopsis thaliana, and MDMADS1 from Malus domestica (Figure 3).



The comparison of the DNA sequence of the Phaseolus MADS motif (PVMADS) with Arabidopsis known MADS sequence motifs showed that PVMADS is related to the AGL2 group of the MADS-box gene family (Figure 4). The Agl2 family of MADS-box genes are normally involved in floral growth and development (Theissen et al., 2000). The highest similarity of the MADS-box sequence and the linked region was with the DEFH72 locus of Antirrhinum majus, which is also linked to a shorter microsatellite motif in its upstream direction (i.e. an imperfect (GA)11 motif). The MADS sequence here is the first reported in Phaseolus, but it is very likely that there are additional MADS-box genes distributed throughout the bean genome since plant species have a large number of MADS-box genes, for instance, in Arabidopsis the MADS-box gene family consists of at least 28 different genes, and in maize at least 50 different MADS-box genes are dispersed in its genome (Fischer et al., 1995).


Vicinity of satellite elements to some important genes

This Phaseolus putative MADS-box gene is located downstream from an imperfect (GA)21 microsatellite (Figure 2). The proximity of microsatellite elements to some important genes in common bean has been reported (Yu et al., 2000) as well as in other plant species. For example, Li et al. (2000), reported a polymorphic microsatellite linked to the dextrinase gene of barley, and a satellite sequence is linked to the rDNA of lentil (Fernández, 2002) and common bean (Falquet et al., 1997). Furthermore, several important plant resistance genes are linked to SSR or Inter Simple Sequence Repeats (ISSR) (e.g., Yu et al., 1996). Thus, the linkage of microsatellite motifs to some important genes is relevant for several genetically applied studies. Microsatellites are ideal markers. However, the development of microsatellite markers is expensive and time consuming. In our study, we show a new microsatellite set in Phaseolus that can be useful for gene mapping and other purposes.

Acknowledgements

This work was supported by the FEDER project 1FD97-0308-CO3-03 and a personal Ph.D. grant to M.W.F. Yaish from the Spanish AECI.

Editor: Marcio de Castro Silva-Filho

Received: November 7, 2002

Accepted: May 12, 2003

  • Beckmann JS and Weber JL (1992) Survey of human and rat microsatellites. Genomics 12:627-631.
  • Beroud C, Antignac C, Jeanpierre C and Junien C (1990) Un programme informatique pour reserche d'amorces pour l'amplification par PRC. Medicine/Sciences 6:901-903.
  • Billote N, Lagoda PJL, Risterucci, A-M and Baurens F-C (1999) Microsatellite-enriched libraries: applied methodology for the development of SSR markers in tropical crops. Fruits 54:277-288.
  • Edwards KJ, Baker JHA, Daly A, Jones C and Karp A (1996) Microsatellite libraries enriched for several microsatellite sequences in plants. Biotechniques 20:758-760.
  • Falquet J, Creusot F and Dron M (1997) Molecular analysis of Phaseolus vulgaris rDNA unit and characterisation of a satellite DNA homologous to IGS subrepeats. Plant Physiol Biochem 35:611-622.
  • Felsenstein J (1993) PHYLIP (phylogeny inference package). Version 3.5c. Distributed by the author, Department of Genetics, University of Washington, Seattle, USA.
  • Fernández M (2002) Variabilidad genética en Lens (Miller): Análisis de los espaciadores intergénicos ribosomales. Ph. D. Dissertation, Univ. de León, Spain.
  • Fischer A, Baum N, Saedler H and TheiBen G (1995) Chromosomal mapping of the MADS-box multigene family in Zea mays reveals dispersed distribution of allelic genes as well as transposed copies. Nucl Acids Res 23:1901-1911.
  • Gupta, PK and Varshney RK (2000) The development and use of microsatellite markers for genetic analysis and plant breeding with emphasis on bread wheat. Euphytica 113:163-185.
  • Guyomarc H, Sourdille P, Charmet G, Edwards KJ and Bernard M (2002) Characterization of polymorphic microsatellite markers from Aegilops tauschii and transferability to the D-genome of bread wheat. Theor Appl Genet 104:1164-1172.
  • Heard J and Dunn K (1995) Symbiotic induction of a MADS-box gene during development of alfalfa root nodules. Proc Natl Acad. Sci USA 92:5273-5277.
  • Kijas JMH, Fowler JCS, Garbett CA and Thomas MR (1994) Enrichment of microsatellite from citrus genome using biotinylated oligonucleotid sequences bound to straptavidin-coated magnetic particles. Biotechniques 16:656-662.
  • Li CD, Zang XQ, Eckstein P, Rossnagel BG and Scoles GJ (2000) A polymorphic microsatellite in the limit dextrinase gene of barley (Hordeum vulgare L.). Mol Breed 5:569-577.
  • Nagl W, Jeanjour M, Kling H, Kühner S, Michels I, Müller T and Stein B (1983) Genome and chromatin organisation in higher plants. Biol Zentralbl 102:129-148.
  • Penueli L, Abu-abeid M, Zamir D, Nacken W, Schawarz-Sommer Z and Lifschitz E (1991) The MADS-box gene family in tomato: temporal expression during floral development, conserved secondary structures, and homology with homeotic genes from Antirrhinum and Arabidopsis Plant J 1:255-266.
  • Priolli RHG, Mendes-Junior CT, Arantes NE and Contel EPB (2002) Characterization of Brazilian soybean cultivars using microsatellite markers. Genet Mol Biol 25:185-193.
  • Saghai-Maroof MA, Biyashev RM, Yang GP, Zang Q and Allard RW (1994) Extraordinarily polymorphic microsatellites DNA in barley species diversity, chromosomal locations, and population dynamics. Proc Natl Acad Sci USA 91:5466-6470.
  • Schwarz-Sommer Z, Hue I, Huijser R, Flor RJ, Hansen R, Tetens E, Lönnig W, Saedler H and Sommer H (1992) Characterisation of the Antirrhinum floral homeotic MADS-box gene defeciens; evidence for DNA binding and autoregulation of its persistent expression throughout flower development. EMBO J 11:251-263.
  • Sommer H, Beltran JR, Huijser R, Lönnig WE, Saedler H, Sommer H and Schwarz-Sommer Z (1990) Defeciens; a homeotic gene involved in the control of flower morphogenesis in Antirrhinum majus: the protein shows homology to transcriptional factors. EMBO J 9:605-613.
  • Suwabe K, Iketani H, Numone T, Kage T and Hirai M (2002) Isolation and characterization of microsatellites in Bassica rapa L. Theor Appl Genet 104:1092-1098.
  • Tautz D and Renz M (1984) Simple sequence repeats are ubiquitous repetitive components of eukaryotic genomes. Nucl Acids Res 12:4127-4137.
  • Theissen G, Becker A, Di Rosa A, Kanno A, Kim JT, Münster T, Winter K and Saelder H (2000) A short history of MADS-box genes in plants. Plant Mol Biol 42:115-149.
  • Thompson JD, Higgins DG and Gibson TJ (1994) Clustal W improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucl Acids Res 22:4673-4680.
  • Uzunova MI and Ecke W (1999) Abundance, polymorphism and genetic mapping of microsatellites in oilseed rape (Brassica napus L.). Plant Breeding 118:323-326.
  • Vallejos CE, Sakiyama NS and Chas CD (1992) A molecular marker-based linkage map of Phaseolus vulgaris L. Genetics 131:731-720.
  • Wang Z, Weber JL, Zhong G and Tanksley SD (1994) Survey of plant short tandem DNA repeats. Theor Appl Genet 88:1-6.
  • Winter KU, Becker A, Münster T, Kim JT, Saeldler H and Theissen G (1999) MADS-box genes reveal that gnetophytes are more closely related to conifers than to flowering plants. Proc Natl Acad Sci. USA 96:7342-7347.
  • Yu K, Park J, Poysa V and Gepts P (2000) Integration of Simple Sequence Repeats (SSR) markers into a molecular linkage map of common bean (Phaseolus vulgaris). J Hered 91:429-434.
  • Yu YG, Saghai-Maroof MA and Buss GR (1996) Divergence and allelomorphic relationship of a soybean virus resistance gene based on tightly linked DNA microsatellite and RFLP markers. Theor Appl Genet 92:64-69.
  • Correspondence to
    Marcelino Pérez de la Vega
    Universidad de León, Facultad de Biología, Area de Genética
    24071 León, Spain
    E-mail:
  • Publication Dates

    • Publication in this collection
      29 Sept 2003
    • Date of issue
      2003

    History

    • Received
      07 Nov 2002
    • Accepted
      12 May 2003
    Sociedade Brasileira de Genética Rua Cap. Adelmio Norberto da Silva, 736, 14025-670 Ribeirão Preto SP Brazil, Tel.: (55 16) 3911-4130 / Fax.: (55 16) 3621-3552 - Ribeirão Preto - SP - Brazil
    E-mail: editor@gmb.org.br