Acessibilidade / Reportar erro

Evolution and diversification of the O-methyltransferase (OMT) gene family in Solanaceae

Abstract

O-methyltransferases (OMTs) are a group of enzymes involved in several fundamental biological processes in plants, including lignin biosynthesis, pigmentation, and aroma production. Despite the intensive investigation of the role of OMTs in plant secondary metabolism, the evolution and diversification of this gene family in Solanaceae remain poorly understood. Here, we conducted a genome-wide survey of OMT genes in six Solanaceae species, reconstructing gene phylogenetic trees, predicting the potential involvement in biological processes, and investigating the exon/intron structure and chromosomal location. We identified 57 caffeoyl-CoA OMTs (CCoAOMTs) and 196 caffeic acid OMTs (COMTs) in the studied species. We observed a conserved gene block on chromosome 2 that consisted of tandem duplicated copies of OMT genes. Our results suggest that the expansion of the OMT gene family in Solanaceae was driven by whole genome duplication, segmental duplication, and tandem duplication, with multiple genes being retained by neofunctionalization and subfunctionalization. This study represents an essential first step in unraveling the evolutionary history of OMTs in Solanaceae. Our findings deepen our understanding of the crucial role of OMTs in several biological processes and highlight their significance as potential biotechnological targets.

Keywords:
Anthocyanin; flavonoid; functional diversification; secondary metabolites; Solanum

Introduction

S-adenosyl-l-methionine (SAM)-dependent O-methyltransferases (OMTs) are a diverse group of multifunctional enzymes that catalyze the transfer of a methyl group from SAM to multiple acceptor molecules, producing the corresponding methyl ether derivatives (Lam et al., 2007Lam KC, Ibrahim RK, Behdad B and Dayanandan S (2007) Structure, function, and evolution of plant O-methyltransferases. Genome 50:1001-1013.). This gene family can be divided into two subfamilies according to the substrate they methylate, their protein characteristics, and conserved motifs: caffeoyl-CoA O-methyltransferases (CCoAOMTs) are mainly responsible for caffeoyl-CoA methylation, whereas caffeic acid O-methyltransferases (COMTs) uses caffeic acid and a myriad of other molecules as a substrate (Davin and Lewis, 1992Davin LB and Lewis NG (1992) Phenylpropanoid metabolism: Biosynthesis of monolignols, lignans and neolignans, lignins and suberins. In: Stafford HA and Ibrahim RK (eds) Phenolic metabolism in plants. Springer, Boston, pp 325-375.; Joshi and Chiang, 1998Joshi CP and Chiang VL (1998) Conserved sequence motifs in plant S-adenosyl-l-methionine-dependent methyltransferases. Plant Mol Biol 37:663-674.; Li et al., 2006Li HM, Rotter D, Hartman TG, Pak FE, Havkin-Frenkel D and Belanger FC (2006) Evolution of novel O-methyltransferases from the Vanilla planifolia caffeic acid O-methyltransferase. Plant Mol Biol 61:537-552.).

OMT proteins play significant roles in several fundamental biological processes in plants. These proteins are involved in the biosynthesis of diverse essential metabolites for plant growth, development, and defense-including alkaloids, flavonoids, lignin, and phenylpropanoids (Lam et al., 2007Lam KC, Ibrahim RK, Behdad B and Dayanandan S (2007) Structure, function, and evolution of plant O-methyltransferases. Genome 50:1001-1013.). Notably, OMT proteins are crucial for lignin biosynthesis, where CCoAOMT and COMT proteins contribute to the process (Bonawitz and Chapple, 2010Bonawitz ND and Chapple C (2010) The genetics of lignin biosynthesis: Connecting genotype to phenotype. Annu Rev Genet 44:337-363.). However, the scope of OMT protein functionality extends far beyond lignin biosynthesis, as OMT proteins are also involved in flower pigmentation (Akita et al., 2011Akita Y, Kitamura S, Hase Y, Narumi I, Ishizaka H, Kondo E, Kameari N, Nakayama M, Tanikawa N, Morita Y et al. (2011) Isolation and characterization of the fragrant cyclamen O-methyltransferase involved in flower coloration. Plant 234:1127-1136.) and aroma production (Wu et al., 2003Wu S, Watanabe N, Mita S, Ueda Y, Shibuya M and Ebizuka Y (2003) Two O-methyltransferases isolated from flower petals of Rosa chinensis var. spontanea involved in scent biosynthesis. J Biosci Bioeng 96:119-128.), stress response (Hafeez et al., 2021Hafeez A, Gě Q, Zhāng Q, Lǐ J, Gōng J, Liú R, Shí Y, Shāng H, Liú À, Iqbal MS et al. (2021) Multi-responses of O-methyltransferase genes to salt stress and fiber development of Gossypium species. BMC Plant Biol 21:37.), and melatonin biosynthesis (Byeon et al., 2014Byeon Y, Lee HY, Lee K and Back K (2014) Caffeic acid O-methyltransferase is involved in the synthesis of melatonin by methylating N-acetylserotonin in Arabidopsis. J Pineal Res 57:219-227.). These findings indicate that OMT proteins could be involved in various biological processes, many of which remain unknown. The broad range of protein functions makes them potentially crucial targets for genetic manipulation to enhance plant productivity, defense, and adaptation to environmental stresses (Struck et al., 2012Struck A-W, Thompson ML, Wong LS and Micklefield J (2012) S-adenosyl-methionine-dependent methyltransferases: Highly versatile enzymes in biocatalysis, biosynthesis and other biotechnological applications. Chem Biochem 13:2642-2655.).

Solanaceae, known as the nightshade family, is one of the largest and most diverse plant families, comprising over 90 genera and 3000 species (Särkinen et al., 2013Särkinen T, Bohs L, Olmstead RG and Knapp S (2013) A phylogenetic framework for evolutionary study of the nightshades (Solanaceae): A dated 1000-tip tree. BMC Evol Biol 13:214.; Gebhardt, 2016Gebhardt C (2016) The historical role of species from the Solanaceae plant family in genetic research. Theor Appl Genet 129:2281-2294.). Many species in this family have great economic significance, including tomato, potato, eggplant, tobacco, and chili peppers. With the recent advances in genomics and the availability of entire genomes, the evolutionary history of the OMT gene family has been explored in various crops, such as Citrus sinensis (Liu et al., 2016Liu X, Luo Y, Wu H, Xi W, Yu J, Zhang Q and Zhou Z (2016) Systematic analysis of O-methyltransferase gene family and identification of potential members involved in the formation of O-methylated flavonoids in Citrus. Gene 575:458-472.), Vitis vinifera (Lu et al., 2022Lu S, Zhuge Y, Hao T, Liu Z, Zhang M and Fang J (2022) Systematic analysis reveals O-methyltransferase gene family members involved in flavonoid biosynthesis in grape. Plant Physiol Biochem 173:33-45.), and Gossypium spp. (Hafeez et al., 2021Hafeez A, Gě Q, Zhāng Q, Lǐ J, Gōng J, Liú R, Shí Y, Shāng H, Liú À, Iqbal MS et al. (2021) Multi-responses of O-methyltransferase genes to salt stress and fiber development of Gossypium species. BMC Plant Biol 21:37.). The function and substrate of OMT enzymes have been extensively studied in Arabidopsis thaliana and Oryza sativa, resulting in 12 and 11 functionally annotated and curated entries in UniProt (2023UniProt (2023) Find your protein, UniProt (2023) Find your protein, https://www.uniprot.org/ (accessed 1 March 2023).
https://www.uniprot.org/...
), primarily associated with lignin production. However, the evolution, diversification, and potential functions of OMT genes are still unknown in Solanaceae.

In this study, we conducted a comprehensive survey of the OMT gene family in Solanaceae genomes to gain insights into their evolutionary history. Specifically, we focused on six species representing the most significant genera in Solanaceae, to achieve four main objectives: 1) reconstructing the phylogeny of CCoAOMT and COMT subfamilies; 2) predicting their potential involvement in biological and molecular processes; 3) identifying their potential substrates; and 4) investigating their exon/intron structure and chromosomal location. This study is the first step toward unraveling the evolutionary history of the OMT gene family in Solanaceae. Our findings pave the way for further research into the crucial role of this gene family in various essential biological processes.

Material and Methods

Identification and filtering of OMT genes

We downloaded the predicted protein sequences of 23 Solanaceae annotated genomes (Table S1 Table S1 - Gene count of raw and filtered datasets, as well as the reference for all 23 Solanaceae species. ) available on the Sol Genomics Network (2023Sol Genomics Network (2023) Welcome to the Solanaceae Genomics Network, Sol Genomics Network (2023) Welcome to the Solanaceae Genomics Network, https://solgenomics.net (accessed 23 February 2023).
https://solgenomics.net...
) (Fernandez-Pozo et al., 2015Fernandez-Pozo N, Menda N, Edwards JD, Saha S, Tecle IY, Strickler SR, Bombarely A, Fisher-York T, Pujar A, Foerster H et al. (2015) The Sol Genomics Network (SGN)-from genotype to phenotype to breeding. Nucleic Acids Res 43:D1036-D1041.) and the online database National Center for Biotechnology Information (NCBI, 2023NCBI (2023) National Center for Biotechnology Information, NCBI (2023) National Center for Biotechnology Information, https://www.ncbi.nlm.nih.gov/ (accessed 23 February 2023).
https://www.ncbi.nlm.nih.gov/...
). To retrieve the candidate OMT genes, we used the Pfam (2023Pfam (2023) Pfam (2023) http://pfam.xfam.org (accessed 24 February 2023).
http://pfam.xfam.org...
) domains PF00891 (COMT subfamily) and PF01596 (CCoAOMT subfamily) as queries in HMMER v.3.2.1 using the hmmsearch tool with a cutoff value of 0.01. In the final dataset we kept only the sequences longer than 200 and 180 amino acid residues (aa) for the COMT and CMCoAOMT subfamilies, respectively.

After observing the initial results, which revealed that congeneric sequences overall clustered together on the phylogenetic tree (see Results section for further details), we conducted downstream analyses using a single representative of each genus. These representatives included Capsicum annuum, Datura stramonium, Iochroma cyaneum, Nicotiana attenuata, Petunia axillaris, and Solanum lycopersicum (Table 1). When we selected the representatives of genera with multiple species, we prioritized diploid species and those with a genomic assembly at the chromosomal level to capture the segmental evolutionary process instead of whole duplication processes.

Table 1-
Gene count and reference for each species in the filtered dataset.

Phylogeny reconstruction

We selected four outgroup species for which the OMT gene family has been well-characterized to include in the phylogeny: Arabidopsis thaliana (Hafeez et al., 2021Hafeez A, Gě Q, Zhāng Q, Lǐ J, Gōng J, Liú R, Shí Y, Shāng H, Liú À, Iqbal MS et al. (2021) Multi-responses of O-methyltransferase genes to salt stress and fiber development of Gossypium species. BMC Plant Biol 21:37.), Citrus sinensis (Liu et al., 2016Liu X, Luo Y, Wu H, Xi W, Yu J, Zhang Q and Zhou Z (2016) Systematic analysis of O-methyltransferase gene family and identification of potential members involved in the formation of O-methylated flavonoids in Citrus. Gene 575:458-472.), Populus trichocarpa (Barakat et al., 2011Barakat A, Choi A, Yassin NBM, Park JS, Sun Z and Carlson JE (2011) Comparative genomics and evolutionary analyses of the O-methyltransferase gene family in Populus. Gene 479:37-46.; Lu et al., 2022Lu S, Zhuge Y, Hao T, Liu Z, Zhang M and Fang J (2022) Systematic analysis reveals O-methyltransferase gene family members involved in flavonoid biosynthesis in grape. Plant Physiol Biochem 173:33-45.), and Vitis vinifera (Lu et al., 2022Lu S, Zhuge Y, Hao T, Liu Z, Zhang M and Fang J (2022) Systematic analysis reveals O-methyltransferase gene family members involved in flavonoid biosynthesis in grape. Plant Physiol Biochem 173:33-45.). References for each dataset are listed in Table S1 Table S1 - Gene count of raw and filtered datasets, as well as the reference for all 23 Solanaceae species. . When genes showed splicing variation, we kept only the longest sequence. The dataset was aligned using MAFFT v.7 (Katoh et al., 2019Katoh K, Rozewicki J and Yamada KD (2019) MAFFT online service: Multiple sequence alignment, interactive sequence choice and visualization. Brief Bioinform 20:1160-1166.) and trimmed with trimAL (Capella-Gutiérrez et al., 2009Capella-Gutiérrez S, Silla-Martínez JM and Gabaldón T (2009) trimAl: A tool for automated alignment trimming in large-scale phylogenetic analyses. Bioinformatics 25:1972-1973.) using the -gt parameter of 0.25. We used ModelFinder (Kalyaanamoorthy et al., 2017Kalyaanamoorthy S, Minh BQ, Wong TKF, von Haeseler A and Jermiin LS (2017) ModelFinder: Fast model selection for accurate phylogenetic estimates. Nat Methods 14:587-589.) and the Bayesian information criterion (BIC) score to select the best-fit substitution model as implemented in the IQ-TREE webserver (Trifinopoulos et al., 2016Trifinopoulos J, Nguyen L-T, von Haeseler A and Minh BQ (2016) W-IQ-TREE: A fast online phylogenetic tool for maximum likelihood analysis. Nucleic Acids Res 44:W232-W235.). We used the IQ-TREE webserver to construct a maximum likelihood phylogenetic tree with 1000 ultrafast bootstrap replications (Hoang et al., 2018Hoang DT, Chernomor O, von Haeseler A, Minh BQ and Vinh LS (2018) UFBoot2: Improving the ultrafast bootstrap approximation. Mol Biol Evol 35:518-522.) for a dataset encompassing sequences from both subfamilies. Considering the reciprocal monophyly of each subfamily (Figure S1 Figure S1 - Maximum likelihood genealogy of the complete dataset shows the reciprocal monophyly of both CCoAOMT and COMT subfamilies. ), we proceeded to construct separate trees for each subfamily.

Function and substrate prediction

To investigate the OMT proteins’ putative functions and substrates, we used the principle that phylogenetically related genes often share similar functions and act on similar substrates (Lu et al., 2022Lu S, Zhuge Y, Hao T, Liu Z, Zhang M and Fang J (2022) Systematic analysis reveals O-methyltransferase gene family members involved in flavonoid biosynthesis in grape. Plant Physiol Biochem 173:33-45.). For this, we downloaded plant sequences for 35 functionally characterized CCoAOMTs and 75 COMTs (Table S2 Table S2 - Functionally characterized OMT proteins downloaded from UniProt. ) from UniProt (2023UniProt (2023) Find your protein, UniProt (2023) Find your protein, https://www.uniprot.org/ (accessed 1 March 2023).
https://www.uniprot.org/...
). We only included sequences that have been manually reviewed by Swiss-Prot. Based on these 110 sequences, we constructed phylogenetic trees using IQ-TREE to predict their functions and substrates through phylogenetic clustering.

Gene structure and chromosomal location

We calculated proteins’ molecular weight and isoelectric point with the ProtParam tool of Expasy (2023ProtParam tool of Expasy (2023) ProtParam tool of Expasy (2023) https://web.expasy.org/protparam/ (accessed 3 April 2023).
https://web.expasy.org/protparam/...
). To understand the intron-exon organization of OMT genes, we created a neighbor-joining tree from the six Solanaceae genera with MEGA11 (Tamura et al., 2021Tamura K, Stecher G and Kumar S (2021) MEGA11: Molecular Evolutionary Genetics Analysis version 11. Mol Biol Evol 38:3022-3027.) and 1000 bootstrap replicates. To visualize the gene structure, we retrieved the GFF file of each species and used the online Gene Structure Display Server (Guo et al., 2007Guo A-Y, Zhu Q-H, Chen X and Luo J-C (2007) GSDS: A gene structure display server. Hereditas 29:1023.; Hu et al., 2015Hu B, Jin J, Guo A-Y, Zhang H, Luo J and Gao G (2015) GSDS 2.0: An upgraded gene feature visualization server. Bioinformatics 31:1296-1297.). We searched for conserved motifs with MEME Suite (Bailey et al., 2015Bailey TL, Johnson J, Grant CE and Noble WS (2015) The MEME suite. Nucleic Acids Res 43:W39-W49.) up to a maximum number of six motifs for each subfamily. After successfully identifying the motifs, we used the PROSITE tool (2023PROSITE tool (2023) ScanProsite tool, PROSITE tool (2023) ScanProsite tool, http://prosite.expasy.org/scanprosite/ (accessed 24 February 2023).
http://prosite.expasy.org/scanprosite/...
) to predict the features of these motifs. We conducted a search with motif widths ranging from six to 60. We visualized the chromosomal location of identified OMT genes using TBtools (Chen et al., 2020Chen C, Chen H, Zhang Y, Thomas HR, Frank MH, He Y and Xia R (2020) TBtools: An integrative toolkit developed for interactive analyses of big biological data. Mol Plant 13:1194-1202.), except for D. stramonium and P. axillaris, for which a chromosomal-level assembly is unavailable.

Results

Identification of OMT genes and preliminary phylogenetic clustering

Our study focused on 23 species of Solanaceae, and we used HMMER to identify CCoAOMT and COMT proteins, of which we found 360 and 1001 sequences, respectively. We then applied stringent filters to exclude sequences that did not meet our criteria and included outgroup sequences, resulting in a final dataset of 273 CCoAOMT sequences and 974 COMT sequences (see Table S1 Table S1 - Gene count of raw and filtered datasets, as well as the reference for all 23 Solanaceae species. for a detailed view of the number of genes per species). Phylogenetic trees based on the CCoAOMT (Figure S2 Figure S2 - Maximum likelihood genealogy of CCoAOMT proteins from a comprehensive dataset including 23 Solanaceae species. ) and COMT (Figure S3 Figure S3 - Maximum likelihood genealogy of CCoAOMT proteins from a comprehensive dataset including 23 Solanaceae species. ) proteins revealed that sequences from the same genus tended to cluster together. Thus, to better visualize the evolutionary patterns of OMTs in Solanaceae, we selected only one representative species per genus.

Evolutionary patterns in representative species of Solanaceae

The HMMER (2023HMMER (2023) HMMER: biosequence analysis using profile hidden Markov models, HMMER (2023) HMMER: biosequence analysis using profile hidden Markov models, http://hmmer.org/ (accessed 24 February 2023).
http://hmmer.org/...
) search identified 340 OMT protein sequences belonging to the two subfamilies through the six Solanaceae genera (Table 1). Subsequently, we filtered out sequences that lacked a Pfam domain or were too short (Table S3 Table S3 - Structural and molecular characteristics of OMT proteins of six Solanaceae representative species. ), keeping 57 CCoAOMT and 196 COMT sequences. Upon addition of outgroup sequences, the CCoAOMT final dataset comprised 86 CCoAOMT and 331 COMT protein sequences. The number of filtered sequences varied between 34 to 55 per representative species, with I. cyaneum exhibiting the highest count.

Phylogenetic relationships and function prediction

To analyze the evolutionary relationships and duplication/loss patterns of the OMT genes, we constructed a maximum likelihood tree using protein sequences for each subfamily and divided the tree into monophyletic subgroups for results’ better visualization. The CCoAOMT subfamily tree (Figure 1) revealed that each group presented orthologs of all six species, except group IV, which lacks P. axillaris genes, and groups I and VI, which lack N. attenuata genes. Such absence of the sequences must be taken with caution as it could be the result of issues during genome assembly, gene annotation, or failure to meet our screening criteria (e.g., minimum sequence length and motif presence). Notably, our analysis also highlights gene duplication in particular clades. For instance, in group II, we observe several CCoAOMT copies in C. annuum, I. cyaneum, and S. lycopersicum, with all copies located in chromosome (chr) 2 in these three species.

Figure 1-
Maximum likelihood genealogies of CCoAOMT proteins from selected Solanaceae species. Outer curves indicate the identified groups. To improve visualization, clades including only outgroup sequences were collapsed. Branch tips are color-coded according to species. The tree is midpoint rooted. Gray squares represent collapsed nodes of the outgroups Arabidopsis thaliana (Ath), Citrus sinensis (Csi), Populus trichocarpa (Ptr), and Vitis vinifera (Vvi).

The COMT subfamily tree (Figure 2) showed a similar pattern to that observed in the CCoAOMT subfamily tree. Most groups had at least one representative of all genera, indicating relatively few loss events throughout evolutionary history. However, we also observed evidence of numerous recent gene duplications in species of different groups, such as P. axillaris in groups II and V, I. cyaneum in group III, and N. attenuata in group V.

Figure 2-
Maximum likelihood genealogies of COMT proteins from selected Solanaceae species. Outer curves indicate the identified groups. To improve visualization, clades including only outgroup sequences were collapsed. Branch tips are color-coded according to species. The tree is midpoint rooted. Gray squares represent collapsed nodes of the outgroups Arabidopsis thaliana (Ath), Citrus sinensis (Csi), Populus trichocarpa (Ptr), and Vitis vinifera (Vvi).

The phylogenetic tree containing the Solanaceae CCoAOMT proteins and the functionally characterized proteins (Figure S4 Figure S4 - Maximum likelihood genealogy of CCoAOMT proteins from the function dataset, including the six Solanaceae species and the 35 functionally characterized proteins from UniProt. ) revealed that all identified groups clustered with proteins involved in methylation and lignin synthesis (Table S4 Table S4 - Putative substrate, biological and molecular processes associated with the phylogenetic groups of the OMT family of Solanaceae species. ). The clades showed that these proteins are involved in metal ion binding and SAM-dependent methyltransferase activity in addition to the caffeoyl-CoA O-methyltransferase activity, as the name of the subfamily suggests. Regarding the biological function of CCoAOMTs in Solanaceae, groups II and III would be potentially involved in circadian rhythm, leaf volatile biosynthesis, and phenylpropanoid metabolism. In turn, group VI encompassed the proteins associated with the highest diversity of biological processes, including seed development, pigmentation, and biosynthesis of cyanidin, delphinidin, and spermidine hydroxycinnamate.

The COMT subfamily is expected to have a broader range of biological and molecular functions and act on a greater variety of substrates due to its larger number of genes (Figure S5 Figure S5 - Maximum likelihood genealogy of COMT proteins from the function dataset, including the six Solanaceae species and the 75 functionally characterized proteins from UniProt. ; Table S4 Table S4 - Putative substrate, biological and molecular processes associated with the phylogenetic groups of the OMT family of Solanaceae species. ). The phylogenetic clustering with functionally characterized proteins showed that all COMT groups of Solanaceae would be involved in methylation and alkaloid metabolism and biosynthesis. However, lignin biosynthesis-one of the most extensively studied processes of the OMT family-was only associated with groups IV and V of the COMT subfamily. Moreover, our findings showed that, in Solanaceae, these proteins might respond to adverse conditions such as wounds, cold, and high light intensity, as well as metabolic processes involving phenylpropanoids and isoflavonoids. Group V, which can act on several substrates, had the highest number of biological functions, including response to phytohormones such as salicylic acid, ethylene, and jasmonic acid.

Gene structure and chromosomal location

After removing the short sequences, the CCoAOMT length of the six Solanaceae species varied from 185 to 685 aa, whereas COMT length ranged from 204 to 888 aa (Table S3 Table S3 - Structural and molecular characteristics of OMT proteins of six Solanaceae representative species. ). The number of introns also varied greatly within the subfamilies: zero to 16 in CCoAOMT (Figure S6 Figure S6 - Gene structure and protein motif composition of CCoAOMT proteins from selected Solanaceae species. Introns were rescaled to the same length. ) and zero to 12 in COMT (Figure S7 Figure S7 - Gene structure and protein motif composition of COMT proteins from selected Solanaceae species. ). The MEME (2023MEME Suite (2023) Multiple Em for Motif Elicitation, MEME Suite (2023) Multiple Em for Motif Elicitation, https://meme-suite.org/meme/tools/meme (accessed 30 March 2023).
https://meme-suite.org/meme/tools/meme...
) search for conserved motifs revealed that most CCoAOMT genes have the six conserved motifs (Figure S6 Figure S6 - Gene structure and protein motif composition of CCoAOMT proteins from selected Solanaceae species. Introns were rescaled to the same length. ), except for some proteins that seem to have undergone duplication (i.e., motifs are present twice in the protein) and four sequences that have low or no probability of having a motif. The motifs in the COMT subfamily follow a similar pattern (Figure S7 Figure S7 - Gene structure and protein motif composition of COMT proteins from selected Solanaceae species. ), where most motifs were conserved in all proteins, except for a clade where these motifs were not found. In the CCoAOMT proteins, PROSITE tool (2023PROSITE tool (2023) ScanProsite tool, PROSITE tool (2023) ScanProsite tool, http://prosite.expasy.org/scanprosite/ (accessed 24 February 2023).
http://prosite.expasy.org/scanprosite/...
) identified binding sites for SAM ligands in motifs 1, 2, and 4. Additionally, motif 2 exhibited binding sites for divalent metal cations. Similarly, the COMT proteins displayed a similar pattern, with motifs 1, 2, and 5 showing binding sites for SAM, and motif 1 exhibiting one active site as a proton acceptor.

To gain insights into duplication events, we examined the chromosomal location of CCoAOMT and COMT genes in I. cyaneum, C. annuum, N. attenuata, and S. lycopersicum (Figure 3), in which genome assemblies at the chromosomal level are available. However, even with highly advanced genome assembly techniques, some genomic regions may not be confidently assigned to a specific chromosome and instead are placed in scaffolds. As a result, we could not assign a chromosomal location for 23 genes from I. cyaneum, three genes from C. annuum, and 23 genes from N. attenuata. The CCoAOMT genes of C. annuum were distributed among four chromosomes, with chr2 containing the highest copy number. On the other hand, COMT genes were found more widely distributed throughout the genome, and chr3 seems to have undergone multiple duplications, with 10 COMT genes located near each other. In S. lycopersicum, CCoAOMT and COMT were both present in eight of the 12 chromosomes. Notably, chr2 has duplicated in the CCoAOMT and COMT genes. Inferences about the chromosomal location of I. cyaneum and N. attenuata should be made cautiously as 23 genes were not assembled at the chromosome level for both species. However, we observed that chr2 of I. cyaneum underwent tandem duplication for CCoAOMT and COMT genes.

Figure 3-
Chromosomal distribution of OMT genes in Capsicum annuum, Iochroma cyaneum, Nicotiana attenuata, and Solanum lycopersicum. Chromosomes are ordered by their corresponding number in ascending order from left to right. Scale bar = 50 megabases (Mb).

Discussion

The OMT gene family plays a crucial role in plants, as it is involved in the biosynthesis of lignin and other secondary metabolites important for many biological processes (Ibrahim et al., 1998Ibrahim RK, Bruneau A and Bantignies B (1998) Plant O-methyltransferases: Molecular analysis, common signature and classification. Plant Mol Biol 36:1-10.; Lam et al., 2007Lam KC, Ibrahim RK, Behdad B and Dayanandan S (2007) Structure, function, and evolution of plant O-methyltransferases. Genome 50:1001-1013.; Balasubramani et al., 2021Balasubramani S, Lv S, Chen Q, Zhou Z, Moorthy MDS, Sathish D and Moola AK (2021) A systematic review of the O-methyltransferase gene expression. Plant Gene 27:100295.). In this study, we investigated the evolution and diversification of this gene family in Solanaceae. Based on a preliminary phylogenetic analysis of OMT proteins for 23 Solanaceae species (Figure S2 Figure S2 - Maximum likelihood genealogy of CCoAOMT proteins from a comprehensive dataset including 23 Solanaceae species. and S3 Figure S3 - Maximum likelihood genealogy of CCoAOMT proteins from a comprehensive dataset including 23 Solanaceae species. ), we found that sequences from congeneric species overall clustered together, suggesting that the phylogenetic signal is generally constrained at the genus level. Recent studies analyzing the evolutive stories of other gene families in Solanaceae found a similar pattern (Pereira et al., 2022Pereira AG, Guzmán-Rodriguez S and Freitas LB (2022) Phylogenetic analyses of some key genes provide information on pollinator attraction in Solanaceae. Genes 13:2278.; Thompson et al., 2023Thompson CE, Brisolara-Corrêa L, Thompson HN, Stassen H and Freitas LB (2023) Evolutionary and structural aspects of Solanaceae RNases T2. Genet Mol Biol 46:e20220115.). Thus, we only kept one representative species of each genus to better visualize the evolutionary history of the OMT gene family in Solanaceae.

OMT gene count showed wide variation among Solanaceae species, with no clear trend of higher gene counts in a particular genus. Considering genera with multiple sampled species, Nicotiana displayed 36 to 58 genes, whereas Solanum exhibited 27 to 71 genes, indicating a notable fluctuation in gene count per plant species in the same genus. Additionally, the number of COMT genes exceeded that of CCoAOMT genes in all Solanaceae studied species, similar to what was found in other plant species (Barakat et al., 2011Barakat A, Choi A, Yassin NBM, Park JS, Sun Z and Carlson JE (2011) Comparative genomics and evolutionary analyses of the O-methyltransferase gene family in Populus. Gene 479:37-46.; Liu et al., 2016Liu X, Luo Y, Wu H, Xi W, Yu J, Zhang Q and Zhou Z (2016) Systematic analysis of O-methyltransferase gene family and identification of potential members involved in the formation of O-methylated flavonoids in Citrus. Gene 575:458-472.; Hafeez et al., 2021Hafeez A, Gě Q, Zhāng Q, Lǐ J, Gōng J, Liú R, Shí Y, Shāng H, Liú À, Iqbal MS et al. (2021) Multi-responses of O-methyltransferase genes to salt stress and fiber development of Gossypium species. BMC Plant Biol 21:37.; Lu et al., 2022Lu S, Zhuge Y, Hao T, Liu Z, Zhang M and Fang J (2022) Systematic analysis reveals O-methyltransferase gene family members involved in flavonoid biosynthesis in grape. Plant Physiol Biochem 173:33-45.). We did not observe any association between the number of genes and polyploidy, as exemplified by N. tabacum, an allotetraploid plant resulting from hybridization between N. sylvestris and N. tomentosiformis (Leitch et al., 2008Leitch IJ, Hanson L, Lim KY, Kovarik A, Chase MW, Clarkson JJ and Leitch AR (2008) The ups and downs of genome size evolution in polyploid species of Nicotiana (Solanaceae). Ann Bot 101:805-814.). Despite having a larger genome than its parental species (Edwards et al., 2017Edwards KD, Fernandez-Pozo N, Drake-Stowe K, Humphry M, Evans AD, Bombarely A, Allen F, Hurst R, White B, Kernodle SP et al. (2017) A reference genome for Nicotiana tabacum enables map-based cloning of homeologous loci implicated in nitrogen utilization efficiency. BMC Genomics 18:448.), the number of OMT genes in N. tabacum is higher but not equal to the sum of its parental species (Table S1 Table S1 - Gene count of raw and filtered datasets, as well as the reference for all 23 Solanaceae species. ). The higher copy number can be explained by the relaxation of purifying selection and the putative functional redundancy of these genes on duplicated genomes, which can lead to gene loss (Langham et al., 2004Langham RJ, Walsh J, Dunn M, Ko C, Goff SA and Freeling M (2004) Genomic duplication, fractionation and the origin of regulatory novelty. Genetics 166:935-945.; Cheng et al., 2018Cheng F, Wu J, Cai X, Liang J, Freeling M and Wang X (2018) Gene retention, fractionation and subgenome differences in polyploid plants. Nat Plants 4:258-268.).

Whereas the number of introns varied within both subfamilies, the genes’ structure exhibited a remarkable degree of motif conservation, except for one cluster in each subfamily that lacked these conserved motifs. This loss suggests that the proteins encoded by these clusters may have been selected to perform a different function or changed the original function via pseudogenization (Garewal et al., 2021Garewal N, Goyal N, Pathania S, Kaur J and Singh K (2021) Gauging the trends of pseudogenes in plants. Crit Rev Biotechnol 41:1114-1129.). Three conserved motifs of CCoAOMT exhibited affinity towards SAM and one specific motif contained a binding site dedicated to metal cations. The affinity to metal ions is characteristic of CCoAOMT, because their role as a cofactor in transferring the methyl group to substrates (e.g., Hugueney et al., 2009Hugueney P, Provenzano S, Verries C, Ferrandino A, Meudec E, Batelli G, Merdinoglu D, Cheynier V, Schubert S and Ageorges, A (2009) A novel cation-dependent O-methyltransferase involved in anthocyanin methylation in grapevine. Plant Physiol 150:2057-2070.; Xu et al., 2015Xu RX, Zhao Y, Gao S, Zhang YY, Li DD, Lou HX and Cheng AX (2015) Functional characterization of a plastidal cation-dependent O-methyltransferase from the liverwort Plagiochasma appendiculatum. Phytochemistry 118:33-41.). Similarly, three COMT motifs showed affinity toward SAM, and one motif was predicted as a proton acceptor. The proton acceptor plays a critical role in COMT-mediated methylation by deprotonating the target substrate and enhancing its reactivity toward the methyl group of SAM (Abdelraheem et al., 2022Abdelraheem E, Thair B, Varela RF, Jockmann E, Popadić D, Hailes HC, Ward JM, Iribarren AM, Lewkowicz ES, Andexer JN et al. (2022). Methyltransferases: Functions and applications. Chembiochem 23:e202200212.). The molecular weights of the proteins in each subfamily were broadly consistent with their expected ranges, which were reported as 26 to 30 kDa for CCoAOMT and 40 to 43 kDa for COMT (Kim et al., 2010Kim B-G, Sung SH, Chong Y, Lim Y and Ahn J-H (2010) Plant flavonoid O-methyltransferases: Substrate specificity and application. J Plant Biol 53:321-329.), with some exceptions to this general trend.

The diversification of OMT genes in plants can be traced back to the ancestor of all land plants in which an early duplication event occurred, giving rise to two major OMT groups characterized by their distinct substrate affinities (Lam et al., 2007Lam KC, Ibrahim RK, Behdad B and Dayanandan S (2007) Structure, function, and evolution of plant O-methyltransferases. Genome 50:1001-1013.; Barakat et al., 2011Barakat A, Choi A, Yassin NBM, Park JS, Sun Z and Carlson JE (2011) Comparative genomics and evolutionary analyses of the O-methyltransferase gene family in Populus. Gene 479:37-46.). Subsequent duplications have contributed to the expansion and diversification of OMTs in plants, involving three primary duplication mechanisms: whole genome duplication, segmental duplication, and tandem duplication.

Whole genome duplication with subsequent genome rearrangement is essential in gene family diversification (Clark and Donoghue, 2018Clark JW and Donoghue PCJ (2018) Whole-genome duplication and plant macroevolution. Trends Plant Sci 23:933-945.). In Solanaceae, two whole genome triplications might be involved in expanding the OMT gene family: the first is shared with all eudicots, and the second is more recent and occurred in the ancestor of Solanaceae (The Tomato Genome Consortium, 2012The Tomato Genome Consortium (2012) The tomato genome sequence provides insights into fleshy fruit evolution. Nature 485:635-641.; Huang et al., 2022Huang Y, Zhang L, Zhang K, Chen S, Hu J and Cheng F (2022) The impact of tandem duplication on gene evolution in Solanaceae species. J Integr Agric 21:1004-1014.). Besides whole genome duplication, segmental duplicantion - in which gene copies are created across the genome because regions of the genome are duplicated to the same or a different chromosome (Bailey et al., 2002Bailey JA, Gu Z, Clark RA, Reinert K, Samonte RV, Schwartz S, Adams MD, Myers EW, Li PW and Eichler EE (2002) Recent segmental duplications in the human genome. Science 297:1003-1007.; Panchy et al., 2016Panchy N, Lehti-Shiu M and Shiu S-H (2016) Evolution of gene duplication in plants. Plant Physiol 171:2294-2316.) - is another main mechanism of OMT gene family expansion, as seen here in Solanaceae, Citrus (Liu et al., 2016Liu X, Luo Y, Wu H, Xi W, Yu J, Zhang Q and Zhou Z (2016) Systematic analysis of O-methyltransferase gene family and identification of potential members involved in the formation of O-methylated flavonoids in Citrus. Gene 575:458-472.), and Vitis (Lu et al., 2022Lu S, Zhuge Y, Hao T, Liu Z, Zhang M and Fang J (2022) Systematic analysis reveals O-methyltransferase gene family members involved in flavonoid biosynthesis in grape. Plant Physiol Biochem 173:33-45.). Segmental duplications are usually associated with repetitive sequences and transposable elements (Panchy et al., 2016Panchy N, Lehti-Shiu M and Shiu S-H (2016) Evolution of gene duplication in plants. Plant Physiol 171:2294-2316.), but the underlying mechanisms of such processes remain poorly understood. Further research is needed to elucidate these mechanisms and their potential functional implications.

Another important mechanism involved in gene family expansion is tandem duplications - when the resulting copies are adjacent to each other on the same chromosome (Jander and Barth, 2007Jander G and Barth C (2007) Tandem gene arrays: A challenge for functional genomics. Trends Plant Sci 12:203-210.). Here, we observed that the tandem duplication magnitude varied among species (Figure 3), with N. attenuata showing the lowest level of tandem duplications. Still, we should look at this result carefully as many genes are not assigned to a specific chromosome and thus are not displayed in Figure 3. Three species showed a tandem-duplicated CCoAOMT region on chr2 and, due to the species phylogenetic relatedness and the chr2 synteny (The Tomato Genome Consortium, 2012The Tomato Genome Consortium (2012) The tomato genome sequence provides insights into fleshy fruit evolution. Nature 485:635-641.), such duplication could represent an ancestral event that has been maintained in these species. The number of tandem duplication events in Solanaceae seems to be ancient, not as abundant as in other genera [e.g., Populus (Barakat et al., 2011Barakat A, Choi A, Yassin NBM, Park JS, Sun Z and Carlson JE (2011) Comparative genomics and evolutionary analyses of the O-methyltransferase gene family in Populus. Gene 479:37-46.), Arabidopsis (Barakat et al., 2011Barakat A, Choi A, Yassin NBM, Park JS, Sun Z and Carlson JE (2011) Comparative genomics and evolutionary analyses of the O-methyltransferase gene family in Populus. Gene 479:37-46.), and Citrus (Liu et al., 2016Liu X, Luo Y, Wu H, Xi W, Yu J, Zhang Q and Zhou Z (2016) Systematic analysis of O-methyltransferase gene family and identification of potential members involved in the formation of O-methylated flavonoids in Citrus. Gene 575:458-472.)], and agrees with previous ideas that Solanaceae tend to have a lower number of tandem duplicated regions than species that have not gone through multiple and recent whole genome duplications (Huang et al., 2022Huang Y, Zhang L, Zhang K, Chen S, Hu J and Cheng F (2022) The impact of tandem duplication on gene evolution in Solanaceae species. J Integr Agric 21:1004-1014.). The observed expansion and diversification of the OMT gene family might be a combination of different processes at different time points that is a typical pattern in Solanaceae (The Tomato Genome Consortium, 2012The Tomato Genome Consortium (2012) The tomato genome sequence provides insights into fleshy fruit evolution. Nature 485:635-641.), as well as in distantly-related species (e.g., Hofberger et al., 2015Hofberger JA, Nsibo DL, Govers F, Bouwmeester K and Schranz ME (2015) A Complex interplay of tandem- and whole-genome duplication drives expansion of the L-type lectin receptor kinase gene family in the Brassicaceae. Genome Biol Evol 7:720-734.; Kuo et al., 2019Kuo Y-T, Chao Y-T, Chen W-C, Shih M-C and Chang S-B (2019) Segmental and tandem chromosome duplications led to divergent evolution of the chalcone synthase gene family in Phalaenopsis orchids. Ann Bot 123:69-77.; Zafar et al., 2022Zafar MM, Rehman A, Razzaq A, Parvaiz A, Mustafa G, Sharif F, Mo H, Youlu Y, Shakeel A and Ren M (2022) Genome-wide characterization and expression analysis of Erf gene family in cotton. BMC Plant Biol 22:134.).

Subfunctionalization and neofunctionalization play critical roles in the retention and diversification of genes that have undergone duplication, leading to functional divergence between copies (Pichersky and Gang, 2000Pichersky E and Gang DR (2000) Genetics and biochemistry of secondary metabolites in plants: An evolutionary perspective. Trends Plant Sci 5:439-445.; Han et al., 2007Han Y, Gasic K and Korban SS (2007) Multiple-copy cluster-type organization and evolution of genes encoding O-Methyltransferases in the apple. Genetics 176:2625-2635.). This evolutionary phenomenon can be illustrated with OMTs regarding anthocyanin biosynthesis, a well-studied metabolic pathway involving these genes. Anthocyanins are pigments that confer red and purple colors to flowers, fruits, and leaves. OMT enzymes catalyze the transfer of a methyl group to the hydroxyl group of anthocyanidins, resulting in the formation of methylated anthocyanins such as petunidin and peonidin, which are stable and water-soluble pigments (Ma et al., 2021Ma Y, Ma X, Gao X, Wu W and Zhou B (2021) Light induced regulation pathway of anthocyanin biosynthesis in plants. Int J Mol Sci 22:11116.). Our functional phylogenetic clustering showed that CCoAOMT group VI proteins are associated with anthocyanin biosynthesis, and their phylogenetic position implies neofunctionalization from an ancestral gene duplication event. Nicotiana attenuata and C. annuum are absent in group VI, indicating gene loss in those species. Moreover, the independent origin of floral volatile production in CCoAOMT groups II and III underscores the high plasticity of this gene family. Such plasticity illustrates how OMT genes may affect pollinator interaction and contribute to rapid speciation events.

There are more genes in the COMT than in the CCoAOMT subfamily. This larger gene set is expected to translate into a broader range of substrates that COMT proteins can methylate and, consequently, into a more diverse array of biological and molecular processes in which they could be involved, highlighting the versatility and adaptability of the COMT subfamily (Lam et al., 2007Lam KC, Ibrahim RK, Behdad B and Dayanandan S (2007) Structure, function, and evolution of plant O-methyltransferases. Genome 50:1001-1013.). Notably, some of these genes were identified as putatively involved in stress response mechanisms, including those triggered by cold (groups I, II, and III), wounding (groups I and V), and high light intensity (groups I, II, and III). In other plants, these genes have been reported to be involved in response to abiotic and biotic factors (e.g., Hafeez et al., 2021Hafeez A, Gě Q, Zhāng Q, Lǐ J, Gōng J, Liú R, Shí Y, Shāng H, Liú À, Iqbal MS et al. (2021) Multi-responses of O-methyltransferase genes to salt stress and fiber development of Gossypium species. BMC Plant Biol 21:37.; Zhao et al., 2022Zhao K, Yu K, Fu X, Zhao X, Xia N, Zhan Y, Zhao X and Han Y (2022) Genome-wide identification and expression profile analysis of the OMT gene family in response to cyst nematodes and multi-abiotic stresses in soybean. Crop Pasture Sci 73:1279-1290.). Whereas these findings do not definitively establish the function and substrate of these proteins, we emphasize that they offer valuable insights into the evolutionary mechanisms underlying the diverse functions observed in Solanaceae. Further functional verifications are necessary to confirm the precise role of each protein, although our results serve as an initial step toward unraveling the potential of COMT proteins in adaptation to stressful conditions in Solanaceae.

In our study, we thoroughly examined the evolution and diversification of OMT genes in Solanaceae. Our findings shed light on the intricate mechanisms that drive the evolutionary process of this gene family. The results also corroborate the significant impact of OMT genes on plant interaction with biotic and abiotic factors, making them ecologically significant and potential targets for genetic engineering to improve agronomic traits. Finally, further research is necessary to expand our knowledge about OMT functions and potential applications in agriculture and biotechnology.

Acknowledgements

This work was supported by the Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq), Coordenação de Aperfeiçoamento de Pessoal de Nível Superior (CAPES), and Programa de Pós-Graduação em Genética e Biologia Molecular da Universidade Federal do Rio Grande do Sul (PPGBM-UFRGS).

References

  • Abdelraheem E, Thair B, Varela RF, Jockmann E, Popadić D, Hailes HC, Ward JM, Iribarren AM, Lewkowicz ES, Andexer JN et al (2022). Methyltransferases: Functions and applications. Chembiochem 23:e202200212.
  • Akita Y, Kitamura S, Hase Y, Narumi I, Ishizaka H, Kondo E, Kameari N, Nakayama M, Tanikawa N, Morita Y et al (2011) Isolation and characterization of the fragrant cyclamen O-methyltransferase involved in flower coloration. Plant 234:1127-1136.
  • Bailey JA, Gu Z, Clark RA, Reinert K, Samonte RV, Schwartz S, Adams MD, Myers EW, Li PW and Eichler EE (2002) Recent segmental duplications in the human genome. Science 297:1003-1007.
  • Bailey TL, Johnson J, Grant CE and Noble WS (2015) The MEME suite. Nucleic Acids Res 43:W39-W49.
  • Balasubramani S, Lv S, Chen Q, Zhou Z, Moorthy MDS, Sathish D and Moola AK (2021) A systematic review of the O-methyltransferase gene expression. Plant Gene 27:100295.
  • Barakat A, Choi A, Yassin NBM, Park JS, Sun Z and Carlson JE (2011) Comparative genomics and evolutionary analyses of the O-methyltransferase gene family in Populus Gene 479:37-46.
  • Bombarely A, Moser M, Amrad A, Bao M, Bapaume L, Barry CS, Bliek M, Boersma MR, Borghi L, Bruggmann R et al (2016) Insight into the evolution of the Solanaceae from the parental genomes of Petunia hybrida Nat Plants 2:16074.
  • Bonawitz ND and Chapple C (2010) The genetics of lignin biosynthesis: Connecting genotype to phenotype. Annu Rev Genet 44:337-363.
  • Byeon Y, Lee HY, Lee K and Back K (2014) Caffeic acid O-methyltransferase is involved in the synthesis of melatonin by methylating N-acetylserotonin in Arabidopsis J Pineal Res 57:219-227.
  • Capella-Gutiérrez S, Silla-Martínez JM and Gabaldón T (2009) trimAl: A tool for automated alignment trimming in large-scale phylogenetic analyses. Bioinformatics 25:1972-1973.
  • Chen C, Chen H, Zhang Y, Thomas HR, Frank MH, He Y and Xia R (2020) TBtools: An integrative toolkit developed for interactive analyses of big biological data. Mol Plant 13:1194-1202.
  • Cheng F, Wu J, Cai X, Liang J, Freeling M and Wang X (2018) Gene retention, fractionation and subgenome differences in polyploid plants. Nat Plants 4:258-268.
  • Clark JW and Donoghue PCJ (2018) Whole-genome duplication and plant macroevolution. Trends Plant Sci 23:933-945.
  • Davin LB and Lewis NG (1992) Phenylpropanoid metabolism: Biosynthesis of monolignols, lignans and neolignans, lignins and suberins. In: Stafford HA and Ibrahim RK (eds) Phenolic metabolism in plants. Springer, Boston, pp 325-375.
  • Edwards KD, Fernandez-Pozo N, Drake-Stowe K, Humphry M, Evans AD, Bombarely A, Allen F, Hurst R, White B, Kernodle SP et al (2017) A reference genome for Nicotiana tabacum enables map-based cloning of homeologous loci implicated in nitrogen utilization efficiency. BMC Genomics 18:448.
  • Fernandez-Pozo N, Menda N, Edwards JD, Saha S, Tecle IY, Strickler SR, Bombarely A, Fisher-York T, Pujar A, Foerster H et al (2015) The Sol Genomics Network (SGN)-from genotype to phenotype to breeding. Nucleic Acids Res 43:D1036-D1041.
  • Garewal N, Goyal N, Pathania S, Kaur J and Singh K (2021) Gauging the trends of pseudogenes in plants. Crit Rev Biotechnol 41:1114-1129.
  • Gebhardt C (2016) The historical role of species from the Solanaceae plant family in genetic research. Theor Appl Genet 129:2281-2294.
  • Guo A-Y, Zhu Q-H, Chen X and Luo J-C (2007) GSDS: A gene structure display server. Hereditas 29:1023.
  • Hafeez A, Gě Q, Zhāng Q, Lǐ J, Gōng J, Liú R, Shí Y, Shāng H, Liú À, Iqbal MS et al (2021) Multi-responses of O-methyltransferase genes to salt stress and fiber development of Gossypium species. BMC Plant Biol 21:37.
  • Han Y, Gasic K and Korban SS (2007) Multiple-copy cluster-type organization and evolution of genes encoding O-Methyltransferases in the apple. Genetics 176:2625-2635.
  • Hoang DT, Chernomor O, von Haeseler A, Minh BQ and Vinh LS (2018) UFBoot2: Improving the ultrafast bootstrap approximation. Mol Biol Evol 35:518-522.
  • Hofberger JA, Nsibo DL, Govers F, Bouwmeester K and Schranz ME (2015) A Complex interplay of tandem- and whole-genome duplication drives expansion of the L-type lectin receptor kinase gene family in the Brassicaceae. Genome Biol Evol 7:720-734.
  • Hu B, Jin J, Guo A-Y, Zhang H, Luo J and Gao G (2015) GSDS 2.0: An upgraded gene feature visualization server. Bioinformatics 31:1296-1297.
  • Huang Y, Zhang L, Zhang K, Chen S, Hu J and Cheng F (2022) The impact of tandem duplication on gene evolution in Solanaceae species. J Integr Agric 21:1004-1014.
  • Hugueney P, Provenzano S, Verries C, Ferrandino A, Meudec E, Batelli G, Merdinoglu D, Cheynier V, Schubert S and Ageorges, A (2009) A novel cation-dependent O-methyltransferase involved in anthocyanin methylation in grapevine. Plant Physiol 150:2057-2070.
  • Ibrahim RK, Bruneau A and Bantignies B (1998) Plant O-methyltransferases: Molecular analysis, common signature and classification. Plant Mol Biol 36:1-10.
  • Jander G and Barth C (2007) Tandem gene arrays: A challenge for functional genomics. Trends Plant Sci 12:203-210.
  • Joshi CP and Chiang VL (1998) Conserved sequence motifs in plant S-adenosyl-l-methionine-dependent methyltransferases. Plant Mol Biol 37:663-674.
  • Kalyaanamoorthy S, Minh BQ, Wong TKF, von Haeseler A and Jermiin LS (2017) ModelFinder: Fast model selection for accurate phylogenetic estimates. Nat Methods 14:587-589.
  • Katoh K, Rozewicki J and Yamada KD (2019) MAFFT online service: Multiple sequence alignment, interactive sequence choice and visualization. Brief Bioinform 20:1160-1166.
  • Kim B-G, Sung SH, Chong Y, Lim Y and Ahn J-H (2010) Plant flavonoid O-methyltransferases: Substrate specificity and application. J Plant Biol 53:321-329.
  • Kim S, Park M, Yeom S-I, Kim Y-M, Lee JM, Lee H-A, Seo E, Choi J, Cheong K, Kim K-T et al (2014) Genome sequence of the hot pepper provides insights into the evolution of pungency in Capsicum species. Nat Genet 46:270-278.
  • Kuo Y-T, Chao Y-T, Chen W-C, Shih M-C and Chang S-B (2019) Segmental and tandem chromosome duplications led to divergent evolution of the chalcone synthase gene family in Phalaenopsis orchids. Ann Bot 123:69-77.
  • Lam KC, Ibrahim RK, Behdad B and Dayanandan S (2007) Structure, function, and evolution of plant O-methyltransferases. Genome 50:1001-1013.
  • Langham RJ, Walsh J, Dunn M, Ko C, Goff SA and Freeling M (2004) Genomic duplication, fractionation and the origin of regulatory novelty. Genetics 166:935-945.
  • Leitch IJ, Hanson L, Lim KY, Kovarik A, Chase MW, Clarkson JJ and Leitch AR (2008) The ups and downs of genome size evolution in polyploid species of Nicotiana (Solanaceae). Ann Bot 101:805-814.
  • Li HM, Rotter D, Hartman TG, Pak FE, Havkin-Frenkel D and Belanger FC (2006) Evolution of novel O-methyltransferases from the Vanilla planifolia caffeic acid O-methyltransferase. Plant Mol Biol 61:537-552.
  • Liu X, Luo Y, Wu H, Xi W, Yu J, Zhang Q and Zhou Z (2016) Systematic analysis of O-methyltransferase gene family and identification of potential members involved in the formation of O-methylated flavonoids in Citrus Gene 575:458-472.
  • Lu S, Zhuge Y, Hao T, Liu Z, Zhang M and Fang J (2022) Systematic analysis reveals O-methyltransferase gene family members involved in flavonoid biosynthesis in grape. Plant Physiol Biochem 173:33-45.
  • Ma Y, Ma X, Gao X, Wu W and Zhou B (2021) Light induced regulation pathway of anthocyanin biosynthesis in plants. Int J Mol Sci 22:11116.
  • Panchy N, Lehti-Shiu M and Shiu S-H (2016) Evolution of gene duplication in plants. Plant Physiol 171:2294-2316.
  • Pereira AG, Guzmán-Rodriguez S and Freitas LB (2022) Phylogenetic analyses of some key genes provide information on pollinator attraction in Solanaceae. Genes 13:2278.
  • Pichersky E and Gang DR (2000) Genetics and biochemistry of secondary metabolites in plants: An evolutionary perspective. Trends Plant Sci 5:439-445.
  • Powell AF, Zhang J, Hauser D, Vilela JA, Hu A, Gates DJ, Mueller LA, Li F, Strickler SR and Smith SD (2022) Genome sequence for the blue‐flowered Andean shrub Iochroma cyaneum reveals extensive discordance across the berry clade of Solanaceae. Plant Genome 15:e20223.
  • Rajewski A, Carter-House D, Stajich J and Litt A (2021) Datura genome reveals duplications of psychoactive alkaloid biosynthetic genes and high mutation rate following tissue culture. BMC Genomics 22:201.
  • Särkinen T, Bohs L, Olmstead RG and Knapp S (2013) A phylogenetic framework for evolutionary study of the nightshades (Solanaceae): A dated 1000-tip tree. BMC Evol Biol 13:214.
  • Struck A-W, Thompson ML, Wong LS and Micklefield J (2012) S-adenosyl-methionine-dependent methyltransferases: Highly versatile enzymes in biocatalysis, biosynthesis and other biotechnological applications. Chem Biochem 13:2642-2655.
  • Tamura K, Stecher G and Kumar S (2021) MEGA11: Molecular Evolutionary Genetics Analysis version 11. Mol Biol Evol 38:3022-3027.
  • The French-Italian Public Consortium for Grapevine Genome Characterization (2007) The grapevine genome sequence suggests ancestral hexaploidization in major angiosperm phyla. Nature 449:463-467.
  • The Tomato Genome Consortium (2012) The tomato genome sequence provides insights into fleshy fruit evolution. Nature 485:635-641.
  • Thompson CE, Brisolara-Corrêa L, Thompson HN, Stassen H and Freitas LB (2023) Evolutionary and structural aspects of Solanaceae RNases T2. Genet Mol Biol 46:e20220115.
  • Trifinopoulos J, Nguyen L-T, von Haeseler A and Minh BQ (2016) W-IQ-TREE: A fast online phylogenetic tool for maximum likelihood analysis. Nucleic Acids Res 44:W232-W235.
  • Tuskan GA, DiFazio S, Jansson S, Bohlmann J, Grigoriev I, Hellsten U, Putnam N, Ralph S, Rombauts S, Salamov A et al (2006) The genome of black cottonwood, Populus trichocarpa (Torr. & Gray). Science 313:1596-1604.
  • Wu S, Watanabe N, Mita S, Ueda Y, Shibuya M and Ebizuka Y (2003) Two O-methyltransferases isolated from flower petals of Rosa chinensis var. spontanea involved in scent biosynthesis. J Biosci Bioeng 96:119-128.
  • Xu Q, Chen L-L, Ruan X, Chen D, Zhu A, Chen C, Bertrand D, Jiao W-B, Hao B-H, Lyon MP et al (2013) The draft genome of sweet orange (Citrus sinensis). Nat Genet 45:59-66.
  • Xu RX, Zhao Y, Gao S, Zhang YY, Li DD, Lou HX and Cheng AX (2015) Functional characterization of a plastidal cation-dependent O-methyltransferase from the liverwort Plagiochasma appendiculatum Phytochemistry 118:33-41.
  • Xu S, Brockmöller T, Navarro-Quezada A, Kuhl H, Gase K, Ling Z, Zhou W, Kreitzer C, Stanke M, Tang H et al (2017) Wild tobacco genomes reveal the evolution of nicotine biosynthesis. Proc Natl Acad Sci 114:6133-6138.
  • Zafar MM, Rehman A, Razzaq A, Parvaiz A, Mustafa G, Sharif F, Mo H, Youlu Y, Shakeel A and Ren M (2022) Genome-wide characterization and expression analysis of Erf gene family in cotton. BMC Plant Biol 22:134.
  • Zhao K, Yu K, Fu X, Zhao X, Xia N, Zhan Y, Zhao X and Han Y (2022) Genome-wide identification and expression profile analysis of the OMT gene family in response to cyst nematodes and multi-abiotic stresses in soybean. Crop Pasture Sci 73:1279-1290.

Internet Resources

  • HMMER (2023) HMMER: biosequence analysis using profile hidden Markov models, HMMER (2023) HMMER: biosequence analysis using profile hidden Markov models, http://hmmer.org/ (accessed 24 February 2023).
    » http://hmmer.org/
  • MEME Suite (2023) Multiple Em for Motif Elicitation, MEME Suite (2023) Multiple Em for Motif Elicitation, https://meme-suite.org/meme/tools/meme (accessed 30 March 2023).
    » https://meme-suite.org/meme/tools/meme
  • NCBI (2023) National Center for Biotechnology Information, NCBI (2023) National Center for Biotechnology Information, https://www.ncbi.nlm.nih.gov/ (accessed 23 February 2023).
    » https://www.ncbi.nlm.nih.gov/
  • Pfam (2023) Pfam (2023) http://pfam.xfam.org (accessed 24 February 2023).
    » http://pfam.xfam.org
  • PROSITE tool (2023) ScanProsite tool, PROSITE tool (2023) ScanProsite tool, http://prosite.expasy.org/scanprosite/ (accessed 24 February 2023).
    » http://prosite.expasy.org/scanprosite/
  • ProtParam tool of Expasy (2023) ProtParam tool of Expasy (2023) https://web.expasy.org/protparam/ (accessed 3 April 2023).
    » https://web.expasy.org/protparam/
  • Sol Genomics Network (2023) Welcome to the Solanaceae Genomics Network, Sol Genomics Network (2023) Welcome to the Solanaceae Genomics Network, https://solgenomics.net (accessed 23 February 2023).
    » https://solgenomics.net
  • The Arabidopsis Information Resource (2023) The Arabidopsis Information Resource (2023) https://www.arabidopsis.org/ (accessed 23 February 2023).
    » https://www.arabidopsis.org/
  • UniProt (2023) Find your protein, UniProt (2023) Find your protein, https://www.uniprot.org/ (accessed 1 March 2023).
    » https://www.uniprot.org/
  • *
    These authors have contributed equally to this work.
  • Author Contributions

    PHP, LTG, and LBF conceptualized the study; PHP and LTG analyzed the data; PHP and LTG wrote the original draft; PHP, LTG, MD, and LBF reviewed and edited the manuscript; LBF and MD supervised the project. All authors have commented and approved the final manuscript.

Supplementary material

The following online material is available for this article:

Table S1 - Gene count of raw and filtered datasets, as well as the reference for all 23 Solanaceae species.

Table S2 - Functionally characterized OMT proteins downloaded from UniProt.

Table S3 - Structural and molecular characteristics of OMT proteins of six Solanaceae representative species.

Table S4 - Putative substrate, biological and molecular processes associated with the phylogenetic groups of the OMT family of Solanaceae species.

Figure S1 - Maximum likelihood genealogy of the complete dataset shows the reciprocal monophyly of both CCoAOMT and COMT subfamilies.

Figure S2 - Maximum likelihood genealogy of CCoAOMT proteins from a comprehensive dataset including 23 Solanaceae species.

Figure S3 - Maximum likelihood genealogy of CCoAOMT proteins from a comprehensive dataset including 23 Solanaceae species.

Figure S4 - Maximum likelihood genealogy of CCoAOMT proteins from the function dataset, including the six Solanaceae species and the 35 functionally characterized proteins from UniProt.

Figure S5 - Maximum likelihood genealogy of COMT proteins from the function dataset, including the six Solanaceae species and the 75 functionally characterized proteins from UniProt.

Figure S6 - Gene structure and protein motif composition of CCoAOMT proteins from selected Solanaceae species. Introns were rescaled to the same length.

Figure S7 - Gene structure and protein motif composition of COMT proteins from selected Solanaceae species.

Edited by

Associate Editor:

Lavínia Schüler-Faccini

Publication Dates

  • Publication in this collection
    10 Nov 2023
  • Date of issue
    2023

History

  • Received
    25 Apr 2023
  • Accepted
    30 Aug 2023
Sociedade Brasileira de Genética Rua Cap. Adelmio Norberto da Silva, 736, 14025-670 Ribeirão Preto SP Brazil, Tel.: (55 16) 3911-4130 / Fax.: (55 16) 3621-3552 - Ribeirão Preto - SP - Brazil
E-mail: editor@gmb.org.br