Acessibilidade / Reportar erro

Quantifying individual variation in reaction norms using random regression models fitted through Legendre polynomials: application in eucalyptus breeding

ABSTRACT

An accurate, efficient and informative statistical method for analyses of genotype × environment (G × E) interactions is a key requirement for progress in any breeding program. Thus, the objective of this study was to quantify individual variation in reaction norms using random regression models fitted through Legendre polynomials in eucalyptus (Eucalyptus spp.) breeding. To this end, a data set with 215 eucalyptus clones of different species and hybrids evaluated in four environments for diameter at breast height (DBH) and Pilodyn penetration (PP) was used. Variance components were estimated by restricted maximum likelihood, and genetic values were predicted by best linear unbiased prediction. The best-fitted model for DBH and PP was indicated by the Akaike information criterion, and the significance of the genotype effects was tested using the likelihood ratio test. Genetic variability between eucalyptus clones and very high accuracies (rg^g  0.90) were detected for both traits. Reaction norms and eigenfunctions generated genetic insights into G × E interactions. This is the first study that quantified individual variation in reaction norms using random regression models fitted through Legendre polynomials in eucalyptus breeding and demonstrated the great potential of this technique.

Key words
mixed model methodology; covariance functions; genotype × environment interactions; genotypic plasticity; forest tree breeding

INTRODUCTION

Eucalyptus (Eucalyptus spp.) is widely cultivated in tropical and subtropical regions. Its cultivation is mainly intended to produce pulp, bio-oil, firewood, and charcoal. The international pulp trade and the intense search for alternative energy sources have increasingly motivated the establishment of eucalyptus plantations in several countries worldwide (Fonseca et al. 2010Fonseca, S. M., Resende, M. D. V., Alfenas, A. C., Guimarães, L. M. S., Assis, T. F., and Grattapaglia, D. (2010). Manual Prático de Melhoramento Genético do Eucalipto. Viçosa: UFV.). In this sense, eucalyptus breeding programs have sought to identify more efficient selection techniques to increase yield and quality of traits of industrial interest.

The genotype × environment (G × E) interactions are characterized by the differential behavior of genotypes in relation to environmental variations (Resende 2015Resende, M. D. V. (2015). Genética quantitativa e de populações. Visconde do Rio Branco: Suprema.). These interactions can be expressed in various ways and with different intensities and can generate significant obstacles for genetic selection (Li et al. 2017Li, Y., Suontoma, M., Burdon, R. D., and Dungey, H. S. (2017). Genotype by environment interactions in forest tree breeding: review of methodology and perspectives on research and application. Tree Genetics & Genomes, 13, 60. https://doi.org/10.1007/s11295-017-1144-x
https://doi.org/10.1007/s11295-017-1144-...
; van Eeuwijk et al. 2016van Eeuwijk, F. A., Bustos-Korts, D. V., and Malosetti, M. (2016). What should students in plant breeding know about the statistical aspects of genotype × environment interactions? Crop Science, 56, 2119-2140. https://doi.org/10.2135/cropsci2015.06.0375
https://doi.org/10.2135/cropsci2015.06.0...
), including eucalyptus (Nunes et al. 2017Nunes, A. C. P., Resende, M. D. V., Santos, G. A., and Alves, R. S. (2017). Evaluation of different selection indices combining Pilodyn penetration and growth performance in Eucalyptus clones. Crop Breeding and Applied Biotechnology, 17, 206-213. https://doi.org/10.1590/1984-70332017v17n3a32
https://doi.org/10.1590/1984-70332017v17...
; Resende et al. 2017Resende, R. T., Resende, M. D. V., Silva, F. F., and Takahashi, E. K. (2017). Acurácia preditiva de testes clonais de Eucalyptus spp. utilizando efeitos aditivos do parentesco e validação cruzada. Scientia Forestalis, 45, 39-47. https://doi.org/10.18671/scifor.v45n113.04
https://doi.org/10.18671/scifor.v45n113....
; 2018Resende, R. T., Soares, A. A. V., Forrester, D. I., Marcatti, G. E., Santos, A. R., Takahashi, E. K., Silva, F. F., Grattapaglia, D., Resende, M. D. V., and Leite, H. G. (2018). Environmental uniformity, site quality and tree competition interact to determine stand productivity of clonal Eucalyptus. Forest Ecology and Management, 410, 76-83. https://doi.org/10.1016/j.foreco.2017.12.038
https://doi.org/10.1016/j.foreco.2017.12...
). Thus, the use of accurate, efficient and informative statistical methods that capture the information present in this source of variation and advantageously exploit its effects is fundamental in any breeding program.

Recently, mixed model methodology has become very popular for analyses of G × E interactions (Li et al. 2017Li, Y., Suontoma, M., Burdon, R. D., and Dungey, H. S. (2017). Genotype by environment interactions in forest tree breeding: review of methodology and perspectives on research and application. Tree Genetics & Genomes, 13, 60. https://doi.org/10.1007/s11295-017-1144-x
https://doi.org/10.1007/s11295-017-1144-...
; van Eeuwijk et al. 2016van Eeuwijk, F. A., Bustos-Korts, D. V., and Malosetti, M. (2016). What should students in plant breeding know about the statistical aspects of genotype × environment interactions? Crop Science, 56, 2119-2140. https://doi.org/10.2135/cropsci2015.06.0375
https://doi.org/10.2135/cropsci2015.06.0...
). There are numerous reasons for their use, including the fact that they allow estimation of variance components and prediction of genetic values simultaneously, deal with unbalanced data, describe heterogeneity of genetic covariances and residual variances across environments, and modeling spatial trends (Resende 2016Resende, M. D. V. (2016). Software Selegen-REML/BLUP: A useful tool for plant breeding. Crop Breeding and Applied Biotechnology, 16, 330-339. https://doi.org/10.1590/1984-70332016v16n4a49
https://doi.org/10.1590/1984-70332016v16...
; van Eeuwijk et al. 2016van Eeuwijk, F. A., Bustos-Korts, D. V., and Malosetti, M. (2016). What should students in plant breeding know about the statistical aspects of genotype × environment interactions? Crop Science, 56, 2119-2140. https://doi.org/10.2135/cropsci2015.06.0375
https://doi.org/10.2135/cropsci2015.06.0...
).

Within the context of mixed model methodology, individual variation in reaction norms can be quantified using random regression models fitted through Legendre polynomials (Resende et al. 2014Resende, M. D. V., Silva, F. F., and Azevedo, C. F. (2014). Estatística matemática, biométrica e computacional. Viçosa: UFV.). Random regression models involve a parsimonious covariance structure that provides predicted genetic values in specific environments or as a trend over environments (Alves et al. 2020Alves, R. S., Resende, M. D. V., Azevedo, C. F., Silva, F. F., Rocha, J. R. A. S. C., Nunes, A. C. P., Carneiro, A. P. S., and Santos, G. A. (2020). Optimization of Eucalyptus breeding through random regression models allowing for reaction norms in response to environmental gradients. Tree Genetics & Genomes, 16, 38. https://doi.org/10.1007/s11295-020-01431-5
https://doi.org/10.1007/s11295-020-01431...
); and Legendre polynomials have been considered to have better convergence properties as the regression are orthogonal (Mrode 2014Mrode, R. A. (2014). Linear models for the prediction of animal breeding values. Wallingford: CAB International.) and are the easiest to apply (Schaeffer 2016Schaeffer, L. R. (2016). Random regression models. Guelph: University of Guelph.).

Random regression models are widely used in longitudinal data analyses (Mrode 2014Mrode, R. A. (2014). Linear models for the prediction of animal breeding values. Wallingford: CAB International.; Resende et al. 2014Resende, M. D. V., Silva, F. F., and Azevedo, C. F. (2014). Estatística matemática, biométrica e computacional. Viçosa: UFV.). In perennials breeding, random regression models have been used to model growth over time (Resende et al. 2001Resende, M. D. V., Rezende, G. D. S. P., and Fernandes, J. S. C. (2001). Regressão aleatória e funções de covariância na análise de medidas repetidas. Revista de Matemática e Estatística, 19, 21-40.; Rocha et al. 2018Rocha, J. R. A. S. C., Marçal, T. S., Salvador, F. V., Silva, A. C., Machado, J. C., and Carneiro, P. C. S. (2018). Genetic insights into elephantgrass persistence for bioenergy purpose. Plos One, 13, e0203818. https://doi.org/10.1371/journal.pone.0203818
https://doi.org/10.1371/journal.pone.020...
). Besides that, reaction norms have been commonly used for genomic and phenotypic selection in multi-environment trials (Jarquín et al. 2014Jarquín, D., Crossa, J., Lacaze, X., Du Cheyron, P., Daucourt, J., Lorgeou, J., Piraux, F., Guerreiro, L., Pérez, P., Calus, M., Burgueño, J., and Campos, G. (2014). A reaction norm model for genomic selection using high-dimensional genomic and environmental data. Theoretical and Applied Genetics, 127, 595-607. https://doi.org/10.1007/s00122-013-2243-1
https://doi.org/10.1007/s00122-013-2243-...
; Morais Júnior et al. 2018Morais Júnior, O. P., Duarte, J. B., Breseghello, F., Coelho, A. S., Morais, O. P., and Magalhães Júnior, A. M. (2018). Single-step reaction norm models for genomic prediction in multienvironment recurrent selection trials. Crop Science, 58, 592-607. https://doi.org/10.2135/cropsci2017.06.0366
https://doi.org/10.2135/cropsci2017.06.0...
; Alves et al. 2020Alves, R. S., Resende, M. D. V., Azevedo, C. F., Silva, F. F., Rocha, J. R. A. S. C., Nunes, A. C. P., Carneiro, A. P. S., and Santos, G. A. (2020). Optimization of Eucalyptus breeding through random regression models allowing for reaction norms in response to environmental gradients. Tree Genetics & Genomes, 16, 38. https://doi.org/10.1007/s11295-020-01431-5
https://doi.org/10.1007/s11295-020-01431...
).

However, no study considering random regression models fitted through Legendre polynomials has been described in forest tree breeding for analyses of G × E interactions. Thus, the objective of this study was to quantify individual variation in reaction norms using random regression models fitted through Legendre polynomials in eucalyptus breeding.

MATERIAL AND METHODS

Experimental data

The data used in this work come from evaluation of a clonal field test of different eucalyptus species (E. grandis, E. urophylla, E. saligna, and E. pellita) and hybrids [E. grandis × E. urophylla, E. urophylla × E. maidenii, E. grandis × E. saligna, E. urophylla × E. saligna, E. urophylla × E. globulus, E. pellita × E. grandis, E. grandis × E. maidenii, E. grandis × E. dunnii, E. grandis × E. globulus, E. globulus × E. tereticornis, E. urophylla × E. deanei, E. urophylla × E. tereticornis, E. urophylla × (E. grandis × E. urophylla), E. globulus × (E. grandis × E. urophylla), E. grandis × (E. grandis × E. urophylla), E. urophylla × (E. camaldulensis × E. grandis), E. saligna × (E. grandis × E. urophylla), E. robusta × (E. grandis × E. urophylla), E. grandis × (E. dunnii × E. grandis), E. maidenii × (E. grandis × E. urophylla), E. saligna × (E. urophylla × E. grandis), E. urophylla × (E. grandis × E. globulus), E. urophylla × (E. tereticornis × E. saligna), E. urophylla × (E. urophylla × E. grandis), (E. grandis × E. kirtoniana) × (E. robusta × E. tereticornis), and (E. grandis × E. urophylla) × (E. urophylla × E. globulus)], implemented in September 2007, in four experimental areas of the CMPC Company, which are located in the state of Rio Grande do Sul, Brazil. The geographic location and annual climatic conditions of each environment are presented in Table 1. In each environment, a field trial in a randomized block design was established, with 215 clones in single tree plots and 30 replications. Trees were planted at a spacing of 3.5 m between rows and 2.6 m between plants.

Table 1
Geographic location (GL) and annual climatic conditions (ACC) of each environment (E).

This work used data from the assessment of all surviving trees at three-years of age in the field tests (22,295 trees in total) for diameter at breast height (DBH) (cm) and Pilodyn penetration (PP) (mm). The DBH was measured using a diameter tape and the PP using a Pilodyn. According to Greaves et al. (1996)Greaves, B. L., Borralho, N. M. G., Raymond, C. A., and Farrington, A. (1996). Use of a Pilodyn for the indirect selection of basic density in Eucalyptus nitens. Canadian Journal of Forest Research, 26, 1643-1650. https://doi.org/10.1139/x26-185
https://doi.org/10.1139/x26-185...
, PP, which is an indirect method to determine the basic density of wood, has been effective to evaluate eucalyptus.

Statistical analyses

In order to use Legendre polynomials, phenotypic mean of each environment (μi) must be scaled to range from -1 to +1. The environmental gradient values (Ei) were obtained as follows (Eq. 1) (Schaeffer 2016Schaeffer, L. R. (2016). Random regression models. Guelph: University of Guelph.):

E i = - 1 + 2 μ i - μ m i n / μ m a x - μ m i n (1)

Variance components were estimated by restricted maximum likelihood (REML) (Patterson and Thompson 1971Patterson, H. D., and Thompson, R. (1971). Recovery of inter-block information when block sizes are unequal. Biometrika, 58, 545-554. https://doi.org/10.2307/2334389
https://doi.org/10.2307/2334389...
) and genetic values were predicted by best linear unbiased prediction (BLUP) (Henderson 1975Henderson, C. R. (1975). Best linear unbiased estimation and prediction under a selection model. Biometrics, 31, 423-447. https://doi.org/10.2307/2529430
https://doi.org/10.2307/2529430...
). Random regression models were fitted through Legendre polynomials for DBH and PP as follows (Eq. 2):

Y i j k = μ + S j + R / S j k + Σ d = 0 D   α i d Φ i j d + e i j k (2)

where Yijk is the ith genotype (i = 1, 2, …, 215) in the jth environment (j = 1, 2, 3, 4) in the kth replication (k = 1, 2, …, 30); μ is the overall mean; Sj is the fixed effect of environment j; R/Sjk is the fixed effect of replication k nested in environment j; d is the polynomial degree, ranging from 0 to D (D = number of environments - 1); αid is the random regression coefficient for the Legendre polynomial for the genotype effect; Φijd is the dth Legendre polynomial for the jth environment for the ith genotype; and eijk is the residual random effect associated with Yijk.

In the matrix notation, the above model is described as follows (Eq. 3):

Y = X β + Z α + e (3)

where y is the vector of phenotypic data; β is the vector of the replication-environment combination that comprises the fixed effects of environment and replication within the environment, added to the overall mean; α is the vector of genotype effects (random); and e is the vector of residuals (random). Uppercase letters represent the incidence matrices for these effects. In addition, α~N(0,KgI215) and e~N(0,R), where I215 is an identity matrix of order 215, ⊗ denotes the Kronecker product, Kg is the covariance matrix for the coefficients of genetic effects, and R represents the matrix of residual variances.

The polynomial order in random regression models were selected using the Akaike Information Criterion (AIC) (Akaike 1974Akaike, H. (1974). A new look at the statistical model identification. IEEE Transactions on Automatic Control, 19, 716-723. https://doi.org/10.1109/TAC.1974.1100705
https://doi.org/10.1109/TAC.1974.1100705...
) as follows (Eq. 4):

A I C = - 2 L o g L + 2 p (4)

where LogL is the logarithm of the maximum (L) of the restricted likelihood function, and p is the number of estimated parameters. Besides that, different residual variance structures (homogeneous and heterogeneous) were tested.

The significance of the genotype effects was tested using the likelihood ratio test (LRT) (Wilks 1938Wilks, S. S. (1938). The large-sample distribution of the likelihood ratio for testing composite hypothesis. Annals of Mathematical Statistics, 9, 60-62. https://doi.org/10.1214/aoms/1177732360
https://doi.org/10.1214/aoms/1177732360...
) as follows (Eq. 5):

L R T = - 2 L o g L - L o g L R (5)

where LogLR is the logarithm of the maximum (LR) of the restricted likelihood function of the reduced model (without genotype effects).

Estimates of genetic variance (σg2) and predicted genetic values (g~ij), in the original scale, were obtained by Eqs. 6 and 7 (Kirkpatrick et al. 1990Kirkpatrick, M., Lofsvold, D., and Bulmer, M. (1990). Analyses of the Inheritance, Selection and Evolution of Growth Trajectories. Genetics, 124, 979-993.):

σ g 2 = Φ i j d 1 K ^ g Φ i j d ' (6)
g ~ i j = Σ d = 0 D   α i d Φ i j d (7)

Phenotypic variance (σp2), broad-sense individual heritability (hg2), and accuracy (rg^g) were estimated by Eqs. 810 (Resende et al. 2014Resende, M. D. V., Silva, F. F., and Azevedo, C. F. (2014). Estatística matemática, biométrica e computacional. Viçosa: UFV.):

σ ^ p 2 = σ ^ g 2 + σ ^ e 2 (8)
h g 2 = σ ^ g 2 / σ ^ p 2   and (9)
r g ^ g = 1 - Φ i j d   P E V   Φ i j d ' σ ^ g 2 (10)

where σ^e2 is the residual variance and PEV is the prediction error variance, obtained by the diagonal elements of the inverse of the coefficient matrix (information matrix) of the mixed model equations.

The eigenfunctions (ψf) of the genetic coefficient covariance matrix, aiming to study the G × E interactions, were obtained by Eq. 11 (Kirkpatrick et al. 1990Kirkpatrick, M., Lofsvold, D., and Bulmer, M. (1990). Analyses of the Inheritance, Selection and Evolution of Growth Trajectories. Genetics, 124, 979-993.):

ψ f = Σ d = 0 D   c ψ f d   Φ d (11)

where (f)d is the dth element of the fth eigenvector of K^g, and Φd is the normalized value of the dth Legendre polynomial.

The areas under the reaction norms (A), aiming to rank the clones, were obtained by Eq. 12:

A = μ + - 1 1 Σ d = 0 D   α ^ d Φ d x d   d x (12)

where xd is the environmental gradient.

Statistical analyses were performed using the ASReml 4.1 (Gilmour et al. 2015Gilmour, A. R., Gogel, B. J., Cullis, B. R., Welham, S. J., and Thompson, R. (2015). ASReml User Guide. Hemel Hempstead: VSN International. [Accessed Jan. 20, 2020]. Available at: https://www.hpc.iastate.edu/sites/default/files/uploads/ASREML/UserGuideStructural.pdf
https://www.hpc.iastate.edu/sites/defaul...
) and R (R Core Team 2018R Core Team. (2018). R: A language and environment for statistical computing. Vienna: R Foundation for Statistical Computing. https://www.r-project.org/
https://www.r-project.org/...
) software. The ASReml program files are available in supplementary material.

RESULTS

Selection of models

According to the AIC (Akaike 1974Akaike, H. (1974). A new look at the statistical model identification. IEEE Transactions on Automatic Control, 19, 716-723. https://doi.org/10.1109/TAC.1974.1100705
https://doi.org/10.1109/TAC.1974.1100705...
), the best model for DBH and PP is denoted by Leg.4.Rhe (Legendre polynomial of the three degree for genotype effects, with a heterogeneous residual variance structure) (Table 2), since the lower AIC value reflect a better overall fit. Thus, this model was adopted to estimate the variance components and to predict the genetic values along the environmental gradients. According to the LRT, genetic variability (p < 0.01) was detected for DBH and PP (Table 2).

Table 2
Akaike information criterion (AIC) and likelihood ratio test (LRT) for the genetic effects for diameter at breast height (DBH) and Pilodyn penetration (PP) in eucalyptus.

Variance components and genetic parameters

For DBH, genetic variance was not stable over the environmental gradient, ranging from 0.9073 (E2) to 2.5426 (E4); and residual and phenotypic variances increased over the environmental gradient (Table 3). Heritability estimates were not stable over the environmental gradient, ranging from 0.22 (E1) to 0.39 (E3); and mean accuracies were higher than 0.90 in all environments (Table 3).

For PP, genetic, residual, and phenotypic variances were not stable over the environmental gradient (Table 3). Genetic variance ranged from 3.6123 (E1) to 4.6763 (E2); and residual and phenotypic variances reached a peak in E2 (Table 3). Heritability estimates were not stable over the environmental gradient, ranging from 0.45 (E2) to 0.63 (E3); and mean accuracies were higher than 0.95 in all environments (Table 3).

Table 3
Estimates of variance components and genetic parameters for diameter at breast height (DBH) and Pilodyn penetration (PP) in eucalyptus, in each environment (E).

Reaction norms

Genotypic plasticity and G × E interactions (nonlinear) were observed for DBH (Fig. 1a) and PP (Fig. 1b), since the reaction norms intersected, diverged, or converged (van Eeuwijk et al. 2016van Eeuwijk, F. A., Bustos-Korts, D. V., and Malosetti, M. (2016). What should students in plant breeding know about the statistical aspects of genotype × environment interactions? Crop Science, 56, 2119-2140. https://doi.org/10.2135/cropsci2015.06.0375
https://doi.org/10.2135/cropsci2015.06.0...
).

Figure 1
Reaction norms for diameter at breast height (DBH) (a) and Pilodyn penetration (PP) (b) over environmental gradient. Each curve represents one eucalyptus clone.

Eigenfunctions

For DBH, first, second, third and fourth eigenfunctions explained, respectively, 78.26, 15.54, 4.68 and 1.51% of the genetic variation (Fig. 2a). For PP, first, second, third, and fourth eigenfunctions explained, respectively, 66.80, 32.48, 0.56, and 0.15% of the genetic variation (Fig. 2b).

Figure 2
Estimates of the eigenfunctions for diameter at breast height (DBH) (a) and Pilodyn penetration (PP) (b) over environmental gradient. Their proportional eigenvalues for the genetic covariance function are in parentheses.

Area under the reaction norms

The areas under the reaction norms ranged from 6.14 (clone 204) to 18.12 (clone 24) cm2 for DBH (supplementary material – Table S1), and from 9.95 (clone 91) to 29.79 (clone 43) mm2 for PP (Supplementary material – Table S1).

DISCUSSION

Random regression and model selection

The covariance functions developed by Kirkpatrick and Heckman (1989)Kirkpatrick, M., and Heckman, N. (1989). A quantitative genetic model for growth, shape and other infinite-dimensional characters. Journal of Mathematical Biology, 27, 429-450. https://doi.org/10.1007/BF00290638
https://doi.org/10.1007/BF00290638...
that uses orthogonal base functions, such as Legendre polynomials, allows the fit of virtually any shape of growth curves or reaction norms (Calus et al. 2004Calus, M. P. L., Bijma, P., and Veerkamp, R. F. (2004). Effects of data structure on the estimation of covariance functions to describe genotype by environment interactions in a reaction norm model. Genetics Selection Evolution, 36, 489-507. https://doi.org/10.1186/1297-9686-36-5-489
https://doi.org/10.1186/1297-9686-36-5-4...
). Reaction norms model the trajectory of genetic values along the environmental gradient, i.e., as a deviation from other fixed and random effects included in the model (Resende et al. 2014Resende, M. D. V., Silva, F. F., and Azevedo, C. F. (2014). Estatística matemática, biométrica e computacional. Viçosa: UFV.). Kirkpatrick et al. (1990)Kirkpatrick, M., Lofsvold, D., and Bulmer, M. (1990). Analyses of the Inheritance, Selection and Evolution of Growth Trajectories. Genetics, 124, 979-993. demonstrated the equivalence between random regression and covariance functions.

Among the various criteria for selection of models, the AIC is prominent (Cavanaugh and Neath 2019Cavanaugh, J. E., and Neath, A. A. (2019). The Akaike information criterion: Background, derivation, properties, application, interpretation and refinements. WIREs Computational Statistics, 11, e1460. https://doi.org/10.1002/wics.1460
https://doi.org/10.1002/wics.1460...
). The selected model fits heterogeneous residuals (i.e., one residual variance for each environment), and the genetic effects was modeled by Legendre polynomials of degree three for DBH and PP. This implies the estimation of 14 parameters of covariance. Heterogeneous residuals were also reported by Resende et al. (2017)Resende, R. T., Resende, M. D. V., Silva, F. F., and Takahashi, E. K. (2017). Acurácia preditiva de testes clonais de Eucalyptus spp. utilizando efeitos aditivos do parentesco e validação cruzada. Scientia Forestalis, 45, 39-47. https://doi.org/10.18671/scifor.v45n113.04
https://doi.org/10.18671/scifor.v45n113....
evaluating clonal trials of eucalyptus.

The random regression model can be considered a reduced and simplified multiple-trait model, which allows the same parameters of interest (heritability and genetic correlation among all pairs of environments) to be estimated, but with lower parameterization and with less computational effort (Resende et al. 2001Resende, M. D. V., Rezende, G. D. S. P., and Fernandes, J. S. C. (2001). Regressão aleatória e funções de covariância na análise de medidas repetidas. Revista de Matemática e Estatística, 19, 21-40.). These models directly define covariance functions, and since there are reliable estimates of variance components, they allow the prediction of genetic values of a genotype in different environments, based on evaluation in only one environment (Alves et al. 2020Alves, R. S., Resende, M. D. V., Azevedo, C. F., Silva, F. F., Rocha, J. R. A. S. C., Nunes, A. C. P., Carneiro, A. P. S., and Santos, G. A. (2020). Optimization of Eucalyptus breeding through random regression models allowing for reaction norms in response to environmental gradients. Tree Genetics & Genomes, 16, 38. https://doi.org/10.1007/s11295-020-01431-5
https://doi.org/10.1007/s11295-020-01431...
).

Variance components and genetic parameters

The estimation of variance components and prediction of genetic values are essential procedures in any breeding program. Currently, REML/BLUP is the standard procedure for estimation of variance components and optimal selection in plant breeding (Resende 2016Resende, M. D. V. (2016). Software Selegen-REML/BLUP: A useful tool for plant breeding. Crop Breeding and Applied Biotechnology, 16, 330-339. https://doi.org/10.1590/1984-70332016v16n4a49
https://doi.org/10.1590/1984-70332016v16...
). Knowledge of genetic parameters is of great importance in plant breeding, since the breeding strategy to be used depends on the information obtained from these parameters (Resende 2002Resende, M. D. V. (2002). Genética biométrica e estatística no melhoramento de plantas perenes. Brasília: Embrapa.).

According to Resende (2015)Resende, M. D. V. (2015). Genética quantitativa e de populações. Visconde do Rio Branco: Suprema., DBH has heritabilities of moderate magnitude (0.15 < hg2 < 0.50), and PP has heritabilities of moderate (0.15 < hg2 < 0.50) to high (h2g >0.50) magnitudes. Moderate heritability (h2g = 0.18, for DBH) were also reported by Nunes et al. (2017)Nunes, A. C. P., Resende, M. D. V., Santos, G. A., and Alves, R. S. (2017). Evaluation of different selection indices combining Pilodyn penetration and growth performance in Eucalyptus clones. Crop Breeding and Applied Biotechnology, 17, 206-213. https://doi.org/10.1590/1984-70332017v17n3a32
https://doi.org/10.1590/1984-70332017v17...
, who used the compound symmetry model for genetic evaluation of eucalyptus clones.

One of the most relevant parameters for evaluation of the effectiveness of the inference about the predicted genetic value of a genotype is selective accuracy (Resende and Duarte 2007Resende, M. D. V., and Duarte, J. B. (2007). Precisão e controle de qualidade em experimentos de avaliação de cultivares. Pesquisa Agropecuária Tropical, 37, 182-194.). This parameter indicates the correct arrangement of the genotypes for selection and recommendation purposes. This parameter does not only depend on the magnitude of the residual variation and the number of replications, but also on the ratio between the genetic and residual variations associated with the traits under evaluation (Resende and Duarte 2007Resende, M. D. V., and Duarte, J. B. (2007). Precisão e controle de qualidade em experimentos de avaliação de cultivares. Pesquisa Agropecuária Tropical, 37, 182-194.). In this study, very high accuracies (rg^g  0.90) were estimated for DBH and PP in all environments, indicating high reliability and a favorable scenario for recommendation of superior clones since high accuracy allows correct ranking of the genotypes.

Reaction norms

The presence of G × E interactions is very clear since the reaction norms are non-constant, the genotypes show genotypic plasticity; and when the reaction norms intersect, a complex G × E interaction occurs (van Eeuwijk et al. 2016van Eeuwijk, F. A., Bustos-Korts, D. V., and Malosetti, M. (2016). What should students in plant breeding know about the statistical aspects of genotype × environment interactions? Crop Science, 56, 2119-2140. https://doi.org/10.2135/cropsci2015.06.0375
https://doi.org/10.2135/cropsci2015.06.0...
). This type of G × E interactions has more severe consequences for breeders as it changes the rank of genotypes in accordance with environmental conditions, i.e., it indicates that the superior genotype in one environment will not normally perform as well in another environment (Resende 2015Resende, M. D. V. (2015). Genética quantitativa e de populações. Visconde do Rio Branco: Suprema.).

Genotypic plasticity is essential for genotype performance in changing environments (Rocha et al. 2018Rocha, J. R. A. S. C., Marçal, T. S., Salvador, F. V., Silva, A. C., Machado, J. C., and Carneiro, P. C. S. (2018). Genetic insights into elephantgrass persistence for bioenergy purpose. Plos One, 13, e0203818. https://doi.org/10.1371/journal.pone.0203818
https://doi.org/10.1371/journal.pone.020...
). Reaction norms shows that the evaluated clones present various forms of genotypic plasticity. In this context, genotypic plasticity can be considered as favorable or unfavorable changes for genotype adaptedness (van Eeuwijk et al. 2016van Eeuwijk, F. A., Bustos-Korts, D. V., and Malosetti, M. (2016). What should students in plant breeding know about the statistical aspects of genotype × environment interactions? Crop Science, 56, 2119-2140. https://doi.org/10.2135/cropsci2015.06.0375
https://doi.org/10.2135/cropsci2015.06.0...
).

Resende et al. (2018)Resende, R. T., Soares, A. A. V., Forrester, D. I., Marcatti, G. E., Santos, A. R., Takahashi, E. K., Silva, F. F., Grattapaglia, D., Resende, M. D. V., and Leite, H. G. (2018). Environmental uniformity, site quality and tree competition interact to determine stand productivity of clonal Eucalyptus. Forest Ecology and Management, 410, 76-83. https://doi.org/10.1016/j.foreco.2017.12.038
https://doi.org/10.1016/j.foreco.2017.12...
investigated the environmental uniformity, site quality and tree competition interact to determine stand productivity of clonal eucalyptus and showed the importance of adopting environmental gradient-based approaches in tree genetic testing and clone recommendation as a way to more accurately match genotypes to specific sites. Marchal et al. (2019)Marchal, A., Schlichting, C. D., Gobin, R., Balandier, P., Millier, F., Muñoz, F., Pâques, L. E., and Sánchez, L. (2019). Deciphering hybrid larch reaction norms using random regression. Gene, Genomes, Genetics, 9, 21-32. https://doi.org/10.1534/g3.118.200697
https://doi.org/10.1534/g3.118.200697...
investigated the role of genotypic plasticity on construction of hybrid larch (Larix decidua × Larix kaempferi) heterosis and on expression of its quantitative genetic parameters. They used random regression models fitted through Legendre polynomials to model reaction norms of ring width and wood density with respect to water availability and concluded that hybrid larch appeared to be the most plastic taxon as its superiority over its parental species increased with increasing water availability.

Eigenfunctions

The estimation of covariances between the random regression coefficients produces estimates of covariance functions (Kirkpatrick et al. 1990Kirkpatrick, M., Lofsvold, D., and Bulmer, M. (1990). Analyses of the Inheritance, Selection and Evolution of Growth Trajectories. Genetics, 124, 979-993.), which refer to a continuous description of the covariance structure of the trait along the environmental gradient. The analyses of the eigenfunctions are given by the total variance decomposition considering the principal components analyses (Arnal et al. 2019Arnal, M., Larroque, H., Leclerc, H., Ducrocq, V., and Robert-Granié, C. (2019). Genetic parameters for first lactation dairy traits in the Alpine and Saanen goat breeds using a random regression test-day model. Genetics Selection Evolution, 51, 43. https://doi.org/10.1186/s12711-019-0485-3
https://doi.org/10.1186/s12711-019-0485-...
). This approach is similar to genetic correlations among the environments (Van der Werf et al. 1998Van der Werf, J. H. J., Goddard, M. E., and Meyer, K. (1998). The use of covariance functions and random regressions for genetic evaluation of milk production based on test day records. Journal of Dairy Science, 81, 3300-3308. https://doi.org/10.3168/jds.S0022-0302(98)75895-3
https://doi.org/10.3168/jds.S0022-0302(9...
).

According to Kirkpatrick et al. (1990)Kirkpatrick, M., Lofsvold, D., and Bulmer, M. (1990). Analyses of the Inheritance, Selection and Evolution of Growth Trajectories. Genetics, 124, 979-993., the first eigenfunction clustered general adaptability genes that was equally expressed in all environments. This can be interpreted as the genetic correlation that exists among the environments. The second eigenfunction clustered specific adaptability genes that expressed themselves depending on environmental differences. This can be interpreted as a lack of genetic correlation among the environments. The third and fourth eigenfunctions showed small eigenvalues and represent deformations for which there is little (or no) genetic variation.

Area under the reaction norms

In plant breeding, G × E interactions can reduce heritability and genetic gain with the selection. Li et al. (2017)Li, Y., Suontoma, M., Burdon, R. D., and Dungey, H. S. (2017). Genotype by environment interactions in forest tree breeding: review of methodology and perspectives on research and application. Tree Genetics & Genomes, 13, 60. https://doi.org/10.1007/s11295-017-1144-x
https://doi.org/10.1007/s11295-017-1144-...
comment that breeders have been adopting two selection strategies in the presence of significant G × E interactions: selecting stable genotypes that are not sensitive to environmental changes or selecting genotypes for specific environments in order to maximize genetic gain with the selection at that environment. In context of the present work, both selection strategies can be applied.

The genotype ranking was performed based on the areas under the reaction norms. The advantage of this strategy is that selection response can be predicted not only in genotypic expression in any environment but also in quantifying the environmental sensitivity of the trait through the reaction norm (robustness or responsiveness to changes in the environment), and it can be used for any number of environments. It is important to highlight that in this study, random regression models were used to fit realistic reaction norms, allowing investigation of changes in genetic covariances along the environmental gradient, as suggested recently by several authors (Marcatti et al. 2017Marcatti, G. E., Resende, R. T., Resende, M. D. V., Ribeiro, C. A. A. S., Santos, A. R., Cruz, J. P., and Leite, H. G. (2017). GIS-based approach applied to optimizing recommendations of Eucalyptus genotypes. Forest Ecology and Management, 392, 144-153. https://doi.org/10.1016/j.foreco.2017.03.006
https://doi.org/10.1016/j.foreco.2017.03...
; Li et al. 2017Li, Y., Suontoma, M., Burdon, R. D., and Dungey, H. S. (2017). Genotype by environment interactions in forest tree breeding: review of methodology and perspectives on research and application. Tree Genetics & Genomes, 13, 60. https://doi.org/10.1007/s11295-017-1144-x
https://doi.org/10.1007/s11295-017-1144-...
; Marchal et al. 2019Marchal, A., Schlichting, C. D., Gobin, R., Balandier, P., Millier, F., Muñoz, F., Pâques, L. E., and Sánchez, L. (2019). Deciphering hybrid larch reaction norms using random regression. Gene, Genomes, Genetics, 9, 21-32. https://doi.org/10.1534/g3.118.200697
https://doi.org/10.1534/g3.118.200697...
).

Spatial genotypic plasticity is generally what interests’ breeders most because of the operational implications for deployment of varieties (Marchal et al. 2019Marchal, A., Schlichting, C. D., Gobin, R., Balandier, P., Millier, F., Muñoz, F., Pâques, L. E., and Sánchez, L. (2019). Deciphering hybrid larch reaction norms using random regression. Gene, Genomes, Genetics, 9, 21-32. https://doi.org/10.1534/g3.118.200697
https://doi.org/10.1534/g3.118.200697...
). For DBH, clones with larger area under the reaction norms are desirable as they relate directly to the volume of the tree (Alves et al. 2018Alves, R. S., Rocha, J. R. A. S. C., Teodoro, P. E., Resende, M. D. V., Henriques, E. P., Silva, L. A., Carneiro, P. C. S., and Bhering, L. L. (2018). Multiple-trait BLUP: a suitable strategy for genetic selection of Eucalyptus. Tree Genetics & Genomes, 14, 77. https://doi.org/10.1007/s11295-018-1292-7
https://doi.org/10.1007/s11295-018-1292-...
). However, for PP, which is related to the basic density of wood, the direction of the selection depends of the purpose of raw material (pulp, bio-oil, firewood, charcoal, among others) (Fonseca et al. 2010Fonseca, S. M., Resende, M. D. V., Alfenas, A. C., Guimarães, L. M. S., Assis, T. F., and Grattapaglia, D. (2010). Manual Prático de Melhoramento Genético do Eucalipto. Viçosa: UFV.).

CONCLUSION

The results showed that random regression models fitted through Legendre polynomials are a powerful technique to quantifying individual variation in reaction norms and therefore can be efficiently applied in the genetic evaluation of eucalyptus. Besides that, for untested environments, the genetic selection can be made based on the areas under the reaction norms.

ERRATA

  • In the article Quantifying individual variation in reaction norms using random regression models fitted through Legendre polynomials: application in eucalyptus breeding with DOI: https://doi.org/10.1590/1678-4499.20200125, published in Bragantia vol.79 no.4 Campinas Oct./Dec. 2020:
    • In the footline where is read Bragantia, Campinas, v. 79, n. 4, p.360-376, 2020

    • Should be read Bragantia, Campinas, v. 79, n. 4, p.485-501, 2020.

ACKNOWLEDGMENTS

We wish to thank the CMPC Company for the partnership.

  • FUNDERS

    Conselho Nacional de Desenvolvimento Científico e Tecnológico
    Coordenação de Aperfeiçoamento de Pessoal de Nível Superior – Financing Code 001

REFERENCES

  • Akaike, H. (1974). A new look at the statistical model identification. IEEE Transactions on Automatic Control, 19, 716-723. https://doi.org/10.1109/TAC.1974.1100705
    » https://doi.org/10.1109/TAC.1974.1100705
  • Alves, R. S., Rocha, J. R. A. S. C., Teodoro, P. E., Resende, M. D. V., Henriques, E. P., Silva, L. A., Carneiro, P. C. S., and Bhering, L. L. (2018). Multiple-trait BLUP: a suitable strategy for genetic selection of Eucalyptus Tree Genetics & Genomes, 14, 77. https://doi.org/10.1007/s11295-018-1292-7
    » https://doi.org/10.1007/s11295-018-1292-7
  • Alves, R. S., Resende, M. D. V., Azevedo, C. F., Silva, F. F., Rocha, J. R. A. S. C., Nunes, A. C. P., Carneiro, A. P. S., and Santos, G. A. (2020). Optimization of Eucalyptus breeding through random regression models allowing for reaction norms in response to environmental gradients. Tree Genetics & Genomes, 16, 38. https://doi.org/10.1007/s11295-020-01431-5
    » https://doi.org/10.1007/s11295-020-01431-5
  • Arnal, M., Larroque, H., Leclerc, H., Ducrocq, V., and Robert-Granié, C. (2019). Genetic parameters for first lactation dairy traits in the Alpine and Saanen goat breeds using a random regression test-day model. Genetics Selection Evolution, 51, 43. https://doi.org/10.1186/s12711-019-0485-3
    » https://doi.org/10.1186/s12711-019-0485-3
  • Calus, M. P. L., Bijma, P., and Veerkamp, R. F. (2004). Effects of data structure on the estimation of covariance functions to describe genotype by environment interactions in a reaction norm model. Genetics Selection Evolution, 36, 489-507. https://doi.org/10.1186/1297-9686-36-5-489
    » https://doi.org/10.1186/1297-9686-36-5-489
  • Cavanaugh, J. E., and Neath, A. A. (2019). The Akaike information criterion: Background, derivation, properties, application, interpretation and refinements. WIREs Computational Statistics, 11, e1460. https://doi.org/10.1002/wics.1460
    » https://doi.org/10.1002/wics.1460
  • Fonseca, S. M., Resende, M. D. V., Alfenas, A. C., Guimarães, L. M. S., Assis, T. F., and Grattapaglia, D. (2010). Manual Prático de Melhoramento Genético do Eucalipto. Viçosa: UFV.
  • Gilmour, A. R., Gogel, B. J., Cullis, B. R., Welham, S. J., and Thompson, R. (2015). ASReml User Guide. Hemel Hempstead: VSN International. [Accessed Jan. 20, 2020]. Available at: https://www.hpc.iastate.edu/sites/default/files/uploads/ASREML/UserGuideStructural.pdf
    » https://www.hpc.iastate.edu/sites/default/files/uploads/ASREML/UserGuideStructural.pdf
  • Greaves, B. L., Borralho, N. M. G., Raymond, C. A., and Farrington, A. (1996). Use of a Pilodyn for the indirect selection of basic density in Eucalyptus nitens Canadian Journal of Forest Research, 26, 1643-1650. https://doi.org/10.1139/x26-185
    » https://doi.org/10.1139/x26-185
  • Henderson, C. R. (1975). Best linear unbiased estimation and prediction under a selection model. Biometrics, 31, 423-447. https://doi.org/10.2307/2529430
    » https://doi.org/10.2307/2529430
  • Jarquín, D., Crossa, J., Lacaze, X., Du Cheyron, P., Daucourt, J., Lorgeou, J., Piraux, F., Guerreiro, L., Pérez, P., Calus, M., Burgueño, J., and Campos, G. (2014). A reaction norm model for genomic selection using high-dimensional genomic and environmental data. Theoretical and Applied Genetics, 127, 595-607. https://doi.org/10.1007/s00122-013-2243-1
    » https://doi.org/10.1007/s00122-013-2243-1
  • Kirkpatrick, M., and Heckman, N. (1989). A quantitative genetic model for growth, shape and other infinite-dimensional characters. Journal of Mathematical Biology, 27, 429-450. https://doi.org/10.1007/BF00290638
    » https://doi.org/10.1007/BF00290638
  • Kirkpatrick, M., Lofsvold, D., and Bulmer, M. (1990). Analyses of the Inheritance, Selection and Evolution of Growth Trajectories. Genetics, 124, 979-993.
  • Li, Y., Suontoma, M., Burdon, R. D., and Dungey, H. S. (2017). Genotype by environment interactions in forest tree breeding: review of methodology and perspectives on research and application. Tree Genetics & Genomes, 13, 60. https://doi.org/10.1007/s11295-017-1144-x
    » https://doi.org/10.1007/s11295-017-1144-x
  • Marcatti, G. E., Resende, R. T., Resende, M. D. V., Ribeiro, C. A. A. S., Santos, A. R., Cruz, J. P., and Leite, H. G. (2017). GIS-based approach applied to optimizing recommendations of Eucalyptus genotypes. Forest Ecology and Management, 392, 144-153. https://doi.org/10.1016/j.foreco.2017.03.006
    » https://doi.org/10.1016/j.foreco.2017.03.006
  • Marchal, A., Schlichting, C. D., Gobin, R., Balandier, P., Millier, F., Muñoz, F., Pâques, L. E., and Sánchez, L. (2019). Deciphering hybrid larch reaction norms using random regression. Gene, Genomes, Genetics, 9, 21-32. https://doi.org/10.1534/g3.118.200697
    » https://doi.org/10.1534/g3.118.200697
  • Morais Júnior, O. P., Duarte, J. B., Breseghello, F., Coelho, A. S., Morais, O. P., and Magalhães Júnior, A. M. (2018). Single-step reaction norm models for genomic prediction in multienvironment recurrent selection trials. Crop Science, 58, 592-607. https://doi.org/10.2135/cropsci2017.06.0366
    » https://doi.org/10.2135/cropsci2017.06.0366
  • Mrode, R. A. (2014). Linear models for the prediction of animal breeding values. Wallingford: CAB International.
  • Nunes, A. C. P., Resende, M. D. V., Santos, G. A., and Alves, R. S. (2017). Evaluation of different selection indices combining Pilodyn penetration and growth performance in Eucalyptus clones. Crop Breeding and Applied Biotechnology, 17, 206-213. https://doi.org/10.1590/1984-70332017v17n3a32
    » https://doi.org/10.1590/1984-70332017v17n3a32
  • Patterson, H. D., and Thompson, R. (1971). Recovery of inter-block information when block sizes are unequal. Biometrika, 58, 545-554. https://doi.org/10.2307/2334389
    » https://doi.org/10.2307/2334389
  • R Core Team. (2018). R: A language and environment for statistical computing. Vienna: R Foundation for Statistical Computing. https://www.r-project.org/
    » https://www.r-project.org/
  • Resende, M. D. V., Rezende, G. D. S. P., and Fernandes, J. S. C. (2001). Regressão aleatória e funções de covariância na análise de medidas repetidas. Revista de Matemática e Estatística, 19, 21-40.
  • Resende, M. D. V. (2002). Genética biométrica e estatística no melhoramento de plantas perenes. Brasília: Embrapa.
  • Resende, M. D. V., and Duarte, J. B. (2007). Precisão e controle de qualidade em experimentos de avaliação de cultivares. Pesquisa Agropecuária Tropical, 37, 182-194.
  • Resende, M. D. V., Silva, F. F., and Azevedo, C. F. (2014). Estatística matemática, biométrica e computacional. Viçosa: UFV.
  • Resende, M. D. V. (2015). Genética quantitativa e de populações. Visconde do Rio Branco: Suprema.
  • Resende, M. D. V. (2016). Software Selegen-REML/BLUP: A useful tool for plant breeding. Crop Breeding and Applied Biotechnology, 16, 330-339. https://doi.org/10.1590/1984-70332016v16n4a49
    » https://doi.org/10.1590/1984-70332016v16n4a49
  • Resende, R. T., Resende, M. D. V., Silva, F. F., and Takahashi, E. K. (2017). Acurácia preditiva de testes clonais de Eucalyptus spp. utilizando efeitos aditivos do parentesco e validação cruzada. Scientia Forestalis, 45, 39-47. https://doi.org/10.18671/scifor.v45n113.04
    » https://doi.org/10.18671/scifor.v45n113.04
  • Resende, R. T., Soares, A. A. V., Forrester, D. I., Marcatti, G. E., Santos, A. R., Takahashi, E. K., Silva, F. F., Grattapaglia, D., Resende, M. D. V., and Leite, H. G. (2018). Environmental uniformity, site quality and tree competition interact to determine stand productivity of clonal Eucalyptus Forest Ecology and Management, 410, 76-83. https://doi.org/10.1016/j.foreco.2017.12.038
    » https://doi.org/10.1016/j.foreco.2017.12.038
  • Rocha, J. R. A. S. C., Marçal, T. S., Salvador, F. V., Silva, A. C., Machado, J. C., and Carneiro, P. C. S. (2018). Genetic insights into elephantgrass persistence for bioenergy purpose. Plos One, 13, e0203818. https://doi.org/10.1371/journal.pone.0203818
    » https://doi.org/10.1371/journal.pone.0203818
  • Schaeffer, L. R. (2016). Random regression models. Guelph: University of Guelph.
  • Van der Werf, J. H. J., Goddard, M. E., and Meyer, K. (1998). The use of covariance functions and random regressions for genetic evaluation of milk production based on test day records. Journal of Dairy Science, 81, 3300-3308. https://doi.org/10.3168/jds.S0022-0302(98)75895-3
    » https://doi.org/10.3168/jds.S0022-0302(98)75895-3
  • van Eeuwijk, F. A., Bustos-Korts, D. V., and Malosetti, M. (2016). What should students in plant breeding know about the statistical aspects of genotype × environment interactions? Crop Science, 56, 2119-2140. https://doi.org/10.2135/cropsci2015.06.0375
    » https://doi.org/10.2135/cropsci2015.06.0375
  • Wilks, S. S. (1938). The large-sample distribution of the likelihood ratio for testing composite hypothesis. Annals of Mathematical Statistics, 9, 60-62. https://doi.org/10.1214/aoms/1177732360
    » https://doi.org/10.1214/aoms/1177732360

Edited by

Section Editor: Alberto Cargnelutti Filho

Publication Dates

  • Publication in this collection
    19 Oct 2020
  • Date of issue
    Oct-Dec 2020

History

  • Received
    26 Mar 2020
  • Accepted
    01 Sept 2020
Instituto Agronômico de Campinas Avenida Barão de Itapura, 1481, 13020-902, Tel.: +55 19 2137-0653, Fax: +55 19 2137-0666 - Campinas - SP - Brazil
E-mail: bragantia@iac.sp.gov.br