Acessibilidade / Reportar erro

Bayesian approach, traditional method, and mixed models for multienvironment trials of soybean

Abordagem bayesiana, método tradicional e modelos mistos para experimentos multiambientes na cultura da soja

Abstract:

The objective of this work was to compare the Bayesian approach and the frequentist methods to estimate means and genetic parameters in soybean multienvironment trials. Fifty-one soybean lines and four controls were evaluated in a randomized complete block design, in six environments, with three replicates, and soybean grain yield was determined. The half-normal prior and uniform distributions were used in combination with parameters obtained from data of 18 genotypes collected in previous and related experiments. The genotypic values of the genotypes of high- and low-grain yield, clustered by the Bayesian approach, differed from the means obtained by the frequentist inference. Soybean assessed through the Bayesian approach showed genetic parameter values of the mixed model (REML/Blup) close to those of the following variables: mean heritability (h2mg), accuracy of genotype selection (Acgen), coefficient of genetic variation (CVgi%), and coefficient of environmental variation (CVe%). Therefore, the mixed model methodology and the Bayesian approach lead to similar results for genetic parameters in multienvironment trials.

Index terms:
Glycine max; mathematical modeling; prior distribution in plant breeding

Resumo:

O objetivo deste trabalho foi comparar a abordagem bayesiana e os métodos frequentistas para estimar as médias e os parâmetros genéticos em experimentos multiambientes de soja. Cinquenta e uma linhagens de soja e quatro testemunhas foram avaliadas em delineamento de blocos ao acaso, em seis ambientes, com três repetições, e a produtividade de grãos foi determinada. As distribuições “half-normal” a priori e uniformes foram utilizadas em combinação com parâmetros obtidos de dados de 18 genótipos coletados em experimentos anteriores e relacionados. Os valores genotípicos de genótipos com alta e baixa produção de grãos, agrupados pela abordagem bayesiana, diferiram das médias obtidas pela inferência frequentista. A soja avaliada pela abordagem bayesiana apresentou valores de parâmetros genéticos de modelos mistos (REML/Blup) próximos daqueles das seguintes variáveis: herdabilidade média (h2mg), acurácia da seleção dos genótipos (Acgen), coeficiente de variação genético (CVgi%) e coeficiente de variação ambiental (CVe%). Portanto, em experimentos multiambientes, a metodologia de modelos mistos e a abordagem bayesiana produzem resultados similares de parâmetros genéticos.

Termos para indexação:
Glycine max; modelagem matemática; distribuição a priori no melhoramento genético

Introduction

Multienvironmental statistical analyses have been the frequentist methodology adopted in soybean breeding programs because it loses previous experimental data information (Omer et al., 2014aOMER, S.O.; ABDALLA, A.W.H.; CECCARELLI, S.; GRANDO, S.; SINGH, M. Bayesian estimation of heritability and genetic gain for subsets of genotypes evaluated in a larger set of genotypes in a block design. European Journal of Experimental Biology, v.4, p.566-575, 2014a.). Therefore, analyses based on the frequentist approach and on variance components are treated as constants that ignore any prior information (Gelman, 2006GELMAN, A. Prior distributions for variance parameters in hierarchical models (comment on article by Browne and Draper). Bayesian Analysis, v.1, p.515-534, 2006. DOI: 10.1214/06-BA117A.
https://doi.org/10.1214/06-BA117A...
). Additionally, based on previous studies, the Bayesian approach improves the data accuracy and provides statistic-inference information that is more realistic (Singh et al., 2015SINGH, M.; AL-YASSIN, A.; OMER, S.O. Bayesian estimation of genotypes means, precision, and genetic gain due to selection from routinely used barley trials. Crop Science, v.55, p.501-513, 2015. DOI: 10.2135/cropsci2014.02.0111.
https://doi.org/10.2135/cropsci2014.02.0...
).

The Bayesian approach can be applied to several fields such as cluster definition analyses (Priolli et al., 2013PRIOLLI, R.H.G.; WYSMIERSKI, P.T.; CUNHA, C.P. da; PINHEIRO, J.B.; VELLO, N.A. Genetic structure and a selected core set of Brazilian soybean cultivars. Genetics and Molecular Biology, v.36, p.382-390, 2013. DOI: 10.1590/S1415-47572013005000034.
https://doi.org/10.1590/S1415-4757201300...
), phenological calculation changes (Shen & Liu, 2015SHEN, Y.; LIU, X. Phenological changes of corn and soybeans over US by Bayesian change-point model. Sustainability, v.7, p.6781-6803, 2015. DOI: 10.3390/su7066781.
https://doi.org/10.3390/su7066781...
), and Bayesian network to study the relation between traits (Valentim et al., 2007VALENTIM, F.L.; SILVA, R.M.D.A.; ALVES, M.D.C. Modelos de redes bayesiana para incidência da ferrugem asiática na soja, cultivar Suprema, em diferentes condições de temperatura e molhamento foliar. In: SIMPÓSIO BRASILEIRO DE PESQUISA OPERACIONAL, 39., 2007, Fortaleza. A pesquisa operacional e o desenvolvimento sustentável: anais. Fortaleza: [s.n.], 2007. p. 2677-2678.). Moreover, to our knowledge, there are no Bayesian methods available to find the genotypic values in soybean breeding programs, so far.

Genotype value prediction for superior materials is the core problem in plant breeding programs because an accurate knowledge of the true variance component value is necessary, which is only found through adequate methods (Borges et al., 2010BORGES, V.; FERREIRA, P.V.; SOARES, L.; SANTOS, G.M.; SANTOS, A.M.M. Seleção de clones de batata-doce pelo procedimento REML/BLUP. Acta Scientiarum. Agronomy, v.32, p.643-649, 2010. DOI: 10.4025/actasciagron.v32i4.4837.
https://doi.org/10.4025/actasciagron.v32...
). Accordingly, the Bayesian approach emerged as an alternative method to estimate the genotype value in the plant breeding field.

Prior information on phenotypic data is often available in ongoing crop improvement programs, and it can be used to estimate the variance component and the genetic parameters through the application of the Bayesian methodology. There is no elucidating Bayesian study based on the use of prior information to estimate variance components and genetic parameters to be used in soybean breeding programs.

The objective of this work was to compare the Bayesian approach and the frequentist methods to estimate means and genetic parameters in soybean multienvironment trials.

Materials and Methods

Grain yield (kg ha-1) of soybean (Glycine max L.) advanced lines was assessed. The experiment followed a randomized complete block design with three replicates. The plots consisted of 4 m long rows, spaced at 0.45 m from each other. For the analyses, only the two rows in the middle of the plot were taken into account. A total of 51 soybean lines resulting from simple, double, quadruple, and octuple crosses, by using different genitors CD-216 and Conquista (MG-BR46) cultivars, were tested. Checks were commercial cultivars Potência and V-Max.

The genotypes were assessed in three municipalities of São Paulo state, in different crop seasons: Pindorama (2013/2014), Jaboticabal (2013/2014, 2014/2015, and 2015/2016), and Piracicaba (2013/2014 and 2014/2015). The crop seasons and locations were combined and taken as environments for all the inferences, as recommended by Omer et al. (2015)OMER, S.O.; ABDALLA, A.W.H.; MOHAMMED, M.H.; SINGH, M. Bayesian estimation of genotype-by-environment interaction in sorghum variety trials. Communications in Biometry and Crop Science, v.10, p.82-95, 2015..

As to the frequentist approach, the grain yield and multiple environment model, including the environment block, the block within environment effect, genotypes and the genotypes by environment interaction, can be described as, Yijk=μ+Ej+Rk(j)+Gi+(GEij)+εijk, in which: Yijk is the observed yield data vector of the ith genotype, in the kth block of the jth environment; μ is the general mean; Ej is the jth environment effect; Rk(j) is the kth block effect from the jth environment; Gi is the ith genotype effect; GEij is the ith genotype and the jth environment interaction; and εijk is the error. The environments were fixed; the replicate within environments, genotypes, GE interaction, and error were the random effect. In general, the following assumption was taken into consideration:

R k ( j ) ~ N ( 0, σ R 2 ) , G i ~ N ( 0, σ G 2 ) , G E i j ~ N ( 0, σ G E 2 ) , and ε i j k ~ N ( 0, σ e 2 ) ,

in which: N(0, σ2) is the normal distribution with zero mean and σ2 variance.

The genotype variance components, genotypes x environment and rep/environment effects, and the interaction in the Bayesian approach were random variables with known parameters in the prior distribution. These half-normal, uniform, prior distributions have been studied and recommended to calculate the variance components, the corresponding standard deviation components, or the scalar parameters (Gelman, 2006GELMAN, A. Prior distributions for variance parameters in hierarchical models (comment on article by Browne and Draper). Bayesian Analysis, v.1, p.515-534, 2006. DOI: 10.1214/06-BA117A.
https://doi.org/10.1214/06-BA117A...
).

Monte Carlo via Markov chain (MCMC) was used to find the a posteriori distribution and the studied parameters. Computational analyses were composed of three chains, each one comprising 10,000 iterations (the first 5,000 ones were discarded) spaced by 15 point samples (thinning), and 1,002 iteration number simulations. The Gibbs sampler is a necessary iterative algorithm to verify the convergence through the Gelman-Rubin (Ȓ) statistics, which is used to assess the variance ratio between the chain and the variance within the chain. Ȓ values below 1.1 confirmed the iteration convergence (Gelman & Rubin, 1992GELMAN, A.; RUBIN, D.B. Inference from iterative simulation using multiple sequences. Statistical Science, v.7, p.457-472, 1992. DOI: 10.1214/ss/1177011136.
https://doi.org/10.1214/ss/1177011136...
).

Two prior distributions were tested. Type 1 was the positive half-normal to the σ2E, σ2G, σ2R, and σ2GE variances; its prior distribution values were calculated based on the inversed variance of previous trials by using 18 lines from the total of 51 evaluated ones, namely: Jab 4, Jab 5, Jab 6, Jab 7, Jab 8, Jab 10, Jab 12, Jab 15, Jab 16, Jab 17, Jab 19, Jab 20, Jab 22, Jab 23, Jab 24, Jab 39, Jab 40, and Jab 41. These genotypes were previously assessed in tree experiments conducted in Jaboticabal (2011/2012 and 2012/2013), and in Piracicaba (2012/2013) (Di Mauro et al., 2014DI MAURO, A.O.; GOMEZ, G.M.; UNÊDA-TREVISOLI, S.H.; PINHEIRO, J.B. Adaptive and agronomic performances of soybean genotypes derived from different genealogies through the use of several analytical strategies. African Journal of Agricultural Research, v.9, p.2146-2157, 2014. DOI: 10.5897/AJAR2014.8700.
https://doi.org/10.5897/AJAR2014.8700...
).

Type 2 prior distribution was tested through uniform distribution [0, and 5,000,000]. These values were used according to related studies (Di Mauro et al., 2014DI MAURO, A.O.; GOMEZ, G.M.; UNÊDA-TREVISOLI, S.H.; PINHEIRO, J.B. Adaptive and agronomic performances of soybean genotypes derived from different genealogies through the use of several analytical strategies. African Journal of Agricultural Research, v.9, p.2146-2157, 2014. DOI: 10.5897/AJAR2014.8700.
https://doi.org/10.5897/AJAR2014.8700...
). The selection of the best prior distribution to be used in the Bayesian analyses was achieved through -2 x log-likelihood to the mean of the posterior parameters, the posterior mean of -2 x log-likelihood, the effective number of parameters and the deviance information criteria (DIC) (Spiegelhalter et al., 2002SPIEGELHALTER, D.J.; BEST, N.G.; CARLIN, B.P.; VAN DER LINDE, A. Bayesian measures of model complexity and fit. Journal of the Royal Statistical Society: Series B (Statistical Methodology), v.64, p.583-639, 2002. DOI: 10.1111/1467-9868.00353.
https://doi.org/10.1111/1467-9868.00353...
).

The mean genotype was calculated through the Bayesian approach, mixed models, adjusted mean (lsmeans: least-squares means) and arithmetic mean. The correlation coefficient between mean and mean rank of the evaluated methods, and the genotype means were estimated. The Spearman correlation between ranks was estimated ranks.

The random effect on the model used in unbalance experiments (data) was assessed through the deviance analyses according to Resende (2007)RESENDE, M.D.V. de. SELEGEN-REML/BLUP: sistema estatístico e seleção genética computadorizada via modelos lineares mistos. Colombo: Embrapa Florestas, 2007. 359p., and through the analysis of variance.

All the Bayesian analyses were conducted in the R software (R Core Team, 2016R CORE TEAM. R: the R project for statistical computing. Vienna: The R Foundation, 2016. Available at: <Available at: http://www.R-project.org/ >. Accessed on: Oct. 17 2016.
http://www.R-project.org/...
) package R2WinBUGS (Sturtz et al., 2005STURTZ, S.; LIGGES, U.; GELMAN, A. R2WinBUGS: a package for running WinBUGS from R. Journal of Statistical Software, v.12, p.1-16, 2005. DOI: 10.18637/jss.v012.i03.
https://doi.org/10.18637/jss.v012.i03...
). Blups (genotypic value) and variances were analyzed through REML by using the lme4 package. Deviance analyses were assessed via lmtest package (Zeileis & Hothorn, 2002ZEILEIS, A.; HOTHORN, T. Diagnostic checking in regression relationships. R News, v.2/3, p.7-10, 2002. Available at: <Available at: http://CRAN.R-project.org/doc/Rnews/ >. Accessed on: Oct. 19 2016.
http://CRAN.R-project.org/doc/Rnews/...
).

Results and Discussion

All the variation sources were significant for F and for the likelihood ratio test (LRT) (Table 1).

Table 1.
Deviance analyses and F-test to measure the random and fixed effects, respectively, on soybean grain yield (kg ha-1).

As to the assessed environments, two of them were favorable and 4 were unfavorable; the checked environmental index signal was positive and negative (Ij). Besides, the coefficient of variation varied from 21.04% up to 51.47% (Table 2).

Table 2.
Soybean grain yield, overall mean, and the environmental index of the assessed locations and crop seasons.

The half-normal was the most appropriate prior distribution to find the genetic value through MCMC method. This prior distribution showed the smallest information deviance criterion (DIC), deviance of posterior mean parameters (D̂), and deviance of posterior mean (D), but the effective number of parameters (PD) in positive half-normal was higher than in the uniform distribution. The larger the effective number of parameters, the easier the data fit by the model (Table 3). The half-normal distribution has been used to set the number of replicates in plant breeding experimental designs because it has been showing the smallest deviance criteria (DIC) (Omer et al., 2014bOMER, S.O.; ABDALLA, A.W.H.; SARKER, A.; SINGH, M. Bayesian determination of the number of replications in crop trials. European Journal of Experimental Biology, v.4, p.129-133, 2014b.), as well as heritability and genetic gain estimates through the Bayesian approach (Omer et al., 2014aOMER, S.O.; ABDALLA, A.W.H.; CECCARELLI, S.; GRANDO, S.; SINGH, M. Bayesian estimation of heritability and genetic gain for subsets of genotypes evaluated in a larger set of genotypes in a block design. European Journal of Experimental Biology, v.4, p.566-575, 2014a.). The DIC is an estimated error prediction; the lower value indicates a better adjusted model. The DIC from half-normal distribution was lower than the distribution uniform value. It indicates that the half-normal prior distribution should be taken into consideration rather than the uniform distribution. Therefore, all the variance components were found through the half-normal prior distribution. Previous experimental data information in genetic experiments lead to better variance estimates and decrease the residual variance (Carneiro Júnior et al., 2005CARNEIRO JÚNIOR, J.M.; ASSIS, G.M.L. de; EUCLYDES, R.F.; LOPES, P.S. Influência da informação a priori na avaliação genética animal utilizando dados simulados. Revista Brasileira de Zootecnia, v.34, p.1905-1913, 2005. DOI: 10.1590/S1516-35982005000600014.
https://doi.org/10.1590/S1516-3598200500...
). Moreover, the chosen of priors to calculate posteriors, conducted through the Bayesian approach, was generated according to the inverse variance of 18 genotypes, collected from 55 genotypes in the present study.

Table 3.
Discrepancy statistics to the prior selection in a soybean trial for grain yield (kg ha-1) conducted in six environments.

The genotype behavior was estimated in percentiles of the previous values, and in their ranks, according to the posterior mean by using the Bayesian approach (Table 4). Moreover, the mean national soybean grain yield is 2,870 kg ha-1 (Conab, 2017CONAB (Brasil). Séries históricas de área plantada, produtividade e produção. 2017. Available at: <Available at: https://portaldeinformacoes.conab.gov.br/index.php/safra-serie-historica-dashboard >. Accessed on: Mar. 5 2017
https://portaldeinformacoes.conab.gov.br...
). Thus, taking into account the credibility interval, the genotypes Jab 42, Jab 44, Jab 32, Jab 31, Conquista, Jab 27, Jab 41, and Jab 5 showed higher values of the limit credibility interval that were above the mean national grain yield; therefore, they were considered the most productive genotypes. Besides, the Jab 42 line remains in the breeding process, and it was herein highlighted as a potential genotype that shows grain yield values above the national mean (Table 4).

Table 4.
Genotype mean predicted were ordered by posterior means, and its ranks were measured by the Bayesian approach and the frequentist analyses based on predicted genotypic values (BLUPs), on adjusted mean, and on the arithmetic mean of 55 soybean lines for grain yield (kg ha-1).

Genotypes Jab 48, Jab 15, Jab 26, Jab 24, Jab 6, and Jab 25 did not reach a high rank, taking into consideration the percentiles 2.5 and 5%; therefore, they should not be selected. According to the ranks, the percentiles 95 and 97.5% would show the predictions of the most unproductive genotypes, if it was selected based on the mean. Genotype Jab 42 may decrease to the 2nd position (percentile 97.5%), or remain in the 1st one (percentile 95%, and others).

Means obtained through the Bayesian approach, lsmeans, and genotypic mean were the least discrepant ones for magnitude and rank (Tables 4 and 5) because they are methods based on experimental designs used to adjust the means whenever necessary. The posterior distribution in the Bayesian method is generated for the studied parameters (genotype, heritability etc.) as a random effect. Genotypes are treated as fixed effect in lsmeans or in marginal mean predictions, and mean adjustment is done according to the factor found in the linear model (genotype, replicate/environment, and environment). Genotypes are treated as random effects in the mixed model method, and the same genotype mean in multi-environment experiments is equal to the general mean + Blup of the genotype + Blup in GxE interaction (Resende, 2007RESENDE, M.D.V. de. SELEGEN-REML/BLUP: sistema estatístico e seleção genética computadorizada via modelos lineares mistos. Colombo: Embrapa Florestas, 2007. 359p.).

Table 5.
Correlation coefficient between means and ranks of all the assessed methods. Superior diagonal, Pearson correlation between mean genotypes and the inferior diagonal, Spearman correlation coefficient between genotype ranks for soybean grain yield (kg ha-1).

The Bayesian method and Blups were similar for correlation means; they recorded r = 0.9974. The lsmeans method used to calculate the mean genotype showed the second highest correlation values through the Bayesian method, r = 0.9678. And the simple arithmetic mean method showed correlation r = 0.9663 (Table 5). The REML/Blup method is the standard one to calculate the random effects on genotype selection in plant breeding (Oliveira et al., 2016OLIVEIRA, G.H.F.; BUZINARO, R.; REVOLTI, L.T.M.; GIORGENON, C.H.B.; CHARNAI, K.; RESENDE, D.; MORO, G.V. An accurate prediction of maize crosses using diallel analysis and best linear unbiased predictor (BLUP). Chilean Journal of Agricultural Research, v.76, p.294-299, 2016. DOI: 10.4067/S0718-58392016000300005.
https://doi.org/10.4067/S0718-5839201600...
). This method showed a high correlation with the Bayesian approach in partially unbalanced experimental situations.

In the present study, there were 5% missed plots, and the data set was restricted to six plots only. However, these missed plots did not significantly influence the genotype rank effect. Thus, it is necessary to assess the Bayesian approach in a larger number of missed plots, environments, and heterogeneous variance situations, in order to verify its efficiency. The Bayesian approach was used to model the heterogeneous variances in an experiment conducted to estimate the error and the genotype x environment interaction variances. The Bayesian approach was able to provide a more parsimonious answer to the question about whether the use of weighted, or unweighted means, in multi-environment trial modeling, properly sets the heterogeneous variances in the assessed experiments (Edwards & Jannink, 2006EDWARDS, J.W.; JANNINK, J.-L. Bayesian modeling of heterogeneous error and genotype x environment interaction variances. Crop Science, v.46, p.820-833, 2006. DOI: 10.2135/cropsci2005.0164.
https://doi.org/10.2135/cropsci2005.0164...
).

The genetic parameters estimated through the analysis of variance and through the mixed model were similar to all the assessed genetic parameters. The genetic parameters estimated through the Bayesian approach and the mixed model were similar to the heritability at mean level (h2mg), genotype selection accuracy (Acgen), and genetic variation coefficient (CVgi%) (Table 6). Besides, the heritability and genotypic correlation values between genotype behaviors corroborate the published literature if one takes into consideration the different grain yield (kg ha-1) environments (Di Mauro et al., 2014DI MAURO, A.O.; GOMEZ, G.M.; UNÊDA-TREVISOLI, S.H.; PINHEIRO, J.B. Adaptive and agronomic performances of soybean genotypes derived from different genealogies through the use of several analytical strategies. African Journal of Agricultural Research, v.9, p.2146-2157, 2014. DOI: 10.5897/AJAR2014.8700.
https://doi.org/10.5897/AJAR2014.8700...
). Accuracy values were classified as high in both the Bayesian and mixed model methodologies, as these methods showed values above 0.7 (Resende & Duarte, 2007RESENDE, M.D.V. de; DUARTE, J.B. Precisão e controle de qualidade em experimentos de avaliação de cultivares. Pesquisa Agropecuária Tropical, v.37, p.182-194, 2007.).

Table 6.
Genetic parameters estimated through the analysis of variance, likelihood method, and Bayesian approach for soybean grain yield (kg ha-1).

The Bayesian approach showed the smallest GE interaction variance, showing the lowest determination coefficient of the GE interaction (c2int). Consequently, the genotypic correlation coefficient between genotypes in different environments (rgloc) increased up to the unit value. A rgloc value close to the unit in genotype selection implies a genotype-selection confidence increase in some tested environments (Resende, 2007RESENDE, M.D.V. de. SELEGEN-REML/BLUP: sistema estatístico e seleção genética computadorizada via modelos lineares mistos. Colombo: Embrapa Florestas, 2007. 359p.).

The GE interaction defined that rgloc ≥ 0.80 in simulation studies, using the analysis of variance, indicates a simple interaction. A rgloc ≤ 0.20 evidences a complex interaction (Cruz & Castoldi, 1991CRUZ, C.D.; CASTOLDI, F.L. Decomposição da interação genótipos x ambientes em partes simples e complexa. Revista Ceres, v.38, p.422-430, 1991. Available at: <Available at: http://www.ceres.ufv.br/ojs/index.php/ceres/article/view/2165/203 >. Accessed on: Oct. 12 2016.
http://www.ceres.ufv.br/ojs/index.php/ce...
). Thus, rgloc close to one indicates lesser complex GE interaction, adaptability, and stability because of the variation decrease of the environment (Rosado et al., 2012ROSADO, A.M.; ROSADO, T.B.; ALVES, A.A.; LAVIOLA, B.G.; BHERING, L.L. Seleção simultânea de clones de eucalipto de acordo com produtividade, estabilidade e adaptabilidade. Pesquisa Agropecuária Brasileira, v.47, p.964-971, 2012. DOI: 10.1590/S0100-204X2012000700013.
https://doi.org/10.1590/S0100-204X201200...
). The interaction effect is associated with two factors: the first simple one, which is the variability difference between genotypes in the environments; and the second one (the complex factor), which is indicated by genotype superiority inconsistency with the environment variation. In other words, there are genotypes that shows a superior behavior in a certain environment, but the same genotype is not observed in another environment; so, it increases intricacy in the suggested selection (Cruz et al., 2012CRUZ, C.D.; REGAZZI, A.J.; CARNEIRO, P.C.S. Modelos biométricos aplicados ao melhoramento genético. 4.ed. Viçosa: Ed. da UFV, 2012. v.1, 514p.).

The predicted error variance (PEV) in genotype values and the standard deviation in genotypic variance values (SEP) in the Bayesian approach were smaller than those in the mixed model, indicating a better adjustment in the mean soybean genotype calculated a posteriori. The smaller PEC and SEP values are desirable for plant breeding programs because they are directly linked to precision and accuracy maximization (Resende & Suarte, 2007RESENDE, M.D.V. de; DUARTE, J.B. Precisão e controle de qualidade em experimentos de avaliação de cultivares. Pesquisa Agropecuária Tropical, v.37, p.182-194, 2007.).

The advantages of using the Bayesian method in experimental data lies on its methodology, which allows of the use of previous data information, presents a lesser sensitive modeling to outliers than the frequentist methods, and works well with lesser assumptions (large sample numbers, balance experiments, etc.) in analysis processes (Singh et al., 2015SINGH, M.; AL-YASSIN, A.; OMER, S.O. Bayesian estimation of genotypes means, precision, and genetic gain due to selection from routinely used barley trials. Crop Science, v.55, p.501-513, 2015. DOI: 10.2135/cropsci2014.02.0111.
https://doi.org/10.2135/cropsci2014.02.0...
).

It is worth highlighting that the Bayesian approach applied to complex agriculture experimental data, such as years, crops, location, and interaction in the model, takes a bit longer than the mixed model in computer processes. Besides, with regard to specific cases based on prior information, the Bayesian analysis could assemble a posterior distribution, which would be highly influenced by the prior information. In this case, it is recommended to use the prior information as parsimony (Gelman, 2006GELMAN, A. Prior distributions for variance parameters in hierarchical models (comment on article by Browne and Draper). Bayesian Analysis, v.1, p.515-534, 2006. DOI: 10.1214/06-BA117A.
https://doi.org/10.1214/06-BA117A...
).

Conclusions

  1. The mixed models and the Bayesian methodology show similar genetic parameters.

  2. The mean genotypic values obtained through the Bayesian approach differ from those obtained through the frequentist method.

  3. The Bayesian approach uses previous soybean experimental data information that can be used as tools in soybean breeding programs.

Acknowledgments

To Coordenação de Aperfeiçoamento de Pessoal de Nível Superior (Capes), for a doctoral scholarship to the first author.

References

  • BORGES, V.; FERREIRA, P.V.; SOARES, L.; SANTOS, G.M.; SANTOS, A.M.M. Seleção de clones de batata-doce pelo procedimento REML/BLUP. Acta Scientiarum. Agronomy, v.32, p.643-649, 2010. DOI: 10.4025/actasciagron.v32i4.4837.
    » https://doi.org/10.4025/actasciagron.v32i4.4837
  • CARNEIRO JÚNIOR, J.M.; ASSIS, G.M.L. de; EUCLYDES, R.F.; LOPES, P.S. Influência da informação a priori na avaliação genética animal utilizando dados simulados. Revista Brasileira de Zootecnia, v.34, p.1905-1913, 2005. DOI: 10.1590/S1516-35982005000600014.
    » https://doi.org/10.1590/S1516-35982005000600014
  • CONAB (Brasil). Séries históricas de área plantada, produtividade e produção. 2017. Available at: <Available at: https://portaldeinformacoes.conab.gov.br/index.php/safra-serie-historica-dashboard >. Accessed on: Mar. 5 2017
    » https://portaldeinformacoes.conab.gov.br/index.php/safra-serie-historica-dashboard
  • CRUZ, C.D.; CASTOLDI, F.L. Decomposição da interação genótipos x ambientes em partes simples e complexa. Revista Ceres, v.38, p.422-430, 1991. Available at: <Available at: http://www.ceres.ufv.br/ojs/index.php/ceres/article/view/2165/203 >. Accessed on: Oct. 12 2016.
    » http://www.ceres.ufv.br/ojs/index.php/ceres/article/view/2165/203
  • CRUZ, C.D.; REGAZZI, A.J.; CARNEIRO, P.C.S. Modelos biométricos aplicados ao melhoramento genético. 4.ed. Viçosa: Ed. da UFV, 2012. v.1, 514p.
  • DI MAURO, A.O.; GOMEZ, G.M.; UNÊDA-TREVISOLI, S.H.; PINHEIRO, J.B. Adaptive and agronomic performances of soybean genotypes derived from different genealogies through the use of several analytical strategies. African Journal of Agricultural Research, v.9, p.2146-2157, 2014. DOI: 10.5897/AJAR2014.8700.
    » https://doi.org/10.5897/AJAR2014.8700
  • EDWARDS, J.W.; JANNINK, J.-L. Bayesian modeling of heterogeneous error and genotype x environment interaction variances. Crop Science, v.46, p.820-833, 2006. DOI: 10.2135/cropsci2005.0164.
    » https://doi.org/10.2135/cropsci2005.0164
  • GELMAN, A. Prior distributions for variance parameters in hierarchical models (comment on article by Browne and Draper). Bayesian Analysis, v.1, p.515-534, 2006. DOI: 10.1214/06-BA117A.
    » https://doi.org/10.1214/06-BA117A
  • GELMAN, A.; RUBIN, D.B. Inference from iterative simulation using multiple sequences. Statistical Science, v.7, p.457-472, 1992. DOI: 10.1214/ss/1177011136.
    » https://doi.org/10.1214/ss/1177011136
  • OLIVEIRA, G.H.F.; BUZINARO, R.; REVOLTI, L.T.M.; GIORGENON, C.H.B.; CHARNAI, K.; RESENDE, D.; MORO, G.V. An accurate prediction of maize crosses using diallel analysis and best linear unbiased predictor (BLUP). Chilean Journal of Agricultural Research, v.76, p.294-299, 2016. DOI: 10.4067/S0718-58392016000300005.
    » https://doi.org/10.4067/S0718-58392016000300005
  • OMER, S.O.; ABDALLA, A.W.H.; CECCARELLI, S.; GRANDO, S.; SINGH, M. Bayesian estimation of heritability and genetic gain for subsets of genotypes evaluated in a larger set of genotypes in a block design. European Journal of Experimental Biology, v.4, p.566-575, 2014a.
  • OMER, S.O.; ABDALLA, A.W.H.; MOHAMMED, M.H.; SINGH, M. Bayesian estimation of genotype-by-environment interaction in sorghum variety trials. Communications in Biometry and Crop Science, v.10, p.82-95, 2015.
  • OMER, S.O.; ABDALLA, A.W.H.; SARKER, A.; SINGH, M. Bayesian determination of the number of replications in crop trials. European Journal of Experimental Biology, v.4, p.129-133, 2014b.
  • PRIOLLI, R.H.G.; WYSMIERSKI, P.T.; CUNHA, C.P. da; PINHEIRO, J.B.; VELLO, N.A. Genetic structure and a selected core set of Brazilian soybean cultivars. Genetics and Molecular Biology, v.36, p.382-390, 2013. DOI: 10.1590/S1415-47572013005000034.
    » https://doi.org/10.1590/S1415-47572013005000034
  • R CORE TEAM. R: the R project for statistical computing. Vienna: The R Foundation, 2016. Available at: <Available at: http://www.R-project.org/ >. Accessed on: Oct. 17 2016.
    » http://www.R-project.org/
  • RESENDE, M.D.V. de. SELEGEN-REML/BLUP: sistema estatístico e seleção genética computadorizada via modelos lineares mistos. Colombo: Embrapa Florestas, 2007. 359p.
  • RESENDE, M.D.V. de; DUARTE, J.B. Precisão e controle de qualidade em experimentos de avaliação de cultivares. Pesquisa Agropecuária Tropical, v.37, p.182-194, 2007.
  • ROSADO, A.M.; ROSADO, T.B.; ALVES, A.A.; LAVIOLA, B.G.; BHERING, L.L. Seleção simultânea de clones de eucalipto de acordo com produtividade, estabilidade e adaptabilidade. Pesquisa Agropecuária Brasileira, v.47, p.964-971, 2012. DOI: 10.1590/S0100-204X2012000700013.
    » https://doi.org/10.1590/S0100-204X2012000700013
  • SHEN, Y.; LIU, X. Phenological changes of corn and soybeans over US by Bayesian change-point model. Sustainability, v.7, p.6781-6803, 2015. DOI: 10.3390/su7066781.
    » https://doi.org/10.3390/su7066781
  • SINGH, M.; AL-YASSIN, A.; OMER, S.O. Bayesian estimation of genotypes means, precision, and genetic gain due to selection from routinely used barley trials. Crop Science, v.55, p.501-513, 2015. DOI: 10.2135/cropsci2014.02.0111.
    » https://doi.org/10.2135/cropsci2014.02.0111
  • SPIEGELHALTER, D.J.; BEST, N.G.; CARLIN, B.P.; VAN DER LINDE, A. Bayesian measures of model complexity and fit. Journal of the Royal Statistical Society: Series B (Statistical Methodology), v.64, p.583-639, 2002. DOI: 10.1111/1467-9868.00353.
    » https://doi.org/10.1111/1467-9868.00353
  • STURTZ, S.; LIGGES, U.; GELMAN, A. R2WinBUGS: a package for running WinBUGS from R. Journal of Statistical Software, v.12, p.1-16, 2005. DOI: 10.18637/jss.v012.i03.
    » https://doi.org/10.18637/jss.v012.i03
  • VALENTIM, F.L.; SILVA, R.M.D.A.; ALVES, M.D.C. Modelos de redes bayesiana para incidência da ferrugem asiática na soja, cultivar Suprema, em diferentes condições de temperatura e molhamento foliar. In: SIMPÓSIO BRASILEIRO DE PESQUISA OPERACIONAL, 39., 2007, Fortaleza. A pesquisa operacional e o desenvolvimento sustentável: anais. Fortaleza: [s.n.], 2007. p. 2677-2678.
  • ZEILEIS, A.; HOTHORN, T. Diagnostic checking in regression relationships. R News, v.2/3, p.7-10, 2002. Available at: <Available at: http://CRAN.R-project.org/doc/Rnews/ >. Accessed on: Oct. 19 2016.
    » http://CRAN.R-project.org/doc/Rnews/

Publication Dates

  • Publication in this collection
    Oct 2018

History

  • Received
    04 June 2017
  • Accepted
    21 Dec 2017
Embrapa Secretaria de Pesquisa e Desenvolvimento; Pesquisa Agropecuária Brasileira Caixa Postal 040315, 70770-901 Brasília DF Brazil, Tel. +55 61 3448-1813, Fax +55 61 3340-5483 - Brasília - DF - Brazil
E-mail: pab@embrapa.br