Acessibilidade / Reportar erro

The parametric restrictions of the Griffing diallel analysis model: combining ability analysis

Abstracts

It was studied the parametric restrictions of the diallel analysis model of Griffing, method 2 (parents and F1 generations) and model 1 (fixed), in order to address the questions: i) does the statistical model need to be restricted? ii) do the restrictions satisfy the genetic parameter values? and iii) do they make the analysis and interpretation easier? Objectively, these questions can be answered as: i) yes, ii) not all of them, and iii) the analysis is easier, but the interpretation is the same as in the model with restrictions that satisfy the parameter values. The main conclusions were that: the statistical models for combining ability analysis are necessarily restricted; in the Griffing model (method 2, model 1), the restrictions relative to the specific combining ability (SCA) effects, <img src="http:/img/fbpe/gmb/v23n4/6246s1.gif" align="absmiddle"> and <img src="http:/img/fbpe/gmb/v23n4/6246s2.gif" align="absmiddle"> for all j, do not satisfy the parametric values, and the same inferences should be established from the analyses using the model with restrictions that satisfy the parametric values of SCA effects and that suggested by Griffing. A consequence of the restrictions of the Griffing model is to allow the definition of formulas for estimating the effects, their variances and the variances of contrasts of effects, as well as for calculating orthogonal sums of squares.


Com o propósito de estender os conhecimentos teóricos sobre o modelo de análise dialélica mais comumente empregado pelos melhoristas, o de Griffing, proposto em 1956, discute-se neste trabalho as restrições paramétricas associadas ao método 2 (pais e gerações F1), modelo 1 (fixo). As questões que conduziram à elaboração deste artigo foram: i) o modelo estatístico tem que ser restrito?; ii) as restrições satisfazem os valores dos parâmetros genéticos? e iii) elas tornam a análise e a interpretação mais fáceis? De forma objetiva, estas questões podem ser assim respondidas: i) sim; ii) nem todas; e iii) a análise sim, mas a interpretação é a mesma do modelo restrito definido neste trabalho. As principais conclusões são: os modelos estatísticos de análise de capacidade de combinação de um grupo de populações são obrigatoriamente restritos; as restrições <img src="http:/img/fbpe/gmb/v23n4/6246fr15.gif" align="absmiddle"> para todo j, do modelo proposto por Griffing, não satisfazem os valores paramétricos dos efeitos de capacidade específica de combinação; embora haja diferenças entre a análise segundo o modelo definido neste trabalho e a proposta por Griffing, as inferências a serem estabelecidas devem ser exatamente as mesmas; uma conseqüência das restrições do modelo de Griffing é possibilitar a definição de fórmulas para a estimação dos efeitos, das suas variâncias e das variâncias de contrastes entre efeitos, e para o cálculo das somas de quadrados, o que certamente foi fundamental para garantir o uso generalizado desta metodologia.


The parametric restrictions of the Griffing diallel analysis model: combining ability analysis

José Marcelo Soriano Viana

Departamento de Biologia Geral, Universidade Federal de Viçosa, 36571-000 Viçosa, MG, Brasil.

ABSTRACT

It was studied the parametric restrictions of the diallel analysis model of Griffing, method 2 (parents and F1 generations) and model 1 (fixed), in order to address the questions: i) does the statistical model need to be restricted? ii) do the restrictions satisfy the genetic parameter values? and iii) do they make the analysis and interpretation easier? Objectively, these questions can be answered as: i) yes, ii) not all of them, and iii) the analysis is easier, but the interpretation is the same as in the model with restrictions that satisfy the parameter values. The main conclusions were that: the statistical models for combining ability analysis are necessarily restricted; in the Griffing model (method 2, model 1), the restrictions relative to the specific combining ability (SCA) effects, and for all j, do not satisfy the parametric values, and the same inferences should be established from the analyses using the model with restrictions that satisfy the parametric values of SCA effects and that suggested by Griffing. A consequence of the restrictions of the Griffing model is to allow the definition of formulas for estimating the effects, their variances and the variances of contrasts of effects, as well as for calculating orthogonal sums of squares.

INTRODUCTION

The expressions general combining ability (GCA) and specific combining ability (SCA) have been used since the 1940s (Sprague and Tatum, 1942) to designate properties of endogamic families and inbred lines under selection in hybrid production programs. Since then, maize breeders have been aware of the need to evaluate endogamic families involved in the selfing process, normally S3 progenies, in order to reduce the number of inbred lines. For this, the families are normally crossed with a tester (a single- or double-cross hybrid or an open-pollinated population) and the progenies are then evaluated experimentally. Outstanding endogamic families, capable of generating superior progenies, i.e., with an elevated GCA, would continue to be selfed. The inbred lines produced would also be evaluated based on crossing between themselves, a system called diallel. The aim would be to identify pairs of inbred lines that produce the best hybrids. This second stage of evaluation is known as the SCA test.

Specific methods for estimating the effects of GCA and SCA or the variances of these effects have been described. These methodologies generally consist of an analysis of variance of data from progenies obtained by diallel, whether complete (Griffing, 1956a,b), partial (Geraldi and Miranda Filho, 1988), circulant (Kempthorne and Curnow, 1961), or otherwise. The main characteristics of these methods are their generality, since they can be used for any species, and the easiness of analysis and interpretation.

Among several methods of combining ability analysis, that described by Griffing (1956b) is probably the most used. Easy computer handling, guaranteed by the availability of appropriate formulas, and the care the author and others took to discuss in detail the value of the effect variance estimates or the effect estimates (Griffing, 1956a; Cruz and Vencovsky, 1989) in breeding programs, have contributed to the widespread use of this model. Neverthless, not every aspect of this method has been evaluated in detail. If the diallel's parents are not a sample from a population, i.e., when the model is fixed, then the parametric restrictions associated with the statistical model must be addressed. Does the model really have to be restricted? Do the imposed restrictions satisfy the genetic parameters? Do the restrictions make analysis and interpretation easier? The objective of this study was to answer these and other questions.

MATERIAL AND METHODS

The data used here are those reported by Gardner and Eberhart (1966) (Table I).

Parametric values of the components of the Griffing (1956b) model

We consider a polygenic system with k genes, each with two allelic forms and no epistasis, that are responsible for determining a quantitative character in a diploid species with sexual reproduction. Independent of the reproduction system of the species (cross-pollination or self-pollination), the genotypic mean of a population can be expressed as

where mi is the mean of the genotypic values of the homozygotes relative to locus i, pij is the frequency in population j of the locus i gene that increases the trait expression, ai is the difference between the genotypic value of the homozygote with highest expression and mi, and di is the deviation due to dominance relative to locus i.

Note that pij is equal to 1 or to 0 if the species is autogamous. In the case of allogamous species, the population is in Hardy-Weinberg equilibrium and is not endogamic. If population j is one of the N parents of a diallel, the genotypic mean of the hybrid produced by crossing populations j and j' is

The mean of the hybrids in which parent j participates, including population j, is

where is the average frequency in the parents of the diallel of the locus i gene that increases character expression.

The diallel mean is

The effect of GCA of population j corresponds to

If the parents are open-pollinated populations, the greater the value of the GCA effect of a population, the greater the frequencies of the genes that increase the trait expression and the greater the differences between the gene frequencies of the population and the average frequencies in the diallel's parents. If the parents are inbred lines or pure lines, the greater the value of the GCA effect of a population, the greater the number of genes that increase the trait expression and, consequently, the greater the number of positive differences between the gene frequency of the population and the average frequency in the diallel's parents. Therefore, the effect of GCA is an indicator of the superiority of the population and of its divergence relative to the diallel's parents, thus providing the same information as the parameters `population effect' (vj) and `variety heterosis' (Hj) of the Gardner and Eberhart (1966) model. The correlation between the GCA effect and vj approaches close to 1 as the degree of dominance approaches zero. For one gene and populations with pij values equal to 0, 0.1, 0.2, 0.3, 0.4, 0.5, 0.6, 0.7, 0.8, 0.9 and 1, the correlation values are 0.87 and 1 when |d/a| = 2 and d/a = 0, respectively.

The genotypic means of population j and the hybrid of parents j and j' can be expressed as

where sjj is the effect of SCA of a population with itself, and sjj' is the effect of SCA of populations j and j'.

When there is negative unidirectional dominance, the sjj values are positive. If the deviations due to dominance are positive, the sjj values are negative. When the SCA effect of a population with itself is null, the population has the same gene frequencies as the average frequencies in the group of the diallel's parents. Furthermore, the higher the absolute value of sjj, the greater the differences between the gene frequencies in the population and the average frequencies in the diallel's parents. Therefore, sjj is also an indicator of the population's divergence relative to the parental group. For one gene and populations with pij values equal to 0, 0.1, 0.2, 0.3, 0.4, 0.5, 0.6, 0.7, 0.8, 0.9 and 1, the correlation between the absolute values of gj and sjj is 0.965 for any degree of dominance (d/a ¹ 0). The sjj values thus provide the same inferences as the estimates of the parameters `average heterosis' and `variety heterosis' of the Gardner and Eberhart (1966) model. Although sjj is a measure of the population divergence relative to the diallel's parents and `variety heterosis' is an indicator of the differences between the gene frequencies in the population and the average frequencies in the other genitors, the correlation between the absolute values of these parameters for one gene is 1, independent of the populations and of the degree of dominance (d/a ¹ 0).

When there is negative unidirectional dominance, the lowest values of sjj' identify the populations with the greatest differences in gene frequencies between themselves and in relation to the average frequencies in the diallel's parents. The highest values identify the populations with the smallest differences in gene frequencies, but with different gene frequencies relative to the average frequencies in the diallel's parents. When there is positive unidirectional dominance, the lowest sjj' values are associated with populations with the smallest differences in gene frequencies, but which have differences in gene frequencies relative to the average frequencies in the diallel's parents. The highest values indicate populations with the highest differences in gene frequencies between themselves and in relation to the average frequencies in the genitor group. Independent of the direction of the dominance effects, sjj' values close to the average value indicate populations with small differences in gene frequencies between themselves and relative to the average frequencies in the parental group. If the sjj' value equals zero, the gene frequencies in one of the populations are equal to the average frequencies in the diallel's parents. The average value of the SCA effects of different populations is

Therefore the SCA of populations j and j' is an indicator of the divergence between them and of their divergence from the diallel's parents, in a manner similar to the parameter `specific heterosis' (Sjj') of the Gardner and Eberhart (1966) model. For one gene and populations with pij values equal to 0, 0.1, 0.2, 0.3, 0.4, 0.5, 0.6, 0.7, 0.8, 0.9 and 1, the correlation between the effect of SCA of two populations and specific heterosis is 0.998, independent of the degree of dominance (d/a ¹ 0). This and the other results given above show that the methods of Griffing (1956b) and of Gardner and Eberhart (1966) are not complementary but indeed absolutely equivalent in terms of inference.

The phenotypic means in a diallel table can be defined as

where and , are the average values of the residues associated with the observations for population j and the hybrid of parents j and j', respectively.

The equations above define a statistical model that is necessarily restricted, since E(gj) = 0 and E(sjj') = 0 (j fixed), in contrast to statistical models with fixed effects, where the researcher decides whether to impose parametric restrictions or not. Thus, the restrictions associated with the previously defined model for combining ability analysis are

for all j, giving N + 1 linearly independent restrictions.

This model is therefore not the same as that defined by Griffing (1956b) (method 2, model 1), whose restrictions are

for all j, giving N + 1 linearly independent restrictions.

Note that restrictions (ii) and (iii) in the latter case do not satisfy the parametric values of the SCA effects, since

and, therefore, Why, then, were they considered by Griffing (1956b)? The answer is simply because they allow one to obtain formulas for the estimation of the effects and the effect variances and for the calculation of the sums of squares of the GCA and SCA. The availability of formulas is an indispensable condition for the widespread use of methods in quantitative genetics, and allows the elaboration of software for data processing by professionals who are not specialized in this field and in the theory of linear models. The model described here is of full column rank and, therefore, it is possible to develop formulas for estimating the effects and their variances and for computing the sums of squares. As will be shown, when there is dominance, the sum of squares attributable to the null hypothesis for GCA effects is not orthogonal to the sum of squares due to the hypothesis of nullity of the SCA effects.

Despite the differences between the two models, the hypotheses that can be tested coincide and include:

1. H0(1) - equality of the treatment means (parents and hybrids): to test this hypothesis is the same as testing that there are no gene frequency differences between the genitors (pij = pi for all i and j).

2. H0(2) - nullity of the GCA effects (gj = for all j): in the case of rejection of the hypothesis H0(1), testing this hypothesis is the same as testing that the GCA effects of populations with different genetic structures are null. Of course, the rejection of H0(2) means that there are differences in the gene frequencies between the parents; acceptance of the hypothesis does not imply the contrary.

3. H0(3) - nullity of the SCA effects (sjj' for all j and j'): if there are gene frequency differences between the parents, testing this hypothesis is the same as testing that there is no dominance (di for all i).

RESULTS AND DISCUSSION

Although there is a difference between the analyses of variance for grain yield of six corn populations and their hybrids using the model described here and that proposed by Griffing (1956b), in relation to the sum of squares attributable to the null hypothesis of the GCA effects (Table II), the inferences remain the same, namely, there are differences in the gene frequencies between the populations and there is dominance in the polygenic system under analysis.

The differences between the estimates of the GCA effects and the SCA effects of each population with itself in the two models did not change the inferences since the correlations between the estimated values of gj and of sjj are, respectively, 0.9955 and 1. Nevertheless, the variances of the estimates of the effects and the contrasts between the effects of the model defined in this study are greater (Table III). Population 6 has the genes that increase yield at the highest frequency, and is the most divergent relative to the parental group. This population ought, therefore, to be selected for an intrapopulational improvement program. Population 4 is the second best, but not the second most divergent, and can also be chosen for intrapopulational improvement. The negative sjj values indicate unidirectional positive dominance.

Although the estimates of the SCA effects of different populations in the two models are different, the inferences that can be established are the same, since the correlation between the estimated values is 0.985. Differences between the variance values of the effects and the contrasts were also observed (Table IV). The variance estimates of sjj' effects and of the contrasts between these effects are lowest for the model developed above. Populations 3 and 4, 2 and 6, 1 and 2, and 3 and 6 showed the greatest differences in gene frequencies between themselves and in relation to the average frequencies in the diallel's parents. Thus, for interpopulational improvement programs, the second pair ought to be selected, because of the superiority of population 6. There was little divergence between parents 1 and 6, and 1 and 5, and from them relative to the group of parents, since their SCA effects come close to the average value (0.865). There were only slight differences in gene frequency between populations 1 and 3, and 2 and 3, although they were divergent in relation to the diallel's parents. The frequencies in these populations ought to be lower than the average frequencies.

CONCLUSIONS

The statistical models for combining ability analysis of a population group are obligatorily restricted. The restrictions for all j, of the model proposed by Griffing (1956b) (method 2, model 1) do not satisfy the parametric values of SCA effects. Although there are differences between the analysis according to the model described here and that suggested by Griffing (1956b), the inferences should be the same. A consequence of the restrictions of the Griffing (1956b) model is to allow the definition of formulas for estimating the effects, their variances and the variances of contrasts of effects, as well as for the calculation of orthogonal sums of squares. In conclusion, it is generally quite safe to use the Griffing model.

RESUMO

Com o propósito de estender os conhecimentos teóricos sobre o modelo de análise dialélica mais comumente empregado pelos melhoristas, o de Griffing, proposto em 1956, discute-se neste trabalho as restrições paramétricas associadas ao método 2 (pais e gerações F1), modelo 1 (fixo). As questões que conduziram à elaboração deste artigo foram: i) o modelo estatístico tem que ser restrito?; ii) as restrições satisfazem os valores dos parâmetros genéticos? e iii) elas tornam a análise e a interpretação mais fáceis? De forma objetiva, estas questões podem ser assim respondidas: i) sim; ii) nem todas; e iii) a análise sim, mas a interpretação é a mesma do modelo restrito definido neste trabalho. As principais conclusões são: os modelos estatísticos de análise de capacidade de combinação de um grupo de populações são obrigatoriamente restritos; as restrições para todo j, do modelo proposto por Griffing, não satisfazem os valores paramétricos dos efeitos de capacidade específica de combinação; embora haja diferenças entre a análise segundo o modelo definido neste trabalho e a proposta por Griffing, as inferências a serem estabelecidas devem ser exatamente as mesmas; uma conseqüência das restrições do modelo de Griffing é possibilitar a definição de fórmulas para a estimação dos efeitos, das suas variâncias e das variâncias de contrastes entre efeitos, e para o cálculo das somas de quadrados, o que certamente foi fundamental para garantir o uso generalizado desta metodologia.

(Received February 21, 2000)

  • Cruz, C.D. and Vencovsky, R. (1989). Comparaçăo de alguns métodos de análise dialélica. Rev. Bras. Genet. 12: 425-438.
  • Gardner, C.O. and Eberhart, S.A. (1966). Analysis and interpretation of the variety cross diallel and related populations. Biometrics 22: 439-452.
  • Geraldi, I.O. and Miranda Filho, J.B. de (1988). Adapted models for the analysis of combining ability of varieties in partial diallel crosses. Rev. Bras. Genet. 11: 419-430.
  • Griffing, B. (1956a). A generalized treatment of the use of diallel crosses in quantitative inheritance. Heredity 10: 31-50.
  • Griffing, B. (1956b). Concept of general and specific combining ability in relation to diallel crossing system. Aust. J. Biol. Sci. 9: 463-493.
  • Kempthorne, O. and Curnow, R.N. (1961). The partial diallel cross. Biometrics 17: 229-250.
  • Sprague, G.F. and Tatum, L.A. (1942). General vs. specific combining ability in single crosses of corn. J. Am. Soc. Agron. 34: 923-932.

Publication Dates

  • Publication in this collection
    14 Nov 2001
  • Date of issue
    Dec 2000

History

  • Received
    21 Feb 2000
Sociedade Brasileira de Genética Rua Cap. Adelmio Norberto da Silva, 736, 14025-670 Ribeirão Preto SP Brazil, Tel.: (55 16) 3911-4130 / Fax.: (55 16) 3621-3552 - Ribeirão Preto - SP - Brazil
E-mail: editor@gmb.org.br