Acessibilidade / Reportar erro

Estimation of variances of the effects of incomplete diallels using a matrix approach

Abstract

A matrix approach is described for assessing the variance of effects in incomplete diallels designs. The method is illustrated by reference to simulated complete and incomplete diallels using different combinations of constraints, average degree of dominance and, for the incomplete diallel, number of hybrids. Our results showed that caution should be taken in working with incomplete diallels under conditions of overdominance because there were changes in the rank of the genotypes when the excluded hybrid had parents with a low frequency of the favorable allele (i.e. the allele which increases expression of a character). The expression described in this paper is a rapid and safe approach to estimate variances and covariances of the effects of contrasts of incomplete diallels.

incomplete diallel; general combining ability; specific combining ability


PLANT GENETICS

RESEARCH ARTICLE

Estimation of variances of the effects of incomplete diallels using a matrix approach

Giselle Mariano Lessa de AssisI; Ricardo da FonsecaII; Cosme Damião CruzIII; José Marques Carneiro JúniorI

IUniversidade Federal de Viçosa, Departamento de Zootecnia, Viçosa, MG, Brazil

IIUniversidade Estadual Paulista "Júlio de Mesquita Filho", Unidade Diferenciada de Dracena, Faculdade de Zootecnia, Dracena, SP, Brazil

IIIUniversidade Federal de Viçosa, Departamento de Biologia Geral, Viçosa, MG, Brazil

Correspondence Correspondence to Giselle Mariano Lessa de Assis Universidade Federal de Viçosa Departamento de Zootecnia Secretaria de Genética e Melhoramento Av. P.H. Rolphs, 36571-000 Campus Universitário Viçosa, MG, Brazil E-mail: giselle@alunos.ufv.br

ABSTRACT

A matrix approach is described for assessing the variance of effects in incomplete diallels designs. The method is illustrated by reference to simulated complete and incomplete diallels using different combinations of constraints, average degree of dominance and, for the incomplete diallel, number of hybrids. Our results showed that caution should be taken in working with incomplete diallels under conditions of overdominance because there were changes in the rank of the genotypes when the excluded hybrid had parents with a low frequency of the favorable allele (i.e. the allele which increases expression of a character). The expression described in this paper is a rapid and safe approach to estimate variances and covariances of the effects of contrasts of incomplete diallels.

Key words: incomplete diallel, general combining ability, specific combining ability.

Introduction

Diallel crosses are an efficient method for the evaluation of the genetic and heterotic potential of parents used in breeding programs. One of the methodologies most widely used to analyze diallel data is that of Griffing (1956) in which the effects of the general combining ability (GCA) and the specific combining ability (SCA) are estimated and then used to find the best parent and hybrid combinations.

In genetic studies using diallel crosses, two analyses of variance (ANOVAR) are performed. The first ANOVAR is directed at the experimental design with the principal objective of testing the significance of the treatments and obtaining the residual mean square which will be used in the next ANOVAR. The second ANOVAR decomposes the sum of squares due to the treatments in CGA and SCA sum of squares using observed means, the variance components and quadratic components also being estimated to provide information on the relative importance of the GCA and SCA. The significance of the effects is verified and any interesting contrasts detected from the estimated variances.

This methodology was initially developed to analyze complete diallels, where all the crosses are present for a group of parents but, in practice, it is often not possible to obtain all the possible hybrid combinations because of various factors including differences in fertility, low seed-production, fertilization problems and adaptation divergence as well as hybrid losses caused by lack of germination, climatic factors, pests and disease, the result being an incomplete diallel cross. In incomplete diallels the parents are represented by a variable number of crosses and the methodology of analysis has to be modified to take into account unbalanced data.

In both complete and incomplete diallels, constraints are often applied to both the parameters and solutions used to solve the equations which calculate the GCA and SCA. The constraints on the solutions do not modify the parametric space and therefore, linear combinations, although of interest, remain non-estimable. Parametric constraints, which are part of the model, are also used in order to enlarge the set of estimable functions and provide more appropriate interpretations of the formulated hypothesis. It is also the case that the solutions arrived at with constraints on the solutions and on the parameters are the same (Cruz and Regazzi, 1997). In incomplete diallels analysis, however, differences in the literature are found regarding the constraints used to estimate the GCA and SCA effects (Machado, 1986 and Cavalcanti et al., 1997).

With the unbalanced data which characterizes incomplete diallels the estimates of GCA and SCA effects are not obtained as easily as in balanced diallels and a matrix approach is required to produce reliable estimates of the variance and contrasts of the effects (Vencovsky and Barriga, 1992). Although the information provided by tests of effects and contrasts is important, few published papers on incomplete diallels show formulae to calculate these estimators (Keuls and Garretsen, 1997, Garretsen and Keuls, 1978 and Shi et al., 1997).

Our paper describes suitable expressions, based on matrices, for the estimation of the covariance of the GCA and SCA effects of complete and incomplete diallels and the calculation of the variance of the contrasts. We also compare the results obtained using several different constraints and check whether or not the analysis of incomplete diallels generates inadequate results considering some degrees of dominance.

Material and Methods

Genotypic values

We considered five genetic populations (Pi, i = 1, 2,..., 5) in Hardy-Weinberg equilibrium, the average genotypic values of the hybrids being simulated by setting the favorable allele (i.e. the allele which increases the expression of a character) frequencies between the parents to the values shown in Table 1. Only one locus was assumed with genotypes of AA, Aa and aa and genotypic values of u + a for the one homozygote, u + d for the heterozygote and u - a for the other homozygote, where u is a constant inherent to each observation (Falconer, 1987). The average degree of dominance was also considered in the simulation and it was assumed that u = 10, a = 10 and d had values of 0, 5, 10 and 15 with the average degree of dominance being equal to 0, 0.5, 1.0 and 1.5 respectively. The estimated mean for each combination (Table 2) was given by:

Yij = u + (pi + pj - 1)a + (pi + pj - 2pi pj)d

Complete diallel analysis

Preliminary ANOVAR

The preliminary ANOVAR would normally be performed using a specific data set which would result in residual mean square values, but we used simulated means for the hybrids and assumed a mean squared error for the preliminary analysis of variance of 1.00 for all cited cases. If this type of analysis were to be carried out in practice it would use on an appropriate experiment design such as a randomized block design with 10 treatments and r repetitions according to the fixed model:

Yijk = m+tij + bk + eijk

where: Yijk: observation of the ij cross in block k; m: constant associated with the data; tij: effect of the cross ij; bk: effect of block k; eijk: effect of the random error associated with the ij cross in block k.

Complementary analysis

For the complementary analysis we used Griffing's method 4, in which the phenotypic values were represented by the mean of the p(p - 1)/2 hybrid combinations, where p is the number of parents (Table 3). The fixed statistical model was:

Yij = m+gi + gj + sij +

where: Yij: mean value of the hybrid resulting from the cross between parents i and j; m: constant associated with the data; gi, gj: effects of the general combining ability of the parents i and j respectively; sij: effect of the specific combining ability of the cross between the parents i and j; : random error associated with each observation.

Estimation of parameters

The least squares method was used to estimate the parameters using the normal equations X'X = X'y.

The X matrix was:

Two distinct sets of constraints were imposed on the X matrix:

a) Type 1 constraints: = 0 and , = 0 for all i.

b) Type 2 constraints: = 0, where ci = number of crosses where the parent i is present and , for all i.

Variances of effects and contrasts

The interesting variances of effects and contrasts were calculated by use of the estimators showed in Table 4 (Cruz and Regazzi, 1997). The same variances were also obtained by the matrix approach.

Tests of effects and contrasts

In order to test the effects and contrasts we used the Student's t test:

where, : estimate of the effect or contrast to be tested and (): estimate of the variance of .

Incomplete diallel analysis

Three analyses involving incomplete diallels were carried out using the same data set described previously. The incomplete data set was made by excluding hybrid Y15 for one analysis (Case 1), hybrids Y12 and Y13 for another analysis (Case 2) and hybrids Y35 and Y45 for the last analysis (Case 3).

Preliminary ANOVAR

The primary ANOVAR was conducted using a randomized block design, Case 1 with 9 treatments and r repetitions and Cases 2 and 3 with 8 treatments and r repetitions. The mean square error was assumed to be 1 with (t - 1)(r - 1) degrees of freedom, where t is the number of treatments and r the number of repetitions (i.e. 3).

Complementary analysis and estimation of parameters

The same methodology and the same two constraints outlined previously were used for all of the three cases.

Variance of effects and contrasts

The matrix method was used to calculate the covariances of the estimates of the parameters in the model:

Cov () = ( X'X - A'A)-1 X'X(X'X - A'A)-1,

where is the parametric vector to be estimated with dimension x1; X is the incidence matrix with dimension; A is the constraints matrix with dimension ; is the residual mean square error divided by the number of repetitions which produced the means of the hybrids included in the diallelic table.

The matrix dimensions shown above refer to the complete diallel model; for the incomplete diallel appropriate changes need to be made to take into account the missing hybrids. For example, matrix A for the complete diallel is:

while for the incomplete diallel (Case 1), the matrix is:

Tests of effects and contrasts

To test the effects and contrasts we used the Student's t test cited previously.

Results and Discussion

Derivation of an expression to estimate variances and covariances of parameter estimates of incomplete and complete diallels

According to definition, the matrix of covariances of the vector of the estimators of the parameters of the model ,(Cov()) is given by:

But = (X'X - A'A) -1 X'y and y = Xb + e, and it follows that:

Cov() = E{[X'X'- A'A)-1 X'Xb + (X'X - A'A)-1 X' e - b] x [( X'X - A'A )-1 X' Xb + (X'X - A'A)-1 X' e - b)]

But,

(X'X - A'A)-1X'Xb - b = f )

Thus:

Cov() = E[ f + (X'X - A'A)-1 X' e)

( f + (X'X - A'A)-1 X' e)

Cov() = E[(X'X - A'A)-1 X' e)

((X'X - A'A)-1 X' e)

Because (X'X - A'A)-1 is symmetric, it follows that:

Cov() = E[(X'X - A'A)-1 X' ee' X(X'X - A'A)-1]

Cov() = (X'X - A'A)-1 X' E (ee')X(X'X - A'A)-1

Because E(ee) = , it follows that:

Cov() = E[(X'X - A'A)-1 X' X(X'X - A'A)-1

Finally,

Cov() = E[(X'X - A'A)-1 X'X(X'X - A'A)-1

Complete and Incomplete diallel analysis

Complementary analysis

Table 5 summarizes the complementary ANOVAR of the complete and incomplete diallels. When two hybrids were excluded it was not possible to detect significance in the SCA values for any of the average degrees of dominance, which prejudices the biological interpretation of the data because it was not possible to detect dominance in a situation where it was supposed to occur. The effect of GCA was significant (p < 0.01) for every case.

Estimation of parameters

The estimates of the parameters of the complete and incomplete diallel model for different average degrees of dominance are given in Table 6, for type 1 constraints and Table 7, for type 2 constraints.

Variances of effects and contrasts

The variances of the effects and of the contrasts for the complete diallel estimated, using expressions taken from the published literature are shown in Table 8, while those obtained by our matrix approach, are given in Table 9. The variance of the effects can be extracted from the matrix data given in Table 9 and the contrasts calculated, the results being the same as those shown in Table 8. An example of how the variance of the contrast was calculated from the matrix is:

The variance estimates of effects and contrasts of the incomplete diallel obtained by the matrix method for Case 1 are given in Table 10 (for type 1 constraints) and in Table 11 (for type 2 constraints); for Case 2, in Table 12 (for type 1 constraints) and in Table 13 (for type 2 constraints); and for Case 3, in Table 14 (for type 1 constraints) and in Table 15 (for type 2 constraints).

Based on estimates of the mean, the effects of GCA and SCA for different degrees of balance (i.e. balanced and unbalanced data), the average degrees of dominance and the type (1 or 2) of constraints, we can make some conclusions which we will discuss below.

When the diallel is unbalanced, imposing different constraints with regard to the effects of GCA results in different estimates for these parameters, although this is not a major problem because the tests are based on the contrasts of these effects. An interesting contrast is Y = gi - gj, where the hypothesis test shows that this contrast equals zero. Rejecting this hypothesis means that there is significant difference between the effects of genotype i and genotype j in respect to GCA. The estimates of these contrasts are the same for both type 1 and 2 constraints.

Analyzing the different average degrees of dominance, our results indicate that:

a) In the absence of dominance loss of hybrids does not alter the estimates of the parameters of the model and thus does not alter the order of the genotypes according to the favorable frequencies of the allele when the constraint is type 1, but when the constraint is type 2 the estimates vary depending on the degree of balance and differ from the estimates of the complete diallel. However, for both constraints the ranking from the best to worst genotypes (i.e. P5, P4, P3, P2, P1) does not change. When only one hybrid (Y15) is lost the estimates are the same as for the complete diallel.

b) For partial dominance and complete dominance the estimates vary according to the degree of balance, although the rank of the genotype continues unaltered according to the allele frequencies.

c) When overdominance occurs, the loss of the Y12 and Y13 hybrids causes alteration in the ranking of the genotypes, which does not occur in the remaining unbalanced designs. In our example, the order of the best to the worst genotype was P5, P4, P1, P3, P2, which indicates that the loss of the hybrid whose parents have a low frequency of the favorable allele can favor these inferior hybrids when overdominance is present. The loss of three hybrids, in which the common parent has a high frequency of the favorable allele, does not produce changes in the rank of the genotype.

There are some interesting points in respect to the variance and covariances matrix. According to the different constraints the estimates of the variance and covariance of the estimates of the SCA effects were the same, which was not the case for the other parameters in the model. However, when we used data from the two matrices to calculate the variance of the contrast (Y = gi - gj) we obtained the same estimate, i.e., estimates of the variance of the contrast does not change with the constraints. Because of this statistical tests such as the Student's t test will produce the same results using either type I or type II constraints. In other words, when the data is unbalanced the estimate of the covariance between the GCA and SCA effects will always be equal to zero. It is therefore possible to test for the degree to which the data is unbalanced using the different estimates of variance and covariance of the GCA and SCA effects by identifying specific variance and covariance values in the matrix, e.g. to obtain the estimate of the variance of the contrast Y1 = g1 - g2 when hybrid Y15 (Case 1) is excluded we can use:

Conclusions

The fact that different estimates of the GCA effects occur when different constraints are imposed is not important because the estimates of the contrasts usually tested are invariants.

Care should be taken in working with incomplete diallels in which overdominance is present because we found one instance where the rank order of the genotypes changed when the excluded hybrid had parents with a low frequency of the favorable allele.

The expression described in this paper is a rapid and safe approach to estimate variances and covariances of the effects of contrasts of incomplete diallels.

Acknowledgments

The authors thank the Coordenação de Aperfeiçoamento de Pessoal de Nível Superior (CAPES) for financial support.

Editor: Fabio de Melo Sene

Received: March 25, 2003;

Accepted: February 16, 2004.

  • Cavalcanti JJV, Pinto CABP, Crisostomo JR and Ferreira DF (1997) Capacidade de combinação do cajueiro (Anacardium occidentale L.) em relação à produção precoce de castanha. Revista Ceres 44:466-472.
  • Cruz CD and Regazzi AJ (1997) Modelos Biométricos Aplicados ao Melhoramento Genético. 2nd edition. UFV, Viçosa, 390 pp.
  • Garretsen F and Keuls M (1978) A general method for the analysis of genetic variation in complete and incomplete diallels and North Caroline II (NC II) designs. Part II: Procedures and general formulas for the fixed model. Euphytica 27:49-68.
  • Griffing B (1956) Concept of general and specific combining ability in relation to diallel crossing systems. Austr J Biol Sci 9:463-493.
  • Keuls M and Garretsen F (1977) A general method for the analysis of genetic variation in complete and incomplete diallels and North Caroline II designs. Part I: Procedures and general formulas for the random model. Euphytica 26:537-551.
  • Machado TM (1986) Avaliação de cruzamentos intervarietais de milho (Zea mays L.) utilizando o esquema dialélico parcial incompleto. Dissertation, Escola Superior de Agricultura "Luiz de Queiroz", Piracicaba.
  • Shi CH, Zhu J, Zang RC and Chen GL (1977) Genetic and heterosis analysis for cooking quality traits of Indica rice in different environments. Theor Appl Genet 95:294-300.
  • Vencovsky R and Barriga P (1992) Genética Biométrica no Fitomelhoramento. Sociedade Brasileira de Genética, Ribeirão Preto, 496 pp.
  • Correspondence to

    Giselle Mariano Lessa de Assis
    Universidade Federal de Viçosa
    Departamento de Zootecnia
    Secretaria de Genética e Melhoramento
    Av. P.H. Rolphs, 36571-000
    Campus Universitário Viçosa, MG, Brazil
    E-mail:
  • Publication Dates

    • Publication in this collection
      01 Sept 2004
    • Date of issue
      2004

    History

    • Received
      25 Mar 2003
    • Accepted
      16 Feb 2004
    Sociedade Brasileira de Genética Rua Cap. Adelmio Norberto da Silva, 736, 14025-670 Ribeirão Preto SP Brazil, Tel.: (55 16) 3911-4130 / Fax.: (55 16) 3621-3552 - Ribeirão Preto - SP - Brazil
    E-mail: editor@gmb.org.br