## Services on Demand

## Article

## Indicators

## Related links

## Share

## Genetics and Molecular Biology

*On-line version* ISSN 1678-4685

### Genet. Mol. Biol. vol.30 no.3 São Paulo 2007

#### http://dx.doi.org/10.1590/S1415-47572007000400026

**EVOLUTIONARY GENETICS RESEARCH ARTICLE**

**On extending the Hardy-Weinberg law**

**Alan E. Stark**

Balgowlah NSW, Australia

**ABSTRACT**

This paper gives a general mating system for an autosomal locus with two alleles. The population reproduces in discrete and non-overlapping generations. The parental population, the same in both sexes, is arbitrary as is that of the offspring and the gene frequencies of the parents are maintained in the offspring. The system encompasses a number of special cases including the random mating model of Weinberg and Hardy. Thus it demonstrates, in the most general way possible, how genetic variation can be conserved in an indefinitely large population without invoking random mating or balancing selection. An important feature is that it provides a mating system which identifies when mating does and does not produce Hardy-Weinberg proportions among offspring.

**Key words:** Hardy-Weinberg law, non-random mating, general offspring distribution.

**Introduction**

This paper gives a general mating system for an autosomal locus with two alleles. The population reproduces in discrete and non-overlapping generations. The system encompasses a number of special cases including the random mating model of Weinberg (1908) and Hardy (1908). It covers also the formulation of Li (1988) and Stark (2005) who showed that Hardy-Weinberg (H-W) frequencies can be maintained in large populations with non-random mating. Furthermore it subsumes the system of Stark (2006a) which demonstrates that Hardy-Weinberg proportions (HWP) can be attained in one round of non-random mating. It is more general than the last of these in that it produces an arbitrary distribution of genotypes in the offspring from an arbitrary distribution in the parents while maintaining the gene frequencies of the parents.

The next section defines the mating system. The following section demonstrates how it encompasses a number of special cases. The last section discusses the canonical representation of the model and includes some numerical examples.

**The general mating system**

Consider a population with respect to a single locus having alleles *A* and *B* with respective frequencies *q* and *p*, the same in males and females. Denote frequencies of genotypes *AA*, *AB* and *BB* among parents by *f*_{0} , *f*_{1} and *f*_{2} and among offspring by *g*_{0} , *g*_{1} and *g*_{2}. Table 1 gives a mating system in which reciprocal crosses have the same frequency so that the roles of males and females can be reversed without changing the model. The 3 ´ 3 matrix of cell frequencies will be denoted by [*f*_{ij}] , i = 0, 1, 2; j = 0, 1, 2. Without loss of generality *q* is taken in the interval 0 < *q* __<__ 1/2. Since the elements of [*f*_{ij}] are non-negative there are constraints on the values of *F*, *G*, *s* and *t*.

Summing the elements of Table 1 by rows and columns shows that the parental genotypic frequencies are: *f*_{0} = *q*^{2} + *Fpq, f*_{1} = 2*pq* - 2*Fpq*, *f*_{2} = *p*^{2} + *Fpq*, *F* being Sewall Wright's *fixation index* Thus the parental frequencies are in the most general form, defined by values of *q* and *F.* Making the usual assumptions it can be seen that the distribution of genotypes among offspring is *g*_{0} = *q*^{2} + *Gpq, g*_{1} = 2*pq* - 2*Gpq*, and *g*_{2} = *p*^{2} + *Gpq*. Because matrix [*f*_{ij}] is symmetric the distribution of genotypes among offspring can be calculated from:

Note that the gene frequencies among the offspring are identical to those of the parents. However the genotypic distribution among the offspring is arbitrary being determined by *G* which plays the same role in offspring as *F* does in parents. In particular, taking *G* = 0 gives Hardy-Weinberg proportions (HWP) among the offspring. A numerical illustration is given in Table 2 which is discussed in the final section. It is specified by *q* = 0.4, *F* = 1/6, *G* = 0, *s* = 0.05, *t* = 0.02 and *f*_{0} = 0.2, *f*_{1} = 0.4, *f*_{2} = 0.4. The distribution among offspring is *g*_{0} = 0.16, *g*_{1} = 0.48, and *g*_{2} = 0.36. Clearly mating is not random yet the offspring proportions are Hardy-Weinberg.

**Special cases**

Random mating is defined in Table 1 by putting *s* = 1/4 *f*_{1}^{2} and *t* = *f*_{0}*f*_{1}. The offspring are distributed in HWP so that *G* = 0 completes the specification.

The mating system given by Li (1988) is reproduced in Table 3. Since both parents and offspring are distributed in HWP both *F* = 0 and *G* = 0. Li's parameters and those of Table 1 are related by *a* = *pq*^{2}(1 + *q*) - (*s* + *t*) and *b* = *s* - *p*^{2}*q*^{2}. In Li's model random mating is defined by the pair of conditions *a* = 0 and *b* = 0 so that *s* and *t* in Table 1 are then *s* = *p*^{2}*q*^{2} and *t* = 2*pq*^{3}.

The model given by Stark (2006a) is obtained by taking *G* = 0. A particular case is obtained by taking *F* = 1/2 (*p* - *q*)/*p* and forcing *f*_{00} = 0 and *f*_{11} = 0. This case is given by Table 4 and considered further in the next section.

**The canonical representation of Table 1**

It is instructive to examine [*f*_{ij}] through its canonical form

Formula (1) is a particular example of the representation of a discrete bivariate probability distribution which Lancaster (1969, p. 90) refers to as "Fisher's Identity". Denote the vector of values {*x*_{0}, *x*_{1}, *x*_{2}} by ** x** and {

*y*

_{0},

*y*

_{1},

*y*

_{2}} by

**. Vectors**

*y***and**

*x***attribute two sets of values to the genotypes of the parents, the same for males and females. To simplify the exposition it helps to define some expressions involving the elements of [**

*y**f*

_{ij}] and the parental genotypic frequencies

*f*

_{0},

*f*

_{1}and

*f*

_{2}:

Next form the following quadratic in n:

Solve the quadratic and designate the two solutions of n as r and s. Then r is the correlation of ** x** in female parents with

**in male parents and s is the correlation of**

*x***in females with**

*y***in males.**

*y*Finally the vector ** x** can be calculated by solving the set of equations

and the vector ** y** from

Some modification of the solution to Eqs. (5) - (7) is necessary for special cases. For example the formulation given by Table 1 can include the cases *f*_{0} = 0 and *f*_{0} = *f*_{2} = 0.

The solution of Eqs. (5) - (7) involves rather unwieldy algebraic expressions although solutions can be obtained for particular numerical examples. One root of (5) is zero if *W* = 0. Suppose this is r, then (1) reduces to

However, even this may not yield simple expressions. One case is that given by Stark (2006a) where the entries in (8) are defined by

and

Then, in Table 1, *G* = 0, *s* = 1/4 *f*_{1}^{2}(1 + s*T*^{-1}) and *t* = *f*_{0}*f*_{1}(1 + s*T*^{-1}*p*(*F* - 1)/(*q* + *Fp*)).

A special case of the preceding example is given in Table 4. The canonical form is expressed by r = 0, s = -*q*(3-4*q*)/(2-3*q*), *y*_{0} = - t, *y*_{1} = t , *y*_{2} = -t*q*/(2-3*q*), where t = 1/Ö(-s). These terms satisfy Eqs. (5) and (7).

Simplifying Li's model (Table 3) by putting *b* = *a* yields another example: then *F* = 0, *G* = 0; also *W* = 0, *X* = -2*apq*, *Y* = 2*pq*(*a* + *p*^{2}*q*^{2}), *s* = *p*^{2}*q*^{2} + *a*, *t* = 2*pq*^{3} - 2*a* and

Note that in this case the vector ** x** is a set of

*additive*values, that is with the property

*x*

_{2}-

*x*

_{1}=

*x*

_{1}-

*x*

_{0}, as pointed out by Stark (2006b), and the set

**is that given by Stark (2005). Since r = 0 the elements in [**

*y**f*

_{ij}] are obtained from Eq. (8).

Another system is defined by *f*_{ij} = *f*_{i}*f*_{j}(1 + r*x*_{i}*x*_{j}), where *x*_{0} = -2*pV* ^{-1/2}, *x*_{1} = (*q* - *p*)*V* ^{-1/2}, *x*_{2} = 2*qV* ^{-1/2} and *V* = 2*pq*(1 + *F*). This model was given by Stark (1976a, 1976b). It has the property that if r is fixed at value 2*F*/(1 + *F*) then the parental distribution characterized by *q* and *F* is reproduced in the offspring, that is *G* = *F*. Again ** x** is additive and the correlation between mates based on

**is r = 2**

*x**F*/(1 +

*F*). In the notation of Table 1,

*s*= 1/4

*f*

_{11}and

*t*=

*f*

_{01}-

*Gpq*=

*f*

_{01}-

*Fpq*.

Table 2 was introduced earlier. Its canonical form is:

Table 5 contains the numerical example defined by *q* = 1/3, *F* = 1/4, *G* = -1/4, *s* = 1/18 and *t* = 1/18. The distribution of parental types is *f*_{0} = 3/18, *f*_{1} = 6/18, *f*_{2} = 9/18 and the distribution among offspring is *g*_{0} = 1/18, *g*_{1} = 10/18, and *g*_{2} = 7/18. The terms to be substituted in formula (1) are as follows:

where *u* = Ö(73+3Ö73)/Ö146 and n = Ö(73-3Ö73)Ö146.

The preceding examples show that the mating system given in Table 1 is a general model which conserves genetic variation but allows genotypic distributions which are not exclusively in Hardy-Weinberg form. In fact it provides a mating system which identifies when mating does and does not produce Hardy-Weinberg proportions among offspring.

**References**

Hardy GH (1908) Mendelian proportions in a mixed population. Science 28:49-50. [ Links ]

Lancaster HO (1969) The Chi-Squared Distribution. John Wiley & Sons, Inc., New York, 356 pp. [ Links ]

Li CC (1988) Pseudo-random mating populations. In celebration of the 80^{th} anniversary of the Hardy-Weinberg law. Genetics 119:731-737. [ Links ]

Stark AE (1976a) Generalisation of the Hardy-Weinberg law. Nature 259:44-44. [ Links ]

Stark AE (1976b) Hardy-Weinberg law: Asymptotic approach to a generalized form. Science 193:1141-1142. [ Links ]

Stark AE (2005) The Hardy-Weinberg principle. Genet Mol Biol 28:485-485. [ Links ]

Stark AE (2006a) A clarification of the Hardy-Weinberg law. Genetics 174:1695-1697. [ Links ]

Stark AE (2006b) Stages in the evolution of the Hardy-Weinberg law. Genet Mol Biol 29:589-594. [ Links ]

Weinberg W (1908) Über den Nachweis der Vererbung beim Menschen. Jahresh Verein f vaterl Naturk Württem 64:368-382. [ Links ]English version: On the demonstration of heredity in Man. In: Boyer SH (ed) Papers on Human Genetics. Prentice-Hall, Englewood Cliffs, 1963, pp 4-15. [ Links ]

**Send correspondence to:**

Alan E. Stark

3/20 Seaview Street

Balgowlah NSW, Australia 2093

E-mail: alans@exemail.com.au

Received: December 15, 2006; Accepted: April 23, 2007.

Associate Editor: Paulo A. Otto