SciELO - Scientific Electronic Library Online

vol.30 issue3Clastogenicity of Piper cubeba (Piperaceae) seed extract in an in vivo mammalian cell systemGenetic divergence between populations of the stingless bee uruçu amarela (Melipona rufiventris group, Hymenoptera, Meliponini): is there a new Melipona species in the Brazilian state of Minas Gerais? author indexsubject indexarticles search
Home Pagealphabetic serial listing  

Genetics and Molecular Biology

On-line version ISSN 1678-4685

Genet. Mol. Biol. vol.30 no.3 São Paulo  2007 



On extending the Hardy-Weinberg law



Alan E. Stark

Balgowlah NSW, Australia

Send correspondence to




This paper gives a general mating system for an autosomal locus with two alleles. The population reproduces in discrete and non-overlapping generations. The parental population, the same in both sexes, is arbitrary as is that of the offspring and the gene frequencies of the parents are maintained in the offspring. The system encompasses a number of special cases including the random mating model of Weinberg and Hardy. Thus it demonstrates, in the most general way possible, how genetic variation can be conserved in an indefinitely large population without invoking random mating or balancing selection. An important feature is that it provides a mating system which identifies when mating does and does not produce Hardy-Weinberg proportions among offspring.

Key words: Hardy-Weinberg law, non-random mating, general offspring distribution.




This paper gives a general mating system for an autosomal locus with two alleles. The population reproduces in discrete and non-overlapping generations. The system encompasses a number of special cases including the random mating model of Weinberg (1908) and Hardy (1908). It covers also the formulation of Li (1988) and Stark (2005) who showed that Hardy-Weinberg (H-W) frequencies can be maintained in large populations with non-random mating. Furthermore it subsumes the system of Stark (2006a) which demonstrates that Hardy-Weinberg proportions (HWP) can be attained in one round of non-random mating. It is more general than the last of these in that it produces an arbitrary distribution of genotypes in the offspring from an arbitrary distribution in the parents while maintaining the gene frequencies of the parents.

The next section defines the mating system. The following section demonstrates how it encompasses a number of special cases. The last section discusses the canonical representation of the model and includes some numerical examples.

The general mating system

Consider a population with respect to a single locus having alleles A and B with respective frequencies q and p, the same in males and females. Denote frequencies of genotypes AA, AB and BB among parents by f0 , f1 and f2 and among offspring by g0 , g1 and g2. Table 1 gives a mating system in which reciprocal crosses have the same frequency so that the roles of males and females can be reversed without changing the model. The 3 ´ 3 matrix of cell frequencies will be denoted by [fij] , i = 0, 1, 2; j = 0, 1, 2. Without loss of generality q is taken in the interval 0 < q < 1/2. Since the elements of [fij] are non-negative there are constraints on the values of F, G, s and t.



Summing the elements of Table 1 by rows and columns shows that the parental genotypic frequencies are: f0 = q2 + Fpq, f1 = 2pq - 2Fpq, f2 = p2 + Fpq, F being Sewall Wright's fixation index Thus the parental frequencies are in the most general form, defined by values of q and F. Making the usual assumptions it can be seen that the distribution of genotypes among offspring is g0 = q2 + Gpq, g1 = 2pq - 2Gpq, and g2 = p2 + Gpq. Because matrix [fij] is symmetric the distribution of genotypes among offspring can be calculated from:

Note that the gene frequencies among the offspring are identical to those of the parents. However the genotypic distribution among the offspring is arbitrary being determined by G which plays the same role in offspring as F does in parents. In particular, taking G = 0 gives Hardy-Weinberg proportions (HWP) among the offspring. A numerical illustration is given in Table 2 which is discussed in the final section. It is specified by q = 0.4, F = 1/6, G = 0, s = 0.05, t = 0.02 and f0 = 0.2, f1 = 0.4, f2 = 0.4. The distribution among offspring is g0 = 0.16, g1 = 0.48, and g2 = 0.36. Clearly mating is not random yet the offspring proportions are Hardy-Weinberg.



Special cases

Random mating is defined in Table 1 by putting s = 1/4 f12 and t = f0f1. The offspring are distributed in HWP so that G = 0 completes the specification.

The mating system given by Li (1988) is reproduced in Table 3. Since both parents and offspring are distributed in HWP both F = 0 and G = 0. Li's parameters and those of Table 1 are related by a = pq2(1 + q) - (s + t) and b = s - p2q2. In Li's model random mating is defined by the pair of conditions a = 0 and b = 0 so that s and t in Table 1 are then s = p2q2 and t = 2pq3.



The model given by Stark (2006a) is obtained by taking G = 0. A particular case is obtained by taking F = 1/2 (p - q)/p and forcing f00 = 0 and f11 = 0. This case is given by Table 4 and considered further in the next section.



The canonical representation of Table 1

It is instructive to examine [fij] through its canonical form

Formula (1) is a particular example of the representation of a discrete bivariate probability distribution which Lancaster (1969, p. 90) refers to as "Fisher's Identity". Denote the vector of values {x0, x1, x2} by x and {y0, y1, y2} by y. Vectors x and y attribute two sets of values to the genotypes of the parents, the same for males and females. To simplify the exposition it helps to define some expressions involving the elements of [fij] and the parental genotypic frequencies f0, f1 and f2:

Next form the following quadratic in n:

Solve the quadratic and designate the two solutions of n as r and s. Then r is the correlation of x in female parents with x in male parents and s is the correlation of y in females with y in males.

Finally the vector x can be calculated by solving the set of equations

and the vector y from

Some modification of the solution to Eqs. (5) - (7) is necessary for special cases. For example the formulation given by Table 1 can include the cases f0 = 0 and f0 = f2 = 0.

The solution of Eqs. (5) - (7) involves rather unwieldy algebraic expressions although solutions can be obtained for particular numerical examples. One root of (5) is zero if W = 0. Suppose this is r, then (1) reduces to

However, even this may not yield simple expressions. One case is that given by Stark (2006a) where the entries in (8) are defined by


Then, in Table 1, G = 0, s = 1/4 f12(1 + sT-1) and t = f0f1(1 + sT-1p(F - 1)/(q + Fp)).

A special case of the preceding example is given in Table 4. The canonical form is expressed by r = 0, s = -q(3-4q)/(2-3q), y0 = - t, y1 = t , y2 = -tq/(2-3q), where t = 1/Ö(-s). These terms satisfy Eqs. (5) and (7).

Simplifying Li's model (Table 3) by putting b = a yields another example: then F = 0, G = 0; also W = 0, X = -2apq, Y = 2pq(a + p2q2), s = p2q2 + a, t = 2pq3 - 2a and

Note that in this case the vector x is a set of additive values, that is with the property x2 - x1 = x1 - x0, as pointed out by Stark (2006b), and the set y is that given by Stark (2005). Since r = 0 the elements in [fij] are obtained from Eq. (8).

Another system is defined by fij = fifj(1 + rxixj), where x0 = -2pV -1/2, x1 = (q - p)V -1/2, x2 = 2qV -1/2 and V = 2pq(1 + F). This model was given by Stark (1976a, 1976b). It has the property that if r is fixed at value 2F/(1 + F) then the parental distribution characterized by q and F is reproduced in the offspring, that is G = F. Again x is additive and the correlation between mates based on x is r = 2F/(1 + F). In the notation of Table 1, s = 1/4f11 and t = f01 - Gpq = f01 - Fpq.

Table 2 was introduced earlier. Its canonical form is:

Table 5 contains the numerical example defined by q = 1/3, F = 1/4, G = -1/4, s = 1/18 and t = 1/18. The distribution of parental types is f0 = 3/18, f1 = 6/18, f2 = 9/18 and the distribution among offspring is g0 = 1/18, g1 = 10/18, and g2 = 7/18. The terms to be substituted in formula (1) are as follows:

where u = Ö(73+3Ö73)/Ö146 and n = Ö(73-3Ö73)Ö146.



The preceding examples show that the mating system given in Table 1 is a general model which conserves genetic variation but allows genotypic distributions which are not exclusively in Hardy-Weinberg form. In fact it provides a mating system which identifies when mating does and does not produce Hardy-Weinberg proportions among offspring.



Hardy GH (1908) Mendelian proportions in a mixed population. Science 28:49-50.        [ Links ]

Lancaster HO (1969) The Chi-Squared Distribution. John Wiley & Sons, Inc., New York, 356 pp.        [ Links ]

Li CC (1988) Pseudo-random mating populations. In celebration of the 80th anniversary of the Hardy-Weinberg law. Genetics 119:731-737.        [ Links ]

Stark AE (1976a) Generalisation of the Hardy-Weinberg law. Nature 259:44-44.        [ Links ]

Stark AE (1976b) Hardy-Weinberg law: Asymptotic approach to a generalized form. Science 193:1141-1142.        [ Links ]

Stark AE (2005) The Hardy-Weinberg principle. Genet Mol Biol 28:485-485.        [ Links ]

Stark AE (2006a) A clarification of the Hardy-Weinberg law. Genetics 174:1695-1697.        [ Links ]

Stark AE (2006b) Stages in the evolution of the Hardy-Weinberg law. Genet Mol Biol 29:589-594.        [ Links ]

Weinberg W (1908) Über den Nachweis der Vererbung beim Menschen. Jahresh Verein f vaterl Naturk Württem 64:368-382.         [ Links ]English version: On the demonstration of heredity in Man. In: Boyer SH (ed) Papers on Human Genetics. Prentice-Hall, Englewood Cliffs, 1963, pp 4-15.        [ Links ]



Send correspondence to:
Alan E. Stark
3/20 Seaview Street
Balgowlah NSW, Australia 2093



Received: December 15, 2006; Accepted: April 23, 2007.



Associate Editor: Paulo A. Otto

All the contents of this site, except where otherwise noted, is licensed under a Creative Commons Attribution License.