Hardy–Weinberg Theorem

Secondary article Article Contents

Alan Hastings, University of California, Davis, California, USA

. What it States

The Hardy–Weinberg theorem states what the genotype frequencies are, in terms of the gene frequencies, at a single locus, under the simplest assumptions for the genetic processes in a diploid organism. It is essentially the cornerstone on which much of the theory of population genetics has been built.

. Example of Population in Hardy–Weinberg Equilibrium

What it States

. Interest and Importance of the Theorem

The Hardy–Weinberg theorem is named after Hardy and Weinberg who independently discovered it in 1908. The theorem states what the genotype frequencies are, in terms of the gene frequencies, at a single locus, under the simplest assumptions for the genetic processes in a diploid organism. These genotype frequencies are attained after one generation of random mating, under the assumption of no selection, no mutation, no random changes in frequencies and no migration. Moreover, the gene frequencies remain constant from generation to generation. The formula for the genotype frequencies is easiest to present in the two-allele case, although the theorem also holds with more alleles. Denote the frequency of allele A by p, and the frequency of allele a by q 5 1 2 p. Then, after a single generation of random mating, the genotype frequencies are as given in Table 1. A similar relationship will hold if there are three alleles at a single locus, which we could designate as A1, A2, A3. In this case, there would be six different genotypes, so the situation is a bit more complicated algebraically, although the underlying principles would be exactly the same. In fact, the theorem extends to an arbitrary number of alleles which we could designate as Ai, where the subscript denotes the allele. Assume that the frequency of the Ai allele is given by pi. Then the frequency of the AiAi homozygote genotype will be p2i , and the frequency of the AiAj heterozygote (with i and j different) will be given by 2pipj. One can also extend the Hardy–Weinberg equilibrium to a case of multiple loci. This is easiest to explain in the case of two loci and two alleles. Denote the alleles at the A locus by A and a, and the alleles at the B locus by B and b. Then there are four haplotypes, AB, Ab, aB and ab, and consequently ten genotypes, AB/AB, AB/Ab, AB/aB, AB/ ab, Ab/Ab, Ab/aB. Ab/ab, aB/aB, aB/ab and ab/ab. Another issue arises which we will not deal with here, namely linkage disequilibrium, or the nonrandom assortment of alleles at the different loci. However, the Hardy– Weinberg proportions still hold, and after one generation, the genotype frequencies can be determined from the haplotype frequencies. Since the Hardy–Weinberg law depends on knowing the allele frequencies, it is important to state that in any

. How You Can Tell Whether a Population is in Hardy– Weinberg Equilibrium . Assumptions, and How Much and In What Sense They Matter

generation, independent of whether the population is at Hardy–Weinberg equilibrium, the allele frequencies can be derived from the genotype frequencies. For the case of two alleles at a single locus, the frequency of the allele A is given by the sum of the frequency of the AA homozygotes plus one half the frequency of the Aa heterozygotes: p 5 pAA 1 pAa/2 For more than two alleles, the frequency of the allele A is given by the sum of the frequency of the AA homozygotes plus one half the frequency of all the heterozygotes that have an A allele.

Example of Population in Hardy– Weinberg Equilibrium If there are data providing genotype frequencies at a single locus, one can tell if the population is in Hardy–Weinberg equilibrium. Obviously, no natural population will be exactly in Hardy–Weinberg proportions, but in many instances the population will be close to Hardy–Weinberg equilibrium. An example of a hypothetical population in Hardy–Weinberg equilibrium is given in Table 2. A natural population will never be in perfect Hardy–Weinberg equilibrium, but in many cases the genotype frequencies are very close to the Hardy–Weinberg proportions.

How You Can Tell Whether a Population is in Hardy–Weinberg Equilibrium Deciding whether a population is in Hardy–Weinberg equilibrium involves a statistical test to determine the likelihood of a population with the observed allele frequencies having the observed genotype frequencies. Following the usual statistical procedures, one accepts the null hypothesis that the population is in Hardy–Weinberg equilibrium if the chance of a deviation from the Hardy– Weinberg proportions that is less than or equal to that

ENCYCLOPEDIA OF LIFE SCIENCES © 2001, John Wiley & Sons, Ltd. www.els.net

1

Hardy–Weinberg Theorem

Table 1 Genotype frequencies after one generation of random mating as given by the Hardy–Weinberg law Genotype Frequency

AA p2

Aa 2pq

aa q2

observed is greater than 0.05. If the probability is less than 0.05, then one would say that the population is not in Hardy–Weinberg equilibrium. In discussing these tests, it is useful to define the Hardy– Weinberg disequilibrium for the allele A, DA, as the frequency of the AA genotype minus the square of the frequency of the A allele: DA 5 pAA 2 (pA)2 Then the statistical question is to test the null hypothesis that DA 5 0. Written this way, we can use the test even if there are more than two alleles, but lumping together all the alleles other than A. The appropriate statistical test is outlined in Weir (1996). Two different approaches can be used. The simplest test begins by using the observed allele frequencies to compute the genotype frequencies, as given in Table 3. Then, the expected genotype numbers are calculated as the sample size times the expected frequencies. Then the w2 goodness of fit statistic is: 2A ¼

X genotypes

¼

ðnDA Þ2 ð2nDA Þ2 ðnDA Þ2 nD2A þ þ ¼ 2 2 np2 2npq nq2 p q

Table 2 Numbers of a hypothetical population in perfect Hardy–Weinberg equilibrium; the population size is 1000, and the frequency of the A allele is 0.6

2

AA p2 5 0.36 360

Aa 2pq 5 0.48 480

Genotype Observed number Expected number Observed 2 Expected

AA nAA np2 nDA

Aa nAa 2npq 2 2nDA

aa naa nq2 nDA

of summing either the probability of observing fewer homozygotes than were actually observed, or the probability of observing more homozygotes than were actually observed. We will give the formula in the case of two alleles, A and a, in a population of size n (so there are 2n alleles). For a given number of A alleles, nA, the conditional probability of observing x heterozygotes can be shown to be: PrðxjnA Þ ¼

n!nA !ð2n  nA Þ!2x ½ðnA  xÞ=2!x!½n  ðnA þ xÞ=2!ð2nÞ!

For the observed value of the frequency of the A allele and the given population size, this formula can be evaluated numerically for all possible values of the heterozygote frequency, x. The least likely outcomes that sum to a given rejection level form a rejection level of that size.

Assumptions, and How Much and In What Sense They Matter

ðObserved  ExpectedÞ2 Expected

This statistic is distributed as a w2 with one degree of freedom, and thus significance can be determined using standard tables. The reason that the test has one degree of freedom and not two, is that there are two constraints: the allele frequencies are given and the genotype frequencies must sum to one. There are problems with this approach, especially if numbers are small. Since the situation is quite simple, exact tests are available, based on computing exactly the probability of an observed set of genotype frequencies. This approach, which goes back to Fisher (1935) is summarized by Weir (1996). Conceptually the test consists

Genotype Frequency Numbers

Table 3 Test for Hardy–Weinberg

aa q2 5 0.16 160

The Hardy–Weinberg theorem rests upon a series of assumptions, namely that: . . . . . .

Generations are nonoverlapping. There is random mating. The population size is very large – effectively infinite. There is no migration. There is no mutation. There is no natural selection.

In addition, the model is phrased for diploid sexual organisms. Violation of any of these assumptions would mean that the theorem is not strictly true. It is recognized that the Hardy–Weinberg theorem applies only to idealized populations and that the assumptions will never be met exactly for any real population. Of more interest is how large the deviation from the Hardy–Weinberg proportions is if any particular assumption is not met. If generations are overlapping, then Hardy–Weinberg proportions are approached, but only asymptotically, rather than after a single generation, as is the case with nonoverlapping generations. The rate of approach is geometric, so that the theorem holds approximately after several generations, even if it is not strictly true. Similarly,

Hardy–Weinberg Theorem

for sex-linked loci, the Hardy–Weinberg proportions are only approached asymptotically and not in a single generation. Lack of random mating is perhaps the most important reason for deviations from Hardy–Weinberg proportions in natural populations. The most common reason for an observed population showing a deficiency of heterozygotes relative to the Hardy–Weinberg proportions is if two populations that do not interbreed freely are sampled together, a result known as the Wahlund effect. How this arises is easiest to understand in the case where it is most dramatic: where two populations that are essentially monomorphic for different alleles are sampled together. In this case the deficiency of heterozygotes can be very large, and the Wahlund effect is the most likely explanation for most observed deviations of natural populations from Hardy–Weinberg proportions. Consider the case where one population is entirely AA individuals and the other is all aa individuals. Then a sample of these two populations together will have no Aa individuals and will appear not to be in Hardy–Weinberg equilibrium. Thus it would be fallacious to attribute deviations from Hardy–Weinberg equilibrium to the action of selection without extensive further investigation. In a similar vein, other deviations from random mating, such as selfing, would also lead to deviations from Hardy– Weinberg equilibrium. The size of the deviation from Hardy–Weinberg proportions would depend on the degree of deviation from random mating. If the population size is small enough, then random effects will lead to deviations from the Hardy–Weinberg proportions. These effects will be very small, however, unless the population size is extremely small (much less than 50). If there is migration, input of alleles from an outside population, then there can be deviations from Hardy– Weinberg. The size of the deviation will depend on the rate of migration, but will typically be quite small. However, the Wahlund effect described above could be viewed as an effect of migration. Similarly, mutation is only likely to cause very small deviations from Hardy–Weinberg proportions. In theory, one way to detect natural selection would be to look for deviations from Hardy–Weinberg proportions. In practice, however, any deviation from Hardy–Weinberg proportions resulting from selection would be relatively small unless selection is very strong. Moreover, the deviations from Hardy–Wienberg proportions depend on the form of selection. Clearly, very strong selection could lead to large deviations from Hardy–Weinberg proportions, such as if all heterozygotes die. However, strong directional selection (favouring a single allele) may lead to no deviations from Hardy–Weinberg proportions. In summary, although all the assumptions listed are required for the strict truth of the Hardy–Weinberg theorem, the effect of nonrandom mating is the most likely

cause of large deviations from Hardy–Weinberg proportions. Very strong selection can also produce sizeable deviations from Hardy–Weinberg proportions, but these will only be large enough to be detected if selection is very strong and sample sizes are very large.

Interest and Importance of the Theorem The Hardy–Weinberg theorem is essentially the cornerstone on which much of the theory of population genetics has been built. It is thus of great historical importance. It also has a number of direct consequences of great import in population genetics. One of the most important questions in population genetics is understanding what maintains variability. The Hardy–Weinberg theorem shows, under the assumptions of the theorem, that variability will be maintained. Before the Hardy–Weinberg theorem was demonstrated, this result was not known, even though it seems so obvious today. Further work on understanding the dynamics of populations depends critically on the Hardy–Weinberg theorem. Since the Hardy–Weinberg proportions are obtained in one generation, the theorem has the consequence that population genetic questions can be described by the frequencies of alleles rather than the frequency of genotypes. Thus, if there are two alleles, only one variable, a single allele frequency, is needed to describe the genetic state of the population. (The other allele frequency can be obtained because the frequencies must sum to one.) If one followed genotypes, two variables (three frequencies minus one since the frequencies must sum to one) would be needed. In general, in a system with n alleles, use of the Hardy–Weinberg theorem would suggest that n 2 1 variables are needed, while there would be n(n 1 1)/2 2 1 variables needed to follow the genotype frequencies. This is a much larger number. Similarly, as we noted, the theorem applies even to systems with more than one locus. Once again, this allows for a great simplification in the description of these systems, so only frequencies of haplotypes, rather than genotypes, need be followed. This observation is essential for organizing large data sets in population genetics.

References Fisher RA (1935) The logic of inductive inference. Journal of the Royal Statistical Society 98: 39–54. Weir BS (1996) Genetic Data Analysis II. Sunderland, MA: Sinauer.

Further Reading Hardy GH (1908) Mendelian proportions in a mixed population. Science 28: 41–50.

3

Hardy–Weinberg Theorem

Hartl DL and Clark AG (1989) Principles of Population Genetics, 2nd edn. Sunderland, MA: Sinauer. Vithayasai C (1973) Exact critical values of the Hardy–Weinberg test statistic for two alleles. Community Statistics 1: 229–242.

4

Weinberg W (1908) On the demonstration of heredity in man. Translated by Boyer SH IV (1963). In: Papers on Human Genetics, pp. 4–15. Englewood Cliffs, NJ: Prentice-Hall.

Hardy–Weinberg Theorem

allele frequencies, it is important to state that in any ... If there are data providing genotype frequencies at a single .... Weir BS (1996) Genetic Data Analysis II.

252KB Sizes 2 Downloads 73 Views

Recommend Documents

Comparison Theorem
part), l'Hôpital's rule, and some other tools and the geometric intuition can be illustrated .... Ui(t) := 1. √fi(t0). Ji(t). (3.9) for i = 1, 2. Therefore, by the Jacobi equations and the ... Then define a map ϕ from the set of vector fields alo

Maximum Power Transfer Theorem
from the power source to the load is when the resistance ... Function generator on the Velleman oscilloscope that is used in a ... Th . http://www.vellemanusa.com ...

Squeeze Theorem Exercises.pdf
There was a problem previewing this document. Retrying... Download. Connect more apps... Try one of the apps below to open or edit this item. Squeeze ...

WEDDERBURN'S FACTORIZATION THEOREM ... - Semantic Scholar
May 25, 2001 - Nrd|N : N −→ Z(N) is well defined, where Z(N) = F .... Let D1 = C((x1)) and define σ1 : D1 → D1 by the rule σ1(x1) = −x1. Now ... dia Math. Sci.

WEDDERBURN'S FACTORIZATION THEOREM ... - Semantic Scholar
May 25, 2001 - V. P. Platonov who developed a so-called reduced K-theory to compute SK1(D) .... be observed that v is a tame valuation and D = C and F = C.

Theorem of math.pdf
ሻ − ૚ࡼሺ. Page 3 of 11. Theorem of math.pdf. Theorem of math.pdf. Open. Extract. Open with. Sign In. Main menu. Displaying Theorem of math.pdf. Page 1 of 11.

First Welfare Theorem
S7. D8. 2. 8. S8. D9. 1. 9. S9. D10 0. 10 S10. Plus: Each person endowed with $10. 4 Conditions for Pareto Efficiency. Condition 1: Efficient Allocation of.

proof of bayes theorem pdf
Loading… Page 1. Whoops! There was a problem loading more pages. proof of bayes theorem pdf. proof of bayes theorem pdf. Open. Extract. Open with.

A STRUCTURE THEOREM FOR RATIONALIZABILITY ...
under which rti (ai) is a best reply for ti and margΘXT−i. (πti,rti (ai)) = κti . Define a mapping φti,rti (ai),m : Θ* → Θ* between the payoff functions by setting. (A.5).

COMPLEXITY OF BEZOUT'S THEOREM VII
References. [BFM96] Bernard Beauzamy, Jean-Louis Frot, and Christian Millour, Massively parallel com- putations on many-variable polynomials, Ann. Math.

Roth's theorem in the primes
I. Schrödinger equations, Geom. Funct. Anal. 3 (1993), 107–156. [6] ———, On triples in arithmetic progression, Geom. Funct. Anal. 9 (1999), 968–984. [7]. H. Davenport, Multiplicative Number Theory, Third edition, Grad. Texts Math. 74. Springer-Verlag

Harsanyi's Aggregation Theorem with Incomplete Preferences
... Investissements d'Ave- nir Program (ANR-10-LABX-93). .... Bi-utilitarianism aggregates two utility functions ui and vi for each individual i = 1, … , I, the former ...

A STRUCTURE THEOREM FOR RATIONALIZABILITY IN ... - STICERD
We show that in any game that is continuous at infinity, if a plan of action ai is rationalizable ... Following Chen, we will use the notation customary in incomplete ...

thevenin theorem examples pdf
There was a problem previewing this document. Retrying... Download. Connect more apps... Try one of the apps below to open or edit this item. thevenin ...

The Fundamental Theorem of Calculus
Problem: Using the information above found in example 5, sketch a graph of ( ). g x . The solutions for example 5: SOLUTIONS: We use the modified diagram above. (a) To calculate. 0. 2. (0). ( ) g. f t dt. -. = ∫. , we note that A1 = A2 and that thi

Oates' Decentralization Theorem with Imperfect ...
Nov 26, 2013 - In our model, agents are heterogeneous so that their result does ...... Wildasin, D. E. (2006), “Global Competition for Mobile Resources: Impli-.

1. Fixed Point Theorem
Apr 11, 2016 - An affine combination of points ,…, in ℝ is. . = = + ⋯+ , which ,…, are coefficients satisfying ∑ = = . Definition 1.1 Affine Combination. 2. 3.

COMPLEXITY OF BEZOUT'S THEOREM VII
For any integer k ≥ 1 be denote by Ik the identity square matrix of size k. Lemma 1. Let k ≥ 1 and U ∈ Uk be a unitary matrix. Then, there exists a smooth path Ut ⊆ Uk, 0 ≤ t ≤ 1, such that U0 = Ik, U1 = U and. Length(Ut) ≤ π. √ k, w

LOGIC, GOEDEL'S THEOREM, RATIONALITY, AND ...
Colorado State University. Fort Collins, CO 80523- ... in one of the articles in the former book (p.77), the distinguished computer scientist J. Weizenbaum says “…

COMPLEXITY OF BEZOUT'S THEOREM VII
all systems f = (f1,...,fn) of homogeneous polynomials of respective degrees deg(fi) = di, 1 ≤ i ≤ n. .... The Theorem and corollaries above are a consequence of the two following tech- ...... Online First DOI 10.1007/s10208-005-0211-0. [BP06b].

COMPLEXITY OF BEZOUT'S THEOREM VI
gests using the condition number to define a Riemannian metric on the solution variety and to study the geodesics of this metric. Finding a geodesic is in itself ...

A STRUCTURE THEOREM FOR RATIONALIZABILITY IN ... - STICERD
particular, there, we have extensively discussed the meaning of perturbing interim ..... assumption that Bi (h), the set of moves each period, is finite restricts the ...

the harrington-kechris- louveau theorem
A global n-approximation p is (F, Y )-terminal if Ln+1(q, F, Y ) = ∅ .... Fix a decreasing sequence (Un)n∈ω of dense open subsets of Cc such that E ∩⋂ n∈ω.