Source: J. Appl. Probab. Volume 47, Number 3
(2010), 732-751.
Recurrence equations for the number of types and the frequency of each type in
a random sample drawn from a finite population undergoing discrete,
nonoverlapping generations and reproducing according to the Cannings
exchangeable model are deduced under the assumption of a mutation scheme with
infinitely many types. The case of overlapping generations in discrete time is
also considered. The equations are developed for the Wright-Fisher model and
the Moran model, and extended to the case of the limit coalescent with
nonrecurrent mutation as the population size goes to ∞ and the mutation
rate to 0. Computations of the total variation distance for the distribution of
the number of types in the sample suggest that the exact Moran model provides a
better approximation for the sampling formula under the exact Wright-Fisher
model than the Ewens sampling formula in the limit of the Kingman coalescent
with nonrecurrent mutation. On the other hand, this model seems to provide a
good approximation for a Λ-coalescent with nonrecurrent mutation as long
as the probability of multiple mergers and the mutation rate are small enough.
References
Abramowitz, M. and Stegun, I. A. (eds) (1965). Handbook of Mathematical Functions. Dover, New York.
Cannings, C. (1974). The latent roots of certain Markov chains arising in genetics: a new approach. I. Haploid models. Adv. Appl. Prob. 6, 260--290.
Mathematical Reviews (MathSciNet):
MR343949
Dong, R., Gnedin, A. and Pitman, J. (2007). Exchangeable partitions derived from Markovian coalescents. Ann. Appl. Prob. 17, 1172--1201.
Eldon, B. and Wakeley, J. (2006). Coalescent processes when the distribution of offspring number among individuals is highly skewed. Genetics 172, 2621--2633.
Ewens, W. J. (1972). The sampling theory of selectively neutral alleles. Theoret. Pop. Biol. 3, 87--112.
Mathematical Reviews (MathSciNet):
MR325177
Ewens, W. J. (1990). Population genetics theory---the past and the future. In Mathematical and Statistical Developments of Evolutionary Theory, ed. S. Lessard, Kluwer, Dordrecht, pp. 177--227.
Ewens, W. J. (2004). Mathematical Population Genetics. I. 2nd edn. Springer, New York.
Fisher, R. A. (1930). The Genetical Theory of Natural Selection. Clarendon Press, Oxford.
Freund, F. and Möhle, M. (2009). On the number of allelic types for samples taken from exchangeable coalescents with mutation. Adv. Appl. Prob. 41, 1082--1101.
Fu, Y. X. (2006). Exact coalescent for the Wright--Fisher model. Theoret. Pop. Biol. 69, 385--394.
Griffiths, R. C. and Lessard, S. (2005). Ewens' sampling formula and related formulae: combinatorial proofs, extensions to variable population size and applications to ages of alleles. Theoret. Pop. Biol. 68, 167--177.
Hoppe, F. M. (1984). Pólya-like urns and the Ewens' sampling formula. J. Math. Biol. 20, 91--94.
Mathematical Reviews (MathSciNet):
MR758915
Hoppe, F. M. (1987). The sampling theory of neutral alleles and an urn model in population genetics. J. Math. Biol. 25, 123--159.
Mathematical Reviews (MathSciNet):
MR896430
Joyce, P. and Tavaré, S. (1987). Cycles, permutations and the structures of the Yule process with immigration. Stoch. Process. Appl. 25, 309--314.
Mathematical Reviews (MathSciNet):
MR915145
Karlin, S. and McGregor, J. (1972). Addendum to a paper of W. Ewens. Theoret. Pop. Biol. 3, 113--116.
Mathematical Reviews (MathSciNet):
MR325178
Kingman, J. F. C. (1982). The coalescent. Stoch. Process. Appl. 13, 235--248.
Mathematical Reviews (MathSciNet):
MR671034
Lessard, S. (2007). An exact sampling formula for the Wright--Fisher model and a solution to a conjecture about the finite-island model. Genetics 177, 1249--1254.
Lessard, S. (2009). Diffusion approximations for one-locus multi-allele kin selection, mutation and random drift in group-structured populations: a unifying approach to selection models in population genetics. J. Math. Biol. 59, 659--696.
Möhle, M. (2000). Total variation distances and rates of convergence for ancestral coalescent processes in exchangeable population models. Adv. Appl. Prob. 32, 983--993.
Möhle, M. (2004). The time back to the most recent common ancestor in exchangeable population models. Adv. Appl. Prob. 36, 78--97.
Möhle, M. (2006). On sampling distributions for coalescent processes with simultaneous multiple collisions. Bernoulli 12, 35--53.
Möhle, M. and Sagitov, S. (2001). A classification of coalescent processes for haploid exchangeable population models. Ann. Prob. 29, 1547--1562.
Moran, P. A. P. (1958). Random processes in genetics. Proc. Camb. Phil. Soc. 54, 60--71.
Mathematical Reviews (MathSciNet):
MR127989
Moran, P. A. P. (1959). The theory of some genetical effects of population subdivision. Austral. J. Biol. Sci. 12, 109--116.
Moran, P. A. P. (1962). The Statistical Processes of Evolutionary Theory. Clarendon Press, Oxford.
Pitman, J. (1999). Coalescents with multiple collisions. Ann. Prob. 27, 1870--1902.
Sagitov, S. (1999). The general coalescent with asynchronous mergers of ancestral lines. J. Appl. Prob. 36, 1116--1125.
Tavaré, S. (1989). The genealogy of the birth, death, and immigration process. In Mathematical Evolutionary Theory, ed. M. W. Feldman, Princeton University Press, pp. 41--56.
Trajstman, A. C. (1974). On a conjecture of G. A. Watterson. Adv. Appl. Prob. 6, 489--493.
Mathematical Reviews (MathSciNet):
MR351505
Wakeley, J. (2003). Polymorphism and divergence for island-model species. Genetics 163, 411--420.
Wakeley, J. and Takahashi, T. (2004). The many-demes limit for selection and drift in a subdivided population. Theoret. Pop. Biol. 66, 83--91.
Wright, S. (1931). Evolution in Mendelian populations. Genetics 16, 97--159.