Abstract
Under the infinitely many sites mutation model, the mutational history of a sample of DNA sequences can be described by a unique gene tree. We show how to find the conditional distribution of the ages of the mutations and the time to the most recent common ancestor of the sample, given this gene tree. Explicit expressions for such distributions seem impossible to find for the sample sizes of interest in practice. We resort to a Monte Carlo method to approximate these distributions. We use this method to study the effects of variable population size and variable mutation rates, the distribution of the time to the most recent common ancestor of the population and the distribution of other functionals of the underlying coalescent process, conditional on the sample gene tree.
Citation
R. C. Griffiths. Simon Tavaré. "The ages of mutations in gene trees." Ann. Appl. Probab. 9 (3) 567 - 590, August 1999. https://doi.org/10.1214/aoap/1029962804
Information