Open Access
December 2012 Phylogenetic mixtures: Concentration of measure in the large-tree limit
Elchanan Mossel, Sebastien Roch
Ann. Appl. Probab. 22(6): 2429-2459 (December 2012). DOI: 10.1214/11-AAP837

Abstract

The reconstruction of phylogenies from DNA or protein sequences is a major task of computational evolutionary biology. Common phenomena, notably variations in mutation rates across genomes and incongruences between gene lineage histories, often make it necessary to model molecular data as originating from a mixture of phylogenies. Such mixed models play an increasingly important role in practice.

Using concentration of measure techniques, we show that mixtures of large trees are typically identifiable. We also derive sequence-length requirements for high-probability reconstruction.

Citation

Download Citation

Elchanan Mossel. Sebastien Roch. "Phylogenetic mixtures: Concentration of measure in the large-tree limit." Ann. Appl. Probab. 22 (6) 2429 - 2459, December 2012. https://doi.org/10.1214/11-AAP837

Information

Published: December 2012
First available in Project Euclid: 23 November 2012

zbMATH: 1257.92037
MathSciNet: MR3024973
Digital Object Identifier: 10.1214/11-AAP837

Subjects:
Primary: 60K35
Secondary: 92D15

Keywords: concentration of measure , phylogenetic reconstruction , Random trees

Rights: Copyright © 2012 Institute of Mathematical Statistics

Vol.22 • No. 6 • December 2012
Back to Top