Open Access
November 2006 The mean, variance and limiting distribution of two statistics sensitive to phylogenetic tree balance
Michael G. B. Blum, Olivier François, Svante Janson
Ann. Appl. Probab. 16(4): 2195-2214 (November 2006). DOI: 10.1214/105051606000000547

Abstract

For two decades, the Colless index has been the most frequently used statistic for assessing the balance of phylogenetic trees. In this article, this statistic is studied under the Yule and uniform model of phylogenetic trees. The main tool of analysis is a coupling argument with another well-known index called the Sackin statistic. Asymptotics for the mean, variance and covariance of these two statistics are obtained, as well as their limiting joint distribution for large phylogenies. Under the Yule model, the limiting distribution arises as a solution of a functional fixed point equation. Under the uniform model, the limiting distribution is the Airy distribution. The cornerstone of this study is the fact that the probabilistic models for phylogenetic trees are strongly related to the random permutation and the Catalan models for binary search trees.

Citation

Download Citation

Michael G. B. Blum. Olivier François. Svante Janson. "The mean, variance and limiting distribution of two statistics sensitive to phylogenetic tree balance." Ann. Appl. Probab. 16 (4) 2195 - 2214, November 2006. https://doi.org/10.1214/105051606000000547

Information

Published: November 2006
First available in Project Euclid: 17 January 2007

zbMATH: 1124.05025
MathSciNet: MR2288718
Digital Object Identifier: 10.1214/105051606000000547

Subjects:
Primary: 05C05
Secondary: 60C05 , 60F05 , 92D15

Keywords: Airy distribution , Catalan trees , central limit theorem , contraction method , Random phylogenetic trees , shape statistics , Yule process

Rights: Copyright © 2006 Institute of Mathematical Statistics

Vol.16 • No. 4 • November 2006
Back to Top