Statistical Science

Sequence Comparison Significance and Poisson Approximation

Michael S. Waterman and Martin Vingron

Full-text: Open access

Abstract

The Chen-Stein method of Poisson approximation has been used to establish theorems about comparison of two DNA or protein sequences. The most useful result for sequence alignment applies to alignment scoring with no gaps. However, there has not been a valid method to assign statistical significance to alignment scores with gaps. In this paper we extend Poisson approximation techniques using the Aldous clumping heuristic to a practical method of estimating statistical significance.

Article information

Source
Statist. Sci. Volume 9, Number 3 (1994), 367-381.

Dates
First available in Project Euclid: 19 April 2007

Permanent link to this document
http://projecteuclid.org/euclid.ss/1177010382

Digital Object Identifier
doi:10.1214/ss/1177010382

Mathematical Reviews number (MathSciNet)
MR1325433

Zentralblatt MATH identifier
0955.92501

JSTOR
links.jstor.org

Keywords
Poisson approximation clumping dynamic programming sequence comparison sequence alignment DNA protein

Citation

Waterman, Michael S.; Vingron, Martin. Sequence Comparison Significance and Poisson Approximation. Statist. Sci. 9 (1994), no. 3, 367--381. doi:10.1214/ss/1177010382. http://projecteuclid.org/euclid.ss/1177010382.


Export citation