Communications in Information & Systems
- Commun. Inf. Syst.
- Volume 10, Number 2 (2010), 69-82.
Methods for Allocating Ambiguous Short-reads
With the rise in prominence of biological research using new short-read DNA sequencing technologies comes the need for new techniques for aligning and assigning these reads to their genomic location of origin. Until now, methods for allocating reads which align with equal or similar fidelity to multiple genomic locations have not been model-based, and have tended to ignore potentially informative data. Here, we demonstrate that existing methods for assigning ambiguous reads can produce biased results. We then present new methods for allocating ambiguous reads to the genome, developed within a framework of statistical modeling, which show promise in alleviating these biases, both in simulated and real data.
Commun. Inf. Syst. Volume 10, Number 2 (2010), 69-82.
First available in Project Euclid: 9 March 2010
Permanent link to this document
Zentralblatt MATH identifier
Taub, Margaret; Lipson, Doron; Speed, Terence P. Methods for Allocating Ambiguous Short-reads. Commun. Inf. Syst. 10 (2010), no. 2, 69--82. http://projecteuclid.org/euclid.cis/1268143264.