June 2016 A stationary distribution associated to a set of laws whose initial states are grouped into classes. An application in genomics
Servet Martínez
Author Affiliations +
J. Appl. Probab. 53(2): 315-326 (June 2016).

Abstract

Let I be a finite set and S be a nonempty strict subset of I which is partitioned into classes, and let C(s) be the class containing sS. Let (Ps: sS) be a family of distributions on IN, where each Ps applies to sequences starting with the symbol s. To this family, we associate a class of distributions P(π) on IN which depends on a probability vector π. Our main results assume that, for each sS, Ps regenerates with distribution Ps' when it encounters s' ∈ SC(s). From semiregenerative theory, we determine a simple condition on π for P(π) to be time stationary. We give a similar result for the following more complex model. Once a symbol s' ∈ SC(s) has been encountered, there is a decision to be made: either a new region of type C(s') governed by Ps' starts or the region continues to be a C(s) region. This decision is modeled as a random event and its probability depends on s and s'. The aim in studying these kinds of models is to attain a deeper statistical understanding of bacterial DNA sequences. Here I is the set of codons and the classes (C(s): sS) identify codons that initiate similar genomic regions. In particular, there are two classes corresponding to the start and stop codons which delimit coding and noncoding regions in bacterial DNA sequences. In addition, the random decision to continue the current region or begin a new region of a different class reflects the well-known fact that not every appearance of a start codon marks the beginning of a new coding region.

Citation

Download Citation

Servet Martínez. "A stationary distribution associated to a set of laws whose initial states are grouped into classes. An application in genomics." J. Appl. Probab. 53 (2) 315 - 326, June 2016.

Information

Published: June 2016
First available in Project Euclid: 17 June 2016

zbMATH: 1344.60072
MathSciNet: MR3514280

Subjects:
Primary: 60J10 , 60J20 , 92D10 , 92D20

Keywords: genomics , Markov chain , Palm theory , Regenerative process , stationary distribution

Rights: Copyright © 2016 Applied Probability Trust

JOURNAL ARTICLE
12 PAGES

This article is only available to subscribers.
It is not available for individual sale.
+ SAVE TO MY LIBRARY

Vol.53 • No. 2 • June 2016
Back to Top