The Annals of Statistics
- Ann. Statist.
- Volume 30, Number 1 (2002), 202-219.
On the bias in estimating genetic length and other quantities in simplex constrained models
The genetic distance between two loci on a chromosome is defined as the mean number of crossovers between the loci. The parameters of the crossover distribution are constrained by the parameters of the distribution of chiasmata. Ott (1996) derived the maximum likelihood estimator (MLE) of the parameters of the crossover distribution and the MLE of the mean. We demonstrate that the MLE of the mean is pointwise less than or equal to the empirical mean number of crossovers. It follows that the MLE is negatively biased. For small sample sizes the bias can be nonnegligible. We recommend reduced bias estimators.
Generalizations to many other problems involving linear constraints on parameters are made. Included in the generalizations are a variety of problems involving simplex constraints as studied recently by Liu (2000).
Ann. Statist., Volume 30, Number 1 (2002), 202-219.
First available in Project Euclid: 5 March 2002
Permanent link to this document
Digital Object Identifier
Mathematical Reviews number (MathSciNet)
Zentralblatt MATH identifier
Cohen, Arthur; Kemperman, J.H.B.; Sackrowitz, Harold. On the bias in estimating genetic length and other quantities in simplex constrained models. Ann. Statist. 30 (2002), no. 1, 202--219. doi:10.1214/aos/1015362190. https://projecteuclid.org/euclid.aos/1015362190