## The Annals of Applied Statistics

### Extending the latent multinomial model with complex error processes and dynamic Markov bases

#### Abstract

The latent multinomial model (LMM) of Link et al. [Biometrics 66 (2010) 178–185] provides a framework for modelling mark-recapture data with potential identification errors. Key is a Markov chain Monte Carlo (MCMC) scheme for sampling configurations of the latent counts of the true capture histories that could have generated the observed data. Assuming a linear map between the observed and latent counts, the MCMC algorithm uses vectors from a basis of the kernel to move between configurations of the latent data. Schofield and Bonner [Biometrics 71 (2015) 1070–1080] shows that this is sufficient for some models within the framework but that a larger set called a Markov basis is required when errors are more complex. We address two further challenges: (1) that models with complex error mechanisms may not fit within the LMM framework and (2) that Markov bases can be difficult to compute for studies of even moderate size. We extend the framework to model the capture/demographic and error processes separately and develop a new MCMC algorithm using dynamic Markov bases. Our work is motivated by a study of queen snakes (Regina septemvittata) and we use simulation to compare estimates of survival rates when snakes are marked with PIT tags which have perfect identification versus brands which are prone to error.

#### Article information

Source
Ann. Appl. Stat., Volume 10, Number 1 (2016), 246-263.

Dates
Revised: September 2015
First available in Project Euclid: 25 March 2016

Permanent link to this document
https://projecteuclid.org/euclid.aoas/1458909915

Digital Object Identifier
doi:10.1214/15-AOAS889

Mathematical Reviews number (MathSciNet)
MR3480495

Zentralblatt MATH identifier
06586144

#### Citation

Bonner, Simon J.; Schofield, Matthew R.; Noren, Patrik; Price, Steven J. Extending the latent multinomial model with complex error processes and dynamic Markov bases. Ann. Appl. Stat. 10 (2016), no. 1, 246--263. doi:10.1214/15-AOAS889. https://projecteuclid.org/euclid.aoas/1458909915

#### References

• Allender, M. C., Dreslik, M. J., Wylie, D. B., Wylie, S. J., Scott, J. W. and Phillips, C. A. (2013). Ongoing health assessment and prevalence of Chrysosporium in the Eastern Massasauga (Sistrurus catenatus catenatus). Copeia 1 97–102.
• Barker, R. J., Schofield, M. R., Wright, J. A., Frantz, A. C. and Stevens, C. (2014). Closed-population capture–recapture modeling of samples drawn one at a time. Biometrics 70 775–782.
• Bonner, S. J. and Holmberg, J. (2013). Mark-recapture with multiple, non-invasive marks. Biometrics 69 766–775.
• Bonner, S. J., Schofield, M. R., Noren, P. and Price, S. (2016). Supplement to “Extending the latent multinomial model with complex error processes and dynamic Markov bases.” DOI:10.1214/15-AOAS889SUPP.
• Cowen, L. and Schwarz, C. J. (2006). The Jolly–Seber model with tag loss. Biometrics 62 699–705.
• Diaconis, P. and Sturmfels, B. (1998). Algebraic algorithms for sampling from conditional distributions. Ann. Statist. 26 363–397.
• Dobra, A. (2012). Dynamic Markov bases. J. Comput. Graph. Statist. 21 496–517.
• Hemmecke, R., Hemmecke, R., Koeppe, M., Malkin, P. and Walter, M. (2013). User’s guide for 4ti2 version 1.6.
• Lebreton, J.-D., Burnham, K. P., Clobert, J. and Anderson, D. R. (1992). Modelling survival and testing biological hypotheses using marked animals: A unified approach with case studies. Ecological Monographs 62 67–118.
• Link, W. A., Yoshizaki, J., Bailey, L. L. and Pollock, K. H. (2010). Uncovering a latent multinomial: Analysis of mark-recapture data with misidentification. Biometrics 66 178–185.
• Lukacs, P. M. and Burnham, K. P. (2005). Estimating population size from DNA-based closed capture–recapture data incorporating genotyping error. J. Wildl. Manag. 69 396–403.
• McClintock, B. T., Conn, P. B., Alonso, R. S. and Crooks, K. R. (2013). Integrated modeling of bilateral photo-identification data in mark-recapture analyses. Ecology 94 1464–1471.
• McClintock, B. T., Hill, J. M., Fritz, L., Chumbley, K., Luxa, K. and Diefenbach, D. R. (2014). Mark-resight abundance estimation under incomplete identification of marked individuals. Methods in Ecology and Evolution 5 1294–1304.
• Morrison, T. A., Yoshizaki, J., Nichols, J. D. and Bolger, D. T. (2011). Estimating survival in photographic capture–recapture studies: Overcoming misidentification error. Methods in Ecology and Evolution 2 454–463.
• Roark, A. W. and Dorcas, M. E. (2000). Regional body temperature variation in corn snakes measured using temperature-sensitive passive integrated transponders. J. Herpetol. 34 481–485.
• Schofield, M. and Bonner, S. (2016). Connecting the latent multinomial. Biometrics. 17 1070–1080.
• Seber, G. A. F. (2002). The Estimation of Animal Abundance and Related Parameters, 2nd ed. The Blackburn Press, New Jersey, USA.
• Sleeman, J. (2013). Snake fungal disease in the United States Bulletin No. 2013-02, USGS National Wildlife Health Center.
• White, G. C. and Burnham, K. P. (1999). Program MARK: Survival estimation from populations of marked animals. Bird Study 46 120–139.
• Williams, B. K., Nichols, J. D. and Conroy, M. J. (2002). Analysis and Management of Animal Populations. Academic Press, San Diego, CA.
• Winne, C. T., Willson, J. D., Andrews, K. M. and Reed, R. N. (2006). Efficacy of marking snakes with disposable medical cautery units. Herpetological Review 37 52–54.
• Wright, J. A., Barker, R. J., Schofield, M. R., Frantz, A. C., Byrom, A. E. and Gleeson, D. M. (2009). Incorporating genotype uncertainty into mark-recapture-type models for estimating abundance using DNA samples. Biometrics 65 833–840.
• Yoshizaki, J., Brownie, C., Pollock, K. H. and Link, W. A. (2011). Modeling misidentification errors that result from use of genetic tags in capture–recapture studies. Environ. Ecol. Stat. 18 27–55.
• Yoshizaki, J., Pollock, K. H., Brownie, C. and Webster, R. A. (2012). Modeling misidentification errors in capture–recapture studies using photographic identification of evolving marks. Ecology 90 3–9.

#### Supplemental materials

• Supplement to “Extending the latent multinomial model with complex error processes and dynamic Markov bases”. Supplement A: Proof of convergence. Proof that the chains generated by Algorithm 2 converge to the correct distribution. Supplement B: Model $M_{t\alpha}$. Application of the extended framework with dynamic Markov bases to model $M_{t\alpha}$.