Open Access
March 2017 Bayesian nonhomogeneous Markov models via Pólya-Gamma data augmentation with applications to rainfall modeling
Tracy Holsclaw, Arthur M. Greene, Andrew W. Robertson, Padhraic Smyth
Ann. Appl. Stat. 11(1): 393-426 (March 2017). DOI: 10.1214/16-AOAS1009

Abstract

Discrete-time hidden Markov models are a broadly useful class of latent-variable models with applications in areas such as speech recognition, bioinformatics, and climate data analysis. It is common in practice to introduce temporal nonhomogeneity into such models by making the transition probabilities dependent on time-varying exogenous input variables via a multinomial logistic parametrization. We extend such models to introduce additional nonhomogeneity into the emission distribution using a generalized linear model (GLM), with data augmentation for sampling-based inference. However, the presence of the logistic function in the state transition model significantly complicates parameter inference for the overall model, particularly in a Bayesian context. To address this, we extend the recently-proposed Pólya-Gamma data augmentation approach to handle nonhomogeneous hidden Markov models (NHMMs), allowing the development of an efficient Markov chain Monte Carlo (MCMC) sampling scheme. We apply our model and inference scheme to 30 years of daily rainfall in India, leading to a number of insights into rainfall-related phenomena in the region. Our proposed approach allows for fully Bayesian analysis of relatively complex NHMMs on a scale that was not possible with previous methods. Software implementing the methods described in the paper is available via the R package NHMM.

Citation

Download Citation

Tracy Holsclaw. Arthur M. Greene. Andrew W. Robertson. Padhraic Smyth. "Bayesian nonhomogeneous Markov models via Pólya-Gamma data augmentation with applications to rainfall modeling." Ann. Appl. Stat. 11 (1) 393 - 426, March 2017. https://doi.org/10.1214/16-AOAS1009

Information

Received: 1 June 2016; Revised: 1 December 2016; Published: March 2017
First available in Project Euclid: 8 April 2017

zbMATH: 1366.62255
MathSciNet: MR3634329
Digital Object Identifier: 10.1214/16-AOAS1009

Keywords: multivariate time series , Nonhomogenous hidden Markov model , Pólya-Gamma latent variables , probit and logit link

Rights: Copyright © 2017 Institute of Mathematical Statistics

Vol.11 • No. 1 • March 2017
Back to Top