Open Access
September 2017 Dynamic mixtures of factor analyzers to characterize multivariate air pollutant exposures
Antonello Maruotti, Jan Bulla, Francesco Lagona, Marco Picone, Francesca Martella
Ann. Appl. Stat. 11(3): 1617-1648 (September 2017). DOI: 10.1214/17-AOAS1049

Abstract

The assessment of pollution exposure is based on the analysis of a multivariate time series that include the concentrations of several pollutants as well as the measurements of multiple atmospheric variables. It typically requires methods of dimensionality reduction that are capable of identifying potentially dangerous combinations of pollutants and simultaneously segmenting exposure periods according to air quality conditions. When the data are high-dimensional, however, efficient methods of dimensionality reduction are challenging because of the formidable structure of cross-correlations that arise from the dynamic interaction between weather conditions and natural/anthropogenic pollution sources. In order to assess pollution exposure in an urban area while taking the above mentioned difficulties into account, we have developed a class of parsimonious hidden Markov models. In a multivariate time series setting, this approach simultaneously allows for the performance of temporal segmentation and dimensionality reduction. We specifically approximate the distribution of multiple pollutant concentrations by mixtures of factor analysis models, whose parameters evolve according to a latent Markov chain. Covariates are included as predictors of the chain transition probabilities. Parameter constraints on the factorial component of the model are exploited to tune the flexibility of dimensionality reduction. In order to estimate the model parameters efficiently, we have proposed a novel three-step Alternating Expected Conditional Maximization (AECM) algorithm, which is also assessed in a simulation study. In the case study, the proposed methods could (1) describe the exposure to pollution in terms of a few latent regimes, (2) associate these regimes with specific combinations of pollutant concentration levels as well as distinct correlation structures between concentrations, and (3) capture the influence of weather conditions on transitions between regimes.

Citation

Download Citation

Antonello Maruotti. Jan Bulla. Francesco Lagona. Marco Picone. Francesca Martella. "Dynamic mixtures of factor analyzers to characterize multivariate air pollutant exposures." Ann. Appl. Stat. 11 (3) 1617 - 1648, September 2017. https://doi.org/10.1214/17-AOAS1049

Information

Received: 1 November 2016; Revised: 1 March 2017; Published: September 2017
First available in Project Euclid: 5 October 2017

zbMATH: 1380.62265
MathSciNet: MR3709572
Digital Object Identifier: 10.1214/17-AOAS1049

Keywords: AECM algorithm , dimensionality reduction , Hidden Markov models , three-step algorithm

Rights: Copyright © 2017 Institute of Mathematical Statistics

Vol.11 • No. 3 • September 2017
Back to Top