Abstract
Low-cost air pollution sensors, offering hyperlocal characterization of pollutant concentrations, are becoming increasingly prevalent in environmental and public health research. However, low-cost air pollution data can be noisy, biased by environmental conditions, and usually need to be field-calibrated by collocating low-cost sensors with reference-grade instruments. We show, theoretically and empirically, that the common procedure of regression-based calibration, using collocated data, systematically underestimates high air pollution concentrations, which are critical to diagnose from a health perspective. Current calibration practices also often fail to utilize the spatial correlation in pollutant concentrations. We propose a novel spatial filtering approach to collocation-based calibration of low-cost networks that mitigates the underestimation issue by using an inverse regression. The inverse regression also allows for incorporating spatial correlations by a second-stage model for the true pollutant concentrations using a conditional Gaussian process. Our approach works with one or more collocated sites in the network and is dynamic, leveraging spatial correlation with the latest available reference data. Through extensive simulations, we demonstrate how the spatial filtering substantially improves estimation of pollutant concentrations and measures peak concentrations with greater accuracy. We apply the methodology for calibration of a low-cost PM network in Baltimore, Maryland, and diagnose air pollution peaks that are missed by the regression-calibration.
Funding Statement
CH was partially supported by the Fonds de recherche du Québec–Nature et Technologies bourse de maîtrise B1X and partially supported by the National Science Foundation Graduate Research Fellowship Program under Grant No. DGE2139757. AD, RP, and KK were partially supported by National Institute of Environmental Health Sciences (NIEHS) grant R01 ES033739. AD was partially supported by National Science Foundation (NSF) Division of Mathematical Sciences grant DMS-1915803. KK, DRG, RP, AD, and CH acknowledge support from the assistance agreement no. RD835871 awarded by the U.S. Environmental Protection Agency to Yale University. It has not been formally reviewed by the EPA. The views expressed in this document are solely those of the authors and do not necessarily reflect those of the agency. The EPA does not endorse any products or commercial services mentioned in this publication. DRG acknowledges HKF Technology (a Kindwell Company) for also supporting the sensor development.
Acknowledgments
The authors would like to thank Colby Buehler (Yale) and Misti Levy Zamora (U. Conn.) for their contributions to the SEARCH network deployment.
Citation
Claire Heffernan. Roger Peng. Drew R. Gentner. Kirsten Koehler. Abhirup Datta. "A dynamic spatial filtering approach to mitigate underestimation bias in field calibrated low-cost sensor air pollution data." Ann. Appl. Stat. 17 (4) 3056 - 3087, December 2023. https://doi.org/10.1214/23-AOAS1751
Information