The Annals of Applied Statistics

A bivariate space–time downscaler under space and time misalignment

Veronica J. Berrocal, Alan E. Gelfand, and David M. Holland

Full-text: Open access

Abstract

Ozone and particulate matter, PM2.5, are co-pollutants that have long been associated with increased public health risks. Information on concentration levels for both pollutants comes from two sources: monitoring sites and output from complex numerical models that produce concentration surfaces over large spatial regions. In this paper, we offer a fully-model-based approach for fusing these two sources of information for the pair of co-pollutants which is computationally feasible over large spatial regions and long periods of time. Due to the association between concentration levels of the two environmental contaminants, it is expected that information regarding one will help to improve prediction of the other. Misalignment is an obvious issue since the monitoring networks for the two contaminants only partly intersect and because the collection rate for PM2.5 is typically less frequent than that for ozone.

Extending previous work in Berrocal, Gelfand and Holland (2010), we introduce a bivariate downscaler that provides a flexible class of bivariate space–time assimilation models. We discuss computational issues for model fitting and analyze a dataset for ozone and PM2.5 for the ozone season during year 2002. We show a modest improvement in predictive performance, not surprising in a setting where we can anticipate only a small gain.

Article information

Source
Ann. Appl. Stat., Volume 4, Number 4 (2010), 1942-1975.

Dates
First available in Project Euclid: 4 January 2011

Permanent link to this document
https://projecteuclid.org/euclid.aoas/1294167805

Digital Object Identifier
doi:10.1214/10-AOAS351

Mathematical Reviews number (MathSciNet)
MR2829942

Zentralblatt MATH identifier
1220.62148

Keywords
Co-kriging coregionalization dynamic model kriging multivariate spatial process spatially varying coefficients

Citation

Berrocal, Veronica J.; Gelfand, Alan E.; Holland, David M. A bivariate space–time downscaler under space and time misalignment. Ann. Appl. Stat. 4 (2010), no. 4, 1942--1975. doi:10.1214/10-AOAS351. https://projecteuclid.org/euclid.aoas/1294167805


Export citation

References

  • Apanasovich, T. and Genton, M. (2010). Cross-covariance functions for multivariate random fields based on latent dimensions. Biometrika 97 15–30.
  • Banerjee, S., Carlin, B. P. and Gelfand, A. E. (2004). Hierarchical Modeling and Analysis for Spatial Data. Chapman & Hall/CRC, Boca Raton.
  • Berrocal, V. J., Gelfand, A. E. and Holland, D. M. (2010a). A spatio-temporal downscaler for outputs from numerical models. J. Agric. Biol. Environ. Stat. 15 176–197.
  • Berrocal, V. J., Gelfand, A. E. and Holland, D. M. (2010b). Supplement to “A bivariate space–time downscaler under space and time misalignment.” DOI: 10.1214/10-AOAS351SUPP.
  • Braga, A. L. F., Zanobetti, A. and Schwartz, J. (2001). The lag structure between particulate air pollution and respiratory and cardiovascular deaths in ten US cities. J. Occup. Environ. Med. 43 927–933.
  • Brown, P. J., Le, N. D. and Zidek, J. V. (1994). Multivariate spatial interpolation and exposure to air pollutants. Canad. J. Statist. 22 489–509.
  • Byun, D. and Schere, K. L. (2006). Review of the governing equations, computational algorithms, and other components of the Models-3 Community Multiscale Air Quality (CMAQ) modeling system. Appl. Mech. Rev. 59 51–77.
  • Carroll, R. J., Chen, E. I., George, T. H., Li, H. J., Newton, H., Schmiediche, H., and Wang, N. (1997). Ozone exposure and population density in Harris County, Texas (with discussion). J. Amer. Statist. Assoc. 92 392–415.
  • Chilès, J.-P. and Delfiner, P. (1999). Geostatistics: Modeling Spatial Uncertainty. Wiley, New York.
  • Cressie, N. A. C. (1993). Statistics for Spatial Data. Wiley, New York.
  • Daley, R. (1993). Atmospheric Data Analysis. Cambridge Univ. Press, New York.
  • Davis, J. M. and Swall, J. L. (2006). An examination of the CMAQ simulations of the wet deposition of ammonium from a Bayesian perspective. Atmospheric Environment 40 4562–4573.
  • Dominici, F., Samet, J. M. and Zeger, S. L. (2000). Combining evidence on air pollution and daily mortality from the twenty largest US cities: A hierarchical modeling strategy (with discussion). J. Roy. Statist. Soc. Ser. A 163 263–302.
  • Dominici, F., Peng, R. D., Bell, M. L., Pham, L., McDermott, A., Zeger, S. L. and Samet, J. M. (2006). Fine particulate air pollution and hospital admission for cardiovascular and respiratory diseases. J. Amer. Med. Assoc. 295 1127–1134.
  • Foley, K. M. and Fuentes, M. (2008). A statistical framework to combine multivariate spatial data and physical models for hurricane wind prediction. J. Agric. Biol. Envir. Statist. 13 37–59.
  • Fuentes, M., Guttorp, P. and Challenor, P. (2003). Statistical assessment of numerical models. Int. Statist. Rev. 71 201–221.
  • Fuentes, M. and Raftery, A. E. (2005). Model evaluation and spatial interpolation by Bayesian combination of observations with outputs from numerical models. Biometrics 61 36–45.
  • Gelfand, A. E., Banerjee, S. and Gamerman, D. (2005). Spatial process modelling for univariate and multivariate dynamic spatial data. Environmetrics 16 465–479.
  • Gelfand, A. E. and Sahu, S. K. (2010). Combining monitoring data and computer model output in assessing environmental exposure. In Handbook of Applied Bayesian Analysis (K. O’Hagan and M. West, eds.) Chapter 19. Oxford Univ. Press.
  • Gelfand, A. E., Schmidt, A. M., Banerjee, S. and Sirmans, C. F. (2004). Nonstationary multivariate process modeling through spatially varying coregionalization. Test 13 263–312.
  • Gneiting, T., Kleiber, W. and Schlather, M. (2009). Matern cross-covariance functions for multivariate random fields. Technical Report no. 549, Dept. Statistics, Univ. Washington.
  • Gneiting, T. and Raftery, A. E. (2007). Strictly proper scoring rules, prediction, and estimation. J. Amer. Statist. Assoc. 102 359–378.
  • Gotway, C. A. and Young, L. J. (2002). Combining incompatible spatial data. J. Amer. Statist. Assoc. 97 632–648.
  • Guillas, S., Bao, J., Choi, Y. and Wang, Y. (2008). Statistical correction and downscaling of chemical transport model ozone forecasts over Atlanta. Atmospheric Environment 42 1338–1348.
  • Harville, D. A. (1977). Maximum likelihood approaches to variance component estimation and to related problems. J. Amer. Statist. Assoc. 72 320–338.
  • Haslett, J. and Raftery, A. E. (1989). Space–time modelling with long-memory dependence: Assessing Ireland’s wind-power resource (with discussion). J. Roy Statist. Soc. Ser. C 38 1–50.
  • Jun, M. and Stein, M. L. (2004). Statistical comparison of observed and CMAQ modeled daily sulfate levels. Atmospheric Environment 38 4427–4436.
  • Kalnay, E. (2002). Atmospheric Modeling, Data Assimilation and Predictability. Cambridge Univ. Press.
  • Kennedy, M. C. and O’Hagan, A. (2001). Bayesian calibration of computer models (with discussion). J. Roy. Statist. Soc. Ser. B 63 425–464.
  • Kibria, B. M. G., Sun, L., Zidek, J. V. and Le, N. D. (2002). Bayesian spatial prediction of random space–time fields with application to mapping PM2.5 exposure. J. Amer. Statist. Assoc. 97 112–124.
  • Le, N. D., Sun, W. and Zidek, J. V. (1997). Bayesian multivariate spatial interpolation with data missing by design. J. Roy. Statist. Soc. Ser. B 59 501–510.
  • Liu, Z., Le, N. D. and Zidek, J. V. (2007). An appraisal of Bayesian Melding for physical-statistical modeling. Technical Report no. 233, Dept. Statistics, Univ. British Columbia.
  • Liu, Z., Le, N. D. and Zidek, J. V. (2008). Combining measurements and physical model outputs for the spatial prediction of hourly ozone space–time fields. Technical Report no. 239, Dept. Statistics, Univ. British Columbia.
  • Majumdar, A. and Gelfand, A. E. (2007). Multivariate spatial modeling for geostatistical data using convolved covariance functions. Mathematical Geology 39 225–245.
  • McMillan, N. J., Holland, D. M., Morara, M. and Feng, J. (2010). Combining numerical model output and particulate data using Bayesian space–time modeling. Environmetrics 21 48–65.
  • Meiring, W., Sampson, P. D. and Guttorp, P. (1998). Space–time estimation of grid cell hourly ozone levels for assessment of a deterministic model. Environ. Ecol. Statist. 5 197–222.
  • Paciorek, C. and Liu, Y. (2009). Limitations of remotely sensed aerosol as a spatial proxy for fine particulate matter. Environmental Health Perspectives 117 904–909.
  • Patterson, H. D. and Thompson, R. (1971). Recovery of inter-block information when block sizes are unequal. Biometrika 58 545–554.
  • Poole, D. and Raftery, A. E. (2000). Inference for deterministic simulation models: The Bayesian melding approach. J. Amer. Statist. Assoc. 95 1244–1255.
  • Sahu, S. K., Gelfand, A. E. and Holland, D. M. (2006). Spatio-temporal modeling of fine particulate matter. J. Agric. Biol. Environ. Statist. 11 61–86.
  • Sahu, S. K., Gelfand, A. E. and Holland, D. M. (2007). High resolution space–time ozone modeling for assessing trends. J. Amer. Statist. Assoc. 102 1221–1234.
  • Sahu, S. K., Gelfand, A. E. and Holland, D. M. (2010). Fusing point and areal level space–time data with application to wet deposition. J. Roy. Statist. Soc. Ser. C 59 77–103.
  • Sahu, S. K. and Mardia, K. V. (2005). A Bayesian kriged-Kalman model for short-term forecasting of air pollution levels. J. Roy. Statist. Soc. Ser. C 54 223–244.
  • Schmidt, A. M. and Gelfand, A. E. (2003). A Bayesian coregionalization approach for multivariate pollutant data. J. Geophys. Res. 108 D248783, DOI: 10.1029/2002JD002905.
  • Schwartz, J. (1996). Air pollution and hospital admissions for respiratory diseases. Epidemiology 7 20–28.
  • Smith, B. J. and Cowles, M. K. (2007). Correlating point-referenced radon and areal uranium data arising from a common spatial process. J. Roy. Statist. Soc. Ser. C 56 313–326.
  • Smith, R. L., Kolenikov, S. and Cox, L. H. (2003). Spatio-temporal modeling of PM2.5 data with missing values. J. Geophys. Res. Atmosphere 108 D249004, DOI: 10.1029/2002JD002914.
  • Smith, R. L., Davis, J. M., Sacks, J., Speckman, P. and Styer, P. (2000). Regression models for air pollution and daily mortality: Analysis of data from Birmingham, Alabama. Environmetrics 11 719–743.
  • Swall, J. L. and Davis, J. M. (2006). A Bayesian statistical approach for the evaluation of CMAQ. Atmospheric Environment 40 4883–4893.
  • Swall, J. L. and Foley, K. M. (2009). The impact of spatial correlation and incommensurability on model evaluation. Atmospheric Environment 43 1204–1217.
  • VerHoef, J. M. and Barry, R. D. (1998). Constructing and fitting model for cokriging and multivariable spatial prediction. J. Statist. Plann. Inference 69 275–294.
  • Wackernagel, H. (2003). Multivariate Geostatistics: An Introduction With Applications, 3rd ed. Springer, Berlin.
  • West, M. and Harrison, J. (1999). Bayesian Forecasting and Dynamic Models, 2nd ed. Springer, New York.

Supplemental materials

  • Supplementary material: Fitting details. This section provides details for fitting the bivariate downscaler model. In the section we will first illustrate how to fit the general bivariate downscaler model in its static version, and then we will discuss how to adapt the fitting model procedures from the static setting to the space-time setting.