Open Access
March 2020 A statistical analysis of noisy crowdsourced weather data
Arnab Chakraborty, Soumendra Nath Lahiri, Alyson Wilson
Ann. Appl. Stat. 14(1): 116-142 (March 2020). DOI: 10.1214/19-AOAS1290

Abstract

Spatial prediction of weather elements like temperature, precipitation, and barometric pressure are generally based on satellite imagery or data collected at ground stations. None of these data provide information at a more granular or “hyperlocal” resolution. On the other hand, crowdsourced weather data, which are captured by sensors installed on mobile devices and gathered by weather-related mobile apps like WeatherSignal and AccuWeather, can serve as potential data sources for analyzing environmental processes at a hyperlocal resolution. However, due to the low quality of the sensors and the nonlaboratory environment, the quality of the observations in crowdsourced data is compromised. This paper describes methods to improve hyperlocal spatial prediction using this varying-quality, noisy crowdsourced information. We introduce a reliability metric, namely Veracity Score (VS), to assess the quality of the crowdsourced observations using a coarser, but high-quality, reference data. A VS-based methodology to analyze noisy spatial data is proposed and evaluated through extensive simulations. The merits of the proposed approach are illustrated through case studies analyzing crowdsourced daily average ambient temperature readings for one day in the contiguous United States.

Citation

Download Citation

Arnab Chakraborty. Soumendra Nath Lahiri. Alyson Wilson. "A statistical analysis of noisy crowdsourced weather data." Ann. Appl. Stat. 14 (1) 116 - 142, March 2020. https://doi.org/10.1214/19-AOAS1290

Information

Received: 1 March 2019; Revised: 1 June 2019; Published: March 2020
First available in Project Euclid: 16 April 2020

zbMATH: 07200164
MathSciNet: MR4085086
Digital Object Identifier: 10.1214/19-AOAS1290

Keywords: Geostatistics , hyperlocal spatial prediction , robust kriging , Veracity score

Rights: Copyright © 2020 Institute of Mathematical Statistics

Vol.14 • No. 1 • March 2020
Back to Top