Abstract
Chimney fires constitute one of the most commonly occurring fire types. Precise prediction and prompt prevention are crucial in reducing the harm they cause. In this paper we develop a combined machine learning and statistical modelling process to predict fire risk. First, we use random forests and permutation importance techniques to identify the most informative explanatory variables. Second, we design a Poisson point process model and employ logistic regression estimation to estimate the parameters. Moreover, we validate the Poisson model assumption using second-order summary statistics and residuals. We implement the modelling process on data collected by the Twente Fire Brigade and obtain plausible predictions. Compared to similar studies, our approach has two advantages: (i) with random forests, we can select explanatory variables nonparametrically considering variable dependence; (ii) using logistic regression estimation, we can fit our statistical model efficiently by tuning it to focus on regions and times that are salient for fire risk.
Funding Statement
This work was funded by the Dutch Research Council (NWO) for the project “Data Driven Risk Management for Fire Services” (18004).
Acknowledgments
We thank the user committee for their valuable input. We also thank Emiel Borggreve, Niels Peters and Etienne Mulder from the Twente Fire Brigade for their help with data cleaning and preprocessing. Marie-Colette van Lieshout is also affiliated with University of Twente. Maurits de graaf is also affiliated with Centrum Winskunde & Informatica. Furthermore, when the project was conducted, Maurits de Graaf was affiliated with University of Twente as well.
Citation
Changqing Lu. Marie-Colette van Lieshout. Maurits de Graaf. Paul Visscher. "Data-driven chimney fire risk prediction using machine learning and point process tools." Ann. Appl. Stat. 17 (4) 3088 - 3111, December 2023. https://doi.org/10.1214/23-AOAS1752
Information