TY - UNPB
T1 - Data-driven chimney fire risk prediction using machine learning and point process tools
AU - Lu, Changqing
AU - van Lieshout, Marie-Colette
AU - de Graaf, Maurits
AU - Visscher, Paul
PY - 2021/12/14
Y1 - 2021/12/14
N2 - Chimney fires constitute one of the most commonly occurring fire types. Precise prediction and prompt prevention are crucial in reducing the harm they cause. In this paper, we develop a combined machine learning and statistical modeling process to predict chimney fires. Firstly, we use random forests and permutation importance techniques to identify the most informative explanatory variables. Secondly, we design a Poisson point process model and apply associated logistic regression estimation to estimate the parameters. Moreover, we validate the Poisson model assumption using second-order summary statistics and residuals. We implement the modeling process on data collected by the Twente Fire Brigade and obtain plausible predictions. Compared to similar studies, our approach has two advantages: i) with random forests, we can select explanatory variables non-parametrically considering variable dependence; ii) using logistic regression estimation, we can fit the statistical model efficiently by tuning it to focus on important regions and times of the fire data.
AB - Chimney fires constitute one of the most commonly occurring fire types. Precise prediction and prompt prevention are crucial in reducing the harm they cause. In this paper, we develop a combined machine learning and statistical modeling process to predict chimney fires. Firstly, we use random forests and permutation importance techniques to identify the most informative explanatory variables. Secondly, we design a Poisson point process model and apply associated logistic regression estimation to estimate the parameters. Moreover, we validate the Poisson model assumption using second-order summary statistics and residuals. We implement the modeling process on data collected by the Twente Fire Brigade and obtain plausible predictions. Compared to similar studies, our approach has two advantages: i) with random forests, we can select explanatory variables non-parametrically considering variable dependence; ii) using logistic regression estimation, we can fit the statistical model efficiently by tuning it to focus on important regions and times of the fire data.
KW - fire prediction
KW - spatio-temporal point pattern
KW - Poisson point process
KW - variable importance
KW - logistic regression
KW - pair correlation function
KW - K-function
U2 - 10.48550/arXiv.2112.07257
DO - 10.48550/arXiv.2112.07257
M3 - Preprint
BT - Data-driven chimney fire risk prediction using machine learning and point process tools
PB - ArXiv.org
ER -