When is variable importance estimation in species distribution modelling affected by spatial correlation?

Nivedita Harisena, T.A. Groen, A.G. Toxopeus, Babak Naimi

Research output: Contribution to journalArticleAcademicpeer-review

5 Downloads (Pure)

Abstract

Species distribution models are generic empirical techniques that have a number of applications. One of these applications is to determine which environmental conditions are most important for a species. The calculation of this variable importance depends on a number of assumptions, including that the observations that are used to estimate the models are independent of each other. Spatial autocorrelation, which is a common feature most environmental factors confounds this assumption. Besides, many species distribution models are trained using a number of explanatory variables that have different levels of spatial autocorrelation. In this study we quantified the effects of differences in spatial autocorrelation in explanatory variables and the type of species responses to environmental gradients on variable importance estimations in species distribution models. We simulated data for both environmental predictors and species, so that we were in control of the true contribution of every variable in the model and the importance that could be estimated after fitting the models. We found that spatial autocorrelation in the predictors inflated the variable importance estimates, but only when the response of species to the environmental gradients is linear. This inflation effect was larger when the environmental preferences of species coincided with the dominant environmental conditions in a study site. Additionally we find that unimodal responses to the predictors yield systematically a higher variable importance compared to linear responses. We conclude that the type of response to environmental conditions and the relative levels of spatial autocorrelation in the environmental variables cause most bias in relative variable importance estimations. In this way, this study helps to clarify in a systematic and controlled approach how to make proper inferences about variable importance in species distribution models.
Original languageEnglish
Pages (from-to)778-788
Number of pages11
JournalEcography
Volume44
Issue number5
Early online date12 Feb 2021
DOIs
Publication statusPublished - May 2021

Keywords

  • dominant niche conditions
  • generalised additive models
  • generalised linear models
  • geographic extent
  • machine learning
  • response curves
  • scale of analysis
  • ITC-ISI-JOURNAL-ARTICLE
  • ITC-GOLD

Fingerprint Dive into the research topics of 'When is variable importance estimation in species distribution modelling affected by spatial correlation?'. Together they form a unique fingerprint.

Cite this