Abstract
ABSTRACT Current circumstances — that the majority of species distribution records exist as presence‐only data (e.g. from museums and herbaria), and that there is an established need for predictions of species distributions — mean that scientists and conservation managers seek to develop robust methods for using these data. Such methods must, in particular, accommodate the difficulties caused by lack of reliable information about sites where species are absent. Here we test two approaches for overcoming these difficulties, analysing a range of data sets using the technique of multivariate adaptive regression splines (MARS). MARS is closely related to regression techniques such as generalized additive models (GAMs) that are commonly and successfully used in modelling species distributions, but has particular advantages in its analytical speed and the ease of transfer of analysis results to other computational environments such as a Geographic Information System. MARS also has the advantage that it can model multiple responses, meaning that it can combine information from a set of species to determine the dominant environmental drivers of variation in species composition. We use data from 226 species from six regions of the world, and demonstrate the use of MARS for distribution modelling using presence‐only data. We test whether (1) the type of data used to represent absence or background and (2) the signal from multiple species affect predictive performance, by evaluating predictions at completely independent sites where genuine presence–absence data were recorded. Models developed with absences inferred from the total set of presence‐only sites for a biological group, and using simultaneous analysis of multiple species to inform the choice of predictor variables, performed better than models in which species were analysed singly, or in which pseudo‐absences were drawn randomly from the study area. The methods are fast, relatively simple to understand, and useful for situations where data are limited. A tutorial is included.
Keywords
Affiliated Institutions
Related Publications
The uncertain nature of absences and their importance in species distribution modelling
Species distribution models (SDM) are commonly used to obtain hypotheses on either the realized or the potential distribution of species. The reliability and meaning of these hy...
Novel methods improve prediction of species’ distributions from occurrence data
Prediction of species’ distributions is central to diverse applications in ecology, evolution and conservation science. There is increasing electronic access to vast sets of occ...
Model‐based uncertainty in species range prediction
Abstract Aim Many attempts to predict the potential range of species rely on environmental niche (or ‘bioclimate envelope’) modelling, yet the effects of using different niche‐b...
Modelling distribution and abundance with presence‐only data
Summary Presence‐only data, for which there is no information on locations where the species is absent, are common in both animal and plant studies. In many situations, these ma...
Presence‐absence versus presence‐only modelling methods for predicting bird habitat suitability
Habitat suitability models can be generated using methods requiring information on species presence or species presence and absence. Knowledge of the predictive performance of s...
Publication Info
- Year
- 2007
- Type
- article
- Volume
- 13
- Issue
- 3
- Pages
- 265-275
- Citations
- 337
- Access
- Closed
External Links
Social Impact
Social media, news, blog, policy document mentions
Citation Metrics
Cite This
Identifiers
- DOI
- 10.1111/j.1472-4642.2007.00340.x