Abstract

ABSTRACT Current circumstances — that the majority of species distribution records exist as presence‐only data (e.g. from museums and herbaria), and that there is an established need for predictions of species distributions — mean that scientists and conservation managers seek to develop robust methods for using these data. Such methods must, in particular, accommodate the difficulties caused by lack of reliable information about sites where species are absent. Here we test two approaches for overcoming these difficulties, analysing a range of data sets using the technique of multivariate adaptive regression splines (MARS). MARS is closely related to regression techniques such as generalized additive models (GAMs) that are commonly and successfully used in modelling species distributions, but has particular advantages in its analytical speed and the ease of transfer of analysis results to other computational environments such as a Geographic Information System. MARS also has the advantage that it can model multiple responses, meaning that it can combine information from a set of species to determine the dominant environmental drivers of variation in species composition. We use data from 226 species from six regions of the world, and demonstrate the use of MARS for distribution modelling using presence‐only data. We test whether (1) the type of data used to represent absence or background and (2) the signal from multiple species affect predictive performance, by evaluating predictions at completely independent sites where genuine presence–absence data were recorded. Models developed with absences inferred from the total set of presence‐only sites for a biological group, and using simultaneous analysis of multiple species to inform the choice of predictor variables, performed better than models in which species were analysed singly, or in which pseudo‐absences were drawn randomly from the study area. The methods are fast, relatively simple to understand, and useful for situations where data are limited. A tutorial is included.

Keywords

Multivariate adaptive regression splinesHerbariumMultivariate statisticsMars Exploration ProgramSpecies distributionRange (aeronautics)Data setEnvironmental niche modellingSet (abstract data type)RegressionRegression analysisLinear regressionGeographyComputer scienceEcologyStatisticsData miningMathematicsBayesian multivariate linear regressionHabitatBiologyEcological nicheEngineering

Affiliated Institutions

Related Publications

Publication Info

Year
2007
Type
article
Volume
13
Issue
3
Pages
265-275
Citations
337
Access
Closed

External Links

Citation Metrics

337
OpenAlex

Cite This

Jane Elith, John R. Leathwick (2007). Predicting species distributions from museum and herbarium records using multiresponse models fitted with multivariate adaptive regression splines. Diversity and Distributions , 13 (3) , 265-275. https://doi.org/10.1111/j.1472-4642.2007.00340.x

Identifiers

DOI
10.1111/j.1472-4642.2007.00340.x