Methods to account for spatial autocorrelation in the analysis of species distributional data: a review

Abstract

Species distributional or trait data based on range map (extent‐of‐occurrence) or atlas survey data often display spatial autocorrelation, i.e. locations close to each other exhibit more similar values than those further apart. If this pattern remains present in the residuals of a statistical model based on such data, one of the key assumptions of standard statistical analyses, that residuals are independent and identically distributed (i.i.d), is violated. The violation of the assumption of i.i.d. residuals may bias parameter estimates and can increase type I error rates (falsely rejecting the null hypothesis of no effect). While this is increasingly recognised by researchers analysing species distribution data, there is, to our knowledge, no comprehensive overview of the many available spatial statistical methods to take spatial autocorrelation into account in tests of statistical significance. Here, we describe six different statistical approaches to infer correlates of species’ distributions, for both presence/absence (binary response) and species abundance data (poisson or normally distributed response), while accounting for spatial autocorrelation in model residuals: autocovariate regression; spatial eigenvector mapping; generalised least squares; (conditional and simultaneous) autoregressive models and generalised estimating equations. A comprehensive comparison of the relative merits of these methods is beyond the scope of this paper. To demonstrate each method's implementation, however, we undertook preliminary tests based on simulated data. These preliminary tests verified that most of the spatial modeling techniques we examined showed good type I error control and precise parameter estimates, at least when confronted with simplistic simulated data containing spatial autocorrelation in the errors. However, we found that for presence/absence data the results and conclusions were very variable between the different methods. This is likely due to the low information content of binary maps. Also, in contrast with previous studies, we found that autocovariate methods consistently underestimated the effects of environmental controls of species distributions. Given their widespread use, in particular for the modelling of species presence/absence data (e.g. climate envelope models), we argue that this warrants further study and caution in their use. To aid other ecologists in making use of the methods described, code to implement them in freely available software is provided in an electronic appendix.

Keywords

Spatial analysisStatisticsAutocorrelationStatistical hypothesis testingNull hypothesisAutoregressive modelEconometricsType I and type II errorsMathematicsComputer science

Affiliated Institutions

Related Publications

Distribution of Residual Autocorrelations in Autoregressive-Integrated Moving Average Time Series Models

George E. P. Box , David A. Pierce

Abstract Many statistical models, and in particular autoregressive—moving average time series models, can be regarded as means of transforming the data to white noise, that is, ...

1970 Journal of the American Statistical A... 2181 citations

Spatial data analysis: theory and practice

Robert Haining

Preface Readership Acknowledgements Introduction Part I. The Context for Spatial Data Analysis: 1. Spatial data analysis: scientific and policy context 2. The nature of spatial ...

2004 Choice Reviews Online 976 citations

Presence‐absence versus presence‐only modelling methods for predicting bird habitat suitability

Wilfried Thuiller , Miguel B. Araújo , Alexandre H. Hirzel

Habitat suitability models can be generated using methods requiring information on species presence or species presence and absence. Knowledge of the predictive performance of s...

2004 Ecography 825 citations

Permutation tests for univariate or multivariate analysis of variance and regression

Marti J. Anderson

The most appropriate strategy to be used to create a permutation distribution for tests of individual terms in complex experimental designs is currently unclear. There are often...

2001 Canadian Journal of Fisheries and Aqu... 1374 citations

An Autologistic Model for the Spatial Distribution of Wildlife

Nicole H. Augustin , M. A. Mugglestone , S. T. Buckland

1. A new method for estimating the geographical distribution of plant and animal species from incomplete field survey data is developed. 2. Wildlife surveys are often conducted ...

1996 Journal of Applied Ecology 657 citations

Publication Info

Year: 2007
Type: review
Volume: 30
Issue: 5
Pages: 609-628
Citations: 3238
Access: Closed

External Links

View on DOI.org

Social Impact

Altmetric

Methods to account for spatial autocorrelation in the analysis of species distributional data: a review

PlumX Metrics

Social media, news, blog, policy document mentions

Citation Metrics

3238

OpenAlex

Cite This

APA Style

                            
                                
                                    Carsten F. Dormann, 
                                
                                    Jana McPherson, 
                                
                                    Miguel B. Araújo
                                
                                et al.
                            
                            (2007). 
                            Methods to account for spatial autocorrelation in the analysis of species distributional data: a review. 
                            Ecography
                            , 30
                            (5)
                            , 609-628.
                            https://doi.org/10.1111/j.2007.0906-7590.05171.x
                        

Identifiers

DOI: 10.1111/j.2007.0906-7590.05171.x