Geographical sampling bias in a large distributional database and its effects on species richness-environment models

Aim: Recent advances in the availability of species distributional and high-resolution environmental data have facilitated the investigation of species richness—environment relationships. However, even exhaustive distributional databases are prone to geographical sampling bias. We aim to quantify th...

Full description

Saved in:
Bibliographic Details
Published inJournal of biogeography Vol. 40; no. 8; pp. 1415 - 1426
Main Authors Yang, Wenjing, Ma, Keping, Kreft, Holger
Format Journal Article
LanguageEnglish
Published Oxford Blackwell Publishing Ltd 01.08.2013
Blackwell Publishing
Blackwell
Wiley Subscription Services, Inc
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Aim: Recent advances in the availability of species distributional and high-resolution environmental data have facilitated the investigation of species richness—environment relationships. However, even exhaustive distributional databases are prone to geographical sampling bias. We aim to quantify the inventory incompleteness of vascular plant data across 2377 Chinese counties and to test whether inventory incompleteness affects the analysis of richness—environment relationships and spatial predictions of species richness. Location: China. Methods: We used the most comprehensive database of Chinese vascular plants, which includes county-level occurrences for 29,012 native species derived from 4,236,768 specimen and literature records. For each county, we computed smoothed species accumulation curves and used the mean slope of the last 10% of the curves as a proxy for inventory incompleteness. We created a series of data subsets with different levels of inventory incompleteness by excluding successively more under-sampled counties from the full data set. We then applied spatial and non-spatial regression models to each of these subsets to investigate relationships between the species richness of subsets and environmental factors, and to predict spatial patterns of vascular plant species richness in China. Results: Log 10 -transformed numbers of records and documented species were strongly correlated (r = 0.97). In total, 91% of Chinese counties were identified as under-sampled. After controlling for inventory incompleteness, the overall explanatory power of environmental factors markedly increased, and the strongest predictor of species richness switched from elevational range to annual wet days. Environmental models calibrated with more complete inventories yielded better spatial predictions of species richness. Main conclusions: Our results indicate that inventory incompleteness strongly affects the explanatory power of environmental factors, the main determinants of species richness obtained from regression analyses, and the reliability of environment-based spatial predictions of species richness. We conclude that even large distributional databases are prone to geographical sampling bias, with far-reaching implications for the perception of and inferences about macroecological patterns.
Bibliography:ark:/67375/WNG-CD2ZGBQ1-F
ArticleID:JBI12108
istex:7C045F1C0DF2F697B57F606682EDBF89F1EB1E2B
Appendix S1 Detailed information of species distributional data, environmental variables and nature reserves used in this analysis.Appendix S2 Moran's I correlograms and the selection of lag distances for simultaneous autoregressive models.Appendix S3 R2 values and coefficients of regression models.
ObjectType-Article-1
SourceType-Scholarly Journals-1
content type line 14
ObjectType-Article-2
ObjectType-Feature-1
content type line 23
ISSN:0305-0270
1365-2699
DOI:10.1111/jbi.12108