improved nonparametric lower bound of species richness via a modified good–turing frequency formula
It is difficult to accurately estimate species richness if there are many almost undetectable species in a hyper‐diverse community. Practically, an accurate lower bound for species richness is preferable to an inaccurate point estimator. The traditional nonparametric lower bound developed by Chao (1...
Saved in:
Published in | Biometrics Vol. 70; no. 3; pp. 671 - 682 |
---|---|
Main Authors | , , , |
Format | Journal Article |
Language | English |
Published |
United States
Blackwell Publishers
01.09.2014
Blackwell Publishing Ltd International Biometric Society |
Subjects | |
Online Access | Get full text |
Cover
Loading…
Summary: | It is difficult to accurately estimate species richness if there are many almost undetectable species in a hyper‐diverse community. Practically, an accurate lower bound for species richness is preferable to an inaccurate point estimator. The traditional nonparametric lower bound developed by Chao (1984, Scandinavian Journal of Statistics 11, 265–270) for individual‐based abundance data uses only the information on the rarest species (the numbers of singletons and doubletons) to estimate the number of undetected species in samples. Applying a modified Good–Turing frequency formula, we derive an approximate formula for the first‐order bias of this traditional lower bound. The approximate bias is estimated by using additional information (namely, the numbers of tripletons and quadrupletons). This approximate bias can be corrected, and an improved lower bound is thus obtained. The proposed lower bound is nonparametric in the sense that it is universally valid for any species abundance distribution. A similar type of improved lower bound can be derived for incidence data. We test our proposed lower bounds on simulated data sets generated from various species abundance models. Simulation results show that the proposed lower bounds always reduce bias over the traditional lower bounds and improve accuracy (as measured by mean squared error) when the heterogeneity of species abundances is relatively high. We also apply the proposed new lower bounds to real data for illustration and for comparisons with previously developed estimators. |
---|---|
Bibliography: | http://dx.doi.org/10.1111/biom.12200 ArticleID:BIOM12200 ark:/67375/WNG-9T8RTRJ8-J istex:2EB4EC4F04742F6D59A887E757AFEF6EE31C3D54 ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14 content type line 23 |
ISSN: | 0006-341X 1541-0420 1541-0420 |
DOI: | 10.1111/biom.12200 |