Handling covariates subject to limits of detection in regression

In the environmental health sciences, measurements of toxic exposures are often constrained by a lower limit called the limit of detection (LOD), with observations below this limit called non-detects. Although valid inference may be obtained by excluding non-detects in the estimation of exposure eff...

Full description

Saved in:

Bibliographic Details
Published in	Environmental and ecological statistics Vol. 19; no. 3; pp. 369 - 391
Main Authors	Arunajadai, Srikesh G, Rauh, Virginia A
Format	Journal Article
Language	English
Published	Boston Springer-Verlag 01.09.2012 Springer US Springer Nature B.V
Subjects	Biomedical and Life Sciences Chemical contaminants Chemistry and Earth Sciences Computer Science detection limit Ecology Environmental health Environmental monitoring Environmental science Estimates Expected values Exposure Health Sciences Human exposure Life Sciences Math. Appl. in Environmental Science Mathematical models Maximum likelihood method Medicine Methods Physics Random variables risk Simulation Specifications Statistical analysis Statistics for Engineering Statistics for Life Sciences Studies Theoretical Ecology/Statistics Toxicity Limit of detection Regression Multiple imputation Generalized gamma distribution
Online Access	Get full text

Cover

Loading…

More Information
Summary:	In the environmental health sciences, measurements of toxic exposures are often constrained by a lower limit called the limit of detection (LOD), with observations below this limit called non-detects. Although valid inference may be obtained by excluding non-detects in the estimation of exposure effects, this practice can lead to substantial reduction in power to detect a significant effect, depending on the proportion of censoring and the closeness of the effect size to the null value. Therefore, a variety of methods have been commonly used in the environmental science literature to substitute values for the non-detects for the purpose of estimating exposure effects, including ad hoc values such as [Formula: see text] and LOD. Another method substitutes the expected value of the non-detects, i.e., E[X\|X ≤ LOD] but this requires that the inference be robust to mild miss-specifications in the distribution of the exposure variable. In this paper, we demonstrate that the estimate of the exposure effect is extremely sensitive to ad-hoc substitutions and moderate distribution miss-specifications under the conditions of large sample sizes and moderate effect size, potentially leading to biased estimates. We propose instead the use of the generalized gamma distribution to estimate imputed values for the non-detects, and show that this method avoids the risk of distribution miss-specification among the class of distributions represented by the generalized gamma distribution. A multiple imputation-based procedure is employed to estimate the regression parameters. Compared to the method of excluding non-detects, the proposed method can substantially increase the power to detect a significant effect when the effect size is close to the null value in small samples with moderate levels of censoring ( ≤ 50%), without compromising the coverage and relative bias of the estimates.
Bibliography:	http://dx.doi.org/10.1007/s10651-012-0191-6 SourceType-Scholarly Journals-1 ObjectType-Feature-1 content type line 14 ObjectType-Article-1 ObjectType-Feature-2 content type line 23
ISSN:	1352-8505 1573-3009
DOI:	10.1007/s10651-012-0191-6