Using the bootstrap to improve estimation and confidence intervals for regression coefficients selected using backwards variable elimination
Applied researchers frequently use automated model selection methods, such as backwards variable elimination, to develop parsimonious regression models. Statisticians have criticized the use of these methods for several reasons, amongst them are the facts that the estimated regression coefficients a...
Saved in:
Published in | Statistics in medicine Vol. 27; no. 17; pp. 3286 - 3300 |
---|---|
Main Author | |
Format | Journal Article |
Language | English |
Published |
Chichester, UK
John Wiley & Sons, Ltd
30.07.2008
Wiley Subscription Services, Inc |
Subjects | |
Online Access | Get full text |
Cover
Loading…
Summary: | Applied researchers frequently use automated model selection methods, such as backwards variable elimination, to develop parsimonious regression models. Statisticians have criticized the use of these methods for several reasons, amongst them are the facts that the estimated regression coefficients are biased and that the derived confidence intervals do not have the advertised coverage rates. We developed a method to improve estimation of regression coefficients and confidence intervals which employs backwards variable elimination in multiple bootstrap samples. In a given bootstrap sample, predictor variables that are not selected for inclusion in the final regression model have their regression coefficient set to zero. Regression coefficients are averaged across the bootstrap samples, and non‐parametric percentile bootstrap confidence intervals are then constructed for each regression coefficient. We conducted a series of Monte Carlo simulations to examine the performance of this method for estimating regression coefficients and constructing confidence intervals for variables selected using backwards variable elimination. We demonstrated that this method results in confidence intervals with superior coverage compared with those developed from conventional backwards variable elimination. We illustrate the utility of our method by applying it to a large sample of subjects hospitalized with a heart attack. Copyright © 2007 John Wiley & Sons, Ltd. |
---|---|
Bibliography: | Ontario Ministry of Health and Long Term Care ArticleID:SIM3104 istex:1FFB0CF22D7B0EEB18DF9119734A5A5A8AE66D27 ark:/67375/WNG-9MDD5V8M-6 CIHR (Institute of Health Services and Policy Research) Natural Sciences and Engineering Research Council (NSERC) SourceType-Scholarly Journals-1 ObjectType-Feature-1 content type line 14 ObjectType-Article-1 ObjectType-Feature-2 content type line 23 |
ISSN: | 0277-6715 1097-0258 |
DOI: | 10.1002/sim.3104 |