Using multiple imputation to estimate missing data in meta‐regression
Summary There is a growing need for scientific synthesis in ecology and evolution. In many cases, meta‐analytic techniques can be used to complement such synthesis. However, missing data are a serious problem for any synthetic efforts and can compromise the integrity of meta‐analyses in these and ot...
Saved in:
Published in | Methods in ecology and evolution Vol. 6; no. 2; pp. 153 - 163 |
---|---|
Main Authors | , , , , , , , , |
Format | Journal Article |
Language | English |
Published |
London
John Wiley & Sons, Inc
01.02.2015
|
Subjects | |
Online Access | Get full text |
Cover
Loading…
Summary: | Summary
There is a growing need for scientific synthesis in ecology and evolution. In many cases, meta‐analytic techniques can be used to complement such synthesis. However, missing data are a serious problem for any synthetic efforts and can compromise the integrity of meta‐analyses in these and other disciplines. Currently, the prevalence of missing data in meta‐analytic data sets in ecology and the efficacy of different remedies for this problem have not been adequately quantified.
We generated meta‐analytic data sets based on literature reviews of experimental and observational data and found that missing data were prevalent in meta‐analytic ecological data sets. We then tested the performance of complete case removal (a widely used method when data are missing) and multiple imputation (an alternative method for data recovery) and assessed model bias, precision and multimodel rankings under a variety of simulated conditions using published meta‐regression data sets.
We found that complete case removal led to biased and imprecise coefficient estimates and yielded poorly specified models. In contrast, multiple imputation provided unbiased parameter estimates with only a small loss in precision. The performance of multiple imputation, however, was dependent on the type of data missing. It performed best when missing values were weighting variables, but performance was mixed when missing values were predictor variables. Multiple imputation performed poorly when imputing raw data which were then used to calculate effect size and the weighting variable.
We conclude that complete case removal should not be used in meta‐regression and that multiple imputation has the potential to be an indispensable tool for meta‐regression in ecology and evolution. However, we recommend that users assess the performance of multiple imputation by simulating missing data on a subset of their data before implementing it to recover actual missing data. |
---|---|
Bibliography: | ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 23 |
ISSN: | 2041-210X 2041-210X |
DOI: | 10.1111/2041-210X.12322 |