An EM algorithm for regression analysis with incomplete covariate information

Regression analysis is often challenged by the fact that some covariates are not completely observed. Among other approaches is a newly developed semiparametric maximum likelihood (SML) method that requires no parametric specification of the selection mechanism or the covariate distribution and that...

Full description

Saved in:
Bibliographic Details
Published inJournal of statistical computation and simulation Vol. 77; no. 2; pp. 163 - 173
Main Authors Zhang, Zhiwei, Rockette, Howard E.
Format Journal Article
LanguageEnglish
Published Taylor & Francis 01.02.2007
Online AccessGet full text

Cover

Loading…
More Information
Summary:Regression analysis is often challenged by the fact that some covariates are not completely observed. Among other approaches is a newly developed semiparametric maximum likelihood (SML) method that requires no parametric specification of the selection mechanism or the covariate distribution and that yields efficient inference, at least in some specific models. In this paper, we propose an EM algorithm for finding the SML estimate and for variance estimation. Simulation results suggest that the SML method performs reasonably well in moderate-sized samples. In contrast, the analogous parametric maximum likelihood method is subject to severe bias under model mis-specification, even in large samples.
Bibliography:ObjectType-Article-2
SourceType-Scholarly Journals-1
ObjectType-Feature-1
content type line 23
ISSN:0094-9655
1563-5163
DOI:10.1080/10629360600565202