Variable selection via the composite likelihood method for multilevel longitudinal data with missing responses and covariates

Longitudinal data with multilevel structures are commonly collected when following up subjects in clusters over a period of time. Missing values and variable selection issues are common for such data. Biased results may be produced if incompleteness of data is ignored in the analysis. On the other h...

Full description

Saved in:
Bibliographic Details
Published inComputational statistics & data analysis Vol. 135; pp. 25 - 34
Main Authors Li, Haocheng, Shu, Di, He, Wenqing, Yi, Grace Y.
Format Journal Article
LanguageEnglish
Published Elsevier B.V 01.07.2019
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Longitudinal data with multilevel structures are commonly collected when following up subjects in clusters over a period of time. Missing values and variable selection issues are common for such data. Biased results may be produced if incompleteness of data is ignored in the analysis. On the other hand, incorporating a large number of irrelevant covariates into inferential procedures may lead to difficulty in computation and interpretation. A unified penalized composite likelihood framework is developed to handle data with missingness and variable selection issues. It is flexible to handle the situation where responses and covariates are missing not simultaneously under the assumption of missing not at random. The method is justified both rigorously with theoretical results and numerically with simulation studies. The method is also applied to the Waterloo Smoking Prevention Project data.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ISSN:0167-9473
1872-7352
DOI:10.1016/j.csda.2019.01.011