Model selection based on resampling approaches for cluster longitudinal data with missingness in outcomes

In medical and health studies, longitudinal and cluster longitudinal data are often collected, where the response variable of interest is observed repeatedly over time and along with a set of covariates. Model selection becomes an active research topic but has not been explored largely due to the co...

Full description

Saved in:
Bibliographic Details
Published inStatistics in medicine Vol. 37; no. 20; pp. 2982 - 2997
Main Authors Chen, Chun‐Shu, Shen, Chung‐Wei
Format Journal Article
LanguageEnglish
Published England Wiley Subscription Services, Inc 10.09.2018
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:In medical and health studies, longitudinal and cluster longitudinal data are often collected, where the response variable of interest is observed repeatedly over time and along with a set of covariates. Model selection becomes an active research topic but has not been explored largely due to the complex correlation structure of the data set. To address this important issue, in this paper, we concentrate on model selection of cluster longitudinal data especially when data are subject to missingness. Motivated from the expected weighted quadratic loss of a given model, data perturbation and bootstrapping methods are used to estimate the loss and then the model that has the smallest expected loss is selected as the best model. To justify the proposed model selection method, we provide various numerical assessments and a real application regarding the asthma data set is also analyzed for illustration.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
content type line 23
ISSN:0277-6715
1097-0258
1097-0258
DOI:10.1002/sim.7801