Distribution‐free model selection for longitudinal zero‐inflated count data with missing responses and covariates

In many medical and social science studies, count responses with excess zeros are very common and often the primary outcome of interest. Such count responses are usually generated under some clustered correlation structures due to longitudinal observations of subjects. To model such longitudinal cou...

Full description

Saved in:
Bibliographic Details
Published inStatistics in medicine Vol. 41; no. 16; pp. 3180 - 3198
Main Authors Chen, Chun‐Shu, Shen, Chung‐Wei
Format Journal Article
LanguageEnglish
Published England Wiley Subscription Services, Inc 20.07.2022
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:In many medical and social science studies, count responses with excess zeros are very common and often the primary outcome of interest. Such count responses are usually generated under some clustered correlation structures due to longitudinal observations of subjects. To model such longitudinal count data with excess zeros, the zero‐inflated binomial (ZIB) models for bounded outcomes, and the zero‐inflated negative binomial (ZINB) and zero‐inflated poisson (ZIP) models for unbounded outcomes all are popular methods. To alleviate the effects of deviations from model assumptions, a semiparametric (or, distribution‐free) weighted generalized estimating equations has been proposed to estimate model parameters when data are subject to missingness. In this article, we further explore important covariates for the response variable. Without assumptions on the data distribution, a model selection criterion based on the expected weighted quadratic loss is proposed to select an appropriate subset of covariates, especially when count responses have excess zeros and data are subject to nonmonotone missingness in both responses and covariates. To understand the selection effects of the percentages of excess zeros and missingness, we design various scenarios for covariate selection in the mean model via simulation studies and a real data example regarding the study of cardiovascular disease is also presented for illustration.
Bibliography:Funding information
Ministry of Science and Technology, Taiwan, MOST 108‐2628‐M‐008‐005‐MY3; MOST 109‐2118‐M‐194‐001‐MY2
ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
content type line 23
ISSN:0277-6715
1097-0258
1097-0258
DOI:10.1002/sim.9411