A comparison of several approaches for choosing between working correlation structures in generalized estimating equation analysis of longitudinal binary data

The method of generalized estimating equations (GEE) models the association between the repeated observations on a subject with a patterned correlation matrix. Correct specification of the underlying structure is a potentially beneficial goal, in terms of improving efficiency and enhancing scientifi...

Full description

Saved in:
Bibliographic Details
Published inStatistics in medicine Vol. 28; no. 18; pp. 2338 - 2355
Main Authors Shults, Justine, Sun, Wenguang, Tu, Xin, Kim, Hanjoo, Amsterdam, Jay, Hilbe, Joseph M., Ten-Have, Thomas
Format Journal Article
LanguageEnglish
Published Chichester, UK John Wiley & Sons, Ltd 15.08.2009
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:The method of generalized estimating equations (GEE) models the association between the repeated observations on a subject with a patterned correlation matrix. Correct specification of the underlying structure is a potentially beneficial goal, in terms of improving efficiency and enhancing scientific understanding. We consider two sets of criteria that have previously been suggested, respectively, for selecting an appropriate working correlation structure, and for ruling out a particular structure(s), in the GEE analysis of longitudinal studies with binary outcomes. The first selection criterion chooses the structure for which the model‐based and the sandwich‐based estimator of the covariance matrix of the regression parameter estimator are closest, while the second selection criterion chooses the structure that minimizes the weighted error sum of squares. The rule out criterion deselects structures for which the estimated correlation parameter violates standard constraints for binary data that depend on the marginal means. In addition, we remove structures from consideration if their estimated parameter values yield an estimated correlation structure that is not positive definite. We investigate the performance of the two sets of criteria using both simulated and real data, in the context of a longitudinal trial that compares two treatments for major depressive episode. Practical recommendations are also given on using these criteria to aid in the efficient selection of a working correlation structure in GEE analysis of longitudinal binary data. Copyright © 2009 John Wiley & Sons, Ltd.
Bibliography:ArticleID:SIM3622
NIH-R01CA - No. 096885
istex:9B181C9858EB3C394ADDB747255867D5A241826C
ark:/67375/WNG-ZL60BRLX-F
ObjectType-Article-2
SourceType-Scholarly Journals-1
ObjectType-Feature-1
content type line 23
ObjectType-Undefined-3
ISSN:0277-6715
1097-0258
1097-0258
DOI:10.1002/sim.3622