Bayesian model selection for logistic regression models with random intercept

Data, collected to model risk of an interesting event, often have a multilevel structure as patients are clustered within larger units, e.g. clinical centers. Risk of the event is usually modeled using a logistic regression model, with a random intercept to control for heterogeneity among clusters....

Full description

Saved in:
Bibliographic Details
Published inComputational statistics & data analysis Vol. 56; no. 5; pp. 1256 - 1274
Main Authors Wagner, Helga, Duller, Christine
Format Journal Article
LanguageEnglish
Published Elsevier B.V 01.05.2012
Elsevier
SeriesComputational Statistics & Data Analysis
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Data, collected to model risk of an interesting event, often have a multilevel structure as patients are clustered within larger units, e.g. clinical centers. Risk of the event is usually modeled using a logistic regression model, with a random intercept to control for heterogeneity among clusters. Model specification requires to decide which regressors have a non-negligible effect, and hence, should be included in the final model and whether risk is actually heterogeneous among centers, i.e. whether the model should include a random intercept or not. In a Bayesian approach, these questions can be answered by combining variable selection with variance selection of the random intercept. Bayesian model selection is performed for a reparameterized version of the logistic random intercept model using spike and slab priors on the parameters subject to selection. Different specifications for these priors are compared on simulated data as well as on a data set where the goal is to identify risk factors for complications after endoscopic retrograde cholangiopancreatography (ERCP). ► Logistic random intercept models are used to model risk for clustered data, e.g. within clinical centers. ► Model specification search is performed by Bayesian variable and variance selection. ► Different spike and slab priors with either Dirac or continuous spikes are specified. ► MCMC algorithms for both spike types are compared in simulations and on real data. ► For different priors, model selection results are similar, but speed and MCMC efficiency differ.
Bibliography:ObjectType-Article-2
SourceType-Scholarly Journals-1
ObjectType-Feature-1
content type line 23
ISSN:0167-9473
1872-7352
DOI:10.1016/j.csda.2011.06.033