Bayesian model selection for logistic regression models with random intercept

Data, collected to model risk of an interesting event, often have a multilevel structure as patients are clustered within larger units, e.g. clinical centers. Risk of the event is usually modeled using a logistic regression model, with a random intercept to control for heterogeneity among clusters....

Full description

Saved in:

Bibliographic Details
Published in	Computational statistics & data analysis Vol. 56; no. 5; pp. 1256 - 1274
Main Authors	Wagner, Helga, Duller, Christine
Format	Journal Article
Language	English
Published	Elsevier B.V 01.05.2012 Elsevier
Series	Computational Statistics & Data Analysis
Subjects	Auxiliary mixture sampling Bayesian analysis Heterogeneity Logistics Mathematical models MCMC Normal scale mixtures Regression Risk Specifications Spike and slab priors Statistics Variable selection Variable selection Variance selection MCMC Auxiliary mixture sampling Normal scale mixtures Spike and slab priors Variance selection MCMC Variance selection Auxiliary mixture sampling Normal scale mixtures Spike and slab priors Variable selection
Online Access	Get full text

Cover

Loading…

More Information
Summary:	Data, collected to model risk of an interesting event, often have a multilevel structure as patients are clustered within larger units, e.g. clinical centers. Risk of the event is usually modeled using a logistic regression model, with a random intercept to control for heterogeneity among clusters. Model specification requires to decide which regressors have a non-negligible effect, and hence, should be included in the final model and whether risk is actually heterogeneous among centers, i.e. whether the model should include a random intercept or not. In a Bayesian approach, these questions can be answered by combining variable selection with variance selection of the random intercept. Bayesian model selection is performed for a reparameterized version of the logistic random intercept model using spike and slab priors on the parameters subject to selection. Different specifications for these priors are compared on simulated data as well as on a data set where the goal is to identify risk factors for complications after endoscopic retrograde cholangiopancreatography (ERCP). ► Logistic random intercept models are used to model risk for clustered data, e.g. within clinical centers. ► Model specification search is performed by Bayesian variable and variance selection. ► Different spike and slab priors with either Dirac or continuous spikes are specified. ► MCMC algorithms for both spike types are compared in simulations and on real data. ► For different priors, model selection results are similar, but speed and MCMC efficiency differ.
Bibliography:	ObjectType-Article-2 SourceType-Scholarly Journals-1 ObjectType-Feature-1 content type line 23
ISSN:	0167-9473 1872-7352
DOI:	10.1016/j.csda.2011.06.033