Development and validation of a prognostic model for survival time data: application to prognosis of HIV positive patients treated with antiretroviral therapy

The process of developing and validating a prognostic model for survival time data has been much discussed in the literature. Assessment of the performance of candidate prognostic models on data other than that used to fit the models is essential for choosing a model that will generalize well to ind...

Full description

Saved in:
Bibliographic Details
Published inStatistics in medicine Vol. 23; no. 15; pp. 2375 - 2398
Main Authors May, Margaret, Royston, Patrick, Egger, Matthias, Justice, Amy C., Sterne, Jonathan AC
Format Journal Article
LanguageEnglish
Published Chichester, UK John Wiley & Sons, Ltd 15.08.2004
Wiley Subscription Services, Inc
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:The process of developing and validating a prognostic model for survival time data has been much discussed in the literature. Assessment of the performance of candidate prognostic models on data other than that used to fit the models is essential for choosing a model that will generalize well to independent data. However, there remain difficulties in current methods of measuring the accuracy of predictions of prognostic models for censored survival time data. In this paper, flexible parametric models based on the Weibull, loglogistic and lognormal distributions with spline smoothing of the baseline log cumulative hazard function are used to fit a set of candidate prognostic models across k data sets. The model that generalizes best to new data is chosen using a cross‐validation scheme which fits the model on k–1 data sets and tests the predictive accuracy on the omitted data set. The procedure is repeated, omitting each data set in turn. The quality of the predictions is measured using three different methods: two commonly proposed validation methods, Harrell's concordance statistic and the Brier statistic, and a novel method using deviance differences. The results show that the deviance statistic is able to discriminate between quite similar models and can be used to choose a prognostic model that generalizes well to new data. The methods are illustrated by using a model developed to predict progression to a new AIDS event or death in HIV‐1 positive patients starting antiretroviral therapy. Copyright © 2004 John Wiley & Sons, Ltd.
Bibliography:The Canadian Institutes of Health Research
GlaxoSmithKline, Roche and Boehringer-Ingelheim
French, Italian and Swiss Ministries of Health
ArticleID:SIM1825
The Michael Smith Foundation for Health Research
Institut National de la Santé et de la Récherche Medicale
ark:/67375/WNG-P42K8DDC-M
The Dutch Stichting HIV Monitoring
The European Commission
U.K. Medical Research Council - No. RD1564
Agence Nationale de Recherches sur le SIDA
The British Columbia and Alberta Governments
istex:8068ECEE88D08AA80AE7D3C36F6899DEB3C984AE
Members of study groups are listed at end of paper.
ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-3
content type line 23
ObjectType-Undefined-2
ISSN:0277-6715
1097-0258
DOI:10.1002/sim.1825