Probabilistic forecasts, calibration and sharpness

Probabilistic forecasts of continuous variables take the form of predictive densities or predictive cumulative distribution functions. We propose a diagnostic approach to the evaluation of predictive performance that is based on the paradigm of maximizing the sharpness of the predictive distribution...

Full description

Saved in:
Bibliographic Details
Published inJournal of the Royal Statistical Society. Series B, Statistical methodology Vol. 69; no. 2; pp. 243 - 268
Main Authors Gneiting, Tilmann, Balabdaoui, Fadoua, Raftery, Adrian E
Format Journal Article
LanguageEnglish
Published Oxford, UK Oxford, UK : Blackwell Publishing Ltd 01.04.2007
Blackwell Publishing Ltd
Blackwell Publishers
Blackwell
Royal Statistical Society
Oxford University Press
SeriesJournal of the Royal Statistical Society Series B
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Probabilistic forecasts of continuous variables take the form of predictive densities or predictive cumulative distribution functions. We propose a diagnostic approach to the evaluation of predictive performance that is based on the paradigm of maximizing the sharpness of the predictive distributions subject to calibration. Calibration refers to the statistical consistency between the distributional forecasts and the observations and is a joint property of the predictions and the events that materialize. Sharpness refers to the concentration of the predictive distributions and is a property of the forecasts only. A simple theoretical framework allows us to distinguish between probabilistic calibration, exceedance calibration and marginal calibration. We propose and study tools for checking calibration and sharpness, among them the probability integral transform histogram, marginal calibration plots, the sharpness diagram and proper scoring rules. The diagnostic approach is illustrated by an assessment and ranking of probabilistic forecasts of wind speed at the Stateline wind energy centre in the US Pacific Northwest. In combination with cross-validation or in the time series context, our proposal provides very general, nonparametric alternatives to the use of information criteria for model diagnostics and model selection.
Bibliography:http://dx.doi.org/10.1111/j.1467-9868.2007.00587.x
ark:/67375/WNG-DVTBRD87-8
ArticleID:RSSB587
istex:4A85915CE3228C5C053DAA8317C5833DDE68D3F1
SourceType-Scholarly Journals-1
ObjectType-Feature-1
content type line 14
ObjectType-Article-2
content type line 23
ObjectType-Article-1
ObjectType-Feature-2
ISSN:1369-7412
1467-9868
DOI:10.1111/j.1467-9868.2007.00587.x