Machine Learning for Precipitation Forecasts Postprocessing Multimodel Comparison and Experimental Investigation

Obtaining high-quality quantitative precipitation forecasts is a key precondition for hydrological forecast systems. Due to multisource uncertainties (e.g., initial conditions, model structures, and parameters), raw forecasts are subject to systematic biases; hence, statistical postprocessing is oft...

Full description

Saved in:

Bibliographic Details
Published in	Journal of hydrometeorology Vol. 22; no. 11; pp. 3065 - 3085
Main Authors	Zhang, Yuhang, Ye, Aizhong
Format	Journal Article
Language	English
Published	American Meteorological Society 01.11.2021
Online Access	Get full text
ISSN	1525-755X 1525-7541
DOI	10.1175/JHM-D-21-0096.1

Cover

More Information
Summary:	Obtaining high-quality quantitative precipitation forecasts is a key precondition for hydrological forecast systems. Due to multisource uncertainties (e.g., initial conditions, model structures, and parameters), raw forecasts are subject to systematic biases; hence, statistical postprocessing is often required to reduce these errors before the forecasts can proceed to hydrological applications. Machine learning (ML) algorithms are canonical statistical models, and they are diverse in type and variation. It is important to verify and compare their performance in the same scenario (e.g., precipitation postprocessing). In this paper, we conduct a large-scale comparison study for the major ML models with diverse model structures and regularization strategies as postprocessors for improving the quality of precipitation forecasts. Specifically, we compare the efficiency and effectiveness of 21ML algorithms on solving this task. Daily reforecast precipitation with lead times up to 8 days from the Global Ensemble Forecast System and corresponding observations are employed to determine the usability of different models in the Yalong River basin in China. The performance of each model is validated by a group of carefully designed experiments and statistical metrics. The results reveal that improvements in model structures are more effective than regularization strategies. Among these algorithms, the optimized extra-trees regressor exhibits the best performance, effectively reducing overestimation and achieving the best skill in forecasting precipitation. Eleven ensemble members and a 3-day forward-rolling time window can be used as predictors to obtain the best model performance. The systematic experiments and findings also offer useful guidelines for other related studies.
ISSN:	1525-755X 1525-7541
DOI:	10.1175/JHM-D-21-0096.1