Machine Learning for Precipitation Forecasts Postprocessing Multimodel Comparison and Experimental Investigation
Obtaining high-quality quantitative precipitation forecasts is a key precondition for hydrological forecast systems. Due to multisource uncertainties (e.g., initial conditions, model structures, and parameters), raw forecasts are subject to systematic biases; hence, statistical postprocessing is oft...
Saved in:
Published in | Journal of hydrometeorology Vol. 22; no. 11; pp. 3065 - 3085 |
---|---|
Main Authors | , |
Format | Journal Article |
Language | English |
Published |
American Meteorological Society
01.11.2021
|
Online Access | Get full text |
ISSN | 1525-755X 1525-7541 |
DOI | 10.1175/JHM-D-21-0096.1 |
Cover
Summary: | Obtaining high-quality quantitative precipitation forecasts is a key precondition for hydrological forecast systems. Due to multisource uncertainties (e.g., initial conditions, model structures, and parameters), raw forecasts are subject to systematic biases; hence, statistical postprocessing is often required to reduce these errors before the forecasts can proceed to hydrological applications. Machine learning (ML) algorithms are canonical statistical models, and they are diverse in type and variation. It is important to verify and compare their performance in the same scenario (e.g., precipitation postprocessing). In this paper, we conduct a large-scale comparison study for the major ML models with diverse model structures and regularization strategies as postprocessors for improving the quality of precipitation forecasts. Specifically, we compare the efficiency and effectiveness of 21ML algorithms on solving this task. Daily reforecast precipitation with lead times up to 8 days from the Global Ensemble Forecast System and corresponding observations are employed to determine the usability of different models in the Yalong River basin in China. The performance of each model is validated by a group of carefully designed experiments and statistical metrics. The results reveal that improvements in model structures are more effective than regularization strategies. Among these algorithms, the optimized extra-trees regressor exhibits the best performance, effectively reducing overestimation and achieving the best skill in forecasting precipitation. Eleven ensemble members and a 3-day forward-rolling time window can be used as predictors to obtain the best model performance. The systematic experiments and findings also offer useful guidelines for other related studies. |
---|---|
ISSN: | 1525-755X 1525-7541 |
DOI: | 10.1175/JHM-D-21-0096.1 |