Evaluating Apparent Temperature in the Contiguous United States From Four Reanalysis Products Using Artificial Neural Networks

Reanalysis data sets are the products of data assimilation systems and while they exhibit consistency with the observed climate, they tend to be biased, especially at a local scale. In this study, we present an innovative approach for the assessment of the accuracy of four reanalysis data sets—Europ...

Full description

Saved in:
Bibliographic Details
Published inJournal of geophysical research. Machine learning and computation Vol. 1; no. 2
Main Authors Ibebuchi, Chibuike Chiedozie, Lee, Cameron C., Silva, Alindomar, Sheridan, Scott C.
Format Journal Article
LanguageEnglish
Published 01.06.2024
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Reanalysis data sets are the products of data assimilation systems and while they exhibit consistency with the observed climate, they tend to be biased, especially at a local scale. In this study, we present an innovative approach for the assessment of the accuracy of four reanalysis data sets—European Center for Medium‐Range Weather Forecasts, Reanalysis 5 (ERA5), Modern‐Era Retrospective analysis for Research and Applications Version 2, North American Regional Reanalysis, and the Twentieth Century Reanalysis—in capturing daily and extreme observed apparent temperatures in over 300 stations in the contiguous United States (USA). We modeled the relationship between the reanalysis and observed data during a training period and subsequently used these models to predict daily observations during the test period. This process enabled potential adjustment of the biases in the reanalysis data sets (i.e., the predictors). For predictive modeling, the performance of a Feed‐Forward Neural Network (FFNN) and a linear regression (LR) were assessed. Our results show the data assimilations were closer to observations in the eastern parts compared to the western parts of the USA. Considering the number of stations with the lowest error and highest hit rate (for extreme events), FFNN outperformed LR except for extreme warm events where LR performed better. Among the reanalysis data sets, ERA5 consistently exhibited the highest performance in predicting both daily and extreme values of apparent temperature, with a larger margin of relative accuracy for cold extremes. Thus, ERA5 is more adept at accurately representing apparent temperatures in the USA. Plain Language Summary Robust assessment of reanalysis data sets is required before they are applied for a targeted study. We present a robust approach to evaluate assimilated apparent temperature (AT) against station observations in the contiguous United States (USA). Our results show that reanalysis data sets consistently underperform in assimilating AT in the western parts of the contiguous USA, characterized by mountainous complex terrain. Post‐processing the reanalysis data sets with neural networks improved their representation of daily station observations and enabled rigorous assessment of the capability of the reanalysis data sets to predict station data. By evaluating the different reanalysis data sets, we conclude that compared to North American Regional Reanalysis, Modern‐Era Retrospective analysis for Research and Applications Version 2, and Twentieth Century Reanalysis, ERA5 performs best in predicting daily and extreme values of station‐based AT in the contiguous USA. Key Points Reanalysis outputs relatively fall short in representing apparent temperature (AT) on the western coasts of the United States (USA) Post‐processing reanalysis outputs with the Feed‐Forward Neural networks reduced their biases relative to station observations ERA5 outperformed North American Regional Reanalysis, Modern‐Era Retrospective analysis for Research and Applications Version 2, and Twentieth Century Reanalysis in representing observed AT in the USA
ISSN:2993-5210
2993-5210
DOI:10.1029/2023JH000102