Determination of CERES TOA Fluxes Using Machine Learning Algorithms. Part I: Classification and Retrieval of CERES Cloudy and Clear Scenes

Continuous monitoring of the earth radiation budget (ERB) is critical to the understanding of Earth’s climate and its variability with time. The Clouds and the Earth’s Radiant Energy System (CERES) instrument is able to provide a long record of ERB for such scientific studies. This manuscript, which...

Full description

Saved in:
Bibliographic Details
Published inJournal of atmospheric and oceanic technology Vol. 34; no. 10; pp. 2329 - 2345
Main Authors Thampi, Bijoy Vengasseril, Wong, Takmeng, Lukashin, Constantin, Loeb, Norman G.
Format Journal Article
LanguageEnglish
Published United States 01.10.2017
Online AccessGet full text
ISSN0739-0572
1520-0426
DOI10.1175/JTECH-D-16-0183.1

Cover

More Information
Summary:Continuous monitoring of the earth radiation budget (ERB) is critical to the understanding of Earth’s climate and its variability with time. The Clouds and the Earth’s Radiant Energy System (CERES) instrument is able to provide a long record of ERB for such scientific studies. This manuscript, which is the first of a two-part paper, describes the new CERES algorithm for improving the clear/cloudy scene classification without the use of coincident cloud imager data. This new CERES algorithm is based on a subset of the modern artificial intelligence (AI) paradigm called machine learning (ML) algorithms. This paper describes the development and application of the ML algorithm known as random forests (RF), which is used to classify CERES broadband footprint measurements into clear and cloudy scenes. Results from the RF analysis carried using the CERES Single Scanner Footprint (SSF) data for January and July are presented in the manuscript. The daytime RF misclassification rate (MCR) shows relatively large values (>30%) for snow, sea ice, and bright desert surface types, while lower values (<10%) for the forest surface type. MCR values observed for the nighttime data in general show relatively larger values for most of the surface types compared to the daytime MCR values. The modified MCR values show lower values (<4%) for most surface types after thin cloud data are excluded from the analysis. Sensitivity analysis shows that the number of input variables and decision trees used in the RF analysis has a substantial influence on determining the classification error.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ISSN:0739-0572
1520-0426
DOI:10.1175/JTECH-D-16-0183.1