MACHINE LEARNING WORKLOAD ORCHESTRATION IN HETEROGENEOUS CLUSTERS

Systems and methods are described herein to orchestrate the execution of an application, such as a machine learning or artificial intelligence application, using distributed compute clusters with heterogeneous compute resources. A discovery subsystem may identify the different compute resources of e...

Full description

Saved in:
Bibliographic Details
Main Authors Costa, Carlos Haas, Makaya, Christian, Athreya, Madhu
Format Patent
LanguageEnglish
Published 19.01.2023
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Systems and methods are described herein to orchestrate the execution of an application, such as a machine learning or artificial intelligence application, using distributed compute clusters with heterogeneous compute resources. A discovery subsystem may identify the different compute resources of each compute cluster. The application is divided into a plurality of workloads with each workload associated with resource demands corresponding to the compute resources of one of the compute clusters. Adaptive modeling allows for hyperparameters to be defined for each workload based on the compute resources associated with the compute cluster to which each respective workload is assigned and the associated dataset.
Bibliography:Application Number: US201917783311