COST-OPTIMAL CLUSTER CONFIGURATION ANALYTICS PACKAGE

Systems, methods, and computer-readable media for identifying an optimal cluster configuration for performing a job in a remote cluster computing system. In some examples, one or more applications and a sample of a production load as part of a job for a remote cluster computing system is received. D...

Full description

Saved in:
Bibliographic Details
Main Authors Truong, Alex V, Potipireddi, Prasad, Nucci, Antonio, Khattab, Ahmed, Wong, Athena, Milosavljevic, Dragan, Oberon, John, Bekti, Samudra Harapan, Tang, Ping Pamela, Stojanovic, Alexander Sasha
Format Patent
LanguageEnglish
Published 06.06.2019
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Systems, methods, and computer-readable media for identifying an optimal cluster configuration for performing a job in a remote cluster computing system. In some examples, one or more applications and a sample of a production load as part of a job for a remote cluster computing system is received. Different clusters of nodes are instantiated in the remote cluster computing system to form different cluster configurations. Multi-Linear regression models segmented into different load regions are trained by running at least a portion of the sample on the instantiated different clusters of nodes. Expected completion times of the production load across varying cluster configurations are identified using the multi-linear regression models. An optimal cluster configuration of the varying cluster configurations is determined for the job based on the identified expected completion times.
Bibliography:Application Number: US201715830490