Workload placement for virtual GPU enabled systems

Disclosed are aspects of workload selection and placement in systems that include graphics processing units (GPUs) that are virtual GPU (vGPU) enabled. In some aspects, workloads are assigned to virtual graphics processing unit (vGPU)-enabled graphics processing units (GPUs) based on a variety of vG...

Full description

Saved in:
Bibliographic Details
Main Authors Kurkure, Uday Pundalik, Sivaraman, Hari, Vu, Lan
Format Patent
LanguageEnglish
Published 14.11.2023
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Disclosed are aspects of workload selection and placement in systems that include graphics processing units (GPUs) that are virtual GPU (vGPU) enabled. In some aspects, workloads are assigned to virtual graphics processing unit (vGPU)-enabled graphics processing units (GPUs) based on a variety of vGPU placement models. A number of vGPU placement neural networks are trained to maximize a composite efficiency metric based on workload data and GPU data for the plurality of vGPU placement models. A combined neural network selector is generated using the vGPU placement neural networks, and utilized to assign a workload to a vGPU-enabled GPU.
Bibliography:Application Number: US202016742108