WORKLOAD PLACEMENT FOR VIRTUAL GPU ENABLED SYSTEMS
Disclosed are aspects of workload selection and placement in systems that include graphics processing units (GPUs) that are virtual GPU (vGPU) enabled. In some aspects, workloads are assigned to virtual graphics processing unit (vGPU)-enabled graphics processing units (GPUs). A number of vGPU placem...
Saved in:
Main Authors | , , |
---|---|
Format | Patent |
Language | English |
Published |
01.02.2024
|
Subjects | |
Online Access | Get full text |
Cover
Loading…
Summary: | Disclosed are aspects of workload selection and placement in systems that include graphics processing units (GPUs) that are virtual GPU (vGPU) enabled. In some aspects, workloads are assigned to virtual graphics processing unit (vGPU)-enabled graphics processing units (GPUs). A number of vGPU placement neural networks are trained to maximize a composite efficiency metric based on workload data and GPU data for the plurality of vGPU placement models. A combined neural network selector is generated using the vGPU placement neural networks, and utilized to assign a workload to a vGPU-enabled GPU. |
---|---|
Bibliography: | Application Number: US202318483100 |