Resource Allocation in Data Centers Using Fast Reinforcement Learning Algorithms

Dynamic resource allocation to satisfy varying, concurrent and unpredictable demands from multiple applications is a key need in cloud systems. A fundamental challenge is the need to find the right balance between over-allocation, which satisfies each application's varying needs without requiri...

Full description

Saved in:

Bibliographic Details
Published in	IEEE eTransactions on network and service management Vol. 18; no. 4; pp. 4576 - 4588
Main Authors	Jiang, Yuang, Kodialam, Murali, Lakshman, T. V., Mukherjee, Sarit, Tassiulas, Leandros
Format	Journal Article
Language	English
Published	New York IEEE 01.12.2021 The Institute of Electrical and Electronics Engineers, Inc. (IEEE)
Subjects	Algorithms Alliances data center Data centers Machine learning Mathematical model Multiplexing Network function virtualization Reconfiguration Reinforcement learning Resource allocation Resource management Upper bound
Online Access	Get full text

Cover

Loading…

More Information
Summary:	Dynamic resource allocation to satisfy varying, concurrent and unpredictable demands from multiple applications is a key need in cloud systems. A fundamental challenge is the need to find the right balance between over-allocation, which satisfies each application's varying needs without requiring frequent allocation changes, and system efficiency which requires that the allocation exactly matches the application needs. However, allocating resources close to current needs will result in frequent allocation changes. This can be detrimental to applications since there may be fixed costs (state replication, policy reconfiguration, etc.) that need to be incurred by applications for each allocation change. In this paper, we develop an MDP-based dynamic allocation scheme that uses reinforcement learning to satisfy unpredictable application demands. It minimizes the overall resource allocation needed to satisfy varying application demands while meeting application constraints on the rate of allocation changes. We prove convergence bounds and use real-world traces to study the performance.
ISSN:	1932-4537 1932-4537
DOI:	10.1109/TNSM.2021.3100460