Graph Attention Network-Based Multi-Agent Reinforcement Learning for Slicing Resource Management in Dense Cellular Network

Network slicing (NS) management devotes to providing various services to meet distinct requirements over the same physical communication infrastructure and allocating resources on demands. Considering a dense cellular network scenario that contains several NS over multiple base stations (BSs), it re...

Full description

Saved in:

Bibliographic Details
Published in	IEEE transactions on vehicular technology Vol. 70; no. 10; pp. 10792 - 10803
Main Authors	Shao, Yan, Li, Rongpeng, Hu, Bing, Wu, Yingxiao, Zhao, Zhifeng, Zhang, Honggang
Format	Journal Article
Language	English
Published	New York IEEE 01.10.2021 The Institute of Electrical and Electronics Engineers, Inc. (IEEE)
Subjects	5G mobile communication Algorithms Cellular communication Deep learning graph attention network Heuristic algorithms Machine learning multi-agent reinforcement learning Multiagent systems Network slicing Quality of service Radio equipment Real time Real-time systems Reinforcement learning Resource management Strategy Ultra reliable low latency communication
Online Access	Get full text

Cover

Loading…

More Information
Summary:	Network slicing (NS) management devotes to providing various services to meet distinct requirements over the same physical communication infrastructure and allocating resources on demands. Considering a dense cellular network scenario that contains several NS over multiple base stations (BSs), it remains challenging to design a proper real-time inter-slice resource management strategy, so as to cope with frequent BS handover and satisfy the fluctuations of distinct service requirements. In this paper, we propose to formulate this challenge as a multi-agent reinforcement learning (MARL) problem in which each BS represents an agent. Then, we leverage graph attention network (GAT) to strengthen the temporal and spatial cooperation between agents. Furthermore, we incorporate GAT into deep reinforcement learning (DRL) and correspondingly design an intelligent real-time inter-slice resource management strategy. More specially, we testify the universal effectiveness of GAT for advancing DRL in the multi-agent system, by applying GAT on the top of both the value-based method deep Q-network (DQN) and a combination of policy-based and value-based method advantage actor-critic (A2C). Finally, we verify the superiority of the GAT-based MARL algorithms through extensive simulations.
ISSN:	0018-9545 1939-9359
DOI:	10.1109/TVT.2021.3103416