Multiple Reconfigurable Intelligent Surfaces Aided Vehicular Edge Computing Networks: A MAPPO-Based Approach

Reconfigurable intelligent surface (RIS) is envisioned as a new technology to improve the quality-of-service in vehicular edge computing (VEC) networks due to its ability to change the wireless radio propagation environment. In this paper, we study multi-RIS-assisted VEC networks, where vehicle user...

Full description

Saved in:
Bibliographic Details
Published inIEEE transactions on vehicular technology Vol. 73; no. 11; pp. 17496 - 17509
Main Authors Ning, Xiangrui, Zeng, Ming, Hua, Meng, Fei, Zesong
Format Journal Article
LanguageEnglish
Published New York IEEE 01.11.2024
The Institute of Electrical and Electronics Engineers, Inc. (IEEE)
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Reconfigurable intelligent surface (RIS) is envisioned as a new technology to improve the quality-of-service in vehicular edge computing (VEC) networks due to its ability to change the wireless radio propagation environment. In this paper, we study multi-RIS-assisted VEC networks, where vehicle user equipments (VUEs) can offload tasks to nearby base stations (BSs) which offer efficient computation edge services (ESs). Meanwhile, the individual virtual machine (VM), which is defined as a software clone of the VUE's service environment containing the profile and application to run the VUE's tasks, need to be migrated to the same ES for offloaded task completion. Accordingly, we formulate a throughput maximization problem for multi-RIS-assisted VEC networks via jointly optimizing the selected ESs, the deployment locations of RISs, and reflection matrices of RISs, subject to the maximum tolerable delay. To solve the non-convex mixed-integer nonlinear optimization problem, we propose an efficient algorithm based on multi-agent proximal policy optimization (MAPPO) with the centralized training and decentralized execution (CTDE) framework, where two types of heterogeneous agents are considered. In particular, several tricks such as reward normalization, orthogonal initialization, and learning rate decay are adopted to accelerate the convergence of the proposed algorithm. Numerical simulation results show that the throughput obtained by the proposed MAPPO-based scheme outperforms that obtained by the scheme without multi-RIS 26.41% and that obtained by the scheme without service migration 23.65%, respectively. Moreover, compared to other conventional multi-agent reinforcement learning (MARL) algorithms, the proposed algorithm converges faster.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
ISSN:0018-9545
1939-9359
DOI:10.1109/TVT.2024.3419554