Joint Trajectory Design and BS Association for Cellular-Connected UAV: An Imitation-Augmented Deep Reinforcement Learning Approach

This article concerns the problem of the trajectory design and base station (BS) association for cellular-connected unmanned aerial vehicles (UAVs). To support safety-critical functions, one primary requirement for UAVs is to maintain reliable cellular connectivity at every time instant during the f...

Full description

Saved in:

Bibliographic Details
Published in	IEEE internet of things journal Vol. 9; no. 4; pp. 2843 - 2858
Main Authors	Chen, Yu-Jia, Huang, Da-Yu
Format	Journal Article
Language	English
Published	Piscataway IEEE 15.02.2022 The Institute of Electrical and Electronics Engineers, Inc. (IEEE)
Subjects	Antenna gain Antenna radiation patterns Artificial neural networks Cell association cellular networks Completion time Connectivity Deep learning deep reinforcement learning (DRL) Interference Machine learning Radio signals Safety critical Signal strength Signal to noise ratio Three-dimensional displays Trajectory trajectory design Trajectory optimization unmanned aerial vehicle (UAV) Unmanned aerial vehicles
Online Access	Get full text

Cover

Loading…

More Information
Summary:	This article concerns the problem of the trajectory design and base station (BS) association for cellular-connected unmanned aerial vehicles (UAVs). To support safety-critical functions, one primary requirement for UAVs is to maintain reliable cellular connectivity at every time instant during the flight mission. Since the antenna gain of a ground BS (GBS) changes with the position of the UAV, the UAV-GBS association strategy should be jointly considered with the trajectory design, which has not been studied in the prior arts. In this article, we first formulate the problem of joint BS association and trajectory design with the objective of minimizing the mission completion time under a connectivity outage constraint. Then, a deep learning framework is proposed to solve the formulated nonconvex optimization problem in a decoupled manner. For the UAV-GBS association strategy, the signal strength radio map of a given area is constructed, which is used to train a deep neural network (DNN) to approximate the nonlinear mapping from the UAV position to the optimal GBS. To tackle the high complexity due to the coupled decision variables of GBS association and UAV movement, a novel deep reinforcement learning (DRL) approach is developed to learn the optimal trajectory, in which the UAV can learn from its own past good experiences. Our simulation results confirm the superiority of the proposed DRL approach compared to the conventional DRL approaches in terms of the trajectory length. Additionally, it is demonstrated that the nearest association scheme fails to provide reliable cellular connections, whereas our proposed approach can ensure strong connectivity with the GBS during the whole trajectory.
Bibliography:	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14
ISSN:	2327-4662 2327-4662
DOI:	10.1109/JIOT.2021.3093116