A novel reinforcement learning based grey wolf optimizer algorithm for unmanned aerial vehicles (UAVs) path planning

Unmanned aerial vehicles (UAVs) have been used in wide range of areas, and a high-quality path planning method is needed for UAVs to satisfy their applications. However, many algorithms reported in the literature may not feasible or efficient, especially in the face of three-dimensional complex flig...

Full description

Saved in:
Bibliographic Details
Published inApplied soft computing Vol. 89; p. 106099
Main Authors Qu, Chengzhi, Gai, Wendong, Zhong, Maiying, Zhang, Jing
Format Journal Article
LanguageEnglish
Published Elsevier B.V 01.04.2020
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Unmanned aerial vehicles (UAVs) have been used in wide range of areas, and a high-quality path planning method is needed for UAVs to satisfy their applications. However, many algorithms reported in the literature may not feasible or efficient, especially in the face of three-dimensional complex flight environment. In this paper, a novel reinforcement learning based grey wolf optimizer algorithm called RLGWO has been presented for solving this problem. In the proposed algorithm, the reinforcement learning is inserted that the individual is controlled to switch operations adaptively according to the accumulated performance. Considering that the proposed algorithm is designed to serve for UAVs path planning, four operations have been introduced for each individual: exploration, exploitation, geometric adjustment, and optimal adjustment. In addition, the cubic B-spline curve is used to smooth the generated flight route and make the planning path be suitable for the UAVs. The simulation experimental results show that the RLGWO algorithm can acquire a feasible and effective route successfully in complicated environment. •A novel reinforcement learning-based grey wolf optimizer algorithm called RLGWO is proposed to solve the UAVs three-dimensional path planning problem.•The RLGWO includes four operations: exploration, exploitation, geometric adjustment and optimal adjustment. Each individual in RLGWO perform their operations independently.
ISSN:1568-4946
1872-9681
DOI:10.1016/j.asoc.2020.106099