Situational continuity-based air combat autonomous maneuvering decision-making

In order to improve the performance of UAV’s autonomous maneuvering decision-making, this paper proposes a decision-making method based on situational continuity. The algorithm in this paper designs a situation evaluation function with strong guidance, then trains the Long Short-Term Memory (LSTM) u...

Full description

Saved in:

Bibliographic Details
Published in	Defence technology Vol. 29; pp. 66 - 79
Main Authors	Zhang, Jian-dong, Yu, Yi-fei, Zheng, Li-hui, Yang, Qi-ming, Shi, Guo-qing, Wu, Yong
Format	Journal Article
Language	English
Published	KeAi Communications Co., Ltd 01.11.2023
Subjects	Deep Q network (DQN) Fully neural network (FNN) Long short-term memory (LSTM) Maneuvering decision-making Situational continuity UAV
Online Access	Get full text

Cover

Loading…

More Information
Summary:	In order to improve the performance of UAV’s autonomous maneuvering decision-making, this paper proposes a decision-making method based on situational continuity. The algorithm in this paper designs a situation evaluation function with strong guidance, then trains the Long Short-Term Memory (LSTM) under the framework of Deep Q Network (DQN) for air combat maneuvering decision-making. Considering the continuity between adjacent situations, the method takes multiple consecutive situations as one input of the neural network. To reflect the difference between adjacent situations, the method takes the difference of situation evaluation value as the reward of reinforcement learning. In different scenarios, the algorithm proposed in this paper is compared with the algorithm based on the Fully Neural Network (FNN) and the algorithm based on statistical principles respectively. The results show that, compared with the FNN algorithm, the algorithm proposed in this paper is more accurate and forward-looking. Compared with the algorithm based on the statistical principles, the decision-making of the algorithm proposed in this paper is more efficient and its real-time performance is better.
ISSN:	2214-9147 2214-9147
DOI:	10.1016/j.dt.2022.08.010