Situational continuity-based air combat autonomous maneuvering decision-making

In order to improve the performance of UAV’s autonomous maneuvering decision-making, this paper proposes a decision-making method based on situational continuity. The algorithm in this paper designs a situation evaluation function with strong guidance, then trains the Long Short-Term Memory (LSTM) u...

Full description

Saved in:
Bibliographic Details
Published inDefence technology Vol. 29; pp. 66 - 79
Main Authors Zhang, Jian-dong, Yu, Yi-fei, Zheng, Li-hui, Yang, Qi-ming, Shi, Guo-qing, Wu, Yong
Format Journal Article
LanguageEnglish
Published KeAi Communications Co., Ltd 01.11.2023
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:In order to improve the performance of UAV’s autonomous maneuvering decision-making, this paper proposes a decision-making method based on situational continuity. The algorithm in this paper designs a situation evaluation function with strong guidance, then trains the Long Short-Term Memory (LSTM) under the framework of Deep Q Network (DQN) for air combat maneuvering decision-making. Considering the continuity between adjacent situations, the method takes multiple consecutive situations as one input of the neural network. To reflect the difference between adjacent situations, the method takes the difference of situation evaluation value as the reward of reinforcement learning. In different scenarios, the algorithm proposed in this paper is compared with the algorithm based on the Fully Neural Network (FNN) and the algorithm based on statistical principles respectively. The results show that, compared with the FNN algorithm, the algorithm proposed in this paper is more accurate and forward-looking. Compared with the algorithm based on the statistical principles, the decision-making of the algorithm proposed in this paper is more efficient and its real-time performance is better.
ISSN:2214-9147
2214-9147
DOI:10.1016/j.dt.2022.08.010