Deep Reinforcement Learning-Based Cooperative Survivability Maximization for a UAV Fleet on an Air-to-Ground Mission
This study focuses on the cooperative strategy development of a UAV team that operates in a hostile environment in which the radar and weapon systems try to track and eliminate them. To simulate the hostile defense system, we present Markov models that generate the detecting and tracking probabiliti...
Saved in:
Published in | Havacılık ve Uzay Teknolojileri Dergisi Vol. 15; no. 2; pp. 94 - 107 |
---|---|
Main Author | |
Format | Journal Article |
Language | English |
Published |
Turkish Air Force Academy
01.07.2022
|
Online Access | Get full text |
Cover
Loading…
Summary: | This study focuses on the cooperative strategy development of a UAV team that operates in a hostile environment in which the radar and weapon systems try to track and eliminate them. To simulate the hostile defense system, we present Markov models that generate the detecting and tracking probabilities of a radar system, and calculate the multiple-shot survivability of air vehicles that fly within the hostile environment. A cooperative strategy development procedure is presented based on proximal policy optimization algorithm, which is a deep reinforcement learning method. It is shown that the UAV team can develop cooperative strategies by exploiting enemy’s weakness to maximize team survivability in an air-to-ground mission after training with the proposed reinforcement learning scheme. |
---|---|
ISSN: | 1304-0448 |