Deep Reinforcement Learning Control for Radar Detection and Tracking in Congested Spectral Environments

This work addresses dynamic non-cooperative coexistence between a cognitive pulsed radar and nearby communications systems by applying nonlinear value function approximation via deep reinforcement learning (Deep RL) to develop a policy for optimal radar performance. The radar learns to vary the band...

Full description

Saved in:
Bibliographic Details
Published inIEEE transactions on cognitive communications and networking Vol. 6; no. 4; pp. 1335 - 1349
Main Authors Thornton, Charles E., Kozy, Mark A., Buehrer, R. Michael, Martone, Anthony F., Sherbondy, Kelly D.
Format Journal Article
LanguageEnglish
Published Piscataway IEEE 01.12.2020
The Institute of Electrical and Electronics Engineers, Inc. (IEEE)
Subjects
Online AccessGet full text
ISSN2332-7731
2332-7731
DOI10.1109/TCCN.2020.3019605

Cover

Loading…
More Information
Summary:This work addresses dynamic non-cooperative coexistence between a cognitive pulsed radar and nearby communications systems by applying nonlinear value function approximation via deep reinforcement learning (Deep RL) to develop a policy for optimal radar performance. The radar learns to vary the bandwidth and center frequency of its linear frequency modulated (LFM) waveforms to mitigate interference with other systems for improved target detection performance while also sufficiently utilizing available frequency bands to achieve a fine range resolution. We demonstrate that this approach, based on the Deep <inline-formula> <tex-math notation="LaTeX">{Q} </tex-math></inline-formula>-Learning (DQL) algorithm, enhances several radar performance metrics more effectively than policy iteration or sense-and-avoid (SAA) approaches in several realistic coexistence environments. The DQL-based approach is also extended to incorporate Double <inline-formula> <tex-math notation="LaTeX">{Q} </tex-math></inline-formula>-learning and a recurrent neural network to form a Double Deep Recurrent <inline-formula> <tex-math notation="LaTeX">{Q} </tex-math></inline-formula>-Network (DDRQN), which yields favorable performance and stability compared to DQL and policy iteration. The practicality of the proposed scheme is demonstrated through experiments performed on a software defined radar (SDRadar) prototype system. Experimental results indicate that the proposed Deep RL approach significantly improves radar detection performance in congested spectral environments compared to policy iteration and SAA.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
ISSN:2332-7731
2332-7731
DOI:10.1109/TCCN.2020.3019605