On Practical Robust Reinforcement Learning: Adjacent Uncertainty Set and Double-Agent Algorithm

Robust reinforcement learning (RRL) aims to seek a robust policy by optimizing the worst case performance over an uncertainty set. This set contains some perturbed Markov decision processes (MDPs) from a nominal MDP (N-MDP) that generate samples for training, which reflects some potential mismatches...

Full description

Saved in:
Bibliographic Details
Published inIEEE transaction on neural networks and learning systems Vol. 36; no. 4; pp. 7696 - 7710
Main Authors Hwang, Ukjo, Hong, Songnam
Format Journal Article
LanguageEnglish
Published United States IEEE 01.04.2025
Subjects
Online AccessGet full text

Cover

Loading…