On Practical Robust Reinforcement Learning: Adjacent Uncertainty Set and Double-Agent Algorithm
Robust reinforcement learning (RRL) aims to seek a robust policy by optimizing the worst case performance over an uncertainty set. This set contains some perturbed Markov decision processes (MDPs) from a nominal MDP (N-MDP) that generate samples for training, which reflects some potential mismatches...
Saved in:
Published in | IEEE transaction on neural networks and learning systems Vol. 36; no. 4; pp. 7696 - 7710 |
---|---|
Main Authors | , |
Format | Journal Article |
Language | English |
Published |
United States
IEEE
01.04.2025
|
Subjects | |
Online Access | Get full text |
Cover
Loading…
Be the first to leave a comment!