Provable Representation with Efficient Planning for Partial Observable Reinforcement Learning

In most real-world reinforcement learning applications, state information is only partially observable, which breaks the Markov decision process assumption and leads to inferior performance for algorithms that conflate observations with state. Partially Observable Markov Decision Processes (POMDPs),...

Full description

Saved in:

Bibliographic Details
Main Authors	Zhang, Hongming, Ren, Tongzheng, Xiao, Chenjun, Schuurmans, Dale, Dai, Bo
Format	Journal Article
Language	English
Published	20.11.2023
Subjects	Computer Science - Artificial Intelligence Computer Science - Learning Statistics - Machine Learning
Online Access	Get full text

Cover

Loading…

Be the first to leave a comment!