STACKED CONVOLUTIONAL LONG SHORT-TERM MEMORY FOR MODEL-FREE REINFORCEMENT LEARNING
Methods, systems, and apparatus, including computer programs encoded on computer storage media, for controlling an agent interacting with an environment. One of the methods includes obtaining a representation of an observation; processing the representation using a convolutional long short-term memo...
Saved in:
Main Authors | , , , |
---|---|
Format | Patent |
Language | English French German |
Published |
10.03.2021
|
Subjects | |
Online Access | Get full text |
Cover
Loading…
Be the first to leave a comment!