STACKED CONVOLUTIONAL LONG SHORT-TERM MEMORY FOR MODEL-FREE REINFORCEMENT LEARNING

Methods, systems, and apparatus, including computer programs encoded on computer storage media, for controlling an agent interacting with an environment. One of the methods includes obtaining a representation of an observation; processing the representation using a convolutional long short-term memo...

Full description

Saved in:
Bibliographic Details
Main Authors GREGOR, Karol, GUEZ, Arthur Clement, KABRA, Rishabh, MIRZA MOHAMMADI, Mehdi
Format Patent
LanguageEnglish
French
German
Published 10.03.2021
Subjects
Online AccessGet full text

Cover

Loading…