ELSIM: End-to-End Learning of Reusable Skills Through Intrinsic Motivation

Taking inspiration from developmental learning, we present a novel reinforcement learning architecture which hierarchically learns and represents self-generated skills in an end-to-end way. With this architecture, an agent focuses only on task-rewarded skills while keeping the learning process of sk...

Full description

Saved in:

Bibliographic Details
Published in	Machine Learning and Knowledge Discovery in Databases Vol. 12458; pp. 541 - 556
Main Authors	Aubret, Arthur, Matignon, Laetitia, Hassas, Salima
Format	Book Chapter
Language	English
Published	Switzerland Springer International Publishing AG 2021 Springer International Publishing
Series	Lecture Notes in Computer Science
Subjects	Curriculum learning Developmental learning Intrinsic motivation Reinforcement learning
Online Access	Get full text
ISBN	3030676609 9783030676605
ISSN	0302-9743 1611-3349
DOI	10.1007/978-3-030-67661-2_32

Cover

Loading…

More Information
Summary:	Taking inspiration from developmental learning, we present a novel reinforcement learning architecture which hierarchically learns and represents self-generated skills in an end-to-end way. With this architecture, an agent focuses only on task-rewarded skills while keeping the learning process of skills bottom-up. This bottom-up approach allows to learn skills that 1 - are transferable across tasks, 2 - improve exploration when rewards are sparse. To do so, we combine a previously defined mutual information objective with a novel curriculum learning algorithm, creating an unlimited and explorable tree of skills. We test our agent on simple gridworld environments to understand and visualize how the agent distinguishes between its skills. Then we show that our approach can scale on more difficult MuJoCo environments in which our agent is able to build a representation of skills which improves over a baseline both transfer learning and exploration when rewards are sparse.
ISBN:	3030676609 9783030676605
ISSN:	0302-9743 1611-3349
DOI:	10.1007/978-3-030-67661-2_32