A Real-Robot Dataset for Assessing Transferability of Learned Dynamics Models

In the context of model-based reinforcement learning and control, a large number of methods for learning system dynamics have been proposed in recent years. The purpose of these learned models is to synthesize new control policies. An important open question is how robust current dynamics-learning m...

Full description

Saved in:
Bibliographic Details
Published in2020 IEEE International Conference on Robotics and Automation (ICRA) pp. 8151 - 8157
Main Authors Agudelo-Espana, Diego, Zadaianchuk, Andrii, Wenk, Philippe, Garg, Aditya, Akpo, Joel, Grimminger, Felix, Viereck, Julian, Naveau, Maximilien, Righetti, Ludovic, Martius, Georg, Krause, Andreas, Scholkopf, Bernhard, Bauer, Stefan, Wuthrich, Manuel
Format Conference Proceeding
LanguageEnglish
Published IEEE 01.05.2020
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:In the context of model-based reinforcement learning and control, a large number of methods for learning system dynamics have been proposed in recent years. The purpose of these learned models is to synthesize new control policies. An important open question is how robust current dynamics-learning methods are to shifts in the data distribution due to changes in the control policy. We present a real-robot dataset which allows to systematically investigate this question. This dataset contains trajectories of a 3 degrees-of-freedom (DOF) robot being controlled by a diverse set of policies. For comparison, we also provide a simulated version of the dataset. Finally, we benchmark a few widely-used dynamics-learning methods using the proposed dataset. Our results show that the iid test error of a learned model is not necessarily a good indicator of its accuracy under control policies different from the one which generated the training data. This suggests that it may be important to evaluate dynamics-learning methods in terms of their transfer performance, rather than only their iid error.
ISSN:2577-087X
DOI:10.1109/ICRA40945.2020.9197392