Learning motor primitives and training a machine learning system using a linear-feedback-stabilized policy

A computer-implemented method of training a student machine learning system comprises receiving data indicating execution of an expert, determining one or more actions performed by the expert during the execution and a corresponding state-action Jacobian, and training the student machine learning sy...

Full description

Saved in:
Bibliographic Details
Main Authors Merel, Joshua, Hasenclever, Leonard, Galashov, Alexandre, Pham, Vu
Format Patent
LanguageEnglish
Published 01.08.2023
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:A computer-implemented method of training a student machine learning system comprises receiving data indicating execution of an expert, determining one or more actions performed by the expert during the execution and a corresponding state-action Jacobian, and training the student machine learning system using a linear-feedback-stabilized policy. The linear-feedback-stabilized policy may be based on the state-action Jacobian. Also a neural network system for representing a space of probabilistic motor primitives, implemented by one or more computers. The neural network system comprises an encoder configured to generate latent variables based on a plurality of inputs, each input comprising a plurality of frames, and a decoder configured to generate an action based on one or more of the latent variables and a state.
Bibliography:Application Number: US202217872308