Meta-inverse Reinforcement Learning Method Based on Relative Entropy

Aiming at the problem that traditional inverse reinforcement learning algorithms are slow,imprecise,or even unsolvable when solving the reward function owing to insufficient expert demonstration samples and unknown state transition probabilitie,a meta-reinforcement learning method based on relative...

Full description

Saved in:

Bibliographic Details
Published in	Ji suan ji ke xue Vol. 48; no. 9; pp. 257 - 263
Main Author	WU Shao-bo, FU Qi-ming, CHEN Jian-ping, WU Hong-jie, LU You
Format	Journal Article
Language	Chinese
Published	Editorial office of Computer Science 01.09.2021
Subjects	inverse reinforcement learning\|meta-learning\|reward function\|relative entropy\|gradient decent
Online Access	Get full text

Cover

Loading…

Be the first to leave a comment!