Meta-inverse Reinforcement Learning Method Based on Relative Entropy

Aiming at the problem that traditional inverse reinforcement learning algorithms are slow,imprecise,or even unsolvable when solving the reward function owing to insufficient expert demonstration samples and unknown state transition probabilitie,a meta-reinforcement learning method based on relative...

Full description

Saved in:
Bibliographic Details
Published inJi suan ji ke xue Vol. 48; no. 9; pp. 257 - 263
Main Author WU Shao-bo, FU Qi-ming, CHEN Jian-ping, WU Hong-jie, LU You
Format Journal Article
LanguageChinese
Published Editorial office of Computer Science 01.09.2021
Subjects
Online AccessGet full text

Cover

Loading…