State-space Model Based Inverse Reinforcement Learning for Reward Function Estimation in Brain-machine Interfaces

The use of reinforcement learning (RL) in brain machine interfaces (BMIs) is considered to be a promising method for neural decoding. One key component of RL-based BMIs is the reward signal, which is used to guide decoders to update the parameters. However, designing effective and efficient rewards...

Full description

Saved in:

Bibliographic Details
Published in	2023 45th Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC) Vol. 2023; pp. 1 - 4
Main Authors	Tan, Jieyuan, Zhang, Xiang, Wu, Shenghui, Wang, Yiwen
Format	Conference Proceeding Journal Article
Language	English
Published	United States IEEE 01.01.2023
Subjects	Animals Biological system modeling Brain Brain machine interface Brain modeling Brain-Computer Interfaces Estimation Humans inverse reinforcement learning Learning Neural activity Q-learning Rats Reinforcement, Psychology Reward State-space methods state-space model
Online Access	Get full text
ISSN	2694-0604
DOI	10.1109/EMBC40787.2023.10340953

Cover

Loading…

More Information
Summary:	The use of reinforcement learning (RL) in brain machine interfaces (BMIs) is considered to be a promising method for neural decoding. One key component of RL-based BMIs is the reward signal, which is used to guide decoders to update the parameters. However, designing effective and efficient rewards can be challenging, especially for complex tasks. Inverse reinforcement learning (IRL) is a method that has been proposed to estimate the internal reward function from subjects' neural activity. However, multi-channel neural activity, which may encode many sources of information, builds a large dimensions of state-action space, making it difficult to directly apply IRL methods in BMI systems. In this paper, we propose a state-space model based inverse Q-learning (SSM-IQL) method to improve the performance of the existing IRL method. The state-space model is designed to extract hidden brain state from high-dimensional neural activity. We tested the proposed method on real data collected from rats during a two-lever discrimination task. Preliminary results show that SSM-IQL provides a more accurate and stable estimation of the internal reward function than the traditional IQL algorithm. This suggests that the use of state-space model in IRL method has potential to improve the design of RL-based BMIs.
ISSN:	2694-0604
DOI:	10.1109/EMBC40787.2023.10340953