State-space Model Based Inverse Reinforcement Learning for Reward Function Estimation in Brain-machine Interfaces

The use of reinforcement learning (RL) in brain machine interfaces (BMIs) is considered to be a promising method for neural decoding. One key component of RL-based BMIs is the reward signal, which is used to guide decoders to update the parameters. However, designing effective and efficient rewards...

Full description

Saved in:
Bibliographic Details
Published in2023 45th Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC) Vol. 2023; pp. 1 - 4
Main Authors Tan, Jieyuan, Zhang, Xiang, Wu, Shenghui, Wang, Yiwen
Format Conference Proceeding Journal Article
LanguageEnglish
Published United States IEEE 01.01.2023
Subjects
Online AccessGet full text
ISSN2694-0604
DOI10.1109/EMBC40787.2023.10340953

Cover

Loading…
More Information
Summary:The use of reinforcement learning (RL) in brain machine interfaces (BMIs) is considered to be a promising method for neural decoding. One key component of RL-based BMIs is the reward signal, which is used to guide decoders to update the parameters. However, designing effective and efficient rewards can be challenging, especially for complex tasks. Inverse reinforcement learning (IRL) is a method that has been proposed to estimate the internal reward function from subjects' neural activity. However, multi-channel neural activity, which may encode many sources of information, builds a large dimensions of state-action space, making it difficult to directly apply IRL methods in BMI systems. In this paper, we propose a state-space model based inverse Q-learning (SSM-IQL) method to improve the performance of the existing IRL method. The state-space model is designed to extract hidden brain state from high-dimensional neural activity. We tested the proposed method on real data collected from rats during a two-lever discrimination task. Preliminary results show that SSM-IQL provides a more accurate and stable estimation of the internal reward function than the traditional IQL algorithm. This suggests that the use of state-space model in IRL method has potential to improve the design of RL-based BMIs.
ISSN:2694-0604
DOI:10.1109/EMBC40787.2023.10340953