Clustering Neural Patterns in Kernel Reinforcement Learning Assists Fast Brain Control in Brain-Machine Interfaces

Neuroprosthesis enables the brain control on the external devices purely using neural activity for paralyzed people. Supervised learning decoders recalibrate or re-fit the discrepancy between the desired target and decoder's output, where the correction may over-dominate the user's intenti...

Full description

Saved in:

Bibliographic Details
Published in	IEEE transactions on neural systems and rehabilitation engineering Vol. 27; no. 9; pp. 1684 - 1694
Main Authors	Zhang, Xiang, Libedinsky, Camilo, So, Rosa, Principe, Jose C., Wang, Yiwen
Format	Journal Article
Language	English
Published	United States IEEE 01.09.2019 The Institute of Electrical and Electronics Engineers, Inc. (IEEE)
Subjects	Algorithms Animals Attention Brain Brain-Computer Interfaces Brain-machine interfaces (BMIs) Cluster Analysis Clustering Clustering algorithms Computational neuroscience Computer Simulation Decoders Decoding Electrodes, Implanted Haplorhini Hilbert space Interfaces Jaccard distance Kernel Kernels Learning Machine Learning Man-machine interfaces Mapping Motor Cortex - physiology Movement - physiology Neural Prostheses Prosthetics Reinforcement Reinforcement learning Reinforcement, Psychology Task analysis Training Trajectory
Online Access	Get full text

Cover

Loading…

More Information
Summary:	Neuroprosthesis enables the brain control on the external devices purely using neural activity for paralyzed people. Supervised learning decoders recalibrate or re-fit the discrepancy between the desired target and decoder's output, where the correction may over-dominate the user's intention. Reinforcement learning decoder allows users to actively adjust their brain patterns through trial and error, which better represents the subject's motive. The computational challenge is to quickly establish new state-action mapping before the subject becomes frustrated. Recently proposed quantized attention-gated kernel reinforcement learning (QAGKRL) explores the optimal nonlinear neural-action mapping in the Reproducing Kernel Hilbert Space (RKHS). However, considering all past data in RKHS is less efficient and sensitive to detect the new neural patterns emerging in brain control. In this paper, we propose a clustering-based kernel RL algorithm. New neural patterns emerge and are clustered to represent the novel knowledge in brain control. The current neural data only activate the nearest subspace in RKHS for more efficient decoding. The dynamic clustering makes our algorithm more sensitive to new brain patterns. We test our algorithm on both the synthetic and real-world spike data. Compared with QAGKRL, our algorithm can achieve a quicker knowledge adaptation in brain control with less computational complexity.
Bibliography:	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14 content type line 23
ISSN:	1534-4320 1558-0210 1558-0210
DOI:	10.1109/TNSRE.2019.2934176