Temporal-spatial cross attention network for recognizing imagined characters

Previous research has primarily employed deep learning models such as Convolutional Neural Networks (CNNs), and Recurrent Neural Networks (RNNs) for decoding imagined character signals. These approaches have treated the temporal and spatial features of the signals in a sequential, parallel, or singl...

Full description

Saved in:

Bibliographic Details
Published in	Scientific reports Vol. 14; no. 1; pp. 15432 - 11
Main Authors	Xu, Mingyue, Zhou, Wenhui, Shen, Xingfa, Qiu, Junping, Li, Dingrui
Format	Journal Article
Language	English
Published	London Nature Publishing Group UK 04.07.2024 Nature Publishing Group Nature Portfolio
Subjects	631/1647 631/1647/48 Attention Attention - physiology Biochips Brain Brain - physiology Brain-Computer Interfaces Computer applications Deep Learning EEG Electroencephalography - methods Environmental law Firing pattern Humanities and Social Sciences Humans Imagination - physiology Implants multidisciplinary Neural networks Neural Networks, Computer Science Science (multidisciplinary) Signal Processing, Computer-Assisted Spatial discrimination learning Temporal variations
Online Access	Get full text

Cover

Loading…

More Information
Summary:	Previous research has primarily employed deep learning models such as Convolutional Neural Networks (CNNs), and Recurrent Neural Networks (RNNs) for decoding imagined character signals. These approaches have treated the temporal and spatial features of the signals in a sequential, parallel, or single-feature manner. However, there has been limited research on the cross-relationships between temporal and spatial features, despite the inherent association between channels and sampling points in Brain-Computer Interface (BCI) signal acquisition, which holds significant information about brain activity. To address the limited research on the relationships between temporal and spatial features, we proposed a Temporal-Spatial Cross-Attention Network model, named TSCA-Net. The TSCA-Net is comprised of four modules: the Temporal Feature (TF), the Spatial Feature (SF), the Temporal-Spatial Cross (TSCross), and the Classifier. The TF combines LSTM and Transformer to extract temporal features from BCI signals, while the SF captures spatial features. The TSCross is introduced to learn the correlations between the temporal and spatial features. The Classifier predicts the label of BCI data based on its characteristics. We validated the TSCA-Net model using publicly available datasets of handwritten characters, which recorded the spiking activity from two micro-electrode arrays (MEAs). The results showed that our proposed TSCA-Net outperformed other comparison models (EEG-Net, EEG-TCNet, S3T, GRU, LSTM, R-Transformer, and ViT) in terms of accuracy, precision, recall, and F1 score, achieving 92.66 % , 92.77 % , 92.70 % , and 92.58 % , respectively. The TSCA-Net model demonstrated a 3.65 % to 7.49 % improvement in accuracy over the comparison models.
Bibliography:	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 23
ISSN:	2045-2322 2045-2322
DOI:	10.1038/s41598-024-59263-5