EGAN: an ensemble adversarial network for topology-preserving EEG data generation for predicting cognitive load levels

Learning representations and extracting meaningful patterns from electroencephalogram (EEG) recordings is critical for analyzing cognitive events (e.g., predicting cognitive load). The primary challenges include individual variability, technical noise from unreliable sensor-skin contacts, and rapid...

Full description

Saved in:
Bibliographic Details
Published inNeural computing & applications Vol. 37; no. 20; pp. 15923 - 15940
Main Authors Havugimana, Felix, Moinuddin, Kazi Ashraf, Yeasin, Mohammed
Format Journal Article
LanguageEnglish
Published London Springer London 01.07.2025
Springer Nature B.V
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Learning representations and extracting meaningful patterns from electroencephalogram (EEG) recordings is critical for analyzing cognitive events (e.g., predicting cognitive load). The primary challenges include individual variability, technical noise from unreliable sensor-skin contacts, and rapid temporal changes in the EEG recordings. Given the multi-factorial nature of the problems, deep learning models are natural choices for learning representations from the data. However, the extensive time required for data collection limits the number of subjects (samples) available, which is essential for building robust deep learning models. We introduce an ensemble generative adversarial network (EGAN) to generate high-fidelity EEG data. The EGAN generates multichannel EEG recordings and their spatial-spectral representation. Key design constraints were preserving topological structure and maintaining proper bias-variance trade-offs for building robust models. To ensure the quality of the synthetic data, we visually inspected the data generated by EGAN. We conducted spectral analyses to confirm that the quality and spectral similarity were comparable to EEG recordings. To illustrate the efficacy of data generated by EGAN, we developed a convolutional neural network (CNN) model to predict four levels of cognitive load. We used spatial-spectral representations (Topomap) from three frequency bands (i.e., θ , α , and β ). We trained our model on real data and a mixture of original and EGAN-generated data with varying proportions (e.g., 50%, 70%, 100%) of real data for training. We compared the performance of models trained solely on original data to those trained on mixed data. Our results indicate that CNN models trained on EGAN-supplemented data significantly outperformed those trained using only real data across all three frequency bands and all training set proportions. For example, the model trained on the mixed data with 50% real data achieved classification F1 and accuracy scores of approximately 71%, 69%, and  72% for θ , α , and β bands, respectively. The same model trained on only real data achieved approximately 52%, 57%, and 59% accuracy and F1 scores for the respective bands. These results demonstrate the utility of EGAN in generating high-fidelity EEG data, significantly advancing cognitive load analysis and modeling other cognitive events.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
ISSN:0941-0643
1433-3058
DOI:10.1007/s00521-025-11346-8