EEGPT: Unleashing the Potential of EEG Generalist Foundation Model by Autoregressive Pre-training
Electroencephalogram (EEG) signals are pivotal in providing insights into spontaneous brain activity, highlighting their significant importance in neuroscience research. However, the exploration of versatile EEG models is constrained by diverse data formats, outdated pre-training paradigms, and limi...
Saved in:
Main Authors | , , , , , , |
---|---|
Format | Journal Article |
Language | English |
Published |
14.10.2024
|
Subjects | |
Online Access | Get full text |
Cover
Loading…
Summary: | Electroencephalogram (EEG) signals are pivotal in providing insights into
spontaneous brain activity, highlighting their significant importance in
neuroscience research. However, the exploration of versatile EEG models is
constrained by diverse data formats, outdated pre-training paradigms, and
limited transfer learning methods, only leading to specialist models on single
dataset. In this paper, we introduce EEGPT, the first generalist EEG foundation
model designed to address these challenges. First, we propose an electrode-wise
modeling strategy that treats each electrode as a fundamental unit, enabling
the integration of diverse EEG datasets collected from up to 138 electrodes,
amassing 37.5M pre-training samples. Second, we develop the first
autoregressive EEG pre-trained model, moving away from traditional masked
autoencoder approaches to a next signal prediction task that better captures
the sequential and temporal dependencies of EEG data. We also explore scaling
laws with model up to 1.1B parameters: the largest in EEG research to date.
Third, we introduce a multi-task transfer learning paradigm using a learnable
electrode graph network shared across tasks, which for the first time confirms
multi-task compatibility and synergy. As the first generalist EEG foundation
model, EEGPT shows broad compatibility with various signal acquisition devices,
subjects, and tasks. It supports up to 138 electrodes and any combination
thereof as input. Furthermore, we simultaneously evaluate it on 5 distinct
tasks across 12 benchmarks. EEGPT consistently outperforms existing specialist
models across all downstream tasks, with its effectiveness further validated
through extensive ablation studies. This work sets a new direction for
generalist EEG modeling, offering improved scalability, transferability, and
adaptability for a wide range of EEG applications. The code and models will be
released. |
---|---|
DOI: | 10.48550/arxiv.2410.19779 |