Optimization of data pre-processing methods for time-series classification of electroencephalography data

The performance of time-series classification of electroencephalographic data varies strongly across experimental paradigms and study participants. Reasons are task-dependent differences in neuronal processing and seemingly random variations between subjects, amongst others. The effect of data pre-p...

Full description

Saved in:
Bibliographic Details
Published inNetwork (Bristol) Vol. 34; no. 4; p. 374
Main Authors Anders, Christoph, Curio, Gabriel, Arnrich, Bert, Waterstraat, Gunnar
Format Journal Article
LanguageEnglish
Published England 02.10.2023
Subjects
Online AccessGet more information

Cover

Loading…
More Information
Summary:The performance of time-series classification of electroencephalographic data varies strongly across experimental paradigms and study participants. Reasons are task-dependent differences in neuronal processing and seemingly random variations between subjects, amongst others. The effect of data pre-processing techniques to ameliorate these challenges is relatively little studied. Here, the influence of spatial filter optimization methods and non-linear data transformation on time-series classification performance is analyzed by the example of high-frequency somatosensory evoked responses. This is a model paradigm for the analysis of high-frequency electroencephalography data at a very low signal-to-noise ratio, which emphasizes the differences of the explored methods. For the utilized data, it was found that the individual signal-to-noise ratio explained up to 74% of the performance differences between subjects. While data pre-processing was shown to increase average time-series classification performance, it could not fully compensate the signal-to-noise ratio differences between the subjects. This study proposes an algorithm to prototype and benchmark pre-processing pipelines for a paradigm and data set at hand. Extreme learning machines, Random Forest, and Logistic Regression can be used quickly to compare a set of potentially suitable pipelines. For subsequent classification, however, machine learning models were shown to provide better accuracy.
ISSN:1361-6536
DOI:10.1080/0954898X.2023.2263083