Biomedical time series clustering based on non-negative sparse coding and probabilistic topic model

Abstract Biomedical time series clustering that groups a set of unlabelled temporal signals according to their underlying similarity is very useful for biomedical records management and analysis such as biosignals archiving and diagnosis. In this paper, a new framework for clustering of long-term bi...

Full description

Saved in:
Bibliographic Details
Published inComputer methods and programs in biomedicine Vol. 111; no. 3; pp. 629 - 641
Main Authors Wang, Jin, Liu, Ping, F.H.She, Mary, Nahavandi, Saeid, Kouzani, Abbas
Format Journal Article
LanguageEnglish
Published Kidlington Elsevier Ireland Ltd 01.09.2013
Elsevier
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Abstract Biomedical time series clustering that groups a set of unlabelled temporal signals according to their underlying similarity is very useful for biomedical records management and analysis such as biosignals archiving and diagnosis. In this paper, a new framework for clustering of long-term biomedical time series such as electrocardiography (ECG) and electroencephalography (EEG) signals is proposed. Specifically, local segments extracted from the time series are projected as a combination of a small number of basis elements in a trained dictionary by non-negative sparse coding. A Bag-of-Words (BoW) representation is then constructed by summing up all the sparse coefficients of local segments in a time series. Based on the BoW representation, a probabilistic topic model that was originally developed for text document analysis is extended to discover the underlying similarity of a collection of time series. The underlying similarity of biomedical time series is well captured attributing to the statistic nature of the probabilistic topic model. Experiments on three datasets constructed from publicly available EEG and ECG signals demonstrates that the proposed approach achieves better accuracy than existing state-of-the-art methods, and is insensitive to model parameters such as length of local segments and dictionary size.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ISSN:0169-2607
1872-7565
DOI:10.1016/j.cmpb.2013.05.022