A diffusion model for inertial based time series generation on scarce data availability to improve human activity recognition

The domain of human activity recognition is able to differentiate between human movements based on sensory driven systems, e.g. in the form of an IMU. Though, in order to perform those differentiation tasks, a measurement setup has to be established and subjects have to be recorded. As this is a tim...

Full description

Saved in:
Bibliographic Details
Published inScientific reports Vol. 15; no. 1; pp. 16841 - 15
Main Authors Oppel, Heiko, Munz, Michael
Format Journal Article
LanguageEnglish
Published London Nature Publishing Group UK 15.05.2025
Nature Publishing Group
Nature Portfolio
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:The domain of human activity recognition is able to differentiate between human movements based on sensory driven systems, e.g. in the form of an IMU. Though, in order to perform those differentiation tasks, a measurement setup has to be established and subjects have to be recorded. As this is a time and cost consuming process, research groups are focused to synthetically generate data resembling human movements to improve the underlying recognition task. So far, work groups are able to generate univariate and multivariate synthetic sequences on basis of an accelerometer or six axis IMU. Yet, they lack in generalizing on unseen subjects and are not able to expand further than a single six axis IMU. In this paper, we aim to fill this gap by using the backbone of a denoising diffusion probabilistic model from the vision domain to synthetically generate multiple IMUs which are able to generalize on unseen participants. The model was adapted to fulfill the criteria of generating meaningful human motion sequences. We then evaluated the quality of the data in two ways: (1) by a subjective visual analysis with the help of a clustering approach new to this domain and (2) by the classifier improvement when adding synthetic samples. The results show a significant improvement in the classification task when synthetic samples were added to the pool of training data. One of the key findings is the benefit of improvement, even in a scarce data set of only 2 samples per subject. This is a huge advantage in the domain of HAR as it reduces the time of a subject to perform a task.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
content type line 23
ISSN:2045-2322
2045-2322
DOI:10.1038/s41598-025-01614-x