Timestamp-Supervised Action Segmentation with Graph Convolutional Networks

We introduce a novel approach for temporal activity segmentation with timestamp supervision. Our main contribution is a graph convolutional network, which is learned in an end-to-end manner to exploit both frame features and connections between neighboring frames to generate dense framewise labels f...

Full description

Saved in:
Bibliographic Details
Published in2022 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) pp. 10619 - 10626
Main Authors Khan, Hamza, Haresh, Sanjay, Ahmed, Awais, Siddiqui, Shakeeb, Konin, Andrey, Zia, M. Zeeshan, Tran, Quoc-Huy
Format Conference Proceeding
LanguageEnglish
Published IEEE 23.10.2022
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:We introduce a novel approach for temporal activity segmentation with timestamp supervision. Our main contribution is a graph convolutional network, which is learned in an end-to-end manner to exploit both frame features and connections between neighboring frames to generate dense framewise labels from sparse timestamp labels. The gener-ated dense framewise labels can then be used to train the segmentation model. In addition, we propose a framework for alternating learning of both the segmentation model and the graph convolutional model, which first initializes and then iteratively refines the learned models. Detailed experiments on four public datasets, including 50 Salads, GTEA, Breakfast, and Desktop Assembly, show that our method is superior to the multi-layer perceptron baseline, while performing on par with or better than the state of the art in temporal activity segmentation with timestamp supervision.
ISSN:2153-0866
DOI:10.1109/IROS47612.2022.9981351