VideoLSTM convolves, attends and flows for action recognition
•To exploit both the spatial and temporal correlations in a video, we hardwire convolutions in the soft-Attention LSTM architecture.•We introduce motion-based attention which guides better the attention towards the relevant spatial-temporal locations of the actions.•We demonstrate how the attention...
Saved in:
Published in | Computer vision and image understanding Vol. 166; pp. 41 - 50 |
---|---|
Main Authors | , , , , |
Format | Journal Article |
Language | English |
Published |
Elsevier Inc
01.01.2018
|
Subjects | |
Online Access | Get full text |
Cover
Loading…
Be the first to leave a comment!