AC-VRNN: Attentive Conditional-VRNN for multi-future trajectory prediction

Anticipating human motion in crowded scenarios is essential for developing intelligent transportation systems, social-aware robots and advanced video surveillance applications. A key component of this task is represented by the inherently multi-modal nature of human paths which makes socially accept...

Full description

Saved in:
Bibliographic Details
Published inComputer vision and image understanding Vol. 210; p. 103245
Main Authors Bertugli, Alessia, Calderara, Simone, Coscia, Pasquale, Ballan, Lamberto, Cucchiara, Rita
Format Journal Article
LanguageEnglish
Published Elsevier Inc 01.09.2021
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Anticipating human motion in crowded scenarios is essential for developing intelligent transportation systems, social-aware robots and advanced video surveillance applications. A key component of this task is represented by the inherently multi-modal nature of human paths which makes socially acceptable multiple futures when human interactions are involved. To this end, we propose a generative architecture for multi-future trajectory predictions based on Conditional Variational Recurrent Neural Networks (C-VRNNs). Conditioning mainly relies on prior belief maps, representing most likely moving directions and forcing the model to consider past observed dynamics in generating future positions. Human interactions are modelled with a graph-based attention mechanism enabling an online attentive hidden state refinement of the recurrent estimation. To corroborate our model, we perform extensive experiments on publicly-available datasets (e.g., ETH/UCY, Stanford Drone Dataset, STATS SportVU NBA, Intersection Drone Dataset and TrajNet++) and demonstrate its effectiveness in crowded scenes compared to several state-of-the-art methods. •Multi-future trajectory predictions in crowded scenarios are considered.•We propose a model based on Conditional Variational Recurrent Neural Networks.•Prior belief maps steer predictions mimicking human behaviours.•An attentive-based graph neural network models interactions among pedestrians.•We outperform state-of-the-art methods on several trajectory prediction benchmarks.
ISSN:1077-3142
1090-235X
DOI:10.1016/j.cviu.2021.103245