View and scale invariant action recognition using multiview shape-flow models

Actions in real world applications typically take place in cluttered environments with large variations in the orientation and scale of the actor. We present an approach to simultaneously track and recognize known actions that is robust to such variations, starting from a person detection in the sta...

Full description

Saved in:

Bibliographic Details
Published in	2008 IEEE Conference on Computer Vision and Pattern Recognition pp. 1 - 8
Main Authors	Natarajan, P., Nevatia, R.
Format	Conference Proceeding
Language	English Japanese
Published	IEEE 01.06.2008
Subjects	Hidden Markov models Humans Image motion analysis Image recognition Intelligent robots Optical computing Pattern recognition Robustness Shape Videos
Online Access	Get full text

Cover

Loading…

More Information
Summary:	Actions in real world applications typically take place in cluttered environments with large variations in the orientation and scale of the actor. We present an approach to simultaneously track and recognize known actions that is robust to such variations, starting from a person detection in the standing pose. In our approach we first render synthetic poses from multiple viewpoints using Mocap data for known actions and represent them in a conditional random field (CRF) whose observation potentials are computed using shape similarity and the transition potentials are computed using optical flow. We enhance these basic potentials with terms to represent spatial and temporal constraints and call our enhanced model the shape, flow, duration-conditional random field (SFD-CRF). We find the best sequence of actions using Viterbi search in the SFD-CRF. We demonstrate our approach on videos from multiple viewpoints and in the presence of background clutter.
ISBN:	9781424422425 1424422426
ISSN:	1063-6919
DOI:	10.1109/CVPR.2008.4587716