Target-Specific Action Classification for Automated Assessment of Human Motor Behavior from Video

Objective monitoring and assessment of human motor behavior can improve the diagnosis and management of several medical conditions. Over the past decade, significant advances have been made in the use of wearable technology for continuously monitoring human motor behavior in free-living conditions....

Full description

Saved in:

Bibliographic Details
Published in	Sensors (Basel, Switzerland) Vol. 19; no. 19; p. 4266
Main Authors	Rezaei, Behnaz, Christakis, Yiorgos, Ho, Bryan, Thomas, Kevin, Erb, Kelley, Ostadabbas, Sarah, Patel, Shyamal
Format	Journal Article
Language	English
Published	Switzerland MDPI AG 01.10.2019 MDPI
Subjects	action classification Algorithms Artificial intelligence Automation Behavior Changing environments Classification Computer vision deep learning Human influences human motor behavior Humans Image Processing, Computer-Assisted Monitoring Monitoring, Physiologic Motor Activity - physiology Neural networks Neural Networks, Computer Parkinson's disease pose tracking Video data Video Recording - methods Wearable technology action classification deep learning pose tracking human motor behavior computer vision
Online Access	Get full text

Cover

Loading…

More Information
Summary:	Objective monitoring and assessment of human motor behavior can improve the diagnosis and management of several medical conditions. Over the past decade, significant advances have been made in the use of wearable technology for continuously monitoring human motor behavior in free-living conditions. However, wearable technology remains ill-suited for applications which require monitoring and interpretation of complex motor behaviors (e.g., involving interactions with the environment). Recent advances in computer vision and deep learning have opened up new possibilities for extracting information from video recordings. In this paper, we present a hierarchical vision-based behavior phenotyping method for classification of basic human actions in video recordings performed using a single RGB camera. Our method addresses challenges associated with tracking multiple human actors and classification of actions in videos recorded in changing environments with different fields of view. We implement a cascaded pose tracker that uses temporal relationships between detections for short-term tracking and appearance based tracklet fusion for long-term tracking. Furthermore, for action classification, we use pose evolution maps derived from the cascaded pose tracker as low-dimensional and interpretable representations of the movement sequences for training a convolutional neural network. The cascaded pose tracker achieves an average accuracy of 88% in tracking the target human actor in our video recordings, and overall system achieves average test accuracy of 84% for target-specific action classification in untrimmed video recordings.
Bibliography:	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 23
ISSN:	1424-8220 1424-8220
DOI:	10.3390/s19194266