Exploiting visual quasi-periodicity for real-time chewing event detection using active appearance models and support vector machines

Steady increases in healthcare costs and obesity have inspired recent studies into cost-effective, assistive systems capable of monitoring dietary habits. Few researchers, though, have investigated the use of video as a means of monitoring dietary activities. Video possesses several inherent qualiti...

Full description

Saved in:

Bibliographic Details
Published in	Personal and ubiquitous computing Vol. 16; no. 6; pp. 729 - 739
Main Authors	Cadavid, Steven, Abdel-Mottaleb, Mohamed, Helal, Abdelsalam
Format	Journal Article
Language	English
Published	London Springer-Verlag 01.08.2012 Springer Nature B.V
Subjects	Behavior Chewing Computer Science Diet Mathematical models Mobile Computing Monitoring Monitoring systems Original Article Personal Computing Power spectra Reduction Software Support vector machines Surveillance Talking Trains User Interfaces and Human Computer Interaction Support vector machines Manifold learning Behavior detection Dietary monitoring Active appearance models
Online Access	Get full text
ISSN	1617-4909 1617-4917
DOI	10.1007/s00779-011-0425-x

Cover

Loading…

More Information
Summary:	Steady increases in healthcare costs and obesity have inspired recent studies into cost-effective, assistive systems capable of monitoring dietary habits. Few researchers, though, have investigated the use of video as a means of monitoring dietary activities. Video possesses several inherent qualities, such as passive acquisition, that merits its analysis as an input modality for such an application. To this end, we propose a method to automatically detect chewing events in surveillance video of a subject. Firstly, an Active Appearance Model (AAM) is used to track a subject’s face across the video sequence. It is observed that the variations in the AAM parameters across chewing events demonstrate a distinct periodicity. We utilize this property to discriminate between chewing and non-chewing facial actions such as talking. A feature representation is constructed by applying spectral analysis to a temporal window of model parameter values. The estimated power spectra subsequently undergo non-linear dimensionality reduction. The low-dimensional embedding of the power spectra are employed to train a binary Support Vector Machine classifier to detect chewing events. To emulate the gradual onset and offset of chewing, smoothness is imposed over the class predictions of neighboring video frames in order to deter abrupt changes in the class labels. Experiments are conducted on a dataset consisting of 37 subjects performing each of five actions, namely, open- and closed-mouth chewing, clutter faces, talking, and still face. Experimental results yielded a cross-validated percentage agreement of 93.0%, indicating that the proposed system provides an efficient approach to automated chewing detection.
Bibliography:	SourceType-Scholarly Journals-1 ObjectType-Feature-1 content type line 14 ObjectType-Article-1 ObjectType-Feature-2 content type line 23
ISSN:	1617-4909 1617-4917
DOI:	10.1007/s00779-011-0425-x