Co-regularized deep representations for video summarization

Compact keyframe-based video summaries are a popular way of generating viewership on video sharing platforms. Yet, creating relevant and compelling summaries for arbitrarily long videos with a small number of keyframes is a challenging task. We propose a comprehensive keyframe-based summarization fr...

Full description

Saved in:

Bibliographic Details
Published in	2015 IEEE International Conference on Image Processing (ICIP) pp. 3165 - 3169
Main Authors	Morere, Olivier, Goh, Hanlin, Veillard, Antoine, Chandrasekhar, Vijay, Jie Lin
Format	Conference Proceeding
Language	English
Published	IEEE 01.09.2015
Subjects	co-regularized restricted Boltzmann machines deep convolutional neural networks Mathematical model Neural networks Oceans Planets Training Video summarization Visualization
Online Access	Get full text

Cover

Loading…

More Information
Summary:	Compact keyframe-based video summaries are a popular way of generating viewership on video sharing platforms. Yet, creating relevant and compelling summaries for arbitrarily long videos with a small number of keyframes is a challenging task. We propose a comprehensive keyframe-based summarization framework combining deep convolutional neural networks and restricted Boltzmann machines. An original co-regularization scheme is used to discover meaningful subject-scene associations. The resulting multimodal representations are then used to select highly-relevant keyframes. A comprehensive user study is conducted comparing our proposed method to a variety of schemes, including the summarization currently in use by one of the most popular video sharing websites. The results show that our method consistently outperforms the baseline schemes for any given amount of keyframes both in terms of attractiveness and in-formativeness. The lead is even more significant for smaller summaries.
DOI:	10.1109/ICIP.2015.7351387