Deformable part models are convolutional neural networks

Deformable part models (DPMs) and convolutional neural networks (CNNs) are two widely used tools for visual recognition. They are typically viewed as distinct approaches: DPMs are graphical models (Markov random fields), while CNNs are "black-box" non-linear classifiers. In this paper, we...

Full description

Saved in:

Bibliographic Details
Published in	2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) pp. 437 - 446
Main Authors	Girshick, Ross, Iandola, Forrest, Darrell, Trevor, Malik, Jitendra
Format	Conference Proceeding Journal Article
Language	English
Published	IEEE 01.06.2015
Subjects	Algorithms Computer vision Convolution Deformation Detectors Feature extraction Formability Geometry Inference Inference algorithms Neural networks Object detection Pattern recognition Transforms
Online Access	Get full text

Cover

Loading…

More Information
Summary:	Deformable part models (DPMs) and convolutional neural networks (CNNs) are two widely used tools for visual recognition. They are typically viewed as distinct approaches: DPMs are graphical models (Markov random fields), while CNNs are "black-box" non-linear classifiers. In this paper, we show that a DPM can be formulated as a CNN, thus providing a synthesis of the two ideas. Our construction involves unrolling the DPM inference algorithm and mapping each step to an equivalent CNN layer. From this perspective, it is natural to replace the standard image features used in DPMs with a learned feature extractor. We call the resulting model a DeepPyramid DPM and experimentally validate it on PASCAL VOC object detection. We find that DeepPyramid DPMs significantly outperform DPMs based on histograms of oriented gradients features (HOG) and slightly outperforms a comparable version of the recently introduced R-CNN detection system, while running significantly faster.
Bibliography:	ObjectType-Article-2 SourceType-Scholarly Journals-1 ObjectType-Conference-1 ObjectType-Feature-3 content type line 23 SourceType-Conference Papers & Proceedings-2
ISSN:	1063-6919 1063-6919 2575-7075
DOI:	10.1109/CVPR.2015.7298641