Robust Semantic Template Matching Using a Superpixel Region Binary Descriptor

Almost all conventional template-matching methods employ low-level image features to measure the similarity between a template image and a scene image using similarity measures, such as pixel intensity and pixel gradient. Although these methods have been widely used in many applications, they cannot...

Full description

Saved in:
Bibliographic Details
Published inIEEE transactions on image processing Vol. 28; no. 6; pp. 3061 - 3074
Main Authors Yang, Hua, Huang, Chenghui, Wang, Feiyue, Song, Kaiyou, Yin, Zhouping
Format Journal Article
LanguageEnglish
Published United States IEEE 01.06.2019
The Institute of Electrical and Electronics Engineers, Inc. (IEEE)
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Almost all conventional template-matching methods employ low-level image features to measure the similarity between a template image and a scene image using similarity measures, such as pixel intensity and pixel gradient. Although these methods have been widely used in many applications, they cannot simultaneously address all types of robustness challenges. In this paper, with the goal of simultaneously addressing the various challenges, we present a robust semantic template-matching (RSTM) approach. Inspired by the local binary descriptor, we propose a novel superpixel region binary descriptor (SRBD) to construct a multilevel semantic fusion feature vector for RSTM. SRBD uses a new kernel-distance-based simple linear iterative clustering method to extract the stable superpixels from the template image. Then, based on the average intensity difference between each superpixel region and its neighbors, the dominant gradient orientation of each superpixel can be obtained, and the semantic features of each superpixel can be described as the dominant orientation difference vector, which is coded as the rotation-invariant SRBD. In the offline matching phase, the fusion semantic feature vector of RSTM combines the multilevel SRBD features with different numbers of superpixels. In the online matching phase, to cope with rotation invariance, a marginal probability model is proposed and applied to locate the positions of template images in the scene image. Moreover, to accelerate computation, an image pyramid is employed. We conduct a series of experiments on a large dataset randomly selected from the MS COCO dataset to fully analyze the robustness of this approach. The experimental results show that RSTM simultaneously addresses rotation changes, scale changes, noise, occlusions, blur, nonlinear illumination changes, and deformation with high time efficiency while also outperforming the previous state-of-the-art template-matching methods.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ISSN:1057-7149
1941-0042
DOI:10.1109/TIP.2019.2893743