Compensated Attention Feature Fusion and Hierarchical Multiplication Decoder Network for RGB-D Salient Object Detection

Multi-modal feature fusion and effectively exploiting high-level semantic information are critical in salient object detection (SOD). However, the depth maps complementing RGB image fusion strategies cannot supply effective semantic information when the object is not salient in the depth maps. Furth...

Full description

Saved in:

Bibliographic Details
Published in	Remote sensing (Basel, Switzerland) Vol. 15; no. 9; p. 2393
Main Authors	Zeng, Zhihong, Liu, Haijun, Chen, Fenglei, Tan, Xiaoheng
Format	Journal Article
Language	English
Published	Basel MDPI AG 01.05.2023
Subjects	Alliances Complementarity Computer vision Contours Design hierarchical multiplication decoder Image retrieval Modules multi-modal feature fusion Object recognition Remote sensing RGB-D saliency detection Salience Semantics
Online Access	Get full text

Cover

Loading…

More Information
Summary:	Multi-modal feature fusion and effectively exploiting high-level semantic information are critical in salient object detection (SOD). However, the depth maps complementing RGB image fusion strategies cannot supply effective semantic information when the object is not salient in the depth maps. Furthermore, most existing (UNet-based) methods cannot fully exploit high-level abstract features to guide low-level features in a coarse-to-fine fashion. In this paper, we propose a compensated attention feature fusion and hierarchical multiplication decoder network (CAF-HMNet) for RGB-D SOD. Specifically, we first propose a compensated attention feature fusion module to fuse multi-modal features based on the complementarity between depth and RGB features. Then, we propose a hierarchical multiplication decoder to refine the multi-level features from top down. Additionally, a contour-aware module is applied to enhance object contour. Experimental results show that our model achieves satisfactory performance on five challenging SOD datasets, including NJU2K, NLPR, STERE, DES, and SIP, which verifies the effectiveness of the proposed CAF-HMNet.
ISSN:	2072-4292 2072-4292
DOI:	10.3390/rs15092393