IMSFNet: integrated multi-source feature network for salient object detection

Multi-scale context features are conducive to image understanding, so it plays an important role in salient object detection (SOD) tasks, and contextual information-based SOD methods have achieved fine performance. However, the context information obtained through parallel independent convolutions w...

Full description

Saved in:
Bibliographic Details
Published inApplied intelligence (Dordrecht, Netherlands) Vol. 53; no. 19; pp. 22228 - 22248
Main Authors Xia, Chenxing, Sun, Yanguang, Fang, Xianjin, Ge, Bin, Gao, Xiuju, Li, Kuan-Ching
Format Journal Article
LanguageEnglish
Published New York Springer US 01.10.2023
Springer Nature B.V
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Multi-scale context features are conducive to image understanding, so it plays an important role in salient object detection (SOD) tasks, and contextual information-based SOD methods have achieved fine performance. However, the context information obtained through parallel independent convolutions with large kernels or dilated convolutions with different dilation rates is lack relevance and dependence at different scales, which limits the expressive ability of context information. In this article, we propose a novel Integrated Multi-Source Feature Network (IMSFNet) for accurate SOD task, which mainly consists of three components. Specifically, we first develop a multi-scale feature aggregation module (MSFAM) to adequately capture and utilize multi-scale context features through a series of well-designed dilated convolutions and short hierarchical connections, and then aggregate these information to improve the performance of input initial features. Subsequently, based on the extracted high-level features, we introduce a global feature extractor (GFE) to further excavate higher-level global semantic information to help locate salient objects from cluttered real-world scenes. Finally, a correlation feature interaction module (CFIM) is designed to interact the diverse information from different level features, reducing the interference of complex backgrounds and highlighting salient objects. Extensive experimental results on six public SOD benchmark datasets convincingly demonstrate the effectiveness and superiority of the proposed IMSFNet method against the 18 state-of-the-art SOD methods under different evaluation metrics.
ISSN:0924-669X
1573-7497
DOI:10.1007/s10489-023-04636-y