EFRNet: Edge feature refinement network for real-time semantic segmentation of driving scenes

In the semantic segmentation field, the dual-branch structure is a highly effective segmentation model. However, the frequent downsampling in the semantic branch reduces the accuracy of features expression with increasing network depth, resulting in suboptimal segmentation performance. To address th...

Full description

Saved in:

Bibliographic Details
Published in	Digital signal processing Vol. 156; p. 104791
Main Authors	Hou, Zhiqiang, Qu, Minjie, Cheng, Minjie, Ma, Sugang, Wang, Yunchen, Yang, Xiaobao
Format	Journal Article
Language	English
Published	Elsevier Inc 01.01.2025
Subjects	Detailed information Dual-branch structure Edge refinement Multi-scale context information Real-time semantic segmentation Detailed information Dual-branch structure Multi-scale context information Edge refinement Real-time semantic segmentation
Online Access	Get full text

Cover

Loading…

More Information
Summary:	In the semantic segmentation field, the dual-branch structure is a highly effective segmentation model. However, the frequent downsampling in the semantic branch reduces the accuracy of features expression with increasing network depth, resulting in suboptimal segmentation performance. To address the above issues, this paper proposes a real-time semantic segmentation network based on Edge Feature Refinement (Edge Feature Refinement Network, EFRNet). A dual-branch structure is used in the encoder. To enhance the accuracy of deep features expression in the network, an edge refinement module (ERM) is designed in the dual-branch interaction stage to refine the features of the two branches and improve segmentation accuracy. In the decoder, a Bilateral Channel Attention (BCA) module is designed, which is used to extract detailed information and semantic information of features at different levels of the network, and gradually restore small target features. To capture multi-scale context information, we introduce a Multi-scale Context Aggregation Module (MCAM), which efficiently integrates multi-scale information in a parallel manner. The proposed algorithm has experimented on Cityscapes and CamVid datasets, and reaches 78.8% mIoU and 79.6% mIoU, with speeds of 81FPS and 115FPS, respectively. Experimental results show that the proposed algorithm effectively improves segmentation performance while maintaining a high segmentation speed. •This paper aims to balance accuracy and speed in semantic segmentation for autonomous driving.•An edge feature refinement network is proposed for real-time semantic segmentation, which is an encoder-decoder structure.•In the network, we designed Edge Refinement Module, Multi-scale Context Aggregation Module and Bilateral Channel Attention module for image processing.•We conducted experiments on two competitive datasets of driving scenarios, Cityscapes and CamVid.•The proposed algorithm achieves superior real-time segmentation performance.
ISSN:	1051-2004
DOI:	10.1016/j.dsp.2024.104791