Based on cross-scale fusion attention mechanism network for semantic segmentation for street scenes

Semantic segmentation, which is a fundamental task in computer vision. Every pixel will have a specific semantic class assigned to it through semantic segmentation methods. Embedded systems and mobile devices are difficult to deploy high-accuracy segmentation algorithms. Despite the rapid developmen...

Full description

Saved in:

Bibliographic Details
Published in	Frontiers in neurorobotics Vol. 17; p. 1204418
Main Authors	Ye, Xin, Gao, Lang, Chen, Jichen, Lei, Mingyue
Format	Journal Article
Language	English
Published	Lausanne Frontiers Research Foundation 31.08.2023 Frontiers Media S.A
Subjects	Accuracy channel attention mechanism computer vision Datasets Deep learning dilation convolution Efficiency factorized convolution Neural networks Neuroscience Performance evaluation residual block semantic segmentation Semantics
Online Access	Get full text

Cover

Loading…

More Information
Summary:	Semantic segmentation, which is a fundamental task in computer vision. Every pixel will have a specific semantic class assigned to it through semantic segmentation methods. Embedded systems and mobile devices are difficult to deploy high-accuracy segmentation algorithms. Despite the rapid development of semantic segmentation, the balance between speed and accuracy must be improved. As a solution to the above problems, we created a cross-scale fusion attention mechanism network called CFANet, which fuses feature maps from different scales. We first design a novel efficient residual module (ERM), which applies both dilation convolution and factorized convolution. Our CFANet is mainly constructed from ERM. Subsequently, we designed a new multi-branch channel attention mechanism (MCAM) to refine the feature maps at different levels. Experiment results show that CFANet achieved 70.6% mean intersection over union (mIoU) and 67.7% mIoU on Cityscapes and CamVid datasets, respectively, with inference speeds of 118 FPS and 105 FPS on NVIDIA RTX2080Ti GPU cards with 0.84M parameters.
Bibliography:	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 23 Reviewed by: Dechao Chen, Hangzhou Dianzi University, China; Rohit Shukla, University of Wisconsin-Madison, United States Edited by: Long Jin, Lanzhou University, China
ISSN:	1662-5218 1662-5218
DOI:	10.3389/fnbot.2023.1204418