An improved DeepLabv3+ lightweight network for remote-sensing image semantic segmentation

To improve the accuracy of remote-sensing image semantic segmentation in complex scenario, an improved DeepLabv3+ lightweight neural network is proposed. Specifically, the lightweight network MobileNetv2 is used as the backbone network. In atrous spatial pyramid pooling (ASPP), to alleviate the grid...

Full description

Saved in:

Bibliographic Details
Published in	Complex & intelligent systems Vol. 10; no. 2; pp. 2839 - 2849
Main Authors	Chen, Hui, Qin, Yuanshou, Liu, Xinyuan, Wang, Haitao, Zhao, Jinling
Format	Journal Article
Language	English
Published	Cham Springer International Publishing 01.04.2024 Springer Nature B.V Springer
Subjects	Accuracy Complexity Computational Intelligence Computer networks Convolution Data Structures and Information Theory Deep learning DeepLabv3 Engineering Image segmentation Lightweight Lightweight network Modules Neural networks Original Article Parameters Remote sensing Remote-sensing image Semantic segmentation Semantics Deep learning DeepLabv3 Semantic segmentation Remote-sensing image Lightweight network
Online Access	Get full text

Cover

Loading…

More Information
Summary:	To improve the accuracy of remote-sensing image semantic segmentation in complex scenario, an improved DeepLabv3+ lightweight neural network is proposed. Specifically, the lightweight network MobileNetv2 is used as the backbone network. In atrous spatial pyramid pooling (ASPP), to alleviate the gridding effect, the Dilated Convolution in original DeepLabv3+ network is replaced with the Hybrid Dilated Convolution (HDC) module. In addition, the traditional spatial mean pooling is replaced by the strip pooling module (SPN) to improve the local segmentation effect. In the decoder, to obtain the rich low-level target edge information, the ResNet50 residual network is added after the low-level feature fusion. To enhance the shallow semantic information, the efficient and lightweight Normalization-based Attention Module (NAM) is added to capture the feature information of small target objects. The results show that, under the INRIA Aerial Image Dataset and same parameter setting, the Mean Pixel Accuracy (MPA) and Mean Intersection over Union (MIoU) are generally best than DeepLabv3+ , U-Net, and PSP-Net, which are respectively improved by 1.22%, − 0.22%, and 2.22% and 2.17%, 1.35%, and 3.42%. Our proposed method has also a good performance on the small object segmentation and multi-object segmentation. What’s more, it significantly converges faster with fewer model parameters and stronger computing power while ensuring the segmentation effect. It is proved to be robust and can provide a methodological reference for high-precision remote-sensing image semantic segmentation.
ISSN:	2199-4536 2198-6053
DOI:	10.1007/s40747-023-01304-z