Multi-stage Attention ResU-Net for Semantic Segmentation of Fine-Resolution Remote Sensing Images

The attention mechanism can refine the extracted feature maps and boost the classification performance of the deep network, which has become an essential technique in computer vision and natural language processing. However, the memory and computational costs of the dot-product attention mechanism i...

Full description

Saved in:

Bibliographic Details
Published in	arXiv.org
Main Authors	Li, Rui, Zheng, Shunyi, Duan, Chenxi, Su, Jianlin, Zhang, Ce
Format	Paper Journal Article
Language	English
Published	Ithaca Cornell University Library, arXiv.org 01.12.2020
Subjects	Computer Science - Computer Vision and Pattern Recognition Computer vision Computing costs Feature extraction Feature maps Image segmentation Natural language processing Remote sensing Semantic segmentation Semantics Source code
Online Access	Get full text

Cover

Loading…

More Information
Summary:	The attention mechanism can refine the extracted feature maps and boost the classification performance of the deep network, which has become an essential technique in computer vision and natural language processing. However, the memory and computational costs of the dot-product attention mechanism increase quadratically with the spatio-temporal size of the input. Such growth hinders the usage of attention mechanisms considerably in application scenarios with large-scale inputs. In this Letter, we propose a Linear Attention Mechanism (LAM) to address this issue, which is approximately equivalent to dot-product attention with computational efficiency. Such a design makes the incorporation between attention mechanisms and deep networks much more flexible and versatile. Based on the proposed LAM, we re-factor the skip connections in the raw U-Net and design a Multi-stage Attention ResU-Net (MAResU-Net) for semantic segmentation from fine-resolution remote sensing images. Experiments conducted on the Vaihingen dataset demonstrated the effectiveness and efficiency of our MAResU-Net. Open-source code is available at https://github.com/lironui/Multistage-Attention-ResU-Net.
ISSN:	2331-8422
DOI:	10.48550/arxiv.2011.14302