Accurate Bridge Detection in Aerial Images With an Auxiliary Waterbody Extraction Task

Bridge detection in aerial images is to determine whether a given aerial image contains one or more bridges and locate them. However, the arbitrary orientations, extreme aspect ratios, and variable backgrounds pose great challenges for bridge detection and positioning. In this article, we tackle the...

Full description

Saved in:
Bibliographic Details
Published inIEEE journal of selected topics in applied earth observations and remote sensing Vol. 14; pp. 9651 - 9666
Main Authors Guo, Haowen, Zhang, Ruixiang, Wang, Yuxuan, Yang, Wen, Li, Heng-Chao, Xia, Gui-Song
Format Journal Article
LanguageEnglish
Published Piscataway IEEE 2021
The Institute of Electrical and Electronics Engineers, Inc. (IEEE)
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Bridge detection in aerial images is to determine whether a given aerial image contains one or more bridges and locate them. However, the arbitrary orientations, extreme aspect ratios, and variable backgrounds pose great challenges for bridge detection and positioning. In this article, we tackle these problems by combining the strengths of semantic-segmentation-based auxiliary supervision, waterbody constraint, and instance-switching-based data augmentation. More precisely, we make three main contributions. First, we propose an oriented bridge detection model with an auxiliary task of waterbody segmentation, which performs as guidance for bridge localization. The network is specifically designed in cascade style to handle the bridge detection and waterbody segmentation task end-to-end. Second, we make use of the semantic features of waterbody as spatial attention to distinguish bridges from cluttered backgrounds and then generate the waterbody segmentation map as the waterbody constraint, which introduces the prior knowledge of bridge distribution to refine the network predictions. Third, we propose a background consistent instance switching method for online data augmentation to further improve the robustness of bridge detection. To verify the effectiveness of the proposed method, we introduce a dataset named BridgeDetV1 containing 5000 well-annotated images with two kinds of bridge representations, i.e., the horizontal bounding box and the oriented bounding box. Extensive experiments demonstrate that our approach outperforms the state-of-the-art methods on this challenging benchmark. Dataset and code are available at https://github.com/whughw/BridgeDet .
ISSN:1939-1404
2151-1535
DOI:10.1109/JSTARS.2021.3112705