A Sparse Sharing Multitask Framework for Building Footprint Extraction From Remote Sensing Imagery Following the Dual Lottery Ticket Hypothesis

Building footprint extraction from high-resolution remote sensing imagery is significant for urban planning, change detection, disaster management, and other applications. Recently, researchers have found that the edge features of buildings are crucial in extracting building footprints, and multitas...

Full description

Saved in:
Bibliographic Details
Published inIEEE transactions on geoscience and remote sensing Vol. 62; pp. 1 - 16
Main Authors Xing, Huaqiao, Xiang, Junwu, Xiong, Li, Wen, Qi, Liu, Qingjie, Wang, Yunhong
Format Journal Article
LanguageEnglish
Published New York IEEE 2024
The Institute of Electrical and Electronics Engineers, Inc. (IEEE)
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Building footprint extraction from high-resolution remote sensing imagery is significant for urban planning, change detection, disaster management, and other applications. Recently, researchers have found that the edge features of buildings are crucial in extracting building footprints, and multitask deep learning is used to share edge feature information. However, these multitask deep learning frameworks adopt a hard sharing approach, which cannot avoid the adverse effects caused by the differences between different tasks, resulting in the problem of blurred edges and building boundaries. To address this issue, this article proposes a dual lottery ticket hypothesis (DLTH) and sparse sharing-based multitask deep learning framework, dual sparse sharing architecture (DSSA), to transmit the edge information in the edge detection to the building footprint extraction by sharing partial parameters. First, the subnetworks of building footprint extraction and edge detection are constructed according to the sparse rate and parameter sharing rate to control the dependencies between the subnetworks. Second, given the difference in the importance of the two tasks, a cosine unequal-scaled alternating training strategy is proposed to strengthen and weaken the transmission of edge information periodically. Third, following the DLTH, the loss function with <inline-formula> <tex-math notation="LaTeX">{L}3 </tex-math></inline-formula>/2 regularization constraint is used to promote the information transmission and parameter conversion of the subnetwork by using the global information. Finally, aiming at the edge of building footprint extraction results, a pixel-based evaluation index, edge extraction accuracy (<inline-formula> <tex-math notation="LaTeX">{\mathrm {EEA}}^{(n)}) </tex-math></inline-formula>, is designed by morphological erosion to better evaluate the integrity of the edge of building footprint extraction results. The experiments conducted on a self-annotated dataset and two public datasets (i.e., WHU Aerial Imagery dataset and Massachusetts Building dataset) show that DSSA can achieve better edge effects than the baseline and show excellent generalization ability.
ISSN:0196-2892
1558-0644
DOI:10.1109/TGRS.2024.3418369