A hybrid deep convolutional neural network for accurate land cover classification

•Spatial attention mechanism improves delineation of various-sized objects in CNNs.•Cascading residual dilated convolution reinforces feature multi-scale inference.•Deep supervision through intermediary loss enhances the network training process.•Different components in the hybrid network can improv...

Full description

Saved in:
Bibliographic Details
Published inInternational journal of applied earth observation and geoinformation Vol. 103; p. 102515
Main Authors Wambugu, Naftaly, Chen, Yiping, Xiao, Zhenlong, Wei, Mingqiang, Aminu Bello, Saifullahi, Marcato Junior, José, Li, Jonathan
Format Journal Article
LanguageEnglish
Published Elsevier B.V 01.12.2021
Elsevier
Subjects
Online AccessGet full text
ISSN1569-8432
1872-826X
DOI10.1016/j.jag.2021.102515

Cover

More Information
Summary:•Spatial attention mechanism improves delineation of various-sized objects in CNNs.•Cascading residual dilated convolution reinforces feature multi-scale inference.•Deep supervision through intermediary loss enhances the network training process.•Different components in the hybrid network can improve land cover classification. Land cover classification provides updated information regarding the Earth's resources, which is vital for agricultural investigation, urban management, and disaster monitoring. Current advances in sensor technology on satellite and aerial remote sensing (RS) devices have improved the spatial-spectral, radiometric, and temporal resolutions of images over time. These improvements offer invaluable chances of understanding land cover information. However, land cover classification from RS images is an intricate task because of the high intra-class disparities, low inter-class similarities, and image variation types. We propose a cascaded residual dilated network (CRD-Net) for land cover classification using very high spatial resolution (VHSR) images to address these challenges. The proposed hybrid network follows the encoder-decoder concept with a spatial attention block to guide the network on learnable discriminate features coupled with an intermediary loss to enhance the training process. Moreover, a cascaded residual dilated module increases the network's receptive field to enrich multi-contextual features further, thus boosting the resultant feature descriptor. Extensive experimental results demonstrate that the proposed CRD-Net outperformed state-of-the-art methods, achieving an overall accuracy (OA) of 90.73% and 90.51% on the ISPRS Potsdam land cover dataset and ISPRS Vaihingen dataset, respectively.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ISSN:1569-8432
1872-826X
DOI:10.1016/j.jag.2021.102515