Cross-Attention-Guided Feature Alignment Network for Road Crack Detection

Road crack detection is one of the important issues in the field of traffic safety and urban planning. Currently, road damage varies in type and scale, and often has different sizes and depths, making the detection task more challenging. To address this problem, we propose a Cross-Attention-guided F...

Full description

Saved in:

Bibliographic Details
Published in	ISPRS international journal of geo-information Vol. 12; no. 9; p. 382
Main Authors	Xu, Chuan, Zhang, Qi, Mei, Liye, Chang, Xiufeng, Ye, Zhaoyi, Wang, Junjian, Ye, Lang, Yang, Wei
Format	Journal Article
Language	English
Published	Basel MDPI AG 01.09.2023
Subjects	Accuracy Algorithms Alignment Coders Cracks cross-layer interaction Damage Deep learning Detection feature alignment Feature extraction Feature maps Machine learning multi-scale features Multilayers road crack detection Roads Roads & highways Segmentation Semantics Support vector machines Traffic accidents & safety Urban planning Wavelet transforms
Online Access	Get full text

Cover

Loading…

More Information
Summary:	Road crack detection is one of the important issues in the field of traffic safety and urban planning. Currently, road damage varies in type and scale, and often has different sizes and depths, making the detection task more challenging. To address this problem, we propose a Cross-Attention-guided Feature Alignment Network (CAFANet) for extracting and integrating multi-scale features of road damage. Firstly, we use a dual-branch visual encoder model with the same structure but different patch sizes (one large patch and one small patch) to extract multi-level damage features. We utilize a Cross-Layer Interaction (CLI) module to establish interaction between the corresponding layers of the two branches, combining their unique feature extraction capability and contextual understanding. Secondly, we employ a Feature Alignment Block (FAB) to align the features from different levels or branches in terms of semantics and spatial aspects, which significantly improves the CAFANet’s perception of the damage regions, reduces background interference, and achieves more precise detection and segmentation of damage. Finally, we adopt multi-layer convolutional segmentation heads to obtain high-resolution feature maps. To validate the effectiveness of our approach, we conduct experiments on the public CRACK500 dataset and compare it with other mainstream methods. Experimental results demonstrate that CAFANet achieves excellent performance in road crack detection tasks, which exhibits significant improvements in terms of F1 score and accuracy, with an F1 score of 73.22% and an accuracy of 96.78%.
ISSN:	2220-9964 2220-9964
DOI:	10.3390/ijgi12090382