Multi-scale attention networks for pavement defect detection
Pavement defects such as cracks, net cracks, and pit slots can cause potential traffic safety problems. The timely detection and identification play a key role in reducing the harm of various pavement defects. Particularly, the recent development in deep learning-based CNNs has shown competitive per...
Saved in:
Published in | IEEE transactions on instrumentation and measurement p. 1 |
---|---|
Main Authors | , , , , |
Format | Journal Article |
Language | English |
Published |
IEEE
22.07.2023
|
Subjects | |
Online Access | Get full text |
Cover
Loading…
Abstract | Pavement defects such as cracks, net cracks, and pit slots can cause potential traffic safety problems. The timely detection and identification play a key role in reducing the harm of various pavement defects. Particularly, the recent development in deep learning-based CNNs has shown competitive performance in image detection and classification. To detect pavement defects automatically and improve effects, a multi-scale mobile attention-based network, which we termed MANet, is proposed to perform the detection of pavement defects. The architecture of the encoder-decoder is used in MANet, where the encoder adopts the MobileNet as the backbone network to extract pavement defect features. Instead of the original 3×3 convolution, the multi-scale convolution kernels are utilized in depth-wise separable convolution layers of the network. Further, the hybrid attention mechanism is separately incorporated into the encoder and decoder modules to infer the significance of spatial points and inter-channel relationship features for the input intermediate feature maps. The proposed approach achieves state-of-the-art performance on two publicly-available benchmark datasets, i.e., the Crack500 (500 crack images with 2,000×1,500 pixels) and CFD (118 crack images with 480×320 pixels) datasets. The mean intersection over union ( MIoU ) of the proposed approach on these two datasets reaches 0.7219 and 0.7788, respectively. Ablation experiments show that the multi-scale convolution and hybrid attention modules can effectively help the model extract high-level feature representations and generate more accurate pavement crack segmentation results. We further test the model on locally collected pavement crack images (131 images with 1024×768 pixels) and it achieves a satisfactory result. The proposed approach realizes the MIoU of 0.6514 on the local dataset and outperforms other compared baseline methods. Experimental findings demonstrate the validity and feasibility of the proposed approach and it provides a viable solution for pavement crack detection in practical application scenarios. Our code is available at https://github.com/xtu502/pavement-defects. |
---|---|
AbstractList | Pavement defects such as cracks, net cracks, and pit slots can cause potential traffic safety problems. The timely detection and identification play a key role in reducing the harm of various pavement defects. Particularly, the recent development in deep learning-based CNNs has shown competitive performance in image detection and classification. To detect pavement defects automatically and improve effects, a multi-scale mobile attention-based network, which we termed MANet, is proposed to perform the detection of pavement defects. The architecture of the encoder-decoder is used in MANet, where the encoder adopts the MobileNet as the backbone network to extract pavement defect features. Instead of the original 3×3 convolution, the multi-scale convolution kernels are utilized in depth-wise separable convolution layers of the network. Further, the hybrid attention mechanism is separately incorporated into the encoder and decoder modules to infer the significance of spatial points and inter-channel relationship features for the input intermediate feature maps. The proposed approach achieves state-of-the-art performance on two publicly-available benchmark datasets, i.e., the Crack500 (500 crack images with 2,000×1,500 pixels) and CFD (118 crack images with 480×320 pixels) datasets. The mean intersection over union ( MIoU ) of the proposed approach on these two datasets reaches 0.7219 and 0.7788, respectively. Ablation experiments show that the multi-scale convolution and hybrid attention modules can effectively help the model extract high-level feature representations and generate more accurate pavement crack segmentation results. We further test the model on locally collected pavement crack images (131 images with 1024×768 pixels) and it achieves a satisfactory result. The proposed approach realizes the MIoU of 0.6514 on the local dataset and outperforms other compared baseline methods. Experimental findings demonstrate the validity and feasibility of the proposed approach and it provides a viable solution for pavement crack detection in practical application scenarios. Our code is available at https://github.com/xtu502/pavement-defects. |
Author | Nanehkaran, Yaser Ahangari Zeb, Adan Zhang, Defu Chen, Junde Wen, Yuxin |
Author_xml | – sequence: 1 givenname: Junde orcidid: 0000-0003-1748-4374 surname: Chen fullname: Chen, Junde organization: Dale E. and Sarah Ann Fowler School of Engineering, Chapman University, CA, USA – sequence: 2 givenname: Yuxin orcidid: 0000-0002-2352-5622 surname: Wen fullname: Wen, Yuxin organization: Dale E. and Sarah Ann Fowler School of Engineering, Chapman University, CA, USA – sequence: 3 givenname: Yaser Ahangari orcidid: 0000-0002-8055-3195 surname: Nanehkaran fullname: Nanehkaran, Yaser Ahangari organization: School of Information Engineering, Yancheng Teachers University, Yancheng, China – sequence: 4 givenname: Defu orcidid: 0000-0002-2396-1205 surname: Zhang fullname: Zhang, Defu organization: School of Informatics, Xiamen University, Xiamen, China – sequence: 5 givenname: Adan orcidid: 0000-0001-6105-3796 surname: Zeb fullname: Zeb, Adan organization: Southern university of science and Technology, Shenzhen, China |
BookMark | eNqFyrEOgjAUQNEOmAjq7uDQHwBfWwSauBmNDmzspMFHgkJL2qrx78XE3ekM90Yk0EYjIWsGCWMgt9WlTDhwkQguCyFZQEIAVsQy3WVzEjl3A4A8S_OQ7MtH77vYNapHqrxH7TujqUb_MvbuaGssHdUThynQK7bYfPET07Yks1b1Dlc_F2RzOlaHc9whYj3ablD2XTNgkqeiEH_yB-5JOPU |
CODEN | IEIMAO |
ContentType | Journal Article |
DBID | 97E RIA RIE |
DOI | 10.1109/TIM.2023.3298391 |
DatabaseName | IEEE All-Society Periodicals Package (ASPP) 2005–Present IEEE All-Society Periodicals Package (ASPP) 1998-Present IEEE Electronic Library Online |
DatabaseTitleList | |
Database_xml | – sequence: 1 dbid: RIE name: IEEE Electronic Library Online url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/ sourceTypes: Publisher |
DeliveryMethod | fulltext_linktorsrc |
Discipline | Engineering Physics |
EndPage | 1 |
ExternalDocumentID | 10192438 |
Genre | orig-research |
GrantInformation_xml | – fundername: Fundamental Research Funds for the Central Universities grantid: 20720181004 funderid: 10.13039/501100012226 |
GroupedDBID | -~X 0R~ 29I 4.4 5GY 6IK 85S 97E AAJGR AASAJ ABQJQ ACGFO ACIWK ACNCT AENEX AKJIK ALMA_UNASSIGNED_HOLDINGS ATWAV BEFXN BFFAM BGNUA BKEBE BPEOZ CS3 DU5 EBS F5P HZ~ IFIPE IPLJI JAVBF LAI M43 O9- OCL P2P RIA RIE RIG RNS TN5 TWZ |
ID | FETCH-ieee_primary_101924383 |
IEDL.DBID | RIE |
ISSN | 0018-9456 |
IngestDate | Mon Nov 04 11:50:24 EST 2024 |
IsPeerReviewed | true |
IsScholarly | true |
Language | English |
LinkModel | DirectLink |
MergedId | FETCHMERGED-ieee_primary_101924383 |
ORCID | 0000-0002-2396-1205 0000-0002-2352-5622 0000-0001-6105-3796 0000-0002-8055-3195 0000-0003-1748-4374 |
ParticipantIDs | ieee_primary_10192438 |
PublicationCentury | 2000 |
PublicationDate | 20230722 |
PublicationDateYYYYMMDD | 2023-07-22 |
PublicationDate_xml | – month: 7 year: 2023 text: 20230722 day: 22 |
PublicationDecade | 2020 |
PublicationTitle | IEEE transactions on instrumentation and measurement |
PublicationTitleAbbrev | TIM |
PublicationYear | 2023 |
Publisher | IEEE |
Publisher_xml | – name: IEEE |
SSID | ssj0007647 |
Score | 4.85228 |
Snippet | Pavement defects such as cracks, net cracks, and pit slots can cause potential traffic safety problems. The timely detection and identification play a key role... |
SourceID | ieee |
SourceType | Publisher |
StartPage | 1 |
SubjectTerms | attention module Convolution Convolutional neural networks deep neural network Feature extraction image identification Kernel multi-scale convolution Pavement defect detection Roads Task analysis Training |
Title | Multi-scale attention networks for pavement defect detection |
URI | https://ieeexplore.ieee.org/document/10192438 |
hasFullText | 1 |
inHoldings | 1 |
isFullTextHit | |
isPrint | |
link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3PS8MwFH64gaAHf8yJzik5eE1t09c2BS8ijilspwm7jbRNL0I3bHvxr_clWWWKgqeEHMojr0m-JN_3BeAWRaH8EmMudOZzjLHkCnPJUYk0CpXyw8QIhWfzePqKL8touRWrWy2M1tqSz7RnqvYuv1jnrTkqoxFutguh7EEvSVMn1vqadpMYnUFmQCOYYEF3J-mnd4vnmWeeCfdCkRIgCL69pGIXkskxzLsQHH_kzWubzMs_frgz_jvGEzjaQkr24P6BU9jT1QAOd4wGB7BviZ55fQb3VnHLa0qNZsZb07IdWeXY4DUjDMs2ypqIN6zQhuxBRWMJW9UQxpOnxeOUm4BWG2dUsepiCc-hX60rfQFMRhiXEnOCVSVSQqQgvFAWhKsKlFoFlzD89ROjP9qv4MD0rDntFGIM_ea91de0TDfZjU3PJxQ4k70 |
link.rule.ids | 315,783,787,799,27936,27937,55086 |
linkProvider | IEEE |
linkToHtml | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3PT4MwGP2iM0Y9-GPO-GNqD15BKKUriRdjXJgOTpjsRgqUiwlbBC7-9X4tw6jRxFObHsiXfrR9bd97BbhhtJBOybhFVeZYjLPSkiwXFpM08D0pHW-ihcJRzMMX9rTwF2uxutHCKKUM-UzZumru8otl3uqjMhzhervgiU3YQmAteCfX-px4J5x1FpkujmEEBv2tpBPcJrPI1g-F2x4NEBK4395SMUvJ9ADiPoiOQfJqt01m5-8__Bn_HeUh7K9BJbnv_oIj2FDVEPa-WA0OYdtQPfP6GO6M5taqMTmKaHdNw3ckVccHrwmiWLKSxka8IYXSdA8sGkPZqkYwnj4mD6GlA0pXnVVF2sfincCgWlbqFIjwGS8FyxFYlQxTIigihrJAZFUwoaR7BqNfP3H-R_s17IRJNE_ns_j5AnZ1L-uzT0rHMGjeWnWJi3aTXZlUfQAAUJcI |
openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Multi-scale+attention+networks+for+pavement+defect+detection&rft.jtitle=IEEE+transactions+on+instrumentation+and+measurement&rft.au=Chen%2C+Junde&rft.au=Wen%2C+Yuxin&rft.au=Nanehkaran%2C+Yaser+Ahangari&rft.au=Zhang%2C+Defu&rft.date=2023-07-22&rft.pub=IEEE&rft.issn=0018-9456&rft.spage=1&rft.epage=1&rft_id=info:doi/10.1109%2FTIM.2023.3298391&rft.externalDocID=10192438 |
thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=0018-9456&client=summon |
thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=0018-9456&client=summon |
thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=0018-9456&client=summon |