Multi-scale attention networks for pavement defect detection

Pavement defects such as cracks, net cracks, and pit slots can cause potential traffic safety problems. The timely detection and identification play a key role in reducing the harm of various pavement defects. Particularly, the recent development in deep learning-based CNNs has shown competitive per...

Full description

Saved in:
Bibliographic Details
Published inIEEE transactions on instrumentation and measurement p. 1
Main Authors Chen, Junde, Wen, Yuxin, Nanehkaran, Yaser Ahangari, Zhang, Defu, Zeb, Adan
Format Journal Article
LanguageEnglish
Published IEEE 22.07.2023
Subjects
Online AccessGet full text

Cover

Loading…
Abstract Pavement defects such as cracks, net cracks, and pit slots can cause potential traffic safety problems. The timely detection and identification play a key role in reducing the harm of various pavement defects. Particularly, the recent development in deep learning-based CNNs has shown competitive performance in image detection and classification. To detect pavement defects automatically and improve effects, a multi-scale mobile attention-based network, which we termed MANet, is proposed to perform the detection of pavement defects. The architecture of the encoder-decoder is used in MANet, where the encoder adopts the MobileNet as the backbone network to extract pavement defect features. Instead of the original 3×3 convolution, the multi-scale convolution kernels are utilized in depth-wise separable convolution layers of the network. Further, the hybrid attention mechanism is separately incorporated into the encoder and decoder modules to infer the significance of spatial points and inter-channel relationship features for the input intermediate feature maps. The proposed approach achieves state-of-the-art performance on two publicly-available benchmark datasets, i.e., the Crack500 (500 crack images with 2,000×1,500 pixels) and CFD (118 crack images with 480×320 pixels) datasets. The mean intersection over union ( MIoU ) of the proposed approach on these two datasets reaches 0.7219 and 0.7788, respectively. Ablation experiments show that the multi-scale convolution and hybrid attention modules can effectively help the model extract high-level feature representations and generate more accurate pavement crack segmentation results. We further test the model on locally collected pavement crack images (131 images with 1024×768 pixels) and it achieves a satisfactory result. The proposed approach realizes the MIoU of 0.6514 on the local dataset and outperforms other compared baseline methods. Experimental findings demonstrate the validity and feasibility of the proposed approach and it provides a viable solution for pavement crack detection in practical application scenarios. Our code is available at https://github.com/xtu502/pavement-defects.
AbstractList Pavement defects such as cracks, net cracks, and pit slots can cause potential traffic safety problems. The timely detection and identification play a key role in reducing the harm of various pavement defects. Particularly, the recent development in deep learning-based CNNs has shown competitive performance in image detection and classification. To detect pavement defects automatically and improve effects, a multi-scale mobile attention-based network, which we termed MANet, is proposed to perform the detection of pavement defects. The architecture of the encoder-decoder is used in MANet, where the encoder adopts the MobileNet as the backbone network to extract pavement defect features. Instead of the original 3×3 convolution, the multi-scale convolution kernels are utilized in depth-wise separable convolution layers of the network. Further, the hybrid attention mechanism is separately incorporated into the encoder and decoder modules to infer the significance of spatial points and inter-channel relationship features for the input intermediate feature maps. The proposed approach achieves state-of-the-art performance on two publicly-available benchmark datasets, i.e., the Crack500 (500 crack images with 2,000×1,500 pixels) and CFD (118 crack images with 480×320 pixels) datasets. The mean intersection over union ( MIoU ) of the proposed approach on these two datasets reaches 0.7219 and 0.7788, respectively. Ablation experiments show that the multi-scale convolution and hybrid attention modules can effectively help the model extract high-level feature representations and generate more accurate pavement crack segmentation results. We further test the model on locally collected pavement crack images (131 images with 1024×768 pixels) and it achieves a satisfactory result. The proposed approach realizes the MIoU of 0.6514 on the local dataset and outperforms other compared baseline methods. Experimental findings demonstrate the validity and feasibility of the proposed approach and it provides a viable solution for pavement crack detection in practical application scenarios. Our code is available at https://github.com/xtu502/pavement-defects.
Author Nanehkaran, Yaser Ahangari
Zeb, Adan
Zhang, Defu
Chen, Junde
Wen, Yuxin
Author_xml – sequence: 1
  givenname: Junde
  orcidid: 0000-0003-1748-4374
  surname: Chen
  fullname: Chen, Junde
  organization: Dale E. and Sarah Ann Fowler School of Engineering, Chapman University, CA, USA
– sequence: 2
  givenname: Yuxin
  orcidid: 0000-0002-2352-5622
  surname: Wen
  fullname: Wen, Yuxin
  organization: Dale E. and Sarah Ann Fowler School of Engineering, Chapman University, CA, USA
– sequence: 3
  givenname: Yaser Ahangari
  orcidid: 0000-0002-8055-3195
  surname: Nanehkaran
  fullname: Nanehkaran, Yaser Ahangari
  organization: School of Information Engineering, Yancheng Teachers University, Yancheng, China
– sequence: 4
  givenname: Defu
  orcidid: 0000-0002-2396-1205
  surname: Zhang
  fullname: Zhang, Defu
  organization: School of Informatics, Xiamen University, Xiamen, China
– sequence: 5
  givenname: Adan
  orcidid: 0000-0001-6105-3796
  surname: Zeb
  fullname: Zeb, Adan
  organization: Southern university of science and Technology, Shenzhen, China
BookMark eNqFyrEOgjAUQNEOmAjq7uDQHwBfWwSauBmNDmzspMFHgkJL2qrx78XE3ekM90Yk0EYjIWsGCWMgt9WlTDhwkQguCyFZQEIAVsQy3WVzEjl3A4A8S_OQ7MtH77vYNapHqrxH7TujqUb_MvbuaGssHdUThynQK7bYfPET07Yks1b1Dlc_F2RzOlaHc9whYj3ablD2XTNgkqeiEH_yB-5JOPU
CODEN IEIMAO
ContentType Journal Article
DBID 97E
RIA
RIE
DOI 10.1109/TIM.2023.3298391
DatabaseName IEEE All-Society Periodicals Package (ASPP) 2005–Present
IEEE All-Society Periodicals Package (ASPP) 1998-Present
IEEE Electronic Library Online
DatabaseTitleList
Database_xml – sequence: 1
  dbid: RIE
  name: IEEE Electronic Library Online
  url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
Discipline Engineering
Physics
EndPage 1
ExternalDocumentID 10192438
Genre orig-research
GrantInformation_xml – fundername: Fundamental Research Funds for the Central Universities
  grantid: 20720181004
  funderid: 10.13039/501100012226
GroupedDBID -~X
0R~
29I
4.4
5GY
6IK
85S
97E
AAJGR
AASAJ
ABQJQ
ACGFO
ACIWK
ACNCT
AENEX
AKJIK
ALMA_UNASSIGNED_HOLDINGS
ATWAV
BEFXN
BFFAM
BGNUA
BKEBE
BPEOZ
CS3
DU5
EBS
F5P
HZ~
IFIPE
IPLJI
JAVBF
LAI
M43
O9-
OCL
P2P
RIA
RIE
RIG
RNS
TN5
TWZ
ID FETCH-ieee_primary_101924383
IEDL.DBID RIE
ISSN 0018-9456
IngestDate Mon Nov 04 11:50:24 EST 2024
IsPeerReviewed true
IsScholarly true
Language English
LinkModel DirectLink
MergedId FETCHMERGED-ieee_primary_101924383
ORCID 0000-0002-2396-1205
0000-0002-2352-5622
0000-0001-6105-3796
0000-0002-8055-3195
0000-0003-1748-4374
ParticipantIDs ieee_primary_10192438
PublicationCentury 2000
PublicationDate 20230722
PublicationDateYYYYMMDD 2023-07-22
PublicationDate_xml – month: 7
  year: 2023
  text: 20230722
  day: 22
PublicationDecade 2020
PublicationTitle IEEE transactions on instrumentation and measurement
PublicationTitleAbbrev TIM
PublicationYear 2023
Publisher IEEE
Publisher_xml – name: IEEE
SSID ssj0007647
Score 4.85228
Snippet Pavement defects such as cracks, net cracks, and pit slots can cause potential traffic safety problems. The timely detection and identification play a key role...
SourceID ieee
SourceType Publisher
StartPage 1
SubjectTerms attention module
Convolution
Convolutional neural networks
deep neural network
Feature extraction
image identification
Kernel
multi-scale convolution
Pavement defect detection
Roads
Task analysis
Training
Title Multi-scale attention networks for pavement defect detection
URI https://ieeexplore.ieee.org/document/10192438
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3PS8MwFH64gaAHf8yJzik5eE1t09c2BS8ijilspwm7jbRNL0I3bHvxr_clWWWKgqeEHMojr0m-JN_3BeAWRaH8EmMudOZzjLHkCnPJUYk0CpXyw8QIhWfzePqKL8touRWrWy2M1tqSz7RnqvYuv1jnrTkqoxFutguh7EEvSVMn1vqadpMYnUFmQCOYYEF3J-mnd4vnmWeeCfdCkRIgCL69pGIXkskxzLsQHH_kzWubzMs_frgz_jvGEzjaQkr24P6BU9jT1QAOd4wGB7BviZ55fQb3VnHLa0qNZsZb07IdWeXY4DUjDMs2ypqIN6zQhuxBRWMJW9UQxpOnxeOUm4BWG2dUsepiCc-hX60rfQFMRhiXEnOCVSVSQqQgvFAWhKsKlFoFlzD89ROjP9qv4MD0rDntFGIM_ea91de0TDfZjU3PJxQ4k70
link.rule.ids 315,783,787,799,27936,27937,55086
linkProvider IEEE
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3PT4MwGP2iM0Y9-GPO-GNqD15BKKUriRdjXJgOTpjsRgqUiwlbBC7-9X4tw6jRxFObHsiXfrR9bd97BbhhtJBOybhFVeZYjLPSkiwXFpM08D0pHW-ihcJRzMMX9rTwF2uxutHCKKUM-UzZumru8otl3uqjMhzhervgiU3YQmAteCfX-px4J5x1FpkujmEEBv2tpBPcJrPI1g-F2x4NEBK4395SMUvJ9ADiPoiOQfJqt01m5-8__Bn_HeUh7K9BJbnv_oIj2FDVEPa-WA0OYdtQPfP6GO6M5taqMTmKaHdNw3ckVccHrwmiWLKSxka8IYXSdA8sGkPZqkYwnj4mD6GlA0pXnVVF2sfincCgWlbqFIjwGS8FyxFYlQxTIigihrJAZFUwoaR7BqNfP3H-R_s17IRJNE_ns_j5AnZ1L-uzT0rHMGjeWnWJi3aTXZlUfQAAUJcI
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Multi-scale+attention+networks+for+pavement+defect+detection&rft.jtitle=IEEE+transactions+on+instrumentation+and+measurement&rft.au=Chen%2C+Junde&rft.au=Wen%2C+Yuxin&rft.au=Nanehkaran%2C+Yaser+Ahangari&rft.au=Zhang%2C+Defu&rft.date=2023-07-22&rft.pub=IEEE&rft.issn=0018-9456&rft.spage=1&rft.epage=1&rft_id=info:doi/10.1109%2FTIM.2023.3298391&rft.externalDocID=10192438
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=0018-9456&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=0018-9456&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=0018-9456&client=summon