Weighted residuals for very deep networks

Deep residual networks have recently shown appealing performance on many challenging computer vision tasks. However, the original residual structure still has some defects making it difficult to converge on very deep networks. In this paper, we introduce a weighted residual network to address the in...

Full description

Saved in:
Bibliographic Details
Published in2016 3rd International Conference on Systems and Informatics (ICSAI) pp. 936 - 941
Main Authors Falong Shen, Rui Gan, Gang Zeng
Format Conference Proceeding
LanguageEnglish
Published IEEE 01.11.2016
Subjects
Online AccessGet full text

Cover

Loading…
Abstract Deep residual networks have recently shown appealing performance on many challenging computer vision tasks. However, the original residual structure still has some defects making it difficult to converge on very deep networks. In this paper, we introduce a weighted residual network to address the incompatibility between ReLU and element-wise addition and the deep network initialization problem. The weighted residual network is able to learn to combine residuals from different layers effectively and efficiently. The proposed models enjoy a consistent improvement over accuracy and convergence with increasing depths from 100+ layers to 1000+ layers. Besides, the weighted residual networks have little more computation and GPU memory burden than the original residual networks. The networks are optimized by projected stochastic gradient descent. Experiments on CIFAR-10 have shown that our algorithm has a faster convergence speed than the original residual networks and reaches a high accuracy at 95.3% with a 1192-layer model. Experiments on CIFAR-100 and ImageNet-1k have also verified the effectiveness of our proposed design.
AbstractList Deep residual networks have recently shown appealing performance on many challenging computer vision tasks. However, the original residual structure still has some defects making it difficult to converge on very deep networks. In this paper, we introduce a weighted residual network to address the incompatibility between ReLU and element-wise addition and the deep network initialization problem. The weighted residual network is able to learn to combine residuals from different layers effectively and efficiently. The proposed models enjoy a consistent improvement over accuracy and convergence with increasing depths from 100+ layers to 1000+ layers. Besides, the weighted residual networks have little more computation and GPU memory burden than the original residual networks. The networks are optimized by projected stochastic gradient descent. Experiments on CIFAR-10 have shown that our algorithm has a faster convergence speed than the original residual networks and reaches a high accuracy at 95.3% with a 1192-layer model. Experiments on CIFAR-100 and ImageNet-1k have also verified the effectiveness of our proposed design.
Author Rui Gan
Gang Zeng
Falong Shen
Author_xml – sequence: 1
  surname: Falong Shen
  fullname: Falong Shen
– sequence: 2
  surname: Rui Gan
  fullname: Rui Gan
– sequence: 3
  surname: Gang Zeng
  fullname: Gang Zeng
BookMark eNotjktPwzAQhI0EB1r4A3DxlUPCrh_Z-FhFPCJV4kAljpVTr8ECksoJoP57guhpNPNJM7MQp_3QsxBXCCUiuNu2eV61pQKsSqrnpLYnYoEWHFir0J6LmxdOr28TB5l5TOHLf4wyDll-cz7IwLyXPU8_Q34fL8RZnClfHnUpNvd3m-axWD89tM1qXSQHU8HIITqKyu8UMIDbzbt_hjrXRTDeB6-58obIABGoinStuxgIDRqOeimu_2sTM2_3OX36fNgez-tfY4o9vw
ContentType Conference Proceeding
DBID 6IE
6IL
CBEJK
RIE
RIL
DOI 10.1109/ICSAI.2016.7811085
DatabaseName IEEE Electronic Library (IEL) Conference Proceedings
IEEE Xplore POP ALL
IEEE Xplore All Conference Proceedings
IEEE Electronic Library (IEL)
IEEE Proceedings Order Plans (POP All) 1998-Present
DatabaseTitleList
Database_xml – sequence: 1
  dbid: RIE
  name: IEEE Electronic Library (IEL)
  url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
EISBN 1509055215
9781509055210
EndPage 941
ExternalDocumentID 7811085
Genre orig-research
GroupedDBID 6IE
6IL
CBEJK
RIE
RIL
ID FETCH-LOGICAL-i90t-e1edf97f2ac20e009c7812ac27b9bf04aada3e6a47740770267383bfd71414ef3
IEDL.DBID RIE
IngestDate Thu Jun 29 18:38:27 EDT 2023
IsPeerReviewed false
IsScholarly false
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-i90t-e1edf97f2ac20e009c7812ac27b9bf04aada3e6a47740770267383bfd71414ef3
PageCount 6
ParticipantIDs ieee_primary_7811085
PublicationCentury 2000
PublicationDate 2016-Nov.
PublicationDateYYYYMMDD 2016-11-01
PublicationDate_xml – month: 11
  year: 2016
  text: 2016-Nov.
PublicationDecade 2010
PublicationTitle 2016 3rd International Conference on Systems and Informatics (ICSAI)
PublicationTitleAbbrev ICSAI
PublicationYear 2016
Publisher IEEE
Publisher_xml – name: IEEE
Score 1.8112712
Snippet Deep residual networks have recently shown appealing performance on many challenging computer vision tasks. However, the original residual structure still has...
SourceID ieee
SourceType Publisher
StartPage 936
SubjectTerms Computational modeling
Convergence
Convolution
Graphics processing units
Road transportation
Stochastic processes
Training
Title Weighted residuals for very deep networks
URI https://ieeexplore.ieee.org/document/7811085
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1NS8NAEB3anjyptOI3e_AimDRpdrPsUYqlFSqCFXsr2ewsiJAWmxz01zuTtBXFg7ckhGz2A96b3fdmAK4I4mmiFS1e4vaBtC4PMhZTmSxzOpEKE89G4elDOn6W93M1b8HNzguDiLX4DEO-rM_y3TKveKusz65IoghtaFPg1ni1tj6YyPQnw6fbCYu10nDz4o-KKTVgjPZhum2q0Ym8hVVpw_zzVxbG__7LAfS-rXnicQc6h9DCogvXL_UGJzpBwXPtrloLIqOClumHcIgrUTRq73UPZqO72XAcbGogBK8mKgOM0Xmj_YAzKSLxoZza5RttjfWRpBHNEkwzSSwu0prLSVHIab3TsYwl-uQIOsWywGMQRikih1ninOLqvAT9LlV2oPhDNvbmBLrcy8WqyXKx2HTw9O_HZ7DHI9248s6hU75XeEHwXNrLel6-AEWYkJk
linkProvider IEEE
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwzV1LS8NAEB5qPehJpRXf7kEPHpLmtQk5eJBqaewDwYq9hWx2AiKkpUmR-lf8K_44Z5O0ongteEtyGGZ3dplvN983A3BBKZ4CzWnxErbXHCFjLVJkKj-KpGc7HO1ECYUHQ7f75NyP-bgGHystDCIW5DPU1WPxL19O4rm6KmspVSRBhIpC2cPFGx3QsuvglqJ5aVmdu1G7q1U9BLQX38g1NFEmvpdYqhIhEp6IyYZ68YQvEsMhjyIb3cghFGR4nmrHREc2kUjPdEwHE5vMbsAmwQxuleKwpfDG8FtB-_EmUOwwV688-9GipchQnR34XI6tJKa86vNc6PH7r7KP_3Twu9D8lh6yh1VS3YMapg24ei4ucFGyGWaFeixjBLYZbcMFk4hTlpZs9qwJo3W4uA_1dJLiATCfcwK_kS0lV92HCdpIlwuLK0PCTPxDaKhJDadlFY-wms-jvz-fw1Z3NOiH_WDYO4ZtFeRSgXgC9Xw2x1OCIrk4K5YEg3DNUfgCeVPtpg
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=2016+3rd+International+Conference+on+Systems+and+Informatics+%28ICSAI%29&rft.atitle=Weighted+residuals+for+very+deep+networks&rft.au=Falong+Shen&rft.au=Rui+Gan&rft.au=Gang+Zeng&rft.date=2016-11-01&rft.pub=IEEE&rft.spage=936&rft.epage=941&rft_id=info:doi/10.1109%2FICSAI.2016.7811085&rft.externalDocID=7811085