Weighted residuals for very deep networks

Deep residual networks have recently shown appealing performance on many challenging computer vision tasks. However, the original residual structure still has some defects making it difficult to converge on very deep networks. In this paper, we introduce a weighted residual network to address the in...

Full description

Saved in:

Bibliographic Details
Published in	2016 3rd International Conference on Systems and Informatics (ICSAI) pp. 936 - 941
Main Authors	Falong Shen, Rui Gan, Gang Zeng
Format	Conference Proceeding
Language	English
Published	IEEE 01.11.2016
Subjects	Computational modeling Convergence Convolution Graphics processing units Road transportation Stochastic processes Training
Online Access	Get full text

Cover

Loading…

Abstract	Deep residual networks have recently shown appealing performance on many challenging computer vision tasks. However, the original residual structure still has some defects making it difficult to converge on very deep networks. In this paper, we introduce a weighted residual network to address the incompatibility between ReLU and element-wise addition and the deep network initialization problem. The weighted residual network is able to learn to combine residuals from different layers effectively and efficiently. The proposed models enjoy a consistent improvement over accuracy and convergence with increasing depths from 100+ layers to 1000+ layers. Besides, the weighted residual networks have little more computation and GPU memory burden than the original residual networks. The networks are optimized by projected stochastic gradient descent. Experiments on CIFAR-10 have shown that our algorithm has a faster convergence speed than the original residual networks and reaches a high accuracy at 95.3% with a 1192-layer model. Experiments on CIFAR-100 and ImageNet-1k have also verified the effectiveness of our proposed design.
AbstractList	Deep residual networks have recently shown appealing performance on many challenging computer vision tasks. However, the original residual structure still has some defects making it difficult to converge on very deep networks. In this paper, we introduce a weighted residual network to address the incompatibility between ReLU and element-wise addition and the deep network initialization problem. The weighted residual network is able to learn to combine residuals from different layers effectively and efficiently. The proposed models enjoy a consistent improvement over accuracy and convergence with increasing depths from 100+ layers to 1000+ layers. Besides, the weighted residual networks have little more computation and GPU memory burden than the original residual networks. The networks are optimized by projected stochastic gradient descent. Experiments on CIFAR-10 have shown that our algorithm has a faster convergence speed than the original residual networks and reaches a high accuracy at 95.3% with a 1192-layer model. Experiments on CIFAR-100 and ImageNet-1k have also verified the effectiveness of our proposed design.
Author	Rui Gan Gang Zeng Falong Shen
Author_xml	– sequence: 1 surname: Falong Shen fullname: Falong Shen – sequence: 2 surname: Rui Gan fullname: Rui Gan – sequence: 3 surname: Gang Zeng fullname: Gang Zeng
BookMark	eNotjktPwzAQhI0EB1r4A3DxlUPCrh_Z-FhFPCJV4kAljpVTr8ECksoJoP57guhpNPNJM7MQp_3QsxBXCCUiuNu2eV61pQKsSqrnpLYnYoEWHFir0J6LmxdOr28TB5l5TOHLf4wyDll-cz7IwLyXPU8_Q34fL8RZnClfHnUpNvd3m-axWD89tM1qXSQHU8HIITqKyu8UMIDbzbt_hjrXRTDeB6-58obIABGoinStuxgIDRqOeimu_2sTM2_3OX36fNgez-tfY4o9vw
ContentType	Conference Proceeding
DBID	6IE 6IL CBEJK RIE RIL
DOI	10.1109/ICSAI.2016.7811085
DatabaseName	IEEE Electronic Library (IEL) Conference Proceedings IEEE Xplore POP ALL IEEE Xplore All Conference Proceedings IEEE Electronic Library (IEL) IEEE Proceedings Order Plans (POP All) 1998-Present
DatabaseTitleList
Database_xml	– sequence: 1 dbid: RIE name: IEEE Electronic Library (IEL) url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/ sourceTypes: Publisher
DeliveryMethod	fulltext_linktorsrc
EISBN	1509055215 9781509055210
EndPage	941
ExternalDocumentID	7811085
Genre	orig-research
GroupedDBID	6IE 6IL CBEJK RIE RIL
ID	FETCH-LOGICAL-i90t-e1edf97f2ac20e009c7812ac27b9bf04aada3e6a47740770267383bfd71414ef3
IEDL.DBID	RIE
IngestDate	Thu Jun 29 18:38:27 EDT 2023
IsPeerReviewed	false
IsScholarly	false
Language	English
LinkModel	DirectLink
MergedId	FETCHMERGED-LOGICAL-i90t-e1edf97f2ac20e009c7812ac27b9bf04aada3e6a47740770267383bfd71414ef3
PageCount	6
ParticipantIDs	ieee_primary_7811085
PublicationCentury	2000
PublicationDate	2016-Nov.
PublicationDateYYYYMMDD	2016-11-01
PublicationDate_xml	– month: 11 year: 2016 text: 2016-Nov.
PublicationDecade	2010
PublicationTitle	2016 3rd International Conference on Systems and Informatics (ICSAI)
PublicationTitleAbbrev	ICSAI
PublicationYear	2016
Publisher	IEEE
Publisher_xml	– name: IEEE
Score	1.8112712
Snippet	Deep residual networks have recently shown appealing performance on many challenging computer vision tasks. However, the original residual structure still has...
SourceID	ieee
SourceType	Publisher
StartPage	936
SubjectTerms	Computational modeling Convergence Convolution Graphics processing units Road transportation Stochastic processes Training
Title	Weighted residuals for very deep networks
URI	https://ieeexplore.ieee.org/document/7811085
hasFullText	1
inHoldings	1
isFullTextHit
isPrint
link	http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1NS8NAEB3anjyptOI3e_AimDRpdrPsUYqlFSqCFXsr2ewsiJAWmxz01zuTtBXFg7ckhGz2A96b3fdmAK4I4mmiFS1e4vaBtC4PMhZTmSxzOpEKE89G4elDOn6W93M1b8HNzguDiLX4DEO-rM_y3TKveKusz65IoghtaFPg1ni1tj6YyPQnw6fbCYu10nDz4o-KKTVgjPZhum2q0Ym8hVVpw_zzVxbG__7LAfS-rXnicQc6h9DCogvXL_UGJzpBwXPtrloLIqOClumHcIgrUTRq73UPZqO72XAcbGogBK8mKgOM0Xmj_YAzKSLxoZza5RttjfWRpBHNEkwzSSwu0prLSVHIab3TsYwl-uQIOsWywGMQRikih1ninOLqvAT9LlV2oPhDNvbmBLrcy8WqyXKx2HTw9O_HZ7DHI9248s6hU75XeEHwXNrLel6-AEWYkJk
linkProvider	IEEE
linkToHtml	http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwzV1LS8NAEB5qPehJpRXf7kEPHpLmtQk5eJBqaewDwYq9hWx2AiKkpUmR-lf8K_44Z5O0ongteEtyGGZ3dplvN983A3BBKZ4CzWnxErbXHCFjLVJkKj-KpGc7HO1ECYUHQ7f75NyP-bgGHystDCIW5DPU1WPxL19O4rm6KmspVSRBhIpC2cPFGx3QsuvglqJ5aVmdu1G7q1U9BLQX38g1NFEmvpdYqhIhEp6IyYZ68YQvEsMhjyIb3cghFGR4nmrHREc2kUjPdEwHE5vMbsAmwQxuleKwpfDG8FtB-_EmUOwwV688-9GipchQnR34XI6tJKa86vNc6PH7r7KP_3Twu9D8lh6yh1VS3YMapg24ei4ucFGyGWaFeixjBLYZbcMFk4hTlpZs9qwJo3W4uA_1dJLiATCfcwK_kS0lV92HCdpIlwuLK0PCTPxDaKhJDadlFY-wms-jvz-fw1Z3NOiH_WDYO4ZtFeRSgXgC9Xw2x1OCIrk4K5YEg3DNUfgCeVPtpg
openUrl	ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=2016+3rd+International+Conference+on+Systems+and+Informatics+%28ICSAI%29&rft.atitle=Weighted+residuals+for+very+deep+networks&rft.au=Falong+Shen&rft.au=Rui+Gan&rft.au=Gang+Zeng&rft.date=2016-11-01&rft.pub=IEEE&rft.spage=936&rft.epage=941&rft_id=info:doi/10.1109%2FICSAI.2016.7811085&rft.externalDocID=7811085