Weighted residuals for very deep networks
Deep residual networks have recently shown appealing performance on many challenging computer vision tasks. However, the original residual structure still has some defects making it difficult to converge on very deep networks. In this paper, we introduce a weighted residual network to address the in...
Saved in:
Published in | 2016 3rd International Conference on Systems and Informatics (ICSAI) pp. 936 - 941 |
---|---|
Main Authors | , , |
Format | Conference Proceeding |
Language | English |
Published |
IEEE
01.11.2016
|
Subjects | |
Online Access | Get full text |
Cover
Loading…
Abstract | Deep residual networks have recently shown appealing performance on many challenging computer vision tasks. However, the original residual structure still has some defects making it difficult to converge on very deep networks. In this paper, we introduce a weighted residual network to address the incompatibility between ReLU and element-wise addition and the deep network initialization problem. The weighted residual network is able to learn to combine residuals from different layers effectively and efficiently. The proposed models enjoy a consistent improvement over accuracy and convergence with increasing depths from 100+ layers to 1000+ layers. Besides, the weighted residual networks have little more computation and GPU memory burden than the original residual networks. The networks are optimized by projected stochastic gradient descent. Experiments on CIFAR-10 have shown that our algorithm has a faster convergence speed than the original residual networks and reaches a high accuracy at 95.3% with a 1192-layer model. Experiments on CIFAR-100 and ImageNet-1k have also verified the effectiveness of our proposed design. |
---|---|
AbstractList | Deep residual networks have recently shown appealing performance on many challenging computer vision tasks. However, the original residual structure still has some defects making it difficult to converge on very deep networks. In this paper, we introduce a weighted residual network to address the incompatibility between ReLU and element-wise addition and the deep network initialization problem. The weighted residual network is able to learn to combine residuals from different layers effectively and efficiently. The proposed models enjoy a consistent improvement over accuracy and convergence with increasing depths from 100+ layers to 1000+ layers. Besides, the weighted residual networks have little more computation and GPU memory burden than the original residual networks. The networks are optimized by projected stochastic gradient descent. Experiments on CIFAR-10 have shown that our algorithm has a faster convergence speed than the original residual networks and reaches a high accuracy at 95.3% with a 1192-layer model. Experiments on CIFAR-100 and ImageNet-1k have also verified the effectiveness of our proposed design. |
Author | Rui Gan Gang Zeng Falong Shen |
Author_xml | – sequence: 1 surname: Falong Shen fullname: Falong Shen – sequence: 2 surname: Rui Gan fullname: Rui Gan – sequence: 3 surname: Gang Zeng fullname: Gang Zeng |
BookMark | eNotjktPwzAQhI0EB1r4A3DxlUPCrh_Z-FhFPCJV4kAljpVTr8ECksoJoP57guhpNPNJM7MQp_3QsxBXCCUiuNu2eV61pQKsSqrnpLYnYoEWHFir0J6LmxdOr28TB5l5TOHLf4wyDll-cz7IwLyXPU8_Q34fL8RZnClfHnUpNvd3m-axWD89tM1qXSQHU8HIITqKyu8UMIDbzbt_hjrXRTDeB6-58obIABGoinStuxgIDRqOeimu_2sTM2_3OX36fNgez-tfY4o9vw |
ContentType | Conference Proceeding |
DBID | 6IE 6IL CBEJK RIE RIL |
DOI | 10.1109/ICSAI.2016.7811085 |
DatabaseName | IEEE Electronic Library (IEL) Conference Proceedings IEEE Xplore POP ALL IEEE Xplore All Conference Proceedings IEEE Electronic Library (IEL) IEEE Proceedings Order Plans (POP All) 1998-Present |
DatabaseTitleList | |
Database_xml | – sequence: 1 dbid: RIE name: IEEE Electronic Library (IEL) url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/ sourceTypes: Publisher |
DeliveryMethod | fulltext_linktorsrc |
EISBN | 1509055215 9781509055210 |
EndPage | 941 |
ExternalDocumentID | 7811085 |
Genre | orig-research |
GroupedDBID | 6IE 6IL CBEJK RIE RIL |
ID | FETCH-LOGICAL-i90t-e1edf97f2ac20e009c7812ac27b9bf04aada3e6a47740770267383bfd71414ef3 |
IEDL.DBID | RIE |
IngestDate | Thu Jun 29 18:38:27 EDT 2023 |
IsPeerReviewed | false |
IsScholarly | false |
Language | English |
LinkModel | DirectLink |
MergedId | FETCHMERGED-LOGICAL-i90t-e1edf97f2ac20e009c7812ac27b9bf04aada3e6a47740770267383bfd71414ef3 |
PageCount | 6 |
ParticipantIDs | ieee_primary_7811085 |
PublicationCentury | 2000 |
PublicationDate | 2016-Nov. |
PublicationDateYYYYMMDD | 2016-11-01 |
PublicationDate_xml | – month: 11 year: 2016 text: 2016-Nov. |
PublicationDecade | 2010 |
PublicationTitle | 2016 3rd International Conference on Systems and Informatics (ICSAI) |
PublicationTitleAbbrev | ICSAI |
PublicationYear | 2016 |
Publisher | IEEE |
Publisher_xml | – name: IEEE |
Score | 1.8112712 |
Snippet | Deep residual networks have recently shown appealing performance on many challenging computer vision tasks. However, the original residual structure still has... |
SourceID | ieee |
SourceType | Publisher |
StartPage | 936 |
SubjectTerms | Computational modeling Convergence Convolution Graphics processing units Road transportation Stochastic processes Training |
Title | Weighted residuals for very deep networks |
URI | https://ieeexplore.ieee.org/document/7811085 |
hasFullText | 1 |
inHoldings | 1 |
isFullTextHit | |
isPrint | |
link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1NS8NAEB3anjyptOI3e_AimDRpdrPsUYqlFSqCFXsr2ewsiJAWmxz01zuTtBXFg7ckhGz2A96b3fdmAK4I4mmiFS1e4vaBtC4PMhZTmSxzOpEKE89G4elDOn6W93M1b8HNzguDiLX4DEO-rM_y3TKveKusz65IoghtaFPg1ni1tj6YyPQnw6fbCYu10nDz4o-KKTVgjPZhum2q0Ym8hVVpw_zzVxbG__7LAfS-rXnicQc6h9DCogvXL_UGJzpBwXPtrloLIqOClumHcIgrUTRq73UPZqO72XAcbGogBK8mKgOM0Xmj_YAzKSLxoZza5RttjfWRpBHNEkwzSSwu0prLSVHIab3TsYwl-uQIOsWywGMQRikih1ninOLqvAT9LlV2oPhDNvbmBLrcy8WqyXKx2HTw9O_HZ7DHI9248s6hU75XeEHwXNrLel6-AEWYkJk |
linkProvider | IEEE |
linkToHtml | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwzV1LS8NAEB5qPehJpRXf7kEPHpLmtQk5eJBqaewDwYq9hWx2AiKkpUmR-lf8K_44Z5O0ongteEtyGGZ3dplvN983A3BBKZ4CzWnxErbXHCFjLVJkKj-KpGc7HO1ECYUHQ7f75NyP-bgGHystDCIW5DPU1WPxL19O4rm6KmspVSRBhIpC2cPFGx3QsuvglqJ5aVmdu1G7q1U9BLQX38g1NFEmvpdYqhIhEp6IyYZ68YQvEsMhjyIb3cghFGR4nmrHREc2kUjPdEwHE5vMbsAmwQxuleKwpfDG8FtB-_EmUOwwV688-9GipchQnR34XI6tJKa86vNc6PH7r7KP_3Twu9D8lh6yh1VS3YMapg24ei4ucFGyGWaFeixjBLYZbcMFk4hTlpZs9qwJo3W4uA_1dJLiATCfcwK_kS0lV92HCdpIlwuLK0PCTPxDaKhJDadlFY-wms-jvz-fw1Z3NOiH_WDYO4ZtFeRSgXgC9Xw2x1OCIrk4K5YEg3DNUfgCeVPtpg |
openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=2016+3rd+International+Conference+on+Systems+and+Informatics+%28ICSAI%29&rft.atitle=Weighted+residuals+for+very+deep+networks&rft.au=Falong+Shen&rft.au=Rui+Gan&rft.au=Gang+Zeng&rft.date=2016-11-01&rft.pub=IEEE&rft.spage=936&rft.epage=941&rft_id=info:doi/10.1109%2FICSAI.2016.7811085&rft.externalDocID=7811085 |