Connectivity Learning in Multi-Branch Networks
While much of the work in the design of convolutional networks over the last five years has revolved around the empirical investigation of the importance of depth, filter sizes, and number of feature channels, recent studies have shown that branching, i.e., splitting the computation along parallel b...
Saved in:
Published in | arXiv.org |
---|---|
Main Authors | , |
Format | Paper |
Language | English |
Published |
Ithaca
Cornell University Library, arXiv.org
07.12.2017
|
Subjects | |
Online Access | Get full text |
Cover
Loading…
Abstract | While much of the work in the design of convolutional networks over the last five years has revolved around the empirical investigation of the importance of depth, filter sizes, and number of feature channels, recent studies have shown that branching, i.e., splitting the computation along parallel but distinct threads and then aggregating their outputs, represents a new promising dimension for significant improvements in performance. To combat the complexity of design choices in multi-branch architectures, prior work has adopted simple strategies, such as a fixed branching factor, the same input being fed to all parallel branches, and an additive combination of the outputs produced by all branches at aggregation points. In this work we remove these predefined choices and propose an algorithm to learn the connections between branches in the network. Instead of being chosen a priori by the human designer, the multi-branch connectivity is learned simultaneously with the weights of the network by optimizing a single loss function defined with respect to the end task. We demonstrate our approach on the problem of multi-class image classification using three different datasets where it yields consistently higher accuracy compared to the state-of-the-art "ResNeXt" multi-branch network given the same learning capacity. |
---|---|
AbstractList | While much of the work in the design of convolutional networks over the last five years has revolved around the empirical investigation of the importance of depth, filter sizes, and number of feature channels, recent studies have shown that branching, i.e., splitting the computation along parallel but distinct threads and then aggregating their outputs, represents a new promising dimension for significant improvements in performance. To combat the complexity of design choices in multi-branch architectures, prior work has adopted simple strategies, such as a fixed branching factor, the same input being fed to all parallel branches, and an additive combination of the outputs produced by all branches at aggregation points. In this work we remove these predefined choices and propose an algorithm to learn the connections between branches in the network. Instead of being chosen a priori by the human designer, the multi-branch connectivity is learned simultaneously with the weights of the network by optimizing a single loss function defined with respect to the end task. We demonstrate our approach on the problem of multi-class image classification using three different datasets where it yields consistently higher accuracy compared to the state-of-the-art "ResNeXt" multi-branch network given the same learning capacity. |
Author | Torresani, Lorenzo Karim, Ahmed |
Author_xml | – sequence: 1 givenname: Ahmed surname: Karim fullname: Karim, Ahmed – sequence: 2 givenname: Lorenzo surname: Torresani fullname: Torresani, Lorenzo |
BookMark | eNrjYmDJy89LZWLgNDI2NtS1MDEy4mDgLS7OMjAwMDIzNzI1NeZk0HPOz8tLTS7JLMssqVTwSU0sysvMS1fIzFPwLc0pydR1KkrMS85Q8EstKc8vyi7mYWBNS8wpTuWF0twMym6uIc4eugVF-YWlqcUl8Vn5pUV5QKl4IwNzM1MDQ0sDC2PiVAEALSk0Jg |
ContentType | Paper |
Copyright | 2017. This work is published under http://arxiv.org/licenses/nonexclusive-distrib/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. |
Copyright_xml | – notice: 2017. This work is published under http://arxiv.org/licenses/nonexclusive-distrib/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. |
DBID | 8FE 8FG ABJCF ABUWG AFKRA AZQEC BENPR BGLVJ CCPQU DWQXO HCIFZ L6V M7S PIMPY PQEST PQQKQ PQUKI PRINS PTHSS |
DatabaseName | ProQuest SciTech Collection ProQuest Technology Collection Materials Science & Engineering Collection ProQuest Central (Alumni) ProQuest Central UK/Ireland ProQuest Central Essentials AUTh Library subscriptions: ProQuest Central Technology Collection ProQuest One Community College ProQuest Central SciTech Premium Collection (Proquest) (PQ_SDU_P3) ProQuest Engineering Collection Engineering Database Access via ProQuest (Open Access) ProQuest One Academic Eastern Edition (DO NOT USE) ProQuest One Academic ProQuest One Academic UKI Edition ProQuest Central China Engineering Collection |
DatabaseTitle | Publicly Available Content Database Engineering Database Technology Collection ProQuest Central Essentials ProQuest One Academic Eastern Edition ProQuest Central (Alumni Edition) SciTech Premium Collection ProQuest One Community College ProQuest Technology Collection ProQuest SciTech Collection ProQuest Central China ProQuest Central ProQuest Engineering Collection ProQuest One Academic UKI Edition ProQuest Central Korea Materials Science & Engineering Collection ProQuest One Academic Engineering Collection |
DatabaseTitleList | Publicly Available Content Database |
Database_xml | – sequence: 1 dbid: 8FG name: ProQuest Technology Collection url: https://search.proquest.com/technologycollection1 sourceTypes: Aggregation Database |
DeliveryMethod | fulltext_linktorsrc |
Discipline | Physics |
EISSN | 2331-8422 |
Genre | Working Paper/Pre-Print |
GroupedDBID | 8FE 8FG ABJCF ABUWG AFKRA ALMA_UNASSIGNED_HOLDINGS AZQEC BENPR BGLVJ CCPQU DWQXO FRJ HCIFZ L6V M7S M~E PIMPY PQEST PQQKQ PQUKI PRINS PTHSS |
ID | FETCH-proquest_journals_20765019083 |
IEDL.DBID | 8FG |
IngestDate | Thu Oct 10 15:09:51 EDT 2024 |
IsOpenAccess | true |
IsPeerReviewed | false |
IsScholarly | false |
Language | English |
LinkModel | DirectLink |
MergedId | FETCHMERGED-proquest_journals_20765019083 |
OpenAccessLink | https://www.proquest.com/docview/2076501908?pq-origsite=%requestingapplication% |
PQID | 2076501908 |
PQPubID | 2050157 |
ParticipantIDs | proquest_journals_2076501908 |
PublicationCentury | 2000 |
PublicationDate | 20171207 |
PublicationDateYYYYMMDD | 2017-12-07 |
PublicationDate_xml | – month: 12 year: 2017 text: 20171207 day: 07 |
PublicationDecade | 2010 |
PublicationPlace | Ithaca |
PublicationPlace_xml | – name: Ithaca |
PublicationTitle | arXiv.org |
PublicationYear | 2017 |
Publisher | Cornell University Library, arXiv.org |
Publisher_xml | – name: Cornell University Library, arXiv.org |
SSID | ssj0002672553 |
Score | 3.1254175 |
SecondaryResourceType | preprint |
Snippet | While much of the work in the design of convolutional networks over the last five years has revolved around the empirical investigation of the importance of... |
SourceID | proquest |
SourceType | Aggregation Database |
SubjectTerms | Algorithms Artificial neural networks Design Image classification |
Title | Connectivity Learning in Multi-Branch Networks |
URI | https://www.proquest.com/docview/2076501908 |
hasFullText | 1 |
inHoldings | 1 |
isFullTextHit | |
isPrint | |
link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwfV1NSwMxEB20i-CtfqG2loBeI2k2u0lPQmXXInQpotBbmWRT9VJrt1797SZpqgehx5CQhDDMm3l5yQDc1GrAapXl1GjFqJDMUmRcU4FznlqeoQ6_7Y-rfPQiHqfZNBJuTZRVbn1icNT1h_EcuWdCXDDh4EvdLT-prxrlb1djCY19SPpcSm_Vqnz45Vh4Ll3EnP5zswE7yjYkE1za1RHs2cUxHATJpWlO4DZoTMymegOJ_5y-kvcFCY9i6dCXvHgj1Uan3ZzCdVk834_odpFZNINm9rfp9AxaLp-350AyNDhQLs7BORPIDfbRurRFCnTjaqYvoLtrpsvd3R045B55vOJCdqG1Xn3ZK4eba90Lh9ODZFhUkyfXGn8XPxEId5Y |
link.rule.ids | 783,787,12779,21402,33387,33758,43614,43819 |
linkProvider | ProQuest |
linkToHtml | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwfV1NSwMxEB20i-jNT9RWDeg1ErPZ3fRUqLSs2i5FKvS2TLKp9lJrt_5_kzTVg9BzQhLCMG9m8jIP4K6SbVbJJKVaSUZFxgxFxhUVOOWx4Qkq321_WKT5m3ieJJNQcKsDrXLjE72jrj61q5G7SogNJix8yc7iizrVKPe6GiQ0diFyraqsVUfdXjF6_a2y8DSzMXP8z9F69OgfQjTChVkewY6ZH8OeJ13q-gTuPctEr_UbSOh0-k5mc-K_xdKuE734IMWaqV2fwm2_N37M6WaTMhhCXf4dOz6Dhs3ozTmQBDW2pY10cMoEco0PaGzikgm08yqmLqC1baXL7cM3sJ-Ph4Ny8FS8NOGAOxxy_IusBY3V8ttcWRRdqetwVT9D2nkc |
openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Connectivity+Learning+in+Multi-Branch+Networks&rft.jtitle=arXiv.org&rft.au=Karim%2C+Ahmed&rft.au=Torresani%2C+Lorenzo&rft.date=2017-12-07&rft.pub=Cornell+University+Library%2C+arXiv.org&rft.eissn=2331-8422 |