Efficient Global Optimization of Two-layer ReLU Networks: Quadratic-time Algorithms and Adversarial Training

The non-convexity of the artificial neural network (ANN) training landscape brings inherent optimization difficulties. While the traditional back-propagation stochastic gradient descent (SGD) algorithm and its variants are effective in certain cases, they can become stuck at spurious local minima an...

Full description

Saved in:
Bibliographic Details
Published inarXiv.org
Main Authors Bai, Yatong, Gautam, Tanmay, Sojoudi, Somayeh
Format Paper
LanguageEnglish
Published Ithaca Cornell University Library, arXiv.org 06.01.2022
Subjects
Online AccessGet full text

Cover

Loading…