On a continuous time model of gradient descent dynamics and instability in deep learning

The recipe behind the success of deep learning has been the combination of neural networks and gradient-based optimization. Understanding the behavior of gradient descent however, and particularly its instability, has lagged behind its empirical success. To add to the theoretical tools available to...

Full description

Saved in:
Bibliographic Details
Main Authors Rosca, Mihaela, Wu, Yan, Qin, Chongli, Dherin, Benoit
Format Journal Article
LanguageEnglish
Published 03.02.2023
Subjects
Online AccessGet full text

Cover

Loading…