Lower bounds for finding stationary points I

We prove lower bounds on the complexity of finding ϵ -stationary points (points x such that ‖ ∇ f ( x ) ‖ ≤ ϵ ) of smooth, high-dimensional, and potentially non-convex functions f . We consider oracle-based complexity measures, where an algorithm is given access to the value and all derivatives of f...

Full description

Saved in:
Bibliographic Details
Published inMathematical programming Vol. 184; no. 1-2; pp. 71 - 120
Main Authors Carmon, Yair, Duchi, John C., Hinder, Oliver, Sidford, Aaron
Format Journal Article
LanguageEnglish
Published Berlin/Heidelberg Springer Berlin Heidelberg 01.11.2020
Springer Nature B.V
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:We prove lower bounds on the complexity of finding ϵ -stationary points (points x such that ‖ ∇ f ( x ) ‖ ≤ ϵ ) of smooth, high-dimensional, and potentially non-convex functions f . We consider oracle-based complexity measures, where an algorithm is given access to the value and all derivatives of f at a query point x . We show that for any (potentially randomized) algorithm A , there exists a function f with Lipschitz p th order derivatives such that A requires at least ϵ - ( p + 1 ) / p queries to find an ϵ -stationary point. Our lower bounds are sharp to within constants, and they show that gradient descent, cubic-regularized Newton’s method, and generalized p th order regularization are worst-case optimal within their natural function classes.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
ISSN:0025-5610
1436-4646
DOI:10.1007/s10107-019-01406-y