Lower bounds for finding stationary points I
We prove lower bounds on the complexity of finding ϵ -stationary points (points x such that ‖ ∇ f ( x ) ‖ ≤ ϵ ) of smooth, high-dimensional, and potentially non-convex functions f . We consider oracle-based complexity measures, where an algorithm is given access to the value and all derivatives of f...
Saved in:
Published in | Mathematical programming Vol. 184; no. 1-2; pp. 71 - 120 |
---|---|
Main Authors | , , , |
Format | Journal Article |
Language | English |
Published |
Berlin/Heidelberg
Springer Berlin Heidelberg
01.11.2020
Springer Nature B.V |
Subjects | |
Online Access | Get full text |
Cover
Loading…
Summary: | We prove lower bounds on the complexity of finding
ϵ
-stationary points (points
x
such that
‖
∇
f
(
x
)
‖
≤
ϵ
) of smooth, high-dimensional, and potentially non-convex functions
f
. We consider oracle-based complexity measures, where an algorithm is given access to the value and all derivatives of
f
at a query point
x
. We show that for any (potentially randomized) algorithm
A
, there exists a function
f
with Lipschitz
p
th order derivatives such that
A
requires at least
ϵ
-
(
p
+
1
)
/
p
queries to find an
ϵ
-stationary point. Our lower bounds are sharp to within constants, and they show that gradient descent, cubic-regularized Newton’s method, and generalized
p
th order regularization are worst-case optimal within their natural function classes. |
---|---|
Bibliography: | ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14 |
ISSN: | 0025-5610 1436-4646 |
DOI: | 10.1007/s10107-019-01406-y |