Accelerated Bregman proximal gradient methods for relatively smooth convex optimization

We consider the problem of minimizing the sum of two convex functions: one is differentiable and relatively smooth with respect to a reference convex function, and the other can be nondifferentiable but simple to optimize. We investigate a triangle scaling property of the Bregman distance generated...

Full description

Saved in:
Bibliographic Details
Published inComputational optimization and applications Vol. 79; no. 2; pp. 405 - 440
Main Authors Hanzely, Filip, Richtárik, Peter, Xiao, Lin
Format Journal Article
LanguageEnglish
Published New York Springer US 01.06.2021
Springer Nature B.V
Subjects
Online AccessGet full text
ISSN0926-6003
1573-2894
DOI10.1007/s10589-021-00273-8

Cover

Loading…
More Information
Summary:We consider the problem of minimizing the sum of two convex functions: one is differentiable and relatively smooth with respect to a reference convex function, and the other can be nondifferentiable but simple to optimize. We investigate a triangle scaling property of the Bregman distance generated by the reference convex function and present accelerated Bregman proximal gradient (ABPG) methods that attain an O ( k - γ ) convergence rate, where γ ∈ ( 0 , 2 ] is the triangle scaling exponent (TSE) of the Bregman distance. For the Euclidean distance, we have γ = 2 and recover the convergence rate of Nesterov’s accelerated gradient methods. For non-Euclidean Bregman distances, the TSE can be much smaller (say γ ≤ 1 ), but we show that a relaxed definition of intrinsic TSE is always equal to 2. We exploit the intrinsic TSE to develop adaptive ABPG methods that converge much faster in practice. Although theoretical guarantees on a fast convergence rate seem to be out of reach in general, our methods obtain empirical O ( k - 2 ) rates in numerical experiments on several applications and provide posterior numerical certificates for the fast rates.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
ISSN:0926-6003
1573-2894
DOI:10.1007/s10589-021-00273-8