TY - JOUR
T1 - Adaptive regularization with cubics on manifolds
AU - Agarwal, Naman
AU - Boumal, Nicolas
AU - Bullins, Brian
AU - Cartis, Coralia
N1 - Publisher Copyright:
© 2020, Springer-Verlag GmbH Germany, part of Springer Nature and Mathematical Optimization Society.
PY - 2021/7
Y1 - 2021/7
N2 - Adaptive regularization with cubics (ARC) is an algorithm for unconstrained, non-convex optimization. Akin to the trust-region method, its iterations can be thought of as approximate, safe-guarded Newton steps. For cost functions with Lipschitz continuous Hessian, ARC has optimal iteration complexity, in the sense that it produces an iterate with gradient smaller than ε in O(1 / ε1.5) iterations. For the same price, it can also guarantee a Hessian with smallest eigenvalue larger than -ε. In this paper, we study a generalization of ARC to optimization on Riemannian manifolds. In particular, we generalize the iteration complexity results to this richer framework. Our central contribution lies in the identification of appropriate manifold-specific assumptions that allow us to secure these complexity guarantees both when using the exponential map and when using a general retraction. A substantial part of the paper is devoted to studying these assumptions—relevant beyond ARC—and providing user-friendly sufficient conditions for them. Numerical experiments are encouraging.
AB - Adaptive regularization with cubics (ARC) is an algorithm for unconstrained, non-convex optimization. Akin to the trust-region method, its iterations can be thought of as approximate, safe-guarded Newton steps. For cost functions with Lipschitz continuous Hessian, ARC has optimal iteration complexity, in the sense that it produces an iterate with gradient smaller than ε in O(1 / ε1.5) iterations. For the same price, it can also guarantee a Hessian with smallest eigenvalue larger than -ε. In this paper, we study a generalization of ARC to optimization on Riemannian manifolds. In particular, we generalize the iteration complexity results to this richer framework. Our central contribution lies in the identification of appropriate manifold-specific assumptions that allow us to secure these complexity guarantees both when using the exponential map and when using a general retraction. A substantial part of the paper is devoted to studying these assumptions—relevant beyond ARC—and providing user-friendly sufficient conditions for them. Numerical experiments are encouraging.
KW - Complexity
KW - Cubic regularization
KW - Lipschitz regularity
KW - Newton’s method
KW - Optimization on manifolds
UR - http://www.scopus.com/inward/record.url?scp=85084702034&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85084702034&partnerID=8YFLogxK
U2 - 10.1007/s10107-020-01505-1
DO - 10.1007/s10107-020-01505-1
M3 - Article
AN - SCOPUS:85084702034
SN - 0025-5610
VL - 188
SP - 85
EP - 134
JO - Mathematical Programming
JF - Mathematical Programming
IS - 1
ER -