Name: ESE Fall Seminar – “Acceleration by Stepsize Hedging”
Start: 2023-10-24T11:00:00-04:00
End: 2023-10-24T12:00:00-04:00
Location: Glandt Forum, Singh Center for Nanotechnology

ESE Fall Seminar – “Acceleration by Stepsize Hedging”

October 24, 2023 at 11:00 AM - 12:00 PM

Share this event

Add to Calendar

Details

Date: October 24, 2023

Time: 11:00 AM - 12:00 PM

Event Category: Colloquium

Event Tags:ESE

Organizer

Electrical and Systems Engineering

Phone: 215-898-6823

Email: eseevents@seas.upenn.edu

Website: View Organizer Website

Venue

Glandt Forum, Singh Center for Nanotechnology 3205 Walnut Street
Philadelphia
PA 19104 Google Map

View Venue Website

Can we accelerate convergence of gradient descent without changing the algorithm — just by optimizing stepsizes? Surprisingly, we show that the answer is yes. Our proposed Silver Stepsize Schedule optimizes strongly convex functions in $k^{\log_p 2} = k^{0.7864}$ iterations, where $p=1+\sqrt{2}$ is the silver ratio and $k$ is the condition number. This is intermediate between the textbook unaccelerated rate $k$ and the accelerated rate $\sqrt{k}$ due to Nesterov in 1983. The non-strongly convex setting is conceptually identical, and we obtain an analogous accelerated rate $\eps^{-\log_p 2} = \eps^{-0.7864}$. We conjecture and provide partial evidence that these rates are optimal among all possible stepsize schedules.

The Silver Stepsize Schedule is an explicit non-monotonic fractal. Why should such stepsizes help? The core intuition is “hedging” between individually suboptimal strategies — short steps and long steps — since bad cases for the former are good cases for the latter, and vice versa. Properly combining these stepsizes yields faster convergence due to the misalignment of worst-case functions. This talk is based on joint work with Pablo Parrilo that publishes and extends my 2018 Master’s Thesis — which established for the first time that judiciously chosen stepsizes can enable accelerated convex optimization. Prior to this thesis, the only such result was for the special case of quadratics, due to Young in 1953.

ESE Fall Seminar – “Acceleration by Stepsize Hedging”

October 24, 2023 at 11:00 AM - 12:00 PM

Details

Organizer

Venue

Read More