NeurIPS 2020

Workshop: OPT2020: Optimization for Machine Learning

Courtney Paquette, Mark Schmidt, Sebastian Stich, Quanquan Gu, Martin Takac

2020-12-11T03:15:00-08:00 - 2020-12-11T16:30:00-08:00

Abstract: Optimization lies at the heart of many machine learning algorithms and enjoys great interest in our community. Indeed, this intimate relation of optimization with ML is the key motivation for the OPT series of workshops.

Looking back over the past decade, a strong trend is apparent: The intersection of OPT and ML has grown to the point that now cutting-edge advances in optimization often arise from the ML community. The distinctive feature of optimization within ML is its departure from textbook approaches, in particular, its focus on a different set of goals driven by "big-data, nonconvexity, and high-dimensions," where both theory and implementation are crucial.

We wish to use OPT 2020 as a platform to foster discussion, discovery, and dissemination of the state-of-the-art in optimization as relevant to machine learning. And well beyond that: as a platform to identify new directions and challenges that will drive future research, and continue to build the OPT+ML joint research community.

**Invited Speakers**
Volkan Cevher (EPFL)
Michael Friedlander (UBC)
Donald Goldfarb (Columbia)
Andreas Krause (ETH, Zurich)
Suvrit Sra (MIT)
Rachel Ward (UT Austin)
Ashia Wilson (MSR)
Tong Zhang (HKUST)

Please join us in gather.town for all breaks and poster sessions (for link see any abstract for a break or poster session, opens on December 11).

Video

Chat

Chat is not available.

Schedule

2020-12-11T03:15:00-08:00 - 2020-12-11T03:50:00-08:00

Welcome event (gather.town)

Quanquan Gu, Courtney Paquette, Mark Schmidt, Sebastian Stich, Martin Takac

Workshop: OPT2020: Optimization for Machine Learning

Courtney Paquette, Mark Schmidt, Sebastian Stich, Quanquan Gu, Martin Takac

Video

Chat

Chat is not available.

Schedule

Welcome event (gather.town)

Welcome remarks to Session 1

Invited speaker: The Convexity of Learning Infinite-width Deep Neural Networks, Tong Zhang

Live Q&A with Tong Zhang (Zoom)

Invited speaker: Adaptation and universality in first-order methods, Volkan Cevher

Live Q&A with Volkan Cevher (Zoom)

Contributed Video: Can We Find Near-Approximately-Stationary Points of Nonsmooth Nonconvex Functions?, Ohad Shamir

Contributed Video: Distributed Proximal Splitting Algorithms with Rates and Acceleration, Laurent Condat

Contributed Video: Employing No Regret Learners for Pure Exploration in Linear Bandits, Mohammadi Zaki

Contributed Video: Constraint-Based Regularization of Neural Networks, Tiffany Vlaar

Contributed Video: PAGE: A Simple and Optimal Probabilistic Gradient Estimator for Nonconvex Optimization, Zhize Li

Contributed talks in Session 1 (Zoom)

Break (gather.town)

Poster Session 1 (gather.town)

Welcome remarks to Session 2

Invited speaker: Adaptive Sampling for Stochastic Risk-Averse Learning, Andreas Krause

Live Q&A with Andreas Krause (Zoom)

Invited speaker: Practical Kronecker-factored BFGS and L-BFGS methods for training deep neural networks, Donald Goldfarb

Live Q&A with Donald Goldfarb (Zoom)

Contributed Video: Adaptive Gradient Methods Converge Faster with Over-Parameterization (and you can do a line-search), Sharan Vaswani

Contributed Video: How to make your optimizer generalize better, Sharan Vaswani

Contributed talks in Session 2 (Zoom)

Contributed Video: Stochastic Polyak Step-size for SGD: An Adaptive Learning Rate for Fast Convergence, Nicolas Loizou

Contributed Video: DDPNOpt: Differential Dynamic Programming Neural Optimizer, Guan-Horng Liu

Contributed Video: Adaptivity of Stochastic Gradient Methods for Nonconvex Optimization, Samuel Horvath

Break (gather.town)

Intro to Invited Speaker 5

Invited speaker: SGD without replacement: optimal rate analysis and more, Suvrit Sra

Live Q&A with Suvrit Sra (Zoom)

Poster Session 2 (gather.town)

Welcome remarks to Session 3

Invited speaker: Stochastic Geodesic Optimization, Ashia Wilson

Live Q&A with Ashia Wilson (Zoom)

Invited speaker: Concentration for matrix products, and convergence of Oja’s algorithm for streaming PCA, Rachel Ward

Live Q&A with Rachel Ward (Zoom)

Contributed Video: Learning Rate Annealing Can Provably Help Generalization, Even for Convex Problems, Preetum Nakkiran

Contributed Video: TenIPS: Inverse Propensity Sampling for Tensor Completion, Chengrun Yang

Contributed Video: Variance Reduction on Adaptive Stochastic Mirror Descent, Wenjie Li

Contributed Video: Incremental Greedy BFGS: An Incremental Quasi-Newton Method with Explicit Superlinear Rate, Zhan Gao

Contributed talks in Session 3 (Zoom)

Contributed Video: When Does Preconditioning Help or Hurt Generalization?, Denny Wu

Break (gather.town)

Invited speaker: Fast convergence of stochastic subgradient method under interpolation, Michael Friedlander

Intro to Invited Speaker 8

Live Q&A with Michael Friedlander (Zoom)

Poster Session 3 (gather.town)

Welcome remarks to Session 4

Invited speaker: Online nonnegative matrix factorization for Markovian and other real data, Deanna Needell and Hanbaek Lyu

Live Q&A with Deanna Needell and Hanbake Lyu (Zoom)

Contributed Video: A Study of Condition Numbers for First-Order Optimization, Charles Guille-Escuret

Contributed talks in Session 4 (Zoom)

Contributed Video: Stochastic Damped L-BFGS with controlled norm of the Hessian approximation, Sanae Lotfi

Contributed Video: On the Convergence of Adaptive Gradient Methods for Nonconvex Optimization, Dongruo Zhou

Contributed Video: Convex Programs for Global Optimization of Convolutional Neural Networks in Polynomial-Time, Tolga Ergen

Contributed Video: Affine-Invariant Analysis of Frank-Wolfe on Strongly Convex Sets, Lewis Liu

Closing remarks