OPT 2022: Optimization for Machine Learning

Workshop

OPT 2022: Optimization for Machine Learning

Courtney Paquette · Sebastian Stich · Quanquan Gu · Cristóbal Guzmán · John Duchi

Room 295 - 296

Sat 3 Dec, 6:55 a.m. PST

[ Abstract ] Workshop Website

[ Contact: optmlworkshop@googlegroups.com ]

OPT 2022 will bring experts in optimization to share their perspectives while leveraging crossover experts in ML to share their views and recent advances. OPT 2022 honors this tradition of bringing together people from optimization and from ML in order to promote and generate new interactions between the two communities.

To foster the spirit of innovation and collaboration, a goal of this workshop, OPT 2022 will focus the contributed talks on research in Reliable Optimization Methods for ML. Many optimization algorithms for ML were originally developed with the goal of handling computational constraints (e.g., stochastic gradient based algorithms). Moreover, the analyses of these algorithms followed the classical optimization approach where one measures the performances of algorithms based on (i) the computation cost and (ii) convergence for any input into the algorithm. As engineering capabilities increase and the wide adoption of ML into many real world usages, practitioners of ML are seeking optimization algorithms that go beyond finding the minimizer with the fastest algorithm. They want reliable methods that solve real-world complications that arise. For example, increasingly bad actors are attempting to fool models with deceptive data. This leads to questions such as what algorithms are more robust to adversarial attacks and can one design new algorithms that can thwart these attacks? The latter question motivates a new area of optimization focusing on game-theoretic environments, that is, environments where there are competing forces at play and devising guarantees. Beyond this, a main reason for the success of ML is that optimization algorithms seemingly generate points that learn from training data; that is, we want minimizers of training data to provide meaningful interpretations on new data (generalization) yet we do not understand what features (e.g., loss function, algorithm, depth of the architectures (deep learning), and/or training samples) yield better generalization properties. These new areas of solving practical ML problems and their deep ties to the optimization community warrants a necessary discussion between the two communities. Specifically, we aim to discuss the meanings of generalization as well as the challenges facing real-world applications of ML and the new paradigms for optimizers seeking to solve them.

Plenary Speakers: All invited speakers have agreed to coming in-person to the workshop.

* Niao He (ETH, Zurich, assistant professor)

* Zico Kolter (Carnegie Mellon University, associate professor)

* Lorenzo Rosasco (U Genova/MIT, assistant professor)

* Katya Scheinberg (Cornell, full professor)

* Aaron Sidford (Stanford, assistant professor)

Chat is not available.

Timezone: America/Los_Angeles

Schedule

Sat 6:55 a.m. - 7:00 a.m.	Welcome Remarks ( Intro ) > SlidesLive Video	Courtney Paquette 🔗
Sat 7:00 a.m. - 7:30 a.m.	Katya Scheinberg, Stochastic Oracles and Where to Find Them ( Plenary Speaker ) > SlidesLive Video	Katya Scheinberg 🔗
Sat 7:30 a.m. - 8:00 a.m.	Contributed Talks 1 ( Contributed talks ) > SlidesLive Video	Courtney Paquette · Tian Li · Guy Kornowski 🔗
Sat 8:00 a.m. - 9:00 a.m.	Poster Session 1 ( Poster Session ) >	35 presenters Andrew Lowy · Thomas Bonnier · Yiling Xie · Guy Kornowski · Simon Schug · Seungyub Han · Nicolas Loizou · xinwei zhang · Laurent Condat · Tabea E. Röber · Si Yi Meng · Marco Mondelli · Runlong Zhou · Eshaan Nichani · Adrian Goldwaser · Rudrajit Das · Kayhan Behdin · Atish Agarwala · Mukul Gagrani · Gary Cheng · Tian Li · Haoran Sun · Hossein Taheri · Allen Liu · Siqi Zhang · Dmitrii Avdiukhin · Bradley Brown · Miaolan Xie · Junhyung Lyle Kim · Sharan Vaswani · Xinmeng Huang · Ganesh Ramachandra Kini · Angela Yuan · Weiqiang Zheng · Jiajin Li 🔗
Sat 9:00 a.m. - 9:30 a.m.	Contributed Talks 2 ( Contributed talks ) > SlidesLive Video	Quanquan Gu · Aaron Defazio · Jiajin Li 🔗
Sat 9:30 a.m. - 10:00 a.m.	Niao He, Simple Fixes for Adaptive Gradient Methods for Nonconvex Min-Max Optimization ( Plenary Speaker ) > SlidesLive Video	Niao He 🔗
Sat 10:00 a.m. - 12:00 p.m.	Lunch	🔗
Sat 12:00 p.m. - 12:30 p.m.	Zico Kolter, Adapt like you train: How optimization at training time affects model finetuning and adaptation ( Plenary Speaker ) > SlidesLive Video	J. Zico Kolter 🔗
Sat 12:30 p.m. - 1:15 p.m.	Contributed Talks 3 ( Contributed talks ) > SlidesLive Video	Cristóbal Guzmán · Fangshuo Liao · Vishwak Srinivasan · Zhiyuan Li 🔗
Sat 1:15 p.m. - 1:45 p.m.	Aaron Sidford, Efficiently Minimizing the Maximum Loss ( Plenary Speaker ) > SlidesLive Video	Aaron Sidford 🔗
Sat 1:45 p.m. - 1:50 p.m.	Closing Remarks ( Closing ) >	Courtney Paquette 🔗
Sat 1:50 p.m. - 2:50 p.m.	Poster Session 2 ( Poster Session ) >	32 presenters Jinwuk Seok · Bo Liu · Ryotaro Mitsuboshi · David Martinez-Rubio · Weiqiang Zheng · Ilgee Hong · Chen Fan · Kazusato Oko · Bo Tang · Miao Cheng · Aaron Defazio · Tim G. J. Rudner · Gabriele Farina · Vishwak Srinivasan · Ruichen Jiang · Peng Wang · Jane Lee · Nathan Wycoff · Nikhil Ghosh · Yinbin Han · David Mueller · Liu Yang · Amrutha Varshini Ramesh · Siqi Zhang · Kaifeng Lyu · David Yunis · Kumar Kshitij Patel · Fangshuo Liao · Dmitrii Avdiukhin · Xiang Li · Sattar Vakili · Jiaxin Shi 🔗
-	A Finite-Particle Convergence Rate for Stein Variational Gradient Descent ( Poster ) > link SlidesLive Video Link	Jiaxin Shi · Lester Mackey 🔗
-	Data-heterogeneity-aware Mixing for Decentralized Learning ( Poster ) > link SlidesLive Video Link	Yatin Dandi · Anastasiia Koloskova · Martin Jaggi · Sebastian Stich 🔗
-	Rieoptax: Riemannian Optimization in JAX ( Poster ) > link SlidesLive Video Link	Saiteja Utpala · Andi Han · Pratik Kumar Jawanpuria · Bamdev Mishra 🔗
-	TorchOpt: An Efficient library for Differentiable Optimization ( Poster ) > link SlidesLive Video Link	Jie Ren · Xidong Feng · Bo Liu · Xuehai Pan · Yao Fu · Luo Mai · Yaodong Yang 🔗
-	On the Implicit Geometry of Cross-Entropy Parameterizations for Label-Imbalanced Data ( Poster ) > link Link	Tina Behnia · Ganesh Ramachandra Kini · Vala Vakilian · Christos Thrampoulidis 🔗
-	Rethinking Sharpness-Aware Minimization as Variational Inference ( Poster ) > link SlidesLive Video Link	Szilvia Ujváry · Zsigmond Telek · Anna Kerekes · Anna Mészáros · Ferenc Huszar 🔗
-	TiAda: A Time-scale Adaptive Algorithm For Nonconvex Minimax Optimization ( Poster ) > link Link	Xiang Li · Junchi YANG · Niao He 🔗
-	TiAda: A Time-scale Adaptive Algorithm For Nonconvex Minimax Optimization ( Oral ) > link SlidesLive Video Link	Xiang Li · Junchi YANG · Niao He 🔗
-	Toward Understanding Why Adam Converges Faster Than SGD for Transformers ( Poster ) > link SlidesLive Video Link	Yan Pan · Yuanzhi Li 🔗
-	Bidirectional Adaptive Communication for Heterogeneous Distributed Learning ( Poster ) > link Link	Dmitrii Avdiukhin · Vladimir Braverman · Nikita Ivkin · Sebastian Stich 🔗
-	How Sharpness-Aware Minimization Minimizes Sharpness? ( Poster ) > link Link	Kaiyue Wen · Tengyu Ma · Zhiyuan Li 🔗
-	How Sharpness-Aware Minimization Minimizes Sharpness? ( Oral ) > link Link	Kaiyue Wen · Tengyu Ma · Zhiyuan Li 🔗
-	Strong Lottery Ticket Hypothesis with $\epsilon$ –perturbation ( Poster ) > link SlidesLive Video Link	Fangshuo Liao · Zheyang Xiong · Anastasios Kyrillidis 🔗
-	Strong Lottery Ticket Hypothesis with $\epsilon$ –perturbation ( Oral ) > link SlidesLive Video Link	Fangshuo Liao · Zheyang Xiong · Anastasios Kyrillidis 🔗
-	Optimization for Robustness Evaluation beyond ℓp Metrics ( Poster ) > link SlidesLive Video Link	Hengyue Liang · Buyun Liang · Ying Cui · Tim Mitchell · Ju Sun 🔗
-	Neural Networks Efficiently Learn Low-Dimensional Representations with SGD ( Poster ) > link SlidesLive Video Link	Alireza Mousavi-Hosseini · Sejun Park · Manuela Girotti · Ioannis Mitliagkas · Murat Erdogdu 🔗
-	Nesterov Meets Optimism: Rate-Optimal Optimistic-Gradient-Based Method for Stochastic Bilinearly-Coupled Minimax Optimization ( Poster ) > link SlidesLive Video Link	Chris Junchi Li · Angela Yuan · Gauthier Gidel · Michael Jordan 🔗
-	Distributed Online and Bandit Convex Optimization ( Poster ) > link SlidesLive Video Link	Kumar Kshitij Patel · Aadirupa Saha · Nati Srebro · Lingxiao Wang 🔗
-	On Convexity and Linear Mode Connectivity in Neural Networks ( Poster ) > link SlidesLive Video Link	David Yunis · Kumar Kshitij Patel · Pedro Savarese · Gal Vardi · Jonathan Frankle · Matthew Walter · Karen Livescu · Michael Maire 🔗
-	Optimal Complexity in Non-Convex Decentralized Learning over Time-Varying Networks ( Poster ) > link SlidesLive Video Link	Xinmeng Huang · Kun Yuan 🔗
-	Target-based Surrogates for Stochastic Optimization ( Poster ) > link SlidesLive Video Link	Jonathan Lavington · Sharan Vaswani · Reza Babanezhad Harikandeh · Mark Schmidt · Nicolas Le Roux 🔗
-	Why (and When) does Local SGD Generalize Better than SGD? ( Poster ) > link SlidesLive Video Link	Xinran Gu · Kaifeng Lyu · Longbo Huang · Sanjeev Arora 🔗
-	Momentum Extragradient is Optimal for Games with Cross-Shaped Spectrum ( Poster ) > link SlidesLive Video Link	Junhyung Lyle Kim · Gauthier Gidel · Anastasios Kyrillidis · Fabian Pedregosa 🔗
-	Stochastic Adaptive Regularization Method with Cubics: A High Probability Complexity Bound ( Poster ) > link SlidesLive Video Link	Katya Scheinberg · Miaolan Xie 🔗
-	ProxSkip for Stochastic Variational Inequalities: A Federated Learning Algorithm for Provable Communication Acceleration ( Poster ) > link SlidesLive Video Link	Siqi Zhang · Nicolas Loizou 🔗
-	DIMENSION-REDUCED ADAPTIVE GRADIENT METHOD ( Poster ) > link SlidesLive Video Link	Jingyang Li · Pan Zhou · Kuangyu Ding · Kim-Chuan Toh · Yinyu Ye 🔗
-	Fast Convergence of Greedy 2-Coordinate Updates for Optimizing with an Equality Constraint ( Poster ) > link SlidesLive Video Link	Amrutha Varshini Ramesh · Aaron Mishkin · Mark Schmidt 🔗
-	Quadratic minimization: from conjugate gradients to an adaptive heavy-ball method with Polyak step-sizes ( Poster ) > link SlidesLive Video Link	Baptiste Goujaud · Adrien Taylor · Aymeric Dieuleveut 🔗
-	NCVX: A General-Purpose Optimization Solver for Constrained Machine and Deep Learning ( Poster ) > link SlidesLive Video Link	Buyun Liang · Tim Mitchell · Ju Sun 🔗
-	A Better Way to Decay: Proximal Gradient Training Algorithms for Neural Nets ( Poster ) > link SlidesLive Video Link	Liu Yang · Jifan Zhang · Joseph Shenouda · Dimitris Papailiopoulos · Kangwook Lee · Robert Nowak 🔗
-	Policy gradient finds global optimum of nearly linear-quadratic control systems ( Poster ) > link SlidesLive Video Link	Yinbin Han · Meisam Razaviyayn · Renyuan Xu 🔗
-	Relating Regularization and Generalization through the Intrinsic Dimension of Activations ( Poster ) > link SlidesLive Video Link	Bradley Brown · Jordan Juravsky · Anthony Caterini · Gabriel Loaiza-Ganem 🔗
-	The Importance of Temperature in Multi-Task Optimization ( Poster ) > link SlidesLive Video Link	David Mueller · Mark Dredze · Nicholas Andrews 🔗
-	Network Pruning at Scale: A Discrete Optimization Approach ( Poster ) > link SlidesLive Video Link	Wenyu Chen · Riade Benbaki · Xiang Meng · Rahul Mazumder 🔗
-	A Novel Stochastic Gradient Descent Algorithm for LearningPrincipal Subspaces ( Poster ) > link Link	Charline Le Lan · Joshua Greaves · Jesse Farebrother · Mark Rowland · Fabian Pedregosa · Rishabh Agarwal · Marc Bellemare 🔗
-	Nonsmooth Composite Nonconvex-Concave Minimax Optimization ( Poster ) > SlidesLive Video	Jiajin Li · Linglingzhi Zhu · Anthony Man-Cho So 🔗
-	Decentralized Stochastic Optimization with Client Sampling ( Poster ) > link SlidesLive Video Link	Ziwei Liu · Anastasiia Koloskova · Martin Jaggi · Tao Lin 🔗
-	Escaping from Moderately Constrained Saddles ( Poster ) > link Link	Dmitrii Avdiukhin · Grigory Yaroslavtsev 🔗
-	Uniform Convergence and Generalization for Nonconvex Stochastic Minimax Problems ( Poster ) > link SlidesLive Video Link	Siqi Zhang · Yifan Hu · Liang Zhang · Niao He 🔗
-	Semi-Random Sparse Recovery in Nearly-Linear Time ( Poster ) > link SlidesLive Video Link	Jonathan Kelner · Jerry Li · Allen Liu · Aaron Sidford · Kevin Tian 🔗
-	Generalization of Decentralized Gradient Descent with Separable Data ( Poster ) > link Link	Hossein Taheri · Christos Thrampoulidis 🔗
-	Gradient dynamics of single-neuron autoencoders on orthogonal data ( Poster ) > link SlidesLive Video Link	Nikhil Ghosh · Spencer Frei · Wooseok Ha · Bin Yu 🔗
-	A Variable-Coefficient Nuclear Norm Penalty for Low Rank Inference ( Poster ) > link SlidesLive Video Link	Nathan Wycoff · Ali Arab · Lisa Singh 🔗
-	Exact Gradient Computation for Spiking Neural Networks ( Poster ) > link SlidesLive Video Link	Jane Lee · Saeid Haghighatshoar · Amin Karbasi 🔗
-	Linear Convergence Analysis of Neural Collapse with Unconstrained Features ( Poster ) > link SlidesLive Video Link	Peng Wang · Huikang Liu · Can Yaras · Laura Balzano · Qing Qu 🔗
-	The Solution Path of the Group Lasso ( Poster ) > link SlidesLive Video Link	Aaron Mishkin · Mert Pilanci 🔗
-	Conditional gradient-based method for bilevel optimization with convex lower-level problem ( Poster ) > link SlidesLive Video Link	Ruichen Jiang · Nazanin Abolfazli · Aryan Mokhtari · Erfan Yazdandoost Hamedani 🔗
-	Sufficient conditions for non-asymptotic convergence of Riemannian optimization methods ( Poster ) > link SlidesLive Video Link	Vishwak Srinivasan · Ashia Wilson 🔗
-	A Neural Tangent Kernel Perspective on Function-Space Regularization in Neural Networks ( Poster ) > link SlidesLive Video Link	Zonghao Chen · Xupeng Shi · Tim G. J. Rudner · Qixuan Feng · Weizhong Zhang · Tong Zhang 🔗
-	Annealed Training for Combinatorial Optimization on Graphs ( Poster ) > link SlidesLive Video Link	Haoran Sun · Etash Guha · Hanjun Dai 🔗
-	Clairvoyant Regret Minimization: Equivalence with Nemirovski’s Conceptual Prox Method and Extension to General Convex Games ( Poster ) > link SlidesLive Video Link	Gabriele Farina · Christian Kroer · Chung-Wei Lee · Haipeng Luo 🔗
-	Parameter Free Dual Averaging: Optimizing Lipschitz Functions in a Single Pass ( Poster ) > link SlidesLive Video Link	Aaron Defazio · Konstantin Mishchenko 🔗
-	Differentially Private Adaptive Optimization with Delayed Preconditioners ( Poster ) > link SlidesLive Video Link	Tian Li · Manzil Zaheer · Ken Liu · Sashank Reddi · H. Brendan McMahan · Virginia Smith 🔗
-	Differentially Private Adaptive Optimization with Delayed Preconditioners ( Oral ) > link Link	Tian Li · Manzil Zaheer · Ken Liu · Sashank Reddi · H. Brendan McMahan · Virginia Smith 🔗
-	A Light-speed Linear Program Solver for Personalized Recommendation with Diversity Constraints ( Poster ) > link SlidesLive Video Link	Miao Cheng · Haoyue Wang · Aman Gupta · Rahul Mazumder · Sathiya Selvaraj · Kinjal Basu 🔗
-	adaStar: A Method for Adapting to Interpolation ( Poster ) > link Link	Gary Cheng · John Duchi 🔗
-	PyEPO: A PyTorch-based End-to-End Predict-then-Optimize Library with Linear Objective Function ( Poster ) > link SlidesLive Video Link	Bo Tang · Elias Khalil 🔗
-	Accelerating Perturbed Stochastic Iterates in Asynchronous Lock-Free Optimization ( Poster ) > link SlidesLive Video Link	Kaiwen Zhou · Anthony Man-Cho So · James Cheng 🔗
-	Neural DAG Scheduling via One-Shot Priority Sampling ( Poster ) > link Link	Wonseok Jeon · Mukul Gagrani · Burak Bartan · Weiliang Zeng · Harris Teague · Piero Zappi · Christopher Lott 🔗
-	Stochastic Gradient Estimator for Differentiable NAS ( Poster ) > link Link	Libin Hou · Linyuan Wang · Qi Peng · Bin Yan 🔗
-	Near-optimal decentralized algorithms for network dynamic optimization ( Poster ) > link SlidesLive Video Link	Judy Gan · Yashodhan Kanoria · Xuan Zhang 🔗
-	A Second-order Regression Model Shows Edge of Stability Behavior ( Poster ) > link SlidesLive Video Link	Fabian Pedregosa · Atish Agarwala · Jeffrey Pennington 🔗
-	Online Min-max Optimization: Nonconvexity, Nonstationarity, and Dynamic Regret ( Poster ) > link SlidesLive Video Link	Yu Huang · Yuan Cheng · Yingbin Liang · Longbo Huang 🔗
-	Improved Deep Neural Network Generalization Using m-Sharpness-Aware Minimization ( Poster ) > link SlidesLive Video Link	Kayhan Behdin · Qingquan Song · Aman Gupta · Sathiya Selvaraj · David Durfee · Ayan Acharya · Rahul Mazumder 🔗
-	Reducing Communication in Nonconvex Federated Learning with a Novel Single-Loop Variance Reduction Method ( Poster ) > link SlidesLive Video Link	Kazusato Oko · Shunta Akiyama · Tomoya Murata · Taiji Suzuki 🔗
-	Fast Convergence of Random Reshuffling under Interpolation and the Polyak-Łojasiewicz Condition ( Poster ) > link SlidesLive Video Link	Chen Fan · Christos Thrampoulidis · Mark Schmidt 🔗
-	ZerO Initialization: Initializing Neural Networks with only Zeros and Ones ( Poster ) > link Link	Jiawei Zhao · Florian Schaefer · Anima Anandkumar 🔗
-	Differentially Private Federated Learning with Normalized Updates ( Poster ) > link SlidesLive Video Link	Rudrajit Das · Abolfazl Hashemi · Sujay Sanghavi · Inderjit Dhillon 🔗
-	Adaptive Inexact Sequential Quadratic Programming via Iterative Randomized Sketching ( Poster ) > link SlidesLive Video Link	Ilgee Hong · Sen Na · Mladen Kolar 🔗
-	Enhanced Index Tracking via Differentiable Assets Sorting ( Poster ) > link Link	Yuanyuan Liu · Yongxin Yang 🔗
-	Learning deep neural networks by iterative linearisation ( Poster ) > link SlidesLive Video Link	Adrian Goldwaser · Hong Ge 🔗
-	Self-Stabilization: The Implicit Bias of Gradient Descent at the Edge of Stability ( Poster ) > link Link	Alex Damian · Eshaan Nichani · Jason Lee 🔗
-	Understanding Curriculum Learning in Policy Optimization for Online Combinatorial Optimization ( Poster ) > link SlidesLive Video Link	Runlong Zhou · Yuandong Tian · YI WU · Simon Du 🔗
-	Solving Constrained Variational Inequalities via a First-order Interior Point-based Method ( Poster ) > link SlidesLive Video Link	Tong Yang · Michael Jordan · Tatjana Chavdarova 🔗
-	Mean-field analysis for heavy ball methods: Dropout-stability, connectivity, and global convergence ( Poster ) > link SlidesLive Video Link	Diyuan Wu · Vyacheslav Kungurtsev · Marco Mondelli 🔗
-	A Stochastic Prox-Linear Method for CVaR Minimization ( Poster ) > link SlidesLive Video Link	Si Yi Meng · Vasileios Charisopoulos · Robert Gower 🔗
-	Counterfactual Explanations Using Optimization With Constraint Learning ( Poster ) > link SlidesLive Video Link	Donato Maragno · Tabea E. Röber · Ilker Birbil 🔗
-	Accelerated Single-Call Methods for Constrained Min-Max Optimization ( Poster ) > link SlidesLive Video Link	Yang Cai · Weiqiang Zheng 🔗
-	Accelerated Algorithms for Monotone Inclusion and Constrained Nonconvex-Nonconcave Min-Max Optimization ( Poster ) > link SlidesLive Video Link	Yang Cai · Argyris Oikonomou · Weiqiang Zheng 🔗
-	Accelerated Riemannian Optimization: Handling Constraints to Bound Geometric Penalties ( Poster ) > link SlidesLive Video Link	David Martinez-Rubio · Sebastian Pokutta 🔗
-	Gradient Descent: Robustness to Adversarial Corruption ( Poster ) > link SlidesLive Video Link	Fu-Chieh Chang · Farhang Nabiei · Pei-Yuan Wu · Alexandru Cioba · Sattar Vakili · Alberto Bernacchia 🔗
-	Boosting as Frank-Wolfe ( Poster ) > link SlidesLive Video Link	Ryotaro Mitsuboshi · Kohei Hatano · Eiji Takimoto 🔗
-	RandProx: Primal-Dual Optimization Algorithms with Randomized Proximal Updates ( Poster ) > link SlidesLive Video Link	Laurent Condat · Peter Richtarik 🔗
-	A Unified Framework to Understand Decentralized and Federated Optimization Algorithms: A Multi-Rate Feedback Control Perspective ( Poster ) > link SlidesLive Video Link	xinwei zhang · Nicola Elia · Mingyi Hong 🔗
-	Stochastic Gradient Descent-Ascent: Unified Theory and New Efficient Methods ( Poster ) > link SlidesLive Video Link	Aleksandr Beznosikov · Eduard Gorbunov · Hugo Berard · Nicolas Loizou 🔗
-	Optimization using Parallel Gradient Evaluations on Multiple Parameters ( Poster ) > link SlidesLive Video Link	Yash Chandak · Shiv Shankar · Venkata Gandikota · Philip Thomas · Arya Mazumdar 🔗
-	An Accuracy Guaranteed Online Solver for Learning in Dynamic Feature Space ( Poster ) > link SlidesLive Video Link	Diyang Li · Bin Gu 🔗
-	Adaptive Methods for Nonconvex Continual Learning ( Poster ) > link SlidesLive Video Link	Seungyub Han · Yeongmo Kim · Taehyun Cho · Jungwoo Lee 🔗
-	Random initialisations performing above chance and how to find them ( Poster ) > link Link	Frederik Benzing · Simon Schug · Robert Meier · Johannes von Oswald · Yassir Akram · Nicolas Zucchet · Laurence Aitchison · Angelika Steger 🔗
-	On the Complexity of Finding Small Subgradients in Nonsmooth Optimization ( Poster ) > link SlidesLive Video Link	Guy Kornowski · Ohad Shamir 🔗
-	On the Complexity of Finding Small Subgradients in Nonsmooth Optimization ( Oral ) > link Link	Guy Kornowski · Ohad Shamir 🔗
-	Solving a Special Type of Optimal Transport Problem by a Modified Hungarian Algorithm ( Poster ) > link SlidesLive Video Link	Yiling Xie · Yiling Luo · Xiaoming Huo 🔗
-	BOME! Bilevel Optimization Made Easy: A Simple First-Order Approach ( Poster ) > link SlidesLive Video Link	Mao Ye · Bo Liu · Stephen Wright · Peter Stone · Qiang Liu 🔗
-	On Convergence of Average-Reward Off-Policy Control Algorithms in Weakly-Communicating MDPs ( Poster ) > link Link	Yi Wan · Richard Sutton 🔗
-	Optimizing the Performative Risk under Weak Convexity Assumptions ( Poster ) > link SlidesLive Video Link	Yulai Zhao 🔗
-	Quantization based Optimization : Alternative Stochastic Approximation of Global Optimization ( Poster ) > link SlidesLive Video Link	Jinwuk Seok · Changsik Cho 🔗
-	Completing the Model Optimization Process by Correcting Patterns of Failure in Regression Tasks ( Poster ) > link SlidesLive Video Link	Thomas Bonnier 🔗
-	Private Stochastic Optimization With Large Worst-Case Lipschitz Parameter: Optimal Rates for (Non-Smooth) Convex Losses & Extension to Non-Convex Losses ( Poster ) > link SlidesLive Video Link	Andrew Lowy · Meisam Razaviyayn 🔗
-	Sufficient Conditions for Non-asymptotic Convergence of Riemannian Optimization Methods ( Oral ) > link Link	Vishwak Srinivasan · Ashia Wilson 🔗