Timezone: »
The loss landscape of deep learning seems hopelessly complex. Most structures discovered in this landscape have been empirical observations. The loss landscape is determined by the model architecture and the dataset. Yet, the interplay between architecture and the structure of local minima is not well understood. We uncover a key part of this relation. First, we show that even nonlinear neural networks admit a large group of continuous symmetries which keep the loss invariant. These symmetries show that many local minima are valleys, having directions in the parameter space where the loss remains invariant. Additionally, we show that these symmetries imply certain quantities are conserved during gradient flow. We derive the explicit form of these conserved quantities using a Noether's theorem for gradient flow. These conserved quantities allow us to define coordinates along the valley of a local minimum. These symmetries can be used to create ensembles of trained models from a single trained model.
Author Information
Bo Zhao (University of California, San Diego)
Iordan Ganev (Radboud University)
Robin Walters (Northeastern University)
Rose Yu (UC San Diego)
Nima Dehmamy (IBM Research)
More from the Same Authors
-
2022 : Charting Flat Minima Using the Conserved Quantities of Gradient Flow »
Bo Zhao · Iordan Ganev · Robin Walters · Rose Yu · Nima Dehmamy -
2022 : Understanding Optimization Challenges when Encoding to Geometric Structures »
Babak Esmaeili · Robin Walters · Heiko Zimmermann · Jan-Willem van de Meent -
2022 : Image to Icosahedral Projection for $\mathrm{SO}(3)$ Object Reasoning from Single-View Images »
David Klee · Ondrej Biza · Robert Platt · Robin Walters -
2022 : Rose Yu: "Physics-Guided Deep Learning for Climate Science" »
Rose Yu -
2022 Poster: Meta-Learning Dynamics Forecasting Using Task Inference »
Rui Wang · Robin Walters · Rose Yu -
2022 Poster: Symmetry Teleportation for Accelerated Optimization »
Bo Zhao · Nima Dehmamy · Robin Walters · Rose Yu -
2021 Poster: Automatic Symmetry Discovery with Lie Algebra Convolutional Network »
Nima Dehmamy · Robin Walters · Yanchen Liu · Dashun Wang · Rose Yu