NeurIPS Poster Explaining Landscape Connectivity of Low-cost Solutions for Multilayer Nets

Poster

Explaining Landscape Connectivity of Low-cost Solutions for Multilayer Nets

Rohith Kuditipudi · Xiang Wang · Holden Lee · Yi Zhang · Zhiyuan Li · Wei Hu · Rong Ge · Sanjeev Arora

East Exhibition Hall B, C #165

Keywords: [ Theory ] [ Non-Convex Optimization ] [ Deep Learning -> Optimization for Deep Networks; Optimization ]

[ Abstract ]

Abstract:

Mode connectivity is a surprising phenomenon in the loss landscape of deep nets. Optima---at least those discovered by gradient-based optimization---turn out to be connected by simple paths on which the loss function is almost constant. Often, these paths can be chosen to be piece-wise linear, with as few as two segments.

We give mathematical explanations for this phenomenon, assuming generic properties (such as dropout stability and noise stability) of well-trained deep nets, which have previously been identified as part of understanding the generalization properties of deep nets. Our explanation holds for realistic multilayer nets, and experiments are presented to verify the theory.

Live content is unavailable. Log in and register to view live content