Toggle Poster Visibility
Poster
Thu Dec 12 10:45 AM -- 12:45 PM (PST) @ East Exhibition Hall B + C #165
Explaining Landscape Connectivity of Low-cost Solutions for Multilayer Nets
Poster
Thu Dec 12 10:45 AM -- 12:45 PM (PST) @ East Exhibition Hall B + C #166
Leader Stochastic Gradient Descent for Distributed Training of Deep Learning Models
Poster
Thu Dec 12 10:45 AM -- 12:45 PM (PST) @ East Exhibition Hall B + C #167
Learning Neural Networks with Adaptive Regularization
Poster
Thu Dec 12 10:45 AM -- 12:45 PM (PST) @ East Exhibition Hall B + C #168
Memory Efficient Adaptive Optimization
Poster
Thu Dec 12 10:45 AM -- 12:45 PM (PST) @ East Exhibition Hall B + C #169
On the Convergence Rate of Training Recurrent Neural Networks
Poster
Thu Dec 12 10:45 AM -- 12:45 PM (PST) @ East Exhibition Hall B + C #170
SGD on Neural Networks Learns Functions of Increasing Complexity
Poster
Thu Dec 12 10:45 AM -- 12:45 PM (PST) @ East Exhibition Hall B + C #171
Towards Understanding the Importance of Shortcut Connections in Residual Networks
Poster
Thu Dec 12 10:45 AM -- 12:45 PM (PST) @ East Exhibition Hall B + C #172
Trivializations for Gradient-Based Optimization on Manifolds
Poster
Thu Dec 12 10:45 AM -- 12:45 PM (PST) @ East Exhibition Hall B + C #173
Using Statistics to Automate Stochastic Optimization
Poster
Thu Dec 12 10:45 AM -- 12:45 PM (PST) @ East Exhibition Hall B + C #174
Which Algorithmic Choices Matter at Which Batch Sizes? Insights From a Noisy Quadratic Model
Poster
Thu Dec 12 10:45 AM -- 12:45 PM (PST) @ East Exhibition Hall B + C #175
Wide Neural Networks of Any Depth Evolve as Linear Models Under Gradient Descent
Poster
Thu Dec 12 05:00 PM -- 07:00 PM (PST) @ East Exhibition Hall B + C #195
Algorithm-Dependent Generalization Bounds for Overparameterized Deep Residual Networks
Poster
Thu Dec 12 05:00 PM -- 07:00 PM (PST) @ East Exhibition Hall B + C #196
Are deep ResNets provably better than linear predictors?
Poster
Thu Dec 12 05:00 PM -- 07:00 PM (PST) @ East Exhibition Hall B + C #197
Efficient Rematerialization for Deep Networks
Poster
Thu Dec 12 05:00 PM -- 07:00 PM (PST) @ East Exhibition Hall B + C #198
Fast Convergence of Natural Gradient Descent for Over-Parameterized Neural Networks
Poster
Thu Dec 12 05:00 PM -- 07:00 PM (PST) @ East Exhibition Hall B + C #199
How to Initialize your Network? Robust Initialization for WeightNorm & ResNets
Poster
Thu Dec 12 05:00 PM -- 07:00 PM (PST) @ East Exhibition Hall B + C #200
Lookahead Optimizer: k steps forward, 1 step back
Poster
Thu Dec 12 05:00 PM -- 07:00 PM (PST) @ East Exhibition Hall B + C #201
Global Convergence of Gradient Descent for Deep Linear Residual Networks
Poster
Thu Dec 12 05:00 PM -- 07:00 PM (PST) @ East Exhibition Hall B + C #202
Piecewise Strong Convexity of Neural Networks
[
Paper]
[
3 min Video]
Poster
Thu Dec 12 05:00 PM -- 07:00 PM (PST) @ East Exhibition Hall B + C #203
PowerSGD: Practical Low-Rank Gradient Compression for Distributed Optimization
Poster
Thu Dec 12 05:00 PM -- 07:00 PM (PST) @ East Exhibition Hall B + C #204
A Primal Dual Formulation For Deep Learning With Constraints
Poster
Thu Dec 12 05:00 PM -- 07:00 PM (PST) @ East Exhibition Hall B + C #205
Surfing: Iterative Optimization Over Incrementally Trained Deep Networks
Poster
Thu Dec 12 05:00 PM -- 07:00 PM (PST) @ East Exhibition Hall B + C #206
Theoretical Limits of Pipeline Parallel Optimization and Application to Distributed Deep Learning