23   Show all »
23 Program Highlights »
Toggle Poster Visibility
Poster
Thu Dec 12th 10:45 AM -- 12:45 PM @ East Exhibition Hall B + C #165
Explaining Landscape Connectivity of Low-cost Solutions for Multilayer Nets
Rohith Kuditipudi · Xiang Wang · Holden Lee · Yi Zhang · Zhiyuan Li · Wei Hu · Rong Ge · Sanjeev Arora
Poster
Thu Dec 12th 10:45 AM -- 12:45 PM @ East Exhibition Hall B + C #166
Leader Stochastic Gradient Descent for Distributed Training of Deep Learning Models
Yunfei Teng · Wenbo Gao · François Chalus · Anna Choromanska · Donald Goldfarb · Adrian Weller
Poster
Thu Dec 12th 10:45 AM -- 12:45 PM @ East Exhibition Hall B + C #167
Learning Neural Networks with Adaptive Regularization
Han Zhao · Yao-Hung Hubert Tsai · Russ Salakhutdinov · Geoffrey Gordon
Poster
Thu Dec 12th 10:45 AM -- 12:45 PM @ East Exhibition Hall B + C #168
Memory Efficient Adaptive Optimization
Rohan Anil · Vineet Gupta · Tomer Koren · Yoram Singer
Poster
Thu Dec 12th 10:45 AM -- 12:45 PM @ East Exhibition Hall B + C #169
On the Convergence Rate of Training Recurrent Neural Networks
Zeyuan Allen-Zhu · Yuanzhi Li · Zhao Song
Poster
Thu Dec 12th 10:45 AM -- 12:45 PM @ East Exhibition Hall B + C #170
SGD on Neural Networks Learns Functions of Increasing Complexity
Dimitris Kalimeris · Gal Kaplun · Preetum Nakkiran · Benjamin Edelman · Tristan Yang · Boaz Barak · Haofeng Zhang
Poster
Thu Dec 12th 10:45 AM -- 12:45 PM @ East Exhibition Hall B + C #171
Towards Understanding the Importance of Shortcut Connections in Residual Networks
Tianyi Liu · Minshuo Chen · Mo Zhou · Simon Du · Enlu Zhou · Tuo Zhao
Poster
Thu Dec 12th 10:45 AM -- 12:45 PM @ East Exhibition Hall B + C #172
Trivializations for Gradient-Based Optimization on Manifolds
Mario Lezcano Casado
Poster
Thu Dec 12th 10:45 AM -- 12:45 PM @ East Exhibition Hall B + C #173
Using Statistics to Automate Stochastic Optimization
Hunter Lang · Lin Xiao · Pengchuan Zhang
Poster
Thu Dec 12th 10:45 AM -- 12:45 PM @ East Exhibition Hall B + C #174
Which Algorithmic Choices Matter at Which Batch Sizes? Insights From a Noisy Quadratic Model
Guodong Zhang · Lala Li · Zachary Nado · James Martens · Sushant Sachdeva · George Dahl · Chris Shallue · Roger Grosse
Poster
Thu Dec 12th 10:45 AM -- 12:45 PM @ East Exhibition Hall B + C #175
Wide Neural Networks of Any Depth Evolve as Linear Models Under Gradient Descent
Jaehoon Lee · Lechao Xiao · Samuel Schoenholz · Yasaman Bahri · Roman Novak · Jascha Sohl-Dickstein · Jeffrey Pennington
Poster
Thu Dec 12th 05:00 -- 07:00 PM @ East Exhibition Hall B + C #195
Algorithm-Dependent Generalization Bounds for Overparameterized Deep Residual Networks
Spencer Frei · Yuan Cao · Quanquan Gu
Poster
Thu Dec 12th 05:00 -- 07:00 PM @ East Exhibition Hall B + C #196
Are deep ResNets provably better than linear predictors?
Chulhee Yun · Suvrit Sra · Ali Jadbabaie
Poster
Thu Dec 12th 05:00 -- 07:00 PM @ East Exhibition Hall B + C #197
Efficient Rematerialization for Deep Networks
Ravi Kumar · Manish Purohit · Zoya Svitkina · Erik Vee · Joshua Wang
Poster
Thu Dec 12th 05:00 -- 07:00 PM @ East Exhibition Hall B + C #198
Fast Convergence of Natural Gradient Descent for Over-Parameterized Neural Networks
Guodong Zhang · James Martens · Roger Grosse
Poster
Thu Dec 12th 05:00 -- 07:00 PM @ East Exhibition Hall B + C #199
How to Initialize your Network? Robust Initialization for WeightNorm & ResNets
Devansh Arpit · Víctor Campos · Yoshua Bengio
Poster
Thu Dec 12th 05:00 -- 07:00 PM @ East Exhibition Hall B + C #200
Lookahead Optimizer: k steps forward, 1 step back
Michael Zhang · James Lucas · Jimmy Ba · Geoffrey E Hinton
Poster
Thu Dec 12th 05:00 -- 07:00 PM @ East Exhibition Hall B + C #201
Global Convergence of Gradient Descent for Deep Linear Residual Networks
Lei Wu · Qingcan Wang · Chao Ma
Poster
Thu Dec 12th 05:00 -- 07:00 PM @ East Exhibition Hall B + C #202
Piecewise Strong Convexity of Neural Networks
Tristan Milne
Poster
Thu Dec 12th 05:00 -- 07:00 PM @ East Exhibition Hall B + C #203
PowerSGD: Practical Low-Rank Gradient Compression for Distributed Optimization
Thijs Vogels · Sai Praneeth Karimireddy · Martin Jaggi
Poster
Thu Dec 12th 05:00 -- 07:00 PM @ East Exhibition Hall B + C #204
A Primal Dual Formulation For Deep Learning With Constraints
Yatin Nandwani · Abhishek Pathak · Mausam · Parag Singla
Poster
Thu Dec 12th 05:00 -- 07:00 PM @ East Exhibition Hall B + C #205
Surfing: Iterative Optimization Over Incrementally Trained Deep Networks
Ganlin Song · Zhou Fan · John Lafferty
Poster
Thu Dec 12th 05:00 -- 07:00 PM @ East Exhibition Hall B + C #206
Theoretical Limits of Pipeline Parallel Optimization and Application to Distributed Deep Learning
Igor Colin · Ludovic DOS SANTOS · Kevin Scaman