Workshop
|
|
Improved Stein Variational Gradient Descent with Importance Weights
Lukang Sun · Peter Richtarik
|
|
Workshop
|
|
On the Parallel Complexity of Multilevel Monte Carlo in Stocahstic Gradient Descent
Kei Ishikawa
|
|
Poster
|
Wed 8:45
|
Implicit Bias of (Stochastic) Gradient Descent for Rank-1 Linear Neural Network
Bochen Lyu · Zhanxing Zhu
|
|
Poster
|
Tue 15:15
|
Universal Gradient Descent Ascent Method for Nonconvex-Nonconcave Minimax Optimization
Taoli Zheng · Linglingzhi Zhu · Anthony Man-Cho So · Jose Blanchet · Jiajin Li
|
|
Poster
|
Wed 8:45
|
Implicit Bias of Gradient Descent for Two-layer ReLU and Leaky ReLU Networks on Nearly-orthogonal Data
Yiwen Kou · Zixiang Chen · Quanquan Gu
|
|
Poster
|
Wed 8:45
|
Transformers learn to implement preconditioned gradient descent for in-context learning
Kwangjun Ahn · Xiang Cheng · Hadi Daneshmand · Suvrit Sra
|
|
Workshop
|
|
Revisiting the noise Model of SGD
Barak Battash · Ofir Lindenbaum
|
|
Poster
|
Thu 8:45
|
Complex-valued Neurons Can Learn More but Slower than Real-valued Neurons via Gradient Descent
Jin-Hui Wu · Shao-Qun Zhang · Yuan Jiang · Zhi-Hua Zhou
|
|
Workshop
|
|
Accelerated gradient descent: A guaranteed bound for a heuristic restart strategy
Walaa Moursi · Stephen Vavasis · Viktor Pavlovic
|
|
Workshop
|
|
GradTree: Learning Axis-Aligned Decision Trees with Gradient Descent
Sascha Marton · Stefan Lüdtke · Christian Bartelt · Heiner Stuckenschmidt
|
|
Workshop
|
|
Benign Oscillation of Stochastic Gradient Descent with Large Learning Rate
Miao Lu · Beining Wu · Xiaodong Yang · Difan Zou
|
|
Workshop
|
|
Large Catapults in Momentum Gradient Descent with Warmup: An Empirical Study
Prin Phunyaphibarn · Junghyun Lee · Bohan Wang · Huishuai Zhang · Chulhee Yun
|
|