Skip to yearly menu bar Skip to main content


Search All 2023 Events
 

43 Results

<<   <   Page 1 of 4   >   >>
Workshop
Improved Stein Variational Gradient Descent with Importance Weights
Lukang Sun · Peter Richtarik
Workshop
On the Parallel Complexity of Multilevel Monte Carlo in Stocahstic Gradient Descent
Kei Ishikawa
Poster
Wed 8:45 Implicit Bias of (Stochastic) Gradient Descent for Rank-1 Linear Neural Network
Bochen Lyu · Zhanxing Zhu
Poster
Tue 15:15 Universal Gradient Descent Ascent Method for Nonconvex-Nonconcave Minimax Optimization
Taoli Zheng · Linglingzhi Zhu · Anthony Man-Cho So · Jose Blanchet · Jiajin Li
Poster
Wed 8:45 Implicit Bias of Gradient Descent for Two-layer ReLU and Leaky ReLU Networks on Nearly-orthogonal Data
Yiwen Kou · Zixiang Chen · Quanquan Gu
Poster
Wed 8:45 Transformers learn to implement preconditioned gradient descent for in-context learning
Kwangjun Ahn · Xiang Cheng · Hadi Daneshmand · Suvrit Sra
Workshop
Revisiting the noise Model of SGD
Barak Battash · Ofir Lindenbaum
Poster
Thu 8:45 Complex-valued Neurons Can Learn More but Slower than Real-valued Neurons via Gradient Descent
Jin-Hui Wu · Shao-Qun Zhang · Yuan Jiang · Zhi-Hua Zhou
Workshop
Accelerated gradient descent: A guaranteed bound for a heuristic restart strategy
Walaa Moursi · Stephen Vavasis · Viktor Pavlovic
Workshop
GradTree: Learning Axis-Aligned Decision Trees with Gradient Descent
Sascha Marton · Stefan Lüdtke · Christian Bartelt · Heiner Stuckenschmidt
Workshop
Benign Oscillation of Stochastic Gradient Descent with Large Learning Rate
Miao Lu · Beining Wu · Xiaodong Yang · Difan Zou
Workshop
Large Catapults in Momentum Gradient Descent with Warmup: An Empirical Study
Prin Phunyaphibarn · Junghyun Lee · Bohan Wang · Huishuai Zhang · Chulhee Yun