Skip to yearly menu bar Skip to main content


Search All 2024 Events
 

76 Results

<<   <   Page 1 of 7   >   >>
Workshop
An Information Theory of Compute-Optimal Size Scaling, Emergence, and Plateaus in Language Models
Anuj Keshava Nayak · Lav Varshney
Workshop
Scaling Collapse Reveals Universal Dynamics in Compute-Optimally Trained Neural Networks
Shikai Qiu · Atish Agarwala · Lechao Xiao · Jeffrey Pennington
Poster
Wed 16:30 4+3 Phases of Compute-Optimal Neural Scaling Laws
Elliot Paquette · Courtney Paquette · Lechao Xiao · Jeffrey Pennington
Poster
Fri 11:00 Resolving Discrepancies in Compute-Optimal Scaling of Language Models
Tomer Porian · Mitchell Wortsman · Jenia Jitsev · Ludwig Schmidt · Yair Carmon
Poster
Thu 16:30 Repurposing Language Models into Embedding Models: Finding the Compute-Optimal Recipe
Albert Q. Jiang · Alicja Ziarko · Bartosz Piotrowski · Wenda Li · Mateja Jamnik · Piotr Miłoś
Poster
Wed 11:00 Scaling Laws and Compute-Optimal Training Beyond Fixed Training Durations
Alex Hägele · Elie Bakouch · Atli Kosson · Loubna Ben allal · Leandro Von Werra · Martin Jaggi
Poster
Fri 11:00 Training Compute-Optimal Protein Language Models
Xingyi Cheng · Bo Chen · Pan Li · Jing Gong · Jie Tang · Le Song
Workshop
Inference Scaling Laws: An Empirical Analysis of Compute-Optimal Inference for LLM Problem-Solving
Yangzhen Wu · Zhiqing Sun · Shanda Li · Sean Welleck · Yiming Yang
Workshop
Smaller, Weaker, Yet Better: Training LLM Reasoners via Compute-Optimal Sampling
Hritik Bansal · Arian Hosseini · Rishabh Agarwal · Vinh Tran · Mehran Kazemi
Poster
Thu 11:00 OPEL: Optimal Transport Guided ProcedurE Learning
Sayeed Shafayet Chowdhury · Soumyadeep Chandra · Kaushik Roy
Competition
Sat 14:20 #1st place (MMGP variant) : Optimal Morphing Strategies for Efficient Computations
Abbas Kabalan
Workshop
Sun 9:01 Optimizing Optimization Methods with Computer Assistance, Ben Grimmer
Benjamin Grimmer