firstbacksecondback
76 Results
Workshop
|
An Information Theory of Compute-Optimal Size Scaling, Emergence, and Plateaus in Language Models Anuj Keshava Nayak · Lav Varshney |
||
Workshop
|
Scaling Collapse Reveals Universal Dynamics in Compute-Optimally Trained Neural Networks Shikai Qiu · Atish Agarwala · Lechao Xiao · Jeffrey Pennington |
||
Poster
|
Wed 16:30 |
4+3 Phases of Compute-Optimal Neural Scaling Laws Elliot Paquette · Courtney Paquette · Lechao Xiao · Jeffrey Pennington |
|
Poster
|
Fri 11:00 |
Resolving Discrepancies in Compute-Optimal Scaling of Language Models Tomer Porian · Mitchell Wortsman · Jenia Jitsev · Ludwig Schmidt · Yair Carmon |
|
Poster
|
Thu 16:30 |
Repurposing Language Models into Embedding Models: Finding the Compute-Optimal Recipe Albert Q. Jiang · Alicja Ziarko · Bartosz Piotrowski · Wenda Li · Mateja Jamnik · Piotr Miłoś |
|
Poster
|
Wed 11:00 |
Scaling Laws and Compute-Optimal Training Beyond Fixed Training Durations Alex Hägele · Elie Bakouch · Atli Kosson · Loubna Ben allal · Leandro Von Werra · Martin Jaggi |
|
Poster
|
Fri 11:00 |
Training Compute-Optimal Protein Language Models Xingyi Cheng · Bo Chen · Pan Li · Jing Gong · Jie Tang · Le Song |
|
Workshop
|
Inference Scaling Laws: An Empirical Analysis of Compute-Optimal Inference for LLM Problem-Solving Yangzhen Wu · Zhiqing Sun · Shanda Li · Sean Welleck · Yiming Yang |
||
Workshop
|
Smaller, Weaker, Yet Better: Training LLM Reasoners via Compute-Optimal Sampling Hritik Bansal · Arian Hosseini · Rishabh Agarwal · Vinh Tran · Mehran Kazemi |
||
Poster
|
Thu 11:00 |
OPEL: Optimal Transport Guided ProcedurE Learning Sayeed Shafayet Chowdhury · Soumyadeep Chandra · Kaushik Roy |
|
Competition
|
Sat 14:20 |
#1st place (MMGP variant) : Optimal Morphing Strategies for Efficient Computations Abbas Kabalan |
|
Workshop
|
Sun 9:01 |
Optimizing Optimization Methods with Computer Assistance, Ben Grimmer Benjamin Grimmer |