Skip to yearly menu bar Skip to main content


Spotlight Poster

Communication-Efficient Language Model Training Scales Reliably and Robustly: Scaling Laws for DiLoCo

Zachary Charles ⋅ Gabriel Teston ⋅ Lucio Dery ⋅ John Rush ⋅ Nova Fallen ⋅ Zachary Garrett ⋅ Arthur Szlam ⋅ Arthur Douillard
2025 Spotlight Poster

Abstract

Video

Chat is not available.