Skip to yearly menu bar Skip to main content


Loss-to-Loss Prediction: Language model scaling laws across datasets

David Brandfonbrener ⋅ Nikhil Anand ⋅ Nikhil Vyas ⋅ Eran Malach ⋅ Sham Kakade

Abstract

Chat is not available.