Skip to yearly menu bar Skip to main content


Smaller, Weaker, Yet Better: Training LLM Reasoners via Compute-Optimal Sampling

Hritik Bansal ⋅ Arian Hosseini ⋅ Rishabh Agarwal ⋅ Vinh Tran ⋅ Mehran Kazemi

Abstract

Chat is not available.