Skip to yearly menu bar Skip to main content


Smaller, Weaker, Yet Better: Training LLM Reasoners via Compute-Optimal Sampling

Hritik Bansal · Arian Hosseini · Rishabh Agarwal · Vinh Tran · Mehran Kazemi

Abstract

Chat is not available.