Skip to yearly menu bar Skip to main content


Poster

Act Only When It Pays: Efficient Reinforcement Learning for LLM Reasoning via Selective Rollouts

Haizhong Zheng ⋅ Yang Zhou ⋅ Brian Bartoldson ⋅ Bhavya Kailkhura ⋅ Fan Lai ⋅ Jiawei Zhao ⋅ Beidi Chen
2025 Poster

Abstract

Video

Chat is not available.