Skip to yearly menu bar Skip to main content


Poster Wed, Dec 3, 2025 • 11:00 AM – 2:00 PM PST

Accelerating RL for LLM Reasoning with Optimal Advantage Regression

Kianté Brantley ⋅ Mingyu Chen ⋅ Zhaolin Gao ⋅ Jason Lee ⋅ Wen Sun ⋅ Wenhao Zhan ⋅ Xuezhou Zhang

Abstract

Video

Chat is not available.