Skip to yearly menu bar Skip to main content


Poster Fri, Dec 5, 2025 • 11:00 AM – 2:00 PM PST

ProRL: Prolonged Reinforcement Learning Expands Reasoning Boundaries in Large Language Models

Mingjie Liu ⋅ Shizhe Diao ⋅ Ximing Lu ⋅ Jian Hu ⋅ Xin Dong ⋅ Yejin Choi ⋅ Jan Kautz ⋅ Yi Dong

Abstract

Video

Chat is not available.