Skip to yearly menu bar Skip to main content


Poster

ProRL: Prolonged Reinforcement Learning Expands Reasoning Boundaries in Large Language Models

Mingjie Liu ⋅ Shizhe Diao ⋅ Ximing Lu ⋅ Jian Hu ⋅ Xin Dong ⋅ Yejin Choi ⋅ Jan Kautz ⋅ Yi Dong
2025 Poster

Abstract

Video

Chat is not available.