Skip to yearly menu bar Skip to main content


Poster

CPPO: Accelerating the Training of Group Relative Policy Optimization-Based Reasoning Models

Zhihang Lin ⋅ Mingbao Lin ⋅ Yuan Xie ⋅ Rongrong Ji
2025 Poster

Abstract

Video

Chat is not available.