Skip to yearly menu bar Skip to main content


Poster Wed, Dec 3, 2025 • 11:00 AM – 2:00 PM PST

DAPO: An Open-Source LLM Reinforcement Learning System at Scale

Qiying Yu ⋅ Zheng Zhang ⋅ Ruofei Zhu ⋅ Yufeng Yuan ⋅ Xiaochen Zuo ⋅ Yu Yue ⋅ Weinan Dai ⋅ Tiantian Fan ⋅ Gaohong Liu ⋅ juncai liu ⋅ LingJun Liu ⋅ Xin Liu ⋅ Haibin Lin ⋅ Zhiqi Lin ⋅ Bole Ma ⋅ Guangming Sheng ⋅ Yuxuan Tong ⋅ Chi Zhang ⋅ Mofan Zhang ⋅ Ru Zhang ⋅ Wang Zhang ⋅ Hang Zhu ⋅ Jinhua Zhu ⋅ Jiaze Chen ⋅ Jiangjie Chen ⋅ Chengyi Wang ⋅ Hongli Yu ⋅ Yuxuan Song ⋅ Xiangpeng Wei ⋅ Hao Zhou ⋅ Jingjing Liu ⋅ Wei-Ying Ma ⋅ Ya-Qin Zhang ⋅ Lin Yan ⋅ Yonghui Wu ⋅ Mingxuan Wang

Abstract

Video

Chat is not available.