Skip to yearly menu bar Skip to main content


Poster Wed, Dec 3, 2025 • 4:30 PM – 7:30 PM PST

ReMA: Learning to Meta-Think for LLMs with Multi-agent Reinforcement Learning

Ziyu Wan ⋅ Yunxiang Li ⋅ Xiaoyu Wen ⋅ Yan Song ⋅ Hanjing Wang ⋅ Linyi Yang ⋅ Mark Schmidt ⋅ Jun Wang ⋅ Weinan Zhang ⋅ Shuyue Hu ⋅ Ying Wen

Abstract

Video

Chat is not available.