Skip to yearly menu bar Skip to main content


Poster

ReMA: Learning to Meta-Think for LLMs with Multi-agent Reinforcement Learning

Ziyu Wan ⋅ Yunxiang Li ⋅ Xiaoyu Wen ⋅ Yan Song ⋅ Hanjing Wang ⋅ Linyi Yang ⋅ Mark Schmidt ⋅ Jun Wang ⋅ Weinan Zhang ⋅ Shuyue Hu ⋅ Ying Wen
2025 Poster

Abstract

Video

Chat is not available.