Skip to yearly menu bar Skip to main content


DEBATE: A Large-Scale Benchmark for Role-Playing LLM Agents in Multi-Agent, Long-Form Debates

Yun-Shiuan Chuang ⋅ Ruixuan Tu ⋅ Chengtao Dai ⋅ Smit Vasani ⋅ Binwei Yao ⋅ Michael Tessler ⋅ Sijia Yang ⋅ Dhavan Shah ⋅ Robert Hawkins ⋅ Junjie Hu ⋅ Timothy T Rogers

Abstract

Chat is not available.