Skip to yearly menu bar Skip to main content


Poster Fri, Dec 5, 2025 • 4:30 PM – 7:30 PM PST

Establishing Best Practices in Building Rigorous Agentic Benchmarks

Yuxuan Zhu ⋅ Tengjun Jin ⋅ Yada Pruksachatkun ⋅ Andy Zhang ⋅ Shu Liu ⋅ Sasha Cui ⋅ Sayash Kapoor ⋅ Shayne Longpre ⋅ Kevin Meng ⋅ Rebecca Weiss ⋅ Fazl Barez ⋅ Rahul Gupta ⋅ Jwala Dhamala ⋅ Jacob Merizian ⋅ Mario Giulianelli ⋅ Harry Coppock ⋅ Cozmin Ududec ⋅ Antony Kellermann ⋅ Jasjeet Sekhon ⋅ Jacob Steinhardt ⋅ Sarah Schwettmann ⋅ Arvind Narayanan ⋅ Matei A Zaharia ⋅ Ion Stoica ⋅ Percy Liang ⋅ Daniel Kang

Abstract

Video

Chat is not available.