Skip to yearly menu bar Skip to main content


Poster
in
Workshop: Workshop on Scaling Environments for Agents
Sun, Dec 7, 2025 • 12:30 PM – 1:30 PM PST

MCP-Bench: Benchmarking Tool-Using LLM Agents with Complex Real-World Tasks via MCP Servers

Zhenting Wang · Qi Chang · Hemani Patel · Shashank Biju · Cheng-En Wu · Quan Liu · Aolin Ding · Alireza Rezazadeh · ANKIT PARAG SHAH · Yujia Bao · Eugene Siow

Abstract

Chat is not available.