Skip to yearly menu bar Skip to main content


MCP-Bench: Benchmarking Tool-Using LLM Agents with Complex Real-World Tasks via MCP Servers

Zhenting Wang ⋅ Qi Chang ⋅ Hemani Patel ⋅ Shashank Biju ⋅ Cheng-En Wu ⋅ Quan Liu ⋅ Aolin Ding ⋅ Alireza Rezazadeh ⋅ ANKIT PARAG SHAH ⋅ Yujia Bao ⋅ Eugene Siow

Abstract

Chat is not available.