Skip to yearly menu bar Skip to main content


AgentChangeBench: A Multi-Dimensional Evaluation Framework for Goal-Shift Robustness in Conversational AI

Manik Rana ⋅ Calissa Man ⋅ Jeffrey Paine ⋅ Anotida Expected Msiiwa ⋅ Ahan M R ⋅ Kevin Zhu ⋅ Vasu Sharma ⋅ Sunishchal Dev

Abstract

Chat is not available.