Skip to yearly menu bar Skip to main content


Recovery-Bench: Evaluating Agentic Recovery from Mistakes

Shangyin Tan ⋅ Kevin Lin ⋅ Koushik Sen ⋅ Matei A Zaharia

Abstract

Chat is not available.