Skip to yearly menu bar Skip to main content


RefactorBench: Evaluating Stateful Reasoning In Language Agents Through Code

Dhruv Gautam ⋅ Spandan Garg ⋅ Jinu Jang ⋅ Neel Sundaresan ⋅ Roshanak Zilouchian Moghaddam

Abstract

Chat is not available.