Skip to yearly menu bar Skip to main content


ARB: Advanced Reasoning Benchmark for Large Language Models

Tom Sawada · Daniel Paleka · Alexander Havrilla · Pranav Tadepalli · Paula Vidas · Alexander Kranias · John Nay · Kshitij Gupta · Aran Komatsuzaki

Abstract

Chat is not available.