Skip to yearly menu bar Skip to main content


BrokenMath: A Benchmark for Sycophancy in Theorem Proving with LLMs

Ivo Petrov · Jasper Dekoninck · Martin Vechev

Abstract

Chat is not available.