Skip to yearly menu bar Skip to main content


BrokenMath: A Benchmark for Sycophancy in Theorem Proving with LLMs

Ivo Petrov ⋅ Jasper Dekoninck ⋅ Martin Vechev

Abstract

Chat is not available.