Skip to yearly menu bar Skip to main content


Spotlight Poster

Inference-Time Reward Hacking in Large Language Models

Hadi Khalaf ⋅ Claudio Mayrink Verdun ⋅ Alex Oesterling ⋅ Himabindu Lakkaraju ⋅ Flavio Calmon
2025 Spotlight Poster

Abstract

Video

Chat is not available.