Skip to yearly menu bar Skip to main content


Spotlight Poster Wed, Dec 3, 2025 • 11:00 AM – 2:00 PM PST

Inference-Time Reward Hacking in Large Language Models

Hadi Khalaf ⋅ Claudio Mayrink Verdun ⋅ Alex Oesterling ⋅ Himabindu Lakkaraju ⋅ Flavio Calmon

Abstract

Video

Chat is not available.