Skip to yearly menu bar Skip to main content


Spotlight Poster

What Makes a Reward Model a Good Teacher? An Optimization Perspective

Noam Razin ⋅ Zixuan Wang ⋅ Hubert Strauss ⋅ Stanley Wei ⋅ Jason Lee ⋅ Sanjeev Arora
2025 Spotlight Poster

Abstract

Video

Chat is not available.