Skip to yearly menu bar Skip to main content


Poster

Scaling Laws for Reward Model Overoptimization in Direct Alignment Algorithms

Rafael Rafailov · Yaswanth Chittepu · Ryan Park · Harshit Sushil Sikchi · Joey Hejna · Brad Knox · Chelsea Finn · Scott Niekum
2024 Poster

Abstract

Video

Chat is not available.