Skip to yearly menu bar Skip to main content


Exploring and Addressing Reward Confusion in Offline Preference Learning

Xin Chen, Cynthia · Sam Toyer · Florian Shkurti

Abstract

Chat is not available.