Skip to yearly menu bar Skip to main content


Exploring and Addressing Reward Confusion in Offline Preference Learning

Xin Chen, Cynthia ⋅ Sam Toyer ⋅ Florian Shkurti

Abstract

Chat is not available.