Skip to yearly menu bar Skip to main content


Poster

Group Robust Preference Optimization in Reward-free RLHF

Shyam Sundhar Ramesh ⋅ Yifan Hu ⋅ Iason Chaimalas ⋅ Viraj Mehta ⋅ Pier Giuseppe Sessa ⋅ Haitham Bou Ammar ⋅ Ilija Bogunovic
2024 Poster

Abstract

Video

Chat is not available.