Skip to yearly menu bar Skip to main content


Poster

Group Robust Preference Optimization in Reward-free RLHF

Shyam Sundhar Ramesh · Yifan Hu · Iason Chaimalas · Viraj Mehta · Pier Giuseppe Sessa · Haitham Bou Ammar · Ilija Bogunovic
2024 Poster

Abstract

Video

Chat is not available.