Skip to yearly menu bar Skip to main content


Search All 2024 Events
 

14 Results

<<   <   Page 2 of 2   >>   >
Poster
Fri 11:00 Group Robust Preference Optimization in Reward-free RLHF
Shyam Sundhar Ramesh · Yifan Hu · Iason Chaimalas · Viraj Mehta · Pier Giuseppe Sessa · Haitham Bou Ammar · Ilija Bogunovic
Workshop
Generative Verifiers: Reward Modeling as Next-Token Prediction
Lunjun Zhang · Arian Hosseini · Hritik Bansal · Mehran Kazemi · Aviral Kumar · Rishabh Agarwal