Skip to yearly menu bar Skip to main content


Spotlight Poster

Personalizing Reinforcement Learning from Human Feedback with Variational Preference Learning

Sriyash Poddar ⋅ Yanming Wan ⋅ Hamish Ivison ⋅ Abhishek Gupta ⋅ Natasha Jaques
2024 Spotlight Poster

Abstract

Video

Chat is not available.