Skip to yearly menu bar Skip to main content


Poster

Adaptive Preference Scaling for Reinforcement Learning with Human Feedback

Ilgee Hong ⋅ Zichong Li ⋅ Alexander Bukharin ⋅ Yixiao Li ⋅ Haoming Jiang ⋅ Tianbao Yang ⋅ Tuo Zhao
2024 Poster

Abstract

Video

Chat is not available.