Skip to yearly menu bar Skip to main content


Poster

Sequential Preference Ranking for Efficient Reinforcement Learning from Human Feedback

Minyoung Hwang ⋅ Gunmin Lee ⋅ Hogun Kee ⋅ Chan Woo Kim ⋅ Kyungjae Lee ⋅ Songhwai Oh
2023 Poster

Abstract

Video

Chat is not available.