Skip to yearly menu bar Skip to main content


Poster

Direct Preference-based Policy Optimization without Reward Modeling

Gaon An ⋅ Junhyeok Lee ⋅ Xingdong Zuo ⋅ Norio Kosaka ⋅ Kyung-Min Kim ⋅ Hyun Oh Song
2023 Poster

Abstract

Video

Chat is not available.