Skip to yearly menu bar Skip to main content


Poster

Direct Preference-based Policy Optimization without Reward Modeling

Gaon An · Junhyeok Lee · Xingdong Zuo · Norio Kosaka · Kyung-Min Kim · Hyun Oh Song
2023 Poster

Abstract

Video

Chat is not available.