Skip to yearly menu bar Skip to main content


Poster

$\beta$-DPO: Direct Preference Optimization with Dynamic $\beta$

Junkang Wu · Yuexiang Xie · Zhengyi Yang · Jiancan Wu · Jinyang Gao · Bolin Ding · Xiang Wang · Xiangnan He
2024 Poster
[ Paper [ Slides [ Poster [ OpenReview

Abstract

Video

Chat is not available.