Skip to yearly menu bar Skip to main content


Common Pitfalls of Margin-based Preference Optimization in Language Model Alignment

Hui Yuan · Yifan Zeng · Yue Wu · Huazheng Wang · Mengdi Wang · Liu Leqi

Abstract

Chat is not available.