Skip to yearly menu bar Skip to main content


Common Pitfalls of Margin-based Preference Optimization in Language Model Alignment

Hui Yuan ⋅ Yifan Zeng ⋅ Yue Wu ⋅ Huazheng Wang ⋅ Mengdi Wang ⋅ Liu Leqi

Abstract

Chat is not available.