Skip to yearly menu bar Skip to main content


Poster
in
Workshop: Reliable ML from Unreliable Data

DPO-PRO: Direct Preference Optimization with Preference Robustness

Cheol Woo Kim ⋅ Shresth Verma ⋅ Mauricio Tec ⋅ Milind Tambe
2025 Poster
in
Workshop: Reliable ML from Unreliable Data

Abstract

Chat is not available.