Skip to yearly menu bar Skip to main content


Poster
in
Workshop: Reliable ML from Unreliable Data
Sat, Dec 6, 2025 • 1:15 PM – 2:15 PM PST

DPO-PRO: Direct Preference Optimization with Preference Robustness

Cheol Woo Kim · Shresth Verma · Mauricio Tec · Milind Tambe

Abstract

Chat is not available.