Skip to yearly menu bar Skip to main content


Oral Wed, Dec 3, 2025 • 3:30 PM – 3:50 PM PST

PRIMT: Preference-based Reinforcement Learning with Multimodal Feedback and Trajectory Synthesis from Foundation Models

Ruiqi Wang ⋅ Dezhong Zhao ⋅ Ziqin Yuan ⋅ Tianyu Shao ⋅ Guohua Chen ⋅ Dominic Kao ⋅ Sungeun Hong ⋅ Byung-Cheol Min

Abstract

Video

Chat is not available.