Skip to yearly menu bar Skip to main content


Oral Poster Wed, Dec 3, 2025 • 4:30 PM – 7:30 PM PST

PRIMT: Preference-based Reinforcement Learning with Multimodal Feedback and Trajectory Synthesis from Foundation Models

Ruiqi Wang ⋅ Dezhong Zhao ⋅ Ziqin Yuan ⋅ Tianyu Shao ⋅ Guohua Chen ⋅ Dominic Kao ⋅ Sungeun Hong ⋅ Byung-Cheol Min

Abstract

Video

Chat is not available.