Skip to yearly menu bar Skip to main content


Poster Thu, Dec 4, 2025 • 4:30 PM – 7:30 PM PST

Safe RLHF-V: Safe Reinforcement Learning from Multi-modal Human Feedback

Jiaming Ji ⋅ Xinyu Chen ⋅ Rui Pan ⋅ Han Zhu ⋅ Jiahao Li ⋅ Donghai Hong ⋅ Boyuan Chen ⋅ Jiayi Zhou ⋅ Kaile Wang ⋅ Juntao Dai ⋅ Chi-Min Chan ⋅ Sirui Han ⋅ Yike Guo ⋅ Yaodong Yang

Abstract

Video

Chat is not available.