Skip to yearly menu bar Skip to main content


Poster

Unified Multimodal Chain-of-Thought Reward Model through Reinforcement Fine-Tuning

Yibin Wang ⋅ li zhimin ⋅ Yuhang Zang ⋅ Chunyu Wang ⋅ Qinglin Lu ⋅ Cheng Jin ⋅ Jiaqi Wang
2025 Poster

Abstract

Video

Chat is not available.