Skip to yearly menu bar Skip to main content


Poster Wed, Dec 3, 2025 • 11:00 AM – 2:00 PM PST

First SFT, Second RL, Third UPT: Continual Improving Multi-Modal LLM Reasoning via Unsupervised Post-Training

Lai Wei ⋅ Yuting Li ⋅ Chen Wang ⋅ Yue Wang ⋅ Linghe Kong ⋅ Weiran Huang ⋅ Lichao Sun

Abstract

Video

Chat is not available.