Skip to yearly menu bar Skip to main content


Poster Wed, Dec 3, 2025 • 4:30 PM – 7:30 PM PST

OpenOmni: Advancing Open-Source Omnimodal Large Language Models with Progressive Multimodal Alignment and Real-time Emotional Speech Synthesis

Run Luo ⋅ Ting-En Lin ⋅ Haonan Zhang ⋅ Yuchuan Wu ⋅ Xiong Liu ⋅ Yongbin Li ⋅ Longze Chen ⋅ Jiaming Li ⋅ Lei Zhang ⋅ Xiaobo Xia ⋅ Hamid Alinejad-Rokny ⋅ Fei Huang ⋅ Min Yang

Abstract

Video

Chat is not available.