Skip to yearly menu bar Skip to main content


Poster

OpenOmni: Advancing Open-Source Omnimodal Large Language Models with Progressive Multimodal Alignment and Real-time Emotional Speech Synthesis

Run Luo ⋅ Ting-En Lin ⋅ Haonan Zhang ⋅ Yuchuan Wu ⋅ Xiong Liu ⋅ Yongbin Li ⋅ Longze Chen ⋅ Jiaming Li ⋅ Lei Zhang ⋅ Xiaobo Xia ⋅ Hamid Alinejad-Rokny ⋅ Fei Huang ⋅ Min Yang
2025 Poster

Abstract

Video

Chat is not available.