Skip to yearly menu bar Skip to main content


Poster Thu, Dec 4, 2025 • 11:00 AM – 2:00 PM PST

Watch and Listen: Understanding Audio-Visual-Speech Moments with Multimodal LLM

Zinuo Li ⋅ Xian Zhang ⋅ Yongxin Guo ⋅ Mohammed Bennamoun ⋅ Farid Boussaid ⋅ Girish Dwivedi ⋅ Luqi Gong ⋅ Qiuhong Ke

Abstract

Video

Chat is not available.