Skip to yearly menu bar Skip to main content


Poster Wed, Dec 3, 2025 • 11:00 AM – 2:00 PM PST

VITA-Audio: Fast Interleaved Audio-Text Token Generation for Efficient Large Speech-Language Model

Zuwei Long ⋅ Yunhang Shen ⋅ Chaoyou Fu ⋅ Heting Gao ⋅ Lijiang Li ⋅ Peixian Chen ⋅ Mengdan Zhang ⋅ Hang Shao ⋅ Jian Li ⋅ Jinlong Peng ⋅ Haoyu Cao ⋅ Ke Li ⋅ Rongrong Ji ⋅ Xing Sun

Abstract

Video

Chat is not available.