Skip to yearly menu bar Skip to main content


What do MLLMs hear? Examining the interaction between LLM and audio encoder components in Multimodal Large Language Models

Enis Çoban · Michael Mandel · Johanna Devaney

Abstract

Video

Chat is not available.