Skip to yearly menu bar Skip to main content


Blinded by Language: Multimodal LLMs Underuse Their Vision Backbone

Haider Al-Tahan · Randall Balestriero · Mark Ibrahim

Abstract

Chat is not available.