Skip to yearly menu bar Skip to main content


Blinded by Language: Multimodal LLMs Underuse Their Vision Backbone

Haider Al-Tahan ⋅ Randall Balestriero ⋅ Mark Ibrahim

Abstract

Chat is not available.