Skip to yearly menu bar Skip to main content


Pixelated Instructions: Can Multimodal Large Language Models Follow Printed Instructions in Images?

Xiujun Li ⋅ Yujie Lu ⋅ William Yang Wang ⋅ Yejin Choi

Abstract

Chat is not available.