Skip to yearly menu bar Skip to main content


Poster Thu, Dec 4, 2025 • 4:30 PM – 7:30 PM PST

MME-VideoOCR: Evaluating OCR-Based Capabilities of Multimodal LLMs in Video Scenarios

Yang Shi ⋅ Huanqian Wang ⋅ Xie ⋅ Huanyao Zhang ⋅ Lijie Zhao ⋅ yifan zhang ⋅ Xinfeng Li ⋅ Chaoyou Fu ⋅ Zhuoer Wen ⋅ Wenting Liu ⋅ Zhuoran Zhang ⋅ Xinlong Chen ⋅ Bohan Zeng ⋅ Sihan Yang ⋅ Yushuo Guan ⋅ Zhang Zhang ⋅ Liang Wang ⋅ Haoxuan Li ⋅ Zhouchen Lin ⋅ Yuanxing Zhang ⋅ Pengfei Wan ⋅ Haotian Wang ⋅ Wenjing Yang

Abstract

Video

Chat is not available.