Skip to yearly menu bar Skip to main content


Poster Thu, Dec 4, 2025 • 11:00 AM – 2:00 PM PST

When Semantics Mislead Vision: Mitigating Large Multimodal Models Hallucinations in Scene Text Spotting and Understanding

Yan Shu ⋅ Hangui Lin ⋅ Yexin Liu ⋅ Yan Zhang ⋅ Gangyan Zeng ⋅ Yan Li ⋅ Yu Zhou ⋅ Ser Nam Lim ⋅ Harry Yang ⋅ Nicu Sebe

Abstract

Video

Chat is not available.