Skip to yearly menu bar Skip to main content


Poster

When Semantics Mislead Vision: Mitigating Large Multimodal Models Hallucinations in Scene Text Spotting and Understanding

Yan Shu ⋅ Hangui Lin ⋅ Yexin Liu ⋅ Yan Zhang ⋅ Gangyan Zeng ⋅ Yan Li ⋅ Yu Zhou ⋅ Ser Nam Lim ⋅ Harry Yang ⋅ Nicu Sebe
2025 Poster

Abstract

Video

Chat is not available.