Skip to yearly menu bar Skip to main content


Contributed Oral Presentation
in
Workshop: 7th International Workshop on Large Scale Holistic Video Understanding: Toward Video Foundation Models
Mon, Dec 1, 2025 • 9:45 AM – 10:00 AM PST

Enhancing Temporal Understanding in Video-LLMs through Stacked Temporal Attention in Vision Encoders

Leibniz University Hannover, L3S Research Center Ali Rasekh · Erfan Soula · Omid Daliran · Simon Gottschalk · Mohsen Fayyaz

Abstract

Video

Chat is not available.