Spotlight: MUSEG: Reinforcing Video Temporal Understanding via Timestamp-Aware Multi-Segment Grounding
Fuwen Luo
Abstract
5-min Spotlight Presentations of the following three papers.
Fuwen Luo et al., MUSEG: Reinforcing Video Temporal Understanding via Timestamp-Aware Multi-Segment Grounding.
Kibum Kim et al., Data Scaling Isn't Enough: Towards Improving Compositional Reasoning in Video-Language Models.
Zihao Lin et al., MLPEdit-Bench: Benchmarking Reasoning-Based Layer-wise Poster Editing.
Video
Chat is not available.
Successful Page Load