Poster
in
Affinity Event: Women in Machine Learning

See No Evil: Adversarial Attacks on Referring Multi-Object Tracking Systems

Halima Bouzidi ⋅ Mohammad Al Faruque

Project Page [ OpenReview]

Abstract

Language-vision understanding has driven the development of Referring Multi-Object Tracking (RMOT). However, their security remains underexplored. We examine adversarial vulnerabilities in Transformer-based RMOT, showing that crafted perturbations disrupt both linguistic-visual referring and object-matching components. We introduce VEIL, an adversarial framework that exposes persistent errors in FIFO-based temporal memory and compromises tracking reliability.

Chat is not available.