Skip to yearly menu bar Skip to main content


Poster

An eye for an ear: zero-shot audio description leveraging an image captioner with audio-visual token distribution matching

Hugo Malard · Michel Olvera · Stéphane Lathuilière · Slim Essid
2024 Poster
[ Paper [ Poster [ OpenReview

Abstract

Video

Chat is not available.