Skip to yearly menu bar Skip to main content


Poster

An eye for an ear: zero-shot audio description leveraging an image captioner with audio-visual token distribution matching

Hugo Malard ⋅ Michel Olvera ⋅ Stéphane Lathuilière ⋅ Slim Essid
2024 Poster
[ Paper [ Poster [ OpenReview

Abstract

Video

Chat is not available.