firstbacksecondback
16 Results
Poster
|
Tue 14:00 |
AVLEN: Audio-Visual-Language Embodied Navigation in 3D Environments Sudipta Paul · Amit Roy-Chowdhury · Anoop Cherian |
|
Poster
|
Tue 14:00 |
Multi-modal Grouping Network for Weakly-Supervised Audio-Visual Video Parsing Shentong Mo · Yapeng Tian |
|
Poster
|
Audio-Driven Co-Speech Gesture Video Generation Xian Liu · Qianyi Wu · Hang Zhou · Yuanqi Du · Wayne Wu · Dahua Lin · Ziwei Liu |
||
Poster
|
Tue 9:00 |
A Closer Look at Weakly-Supervised Audio-Visual Source Localization Shentong Mo · Pedro Morgado |
|
Poster
|
Tue 14:00 |
Learning State-Aware Visual Representations from Audible Interactions Himangi Mittal · Pedro Morgado · Unnat Jain · Abhinav Gupta |
|
Poster
|
Wed 14:00 |
u-HuBERT: Unified Mixed-Modal Speech Pretraining And Zero-Shot Transfer to Unlabeled Modality Wei-Ning Hsu · Bowen Shi |
|
Poster
|
Thu 9:00 |
SoundSpaces 2.0: A Simulation Platform for Visual-Acoustic Learning Changan Chen · Carl Schissler · Sanchit Garg · Philip Kobernik · Alexander Clegg · Paul Calamia · Dhruv Batra · Philip Robinson · Kristen Grauman |
|
Poster
|
Wed 9:00 |
Few-Shot Audio-Visual Learning of Environment Acoustics Sagnik Majumder · Changan Chen · Ziad Al-Halah · Kristen Grauman |
|
Poster
|
Wed 14:00 |
Learning Audio-Visual Dynamics Using Scene Graphs for Audio Source Separation Moitreya Chatterjee · Narendra Ahuja · Anoop Cherian |
|
Poster
|
Text-Adaptive Multiple Visual Prototype Matching for Video-Text Retrieval Chengzhi Lin · Ancong Wu · Junwei Liang · Jun Zhang · Wenhang Ge · Wei-Shi Zheng · Chunhua Shen |
||
Poster
|
Wed 9:00 |
Robustness Analysis of Video-Language Models Against Visual and Language Perturbations Madeline Chantry · Shruti Vyas · Hamid Palangi · Yogesh Rawat · Vibhav Vineet |
|
Poster
|
Thu 9:00 |
NUWA-Infinity: Autoregressive over Autoregressive Generation for Infinite Visual Synthesis Jian Liang · Chenfei Wu · Xiaowei Hu · Zhe Gan · Jianfeng Wang · Lijuan Wang · Zicheng Liu · Yuejian Fang · Nan Duan |