firstbacksecondback
11 Results
Poster
|
Wed 9:00 |
Robustness Analysis of Video-Language Models Against Visual and Language Perturbations Madeline Chantry · Shruti Vyas · Hamid Palangi · Yogesh Rawat · Vibhav Vineet |
|
Poster
|
Thu 14:00 |
Video Diffusion Models Jonathan Ho · Tim Salimans · Alexey Gritsenko · William Chan · Mohammad Norouzi · David Fleet |
|
Poster
|
LGDN: Language-Guided Denoising Network for Video-Language Modeling Haoyu Lu · Mingyu Ding · Nanyi Fei · Yuqi Huo · Zhiwu Lu |
||
Poster
|
Text-Adaptive Multiple Visual Prototype Matching for Video-Text Retrieval Chengzhi Lin · Ancong Wu · Junwei Liang · Jun Zhang · Wenhang Ge · Wei-Shi Zheng · Chunhua Shen |
||
Poster
|
Tue 14:00 |
Fine-Grained Semantically Aligned Vision-Language Pre-Training Juncheng Li · XIN HE · Longhui Wei · Long Qian · Linchao Zhu · Lingxi Xie · Yueting Zhuang · Qi Tian · Siliang Tang |
|
Affinity Workshop
|
Erased text retrieval from historical palimpsest manuscripts using deep autoregressive priors Anna Starynska · David Messinger |
||
Poster
|
Thu 9:00 |
Retrieve, Reason, and Refine: Generating Accurate and Faithful Patient Instructions Fenglin Liu · Bang Yang · Chenyu You · Xian Wu · Shen Ge · Zhangdaihong Liu · Xu Sun · Yang Yang · David Clifton |
|
Poster
|
Expectation-Maximization Contrastive Learning for Compact Video-and-Language Representations Peng Jin · Jinfa Huang · Fenglin Liu · Xian Wu · Shen Ge · Guoli Song · David Clifton · Jie Chen |
||
Workshop
|
Zero-shot Video Moment Retrieval With Off-the-Shelf Models Anuj Diwan · Puyuan Peng · Raymond Mooney |
||
Poster
|
Towards Video Text Visual Question Answering: Benchmark and Baseline Minyi Zhao · Bingjia Li · Jie Wang · Wanqing Li · Wenjing Zhou · Lan Zhang · Shijie Xuyang · Zhihang Yu · Xinkun Yu · Guangze Li · Aobotao Dai · Shuigeng Zhou |
||
Poster
|
A Differentiable Semantic Metric Approximation in Probabilistic Embedding for Cross-Modal Retrieval Hao Li · Jingkuan Song · Lianli Gao · Pengpeng Zeng · Haonan Zhang · Gongfu Li |