Workshop
|
Sat 8:35
|
Compositional Generalization in Vision-Language Models uses the Language Modality only
|
|
Workshop
|
|
A2Nav: Action-Aware Zero-Shot Robot Navigation Using Vision-Language Ability of Foundation Models
Peihao Chen · Xinyu Sun · Hongyan Zhi · Runhao Zeng · Thomas Li · Mingkui Tan · Chuang Gan
|
|
Workshop
|
|
Analyzing Zero-Shot Abilities of Vision-Language Models on Video Understanding Tasks
Avinash Madasu · Anahita Bhiwandiwalla · VASUDEV LAL
|
|
Workshop
|
|
Selective Prediction For Open-Ended Question Answering in Black-Box Vision-Language Models
Zaid Khan · Yun Fu
|
|
Poster
|
Tue 8:45
|
Kiki or Bouba? Sound Symbolism in Vision-and-Language Models
Morris Alper · Hadar Averbuch-Elor
|
|
Workshop
|
|
Vision-and-Language Navigation in Real World using Foundation Models
Chengguang Xu · Hieu T. Nguyen · Christopher Amato · Lawson Wong
|
|
Workshop
|
|
Vision-and-Language Navigation in Real World using Foundation Models
Chengguang Xu · Hieu T. Nguyen · Christopher Amato · Lawson Wong
|
|
Workshop
|
|
How to Recycle: General Vision-Language Model without Task Tuning for Predicting Object Recyclability
Eliot Park · Eddy Pan · Shreya Johri · Pranav Rajpurkar
|
|
Workshop
|
|
Vision-Language Models are Zero-Shot Reward Models for Reinforcement Learning
Juan Rocamonde · Victoriano Montesinos · Elvis Nava · Ethan Perez · David Lindner
|
|
Workshop
|
|
Vision-Language Models are Zero-Shot Reward Models for Reinforcement Learning
Juan Rocamonde · Victoriano Montesinos · Elvis Nava · Ethan Perez · David Lindner
|
|
Poster
|
Thu 8:45
|
Distilling Out-of-Distribution Robustness from Vision-Language Foundation Models
Andy Zhou · Jindong Wang · Yu-Xiong Wang · Haohan Wang
|
|
Poster
|
Thu 8:45
|
VLATTACK: Multimodal Adversarial Attacks on Vision-Language Tasks via Pre-trained Models
Ziyi Yin · Muchao Ye · Tianrong Zhang · Tianyu Du · Tianyu Du · Jinguo Zhu · Han Liu · Jinghui Chen · Ting Wang · Fenglong Ma
|
|