Workshop
|
|
A2Nav: Action-Aware Zero-Shot Robot Navigation Using Vision-Language Ability of Foundation Models
Peihao Chen · Xinyu Sun · Hongyan Zhi · Runhao Zeng · Thomas Li · Mingkui Tan · Chuang Gan
|
|
Workshop
|
Sat 8:35
|
Compositional Generalization in Vision-Language Models uses the Language Modality only
|
|
Workshop
|
|
Analyzing Zero-Shot Abilities of Vision-Language Models on Video Understanding Tasks
Avinash Madasu · Anahita Bhiwandiwalla · VASUDEV LAL
|
|
Workshop
|
|
Learning Inner Monologue and Its Utilization in Vision-Language Challenges
Diji Yang · Kezhen Chen · Jinmeng Rao · Xiaoyuan Guo · Yawen Zhang · Jie Yang · Yi Zhang
|
|
Workshop
|
|
Selective Prediction For Open-Ended Question Answering in Black-Box Vision-Language Models
Zaid Khan · Yun Fu
|
|
Workshop
|
|
Compositional Generalization in Vision-Language Models uses the Language Modality only
Chenwei Wu · Patrick Haffner · Erran Li Li · Stefano Ermon · Rong Ge
|
|
Poster
|
Thu 15:00
|
SugarCrepe: Fixing Hackable Benchmarks for Vision-Language Compositionality
Cheng-Yu Hsieh · Jieyu Zhang · Zixian Ma · Aniruddha Kembhavi · Ranjay Krishna
|
|
Workshop
|
|
Temporal Fine-tuning of Medical Vision-Language Representation
Haoxu Huang · Kyunghyun Cho · Sumit Chopra · Divyam Madaan
|
|
Workshop
|
|
Probing Intersectional Biases in Vision-Language Models with Counterfactual Examples
Phillip Howard · Avinash Madasu · Tiep Le · Gustavo Lujan-Moreno · VASUDEV LAL
|
|
Workshop
|
|
Vision-Language Models Provide Promptable Representations for Reinforcement Learning
William Chen · Oier Mees · Aviral Kumar · Sergey Levine
|
|
Workshop
|
|
Vision-Language Models Provide Promptable Representations for Reinforcement Learning
William Chen · Oier Mees · Aviral Kumar · Sergey Levine
|
|
Workshop
|
|
How to Recycle: General Vision-Language Model without Task Tuning for Predicting Object Recyclability
Eliot Park · Eddy Pan · Shreya Johri · Pranav Rajpurkar
|
|