firstbacksecondback
344 Results
Workshop
|
Sun 14:15 |
Illusory VQA: Benchmarking and Enhancing Multimodal Models on Visual Illusions Mohammadmostafa Rostamkhani · Baktash Ansariogholbake · Hoorieh Sabzevari · Farzan Rahmani · Sauleh Eetemadi |
|
Workshop
|
Sun 11:50 |
Illusory VQA: Benchmarking and Enhancing Multimodal Models on Visual Illusions Mohammadmostafa Rostamkhani · Baktash Ansariogholbake · Hoorieh Sabzevari · Farzan Rahmani · Sauleh Eetemadi |
|
Workshop
|
Seeing Inside Buildings: Leveraging Generative AI and Multimodal Data to Automate Building Material Audits Nikita Klimenko · James Stoddart · Lorenzo Villaggi · Dale Zhao |
||
Workshop
|
Sun 11:55 |
HAMMR : HierArchical MultiModal React agents for generic VQA Lluis Castrejon · Thomas Mensink · Howard Zhou · Vittorio Ferrari · Andre Araujo · Jasper Uijlings |
|
Workshop
|
MMed-RAG: Versatile Multimodal RAG System for Medical Vision Language Models Peng Xia · Kangyu Zhu · Haoran Li · Tianze Wang · Weijia Shi · Linjun Zhang · James Zou · Huaxiu Yao |
||
Workshop
|
MMed-RAG: Versatile Multimodal RAG System for Medical Vision Language Models Peng Xia · Kangyu Zhu · Haoran Li · Tianze Wang · Weijia Shi · Sheng Wang · Linjun Zhang · James Zou · Huaxiu Yao |
||
Poster
|
Thu 16:30 |
Meteor: Mamba-based Traversal of Rationale for Large Language and Vision Models Byung-Kwan Lee · Chae Won Kim · Beomchan Park · Yong Man Ro |
|
Workshop
|
How Easy is It to Fool Your Multimodal LLMs? An Empirical Analysis on Deceptive Prompt Yusu Qian · Haotian Zhang · Yinfei Yang · Zhe Gan |