Skip to yearly menu bar Skip to main content


Search All 2024 Events
 

344 Results

<<   <   Page 29 of 29   >>   >
Workshop
Sun 14:15 Illusory VQA: Benchmarking and Enhancing Multimodal Models on Visual Illusions
Mohammadmostafa Rostamkhani · Baktash Ansariogholbake · Hoorieh Sabzevari · Farzan Rahmani · Sauleh Eetemadi
Workshop
Sun 11:50 Illusory VQA: Benchmarking and Enhancing Multimodal Models on Visual Illusions
Mohammadmostafa Rostamkhani · Baktash Ansariogholbake · Hoorieh Sabzevari · Farzan Rahmani · Sauleh Eetemadi
Workshop
Seeing Inside Buildings: Leveraging Generative AI and Multimodal Data to Automate Building Material Audits
Nikita Klimenko · James Stoddart · Lorenzo Villaggi · Dale Zhao
Workshop
Sun 11:55 HAMMR : HierArchical MultiModal React agents for generic VQA
Lluis Castrejon · Thomas Mensink · Howard Zhou · Vittorio Ferrari · Andre Araujo · Jasper Uijlings
Workshop
MMed-RAG: Versatile Multimodal RAG System for Medical Vision Language Models
Peng Xia · Kangyu Zhu · Haoran Li · Tianze Wang · Weijia Shi · Linjun Zhang · James Zou · Huaxiu Yao
Workshop
MMed-RAG: Versatile Multimodal RAG System for Medical Vision Language Models
Peng Xia · Kangyu Zhu · Haoran Li · Tianze Wang · Weijia Shi · Sheng Wang · Linjun Zhang · James Zou · Huaxiu Yao
Poster
Thu 16:30 Meteor: Mamba-based Traversal of Rationale for Large Language and Vision Models
Byung-Kwan Lee · Chae Won Kim · Beomchan Park · Yong Man Ro
Workshop
How Easy is It to Fool Your Multimodal LLMs? An Empirical Analysis on Deceptive Prompt
Yusu Qian · Haotian Zhang · Yinfei Yang · Zhe Gan