Skip to yearly menu bar Skip to main content


Search All 2024 Events
 

34 Results

<<   <   Page 3 of 3   >>   >
Workshop
Integrating Object Detection Modality into Visual Language Model for Enhanced Autonomous Driving Agent
Linfeng He · Yiming Sun · Sihao Wu · Jiaxu Liu · Xiaowei Huang
Workshop
Enhancing Multi-Agent Multi-Modal Collaboration with Fine-Grained Reward Modeling
Qian Yang · Weixiang Yan · Aishwarya Agrawal
Workshop
MAMORX: Multi-agent Multi-Modal Scientific Review Generation with External Knowledge
Guanchao Wang · Pawin Taechoyotin · Tong Zeng · Bradley Sides · Daniel Acuna
Workshop
MuMA-ToM: Multi-modal Multi-Agent Theory of Mind
Haojun Shi · Suyu Ye · Xinyu Fang · Chuanyang Jin · Leyla Isik · Yen-Ling Kuo · Tianmin Shu
Workshop
MuMA-ToM: Multi-modal Multi-Agent Theory of Mind
Haojun Shi · Suyu Ye · Xinyu Fang · Chuanyang Jin · Leyla Isik · Yen-Ling Kuo · Tianmin Shu
Workshop
CROSS-JEM: Accurate and Efficient Cross-encoders for Short-text Ranking Tasks
Bhawna Paliwal · Deepak Saini · Mudit Dhawan · Siddarth Asokan · Nagarajan Natarajan · Surbhi Aggarwal · Pankaj Malhotra · Jian Jiao · Manik Varma
Workshop
Multimodal Auto Validation For Self-Refinement in Web Agents
Ruhana Azam · Tamer Abuelsaad · Aditya Vempaty · Ashish Jagmohan
Workshop
OmniPredict: GPT-4o Enhanced Multi-modal Pedestrian Crossing Intention Prediction
Je-Seok Ham · Jia Huang · Peng Jiang · Jinyoung Moon · Yongjin Kwon · Srikanth Saripalli · Changick Kim
Workshop
Windows Agent Arena: Evaluating Multi-Modal OS Agents at Scale
Rogerio Bonatti · Dan Zhao · Sara Abdali · Yinheng Li · Yadong Lu · Justin Wagle · Kazuhito Koishida · Arthur Bucker · Lawrence Jang · Dillon Dupont · Zheng Hui
Workshop
Windows Agent Arena: Evaluating Multi-Modal OS Agents at Scale
Rogerio Bonatti · Dan Zhao · Dillon Dupont · Sara Abdali · Yinheng Li · Yadong Lu · Justin Wagle · Kazuhito Koishida · Arthur Bucker · Lawrence Jang · Zheng Hui