firstbacksecondback
34 Results
Workshop
|
Integrating Object Detection Modality into Visual Language Model for Enhanced Autonomous Driving Agent Linfeng He · Yiming Sun · Sihao Wu · Jiaxu Liu · Xiaowei Huang |
||
Workshop
|
Enhancing Multi-Agent Multi-Modal Collaboration with Fine-Grained Reward Modeling Qian Yang · Weixiang Yan · Aishwarya Agrawal |
||
Workshop
|
MAMORX: Multi-agent Multi-Modal Scientific Review Generation with External Knowledge Guanchao Wang · Pawin Taechoyotin · Tong Zeng · Bradley Sides · Daniel Acuna |
||
Workshop
|
MuMA-ToM: Multi-modal Multi-Agent Theory of Mind Haojun Shi · Suyu Ye · Xinyu Fang · Chuanyang Jin · Leyla Isik · Yen-Ling Kuo · Tianmin Shu |
||
Workshop
|
MuMA-ToM: Multi-modal Multi-Agent Theory of Mind Haojun Shi · Suyu Ye · Xinyu Fang · Chuanyang Jin · Leyla Isik · Yen-Ling Kuo · Tianmin Shu |
||
Workshop
|
CROSS-JEM: Accurate and Efficient Cross-encoders for Short-text Ranking Tasks Bhawna Paliwal · Deepak Saini · Mudit Dhawan · Siddarth Asokan · Nagarajan Natarajan · Surbhi Aggarwal · Pankaj Malhotra · Jian Jiao · Manik Varma |
||
Workshop
|
Multimodal Auto Validation For Self-Refinement in Web Agents Ruhana Azam · Tamer Abuelsaad · Aditya Vempaty · Ashish Jagmohan |
||
Workshop
|
OmniPredict: GPT-4o Enhanced Multi-modal Pedestrian Crossing Intention Prediction Je-Seok Ham · Jia Huang · Peng Jiang · Jinyoung Moon · Yongjin Kwon · Srikanth Saripalli · Changick Kim |
||
Workshop
|
Windows Agent Arena: Evaluating Multi-Modal OS Agents at Scale Rogerio Bonatti · Dan Zhao · Sara Abdali · Yinheng Li · Yadong Lu · Justin Wagle · Kazuhito Koishida · Arthur Bucker · Lawrence Jang · Dillon Dupont · Zheng Hui |
||
Workshop
|
Windows Agent Arena: Evaluating Multi-Modal OS Agents at Scale Rogerio Bonatti · Dan Zhao · Dillon Dupont · Sara Abdali · Yinheng Li · Yadong Lu · Justin Wagle · Kazuhito Koishida · Arthur Bucker · Lawrence Jang · Zheng Hui |