firstbacksecondback
706 Results
Poster
|
Wed 16:30 |
Can Graph Learning Improve Planning in LLM-based Agents? Xixi Wu · Yifei Shen · Caihua Shan · Kaitao Song · Siwei Wang · Bohang Zhang · Jiarui Feng · Hong Cheng · Wei Chen · Yun Xiong · Dongsheng Li |
|
Workshop
|
SELFGOAL: Your Language Agents Already Know How to Achieve High-level Goals 睿涵 杨 · Jiangjie Chen · yikai zhang · Siyu Yuan · Chen · Kyle Richardson · Yanghua Xiao · Deqing Yang |
||
Oral
|
Wed 16:10 |
AgentBoard: An Analytical Evaluation Board of Multi-turn LLM Agents Ma Chang · Junlei Zhang · Zhihao Zhu · Cheng Yang · Yujiu Yang · Yaohui Jin · Zhenzhong Lan · Lingpeng Kong · Junxian He |
|
Poster
|
Wed 16:30 |
AgentBoard: An Analytical Evaluation Board of Multi-turn LLM Agents Ma Chang · Junlei Zhang · Zhihao Zhu · Cheng Yang · Yujiu Yang · Yaohui Jin · Zhenzhong Lan · Lingpeng Kong · Junxian He |
|
Workshop
|
Windows Agent Arena: Evaluating Multi-Modal OS Agents at Scale Rogerio Bonatti · Dan Zhao · Sara Abdali · Yinheng Li · Yadong Lu · Justin Wagle · Kazuhito Koishida · Arthur Bucker · Lawrence Jang · Dillon Dupont · Zheng Hui |
||
Poster
|
Thu 11:00 |
MO-DDN: A Coarse-to-Fine Attribute-based Exploration Agent for Multi-Object Demand-driven Navigation Hongcheng Wang · Peiqi Liu · Wenzhe Cai · Mingdong Wu · Zhengyu Qian · Hao Dong |
|
Workshop
|
Windows Agent Arena: Evaluating Multi-Modal OS Agents at Scale Rogerio Bonatti · Dan Zhao · Dillon Dupont · Sara Abdali · Yinheng Li · Yadong Lu · Justin Wagle · Kazuhito Koishida · Arthur Bucker · Lawrence Jang · Zheng Hui |
||
Poster
|
Wed 11:00 |
AutoGuide: Automated Generation and Selection of Context-Aware Guidelines for Large Language Model Agents Yao Fu · Dong-Ki Kim · Jaekyeom Kim · Sungryull Sohn · Lajanugen Logeswaran · Kyunghoon Bae · Honglak Lee |
|
Workshop
|
From Context to Action: Analysis of the Impact of State Representation and Context on the Generalization of Multi-Turn Web Navigation Agents Nalin Tiwary · Vardhan Dongre · Sanil Chawla · Ashwin Lamani · Dilek Tur |
||
Workshop
|
CRAB: Cross-platfrom agent benchmark for multi-modal embodied language model agents Tianqi Xu · Linyao Chen · Dai-Jie Wu · Yanjun Chen · Zecheng Zhang · Xiang Yao · Zhiqiang Xie · Yongchao Chen · Shilong Liu · Bochen Qian · Philip Torr · Bernard Ghanem · Guohao Li |