firstbacksecondback
139 Results
Workshop
|
Measuring AI Agent Autonomy: Towards a Scalable Approach With Code Inspection Merlin Stein · Peter Cihon · Gagan Bansal · Sam Manning |
||
Workshop
|
HoneyComb: A Flexible LLM-Based Agent System for Materials Science Huan Zhang · Yu Song · Ziyu Hou · Santiago Miret · Bang Liu |
||
Workshop
|
CausalQuest: Collecting Natural Causal Questions for AI Agents Roberto Ceraolo · Dmitrii Kharlapenko · Amélie Reymond · Rada Mihalcea · Bernhard Schölkopf · Mrinmaya Sachan · Zhijing Jin |
||
Workshop
|
Sat 16:39 |
HoneyComb: A Flexible LLM-Based Agent System for Materials Science Huan Zhang · Yu Song · Ziyu Hou · Santiago Miret · Bang Liu |
|
Workshop
|
CRAB: Cross-platfrom agent benchmark for multi-modal embodied language model agents Tianqi Xu · Linyao Chen · Dai-Jie Wu · Yanjun Chen · Zecheng Zhang · Xiang Yao · Zhiqiang Xie · Yongchao Chen · Shilong Liu · Bochen Qian · Philip Torr · Bernard Ghanem · Guohao Li |
||
Workshop
|
Simulation System Towards Solving Societal-Scale Manipulation Maximilian Puelma Touzel · Sneheel Sarangi · Austin Welch · Gayatri K · Dan Zhao · Zachary Yang · Hao Yu · Tom Gibbs · Ethan Kosak-Hine · Andreea Musulan · Camille Thibault · Reihaneh Rabbany · Jean-François Godbout · Kellin Pelrine |
||
Workshop
|
Protecting Users From Themselves: Safeguarding Contextual Privacy in Interactions with Conversational Agents Kieleh Ngong Ivoline Clarisse · Swanand Kadhe · Hao Wang · Keerthiram Murugesan · Justin D Weisz · Amit Dhurandhar · Karthikeyan Natesan Ramamurthy |