firstbacksecondback
116 Results
Poster
|
Wed 9:00 |
Max-Min Off-Policy Actor-Critic Method Focusing on Worst-Case Robustness to Model Misspecification Takumi Tanabe · Rei Sato · Kazuto Fukuchi · Jun Sakuma · Youhei Akimoto |
|
Poster
|
Thu 9:00 |
Direct Advantage Estimation Hsiao-Ru Pan · Nico Gürtler · Alexander Neitz · Bernhard Schölkopf |
|
Workshop
|
Dynamic Collaborative Multi-Agent Reinforcement Learning Communication for Autonomous Drone Reforestation Philipp Siedler |
||
Workshop
|
Multi-Agent Policy Transfer via Task Relationship Modeling Rong-Jun Qin · Feng Chen · Tonghan Wang · Lei Yuan · Xiaoran Wu · Yipeng Kang · Zongzhang Zhang · Chongjie Zhang · Yang Yu |
||
Poster
|
Wed 14:00 |
Shield Decentralization for Safe Multi-Agent Reinforcement Learning Daniel Melcer · Christopher Amato · Stavros Tripakis |
|
Poster
|
Wed 9:00 |
Queue Up Your Regrets: Achieving the Dynamic Capacity Region of Multiplayer Bandits Ilai Bistritz · Nicholas Bambos |
|
Panel
|
Tue 9:15 |
Panel 1B-1: Online Minimax Multiobjective… & Minimax-Optimal Multi-Agent RL… Gen Li · Georgy Noarov |
|
Workshop
|
Understanding Redundancy in Discrete Multi-Agent Communication Jonathan Thomas · Raul Santos-Rodriguez · Robert Piechocki |
||
Poster
|
Tue 14:00 |
Monte Carlo Augmented Actor-Critic for Sparse Reward Deep Reinforcement Learning from Suboptimal Demonstrations Albert Wilcox · Ashwin Balakrishna · Jules Dedieu · Wyame Benslimane · Daniel Brown · Ken Goldberg |
|
Poster
|
Wed 14:00 |
Relational Reasoning via Set Transformers: Provable Efficiency and Applications to MARL Fengzhuo Zhang · Boyi Liu · Kaixin Wang · Vincent Tan · Zhuoran Yang · Zhaoran Wang |
|
Poster
|
Tue 9:00 |
Non-Linear Coordination Graphs Yipeng Kang · Tonghan Wang · Qianlan Yang · Xiaoran Wu · Chongjie Zhang |
|
Workshop
|
On- and Offline Multi-agent Reinforcement Learning for Disease Mitigation using Human Mobility Data Sofia Hurtado · Radu Marculescu |