firstbacksecondback
273 Results
Poster
|
Tue 15:15 |
Statistical and Computational Trade-off in Multi-Agent Multi-Armed Bandits Filippo Vannella · Alexandre Proutiere · Jaeseong Jeong |
|
Workshop
|
Extreme Event Prediction with Multi-agent Reinforcement Learning-based Parametrization of Atmospheric and Oceanic Turbulence Rambod Mojgani · Daniel Waelchli · Yifei Guan · Petros Koumoutsakos · Pedram Hassanzadeh |
||
Poster
|
Wed 8:45 |
Decentralized Randomly Distributed Multi-agent Multi-armed Bandit with Heterogeneous Rewards Mengfan Xu · Diego Klabjan |
|
Poster
|
Wed 8:45 |
Mutual-Information Regularized Multi-Agent Policy Iteration Wang · Deheng Ye · Zongqing Lu |
|
Poster
|
Tue 8:45 |
Diverse Conventions for Human-AI Collaboration Bidipta Sarkar · Andy Shih · Dorsa Sadigh |
|
Poster
|
Wed 8:45 |
Blocked Collaborative Bandits: Online Collaborative Filtering with Per-Item Budget Constraints Soumyabrata Pal · Arun Suggala · Karthikeyan Shanmugam · Prateek Jain |
|
Poster
|
Thu 8:45 |
Emergent Communication for Rules Reasoning Yuxuan Guo · Yifan Hao · Rui Zhang · Enshuai Zhou · Zidong Du · xishan zhang · Xinkai Song · Yuanbo Wen · Yongwei Zhao · Xuehai Zhou · Jiaming Guo · Qi Yi · Shaohui Peng · Di Huang · Ruizhi Chen · Qi Guo · Yunji Chen |
|
Poster
|
Wed 8:45 |
Zero-sum Polymatrix Markov Games: Equilibrium Collapse and Efficient Computation of Nash Equilibria Fivos Kalogiannis · Ioannis Panageas |
|
Poster
|
Thu 8:45 |
History Filtering in Imperfect Information Games: Algorithms and Complexity Christopher Solinas · Doug Rebstock · Nathan Sturtevant · Michael Buro |
|
Poster
|
Wed 8:45 |
Scalable Primal-Dual Actor-Critic Method for Safe Multi-Agent RL with General Utilities Donghao Ying · Yunkai Zhang · Yuhao Ding · Alec Koppel · Javad Lavaei |
|
Poster
|
Thu 15:00 |
A Robust and Opponent-Aware League Training Method for StarCraft II Ruozi Huang · Xipeng Wu · Hongsheng Yu · Zhong Fan · Haobo Fu · Qiang Fu · Wei Yang |
|
Poster
|
Tue 8:45 |
Offline Multi-Agent Reinforcement Learning with Implicit Global-to-Local Value Regularization Xiangsen Wang · Haoran Xu · Yinan Zheng · Xianyuan Zhan |