Timezone: »
Communication lays the foundation for human cooperation. It is also crucial for multi-agent cooperation. However, existing work focuses on broadcast communication, which is not only impractical but also leads to information redundancy that could even impair the learning process. To tackle these difficulties, we propose Individually Inferred Communication (I2C), a simple yet effective model to enable agents to learn a prior for agent-agent communication. The prior knowledge is learned via causal inference and realized by a feed-forward neural network that maps the agent's local observation to a belief about who to communicate with. The influence of one agent on another is inferred via the joint action-value function in multi-agent reinforcement learning and quantified to label the necessity of agent-agent communication. Furthermore, the agent policy is regularized to better exploit communicated messages. Empirically, we show that I2C can not only reduce communication overhead but also improve the performance in a variety of multi-agent cooperative scenarios, comparing to existing methods.
Author Information
gang Ding (Peking University)
Tiejun Huang (Peking University)
Zongqing Lu (Peking University)
Related Events (a corresponding poster, oral, or spotlight)
-
2020 Poster: Learning Individually Inferred Communication for Multi-Agent Cooperation »
Wed. Dec 9th 05:00 -- 07:00 AM Room Poster Session 2 #580
More from the Same Authors
-
2022 Poster: Model-Based Opponent Modeling »
XiaoPeng Yu · Jiechuan Jiang · Wanpeng Zhang · Haobin Jiang · Zongqing Lu -
2022 Poster: Learning to Share in Networked Multi-Agent Reinforcement Learning »
Yuxuan Yi · Ge Li · Yaowei Wang · Zongqing Lu -
2022 Poster: Double Check Your State Before Trusting It: Confidence-Aware Bidirectional Offline Model-Based Imagination »
Jiafei Lyu · Xiu Li · Zongqing Lu -
2022 Poster: I2Q: A Fully Decentralized Q-Learning Algorithm »
Jiechuan Jiang · Zongqing Lu -
2022 Poster: Adaptation Accelerating Sampling-based Bayesian Inference in Attractor Neural Networks »
Xingsi Dong · Zilong Ji · Tianhao Chu · Tiejun Huang · Wenhao Zhang · Si Wu -
2022 Poster: SNN-RAT: Robustness-enhanced Spiking Neural Network through Regularized Adversarial Training »
Jianhao Ding · Tong Bu · Zhaofei Yu · Tiejun Huang · Jian Liu -
2022 Poster: Mildly Conservative Q-Learning for Offline Reinforcement Learning »
Jiafei Lyu · Xiaoteng Ma · Xiu Li · Zongqing Lu -
2022 Poster: Training Spiking Neural Networks with Event-driven Backpropagation »
Yaoyu Zhu · Zhaofei Yu · Wei Fang · Xiaodong Xie · Tiejun Huang · Timothée Masquelier -
2022 Poster: Temporal Effective Batch Normalization in Spiking Neural Networks »
Chaoteng Duan · Jianhao Ding · Shiyan Chen · Zhaofei Yu · Tiejun Huang -
2022 Poster: Learning Optical Flow from Continuous Spike Streams »
Rui Zhao · Ruiqin Xiong · Jing Zhao · Zhaofei Yu · Xiaopeng Fan · Tiejun Huang -
2022 Poster: Towards Human-Level Bimanual Dexterous Manipulation with Reinforcement Learning »
Yuanpei Chen · Tianhao Wu · Shengjie Wang · Xidong Feng · Jiechuan Jiang · Zongqing Lu · Stephen McAleer · Hao Dong · Song-Chun Zhu · Yaodong Yang -
2022 : State Advantage Weighting for Offline RL »
Jiafei Lyu · aicheng Gong · Le Wan · Zongqing Lu · Xiu Li -
2023 Poster: Slow and Weak Attractor Computation Embedded in Fast and Strong E-I Balanced Neural Dynamics »
Xiaohan Lin · Liyuan Li · Boxin Shi · Tiejun Huang · Yuanyuan Mi · Si Wu -
2023 Poster: Enhancing Motion Deblurring in High-Speed Scenes with Spike Streams »
Shiyan Chen · Jiyuan Zhang · Yajing Zheng · Zhaofei Yu · Tiejun Huang -
2023 Poster: Learning to Ignore: Mutual-Information Regularized Multi-Agent Policy Iteration »
Jiangxing Wang · Deheng Ye · Zongqing Lu -
2023 Poster: Learning from Visual Observation via Offline Pretrained State-to-Go Transformer »
Bohan Zhou · Ke Li · Jiechuan Jiang · Zongqing Lu -
2023 Poster: Exploring Loss Functions for Time-based Training Strategy in Spiking Neural Networks »
Yaoyu Zhu · Wei Fang · Xiaodong Xie · Tiejun Huang · Zhaofei Yu -
2023 Poster: Unsupervised Optical Flow Estimation with Dynamic Timing Representation for Spike Camera »
Lujie Xia · gang Ding · Rui Zhao · Jiyuan Zhang · Lei Ma · Zhaofei Yu · Tiejun Huang · Ruiqin Xiong -
2022 Spotlight: Mildly Conservative Q-Learning for Offline Reinforcement Learning »
Jiafei Lyu · Xiaoteng Ma · Xiu Li · Zongqing Lu -
2022 Spotlight: Double Check Your State Before Trusting It: Confidence-Aware Bidirectional Offline Model-Based Imagination »
Jiafei Lyu · Xiu Li · Zongqing Lu -
2022 Spotlight: Training Spiking Neural Networks with Event-driven Backpropagation »
Yaoyu Zhu · Zhaofei Yu · Wei Fang · Xiaodong Xie · Tiejun Huang · Timothée Masquelier -
2022 Spotlight: Towards Human-Level Bimanual Dexterous Manipulation with Reinforcement Learning »
Yuanpei Chen · Tianhao Wu · Shengjie Wang · Xidong Feng · Jiechuan Jiang · Zongqing Lu · Stephen McAleer · Hao Dong · Song-Chun Zhu · Yaodong Yang -
2022 Spotlight: Lightning Talks 2A-2 »
Harikrishnan N B · Jianhao Ding · Juha Harviainen · Yizhen Wang · Lue Tao · Oren Mangoubi · Tong Bu · Nisheeth Vishnoi · Mohannad Alhanahnah · Mikko Koivisto · Aditi Kathpalia · Lei Feng · Nithin Nagaraj · Hongxin Wei · Xiaozhu Meng · Petteri Kaski · Zhaofei Yu · Tiejun Huang · Ke Wang · Jinfeng Yi · Jian Liu · Sheng-Jun Huang · Mihai Christodorescu · Songcan Chen · Somesh Jha -
2022 Spotlight: SNN-RAT: Robustness-enhanced Spiking Neural Network through Regularized Adversarial Training »
Jianhao Ding · Tong Bu · Zhaofei Yu · Tiejun Huang · Jian Liu -
2022 Poster: Oscillatory Tracking of Continuous Attractor Neural Networks Account for Phase Precession and Procession of Hippocampal Place Cells »
Tianhao Chu · Zilong Ji · Junfeng Zuo · Wenhao Zhang · Tiejun Huang · Yuanyuan Mi · Si Wu -
2021 Poster: Noisy Adaptation Generates Lévy Flights in Attractor Neural Networks »
Xingsi Dong · Tianhao Chu · Tiejun Huang · Zilong Ji · Si Wu -
2021 Poster: Deep Residual Learning in Spiking Neural Networks »
Wei Fang · Zhaofei Yu · Yanqi Chen · Tiejun Huang · Timothée Masquelier · Yonghong Tian -
2020 Poster: UnModNet: Learning to Unwrap a Modulo Image for High Dynamic Range Imaging »
Chu Zhou · Hang Zhao · Jin Han · Chang Xu · Chao Xu · Tiejun Huang · Boxin Shi -
2019 Poster: Learning Fairness in Multi-Agent Systems »
Jiechuan Jiang · Zongqing Lu -
2019 Poster: Push-pull Feedback Implements Hierarchical Information Retrieval Efficiently »
Xiao Liu · Xiaolong Zou · Zilong Ji · Gengshuo Tian · Yuanyuan Mi · Tiejun Huang · K. Y. Michael Wong · Si Wu -
2018 Poster: Learning Attentional Communication for Multi-Agent Cooperation »
Jiechuan Jiang · Zongqing Lu