Timezone: »

 
TiKick: Toward Playing Multi-agent Football Full Games from Single-agent Demonstrations
Shiyu Huang · Wenze Chen · Longfei Zhang · Shizhen Xu · Ziyang Li · Fengming Zhu · Deheng Ye · Ting Chen · Jun Zhu

Deep reinforcement learning (DRL) has achieved super-human performance on complex video games (e.g., StarCraft II and Dota II). However, current DRL systems still suffer from challenges of multi-agent coordination, sparse rewards, stochastic environments, etc. In seeking to address these challenges, we employ a football video game, e.g., Google Research Football (GRF), as our testbed and develop an end-to-end learning-based AI system (denoted as TiKick) to complete this challenging task. In this work, we first generated a large replay dataset from the self-playing of single-agent experts, which are obtained from league training. We then developed a new offline algorithm to learn a powerful multi-agent AI from the fixed single-agent dataset. To the best of our knowledge, Tikick is the first learning-based AI system that can take over the multi-agent Google Research Football full game, while previous work could either control a single agent or experiment on toy academic scenarios. Extensive experiments further show that our pre-trained model can accelerate the training process of the modern multi-agent algorithm and our method achieves state-of-the-art performances on various academic scenarios.

Author Information

Shiyu Huang (Tsinghua University)

I am a fifth-year Ph.D. student in the Department of Computer Science and Technology, Tsinghua University, China, advised by Prof. Jun Zhu and Prof. Ting Chen. My research interests lie on the intersection of computer vision, reinforcement learning and deep learning. I have also spent time working at Huawei Noah's Ark Lab, Tencent AI Lab, Carnegie Mellon University and Sensetime Inc. . And I am also the founder of the TARTRL group.

Wenze Chen (Tsinghua University)
Longfei Zhang (National University of Defense Technology)
Shizhen Xu (RealAI)
Ziyang Li (Tencent AI Lab)
Fengming Zhu (Tencent AI Lab)
Deheng Ye (Tencent)
Ting Chen (Tsinghua University)
Jun Zhu (Tsinghua University)

More from the Same Authors