Skip to yearly menu bar Skip to main content


Mastering the Game of No-Press Diplomacy via Human-Regularized Reinforcement Learning and Planning

Anton Bakhtin ⋅ David Wu ⋅ Adam Lerer ⋅ Jonathan Gray ⋅ Athul Jacob ⋅ Gabriele Farina ⋅ Alexander Miller ⋅ Noam Brown

Abstract

Video

Chat is not available.