firstbacksecondback
88 Results
Workshop
|
Mastering the Game of No-Press Diplomacy via Human-Regularized Reinforcement Learning and Planning Anton Bakhtin · David Wu · Adam Lerer · Jonathan Gray · Athul Jacob · Gabriele Farina · Alexander Miller · Noam Brown |
||
Poster
|
Thu 9:00 |
Nocturne: a scalable driving benchmark for bringing multi-agent learning one step closer to the real world Eugene Vinitsky · Nathan Lichtlé · Xiaomeng Yang · Brandon Amos · Jakob Foerster |
|
Workshop
|
Sat 8:35 |
Towards Credible Human Evaluation of Open-Domain Dialog Systems Using Interactive Setup Sijia Liu · Patrick Lange · Behnam Hedayatnia · Alexandros Papangelis · Di Jin · Andrew Wirth · Yang Liu · Dilek Hakkani-Tur |
|
Poster
|
Tue 9:00 |
Fine-tuning language models to find agreement among humans with diverse preferences Michiel Bakker · Martin Chadwick · Hannah Sheahan · Michael Tessler · Lucy Campbell-Gillingham · Jan Balaguer · Nat McAleese · Amelia Glaese · John Aslanides · Matt Botvinick · Christopher Summerfield |