Skip to yearly menu bar Skip to main content


Evaluating Language Models' Evaluations of Games

Katie Collins · Cedegao (Ced) Zhang · Graham Todd · Lance Ying · Mauricio Barba · Ryan Liu · Adrian Weller · Ionatan Kuperwajs · Catherine Wong · Josh Tenenbaum · Tom Griffiths

Abstract

Chat is not available.